Skip to content

[release-ocm-2.14] MGMT-23971: Add a timeout for installing-pending-user-action#10260

Merged
openshift-merge-bot[bot] merged 6 commits intoopenshift:release-ocm-2.14from
carbonin:2.14-pending-timeout
May 6, 2026
Merged

[release-ocm-2.14] MGMT-23971: Add a timeout for installing-pending-user-action#10260
openshift-merge-bot[bot] merged 6 commits intoopenshift:release-ocm-2.14from
carbonin:2.14-pending-timeout

Conversation

@carbonin
Copy link
Copy Markdown
Member

This is a manual cherry-pick of #10202

Hosts in `installing-pending-user-action` were stalling entire clusters
even if the cluster would install fine without them. This commit adds a
timeout for hosts in this state so that the cluster can succeed when the
required minimum number of nodes have already installed.

This is especially important in cases where very large clusters are
being installed (~100 nodes). In these kinds of cases, one or two worker
nodes shouldn't force the user to reinstall the entire thing if they
don't want to monitor the multi-hour install process for hosts failing
to reboot.

Resolves https://redhat.atlassian.net/browse/MGMT-23971

Assisted-By: Claude Code
The mapping was slightly more efficient, but for small lists of hosts
and small lists of statuses this change is fine and much easier to read.
These will be needed so that the hosts package can evaluate if a cluster
is ready to move install states
Without this it would be possibly for a host to time out in
installing-pending-user-action when it was a host that was required for
the entire cluster to succeed (for example, control plane nodes, or
workers in non-compact clusters).

If enough other hosts have installed for the cluster to finish _and_ the
host has spent over an hour in pending-user-action then it will time out
to allow the user to use the cluster.
@openshift-ci-robot openshift-ci-robot added the jira/valid-reference Indicates that this PR references a valid Jira ticket of any type. label Apr 29, 2026
@openshift-ci-robot
Copy link
Copy Markdown

openshift-ci-robot commented Apr 29, 2026

@carbonin: This pull request references MGMT-23971 which is a valid jira issue.

Details

In response to this:

This is a manual cherry-pick of #10202

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the openshift-eng/jira-lifecycle-plugin repository.

@coderabbitai
Copy link
Copy Markdown

coderabbitai Bot commented Apr 29, 2026

Important

Review skipped

Auto reviews are disabled on base/target branches other than the default branch.

Please check the settings in the CodeRabbit UI or the .coderabbit.yaml file in this repository. To trigger a single review, invoke the @coderabbitai review command.

⚙️ Run configuration

Configuration used: Repository: openshift/coderabbit/.coderabbit.yaml

Review profile: CHILL

Plan: Enterprise

Run ID: d5d95bf3-5514-43bf-84d6-1a8414e93b72

You can disable this status message by setting the reviews.review_status to false in the CodeRabbit configuration file.

Use the checkbox below for a quick retry:

  • 🔍 Trigger review
✨ Finishing Touches
🧪 Generate unit tests (beta)
  • Create PR with unit tests

Comment @coderabbitai help to get the list of available commands and usage tips.

@openshift-ci openshift-ci Bot added the size/XL Denotes a PR that changes 500-999 lines, ignoring generated files. label Apr 29, 2026
@openshift-ci openshift-ci Bot requested review from oourfali and pastequo April 29, 2026 14:52
@openshift-ci
Copy link
Copy Markdown

openshift-ci Bot commented Apr 29, 2026

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: carbonin

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Details Needs approval from an approver in each of these files:

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

@openshift-ci openshift-ci Bot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label Apr 29, 2026
@codecov
Copy link
Copy Markdown

codecov Bot commented Apr 29, 2026

Codecov Report

❌ Patch coverage is 83.82353% with 11 lines in your changes missing coverage. Please review.
✅ Project coverage is 42.88%. Comparing base (fc109f5) to head (e728cc9).
⚠️ Report is 2 commits behind head on release-ocm-2.14.

Files with missing lines Patch % Lines
internal/host/transition.go 42.10% 6 Missing and 5 partials ⚠️
Additional details and impacted files

Impacted file tree graph

@@                Coverage Diff                @@
##           release-ocm-2.14   #10260   +/-   ##
=================================================
  Coverage             42.87%   42.88%           
=================================================
  Files                   380      380           
  Lines                 67872    67887   +15     
=================================================
+ Hits                  29102    29111    +9     
- Misses                36150    36152    +2     
- Partials               2620     2624    +4     
Files with missing lines Coverage Δ
internal/cluster/common.go 72.03% <ø> (-4.32%) ⬇️
internal/cluster/transition.go 73.47% <100.00%> (-1.11%) ⬇️
internal/common/common.go 37.84% <100.00%> (+5.82%) ⬆️
internal/host/common.go 88.88% <ø> (ø)
internal/host/config.go 100.00% <ø> (ø)
internal/host/statemachine.go 100.00% <100.00%> (ø)
internal/host/transition.go 59.15% <42.10%> (-0.72%) ⬇️

... and 3 files with indirect coverage changes

🚀 New features to boost your workflow:
  • ❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

@carbonin
Copy link
Copy Markdown
Member Author

carbonin commented May 5, 2026

/test subsystem-aws

@gamli75
Copy link
Copy Markdown
Contributor

gamli75 commented May 6, 2026

/lgtm

@openshift-ci openshift-ci Bot added the lgtm Indicates that a PR is ready to be merged. label May 6, 2026
@gamli75
Copy link
Copy Markdown
Contributor

gamli75 commented May 6, 2026

/test e2e-ai-operator-ztp

@openshift-ci
Copy link
Copy Markdown

openshift-ci Bot commented May 6, 2026

@carbonin: all tests passed!

Full PR test history. Your PR dashboard.

Details

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository. I understand the commands that are listed here.

@openshift-merge-bot openshift-merge-bot Bot merged commit 18c02d3 into openshift:release-ocm-2.14 May 6, 2026
15 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

approved Indicates a PR has been approved by an approver from all required OWNERS files. jira/valid-reference Indicates that this PR references a valid Jira ticket of any type. lgtm Indicates that a PR is ready to be merged. size/XL Denotes a PR that changes 500-999 lines, ignoring generated files.

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants