Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

increase status test timeouts for OCP deployment #15

Open
xtophd opened this issue Mar 6, 2021 · 1 comment
Open

increase status test timeouts for OCP deployment #15

xtophd opened this issue Mar 6, 2021 · 1 comment

Comments

@xtophd
Copy link
Owner

xtophd commented Mar 6, 2021

part of the playbooks finishing process that

  • watches the cluster deployment progress,
  • checks for nodes ready,
  • checks for and approves CSR requests,
  • checks for clusteroperator availability,
  • etc ...

the timeouts are apparently too short for OCP 4.7.

also note that even previously the timeouts are too short for deployments with very slow download links.

3 major issues to consider:

  • if timeouts expire and playbooks exit during the bastion buildout (ie: during downloads), the automated deployment ends in failure
  • if timeouts expire and playbooks exit during the wait for "nodes ready", the automated detecting and approval of CSR requests ends and the automated deployment ends, but is manually recoverable
  • if timeouts expire while waiting for "clusteroperators available", the playbooks end with a failure but the cluster will continue to deploy until successful (probably).
@xtophd
Copy link
Owner Author

xtophd commented Mar 6, 2021

one possibility is to add a set of unique timer values to the openshift version config. individual users could then modify a multiplier parameter in the master-config.yml to alter default values to something more accommodating to their environment.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant