Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

✨ ROSA: Cleanup #4850

Merged
merged 2 commits into from
Mar 12, 2024
Merged

Conversation

muraee
Copy link
Contributor

@muraee muraee commented Mar 6, 2024

What type of PR is this?

/kind cleanup

What this PR does / why we need it:

  • Group ROSA clusterCompute config into 1 field DefaultMachinePoolSpec
  • Make ROSAControlePlane force deletion optional through the annotation controlplane.cluster.x-k8s.io/rosacontrolplane-force-delete

Which issue(s) this PR fixes (optional, in fixes #<issue number>(, fixes #<issue_number>, ...) format, will close the issue(s) when PR gets merged):
Fixes #

Special notes for your reviewer:

Checklist:

  • squashed commits
  • includes documentation
  • includes emojis
  • adds unit tests
  • adds or updates e2e tests

Release note:

NONE

@k8s-ci-robot k8s-ci-robot added kind/cleanup Categorizes issue or PR as related to cleaning up code, process, or technical debt. do-not-merge/release-note-label-needed Indicates that a PR should not merge because it's missing one of the release note labels. cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. labels Mar 6, 2024
@k8s-ci-robot k8s-ci-robot added needs-priority size/XXL Denotes a PR that changes 1000+ lines, ignoring generated files. labels Mar 6, 2024
@muraee muraee changed the title ROSA: Cleanup ✨ ROSA: Cleanup Mar 6, 2024
@k8s-ci-robot k8s-ci-robot added release-note-none Denotes a PR that doesn't merit a release note. and removed do-not-merge/release-note-label-needed Indicates that a PR should not merge because it's missing one of the release note labels. labels Mar 6, 2024
@muraee muraee force-pushed the rosa-cleanup-api branch from 14cddba to 0faf93c Compare March 6, 2024 14:13
@k8s-ci-robot k8s-ci-robot added size/L Denotes a PR that changes 100-499 lines, ignoring generated files. and removed size/XXL Denotes a PR that changes 1000+ lines, ignoring generated files. labels Mar 6, 2024
@muraee muraee force-pushed the rosa-cleanup-api branch from 0faf93c to 54d8299 Compare March 6, 2024 14:34
@k8s-ci-robot k8s-ci-robot added size/XL Denotes a PR that changes 500-999 lines, ignoring generated files. and removed size/L Denotes a PR that changes 100-499 lines, ignoring generated files. labels Mar 6, 2024
@muraee muraee force-pushed the rosa-cleanup-api branch from 54d8299 to 0c8b94b Compare March 6, 2024 14:58
Comment on lines 159 to 172
// ClusterComputeSpec defines the configuration for the required worker nodes provisioned as part of the cluster creation.
type ClusterComputeSpec struct {
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Is this for the default workers pool configuration?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

yes

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Could we call this struct something different then? For example defaultNodePool and specify why/when this configuration is needed? are we also planning to surface this information as a MachinePool?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I didn't want to call it defaultNodePool since OCM can create multiple nodepools (one per az).
the rosa cli flag for this is called computeMachineType, so I thought this will be less confusing for customer transition from the cli, wdyt?

specify why/when this configuration is needed

InstanceType and Autoscaling are optional, only the availabilityZones field is required, we could probably infer this from the subnetIDs field in the future.

are we also planning to surface this information as a MachinePool?

At first I was planing to do that when I thought only 1 default nodepool will be created, but given the fact that multiple default nodepools can be created. it didn't seem like a good idea anymore. the logic is more complicated and the resulting ROSAMachinePool CRs will be different from CRs created manually because OCM limits what fields you can edit on the defaul machinepools and also prevents you from deleting them.

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

so I thought this will be less confusing for customer transition from the cli, wdyt?

What's the purpose of these configuration field? My understanding was that ROSA uses these fields to create a default pool for each availability zone?

only the availabilityZones field is required

Do we want to move availabilityZones outside given that's a required field?

we could probably infer this from the subnetIDs field in the future

Is there an issue open for it?

CRs will be different from CRs created manually because OCM limits what fields you can edit on the defaul machinepools and also prevents you from deleting them.

From a Cluster API user experience perspective seems like we might want to fix these behaviors relatively soon.

Are we planning to delete these machine pools eventually once OCM APIs support to create them with zero replicas?

If we don't surface them, users would incur in additional costs hidden away from their configuration, which isn't ideal.

Copy link
Contributor Author

@muraee muraee Mar 11, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

What's the purpose of these configuration field? My understanding was that ROSA uses these fields to create a default pool for each availability zone?

correct

Is there an issue open for it?

#4459

Do we want to move availabilityZones outside given that's a required field?
If we don't surface them, users would incur in additional costs hidden away from their configuration, which isn't ideal.

since its difficult to surface those default machinepools today, my idea was to have this required field there, so you would always get the computeSpec field in your manifest so you know that you are getting compute resources created per az you specify.
This is not great, but the rosa cli will create the same default machinepools when you do rosa create cluster --avaialbity-zones az1, az2 ... so users of the cli are familiar with this behavior.

The plan is for OCM/ROSA to stop creating default machinePools altogether. Then we would just need to drop this clusterCompute field in CAPI.

@muraee
Copy link
Contributor Author

muraee commented Mar 6, 2024

/retest-required

@muraee muraee force-pushed the rosa-cleanup-api branch 2 times, most recently from ccb113b to 53fd6f2 Compare March 12, 2024 17:45
@muraee
Copy link
Contributor Author

muraee commented Mar 12, 2024

/retest-required

muraee added 2 commits March 12, 2024 19:50
- group rosa clusterCompute config
- make force deletion optional through annotation
- report deleting status in condition
- fixed machinepool triggering an upgrade when upgrading controlPlane
@muraee muraee force-pushed the rosa-cleanup-api branch from 53fd6f2 to d60bc63 Compare March 12, 2024 18:50
Copy link
Member

@vincepri vincepri left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

/approve
/lgtm

@k8s-ci-robot k8s-ci-robot added the lgtm "Looks good to me", indicates that a PR is ready to be merged. label Mar 12, 2024
@k8s-ci-robot
Copy link
Contributor

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: vincepri

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

@k8s-ci-robot k8s-ci-robot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label Mar 12, 2024
@k8s-ci-robot k8s-ci-robot merged commit 52469b2 into kubernetes-sigs:main Mar 12, 2024
22 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
approved Indicates a PR has been approved by an approver from all required OWNERS files. cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. kind/cleanup Categorizes issue or PR as related to cleaning up code, process, or technical debt. lgtm "Looks good to me", indicates that a PR is ready to be merged. needs-priority release-note-none Denotes a PR that doesn't merit a release note. size/XL Denotes a PR that changes 500-999 lines, ignoring generated files.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants