chore: Enable migration tests for clusters in legacy schema #2975

lantoli · 2025-01-16T16:14:35Z

Description

Enable migration tests for clusters in legacy schema. With this PR, all mig tests for TPF are passing.

Link to any related issue(s): CLOUDP-295165

Type of change:

Bug fix (non-breaking change which fixes an issue). Please, add the "bug" label to the PR.
New feature (non-breaking change which adds functionality). Please, add the "enhancement" label to the PR. A migration guide must be created or updated if the new feature will go in a major version.
Breaking change (fix or feature that would cause existing functionality to not work as expected). Please, add the "breaking change" label to the PR. A migration guide must be created or updated.
This change requires a documentation update
Documentation fix/enhancement

Required Checklist:

I have signed the MongoDB CLA
I have read the contributing guides
I have checked that this change does not generate any credentials and that they are NOT accidentally logged anywhere.
I have added tests that prove my fix is effective or that my feature works per HashiCorp requirements
I have added any necessary documentation (if appropriate)
I have run make fmt and formatted my code
If changes include deprecations or removals I have added appropriate changelog entries.
If changes include removal or addition of 3rd party GitHub actions, I updated our internal document. Reach out to the APIx Integration slack channel to get access to the internal document.

Further comments

* master: chore: Bring `pinned_fcv` feature into advanced_cluster TPF implementation (#2970)

This reverts commit 5cdf294.

This reverts commit 6ac59b2.

* master: refactor: Remove unused RootDiskSize field and simplify ExtraAPIInfo construction (#2974)

lantoli · 2025-01-17T11:18:57Z

internal/service/advancedcluster/resource_advanced_cluster_migration_test.go

@@ -29,7 +29,6 @@ func TestMigAdvancedCluster_singleShardedMultiCloud(t *testing.T) {
 }

 func TestMigAdvancedCluster_symmetricGeoShardedOldSchema(t *testing.T) {
-	acc.SkipIfAdvancedClusterV2Schema(t) // unexpected update and then: error operation not permitted, nums_shards from 1 -> > 1


the last mig test to enable for TPF

lantoli · 2025-01-17T11:22:38Z

internal/service/advancedclustertpf/move_upgrade_state.go

+
+// sendLegacySchemaRequestToRead sets ClusterID to a special value so Read can know whether it must use legacy schema.
+// private state can't be used here because it's not available in Move Upgrader.
+// ClusterID is computed (not optional) so the value will be overridden in Read and the special value won't ever appear in the state file.


let me know if there is any question about why we use ClusterID as a side channel to communicate between State Move / Upgrader and Read. Also if you have a better idea, please let me know

if we go with this option, do you think we can add a check of ClusterID != forceLegacySchema in one of the tests that are testing the upgrade?

AgustinBettati · 2025-01-17T11:45:05Z

internal/service/advancedclustertpf/move_upgrade_state.go

+func sendLegacySchemaRequestToRead(model *TFModel) {
+	model.ClusterID = types.StringValue("forceLegacySchema")
+}


As an alternative, is it too complex to populate the replication spec list only with the bare minimum so our existing logic detects the legacy sharding config (objects with num_shards)? With this approach we would avoid receivedLegacySchemaRequestInRead using cluster_id which looks more hacky

marcosuma · 2025-01-17T11:50:35Z

internal/service/advancedclustertpf/move_upgrade_state.go

@@ -39,101 +41,149 @@ func stateUpgraderFromV1(ctx context.Context, req resource.UpgradeStateRequest,
 	setStateResponse(ctx, &resp.Diagnostics, req.RawState, &resp.State)
 }

-func setStateResponse(ctx context.Context, diags *diag.Diagnostics, stateIn *tfprotov6.RawState, stateOut *tfsdk.State) {
-	rawStateValue, err := stateIn.UnmarshalWithOpts(tftypes.Object{
+// Minimum attributes needed from source schema. Read will fill in the rest


non-blocking comment: in the future where new attributes will be added, will there be some compile-time or test failure if that field needs to be specified here?

in line 69 , we're using: IgnoreUndefinedAttributes: true}
that means that we're flexible in the schema, if you see the code below the real mandatory ones are project_id and cluster, the ones we try to used them in a best-effort basis, it's ok if they don't come (e.g. later when moving from flex cluster)

so the schema doesn't make it fail if they some attribute doesn't exist, later when the value is tried to be read it will be null

it's ok if they don't come
ok then why are we even populating them? What is the advantage of populating? Answering this question should also help me with understanding the consequence of:
later when the value is tried to be read it will be null

if the previous version/resource have them and we don't set them, then there will a plan change, e.g. if timeouts or retain_backups_enabled is defined in SDKv2 and want to migrate to TPF, users will get a plan change saying that timeouts/retain_backups_enabled will be deleted.
(we don't want plan changes when upgrading from sdkv2 to tpf or moved block from cluster to tpf adv_cluster)

in the case that will come with moving from flex cluster to adv_cluster for example, they don't exist so it's ok not to send them.

it's to avoid plan changes by filling attributes that Read can't do

lantoli added 12 commits January 16, 2025 17:13

TEMPORARY changes from a working PR

5cdf294

enable TestMigAdvancedCluster_symmetricGeoShardedOldSchema

2fa2393

TEMPORARY skip mocked tests

6ac59b2

Merge branch 'master' into CLOUDP-295165_state_upgrade_legacy_schema

14d889f

* master: chore: Bring `pinned_fcv` feature into advanced_cluster TPF implementation (#2970)

comment attributes

f124852

use clusterID

eb044aa

check num_shards

fe9f458

Revert "TEMPORARY changes from a working PR"

c9e6e8a

This reverts commit 5cdf294.

Revert "TEMPORARY skip mocked tests"

5b925e5

This reverts commit 6ac59b2.

Merge branch 'master' into CLOUDP-295165_state_upgrade_legacy_schema

babb28c

* master: refactor: Remove unused RootDiskSize field and simplify ExtraAPIInfo construction (#2974)

comment entry points

82c87ff

refactor state move and upgrader logic

197e0fc

lantoli commented Jan 17, 2025

View reviewed changes

lantoli marked this pull request as ready for review January 17, 2025 11:23

lantoli requested a review from a team as a code owner January 17, 2025 11:23

simplify error logic

fd373c6

AgustinBettati reviewed Jan 17, 2025

View reviewed changes

marcosuma reviewed Jan 17, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

chore: Enable migration tests for clusters in legacy schema #2975

chore: Enable migration tests for clusters in legacy schema #2975

lantoli commented Jan 16, 2025 •

edited

Loading

lantoli Jan 17, 2025

lantoli Jan 17, 2025

marcosuma Jan 17, 2025

AgustinBettati Jan 17, 2025

marcosuma Jan 17, 2025

lantoli Jan 17, 2025

marcosuma Jan 17, 2025

lantoli Jan 17, 2025 •

edited

Loading

lantoli Jan 17, 2025

chore: Enable migration tests for clusters in legacy schema #2975

Are you sure you want to change the base?

chore: Enable migration tests for clusters in legacy schema #2975

Conversation

lantoli commented Jan 16, 2025 • edited Loading

Description

Type of change:

Required Checklist:

Further comments

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

lantoli Jan 17, 2025 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

lantoli commented Jan 16, 2025 •

edited

Loading

lantoli Jan 17, 2025 •

edited

Loading