neonvm: prevent migrating VMs to the nodes with non-matching architecture #1123

mikhail-sakhnov · 2024-10-24T08:10:41Z

Add node affinity to the VM spec during the live migration pending state, if it doesn't have.

#1083

github-actions · 2024-10-24T08:14:02Z

No changes to the coverage.

HTML Report

Click to open

sharnoff

Some comments

pkg/neonvm/controllers/vmmigration_controller.go

sharnoff · 2024-11-13T16:12:47Z

pkg/neonvm/controllers/vmmigration_controller.go

+	if vm.Spec.Affinity != nil && vm.Spec.Affinity.NodeAffinity != nil && vm.Spec.Affinity.NodeAffinity.RequiredDuringSchedulingIgnoredDuringExecution != nil {
+		for _, term := range vm.Spec.Affinity.NodeAffinity.RequiredDuringSchedulingIgnoredDuringExecution.NodeSelectorTerms {
+			for _, expr := range term.MatchExpressions {
+				if expr.Key == "kubernetes.io/arch" && expr.Operator == corev1.NodeSelectorOpIn && len(expr.Values) > 0 {


question: does len(expr.Values) > 0 mean that this function will return true for VMs that have an architecture affinity for e.g. "either amd64 or arm64" ? IIUC, would that mean that we could accidentally migrate between x86 and ARM?

Values are checked in the loop body, if I understand correctly your question

I think what Em means that for the following input

{ Key: "kubernetes.io/arch", Operator: corev1.NodeSelectorOpIn, Values: []string{"x86", "arm64"}, }

The function how it is written will return true, even though this affinity doesn't prevent us from migration between x86 and arm.

tests/e2e/vm-migration/01-assert.yaml

sharnoff · 2024-11-13T16:17:31Z

pkg/neonvm/controllers/vmmigration_controller.go

+			if err := addArchitectureAffinity(vm, sourceNode); err != nil {
+				log.Error(err, "Failed to add architecture affinity to VM", "sourceNodeStatus", sourceNode.Status)
+			}
+			if err = r.Update(ctx, vm); err != nil {
+				log.Error(err, "Failed to update node affinity of the source vm")
+				return ctrl.Result{RequeueAfter: time.Second}, err
+			}


question: IIUC, does this mean that the VM will permanently get affinity for the node it starts with?

Is there a reason we can't just do it on the target pod for the migration? (I guess, potential race conditions?)

I don't think there is a particular reason, I just feel it is a bit safer to make the guard on the very first step of the migration.

Plus if we going to have multiarch clusters it is a bit unsafe to even have VM without architecture affinity, may be in later it should be even added during the VM creation, in the vmcontroller?

it is a bit unsafe to even have VM without architecture affinity

Do we know of any particular reason it might be unsafe? The same VM can create the pod first time on x86, and if it is deleted, it can be recreated on arm node. We can potentially utilize this to auto-balance between architectures.

Although, the regular restart of endpoints anyway means the recreation of the VM object (right?), so this might not matter much.

We can potentially utilize this to auto-balance between architectures.

if we can do it, why to even prevent migration between architectures? I was under the impression that postgres can't safely migrate from one arch to another, because of the data files, need to double check that.

We can potentially utilize this to auto-balance between architectures.

if we can do it, why to even prevent migration between architectures?

Having VM objects without affinity doesn't mean migration between architectures. Pods would still have defined architecture, architecture would be undefined only when there is no pod. This would mean if a pod is deleted for whatever reason, we can recreate it with different architecture. But I am not sure this is a practical scenario.

postgres can't safely migrate from one arch to another

It's not even postgres, it is the whole VM: all memory, including executable code, registers, etc.

Signed-off-by: Misha Sakhnov <[email protected]>

Omrigan · 2025-01-03T15:34:20Z

pkg/neonvm/controllers/vmmigration_controller.go

+func hasArchitectureAffinity(vm *vmv1.VirtualMachine, sourceNode *corev1.Node) bool {
+	if vm.Spec.Affinity != nil && vm.Spec.Affinity.NodeAffinity != nil && vm.Spec.Affinity.NodeAffinity.RequiredDuringSchedulingIgnoredDuringExecution != nil {
+		for _, term := range vm.Spec.Affinity.NodeAffinity.RequiredDuringSchedulingIgnoredDuringExecution.NodeSelectorTerms {
+			for _, expr := range term.MatchExpressions {
+				if expr.Key == "kubernetes.io/arch" && expr.Operator == corev1.NodeSelectorOpIn && len(expr.Values) > 0 {
+					for _, value := range expr.Values {
+						if value == sourceNode.Status.NodeInfo.Architecture {
+							return true
+						}
+					}
+				}
+			}
+		}
+	}
+	return false
+}


This function can be rewritten to reduce the max nested level from 7 to 4, smth like below.

This implementation also doesn't use sourceNode.

Suggested change

func hasArchitectureAffinity(vm *vmv1.VirtualMachine, sourceNode *corev1.Node) bool {

if vm.Spec.Affinity != nil && vm.Spec.Affinity.NodeAffinity != nil && vm.Spec.Affinity.NodeAffinity.RequiredDuringSchedulingIgnoredDuringExecution != nil {

for _, term := range vm.Spec.Affinity.NodeAffinity.RequiredDuringSchedulingIgnoredDuringExecution.NodeSelectorTerms {

for _, expr := range term.MatchExpressions {

if expr.Key == "kubernetes.io/arch" && expr.Operator == corev1.NodeSelectorOpIn && len(expr.Values) > 0 {

for _, value := range expr.Values {

if value == sourceNode.Status.NodeInfo.Architecture {

return true

}

}

}

}

}

}

return false

}

func hasArchitectureAffinity(vm *vmv1.VirtualMachine, sourceNode *corev1.Node) bool {

if vm.Spec.Affinity == nil || vm.Spec.Affinity.NodeAffinity == nil || vm.Spec.Affinity.NodeAffinity.RequiredDuringSchedulingIgnoredDuringExecution == nil {

return false

}

terms = vm.Spec.Affinity.NodeAffinity.RequiredDuringSchedulingIgnoredDuringExecution.NodeSelectorTerms

for _, term := range terms {

for _, expr := range term.MatchExpressions {

if expr.Key == "kubernetes.io/arch" &&

expr.Operator == corev1.NodeSelectorOpIn &&

len(expr.Values) == 1 {

return true

}

}

}

return false

}

Omrigan · 2025-01-03T15:36:12Z

pkg/neonvm/controllers/vmmigration_controller.go

+	if vm.Spec.Affinity != nil && vm.Spec.Affinity.NodeAffinity != nil && vm.Spec.Affinity.NodeAffinity.RequiredDuringSchedulingIgnoredDuringExecution != nil {
+		for _, term := range vm.Spec.Affinity.NodeAffinity.RequiredDuringSchedulingIgnoredDuringExecution.NodeSelectorTerms {
+			for _, expr := range term.MatchExpressions {
+				if expr.Key == "kubernetes.io/arch" && expr.Operator == corev1.NodeSelectorOpIn && len(expr.Values) > 0 {


I think what Em means that for the following input

{ Key: "kubernetes.io/arch", Operator: corev1.NodeSelectorOpIn, Values: []string{"x86", "arm64"}, }

The function how it is written will return true, even though this affinity doesn't prevent us from migration between x86 and arm.

Omrigan · 2025-01-03T15:44:47Z

pkg/neonvm/controllers/vmmigration_controller.go

+			if err := addArchitectureAffinity(vm, sourceNode); err != nil {
+				log.Error(err, "Failed to add architecture affinity to VM", "sourceNodeStatus", sourceNode.Status)
+			}
+			if err = r.Update(ctx, vm); err != nil {
+				log.Error(err, "Failed to update node affinity of the source vm")
+				return ctrl.Result{RequeueAfter: time.Second}, err
+			}


it is a bit unsafe to even have VM without architecture affinity

Do we know of any particular reason it might be unsafe? The same VM can create the pod first time on x86, and if it is deleted, it can be recreated on arm node. We can potentially utilize this to auto-balance between architectures.

Although, the regular restart of endpoints anyway means the recreation of the VM object (right?), so this might not matter much.

mikhail-sakhnov requested review from Omrigan and sharnoff October 24, 2024 08:10

mikhail-sakhnov force-pushed the misha/architecture-fence-for-livemigration branch 2 times, most recently from 5446383 to a036d85 Compare October 24, 2024 09:46

mikhail-sakhnov marked this pull request as ready for review October 24, 2024 09:46

mikhail-sakhnov force-pushed the misha/architecture-fence-for-livemigration branch 3 times, most recently from 893c635 to 8e751ea Compare October 31, 2024 00:58

sharnoff self-assigned this Oct 31, 2024

sharnoff reviewed Nov 13, 2024

View reviewed changes

sharnoff assigned mikhail-sakhnov and unassigned sharnoff Nov 18, 2024

neonvm: add architecture fencing for migrations

cc80a6f

Signed-off-by: Misha Sakhnov <[email protected]>

mikhail-sakhnov force-pushed the misha/architecture-fence-for-livemigration branch from a0219ef to c5ba550 Compare November 26, 2024 12:27

neonvm-controller: apply code formatting

f5246f7

Signed-off-by: Misha Sakhnov <[email protected]>

mikhail-sakhnov force-pushed the misha/architecture-fence-for-livemigration branch from c5ba550 to f5246f7 Compare November 26, 2024 12:43

Omrigan mentioned this pull request Dec 2, 2024

neonvm-runner: arm support #1119

Merged

Omrigan reviewed Jan 3, 2025

View reviewed changes

Merge branch 'main' into misha/architecture-fence-for-livemigration

a404371

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

neonvm: prevent migrating VMs to the nodes with non-matching architecture #1123

neonvm: prevent migrating VMs to the nodes with non-matching architecture #1123

mikhail-sakhnov commented Oct 24, 2024 •

edited

Loading

github-actions bot commented Oct 24, 2024 •

edited

Loading

sharnoff left a comment

sharnoff Nov 13, 2024

mikhail-sakhnov Nov 26, 2024

Omrigan Jan 3, 2025

sharnoff Nov 13, 2024

mikhail-sakhnov Nov 26, 2024

Omrigan Jan 3, 2025

mikhail-sakhnov Jan 13, 2025

Omrigan Jan 13, 2025

Omrigan Jan 3, 2025

Omrigan Jan 3, 2025

Omrigan Jan 3, 2025

neonvm: prevent migrating VMs to the nodes with non-matching architecture #1123

Are you sure you want to change the base?

neonvm: prevent migrating VMs to the nodes with non-matching architecture #1123

Conversation

mikhail-sakhnov commented Oct 24, 2024 • edited Loading

github-actions bot commented Oct 24, 2024 • edited Loading

HTML Report

sharnoff left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

mikhail-sakhnov commented Oct 24, 2024 •

edited

Loading

github-actions bot commented Oct 24, 2024 •

edited

Loading