Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Implement JMX E2E Tests in Terraform Framework #443

Merged
merged 103 commits into from
Dec 30, 2024
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
103 commits
Select commit Hold shift + click to select a range
6f320b0
Preliminary testing.
musa-asad Nov 29, 2024
5727809
Support environment.
musa-asad Dec 1, 2024
912d98e
Fix file positioning.
musa-asad Dec 1, 2024
5d0362f
Fix file positioning.
musa-asad Dec 1, 2024
6740715
Fix values.
musa-asad Dec 1, 2024
c401c0e
Fix otel and prometheus config set-up.
musa-asad Dec 2, 2024
e8d4901
Remove jmx files.
musa-asad Dec 2, 2024
22186ed
Add jmx testing.
musa-asad Dec 2, 2024
7a02bc2
Merge branch 'main' into e2e
musa-asad Dec 4, 2024
b5b2fb9
Allow to choose e2e or regular tests.
musa-asad Dec 8, 2024
7672929
Merge remote-tracking branch 'origin/e2e' into e2e-jmx
musa-asad Dec 8, 2024
9a830a0
Add tomcat sessions check.
musa-asad Dec 8, 2024
5795103
Add TA.
musa-asad Dec 9, 2024
6a5f26e
Update jmx_test.go.
musa-asad Dec 9, 2024
c967e3d
Port foward and fix kafka.
musa-asad Dec 9, 2024
63b6ba8
Add CI.
musa-asad Dec 9, 2024
dfd94e0
Add TA.
musa-asad Dec 9, 2024
da21e5a
Merge branch 'main' into e2e
musa-asad Dec 9, 2024
82f7159
Merge branch 'e2e' into e2e-jmx
musa-asad Dec 9, 2024
ad76db4
Update to 2h.
musa-asad Dec 10, 2024
d918ffb
Remove port forwarding.
musa-asad Dec 11, 2024
987b005
Fix providers.
musa-asad Dec 11, 2024
9ea9eb2
Use loadbalancer.
musa-asad Dec 11, 2024
6f8d7d1
Fix json.
musa-asad Dec 12, 2024
ba53b14
Make 2 hours
musa-asad Dec 12, 2024
f5d2733
Fix providers.tf
musa-asad Dec 12, 2024
ee01d1b
Merge branch 'e2e' into e2e-jmx
musa-asad Dec 12, 2024
b6416ae
Add auth for IAM role.
musa-asad Dec 13, 2024
5714a7e
Verify cluster access.
musa-asad Dec 13, 2024
9a4bb6e
Remove force.
musa-asad Dec 13, 2024
8628c7d
Add assume role.
musa-asad Dec 13, 2024
20fa9be
Update JMX tests.
musa-asad Dec 13, 2024
cfc6fe1
Consumer only metrics.
musa-asad Dec 13, 2024
ff5cb0a
Fix formatting.
musa-asad Dec 13, 2024
82f9d34
Remove unnecessary resources.
musa-asad Dec 13, 2024
e6b5faa
Fix formatting.
musa-asad Dec 13, 2024
17657f0
Remove unnecessary resources.
musa-asad Dec 13, 2024
1969fe0
5 minutes.
musa-asad Dec 13, 2024
f18fd26
Merge branch 'e2e' into e2e-jmx
musa-asad Dec 13, 2024
e0fbd7e
Remove redundant logic.
musa-asad Dec 13, 2024
537cb6f
Revert "Remove redundant logic."
musa-asad Dec 13, 2024
43b7dae
Remove redundant logic.
musa-asad Dec 13, 2024
d2b7860
Remove redundant logic.
musa-asad Dec 13, 2024
371e8ca
Merge branch 'e2e' into e2e-jmx
musa-asad Dec 13, 2024
18b1257
Use older version of Kafka and update annotations/config to include K…
musa-asad Dec 15, 2024
67f3247
Missing.
musa-asad Dec 15, 2024
497c79b
Allow for float metrics.
musa-asad Dec 15, 2024
65272af
Set to 1 node.
musa-asad Dec 16, 2024
9f9e9db
Use 0.01.
musa-asad Dec 16, 2024
4ea25c2
Consumer only metrics.
musa-asad Dec 16, 2024
dbea081
Update nodes.
musa-asad Dec 16, 2024
08c2780
Merge branch 'e2e' into e2e-jmx
musa-asad Dec 16, 2024
a0c088e
Merge branch 'main' into e2e-jmx
musa-asad Dec 16, 2024
37dcf0b
Allow node configuration.
musa-asad Dec 16, 2024
cf4a54c
Fix spacing.
musa-asad Dec 16, 2024
58101e3
Add license.
musa-asad Dec 16, 2024
0db3337
Check for max metric value.
musa-asad Dec 17, 2024
803bad7
Simplify
musa-asad Dec 17, 2024
3611226
Fix imports.
musa-asad Dec 17, 2024
123f8e5
Prevent test from running when it doesn't need to.
musa-asad Dec 17, 2024
9591142
Update logic for max metric value.
musa-asad Dec 17, 2024
6fcde7e
Improve logic to check nodes being ran.
musa-asad Dec 17, 2024
0533bfa
Delete loadbalancer after complete
musa-asad Dec 17, 2024
c995fc7
constant
musa-asad Dec 17, 2024
5efca7c
constant
musa-asad Dec 17, 2024
a10dc57
Merge branch 'main' into e2e-jmx
musa-asad Dec 18, 2024
b9ae22b
Merge branch 'main' into e2e-jmx
musa-asad Dec 18, 2024
74c44df
address initial comments
musa-asad Dec 18, 2024
5d0af54
addressed comments
musa-asad Dec 19, 2024
917a606
change generator
musa-asad Dec 19, 2024
975168c
clean
musa-asad Dec 19, 2024
28e5c55
update sample
musa-asad Dec 20, 2024
8bf4517
fix kafka
musa-asad Dec 20, 2024
8e94f74
minor adjustments
musa-asad Dec 20, 2024
2fbb559
check annotations
musa-asad Dec 20, 2024
4360691
clean up
musa-asad Dec 20, 2024
7ab4f90
30m
musa-asad Dec 20, 2024
91aa2ad
clean function
musa-asad Dec 20, 2024
953272d
spacing
musa-asad Dec 20, 2024
545a256
move variable
musa-asad Dec 20, 2024
67c6371
move variable
musa-asad Dec 20, 2024
78287c6
remove bad exit
musa-asad Dec 20, 2024
bf1c276
remove quotes
musa-asad Dec 20, 2024
9799dc9
adjust waits
musa-asad Dec 20, 2024
13a6985
fix for ci dimensions
musa-asad Dec 20, 2024
e125678
fix for ci dimensions
musa-asad Dec 20, 2024
1e29443
fix for ci dimensions
musa-asad Dec 20, 2024
a8f40e2
typo
musa-asad Dec 20, 2024
2fb3b53
remove var
musa-asad Dec 20, 2024
bef2af6
switch to nodeport
musa-asad Dec 22, 2024
8cb81ce
5 min
musa-asad Dec 22, 2024
bf73309
revision 3
musa-asad Dec 27, 2024
8fe5e55
move and rename kubernetes.go
musa-asad Dec 27, 2024
fadc785
added var for deployment strategy
musa-asad Dec 27, 2024
9dbaa09
increase wait
musa-asad Dec 27, 2024
369da5c
Revert "increase wait"
musa-asad Dec 27, 2024
94b3d9c
added ami_type
musa-asad Dec 27, 2024
21c2eb1
add deployment strategy
musa-asad Dec 27, 2024
e620f14
fmt
musa-asad Dec 27, 2024
f65adc6
move function to utils
musa-asad Dec 27, 2024
fc9f53b
spacing
musa-asad Dec 27, 2024
f1dc36f
remove flaky node names logic
musa-asad Dec 30, 2024
52f3a86
set back to 3h
musa-asad Dec 30, 2024
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
3 changes: 2 additions & 1 deletion .gitignore
Original file line number Diff line number Diff line change
Expand Up @@ -10,4 +10,5 @@ terraform.*
**/.terraform/*
/test/**/final_*.yml
coverage.txt
generator/resources/*complete*.json
generator/resources/*complete*.json
terraform/eks/e2e/helm-charts
18 changes: 10 additions & 8 deletions environment/eksdeploymenttype/eks_deployment_type.go
Original file line number Diff line number Diff line change
Expand Up @@ -8,20 +8,22 @@ import "strings"
type EKSDeploymentType string

const (
DAEMON EKSDeploymentType = "DAEMON"
REPLICA EKSDeploymentType = "REPLICA"
SIDECAR EKSDeploymentType = "SIDECAR"
DAEMON EKSDeploymentType = "DAEMON"
REPLICA EKSDeploymentType = "REPLICA"
SIDECAR EKSDeploymentType = "SIDECAR"
STATEFUL EKSDeploymentType = "STATEFUL"
)

var (
ecsDeploymentTypes = map[string]EKSDeploymentType{
"DAEMON": DAEMON,
"REPLICA": REPLICA,
"SIDECAR": SIDECAR,
eksDeploymentTypes = map[string]EKSDeploymentType{
"DAEMON": DAEMON,
"REPLICA": REPLICA,
"SIDECAR": SIDECAR,
"STATEFUL": STATEFUL,
}
)

func FromString(str string) (EKSDeploymentType, bool) {
c, ok := ecsDeploymentTypes[strings.ToUpper(str)]
c, ok := eksDeploymentTypes[strings.ToUpper(str)]
return c, ok
}
4 changes: 4 additions & 0 deletions environment/metadata.go
Original file line number Diff line number Diff line change
Expand Up @@ -47,6 +47,7 @@ type MetaData struct {
AmpWorkspaceId string
Region string
K8sVersion string
Destroy bool
HelmChartsBranch string
CloudwatchAgentRepository string
CloudwatchAgentTag string
Expand Down Expand Up @@ -87,6 +88,7 @@ type MetaDataStrings struct {
AmpWorkspaceId string
Region string
K8sVersion string
Destroy bool
HelmChartsBranch string
CloudwatchAgentRepository string
CloudwatchAgentTag string
Expand Down Expand Up @@ -136,6 +138,7 @@ func registerEKSData(d *MetaDataStrings) {
func registerEKSE2ETestData(dataString *MetaDataStrings) {
flag.StringVar(&(dataString.Region), "region", "", "AWS region")
flag.StringVar(&(dataString.K8sVersion), "k8s_version", "", "Kubernetes version")
flag.BoolVar(&(dataString.Destroy), "destroy", false, "Whether to run in destroy mode (true/false)")
flag.StringVar(&(dataString.HelmChartsBranch), "helm_charts_branch", "", "Helm charts branch")
flag.StringVar(&(dataString.CloudwatchAgentRepository), "cloudwatch_agent_repository", "", "CloudWatch Agent repository")
flag.StringVar(&(dataString.CloudwatchAgentTag), "cloudwatch_agent_tag", "", "CloudWatch Agent tag")
Expand Down Expand Up @@ -318,6 +321,7 @@ func GetEnvironmentMetaData() *MetaData {
metaDataStorage.AmpWorkspaceId = registeredMetaDataStrings.AmpWorkspaceId
metaDataStorage.Region = registeredMetaDataStrings.Region
metaDataStorage.K8sVersion = registeredMetaDataStrings.K8sVersion
metaDataStorage.Destroy = registeredMetaDataStrings.Destroy
metaDataStorage.HelmChartsBranch = registeredMetaDataStrings.HelmChartsBranch
metaDataStorage.CloudwatchAgentRepository = registeredMetaDataStrings.CloudwatchAgentRepository
metaDataStorage.CloudwatchAgentTag = registeredMetaDataStrings.CloudwatchAgentTag
Expand Down
5 changes: 5 additions & 0 deletions generator/resources/eks_e2e_jmx_test_matrix.json
Original file line number Diff line number Diff line change
@@ -0,0 +1,5 @@
[
{
"nodes": 2
}
]
5 changes: 0 additions & 5 deletions generator/resources/eks_e2e_test_matrix.json

This file was deleted.

4 changes: 3 additions & 1 deletion generator/test_case_generator.go
Original file line number Diff line number Diff line change
Expand Up @@ -30,6 +30,8 @@ type matrixRow struct {
CaCertPath string `json:"caCertPath"`
ValuesPerMinute int `json:"values_per_minute"` // Number of metrics to be sent or number of log lines to write
K8sVersion string `json:"k8sVersion"`
Nodes int `json:"nodes"`
DeploymentStrategy string `json:"deploymentStrategy"`
TerraformDir string `json:"terraform_dir"`
UseSSM bool `json:"useSSM"`
ExcludedTests string `json:"excludedTests"`
Expand Down Expand Up @@ -247,7 +249,7 @@ var testTypeToTestConfig = map[string][]testConfig{
}

var testTypeToTestConfigE2E = map[string][]testConfig{
"eks_e2e": {
"eks_e2e_jmx": {
{testDir: "../../../test/e2e/jmx"},
},
}
Expand Down
35 changes: 32 additions & 3 deletions terraform/eks/e2e/main.tf
Original file line number Diff line number Diff line change
Expand Up @@ -39,10 +39,10 @@ resource "aws_eks_node_group" "this" {
min_size = var.nodes
}

ami_type = "AL2_x86_64"
ami_type = var.ami_type
capacity_type = "ON_DEMAND"
disk_size = 20
instance_types = ["t3a.medium"]
instance_types = [var.instance_type]

depends_on = [
aws_iam_role_policy_attachment.node_CloudWatchAgentServerPolicy,
Expand All @@ -52,6 +52,15 @@ resource "aws_eks_node_group" "this" {
]
}

resource "aws_security_group_rule" "nodeport_inbound" {
musa-asad marked this conversation as resolved.
Show resolved Hide resolved
type = "ingress"
from_port = 30080
to_port = 30080
protocol = "tcp"
cidr_blocks = ["0.0.0.0/0"]
security_group_id = aws_eks_cluster.this.vpc_config[0].cluster_security_group_id
}

resource "aws_iam_role" "node_role" {
name = "${local.cluster_name}-Worker-Role-${module.common.testing_id}"

Expand Down Expand Up @@ -109,6 +118,13 @@ resource "null_resource" "helm_charts" {
resource "null_resource" "validator" {
depends_on = [aws_eks_cluster.this, aws_eks_node_group.this, null_resource.helm_charts]

triggers = {
cluster_name = aws_eks_cluster.this.name
region = var.region
test_dir = var.test_dir
eks_deployment_strategy = var.eks_deployment_strategy
}

provisioner "local-exec" {
command = <<-EOT
echo "Validating K8s resources and metrics"
Expand All @@ -117,7 +133,7 @@ resource "null_resource" "validator" {
-k8s_version=${var.k8s_version} \
-eksClusterName=${aws_eks_cluster.this.name} \
-computeType=EKS \
-eksDeploymentStrategy=DAEMON \
-eksDeploymentStrategy=${var.eks_deployment_strategy} \
-helm_charts_branch=${var.helm_charts_branch} \
-cloudwatch_agent_repository=${var.cloudwatch_agent_repository} \
-cloudwatch_agent_tag=${var.cloudwatch_agent_tag} \
Expand All @@ -134,4 +150,17 @@ resource "null_resource" "validator" {
-sample_app="${var.test_dir}/${var.sample_app}"
EOT
}

provisioner "local-exec" {
when = destroy
command = <<-EOT
echo "Running cleanup for K8s resources"
go test -timeout 30m -v ${self.triggers.test_dir} \
-destroy \
-region=${self.triggers.region} \
-eksClusterName=${self.triggers.cluster_name} \
-computeType=EKS \
-eksDeploymentStrategy=${self.triggers.eks_deployment_strategy}
EOT
}
}
17 changes: 16 additions & 1 deletion terraform/eks/e2e/variables.tf
Original file line number Diff line number Diff line change
Expand Up @@ -18,7 +18,17 @@ variable "cluster_name" {

variable "nodes" {
type = number
default = 1
default = 2
}

variable "ami_type" {
musa-asad marked this conversation as resolved.
Show resolved Hide resolved
type = string
default = "AL2_x86_64"
}

variable "instance_type" {
type = string
default = "t3a.medium"
}

variable "helm_charts_branch" {
Expand Down Expand Up @@ -95,3 +105,8 @@ variable "sample_app" {
type = string
default = ""
}

variable "eks_deployment_strategy" {
type = string
default = "DAEMON"
}
141 changes: 141 additions & 0 deletions test/e2e/envutils.go
Original file line number Diff line number Diff line change
@@ -0,0 +1,141 @@
// Copyright Amazon.com, Inc. or its affiliates. All Rights Reserved.
// SPDX-License-Identifier: MIT

package e2e

import (
"fmt"
"os"
"os/exec"
"path/filepath"
"strings"

"github.com/aws/amazon-cloudwatch-agent-test/environment"
"github.com/aws/amazon-cloudwatch-agent-test/util/awsservice"
)

//------------------------------------------------------------------------------
// Environment Setup
//------------------------------------------------------------------------------

func InitializeEnvironment(env *environment.MetaData) error {
if env.Region != "us-west-2" {
if err := awsservice.ConfigureAWSClients(env.Region); err != nil {
return fmt.Errorf("failed to reconfigure AWS clients: %v", err)
}
fmt.Printf("AWS clients reconfigured to use region: %s\n", env.Region)
} else {
fmt.Printf("Using default testing region: us-west-2\n")
}

fmt.Println("Applying K8s resources...")
if err := ApplyResources(env); err != nil {
return fmt.Errorf("failed to apply K8s resources: %v", err)
}

return nil
}

//------------------------------------------------------------------------------
// K8s Resource Management Functions
//------------------------------------------------------------------------------

func ApplyResources(env *environment.MetaData) error {
updateKubeconfig := exec.Command("aws", "eks", "update-kubeconfig", "--name", env.EKSClusterName)
output, err := updateKubeconfig.CombinedOutput()
if err != nil {
return fmt.Errorf("failed to update kubeconfig: %w\nOutput: %s", err, output)
}

fmt.Println("Installing Helm release...")
helm := []string{
"helm", "upgrade", "--install", "amazon-cloudwatch-observability",
filepath.Join("..", "..", "..", "terraform", "eks", "e2e", "helm-charts", "charts", "amazon-cloudwatch-observability"),
"--set", fmt.Sprintf("clusterName=%s", env.EKSClusterName),
"--set", fmt.Sprintf("region=%s", env.Region),
"--set", fmt.Sprintf("agent.image.repository=%s", env.CloudwatchAgentRepository),
"--set", fmt.Sprintf("agent.image.tag=%s", env.CloudwatchAgentTag),
"--set", fmt.Sprintf("agent.image.repositoryDomainMap.public=%s", env.CloudwatchAgentRepositoryURL),
"--set", fmt.Sprintf("manager.image.repository=%s", env.CloudwatchAgentOperatorRepository),
"--set", fmt.Sprintf("manager.image.tag=%s", env.CloudwatchAgentOperatorTag),
"--set", fmt.Sprintf("manager.image.repositoryDomainMap.public=%s", env.CloudwatchAgentOperatorRepositoryURL),
"--namespace", "amazon-cloudwatch",
"--create-namespace",
}

if env.AgentConfig != "" {
agentConfigContent, err := os.ReadFile(env.AgentConfig)
if err != nil {
return fmt.Errorf("failed to read agent config file: %w", err)
}
helm = append(helm, "--set-json", fmt.Sprintf("agent.config=%s", string(agentConfigContent)))
}

helmUpgrade := exec.Command(helm[0], helm[1:]...)
helmUpgrade.Stdout = os.Stdout
helmUpgrade.Stderr = os.Stderr
if err := helmUpgrade.Run(); err != nil {
return fmt.Errorf("failed to install Helm release: %w", err)
}

fmt.Println("Waiting for CloudWatch Agent Operator to initialize...")
wait := exec.Command("kubectl", "wait", "--for=condition=available", "--timeout=60s", "deployment/amazon-cloudwatch-observability-controller-manager", "-n", "amazon-cloudwatch")
output, err = wait.CombinedOutput()
if err != nil {
return fmt.Errorf("failed to wait for operator deployment: %w\nOutput: %s", err, output)
}

deploymentName := strings.TrimSuffix(filepath.Base(env.SampleApp), ".yaml")

apply := exec.Command("kubectl", "apply", "-f", env.SampleApp)
output, err = apply.CombinedOutput()
if err != nil {
return fmt.Errorf("failed to apply sample app: %w\nOutput: %s", err, output)
}

fmt.Println("Waiting for Sample Application to initialize...")
wait = exec.Command("kubectl", "wait", "--for=condition=available", "--timeout=300s", fmt.Sprintf("deployment/%s", deploymentName), "-n", "test")
output, err = wait.CombinedOutput()
if err != nil {
return fmt.Errorf("failed to wait for deployment %s: %w\nOutput: %s", deploymentName, err, output)
}

return nil
}

func DestroyResources(env *environment.MetaData) error {
updateKubeconfig := exec.Command("aws", "eks", "update-kubeconfig", "--name", env.EKSClusterName)
output, err := updateKubeconfig.CombinedOutput()
if err != nil {
return fmt.Errorf("failed to update kubeconfig: %w\nOutput: %s", err, output)
}

var errors []error

fmt.Println("Deleting test namespace...")
deleteCmd := exec.Command("kubectl", "delete", "namespace", "test", "--timeout=60s")
output, err = deleteCmd.CombinedOutput()

// We don't want to consider not finding the namespace to be an error since that's the outcome we want
if err != nil && !strings.Contains(string(output), "not found") {
errors = append(errors, fmt.Errorf("failed to delete test namespace: %w\nOutput: %s", err, output))
}

fmt.Println("Uninstalling Helm release...")
helm := []string{
"helm", "uninstall", "amazon-cloudwatch-observability", "--namespace", "amazon-cloudwatch",
}

helmUninstall := exec.Command(helm[0], helm[1:]...)
helmUninstall.Stdout = os.Stdout
helmUninstall.Stderr = os.Stderr
if err := helmUninstall.Run(); err != nil {
errors = append(errors, fmt.Errorf("failed to uninstall Helm release: %w", err))
}

if len(errors) > 0 {
return fmt.Errorf("cleanup errors: %v", errors)
}

return nil
}
Loading
Loading