-
Notifications
You must be signed in to change notification settings - Fork 40
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Add end-to-end behavioural tests for the python CLI (#1313)
* Add python e2e tests with behave Signed-off-by: Partho Sarthi <[email protected]> * Add e2e test for HDFS platform Signed-off-by: Partho Sarthi <[email protected]> * Updates tests, docs and fix styling Signed-off-by: Partho Sarthi <[email protected]> * Remove unused subprocess Signed-off-by: Partho Sarthi <[email protected]> * Remove separate tox env for github actions Signed-off-by: Partho Sarthi <[email protected]> * Update docs Signed-off-by: Partho Sarthi <[email protected]> * Refactor feature files Signed-off-by: Partho Sarthi <[email protected]> * Address review comments Signed-off-by: Partho Sarthi <[email protected]> * Improve current file path calculation Signed-off-by: Partho Sarthi <[email protected]> * Address review feedbacks - Add E2E_TOOLS prefix - Fail if HDFS already running - Add verbose output - Move util functions to class Signed-off-by: Partho Sarthi <[email protected]> * Update docs about verbose mode Signed-off-by: Partho Sarthi <[email protected]> * Fix logger Signed-off-by: Partho Sarthi <[email protected]> * Fix flake8 Signed-off-by: Partho Sarthi <[email protected]> * Address review comments Signed-off-by: Partho Sarthi <[email protected]> --------- Signed-off-by: Partho Sarthi <[email protected]>
- Loading branch information
Showing
22 changed files
with
1,285 additions
and
6 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,172 @@ | ||
# Spark Rapids Tools End-to-End Behavior Tests | ||
|
||
This document outlines the end-to-end tests for Spark Rapids tools, designed to cover scenarios such as missing | ||
dependencies, handling different types of event logs, and interacting with HDFS. | ||
|
||
## Directory Structure | ||
```commandline | ||
user_tools/tests/spark_rapids_tools_e2e/ | ||
├── features # Contains test scenarios and environment setup. | ||
│ ├── environment.py # Setup and teardown procedures for the tests. | ||
│ ├── steps # Step definitions for the tests. | ||
│ └── *.feature # Feature files defining test scenarios. | ||
└── resources # Resources used in the tests. | ||
├── event_logs | ||
└── scripts # Scripts used in the tests. | ||
``` | ||
Configurations for `behave` tests are defined in `user_tools/tox.ini` file. | ||
|
||
## Setup | ||
|
||
From the `<repo_root>/user_tools` directory, run the following command to install the required dependencies: | ||
|
||
|
||
```sh | ||
pip install behave | ||
# or | ||
pip install .[test] | ||
``` | ||
|
||
|
||
## Running Tests | ||
Tests can be run using 'behave' cmd or using 'tox' cmd. | ||
|
||
**Basic Usage:** | ||
|
||
```sh | ||
behave <options> | ||
# or | ||
tox -e behave -- <options> | ||
``` | ||
|
||
**Run All Tests:** | ||
|
||
```sh | ||
behave | ||
# or | ||
tox -e behave | ||
``` | ||
|
||
### Common Options | ||
|
||
**Run Specific Tests by Tag** | ||
|
||
See the [Tags Format](#tags-format) section for more information on tags. | ||
|
||
```sh | ||
behave --tags <tag> | ||
# or | ||
tox -e behave -- --tags <tag> | ||
``` | ||
|
||
**Run Specific Tests by Name** | ||
|
||
```sh | ||
behave --name <scenario_name> | ||
# or | ||
tox -e behave -- --name <scenario_name> | ||
``` | ||
|
||
**Skip Tests by Tag** | ||
|
||
```sh | ||
behave --tags ~<tag> | ||
# or | ||
tox -e behave -- --tags ~<tag> | ||
``` | ||
|
||
**Custom Arguments** | ||
- Custom arguments can be passed to the behave tests using the `-D` flag. | ||
- Example: Skip building the Tools jar during setup. | ||
|
||
```sh | ||
behave -D build_jar=false # Skip building the Tools jar during setup (default: true) | ||
# or | ||
tox -e behave -- -D build_jar=false | ||
``` | ||
|
||
**Verbose Mode** | ||
- When verbose mode is enabled, `STDOUT` and `STDERR` from all subprocesses executed during the test run are shown in the console. | ||
```sh | ||
behave -v | ||
# or | ||
tox -e behave -- -v | ||
``` | ||
|
||
## Notes | ||
|
||
### Tags Format | ||
Tags are used to uniquely identify test scenarios and are defined in the following format: `@test_id_<feature>_<00xx>`. | ||
- `<feature>`: Acronym for the feature file being tested. Examples: | ||
- `ELP` for `event_log_processing.feature` | ||
- `IC` for `installation_checks.feature` | ||
- `<00xx>`: Unique 4-digit identifier for the test scenario. Examples: `0001`, `0002`. | ||
|
||
Tags Example: `@test_id_ELP_0001`, `@test_id_IC_0002`. | ||
|
||
### Built-in Setup Steps | ||
|
||
The tests include the following setup steps: | ||
|
||
1. Build Spark Rapids Tools JAR: | ||
- By default, the JAR is built before running the tests. | ||
- To skip this step (e.g., if the JAR is already built), use the argument -D build_jar=false. | ||
2. Build the Python Package. | ||
|
||
The test warns the user that initial setup may take a few minutes. | ||
|
||
### Built-in HDFS Cluster Setup | ||
|
||
- Some of the tests include configuring a local HDFS cluster. Step: `HDFS is "{status}"` | ||
- This step downloads Hadoop binaries and sets up the cluster. | ||
- The download occurs only once per machine but cluster setup is done for each test run. | ||
- Download step may take a few minutes. | ||
- Tests involving HDFS are tagged with `@long_running` and can be skipped using `--tags ~@long_running` | ||
|
||
#### HDFS Configuration: | ||
- Replication factor: 1 | ||
- Disk Space Quota: 2GB | ||
- Temp Directory: `/tmp/spark_rapids_tools_e2e_tests` | ||
- Temp Directory can be changed using the argument `-D e2e_tests_tmp_dir=<dir_path>` during test run. | ||
|
||
#### Cleanup | ||
- Step `HDFS is "{status}"` sets an after scenario hook to stop up the HDFS cluster and remove the temporary directories. | ||
- It does not clean up the Hadoop binaries downloaded during the setup. | ||
- Cleanup can be done manually using the below script: | ||
```sh | ||
<repo_root>/user_tools/tests/spark_rapids_tools_e2e/resources/scripts/hdfs/cleanup_hdfs.sh | ||
``` | ||
|
||
## Debugging Tests in IDE: | ||
|
||
- Ensure the Python interpreter is set to the correct virtual environment and `JAVA_HOME` is set. | ||
|
||
**IntelliJ** | ||
- Add a Python run configuration with module name: `behave` and working directory: `<repo-root>/user_tools`. | ||
- Add required arguments in `Script parameters` field. | ||
|
||
Sample Run Configuration: | ||
![resources/debug-behave-intellij.png](resources/debug-behave-intellij.png) | ||
|
||
**VS Code** | ||
- Open or create the `.vscode/launch.json` file. Add the following configuration with required arguments: | ||
```json | ||
{ | ||
"configurations": [ | ||
{ | ||
"name": "Python: Spark Rapids Tools E2E Tests", | ||
"type": "debugpy", | ||
"request": "launch", | ||
"module": "behave", | ||
"args": [], | ||
"python": "${command:python.interpreterPath}", | ||
"cwd": "${workspaceFolder}/user_tools" | ||
} | ||
] | ||
} | ||
``` | ||
|
||
|
||
## Guidelines for Writing Tests | ||
|
||
TODO: Add guidelines and conventions for writing tests. |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
119 changes: 119 additions & 0 deletions
119
user_tools/tests/spark_rapids_tools_e2e/features/environment.py
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,119 @@ | ||
# Copyright (c) 2024, NVIDIA CORPORATION. | ||
# | ||
# Licensed under the Apache License, Version 2.0 (the "License"); | ||
# you may not use this file except in compliance with the License. | ||
# You may obtain a copy of the License at | ||
# | ||
# http://www.apache.org/licenses/LICENSE-2.0 | ||
# | ||
# Unless required by applicable law or agreed to in writing, software | ||
# distributed under the License is distributed on an "AS IS" BASIS, | ||
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. | ||
# See the License for the specific language governing permissions and | ||
# limitations under the License. | ||
|
||
""" | ||
This module defines environment setup and teardown functions for the end-to-end tests using behave. | ||
""" | ||
|
||
import os | ||
import shutil | ||
import tempfile | ||
|
||
from spark_rapids_tools.utils import Utilities | ||
from steps.e2e_utils import E2ETestUtils | ||
|
||
""" Define behave hooks for the tests. These hooks are automatically called by behave. """ | ||
|
||
logger = E2ETestUtils.get_logger() | ||
|
||
|
||
def before_all(context) -> None: | ||
""" | ||
Set up the environment for the tests. This function is automatically called before all the tests. | ||
""" | ||
context.temp_dir = tempfile.mkdtemp() | ||
_set_environment_variables(context) | ||
_set_verbose_mode(context) | ||
_setup_env(context) | ||
|
||
|
||
def after_all(context) -> None: | ||
""" | ||
Clean up the environment after the tests. This function is automatically called after all the tests. | ||
""" | ||
_clear_environment_variables() | ||
shutil.rmtree(context.temp_dir) | ||
|
||
|
||
def before_scenario(context, scenario) -> None: | ||
if "skip" in scenario.effective_tags: | ||
scenario.skip("Marked with @skip") | ||
return | ||
|
||
|
||
def after_scenario(context, scenario) -> None: | ||
""" | ||
Clean up the environment after each scenario. This function is automatically called after each scenario. | ||
Steps must set the callback function using set_after_scenario_fn() to perform any cleanup. | ||
""" | ||
if hasattr(context, 'after_scenario_fn'): | ||
context.after_scenario_fn() | ||
|
||
|
||
def _set_verbose_mode(context) -> None: | ||
verbose_enabled = getattr(context.config, 'verbose', False) | ||
if verbose_enabled: | ||
context.config.stdout_capture = False | ||
context.config.stderr_capture = False | ||
os.environ['E2E_TEST_VERBOSE_MODE'] = str(verbose_enabled).lower() | ||
|
||
|
||
def _set_environment_variables(context) -> None: | ||
""" | ||
Set environment variables needed for the virtual environment setup. | ||
""" | ||
tools_version = Utilities.get_base_release() | ||
scala_version = context.config.userdata.get('scala_version') | ||
venv_name = context.config.userdata.get('venv_name') | ||
jar_filename = f'rapids-4-spark-tools_{scala_version}-{tools_version}-SNAPSHOT.jar' | ||
build_jar_value = context.config.userdata.get('build_jar') | ||
build_jar = build_jar_value.lower() in ['true', '1', 'yes'] | ||
|
||
os.environ['E2E_TEST_TOOLS_DIR'] = E2ETestUtils.get_tools_root_path() | ||
os.environ['E2E_TEST_SCRIPTS_DIR'] = os.path.join(E2ETestUtils.get_e2e_tests_resource_path(), 'scripts') | ||
os.environ['E2E_TEST_TOOLS_JAR_PATH'] = os.path.join(os.environ['E2E_TEST_TOOLS_DIR'], | ||
f'core/target/{jar_filename}') | ||
os.environ['E2E_TEST_VENV_DIR'] = os.path.join(context.temp_dir, venv_name) | ||
os.environ['E2E_TEST_BUILD_JAR'] = 'true' if build_jar else 'false' | ||
os.environ['E2E_TEST_SPARK_BUILD_VERSION'] = context.config.userdata.get('buildver') | ||
os.environ['E2E_TEST_HADOOP_VERSION'] = context.config.userdata.get('hadoop.version') | ||
os.environ['E2E_TEST_TMP_DIR'] = context.config.userdata.get('e2e_tests_tmp_dir') | ||
|
||
|
||
def _setup_env(context) -> None: | ||
""" | ||
Build the JAR and set up the virtual environment for the tests. | ||
""" | ||
script_file_name = context.config.userdata.get('setup_script_file') | ||
script = os.path.join(os.environ['E2E_TEST_SCRIPTS_DIR'], script_file_name) | ||
try: | ||
warning_msg = "Setting up the virtual environment for the tests. This may take a while." | ||
if os.environ.get('BUILD_JAR') == 'true': | ||
warning_msg = f'Building JAR and {warning_msg}' | ||
logger.warning(warning_msg) | ||
result = E2ETestUtils.run_sys_cmd([script]) | ||
E2ETestUtils.assert_sys_cmd_return_code(result, | ||
exp_return_code=0, | ||
error_msg="Failed to create virtual environment") | ||
except Exception as e: # pylint: disable=broad-except | ||
raise RuntimeError(f"Failed to create virtual environment. Reason: {str(e)}") from e | ||
|
||
|
||
def _clear_environment_variables() -> None: | ||
""" | ||
Clear environment variables set for the virtual environment setup. | ||
""" | ||
env_vars = ['SCRIPTS_DIR', 'VENV_DIR', 'TOOLS_JAR_PATH'] | ||
for key in env_vars: | ||
os.environ.pop(key, None) |
43 changes: 43 additions & 0 deletions
43
user_tools/tests/spark_rapids_tools_e2e/features/event_log_processing.feature
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,43 @@ | ||
# Copyright (c) 2024, NVIDIA CORPORATION. | ||
# | ||
# Licensed under the Apache License, Version 2.0 (the "License"); | ||
# you may not use this file except in compliance with the License. | ||
# You may obtain a copy of the License at | ||
# | ||
# http://www.apache.org/licenses/LICENSE-2.0 | ||
# | ||
# Unless required by applicable law or agreed to in writing, software | ||
# distributed under the License is distributed on an "AS IS" BASIS, | ||
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. | ||
# See the License for the specific language governing permissions and | ||
# limitations under the License. | ||
|
||
Feature: Event Log Processing | ||
|
||
@test_id_ELP_0001 | ||
Scenario Outline: Tool spark_rapids runs with different types of event logs | ||
When spark-rapids tool is executed with "<event_logs>" eventlogs | ||
Then stderr contains the following | ||
""" | ||
<expected_stderr> | ||
""" | ||
And processed applications is "0" | ||
And return code is "0" | ||
|
||
Examples: | ||
| event_logs | expected_stderr | | ||
| invalid_path_eventlog | process.failure.count = 1;invalid_path_eventlog not found, skipping! | | ||
| gpu_eventlog.zstd | process.skipped.count = 1;GpuEventLogException: Cannot parse event logs from GPU run: skipping this file | | ||
| photon_eventlog.zstd | process.skipped.count = 1;PhotonEventLogException: Encountered Databricks Photon event log: skipping this file! | | ||
| streaming_eventlog.zstd | process.skipped.count = 1;StreamingEventLogException: Encountered Spark Structured Streaming Job: skipping this file! | | ||
| incorrect_app_status_eventlog.zstd | process.NA.count = 1;IncorrectAppStatusException: Application status is incorrect. Missing AppInfo | | ||
|
||
@test_id_ELP_0002 | ||
Scenario: Qualification tool JAR crashes | ||
Given thread to crash qualification tool has started | ||
When spark-rapids tool is executed with "join_agg_on_yarn_eventlog.zstd" eventlogs | ||
Then stderr contains the following | ||
""" | ||
Qualification. Raised an error in phase [Execution] | ||
""" | ||
And return code is "1" |
Oops, something went wrong.