Write end-to-end test suite #45

JackCollins91 · 2024-11-23T15:49:22Z

Closes Issue #11
Replaces PR #39 (which I will close unmerged after creating this).

This PR introduces two unit tests that provide end-to-end coverage for the RSS and SEC search functionality. Both tests operate by executing a simple CLI command and verifying that the output file is:

Created successfully.
Matches the expected data structure.

To maintain tidiness, the patch decorator is used to prevent actual output files from being created during testing. These tests ensure that any application relying on edgar-tool's data structure will remain unaffected by internal changes to the edgar-tool logic.

Notes

Test Warnings Instead of Failures
These tests may fail due to external factors, such as:
- Internet connectivity issues.
- Changes to the SEC’s data structure.
To address this, the tests issue warnings instead of outright failures. This prevents us from being blocked by unit test failures caused by changes outside the scope of the edgar-tool code.
Question: Is this approach too lenient?
Asserting Actual Values vs. Structure
While we discussed asserting actual values in the returned data, this could result in overly rigid tests. Changes to the SEC’s backend could cause slight variations in return values, making strict assertions problematic.

Instead, I propose a middle ground: asserting only the correctness of the data structure. This approach is particularly necessary for the RSS feed, where data changes frequently.
For older SEC data, we might expect greater stability, but I am not confident enough to assert this. I'm open to revisiting this approach if desired.
Future Changes Related to Create new URL encoding function to align with SEC API #40
The changes in Create new URL encoding function to align with SEC API #40 are currently in progress. While the URL generation code resides in edgar_tool\url_generator.py, it has not yet been integrated into edgar_tool\cli.py.

A future PR will cut over text-search functionality to use the new url_generator.
The tests introduced in this PR do not duplicate any functionality that would become redundant after the cutover. Instead, they ensure that the output data structure remains unchanged following this update.

JackCollins91 · 2024-11-23T15:51:20Z

Review appreciated from @jordan-gillard and @GalenReich whenever available :)

jordan-gillard

MAN. So so sorry for the incredibly late review. I promise to follow up much quicker in the future. Hope all is well & happy new year!

jordan-gillard · 2025-01-09T02:02:57Z

tests/test_cli.py

+    except Exception as e:
+        # Log a warning instead of failing the test
+        warnings.warn(
+            f"An exception occurred: {str(e)}\n"
+            "There might be an issue with accessing the SEC website or the SEC's return payload.",
+            UserWarning
+        )


How about we let it fail? We can also configure the test suite to run daily and not just on PRs. A warning log can easily go unnoticed.

jordan-gillard · 2025-01-09T02:12:36Z

tests/test_cli.py

@@ -27,3 +29,90 @@ def test_cli_should_return_help_string_when_passed_no_args():
    # THEN
    assert result.returncode == 0
    assert result.stdout.strip() == expected.strip()
+
+@patch("edgar_tool.text_search.write_results_to_file")


What would you think about using pytest's tmp_path fixture? It was made for this exact thing! 🪄

def test_example(tmp_path): # GIVEN output_file = tmp_path / "results.csv" # WHEN SecEdgarScraperCli.text_search( "John Doe", output=str(output_file), start_date="2021-01-01", end_date="2021-01-31" ) # THEN written_data = output_file.read_text() # And you can make your assertions directly against the content # of the file - it's truly end-to-end assert "'root_form', 'form_name'" in written_data

jordan-gillard · 2025-01-09T02:13:15Z

tests/test_cli.py

+            f"An exception occurred: {str(e)}\n"
+            "There might be an issue with accessing the RSS feed or the return payload.",
+            UserWarning
+        )


Make sure to run those git-commit hooks 🙃

Suggested change

)

)

Update test_cli.py

47881f0

JackCollins91 mentioned this pull request Nov 23, 2024

Write-test-suite-#11 #39

Closed

jordan-gillard requested changes Jan 9, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Write end-to-end test suite #45

Write end-to-end test suite #45

JackCollins91 commented Nov 23, 2024 •

edited

Loading

JackCollins91 commented Nov 23, 2024

jordan-gillard left a comment

jordan-gillard Jan 9, 2025

jordan-gillard Jan 9, 2025

jordan-gillard Jan 9, 2025

Write end-to-end test suite #45

Are you sure you want to change the base?

Write end-to-end test suite #45

Conversation

JackCollins91 commented Nov 23, 2024 • edited Loading

Notes

JackCollins91 commented Nov 23, 2024

jordan-gillard left a comment

Choose a reason for hiding this comment

jordan-gillard Jan 9, 2025

Choose a reason for hiding this comment

jordan-gillard Jan 9, 2025

Choose a reason for hiding this comment

jordan-gillard Jan 9, 2025

Choose a reason for hiding this comment

JackCollins91 commented Nov 23, 2024 •

edited

Loading