-
Notifications
You must be signed in to change notification settings - Fork 9
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[GEN-1313] Export detailed columns for NAACCR codes #567
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Code looks good so far! i would just adjust the tests so they pass
Close pr for now to rebuild docker image |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM! Great work! Just a couple of comments but going to pre-approve. Also just a heads up, i think you have to merge in changes from develop
genie/consortium_to_public.py
Outdated
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Just a comment. Nothing actionable here. I think it's OK we don't have unit tests for these giant functions: stagingToCbio
, store_clinical_files
and consortiumToPublic
in the database_to_staging
and consortium_to_public
code just because these are better tested using integration tests and we already did that with our pipeline comparisons and our test runs on the test pipeline.
Doing this would be outside the scope of this ticket and is already part of our tech debt epic to refactor and add tests.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Yes, I agree that we add unit tests later for these functions for our tech debt epic.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The more we can spin out "unit functions" within these larger functions, the more they will be better served as integration tests.
If we think of these as ETL, it should tell the story of data processing.
Quality Gate passedIssues Measures |
Problem:
Need to export detailed columns for NAACCR codes
Solution:
Add the detailed columns when corresponding race, sex, and ethnicity columns are available.
Test:
Tested locally and ran through validation and processing steps successfully with expected results.