Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
check for new filenames before merge/subset
python logic written in separate library - looks for any filenames present in the current datatype sheet (embedded in script) - excludes any filenames which were in use before the introduction of updated filenames - ignore normal sample files (which are ignored by importer) - matches filenames in directory against wildcard patterns (up to a single asterisk) - all filenames of the form *_data_cna_hg18.seg will be allowed (hg18/hg19 and data/meta) - these will not prevent the merge/subset scripts from running (determining cancer_study_id in all cases was not feasible) perform check from subset-impact-data.sh and merge.py and exit on noticing new filename patterns present.
- Loading branch information