Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Check: Chromosomes in stats file need to be subset of those in genome (index) #15

Open
vinjana opened this issue Nov 15, 2018 · 1 comment

Comments

@vinjana
Copy link
Member

vinjana commented Nov 15, 2018

Chromosomose in the stats file (chromosome lengths ATCG) need to be checked to be a subset of those in the FASTA file/index. If they are not, then probably the workflows is misconfigured. A descriptive error message needs to be thrown. All tools taking the CHROM_SIZES_FILE should do this check. Similarly, alse the CHROMOSOME_INDICES variable of the WGBS workflow should get checked

Failure to check may result in partial statistics or statistics mislabeled, e.g. in case of xenograph alignments, without any error raised. In the worst case these remain unnoticed. Alternatively, an incorrect CHROM_SIZES_FILE may yield an unspecific message from the filter_readbins.pl script (that also occurs in other situations, e.g. if the input is truncated due to pipe-errors).

This should be checked early in the job scripts or earlier, e.g. in checkExecutability() in the Java code. If there is no BAM file yet, it should be checked against the BWA index directly to prevent mapping against the wrong INDEX. For the BWA index the .annfile can be used.

@vinjana
Copy link
Member Author

vinjana commented Nov 16, 2018

Check out #23

@vinjana vinjana changed the title Check: Chromosomes in stats file need to be subset of those in FASTA Check: Chromosomes in stats file need to be subset of those in genome (index) Jan 16, 2019
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant