-
Notifications
You must be signed in to change notification settings - Fork 3
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Improve the error messages in case of a genotype data parsing failure #224
Comments
We can definitely include a size-check for the genotype data, which I'm happy to include into our validation pipeline. |
It has become clear that a size-check is not possible. But independent of that I was hoping for more. I hope we could get an error message that looks something like this:
Is this science fiction with our current implementation? |
OK, so going back to size checks: Since we do have the snpSet (1240K, HumanOrigins, Other) in the YAML file, we should actually be able to give a size-check warning after all, at least in cases where it's either 1240K or HumanOrigins. We can hardcode the expected number of SNPs for these categories and then use the number of of individuals to compute an expected byte size of the I'll work on that. |
I think it's not science fiction. My sequence-formats parsers can provide all that information, it's just a matter of having all the data ready to create that error message, which might involve some refactoring here and there. I'll look into it. |
At the moment an issue in the genotype data is always reduced to
That often does not help to identify and solve the underlying issue, because it omits in which package + SNP (+ individual?) the problem occurred. If such an error comes up in a big forge, debugging becomes a search for the needle in the haystack. The short snipped of the relevant chunk in the error message above can be pointless, when the genotype data is in a binary format.
I wonder if there is a way to include additional, crucial information in this error message.
The text was updated successfully, but these errors were encountered: