-
Notifications
You must be signed in to change notification settings - Fork 34
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Advanced error model(s) #6
Comments
Alright let's get back on this. A few more TODOs!
I'm focusing on hiseq data atm, once we get the functions right, it should be easy to add models for different machines. |
For the indels:
|
Concerning the error model itself, the per read sequence quality looks nice, the per sequence mean quality still looks a bit fake-ish I might want to switch to a 2D KDE (taking the mean quality into account?), but that's more of a v1 issue. An alpha could be released with somewhat poor modelling of the average sequence quality. |
Closed (for now?) with 06b16f3 🚀 |
Re-opened for refining the error model:
|
nvm i fixed it |
closed with the last release |
Issue to track the progress on the Roadmap item "Add a more complicated error model"
This is where the fun begins.
The current plan is to:
MD tag andCIGAR string.(samtools calmd?)It worked well with CIGAR* It will be very important to track every step used to generate the profiles: genomes download, mapping options, ... I will keep everything in evernote for now but will dump it in the software doc later on.
** I'll need pysam as a dependency.
Hopefully it will not add difficulties to the install processPysam installed seamlessly*** something of the like (not sure yet, subject to change!):
The text was updated successfully, but these errors were encountered: