Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Exclude loci or taxa from data export if they did not sequence/capture above a threshold #89

Open
djlohman opened this issue Feb 20, 2022 · 2 comments

Comments

@djlohman
Copy link
Collaborator

djlohman commented Feb 20, 2022

Export option to exclude loci not present in X% of taxa.
Export option to exclude taxa with fewer than X % of loci.
It should be possible to use both of the above selective criteria at once for any data export format.
When data are excluded there should be a report of what was left out.

@djlohman djlohman added this to the SequenceMatrix v2.0 milestone Feb 20, 2022
@gaurav
Copy link
Owner

gaurav commented Apr 25, 2022

I'm trying to figure out what the best user interface for this would be:

  1. Have each of these be a menu option, but I think it'd be nice for users to be able to see which loci and taxa they're exporting.
  2. An "Export dataset by criteria ..." menu item, which opens a dialog box that provides the following options:
    • Exclude loci not present in [100%] of taxa.
      • This will exclude the following loci: [List]
    • Exclude taxa not present in [0%] of loci.
      • This will exclude the following taxa: [List]
    • [A table, similar to the main table, showing what will be output]
    • Buttons that provide exports in the formats listed in the "Export ..." menu, as well as an additional button for "Open filtered dataset in a new window".
  3. A "Filter by criteria ..." menu item, which would open a dialog box identical to that above, but which would only have an "Open filtered dataset in a new window" button. Once you do that, you can then export the new dataset in whichever formats you like.

What do you think?

@gaurav
Copy link
Owner

gaurav commented Apr 28, 2022

As per https://twitter.com/RobLanfear/status/1519534783050952704, it might also be useful to filter taza/loci that have more or less than a certain threshold of gaps/unknowns.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants