You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
skani currently outputs the full sequence header in its results. When the header is long and contains spaces, this output becomes difficult to read and parse. Displaying only the sequence identifier (the portion before the first white space) would align skani's output with the conventions of most other tools.
Proposed Solution:
Introduce an optional parameter to enable this behavior, allowing users to toggle between displaying the full header and only the sequence identifier.
The text was updated successfully, but these errors were encountered:
Thanks for raising this. I think including an option to toggle only the first token of the header is a good idea and agree with you.
I personally like having the whole header for readability: for many annotated genomes, the organism name can be informative and is not in the sequence identifier, hence the default behavior. Personally, this helps my sleuthing a lot. But it can get unruly and long. And given there should be no tabs in the whole header, I think parsing is mostly OK.
I'll mull over how to include this option. Maybe a --seq-id-only option
Description:
skani
currently outputs the full sequence header in its results. When the header is long and contains spaces, this output becomes difficult to read and parse. Displaying only the sequence identifier (the portion before the first white space) would alignskani
's output with the conventions of most other tools.Proposed Solution:
Introduce an optional parameter to enable this behavior, allowing users to toggle between displaying the full header and only the sequence identifier.
The text was updated successfully, but these errors were encountered: