Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

How to estimate read count #19

Open
yejunbin opened this issue Oct 12, 2024 · 1 comment
Open

How to estimate read count #19

yejunbin opened this issue Oct 12, 2024 · 1 comment
Labels
enhancement New feature or request

Comments

@yejunbin
Copy link

Hi,

Is sylph possible to estimate read count for each genome or taxonomy, like metaphlan or kraken?

thanks

@bluenote-1577
Copy link
Owner

Maybe I will add an option for estimating the read count. Sylph does not classify reads directly, so only an estimate can be provided.

For now, you can estimate the read count for sylph by doing the following:

  1. Use the -u option. This multiplies the Sequence abundance column by the % of classified reads.

  2. Multiply the Sequence abundance of each row by the # of reads in your dataset. So if your fastq file has 3M reads and a genome has sequence abundance 5%, then it should have 150k reads assigned to it.

I'll probalby add a feature to do this in a new update.

@bluenote-1577 bluenote-1577 added the enhancement New feature or request label Oct 12, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

2 participants