Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Influenza: Number of Sequences over time graphs for multiple clades #475

Open
anna-parker opened this issue Jan 15, 2025 · 3 comments
Open

Comments

@anna-parker
Copy link
Contributor

I saw these lovely graphs in https://www.science.org/doi/10.1126/science.adq0072#abstract, they give a very clear and concise overview of the proportion of a specific variant/clade of all sequences at a point in time. I think that these graphs are easier to understand than our current Number of Sequences over Time graphs and extend better to multiple clades - I could see them being immediately useful for influenza. However, we might have to limit the number of clades/variants that can viewed at a specific time.

image

@chaoran-chen
Copy link
Member

Yes, agreed, that would be very useful! I wonder, should this be a new component or an extension of the prevalence-over-time?

If we extend prevalence-over-time, we could add an optional field like stratifyByField. The bar chart would then be stacked as shown in your example and this would also work if more than one dataset is provided. But I'm not sure how the line and bubble chart would work with multiple datasets (see storybook) – but maybe be can start with disabling the line and bubble charts for now if both stratifyByField and multiple datasets are provided?

@anna-parker
Copy link
Contributor Author

I was thinking we could extend the Number of Sequences over Time graph then we do not need to think about how to modify the confidence intervals that are in prevalence over time line graph, prevalence is always a ratio to the total and here the bars should add up to a total so it is more Number of Sequences over Time in my understanding

@chaoran-chen
Copy link
Member

Ah, yes, sounds good, we can start with Number of Sequences over Time which should be easier. The second row of the plot that you posted shows the proportions and I can see use cases for both but starting with the simpler version makes sense to me.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants