Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

When too many query/ref pairs are reported, final_stats uses too much of memory due to its naive Python implementation #185

Open
karel-brinda opened this issue Sep 23, 2022 · 0 comments
Labels
bug Something isn't working paper

Comments

@karel-brinda
Copy link
Owner

Happens with eg 1M queries

This is the problematic part:
https://github.com/karel-brinda/mof-search/blob/e79b0c842ed919f1787a3071a52065d2317c8f71/scripts/final_stats.py#L109

Probably should be possible optimize by that the output is sorted according to ref (so it's sufficient to keep just stats for the last ref in memory)

@karel-brinda karel-brinda added the bug Something isn't working label Sep 23, 2022
@karel-brinda karel-brinda self-assigned this Sep 23, 2022
@karel-brinda karel-brinda changed the title When many query/ref pairs are reported, final_stats uses to much of memory When many query/ref pairs are reported, final_stats uses too much of memory Sep 23, 2022
@karel-brinda karel-brinda removed their assignment Dec 6, 2022
@leoisl leoisl added the paper label Jan 10, 2023
@karel-brinda karel-brinda changed the title When many query/ref pairs are reported, final_stats uses too much of memory When too many query/ref pairs are reported, final_stats uses too much of memory due to its naive Python implementation May 13, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working paper
Projects
None yet
Development

No branches or pull requests

2 participants