Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Think about how to speed up semantic similarity #29

Open
BlancaCalvo opened this issue Jun 30, 2022 · 3 comments
Open

Think about how to speed up semantic similarity #29

BlancaCalvo opened this issue Jun 30, 2022 · 3 comments

Comments

@BlancaCalvo
Copy link
Contributor

BlancaCalvo commented Jun 30, 2022

  • Parallelize
  • Pre-filter telegram with keywords (maybe menéame too)
  • Pre-filter by length
  • Creating windows to have concatenated conversations
@BlancaCalvo
Copy link
Contributor Author

@cuquiwi
Copy link
Collaborator

cuquiwi commented Jun 30, 2022

Seems too complicated to accelerate just a little.
I will parallelize. The current Memory in the VM allows us to use 2 threads without swapping memory in theory.

@cuquiwi
Copy link
Collaborator

cuquiwi commented Jul 1, 2022

Finally parallelizing the encoding of the messages is a bad idea (even not possible). And the encoding already uses multiprocessing.
So I will try Asier's method

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants