Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Work on embedding tuning for medical/pharmacology data #6

Open
strayMat opened this issue Mar 8, 2018 · 0 comments
Open

Work on embedding tuning for medical/pharmacology data #6

strayMat opened this issue Mar 8, 2018 · 0 comments

Comments

@strayMat
Copy link
Collaborator

strayMat commented Mar 8, 2018

Find datasets of french medical/parmacology and train fasttext embeddings on them:

Use this notebook to construct a training corpus (as for now, the notebook contains only the two precedent sources but we can consider to add more).

Use the fasttext and there cheatsheet to train the embedding.

The fasttext command should be something like :

./fasttext skipgram -input data.txt -output model -pretrainedVectors pretrained -dim 300

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant