-
Notifications
You must be signed in to change notification settings - Fork 2
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Train the aligner on inuktitut #2
Comments
Put two small files into the training dir$ ls inuktitut_train/ Train aligner$ ./align.py -t inuktitut_train inuktitut_train/ Run aligner$ ./align_ex.sh inuktitut_train/ConversationInuit9-11_extract.wav inuktitut_train/ConversationInuit9-11_extract.lab |
How to adjust pruning HTK p 43 (phones0.mlf) The -t option sets the pruning thresholds to be used during training. Pruning limits the range of and in consequence a wider pruning beam is needed. HERest deals with this by having an auto- incrementing pruning threshold. In the above example, pruning is normally 250.0. If re-estimation fails on any particular file, the threshold is increased by 150.0 and the file is reprocessed. This is repeated until either the file is successfully processed or the pruning limit of 1000.0 is exceeded. At this point it is safe to assume that there is a serious problem with the training file and hence the fault should be fixed (typically it will be an incorrect transcription) or the training file should be discarded. |
./align.py -t sample_english_data sample_english data |
./align.py -t inuktitut_data inuktitut_data
The text was updated successfully, but these errors were encountered: