-
I was successful in rescoring the ASR output (provided by the NVIDIA STT_it_QuartzNet15x5 model) using Beam Search & KenLM and training the latter on a large Italian corpus (say 'it_corpus.txt'). Now, I would like to apply a Neural Rescorer, but I cannot find a way to do that given that there is no Nemo LM for Italian already trained and available. |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment 4 replies
-
We do not support using HF pretrained models as neural rescorer currently but I think it should not be that hard to write such a script to load the HF model and rescore sentences. Just keep in mind that if you are referring to BERT-like models in HF, you can use them as neural rescorer for ASR models generally but they are not efficient computationally to be used as neural rescorers. It is better to use GPT style LM like regular Transformer LM. I have explained a little more here why: The other thing is that those pretrianed models are trained on different types/domains of data while ASR models are generally trained on lower case with no punctuation text. So it is likely that a model you train by yourself from scratch may give better results if you have enough large text corpus. We are going to add more documentation on training Transfromer LM models soon. In the meantime, you may try to train it as the following: 1-Train a yttm tokenizer with
|
Beta Was this translation helpful? Give feedback.
We do not support using HF pretrained models as neural rescorer currently but I think it should not be that hard to write such a script to load the HF model and rescore sentences. Just keep in mind that if you are referring to BERT-like models in HF, you can use them as neural rescorer for ASR models generally but they are not efficient computationally to be used as neural rescorers. It is better to use GPT style LM like regular Transformer LM. I have explained a little more here why:
#2313
The other thing is that those pretrianed models are trained on different types/domains of data while ASR models are generally trained on lower case with no punctuation text. So it is likely that a mode…