About Korean ASR #3648
Replies: 3 comments 15 replies
-
First, I share one of the models in the following. Conformer-Transducer-BPE-Small.nemo [Link]Model OverviewThis collection contains small size versions of Conformer-Transducer trained on ksponspeech which is an open-domain Korean dialog corpus. Model ArchitectureConformer-Transducer model is an autoregressive variant of Conformer model [1] for Automatic Speech Recognition which uses Transducer loss/decoding. You may find more info on the detail of this model here: [Conformer-Transducer Model]. TrainingThe NeMo toolkit [3] was used for training the models for over several hundred epochs. These model are trained with this [base config]. The tokenizers for these models were built using the text transcript. DatasetsAll the models in this collection are trained on Ksponspeech dataset [Download] PerformanceThe list of the available models in this collection is shown in the following. Performances of the ASR models are reported in terms of Word Error Rate (WER%) with mAES decoding.
|
Beta Was this translation helpful? Give feedback.
-
@eesungkim With the NeMo 1.8.1 release (soon™), we will support Huggingface Hub for external contributions (starting with ASR support). See #4030 for more details. If you would like, you can upload a public checkpoint for Korean ASR to HuggingFace and add the links here so that others may use it easily. When naming the model, please try to follow the current conversion for Conformer models- So for example, |
Beta Was this translation helpful? Give feedback.
-
@eesungkim It is now very easy to publish your model on Hugging Face Hub. @titu1994 prepared a great tutorial on how to do this #4333 I would encourage you publish your model (under your name/org) on HF Hub. |
Beta Was this translation helpful? Give feedback.
-
Hi guys,
Thank you for sharing a great tool for conversational AI.
I'm going to start a discussion on Korean ASR here. @okuchaiev
Beta Was this translation helpful? Give feedback.
All reactions