Automatic audio resampling on the fly #2360

agemagician · 2021-06-11T08:47:06Z

agemagician
Jun 11, 2021

Is it possible to resampling audio on the fly during training ?

Assuming I have a dataset that consists of different sampling rage "32k, 16k, and 8k", Can Nemo automatically resample them during training into the model-specific sample rate defined in the YAML file?

Answered by okuchaiev

Jun 15, 2021

Yes, there is a sample_rate parameter you can pass https://docs.nvidia.com/deeplearning/nemo/user-guide/docs/en/main/asr/api.html?highlight=AudioToCharDataset#nemo.collections.asr.data.audio_to_text.AudioToCharDataset

However, I would strongly recommend against this and instead pre-process your data into the right sampling rate before training. Otherwise it would really hurt training speed.

View full answer

okuchaiev · 2021-06-15T05:42:35Z

okuchaiev
Jun 15, 2021
Collaborator

Yes, there is a sample_rate parameter you can pass https://docs.nvidia.com/deeplearning/nemo/user-guide/docs/en/main/asr/api.html?highlight=AudioToCharDataset#nemo.collections.asr.data.audio_to_text.AudioToCharDataset

However, I would strongly recommend against this and instead pre-process your data into the right sampling rate before training. Otherwise it would really hurt training speed.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Automatic audio resampling on the fly #2360

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Replies: 1 comment

{{title}}

Select a reply

Automatic audio resampling on the fly #2360

agemagician Jun 11, 2021

Replies: 1 comment

okuchaiev Jun 15, 2021 Collaborator

agemagician
Jun 11, 2021

okuchaiev
Jun 15, 2021
Collaborator