Audio input requirements #172

AndyLogi · 2023-08-22T09:57:14Z

Are there any specific requirements for audio files to make the results of DNSMOS valid?
I couldn't find any documentation in this repo or the original paper describing the audio requirements, but I was hoping to use home-made recordings to evaluate the performance of speech enhancement algorithms. Can any audio be used and gives valid results?

I've been running DNSMOS on some local files and have found that the S-MOS and G-MOS scores don't always correlate with subjectively listening to the files. Is there anything I should be doing to make these files valid for use in DNSMOS?
For example, are there requirements/recommendations on:

total duration of audio file;
proportion of speech and non-speech in the file;
level requirements;
suggested SNR for evaluation files (before speech enhancement is applied).

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Audio input requirements #172

Audio input requirements #172

AndyLogi commented Aug 22, 2023

Audio input requirements #172

Audio input requirements #172

Comments

AndyLogi commented Aug 22, 2023