You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Are there any specific requirements for audio files to make the results of DNSMOS valid?
I couldn't find any documentation in this repo or the original paper describing the audio requirements, but I was hoping to use home-made recordings to evaluate the performance of speech enhancement algorithms. Can any audio be used and gives valid results?
I've been running DNSMOS on some local files and have found that the S-MOS and G-MOS scores don't always correlate with subjectively listening to the files. Is there anything I should be doing to make these files valid for use in DNSMOS?
For example, are there requirements/recommendations on:
total duration of audio file;
proportion of speech and non-speech in the file;
level requirements;
suggested SNR for evaluation files (before speech enhancement is applied).
The text was updated successfully, but these errors were encountered:
Are there any specific requirements for audio files to make the results of DNSMOS valid?
I couldn't find any documentation in this repo or the original paper describing the audio requirements, but I was hoping to use home-made recordings to evaluate the performance of speech enhancement algorithms. Can any audio be used and gives valid results?
I've been running DNSMOS on some local files and have found that the S-MOS and G-MOS scores don't always correlate with subjectively listening to the files. Is there anything I should be doing to make these files valid for use in DNSMOS?
For example, are there requirements/recommendations on:
The text was updated successfully, but these errors were encountered: