Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Audio input requirements #172

Open
AndyLogi opened this issue Aug 22, 2023 · 0 comments
Open

Audio input requirements #172

AndyLogi opened this issue Aug 22, 2023 · 0 comments

Comments

@AndyLogi
Copy link

Are there any specific requirements for audio files to make the results of DNSMOS valid?
I couldn't find any documentation in this repo or the original paper describing the audio requirements, but I was hoping to use home-made recordings to evaluate the performance of speech enhancement algorithms. Can any audio be used and gives valid results?

I've been running DNSMOS on some local files and have found that the S-MOS and G-MOS scores don't always correlate with subjectively listening to the files. Is there anything I should be doing to make these files valid for use in DNSMOS?
For example, are there requirements/recommendations on:

  • total duration of audio file;
  • proportion of speech and non-speech in the file;
  • level requirements;
  • suggested SNR for evaluation files (before speech enhancement is applied).
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant