You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I read the C++ code of TensorflowPredictMusiCNN function and found that the input signal requirements are frameSize=512, hopSize=256, samplerate =16000.
Then I found that the input data requirement for mood_happy-msd-musicnn-1.pb is: batchsize * 187 *96.
According to this information, I calculated that the minimum audio duration required by this model is about 3 seconds, I am not sure whether the calculation is correct or not.
In addition, can 3 seconds of data accurately deduce the correct result?
Do I need to average enough results to make the results more accurate?
The text was updated successfully, but these errors were encountered:
feibuguocanghai
changed the title
how to run BeatTrackerDegara in c++ standard mode?
Does mood_happy-msd-musicnn work in real time?
Jan 17, 2025
Hi @feibuguocanghai, you are right, this model operates in windows of 3 seconds.
Individual estimations based on 3-second windows are expected to be noisy, and to obtain more accurate results you can average predictions over time.
If you want a real-time system, you could go for something like a moving average. You can have a look at our tutorial for real-time usage of the models.
Is it possible to use mood_happy-msd-musicnn to make accurate inferences on a 3-second audio clip?
With the following code:
I read the C++ code of TensorflowPredictMusiCNN function and found that the input signal requirements are frameSize=512, hopSize=256, samplerate =16000.
Then I found that the input data requirement for mood_happy-msd-musicnn-1.pb is: batchsize * 187 *96.
According to this information, I calculated that the minimum audio duration required by this model is about 3 seconds, I am not sure whether the calculation is correct or not.
In addition, can 3 seconds of data accurately deduce the correct result?
Do I need to average enough results to make the results more accurate?
The text was updated successfully, but these errors were encountered: