-
Notifications
You must be signed in to change notification settings - Fork 41
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Loss doesnt goes down #32
Comments
were you able to solve this? I am also finetuning the model on 5k audios, and loss is stuck at 32 |
Did you guys make it lesser? I got the same problem |
Reducing the weight co-efficients works, but you need to train for a lot more epochs then mentioned, and depends on quality of your dataset. |
@omerarshad I'm training on 40 mins of good quality dataset how many epochs do I need to get clear audio. |
How long did it take you and what was your hardware setup? I'm wondering if this is possible on a midrange gaming laptop. |
@andergisomon I finetuned it on google colab pro, A100 with batch size of 64 for around 400+ epochs |
lowering the mel weight from 35 to 10 after training 2000 epochs on 1 hour of audio made the step loss go from 16 to 7, i need to finish training to see the results but looks promising |
nope, even when the lossrate dropped from 16 to 7 the quality is still bad, very robotic |
which weights? |
I am finetuning a catalan checkpoint from MMS, with a single speaker dataset of 5 hours, the loss doesnt goes bellow 22, all the characters of my the transcription from my datset exist inside the vocab of the model.
I dont know if i just need more training, how many epochs for 2500 audios is a good amount to get better results than the checkpoint?
I already did 400
The text was updated successfully, but these errors were encountered: