-
Notifications
You must be signed in to change notification settings - Fork 11
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Colab for Synthesis #6
Comments
@athenasaurav Can I ask if I want to use anyone's voice, how to obtain TextGrid alignment? |
Hello @yiwei0730 You can use MFA to do this. Please read this blog |
Thanks for your reply, that means using MFA datasets_align.sh |
Yes, you need MFA, but you don't need alignment for full dataset, you can just run on files from your samples.
|
I'm interested in his audio-prompt-free automatic sound generation |
Yes you can do this, just zero out required prompts (do not provide any) and you will get random voices which you can later use as prompt.
|
Good idea, I never thought that the new voice could be directly used as a prompt hahaha. |
MFA is a proxy between text-phoneme pairs, since gpt takes text and generates phonemes and durations you will get all you need and pack it to the similar pt file.
|
@ex3ndr Did you mean the created file is this ? https://github.com/ex3ndr/supervoice/blob/master/generate_voices.py |
Yes |
Hello Everyone,
Here is the Colab for synthesis.
The text was updated successfully, but these errors were encountered: