Colab for Synthesis #6

athenasaurav · 2024-03-26T14:27:57Z

Hello Everyone,

Here is the Colab for synthesis.

yiwei0730 · 2024-03-29T08:45:24Z

@athenasaurav Can I ask if I want to use anyone's voice, how to obtain TextGrid alignment?

athenasaurav · 2024-03-29T10:31:39Z

Hello @yiwei0730

You can use MFA to do this. Please read this blog

yiwei0730 · 2024-03-29T14:07:09Z

Thanks for your reply, that means using MFA datasets_align.sh
Run according to this (just use the same method as before with FS2)

ex3ndr · 2024-04-01T18:20:59Z

Yes, you need MFA, but you don't need alignment for full dataset, you can just run on files from your samples.

yiwei0730 · 2024-04-02T01:18:56Z

I'm interested in his audio-prompt-free automatic sound generation
I would like to ask where he produces/samples unique sound features when I don't give him the required sound prompts. Can you point it out to me?
I want to know if after production, if i think the sound is great, i can repeatedly extract this feature parameter for use and synthesize this sound to another sentence.

ex3ndr · 2024-04-02T07:15:53Z

Yes you can do this, just zero out required prompts (do not provide any) and you will get random voices which you can later use as prompt.

yiwei0730 · 2024-04-02T07:35:31Z

Yes you can do this, just zero out required prompts (do not provide any) and you will get random voices which you can later use as prompt. Steve Korshakov Sent via Superhuman @.> On Mon, Apr 1 2024 at 6:19 PM, yiwei0730 @.@.>> wrote: I'm interested in his audio-prompt-free automatic sound generation I would like to ask where he produces/samples unique sound features when I don't give him the required sound prompts. Can you point it out to me? I want to know if after production, if i think the sound is great, i can repeatedly extract this feature parameter for use and synthesize this sound to another sentence. — Reply to this email directly, view it on GitHub<#6 (comment)>, or unsubscribehttps://github.com/notifications/unsubscribe-auth/AADB2E2PBWROIPNO4SLYPDLY3IBRLAVCNFSM6AAAAABFJBOKPSVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDAMZQHEYDKMJVGQ. You are receiving this because you commented.Message ID: @.>

Good idea, I never thought that the new voice could be directly used as a prompt hahaha.
I noticed that voices has four files to setting: TextGrid, pt, txt, wav
TextGrid generates txt through MFA. It can be generated from recognition. Where should the pt file be generated?

ex3ndr · 2024-04-02T07:42:35Z

MFA is a proxy between text-phoneme pairs, since gpt takes text and generates phonemes and durations you will get all you need and pack it to the similar pt file.

yiwei0730 · 2024-04-02T07:59:57Z

@ex3ndr Did you mean the created file is this ? https://github.com/ex3ndr/supervoice/blob/master/generate_voices.py

ex3ndr · 2024-04-02T15:41:54Z

Yes, you need MFA, but you don't need alignment for full dataset, you can just run on files from your samples.

Yes

athenasaurav closed this as completed Mar 27, 2024

athenasaurav reopened this Mar 29, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Colab for Synthesis #6

Colab for Synthesis #6

athenasaurav commented Mar 26, 2024

yiwei0730 commented Mar 29, 2024

athenasaurav commented Mar 29, 2024

yiwei0730 commented Mar 29, 2024

ex3ndr commented Apr 1, 2024 via email •

edited

Loading

yiwei0730 commented Apr 2, 2024

ex3ndr commented Apr 2, 2024 via email •

edited

Loading

yiwei0730 commented Apr 2, 2024 •

edited

Loading

ex3ndr commented Apr 2, 2024 via email •

edited

Loading

yiwei0730 commented Apr 2, 2024

ex3ndr commented Apr 2, 2024

Colab for Synthesis #6

Colab for Synthesis #6

Comments

athenasaurav commented Mar 26, 2024

yiwei0730 commented Mar 29, 2024

athenasaurav commented Mar 29, 2024

yiwei0730 commented Mar 29, 2024

ex3ndr commented Apr 1, 2024 via email • edited Loading

yiwei0730 commented Apr 2, 2024

ex3ndr commented Apr 2, 2024 via email • edited Loading

yiwei0730 commented Apr 2, 2024 • edited Loading

ex3ndr commented Apr 2, 2024 via email • edited Loading

yiwei0730 commented Apr 2, 2024

ex3ndr commented Apr 2, 2024

ex3ndr commented Apr 1, 2024 via email •

edited

Loading

ex3ndr commented Apr 2, 2024 via email •

edited

Loading

yiwei0730 commented Apr 2, 2024 •

edited

Loading

ex3ndr commented Apr 2, 2024 via email •

edited

Loading