Do you have any plans for a new release? #1892

GalenMarek14 · 2024-12-25T12:40:36Z

The V2 model is very good, especially since it supports laughs and other non-word sounds, but it's still a little bit behind the best paid models. Are there any plans for a V3 release? If so, can you provide an ETA for its release and what can we expect from it? For example, will it add support for other languages or significantly increase the training data?

Thank you very much for the project.

RVC-Boss · 2024-12-26T14:52:15Z

Data: about 8k hours chinese data, 7k hours other language data.
Better zero shot TTS timbre similarity (speaker verification distance). Better audio quality(MOS).
Richer emotional expression. Consistent stability (WER) comparing with v2.
The experiment has been successful, and I am expanding the training data.
Inference time: maybe slightly slower.
Time of new release is about January.

RVC-Boss added the todolist label Dec 26, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Do you have any plans for a new release? #1892

Do you have any plans for a new release? #1892

GalenMarek14 commented Dec 25, 2024

RVC-Boss commented Dec 26, 2024

Do you have any plans for a new release? #1892

Do you have any plans for a new release? #1892

Comments

GalenMarek14 commented Dec 25, 2024

RVC-Boss commented Dec 26, 2024