-
-
Notifications
You must be signed in to change notification settings - Fork 153
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Polish TTS pronunciation #474
Comments
Technically speaking Applio is not a TTS engine in and of itself, just like AllTalk is not a TTS engine in and of itself. They would be classed as competing products, like saying Windows, Linux and Mac OS. So your question is like asking if I can put Windows in Mac OS. If there is a specific feature of it you are wanting/asking for? Here is what is currently planned #74 and the next release will be having RVC Training and also Opensea and also openvoice. Outside of those if you can tell me what it is you are asking for, then perhaps I can look at it at some point. |
@erew123 Oh, I thought Applio was just another model for creating TTS and would be able to implement it into alltalktts just like piper, vits, etc. The problem I have with all models is that in Polish, English words like names of people, names of places and any words eng says as it is written and not as it is pronounced. |
Yes, I checked every engine and in all of them is the same problem. |
With Piper and VITS, because the models/voices you load (of those models shown above) they should automatically shift into polish (aka, you cannot specify the language for them, they just are the language you load in). With XTTS, you would need to specify Appilio uses Edge TTS. which you can test the voices here https://huggingface.co/spaces/innoai/Edge-TTS-Text-to-Speech Does that TTS engine do what you want? |
@xdax1 just to be clear, reason I am asking if it does what you want, is I may be able to add that engine quite easily (when I get chance) |
Yes! I checked this edge and most of the words eng says correctly. And will I be able to add other voices there? |
So edge TTS, the actual TTS engine I think is hard coded voices. However, beyond that you can use RVC to alter the pitch to make it sound like someone specific and train a voice into that, though its not going to change the pronunciation of words, thats down to the underlying TTS engine. RVC is a voice changer/transformer so makes speech or TTS sound like another person. I have working code to train your own RVC voice, but I havnt had time to finish it just yet as its been a large code base update. I will also be adding Openvoice, which is the same as RVC but requires no pre-training and will just use a wav/audio file, like XTTS does. Again though the underlying TTS engine will need to pronounce things the way you want them spoken for your language, hence me asking is Edge is doing what you want with polish. |
Yes, Edge works. |
@erew123 Any ETA on the next update? RVC model training and Openvoice are 2 incredibly amazing features, wow! Can't wait to try them. And RVC training would actually come in handy right now, otherwise I would have to do that in Applio for the time being. Oh, and thank you so much for your incredible work! Alltalk is unbelievably useful and polished! |
@S-T-K No hard date yet. Hopefully before the end of the month, but had to leave coding pre Christmas to deal with a house break-in. Didnt get back until a couple of days ago, Im trying to catch up with PR's & support issues and then I have to try get my head back into where I was in the code pre christmas, with one additional problem that I have to travel again on the 10th and that will be at least 7 days away. The code is 80-85% done, but I cant recall exactly where Im at with it and that final 10-15% can always be a sod to complete when something goes wrong. So by the end of the month (hopefully). |
Wow that's soon! Guess I'll shuffle around some task to postpone the RVC part of my project till then, good to know. |
Hello, is it possible to add Applio TTS to alltalktts?
I care most about the API, which applio unfortunately does not have and I do not know how to create one.
The text was updated successfully, but these errors were encountered: