You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
This issue is not a priority until changing the API
If there seems to be no " . " (followed by space) token in a sentence that is more than 100 character long, TTS API introduces stops. I add ". " when concatenating every text block, to eliminate the chances of unwanted stops.**
But I can't have control over the text sent by the API: I.e if the article description is 100 chars long without a ". "
Sometimes there is a stop at the middle of a sentence due to this. For example:
"bla bla X has been interviewing Z bla bla" there might be a stop after "been" and it sounds strange...**
1- iterate through the text and if there does not exist ", ." in a group of 100 chars; adding ". , " near to keywords "and, or" (if they exist)
But this increases the complexity since for every article we have to go through it's text and such keywords might not exit...
2-trim article text to be less than 100 chars long, this would help with strange stops but will reduce article data.
I'd want to hear other solutions.
Details for the curious: (The Google TTS API actually introduces stops in every 100 chars but gTTS library avoids this behaviour if there exist ". , " in the sentence. But if there are not any ". ," nothing can be done by the gTTS to eliminate stops")
The text was updated successfully, but these errors were encountered:
This issue is not a priority until changing the API
If there seems to be no " . " (followed by space) token in a sentence that is more than 100 character long, TTS API introduces stops. I add ". " when concatenating every text block, to eliminate the chances of unwanted stops.**
But I can't have control over the text sent by the API: I.e if the article description is 100 chars long without a ". "
Sometimes there is a stop at the middle of a sentence due to this. For example:
"bla bla X has been interviewing Z bla bla" there might be a stop after "been" and it sounds strange...**
See the similar issue in API library pndurette/gTTS#119 also https://gtts.readthedocs.io/en/latest/tokenizer.html#tokenizing for excepted tokens.
Solutions I can think of are:
1- iterate through the text and if there does not exist ", ." in a group of 100 chars; adding ". , " near to keywords "and, or" (if they exist)
But this increases the complexity since for every article we have to go through it's text and such keywords might not exit...
2-trim article text to be less than 100 chars long, this would help with strange stops but will reduce article data.
I'd want to hear other solutions.
Details for the curious: (The Google TTS API actually introduces stops in every 100 chars but gTTS library avoids this behaviour if there exist ". , " in the sentence. But if there are not any ". ," nothing can be done by the gTTS to eliminate stops")
The text was updated successfully, but these errors were encountered: