Google speech to syllables

Script for transcribing audio files into syllables using Google Cloud and Pyphen library. See google python tutorial and Pyphen.

If you use the particular method in your work, please cite the following article:

Illner, V., Tykalová, T., Novotný, M., Klempíř, J., Dušek, P., & Rusz, J. Toward Automated Articulation Rate Analysis via Connected Speech in Dysarthrias. Journal of Speech, Language, and Hearing Research, 1-16. https://doi.org/10.1044/2021_JSLHR-21-00549

Developed at FEE CTU in Prague.

Use

Process speech audio files and outputs its syllables and corresponding timestamps.

python speech-to-words-to-syllables.py

Setup

Create a google cloud account
Create a credentials file for server authentication (hardest step probably)
Create a new bucket (folder) on the cloud storage
Upload files of interest to the given bucket
Supply the following arguments within the script:
- path to the credentials file
- path to the output folder
- name of the bucket on google cloud

# set the credentials environment variable
os.environ['GOOGLE_APPLICATION_CREDENTIALS'] = 'C:\\Users\\username\\google_credentials_file.json'
#  files for result storing
output_file_path = 'C:/Users/username/python_project/data_output/'
# obtain the desired bucket
bucket_name = "bucket_name"

Modify the config variables to reflect your data

Set the audio sampling rate and language. For Google API:

config = speech.RecognitionConfig(
        sample_rate_hertz=48000,
        language_code="cs-CZ",
        enable_word_time_offsets=True,
    )

For Pyphen:

pyphen.language_fallback("cs_CZ")
dic = pyphen.Pyphen(lang='cs_CZ')

Run the script speech-to-words-to-syllables.py in terminal

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
data/test_data		data/test_data
scripts		scripts
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Google speech to syllables

Use

Setup

About

Releases

Packages

Languages

vojtaiii/google_speech_to_syllables

Folders and files

Latest commit

History

Repository files navigation

Google speech to syllables

Use

Setup

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages