What:

A convolutional neural network for recognizing the language being spoken in short (about 3-10 second) wav recordings by first converting the raw audio to Mel-Frequency Cepstral Coefficient images.

How:

To train: Run train.py --modeldir=[directory_to_save_model] <-v[erbose_logging]>

To test: Drop any desired test audio into the test-audio folder, add the filename and label to the files.csv file there, and run `predict.py --modeldir=[directory_of_saved_model] <-v[erbose_logging]>

Requirements:

argparse tempfile tensorflow >1.8.0 numpy pandas sklearn

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
test-audio		test-audio
voxforge		voxforge
README.md		README.md
fcbh_file_processing.py		fcbh_file_processing.py
fcbh_trainingData.csv		fcbh_trainingData.csv
lingua_franca_config.py		lingua_franca_config.py
mfcc_check.ipynb		mfcc_check.ipynb
predict.py		predict.py
train.py		train.py
voxforge.py		voxforge.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

What:

How:

Requirements:

About

Releases

Packages

Languages

joecomerisnotavailable/Lingua_Franca

Folders and files

Latest commit

History

Repository files navigation

What:

How:

Requirements:

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages