Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

How to transcribe audio in Indian English #1246

Open
ranjitsingha opened this issue Feb 14, 2025 · 1 comment
Open

How to transcribe audio in Indian English #1246

ranjitsingha opened this issue Feb 14, 2025 · 1 comment

Comments

@ranjitsingha
Copy link

ranjitsingha commented Feb 14, 2025

Hey there i want to know if is it possible to convert an audio who's source language is hindi an then transcribe to Indian English also known as Hinglish.

When i use model large-v3-turbo with language

model.transcribe(audio='audio.mp3', language="hi", word_timestamps=True)

I get output as:

00:00:00,000 --> 00:00:00,320
पता
00:00:00,320 --> 00:00:00,440
है
00:00:00,440 --> 00:00:01,080
सबसे
00:00:01,080 --> 00:00:01,540
डरावनी

I want it as Indian English (Example) :

00:00:00,000 --> 00:00:00,320
Pata
00:00:00,320 --> 00:00:00,440
hai
00:00:00,440 --> 00:00:01,080
sabse
00:00:01,080 --> 00:00:01,540
darawani

But if i set the language to "en" i get output as:

00:00:00,000 --> 00:00:00,320
Do
00:00:00,320 --> 00:00:00,440
you
00:00:00,440 --> 00:00:01,080
know
00:00:01,080 --> 00:00:01,540
that

@emcodem
Copy link

emcodem commented Mar 6, 2025

No, "Hinglish" is not a supported language in the whisper models.
You can try to set lang to hindi and work with a perpetual hinglish prompt (e.g. 2 short sentences that always stay on the left side in prompt), but i doubt that the results will be stable/satisfying.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants