You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
First large-v2 does not support the "yue" language token but large-v3 does.
However when I used faster-whisper to load large-v2 and then transcribed sentences with the param language="yue", there was no bug. It should not work.
faster_whisper
large-v2
None is returned when use large-v2's tokenizer to encode "yue".
>>> from faster_whisper import WhisperModel
>>> model = WhisperModel("large-v2",
... device=DEVICE,
... )
>>> print(model.hf_tokenizer.token_to_id("<|%s|>"%"yue"))
None
>>> print(model.hf_tokenizer.token_to_id("<|%s|>"%"zh"))
xxx
The result is returned normally and the language of TranscriptionInfo is "yue".
importwhispermodel=whisper.load_model("large-v2",)
input_language='yue'result=model.transcribe(audio, language=input_language)
"""--> [154] sot_sequence.append(sot + 1 + langs.index(self.language))ValueError: tuple.index(x): x not in tuple"""
Language-token is put the start of encoded_input in openai/whisper. However, the language tokens don't seem to be parsed in the way we expect.
The text was updated successfully, but these errors were encountered:
Coconut3223
changed the title
large-v2 does not support the "yue" but no bug in faster-whisperlarge-v2 does not support "yue" but no bug in faster-whisper
Jan 20, 2025
First
large-v2
does not support the "yue" language token butlarge-v3
does.However when I used
faster-whisper
to loadlarge-v2
and then transcribed sentences with the paramlanguage="yue"
, there was no bug. It should not work.faster_whisper
large-v2
None is returned when use
large-v2
's tokenizer to encode "yue".The result is returned normally and the language of
TranscriptionInfo
is "yue".large-V3
openai/whisper
large-V2
large-V3
Question:
Language-token is put the start of encoded_input in
openai/whisper
. However, the language tokens don't seem to be parsed in the way we expect.The text was updated successfully, but these errors were encountered: