You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I love how accurate this implementation of whisper but I am shocked at how slow it is to process one episode of tv show
I am using an RTX 3070
and it takes hours to finish a 150-minute video. I am getting an average of 99.53frames/s
whisper_timestamped 'video.mp4' --language ko --model large-v2 --task transcribe --output_format vtt --output_dir . --recompute_all_timestamps True --temperature 0.3 --best_of 2 --device cuda
reacted with thumbs up emoji reacted with thumbs down emoji reacted with laugh emoji reacted with hooray emoji reacted with confused emoji reacted with heart emoji reacted with rocket emoji reacted with eyes emoji
-
I love how accurate this implementation of whisper but I am shocked at how slow it is to process one episode of tv show
I am using an RTX 3070
and it takes hours to finish a 150-minute video. I am getting an average of 99.53frames/s
whisper_timestamped 'video.mp4' --language ko --model large-v2 --task transcribe --output_format vtt --output_dir . --recompute_all_timestamps True --temperature 0.3 --best_of 2 --device cuda
can I improve it ?
Beta Was this translation helpful? Give feedback.
All reactions