Offline, privacy-respecting speech to text #34
Labels
feature request
Issue is about a new feature in the app
needs triage
Issue is not yet ready for PR authors to take up
Checklist
Feature description
Speech-to-text transcription of audios that recognises multiple speakers. Able to see text of any audio by dropdown, or search bar, and exporting of all trascribed text as well.
Why do you want this feature?
would also be able to allow for a transcript so you could have a search bar and go through your voice recordings and you could click through the exact moment that word was said in the voice recordings. so if i typed 'adam' it may find 4 hits from the past 4 months:
file191: 00:07
file179: 12:23, 16:30
file73: 06:42
you could then click on those moments to find the one youre looking for.
this could also be used for tagging, for example, if im working on a project called 'block runner' i could search for all mentions and tag them all easily
Additional information
Futo has partially delivered on this with an excellent FOSS solution:
https://gitlab.futo.org/alex/voiceinput
https://voiceinput.futo.org/
But the Futo solution currently works within other apps only and is not integrated directly into a voice recorder app. Adding Futo's speech-to-text capabilities to Simple Voice Recorder would make a voice recorded easily on par with Google's proprietary app.
The text was updated successfully, but these errors were encountered: