Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Offline, privacy-respecting speech to text #34

Open
6 of 7 tasks
RustoMCSpit opened this issue Feb 26, 2024 · 4 comments
Open
6 of 7 tasks

Offline, privacy-respecting speech to text #34

RustoMCSpit opened this issue Feb 26, 2024 · 4 comments
Labels
feature request Issue is about a new feature in the app needs triage Issue is not yet ready for PR authors to take up

Comments

@RustoMCSpit
Copy link

Checklist

  • I made sure that there are no existing issues - open or closed - to which I could contribute my information.
  • I made sure that there are no existing discussions - open or closed - to which I could contribute my information.
  • I have read the FAQs inside the app (Menu -> About -> FAQs) and my problem isn't listed.
  • I have taken the time to fill in all the required details. I understand that the bug report will be dismissed otherwise.
  • This issue contains only one feature request.
  • I have read and understood the contribution guidelines.
  • I optionally donated to support the Fossify mission.

Feature description

Speech-to-text transcription of audios that recognises multiple speakers. Able to see text of any audio by dropdown, or search bar, and exporting of all trascribed text as well.

Why do you want this feature?

would also be able to allow for a transcript so you could have a search bar and go through your voice recordings and you could click through the exact moment that word was said in the voice recordings. so if i typed 'adam' it may find 4 hits from the past 4 months:
file191: 00:07
file179: 12:23, 16:30
file73: 06:42

you could then click on those moments to find the one youre looking for.

this could also be used for tagging, for example, if im working on a project called 'block runner' i could search for all mentions and tag them all easily

Additional information

Futo has partially delivered on this with an excellent FOSS solution:
https://gitlab.futo.org/alex/voiceinput
https://voiceinput.futo.org/

But the Futo solution currently works within other apps only and is not integrated directly into a voice recorder app. Adding Futo's speech-to-text capabilities to Simple Voice Recorder would make a voice recorded easily on par with Google's proprietary app.

@RustoMCSpit RustoMCSpit added feature request Issue is about a new feature in the app needs triage Issue is not yet ready for PR authors to take up labels Feb 26, 2024
@RustoMCSpit
Copy link
Author

#17

@Warden20
Copy link

+1

@satvikpendem
Copy link

Also looking for something like this. Lots of proprietary apps but no FLOSS ones.

@endingisnight
Copy link

endingisnight commented Jan 25, 2025

Though it will land an AF on fdroid, it might be easy to copy from Whisper.
https://github.com/woheller69/whisperIME

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
feature request Issue is about a new feature in the app needs triage Issue is not yet ready for PR authors to take up
Projects
None yet
Development

No branches or pull requests

4 participants