Offline, privacy-respecting speech to text #34

RustoMCSpit · 2024-02-26T16:00:39Z

Checklist

I made sure that there are no existing issues - open or closed - to which I could contribute my information.
I made sure that there are no existing discussions - open or closed - to which I could contribute my information.
I have read the FAQs inside the app (Menu -> About -> FAQs) and my problem isn't listed.
I have taken the time to fill in all the required details. I understand that the bug report will be dismissed otherwise.
This issue contains only one feature request.
I have read and understood the contribution guidelines.
I optionally donated to support the Fossify mission.

Feature description

Speech-to-text transcription of audios that recognises multiple speakers. Able to see text of any audio by dropdown, or search bar, and exporting of all trascribed text as well.

Why do you want this feature?

would also be able to allow for a transcript so you could have a search bar and go through your voice recordings and you could click through the exact moment that word was said in the voice recordings. so if i typed 'adam' it may find 4 hits from the past 4 months:
file191: 00:07
file179: 12:23, 16:30
file73: 06:42

you could then click on those moments to find the one youre looking for.

this could also be used for tagging, for example, if im working on a project called 'block runner' i could search for all mentions and tag them all easily

Additional information

Futo has partially delivered on this with an excellent FOSS solution:
https://gitlab.futo.org/alex/voiceinput
https://voiceinput.futo.org/

But the Futo solution currently works within other apps only and is not integrated directly into a voice recorder app. Adding Futo's speech-to-text capabilities to Simple Voice Recorder would make a voice recorded easily on par with Google's proprietary app.

RustoMCSpit · 2024-02-26T16:01:02Z

#17

Warden20 · 2024-05-17T05:19:07Z

+1

satvikpendem · 2025-01-20T09:04:08Z

Also looking for something like this. Lots of proprietary apps but no FLOSS ones.

endingisnight · 2025-01-25T02:42:20Z

Though it will land an AF on fdroid, it might be easy to copy from Whisper.
https://github.com/woheller69/whisperIME

RustoMCSpit added feature request Issue is about a new feature in the app needs triage Issue is not yet ready for PR authors to take up labels Feb 26, 2024

RustoMCSpit mentioned this issue Nov 23, 2024

Offline, privacy-respecting speech to text Leonidius20/RecordingStudio#22

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Offline, privacy-respecting speech to text #34

Offline, privacy-respecting speech to text #34

RustoMCSpit commented Feb 26, 2024

RustoMCSpit commented Feb 26, 2024

Warden20 commented May 17, 2024

satvikpendem commented Jan 20, 2025

endingisnight commented Jan 25, 2025 •

edited

Loading

Offline, privacy-respecting speech to text #34

Offline, privacy-respecting speech to text #34

Comments

RustoMCSpit commented Feb 26, 2024

Checklist

Feature description

Why do you want this feature?

Additional information

RustoMCSpit commented Feb 26, 2024

Warden20 commented May 17, 2024

satvikpendem commented Jan 20, 2025

endingisnight commented Jan 25, 2025 • edited Loading

endingisnight commented Jan 25, 2025 •

edited

Loading