Integrate Watson API for speech-to-text #47

jeffpaul · 2019-04-08T20:09:17Z

Splitting this out from #2

Sidsector9 · 2022-07-17T12:46:19Z

Additional info:

The IBM Watson Speech to Text provides 3 interfaces for speech recognition:

See more on Advantages of the WebSocket interface.

@jeffpaul This service provides a lot of features, few of them such as: Speaker labels, Profanity filtering and Background audio suppression, etc.

Can you expand on the use case of this feature? That way we can list out the features that can go with the implementation.

jeffpaul · 2022-07-27T02:13:30Z

@Sidsector9 I believe the original thought on this enhancements was taking live speech and generating text from that for captions, so the speaker labels bit you highlighted probably most applies here.

jeffpaul · 2022-09-26T21:33:30Z

Could similarly look at using OpenAI's Whisper for this.

jeffpaul added the enhancement label Apr 8, 2019

jeffpaul added this to the Future Release milestone Apr 8, 2019

jeffpaul mentioned this issue Apr 8, 2019

Integrate Watson API for image recognition #2

Open

jeffpaul removed the type:enhancement label Jan 7, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Integrate Watson API for speech-to-text #47

Integrate Watson API for speech-to-text #47

jeffpaul commented Apr 8, 2019

Sidsector9 commented Jul 17, 2022

jeffpaul commented Jul 27, 2022

jeffpaul commented Sep 26, 2022

Integrate Watson API for speech-to-text #47

Integrate Watson API for speech-to-text #47

Comments

jeffpaul commented Apr 8, 2019

Sidsector9 commented Jul 17, 2022

Additional info:

jeffpaul commented Jul 27, 2022

jeffpaul commented Sep 26, 2022