Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

docs/update old website references #302

Merged
merged 2 commits into from
Jan 31, 2025
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
7 changes: 3 additions & 4 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -7,13 +7,13 @@

Try it out on the [HuggingFace Space](https://huggingface.co/spaces/speaches-ai/speaches)

See the documentation for installation instructions and usage: [https://speaches-ai.github.io/speaches/](https://speaches-ai.github.io/speaches/)
See the documentation for installation instructions and usage: [speaches.ai](https://speaches.ai/)

## Features:

- GPU and CPU support.
- [Deployable via Docker Compose / Docker](https://speaches-ai.github.io/speaches/installation/)
- [Highly configurable](https://speaches-ai.github.io/speaches/configuration/)
- [Deployable via Docker Compose / Docker](https://speaches.ai/installation/)
- [Highly configurable](https://speaches.ai/configuration/)
- OpenAI API compatible. All tools and SDKs that work with OpenAI's API should work with `speaches`.
- Streaming support (transcription is sent via SSE as the audio is transcribed. You don't need to wait for the audio to fully be transcribed before receiving it).

Expand All @@ -40,7 +40,6 @@ TODO

https://github.com/user-attachments/assets/0021acd9-f480-4bc3-904d-831f54c4d45b


### Live Transcription (using WebSockets)

https://github.com/fedirz/faster-whisper-server/assets/76551385/e334c124-af61-41d4-839c-874be150598f
4 changes: 2 additions & 2 deletions docs/index.md
Original file line number Diff line number Diff line change
Expand Up @@ -23,8 +23,8 @@
- Dynamic model loading / offloading. Just specify which model you want to use in the request and it will be loaded automatically. It will then be unloaded after a period of inactivity.
- Text-to-Speech via `kokoro`(Ranked #1 in the [TTS Arena](https://huggingface.co/spaces/Pendrokar/TTS-Spaces-Arena)) and `piper` models.
- GPU and CPU support.
- [Deployable via Docker Compose / Docker](https://speaches-ai.github.io/speaches/installation/)
- [Highly configurable](https://speaches-ai.github.io/speaches/configuration/)
- [Deployable via Docker Compose / Docker](https://speaches.ai/installation/)
- [Highly configurable](https://speaches.ai/configuration/)
- [Coming soon](https://github.com/speaches-ai/speaches/issues/115): Realtime API | [OpenAI Documentation](https://platform.openai.com/docs/guides/realtime)

Please create an issue if you find a bug, have a question, or a feature suggestion.
Expand Down
10 changes: 5 additions & 5 deletions docs/usage/voice-chat.md
Original file line number Diff line number Diff line change
@@ -1,14 +1,14 @@
!!! note

Before proceeding, you should be familiar with [OpenAI Audio Generation Guide](https://platform.openai.com/docs/guides/audio). The guide explains how the API works and provides examples on how to use. Unless stated otherwise in [Limitations](#limitations) if a feature is supported by OpenAI, it should be supported by this project as well.
Before proceeding, you should be familiar with [OpenAI Audio Generation Guide](https://platform.openai.com/docs/guides/audio). The guide explains how the API works and provides examples on how to use. Unless stated otherwise in [limitations](#limitations) if a feature is supported by OpenAI, it should be supported by this project as well.

## Prerequisites

Follow the prerequisites in the [Text-to-Speech](./text-to-speech.md) guide. And set the following environmental variables:

- `CHAT_COMPLETION_BASE_URL` to the base url of an OpenAI API compatible endpoint | [Config](../configuration.md)
- `CHAT_COMPLETION_MODEL` to the name of the model you'd like to use. | [Config](../configuration.md)
- `CHAT_COMPLETION_API_KEY` if the API you are using requires authentication | [Config](../configuration.md)
- `CHAT_COMPLETION_BASE_URL` to the base URL of an OpenAI API compatible endpoint | [Config](../configuration.md#speaches.config.Config.chat_completion_base_url)
- `CHAT_COMPLETION_MODEL` to the name of the model you'd like to use. | [Config](../configuration.md#speaches.config.Config.chat_completion_model)
- `CHAT_COMPLETION_API_KEY` if the API you are using requires authentication | [Config](../configuration.md#speaches.config.Config.chat_completion_api_key)

Ollama example:

Expand Down Expand Up @@ -143,4 +143,4 @@ openai_client.chat.completions.create(
- User's input audio message are not cached. That means the user's input audio message will be transcribed each time it sent. This can be a performance issue when doing long multi-turn conversations.
- Multiple choices (`n` > 1) are not supported

This features utilizes [./text-to-speech.md](Text-to-Speech) and [./speech-to-text.md](Speech-to-Text) features. Therefore, the limitations of those features apply here as well.
This features utilizes [Text-to-Speech](./text-to-speech.md) and [Speech-to-Text](./speech-to-text.md) features. Therefore, the limitations of those features apply here as well.
4 changes: 2 additions & 2 deletions src/speaches/ui/app.py
Original file line number Diff line number Diff line change
Expand Up @@ -14,9 +14,9 @@ def create_gradio_demo(config: Config) -> gr.Blocks:
gr.Markdown(
"### Consider supporting the project by starring the [speaches-ai/speaches repository on GitHub](https://github.com/speaches-ai/speaches)."
)
gr.Markdown("### Documentation Website: https://speaches-ai.github.io/speaches")
gr.Markdown("### Documentation Website: https://speaches.ai")
gr.Markdown(
"### For additional details regarding the parameters, see the [API Documentation](https://speaches-ai.github.io/speaches/api)"
"### For additional details regarding the parameters, see the [API Documentation](https://speaches.ai/api)"
)

create_audio_chat_tab(config)
Expand Down