This repository contains STTS utilities which I use primarily for VRChat. By using the scripts available in this repository, you can use your own microphone to speak, but the output will be an AI-generated voice instead of your own voice.
Before using the STTS utilities, make sure you have the following prerequisites installed and set up:
-
VB-Audio CABLE: Install VB-Audio CABLE, as it is required to forward the audio properly to your games.
-
Docker: Install Docker, as it is required for the Voicevox engine.
-
Python: Ensure you have at least Python 3.10.6 installed. You can download it from the Python website.
-
API Access: Obtain API access to DeepL and OpenAI. You will need the API keys for these services.
Follow the steps below to use the STTS utilities:
-
If you intend to use the Japanese STTS, install Voicevox by running
start.bat
. If you're not planning to use it, you can skip this step. -
Install the Python dependencies by running
install_dependencies.bat
. -
If you're using the Japanese voice, start the Voicevox engine by running
start.bat
. If you're not using it, you can skip this step. -
Configure your API keys and audio devices by modifying the
.env.sample
file. Save the changes and rename the file to.env
. -
Start your preferred STTS script, either the English or Japanese version.
-
To record yourself, press the 'V' key while speaking. Release the key when you're done speaking. The audio will be transcribed, converted into an AI voice, and played through your virtual microphone.
Note: In your game settings, make sure to set VB-Audio CABLE as your microphone so that others can hear you. License
This repository is licensed under the MIT license.
Contributions are always welcome, if you got anything to improve, feel free to make a PR.