Run this project on Ubuntu 24.04 with cuda 12. #335

CyberT33N · 2024-11-26T23:22:53Z

I can not run this project the this because I get:

RuntimeError: Library libcublas.so.11 is not found or cannot be loaded

I use the code of demo_part3.ipynb and installed everything as explained in USAGE.md

Related to:

libcudnn error. #225

Any ideas?

However, I manged to get get it working with CPU as workaround if somebody has the same problems

Install:

cd ~/Projects/ai
git clone https://github.com/myshell-ai/OpenVoice

pyenv local 3.9

python3 -m venv venv
source venv/bin/activate

pip install -e .

wget https://myshell-public-repo-host.s3.amazonaws.com/openvoice/checkpoints_v2_0417.zip
unzip checkpoints_v2_0417.zip

pip install git+https://github.com/myshell-ai/MeloTTS.git
python -m unidic download

python
# then enter in python shell
import nltk
nltk.download('averaged_perceptron_tagger')
exit()

Edit /home/userName/Projects/ai/OpenVoice/openvoice/se_extractor.py and change line to:

model = WhisperModel(model_size, device="cpu", compute_type="float32")

Start script:

python main.py

main.py:

import os
import torch
print(torch.__version__)            # Sollte 2.1.0 oder höher sein
print(torch.cuda.is_available())    # Sollte True ausgeben
print(torch.cuda.get_device_name(0))  # Zeigt den Namen deiner GPU an

from openvoice import se_extractor
from openvoice.api import ToneColorConverter

ckpt_converter = 'checkpoints_v2/converter'
device = "cuda:0" if torch.cuda.is_available() else "cpu"
output_dir = 'outputs_v2'

tone_color_converter = ToneColorConverter(f'{ckpt_converter}/config.json', device=device)
tone_color_converter.load_ckpt(f'{ckpt_converter}/checkpoint.pth')

os.makedirs(output_dir, exist_ok=True)

reference_speaker = 'resources/example_reference.mp3' # This is the voice you want to clone
target_se, audio_name = se_extractor.get_se(reference_speaker, tone_color_converter, vad=False)

from melo.api import TTS

texts = {
    'EN_NEWEST': "Did you ever hear a folk tale about a giant turtle?",  # The newest English base speaker model
    'EN': "Did you ever hear a folk tale about a giant turtle?",
    'ES': "El resplandor del sol acaricia las olas, pintando el cielo con una paleta deslumbrante.",
    'FR': "La lueur dorée du soleil caresse les vagues, peignant le ciel d'une palette éblouissante.",
    'ZH': "在这次vacation中，我们计划去Paris欣赏埃菲尔铁塔和卢浮宫的美景。",
    'JP': "彼は毎朝ジョギングをして体を健康に保っています。",
    'KR': "안녕하세요! 오늘은 날씨가 정말 좋네요.",
}


src_path = f'{output_dir}/tmp.wav'

# Speed is adjustable
speed = 1.0

for language, text in texts.items():
    model = TTS(language=language, device=device)
    speaker_ids = model.hps.data.spk2id
    
    for speaker_key in speaker_ids.keys():
        speaker_id = speaker_ids[speaker_key]
        speaker_key = speaker_key.lower().replace('_', '-')
        
        source_se = torch.load(f'checkpoints_v2/base_speakers/ses/{speaker_key}.pth', map_location=device)
        model.tts_to_file(text, speaker_id, src_path, speed=speed)
        save_path = f'{output_dir}/output_v2_{speaker_key}.wav'

        # Run the tone color converter
        encode_message = "@MyShell"
        tone_color_converter.convert(
            audio_src_path=src_path, 
            src_se=source_se, 
            tgt_se=target_se, 
            output_path=save_path,
            message=encode_message)

The text was updated successfully, but these errors were encountered:

lukaLLM · 2024-12-11T09:56:53Z

I have same issue

hongleng · 2024-12-23T11:51:51Z

Same issue, too. v1 works fine.

ground-creative · 2025-01-02T06:13:25Z

Try with a docker image maybe.

https://github.com/ground-creative/openvoice-docker

ER404R · 2025-01-07T08:43:02Z

Try with a docker image maybe.

https://github.com/ground-creative/openvoice-docker

Did this work for anyone yet ?

dosubot bot added the bug Something isn't working label Nov 26, 2024

CyberT33N changed the title ~~RUn this project on Ubuntu 24.04 with cuda 12.~~ Run this project on Ubuntu 24.04 with cuda 12. Nov 26, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Run this project on Ubuntu 24.04 with cuda 12. #335

Run this project on Ubuntu 24.04 with cuda 12. #335

CyberT33N commented Nov 26, 2024 •

edited

Loading

lukaLLM commented Dec 11, 2024

hongleng commented Dec 23, 2024

ground-creative commented Jan 2, 2025 •

edited

Loading

ER404R commented Jan 7, 2025

Run this project on Ubuntu 24.04 with cuda 12. #335

Run this project on Ubuntu 24.04 with cuda 12. #335

Comments

CyberT33N commented Nov 26, 2024 • edited Loading

lukaLLM commented Dec 11, 2024

hongleng commented Dec 23, 2024

ground-creative commented Jan 2, 2025 • edited Loading

ER404R commented Jan 7, 2025

CyberT33N commented Nov 26, 2024 •

edited

Loading

ground-creative commented Jan 2, 2025 •

edited

Loading