Skip to content

Commit

Permalink
Update docs for vits list (#20)
Browse files Browse the repository at this point in the history
* add vits filelist support

* add vits filelist support

* Update create_train_split.py

* Update create_train_split.py

* Add files via upload

* Add files via upload

* Add files via upload

* Add files via upload

* fix list not subscriptable

* fix list not subscriptable

* fix path lib

* Add files via upload

* Add files via upload

* fix parent

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* Add files via upload

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* Add files via upload

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* Add files via upload

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* add vits format list docs

* Update finetune.md

---------

Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Leng Yue <[email protected]>
  • Loading branch information
3 people authored Dec 20, 2023
1 parent a2e3f6e commit 744622d
Showing 1 changed file with 12 additions and 2 deletions.
14 changes: 12 additions & 2 deletions docs/zh/finetune.md
Original file line number Diff line number Diff line change
Expand Up @@ -38,7 +38,12 @@
python tools/vqgan/create_train_split.py data/demo
```

该命令会在 `data/demo` 目录下创建 `data/demo/vq_train_filelist.txt``data/demo/vq_val_filelist.txt` 文件, 分别用于训练和验证.

该命令会在 `data/demo` 目录下创建 `data/demo/vq_train_filelist.txt``data/demo/vq_val_filelist.txt` 文件, 分别用于训练和验证.

!!!info
对于 VITS 格式, 你可以使用 `--filelist xxx.list` 来指定文件列表.
请注意, `filelist` 所指向的音频文件必须也位于 `data/demo` 文件夹下.

### 3. 启动训练

Expand Down Expand Up @@ -109,7 +114,8 @@ python tools/vqgan/extract_vq.py data/demo \
```

!!! note
你可以调整 `--num-workers``--batch-size` 来提高提取速度, 但是请注意不要超过你的显存限制.
你可以调整 `--num-workers``--batch-size` 来提高提取速度, 但是请注意不要超过你的显存限制.
对于 VITS 格式, 你可以使用 `--filelist xxx.list` 来指定文件列表.

该命令会在 `data/demo` 目录下创建 `.npy` 文件, 如下所示:

Expand Down Expand Up @@ -137,10 +143,14 @@ python tools/vqgan/extract_vq.py data/demo \
python tools/llama/build_dataset.py \
--config "fish_speech/configs/data/finetune.yaml" \
--output "data/quantized-dataset-ft.protos"
--num_workers 16
```

命令执行完毕后, 你应该能在 `data` 目录下看到 `quantized-dataset-ft.protos` 文件.

!!! note
对于 VITS 格式, 你可以使用 `--filelist xxx.list` 来指定文件列表.

### 4. 启动 Rust 数据服务器

由于加载和打乱数据集非常缓慢且占用内存, 因此我们使用 rust 服务器来加载和打乱数据. 该服务器基于 GRPC, 可以通过以下方式安装:
Expand Down

0 comments on commit 744622d

Please sign in to comment.