GitHub - liuhuang31/HiFTNet-sr: HiFTNet wav/audio super-resolution 16/24 kHz to 48 kHz

Pre-requisites

Python == 3.8
Clone this repository.
Install python requirements. Please refer requirements.txt
Download a 48k dataset, such as genshin or VCTK.

Training

python train.py \
--config config_v1_16k_to_48k.json \
--input_wavs_dir VCTK-Corpus/wav48/,genshin --checkpoint_path exp/v1_16k_to_48k/

To train 24k_to_48k, replace config_v1_16k_to_48k.json with config_v1_24k_to_48k.json.
Checkpoints and copy of the configuration file are saved in checkpoint_path directory by default.
You can change the path by adding --checkpoint_path option.

SR model sample theory

The hifigan means hiftnet here.

SR results

Dir of gen_from_wav is the generated wavs, which sound good, and better than hifigan-sr.

origin 16k mel-spectrum
generated 48k mel-spectrum

Pretrained Model

The pretrained models provided is in "exp/v1_16k_to_48k/g_bst", trained with StarRail_Datasets and VCTK.
For i don't have GPU resources, a kind person(@Lucy) train config_v1_16k_to_48k version and trained stop at 300k, maybe training to 800k is better.

Inference from wav file

Make test_files directory and copy wav files into the directory.

Run the following command.

# in inference.py, change the 'cp_path' param to your checkpoint dir.
python inference.py

Generated wav files are saved in generated_files by default.
You can change the path by adding --output_dir option.

Reference

Our repository is heavily based on yl4579 's HiFTNet.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
LJSpeech-BZNSYP-16k		LJSpeech-BZNSYP-16k
Utils		Utils
exp/v1_16k_to_48k		exp/v1_16k_to_48k
gen_from_wav		gen_from_wav
images		images
LICENSE		LICENSE
README.md		README.md
config_v1.json		config_v1.json
config_v1_16k_to_48k.json		config_v1_16k_to_48k.json
config_v1_24k.json		config_v1_24k.json
config_v1_24k_to_48k.json		config_v1_24k_to_48k.json
env.py		env.py
inference.ipynb		inference.ipynb
inference.py		inference.py
meldataset.py		meldataset.py
models.py		models.py
requirements.txt		requirements.txt
stft.py		stft.py
train.py		train.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Pre-requisites

Training

SR model sample theory

SR results

Pretrained Model

Inference from wav file

Reference

About

Releases

Packages

Languages

License

liuhuang31/HiFTNet-sr

Folders and files

Latest commit

History

Repository files navigation

Pre-requisites

Training

SR model sample theory

SR results

Pretrained Model

Inference from wav file

Reference

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages