1. Audio Super Resolution with Deep Learning models

This repository contains the source code for the implementation of two deep learning models concerning the audio super resolution task.

The original papers can be found here:

2. Audio Super Resolution via ViT

This model is located in the "ViT-SR" folder. The "GAN" folder contains the code for the original paper; here the ViT is used in a Generative Adversarial Network, while the "Autoencoder" folder implements the Bandwidth Extension using an autoencoder-like architecture where the ViT is part of the encoder. For the autoencoder a checkpoint is available with the weights of the model trained on the FMA dataset. Inside the "out" directory there are some examples of the results obtained with the model.

Better results can be obtaines by using the GAN or by extending the training time for the autoencoder.

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
CNN-SR		CNN-SR
ViT-SR		ViT-SR
.gitignore		.gitignore
LICENSE		LICENSE
Readme.md		Readme.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

1. Audio Super Resolution with Deep Learning models

2. Audio Super Resolution via ViT

About

Releases

Packages

Languages

License

teo-sl/Audio-Super-Resolution-ViT

Folders and files

Latest commit

History

Repository files navigation

1. Audio Super Resolution with Deep Learning models

2. Audio Super Resolution via ViT

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages