Skip to content

Commit

Permalink
Merge remote-tracking branch 'andabi/master'
Browse files Browse the repository at this point in the history
  • Loading branch information
andabi committed May 24, 2018
2 parents 7338454 + 55cbeaa commit d482908
Showing 1 changed file with 3 additions and 15 deletions.
18 changes: 3 additions & 15 deletions README.md
Original file line number Diff line number Diff line change
@@ -1,11 +1,9 @@
# Voice Conversion with Non-Parallel Data
## Subtitle: Speaking like Kate Winslet
>* This is the first draft.
>* Authors: Dabi Ahn([email protected]), [Kyubyong Park](https://github.com/Kyubyong)([email protected])
>* We always welcome any questions, new ideas, or contributions.
> Authors: Dabi Ahn([email protected]), [Kyubyong Park](https://github.com/Kyubyong)([email protected])
## Samples
It's not perfect yet, but listen to [them](https://soundcloud.com/andabi/sets/voice-style-transfer-to-kate-winslet-with-deep-neural-networks).
https://soundcloud.com/andabi/sets/voice-style-transfer-to-kate-winslet-with-deep-neural-networks

## Intro
What if you could imitate a famous celebrity's voice or sing like a famous singer?
Expand Down Expand Up @@ -90,16 +88,6 @@ Net2 contains Net1 as a sub-network.
* IMHO, the accuracy of Net1(phoneme classification) does not need to be so perfect.
* Net2 can reach to near optimal when Net1 accuracy is correct to some extent.

## Future Works
* Adversarial training
* Expecting to generate sharper and cleaner voice.
* Cross lingual

## Ultimate Goals
* Many-to-Many(Multi target speaker) voice conversion system
* VC without training set of target voice, but only small set of target voice (1 min)
* (On going)

## References
* ["Phonetic posteriorgrams for many-to-one voice conversion without parallel data training"](https://www.researchgate.net/publication/307434911_Phonetic_posteriorgrams_for_many-to-one_voice_conversion_without_parallel_data_training), 2016 IEEE International Conference on Multimedia and Expo (ICME)
* ["TACOTRON: TOWARDS END-TO-END SPEECH SYNTHESIS"](https://arxiv.org/abs/1703.10135), Submitted to Interspeech 2017
* ["TACOTRON: TOWARDS END-TO-END SPEECH SYNTHESIS"](https://arxiv.org/abs/1703.10135), Submitted to Interspeech 2017

0 comments on commit d482908

Please sign in to comment.