Skip to content

Latest commit

 

History

History
11 lines (9 loc) · 994 Bytes

README.md

File metadata and controls

11 lines (9 loc) · 994 Bytes

Audio samples from "Exploring the limits of neural voice cloning: A case study on two well-known personalities"

Authors: Ander González-Docasal, Aitor Álvarez, Haritz Arzelus

Abstract: This work describes one successful and one failed Voice Cloning processes of two famous personalities in order to be broadcast in a high-impact podcast and in a Spanish public television program. Whilst a good quality synthesised voice could be generated for the first public figure, the second one was not adequate enough for its broadcast on television given its low speech quality. In this study, we explore the limits of the neural voice cloning considering the different conditions of the training material employed in each case and, based on several objective measures (volume of data, phoneme occurrence, SNR, PESQ and MCD), we analysed the main features to be considered for a high-quality synthetic voice generation.