Viewing a single comment thread. View all comments

Raaaaaav t1_j9jfmc7 wrote

I don't know if it is the current best AI to clone voices, but there is a zero-shot model named YOUR-TTS, it has pre-trained weights available and you only need around 1 min of your voice to make it sound quite similar. But you can always retrain it with more samples of your voice to improve the performance even more.

https://github.com/Edresson/YourTTS

I think it was also added to the Coqui-TTS toolkit.

https://github.com/coqui-ai/TTS

However I only played around with the demos in the original repo, and therefore don't know how to use it if you are serious about voice cloning.

1