Submitted by CeFurkan t3_1091e54 in MachineLearning
Atom_101 t1_j3voh2m wrote
Checkout tortoise-tts
CeFurkan OP t1_j3vro9v wrote
>ortoise-tts
thanks i will check out
it can produce my voice?
Atom_101 t1_j3vwoid wrote
Yeah. It supports zero shot voice cloning using a reference clip.
Elleo t1_j3w67mx wrote
It's worth noting that it's still heavily influenced by whatever the initial training data is. I had a play with the model here: https://replicate.com/afiaka87/tortoise-tts and everything comes out with an American accent.
CeFurkan OP t1_j3w9hu6 wrote
Are you able to generate speech based on given timings like providing a str, vtt file or convert speech audio into equivalent timed speech?
​
ty so much for answers.
Cultural_Phone4060 t1_j3yqpaz wrote
We put up a free, open source API for tortoise recently, will be adding improvements to this over time & appreciate contributions: https://github.com/metavoicexyz/tortoise-tts-modal-api
Currently it can't hit timings of an srt file, what are you looking to achieve exactly... I can probably help out?
Viewing a single comment thread. View all comments