Atom_101 t1_j3voh2m wrote on January 11, 2023 at 11:29 AM

Checkout tortoise-tts

CeFurkan OP t1_j3vro9v wrote on January 11, 2023 at 12:06 PM

>ortoise-tts

thanks i will check out

it can produce my voice?

Atom_101 t1_j3vwoid wrote on January 11, 2023 at 12:55 PM

Yeah. It supports zero shot voice cloning using a reference clip.

Elleo t1_j3w67mx wrote on January 11, 2023 at 2:13 PM

It's worth noting that it's still heavily influenced by whatever the initial training data is. I had a play with the model here: https://replicate.com/afiaka87/tortoise-tts and everything comes out with an American accent.

CeFurkan OP t1_j3w9hu6 wrote on January 11, 2023 at 2:36 PM

Are you able to generate speech based on given timings like providing a str, vtt file or convert speech audio into equivalent timed speech?

ty so much for answers.

Cultural_Phone4060 t1_j3yqpaz wrote on January 11, 2023 at 11:53 PM

We put up a free, open source API for tortoise recently, will be adding improvements to this over time & appreciate contributions: https://github.com/metavoicexyz/tortoise-tts-modal-api

Currently it can't hit timings of an srt file, what are you looking to achieve exactly... I can probably help out?