Comments

You must log in or register to comment.

Atom_101 t1_j3voh2m wrote

Checkout tortoise-tts

10

CeFurkan OP t1_j3vro9v wrote

>ortoise-tts

thanks i will check out

it can produce my voice?

1

Atom_101 t1_j3vwoid wrote

Yeah. It supports zero shot voice cloning using a reference clip.

5

Elleo t1_j3w67mx wrote

It's worth noting that it's still heavily influenced by whatever the initial training data is. I had a play with the model here: https://replicate.com/afiaka87/tortoise-tts and everything comes out with an American accent.

3

CeFurkan OP t1_j3w9hu6 wrote

Are you able to generate speech based on given timings like providing a str, vtt file or convert speech audio into equivalent timed speech?

​

ty so much for answers.

1

mamafied t1_j3w66d9 wrote

check coqui TTS they have all kinds of models and they own yourtts compared in the paper. It is also way faster than tortoisetts

3

CeFurkan OP t1_j3w9pp0 wrote

>coqui TTS

thanks a lot i should test

1

YouDamnHotdog t1_j4vbbf4 wrote

Man, that doesn't work at aaaaaall. Sounds like the worst robot and nothing like me

1

sayoonarachu t1_j3z22kh wrote

Other than tortoise tts as mentioned above, probably best to watch the Microsoft github page. They have a section for vall-e and they do tend to release some of their source codes for their other models.

Might take a while as the paper was just publish like a week and and still says, "work in progress."

https://github.com/microsoft/unilm/blob/master/valle/README.md

3

CeFurkan OP t1_j40lohp wrote

I hope they release model. without model source code useless since i don't have gpu power or dataset to train :(

1