Elleo t1_j3w67mx wrote on January 11, 2023 at 2:13 PM

Reply to comment by Atom_101 in [D] Any model like VALL-E available currently? by CeFurkan

It's worth noting that it's still heavily influenced by whatever the initial training data is. I had a play with the model here: https://replicate.com/afiaka87/tortoise-tts and everything comes out with an American accent.

CeFurkan OP t1_j3w9hu6 wrote on January 11, 2023 at 2:36 PM

Are you able to generate speech based on given timings like providing a str, vtt file or convert speech audio into equivalent timed speech?

ty so much for answers.