Submitted by CeFurkan t3_1091e54 in MachineLearning
Elleo t1_j3w67mx wrote
Reply to comment by Atom_101 in [D] Any model like VALL-E available currently? by CeFurkan
It's worth noting that it's still heavily influenced by whatever the initial training data is. I had a play with the model here: https://replicate.com/afiaka87/tortoise-tts and everything comes out with an American accent.
CeFurkan OP t1_j3w9hu6 wrote
Are you able to generate speech based on given timings like providing a str, vtt file or convert speech audio into equivalent timed speech?
​
ty so much for answers.
Viewing a single comment thread. View all comments