Submitted by Intelligent_Rough_21 t3_1056c2e in MachineLearning
Out-of-copyright books only of course.
Hi, I was wondering if I could fine tune a GPT3 model to take a book, likely in html, markdown, or plain text, and convert it to SSML. In order to do that, I would need a bunch of SSML files already hand made, and fine tune a model based on them. Then I've got some code to split that up and do formatting: pandoc
, csplit
, and then I could use aws polly or one of the others to do real good text to speech.
Anyone have a dataset?
References:
geneing t1_j3e1573 wrote
I looked for it once years ago, but couldn't find any. I don't think it's needed anymore. Current TTS systems based on neural networks are really good at producing speech with the right intonation from just the text.