express_mode_420 t1_j7w3mrm wrote on February 9, 2023 at 8:39 PM

Could you speech-to-text your lecture, collecting timestamps, do the same with TTS and automagically sync that way?

CeFurkan OP t1_j7whf7d wrote on February 9, 2023 at 10:05 PM

i have vtt file you know the subtitles we use for movies

but i haven't found and text to speech that can generate speech with that timing

do you know any?

about your suggested approach, any way to automatically do it? i mean we generate speech then we sync but how?

I'm not sure how I'd go about syncing it, but would this be an adequate workaround:

break apart your script in small chunks by time stamp
generate different tts recordings off of each time stamp
generate an audio file that inserts each of the produced recordings at their respective time-stamped location
replace the audio of the recording with your newly produced recording

so it is a logical layout

any software that can do it?

I think this is more likely a task for Python. I haven't done anything like this myself, it's just the approach I would start with.

if only i were not a c# programmer but a python programmer :/

Check out murf.ai, that service works similarly to what i described

tested looks awesome but i have to purchase yearly plan which is 3500$ lol :D