express_mode_420 t1_j7w3mrm wrote
Reply to comment by CeFurkan in [D] Are there any AI model that I can use to improve very bad quality sound recording? Removing noise and improving overall quality by CeFurkan
Could you speech-to-text your lecture, collecting timestamps, do the same with TTS and automagically sync that way?
CeFurkan OP t1_j7whf7d wrote
i have vtt file you know the subtitles we use for movies
but i haven't found and text to speech that can generate speech with that timing
do you know any?
​
about your suggested approach, any way to automatically do it? i mean we generate speech then we sync but how?
express_mode_420 t1_j7wizoa wrote
I'm not sure how I'd go about syncing it, but would this be an adequate workaround:
- break apart your script in small chunks by time stamp
- generate different tts recordings off of each time stamp
- generate an audio file that inserts each of the produced recordings at their respective time-stamped location
- replace the audio of the recording with your newly produced recording
CeFurkan OP t1_j7wsy5f wrote
so it is a logical layout
any software that can do it?
express_mode_420 t1_j7wya6a wrote
I think this is more likely a task for Python. I haven't done anything like this myself, it's just the approach I would start with.
CeFurkan OP t1_j7yjgw6 wrote
if only i were not a c# programmer but a python programmer :/
express_mode_420 t1_j7z394g wrote
Check out murf.ai, that service works similarly to what i described
CeFurkan OP t1_j81neng wrote
tested looks awesome but i have to purchase yearly plan which is 3500$ lol :D
Viewing a single comment thread. View all comments