CeFurkan OP t1_j7tnxp9 wrote on February 9, 2023 at 9:12 AM

yep not confidental

how can I reach you? here my email : monstermmorpg@gmail.com

CeFurkan OP t1_j7tob1u wrote on February 9, 2023 at 9:17 AM

>Nvidia RTX voice

example link that you can download extract audio quickly if you wish : https://youtu.be/2zY1dQDGl3o

also here 5 min example speech : https://sndup.net/stjs/

logsinh t1_j7tqjmm wrote on February 9, 2023 at 9:50 AM

The audio is a bit distorted possibly due to noise gating. I don't see too much noise, so maybe noise reduction is not what you need. The audio has 8 kHz bandwidth (16 kHz sample rate), maybe you may try to use an audio super-resolution network such as https://github.com/mindslab-ai/nuwave2 to increase the audio bandwidth.

CeFurkan OP t1_j7tr5at wrote on February 9, 2023 at 9:58 AM

yes i had tried some options obs back in time. it was probably noise gate. even i forgotten it.

thank you so much for reply gonna test that repo now

CeFurkan OP t1_j7trlei wrote on February 9, 2023 at 10:05 AM

their example really good improvement but do i need training for that?

opened an issue thread but not much hope : https://github.com/mindslab-ai/nuwave2/issues/11

logsinh t1_j7tu0x1 wrote on February 9, 2023 at 10:39 AM

Just download the checkpoint and use the command at Inference session. sr should be 16000

CeFurkan OP t1_j7tvbny wrote on February 9, 2023 at 10:57 AM

thanks i made it work

however i got out of memory error on RTX 3060 - 12 GB vram

it is like a joke :/

https://i.imgur.com/KslqNBg.png

logsinh t1_j7uw4wb wrote on February 9, 2023 at 4:10 PM

Process with a sliding window would solve your problem, see e.g. https://colab.research.google.com/github/asteroid-team/asteroid/blob/master/notebooks/04_ProcessLargeAudioFiles.ipynb

CeFurkan OP t1_j7vouon wrote on February 9, 2023 at 7:10 PM

thanks

no idea where to put this code in nuwave2

logsinh t1_j7tsvku wrote on February 9, 2023 at 10:23 AM

Anyway, here is the denoised audio of your example speech: https://www.sndup.net/pbxf/. There is no improvement, your best bet is audio super-resolution.

Input: Speech MOS: 4.259 Noise MOS: 4.369 Overall MOS: 3.927

Output: Speech MOS: 4.263 Noise MOS: 4.403 Overall MOS: 3.947

CeFurkan OP t1_j7ttvbm wrote on February 9, 2023 at 10:37 AM

>audio super-resolution

thank you so much for answers and testing

any idea to get super resolution ? or my only option is mindslab-ai/nuwave2 ?

[deleted] t1_j7vl6xh wrote on February 9, 2023 at 6:47 PM

[removed]

No_Network_3714 t1_j87lp29 wrote on February 12, 2023 at 6:05 AM

I am also interested in having you process two recordings. They both a little over 40 minutes in length. If you feel you can do this, please contact me at (email address removed). Thanks.

No_Network_3714 t1_j89zlcu wrote on February 12, 2023 at 7:38 PM

Thought I had previously replied. I am also interested in letting you to try and clean up my two audio files, or know when it goes public. The are both over 40 minutes, were recorded in a car and the microphone was held too close

logsinh t1_j8cnqzr wrote on February 13, 2023 at 9:01 AM

Pls upload it somewhere, preferably, wav format. I will do it when I have time.

No_Network_3714 t1_j92iku0 wrote on February 18, 2023 at 7:09 PM

Thank you. I have uploaded the two audio files in a wav format to Google docs but will need an email address in order to share this with you. How do you suggest you get that information to me?

[D] Are there any AI model that I can use to improve very bad quality sound recording? Removing noise and improving overall quality

logsinh t1_j7t1cna wrote on February 9, 2023 at 4:48 AM