Comments
[deleted] OP t1_jcuny15 wrote
There’s a behind the scenes video, but what they did was they had an AI program watch tons of Nixon speeches to determine what his voice sounded like, had an actor deliver the speech, then had the bot change Nixon’s mouth movements (the visual they used was actually his resignation speech) and change the actor’s voice to Nixon’s.
It was done to raise awareness about deepfakes by creating what would have been a very memorable moment for the entire world that actually never happened.
brkh47 t1_jcup13r wrote
Designer-Edge-6505 t1_jcupfn7 wrote
Well, I'm pretty sure if we could do that, we would have gotten Morgan Freeman to narrate our lives by now.
otclogic t1_jcupgmk wrote
It needs a massive sample for it to sound convincing. There’s tons of audio deepfakes of Obama, Trump, Biden, and Ben Shapiro playing video games together. Between unserious performances by whoever is speaking their lines and the fact that even with hundreds or thousands of hours of publicly available dialogue it’s still difficult to get publicly available deepfake models to portray those people smoothly and convincingly.
SendMeNudesThough t1_jcurebu wrote
Have you missed the entire ElevenLabs thing and the voice cloning drama?
David Attenburough's voice narrating Fallout creatures
Emma Watson's voice reading Mein Kampf by Adolf Hitler
Donald Trump and President Biden debating Clone Troopers from Star Wars
George Lucas talking about Jar Jar Binks
Rod Serling narrating a NSFW Twilight Zone episode
With clear enough samples, ElevenLabs can pretty convincingly clone a voice and you can make them say anything.
Ok_Copy5217 t1_jcurs45 wrote
like can I clone the voice of my group member and have him "record" a presentation for class since he is absent?
SendMeNudesThough t1_jcus149 wrote
Sure, give the AI enough clear voice samples and it can clone anyone's voice
GhettoChemist t1_jcutz1r wrote
Holy shit i never knew about this disaster!
[deleted] OP t1_jcuup18 wrote
[deleted]
TheManInTheShack t1_jcuwxc7 wrote
Didn’t sound that much like Nixon to me.
GonWithTheNen t1_jcuxjtt wrote
Concerning Nixon's voice: 7 years ago, Adobe showcased a program called Audio Manipulator that could emulate anyone's voice and speech patterns. It only required a 20 minute audio sample of that person's speech.
It's no surprise that the development of this kind of audio replication was going to stick around, and now it's being added to deep fake videos. Interesting tech... and scary at the same time.
PM_ur_Rump t1_jcuyshe wrote
Don't forget Tucker Carlson talking about Vaporeon being the most fuckable Pokemon.
Iz-kan-reddit t1_jcuz2a9 wrote
You can't just say that and not post a link.
SendMeNudesThough t1_jcuzcr8 wrote
Well in the case of that one there's just no way to tell whether that's real or AI
AnthillOmbudsman t1_jcuze6v wrote
OK that video is pretty annoying, it's about 90% the dude making the production. I have no idea how well the AI worked.
PM_ur_Rump t1_jcuzffm wrote
The one I had is gone, so trying to find it now.
Iz-kan-reddit t1_jcv0nh4 wrote
Doing the Lord's work!
[deleted] OP t1_jcv5vbt wrote
Because it never happened. This video was made to illustrate the dangers of deepfakes by creating a record of an event that didn’t occur.
sincle354 t1_jcv7v93 wrote
Elevenlabs' AI voice only needs 5 minutes for maximum effect now. Hell, someone took a very minor character with 23 seconds of dialogue (Morshu) and created a convincing meme with it.
GonWithTheNen t1_jcvaxrq wrote
Interesting info, thanks! I figured back then that the required length of audio samples would be greatly reduced over time. Honestly, with tech's fast advancements, it's less of a surprise and more of what's expected.
SubstantialPressure3 t1_jcvj5cw wrote
Never heard about it. Probably all it did was feed the "American moon landing was fake" conspiracy.
Lady-Maya t1_jcvs0gf wrote
Prowland12 t1_jcw0z6q wrote
Good thing they had a lot of audio samples of Nixon talking for some reason.
aitchnyu t1_jcwjhav wrote
They should have used all the speeches as revealed by xkcd, "when astronauts abscond with craft", "when they encounter aliens" etc
Cardioth t1_jcwnqke wrote
1/10 bad fake
KippieDaoud t1_jcwvtd5 wrote
wow besides the weirdly blurred mouth and that this speech made more sense than his usual bs, it is really hard to spot it as fake
FyreWulff t1_jcwxx2m wrote
If they could get about 17-20 minutes more audio though, they could get it absolutely perfect.
p_a_schal t1_jcx1u67 wrote
Thank you so much
notsocoolnow t1_jcximnv wrote
Right now people are using Elevenlabs to emulate the voice of Dagoth Ur, a character in the Elder Scrolls IV Morrowind with only a handful of voice lines, to make funny meme videos.
The capability of the AI tool is quite impressive even with small samples, though the small number of available samples does mean people are also depending on that small sample to get an impression of what the character sounds like.
I imagine it would be different for a character with a large number of samples (which viewers are familiar with) that you only fed a handful of to the AI.
theb0tman t1_jcydjzh wrote
Underappreciated comment
GonWithTheNen t1_jd05kbs wrote
Being able to do voice samples "even with small samples" is mind-blowing. Frightening in terms of the possibilities, but mind-blowing.
What I hope all these emulators include is some kind of digital signature (like what Adobe said they'd use) that would detect manipulated audio and the real person's voice.
I mean, emulation seems benign for game-related memes and similar media where any player can open their copy and verify what a voice actor actually said. Other scenarios? Maybe not so much...
Ok_Copy5217 t1_jcunhyo wrote
how did they get Nixon's voice to say new sentences? so does this mean I can upload an audio file of anyone and get it to say whatever I want?