Comments

You must log in or register to comment.

AnthillOmbudsman t1_jcuze6v wrote

OK that video is pretty annoying, it's about 90% the dude making the production. I have no idea how well the AI worked.

73

Lady-Maya t1_jcvs0gf wrote

Finished video for anyone wondering:

Link

4:25 for the actual speech.

29

GonWithTheNen t1_jcuxjtt wrote

Concerning Nixon's voice: 7 years ago, Adobe showcased a program called Audio Manipulator that could emulate anyone's voice and speech patterns. It only required a 20 minute audio sample of that person's speech.

It's no surprise that the development of this kind of audio replication was going to stick around, and now it's being added to deep fake videos. Interesting tech... and scary at the same time.

45

sincle354 t1_jcv7v93 wrote

Elevenlabs' AI voice only needs 5 minutes for maximum effect now. Hell, someone took a very minor character with 23 seconds of dialogue (Morshu) and created a convincing meme with it.

18

GonWithTheNen t1_jcvaxrq wrote

Interesting info, thanks! I figured back then that the required length of audio samples would be greatly reduced over time. Honestly, with tech's fast advancements, it's less of a surprise and more of what's expected.

2

notsocoolnow t1_jcximnv wrote

Right now people are using Elevenlabs to emulate the voice of Dagoth Ur, a character in the Elder Scrolls IV Morrowind with only a handful of voice lines, to make funny meme videos.

The capability of the AI tool is quite impressive even with small samples, though the small number of available samples does mean people are also depending on that small sample to get an impression of what the character sounds like.

I imagine it would be different for a character with a large number of samples (which viewers are familiar with) that you only fed a handful of to the AI.

3

GonWithTheNen t1_jd05kbs wrote

Being able to do voice samples "even with small samples" is mind-blowing. Frightening in terms of the possibilities, but mind-blowing.

What I hope all these emulators include is some kind of digital signature (like what Adobe said they'd use) that would detect manipulated audio and the real person's voice.

I mean, emulation seems benign for game-related memes and similar media where any player can open their copy and verify what a voice actor actually said. Other scenarios? Maybe not so much...

1

Prowland12 t1_jcw0z6q wrote

Good thing they had a lot of audio samples of Nixon talking for some reason.

8

FyreWulff t1_jcwxx2m wrote

If they could get about 17-20 minutes more audio though, they could get it absolutely perfect.

4

Ok_Copy5217 t1_jcunhyo wrote

how did they get Nixon's voice to say new sentences? so does this mean I can upload an audio file of anyone and get it to say whatever I want?

7

[deleted] OP t1_jcuny15 wrote

There’s a behind the scenes video, but what they did was they had an AI program watch tons of Nixon speeches to determine what his voice sounded like, had an actor deliver the speech, then had the bot change Nixon’s mouth movements (the visual they used was actually his resignation speech) and change the actor’s voice to Nixon’s.

It was done to raise awareness about deepfakes by creating what would have been a very memorable moment for the entire world that actually never happened.

34

SendMeNudesThough t1_jcurebu wrote

Have you missed the entire ElevenLabs thing and the voice cloning drama?

David Attenburough's voice narrating Fallout creatures

Emma Watson's voice reading Mein Kampf by Adolf Hitler

Donald Trump and President Biden debating Clone Troopers from Star Wars

George Lucas talking about Jar Jar Binks

Rod Serling narrating a NSFW Twilight Zone episode

With clear enough samples, ElevenLabs can pretty convincingly clone a voice and you can make them say anything.

20

PM_ur_Rump t1_jcuyshe wrote

Don't forget Tucker Carlson talking about Vaporeon being the most fuckable Pokemon.

Edit: https://youtu.be/I1wHDY4DGRI

21

SendMeNudesThough t1_jcuzcr8 wrote

Well in the case of that one there's just no way to tell whether that's real or AI

8

KippieDaoud t1_jcwvtd5 wrote

wow besides the weirdly blurred mouth and that this speech made more sense than his usual bs, it is really hard to spot it as fake

2

Ok_Copy5217 t1_jcurs45 wrote

like can I clone the voice of my group member and have him "record" a presentation for class since he is absent?

2

SendMeNudesThough t1_jcus149 wrote

Sure, give the AI enough clear voice samples and it can clone anyone's voice

3

otclogic t1_jcupgmk wrote

It needs a massive sample for it to sound convincing. There’s tons of audio deepfakes of Obama, Trump, Biden, and Ben Shapiro playing video games together. Between unserious performances by whoever is speaking their lines and the fact that even with hundreds or thousands of hours of publicly available dialogue it’s still difficult to get publicly available deepfake models to portray those people smoothly and convincingly.

11

Designer-Edge-6505 t1_jcupfn7 wrote

Well, I'm pretty sure if we could do that, we would have gotten Morgan Freeman to narrate our lives by now.

3

aitchnyu t1_jcwjhav wrote

They should have used all the speeches as revealed by xkcd, "when astronauts abscond with craft", "when they encounter aliens" etc

2

SubstantialPressure3 t1_jcvj5cw wrote

Never heard about it. Probably all it did was feed the "American moon landing was fake" conspiracy.

−1

GhettoChemist t1_jcutz1r wrote

Holy shit i never knew about this disaster!

−7

[deleted] OP t1_jcv5vbt wrote

Because it never happened. This video was made to illustrate the dangers of deepfakes by creating a record of an event that didn’t occur.

9