WEBVTT - The Story: Liar, Liar, Deepfakes on Fire w/ Hany Farid

0:00:00.040 --> 0:00:02.640
<v Speaker 1>Thanks for tuney into tech Stuff. If you don't recognize

0:00:02.680 --> 0:00:05.440
<v Speaker 1>my voice, my name is Osva Looshan, and I'm here

0:00:05.480 --> 0:00:08.600
<v Speaker 1>because the inimitable Jonathan Strickland has passed the baton to

0:00:08.720 --> 0:00:12.160
<v Speaker 1>Cara Price and myself to host Tech Stuff. The show

0:00:12.160 --> 0:00:15.040
<v Speaker 1>will remain your home for all things tech, and all

0:00:15.080 --> 0:00:20.400
<v Speaker 1>the old episodes will remain available in this feed. Welcome

0:00:20.400 --> 0:00:24.200
<v Speaker 1>to tech Stuff. This is the story. Every Wednesday, we

0:00:24.280 --> 0:00:26.880
<v Speaker 1>bring you an in depth interview with someone at the

0:00:26.920 --> 0:00:30.600
<v Speaker 1>forefront of technology or someone who can unknock a world

0:00:30.640 --> 0:00:34.000
<v Speaker 1>where tech is that it's most fascinating. This week it's

0:00:34.040 --> 0:00:40.640
<v Speaker 1>Harney for Reid. He's a professor of electrical engineering and

0:00:40.680 --> 0:00:45.040
<v Speaker 1>computer science at the University of California, Berkeley, with a

0:00:45.120 --> 0:00:51.640
<v Speaker 1>CSI sounding specialization digital forensics. His focus is on image

0:00:51.640 --> 0:00:55.480
<v Speaker 1>analysis and human perception, so he's the guy you call

0:00:55.920 --> 0:00:58.000
<v Speaker 1>when you need to know whether or not you're confronting

0:00:58.040 --> 0:01:01.720
<v Speaker 1>at deep fake, and many do. He's constantly talking to

0:01:01.840 --> 0:01:05.760
<v Speaker 1>journalists to help them determine what's real and what's fake online.

0:01:06.440 --> 0:01:08.959
<v Speaker 1>In his lab at UC Berkeley, he and his students

0:01:09.000 --> 0:01:13.319
<v Speaker 1>study the various ways misinformation is created and spread and

0:01:13.360 --> 0:01:15.480
<v Speaker 1>how it erodes trust. In our institutions.

0:01:16.200 --> 0:01:16.880
<v Speaker 2>And one more thing.

0:01:17.440 --> 0:01:20.200
<v Speaker 1>For Reid is the founder and chief science officer of

0:01:20.240 --> 0:01:24.320
<v Speaker 1>get Real Labs, where he consults with businesses, news organizations,

0:01:24.360 --> 0:01:28.760
<v Speaker 1>and law enforcement to authenticate digital content. You might be

0:01:28.760 --> 0:01:31.600
<v Speaker 1>wondering how far Reed got into this field. If so,

0:01:32.080 --> 0:01:32.840
<v Speaker 1>you're not alone.

0:01:33.120 --> 0:01:34.800
<v Speaker 2>Somebody said to me the other day, Oh, you were

0:01:34.800 --> 0:01:37.120
<v Speaker 2>so prescient. I'm like, no, we weren't. We were just

0:01:37.160 --> 0:01:37.960
<v Speaker 2>screwing around.

0:01:38.520 --> 0:01:42.920
<v Speaker 1>Farred first started pondering the implications of digital images back

0:01:42.920 --> 0:01:43.960
<v Speaker 1>in nineteen ninety seven.

0:01:44.280 --> 0:01:47.680
<v Speaker 2>This is really pre digital Almost film was still the

0:01:47.720 --> 0:01:50.080
<v Speaker 2>dominant source of media that we took photographs on the

0:01:50.120 --> 0:01:53.280
<v Speaker 2>Internet was nothing right. There was no social media, and

0:01:53.360 --> 0:01:56.440
<v Speaker 2>everything was very nascent. You could see the trends, you

0:01:56.520 --> 0:01:59.240
<v Speaker 2>knew things. Something was bubbling up with the Internet and

0:01:59.240 --> 0:02:02.400
<v Speaker 2>with digital technolog Freed was a postdoc at the time.

0:02:02.720 --> 0:02:05.840
<v Speaker 2>I was at the library getting a book, which now

0:02:05.960 --> 0:02:09.720
<v Speaker 2>just seems quaint, and I was waiting in line, and

0:02:09.760 --> 0:02:11.679
<v Speaker 2>there was a return card, and on the return cart

0:02:11.800 --> 0:02:14.320
<v Speaker 2>was a big book called the Federal Rules of Evidence.

0:02:14.919 --> 0:02:17.200
<v Speaker 2>I'm not a legal scholar, I'm not a lawyer, but

0:02:17.240 --> 0:02:19.040
<v Speaker 2>I was bored and I flipped it open to a

0:02:19.120 --> 0:02:23.160
<v Speaker 2>random page and it was titled introducing Photographs into Evidence

0:02:23.160 --> 0:02:25.760
<v Speaker 2>in a Court of Law. And I liked taking photographs.

0:02:25.760 --> 0:02:27.960
<v Speaker 2>I was working with digital images, but nothing to do

0:02:28.000 --> 0:02:29.679
<v Speaker 2>with this topic, and I thought, I wonder what the

0:02:29.760 --> 0:02:32.040
<v Speaker 2>rules are, and so I read it and there was

0:02:32.040 --> 0:02:35.640
<v Speaker 2>almost a footnote that said, there's this digital format and

0:02:35.720 --> 0:02:39.040
<v Speaker 2>we're going to treat digital the same way we treat analog.

0:02:40.080 --> 0:02:42.400
<v Speaker 2>And I just remember thinking, I don't know anything, but

0:02:42.520 --> 0:02:43.919
<v Speaker 2>that seems like a bad idea.

0:02:44.600 --> 0:02:47.560
<v Speaker 1>This passage really stuck with him, and for years he

0:02:47.600 --> 0:02:51.040
<v Speaker 1>couldn't stop thinking about the implications of a digital world,

0:02:51.760 --> 0:02:54.880
<v Speaker 1>the fact that digital manipulation would change our perception of

0:02:54.880 --> 0:03:00.320
<v Speaker 1>what's real because the photographic medium had fundamentally shifted. What

0:03:00.440 --> 0:03:03.600
<v Speaker 1>surprised him was that few others were taking note.

0:03:04.000 --> 0:03:07.040
<v Speaker 2>It's really unusual in an academic life where you start

0:03:07.080 --> 0:03:09.080
<v Speaker 2>thinking about a problem and you go into the academic

0:03:09.160 --> 0:03:13.000
<v Speaker 2>literature and there is nothing. It was just crickets, because

0:03:13.000 --> 0:03:15.440
<v Speaker 2>there was no reason to be thinking about the problem.

0:03:15.960 --> 0:03:19.160
<v Speaker 1>Two years later, as a professor of computer science at Dartmouth,

0:03:19.600 --> 0:03:22.799
<v Speaker 1>he was playing around in photoshop creating a comic image

0:03:22.800 --> 0:03:25.440
<v Speaker 1>of his friend when he had an epiphany.

0:03:26.040 --> 0:03:31.079
<v Speaker 2>Mathematically, I just did something very interesting. I introduced pixels

0:03:31.320 --> 0:03:35.240
<v Speaker 2>that have been synthesized by photoshop to make the image bigger, right,

0:03:35.280 --> 0:03:37.920
<v Speaker 2>because they didn't exist, and I remember thinking, oh, I

0:03:37.920 --> 0:03:39.160
<v Speaker 2>should be able to detect that.

0:03:40.240 --> 0:03:43.880
<v Speaker 1>In that moment, he started writing code and actually developed

0:03:43.920 --> 0:03:48.120
<v Speaker 1>programs to detect digital manipulation. The world woke up to

0:03:48.120 --> 0:03:50.240
<v Speaker 1>the importance of this work, and he started getting asked

0:03:50.280 --> 0:03:53.120
<v Speaker 1>to chime in on serious cases for the Associated Press,

0:03:53.440 --> 0:03:55.800
<v Speaker 1>for law enforcement, for national security.

0:03:56.240 --> 0:04:00.400
<v Speaker 2>And then twenty fifteen, sixteen seventeen AI hit and the

0:04:00.400 --> 0:04:03.920
<v Speaker 2>world exploded. But it exploded for a few reasons because one,

0:04:04.320 --> 0:04:06.680
<v Speaker 2>at least with Photoshop, there was a barrier to entry.

0:04:06.920 --> 0:04:09.160
<v Speaker 2>You had to actually know how to use photoshop. But

0:04:09.200 --> 0:04:11.400
<v Speaker 2>then when AI came around, you just go to chat

0:04:11.480 --> 0:04:13.800
<v Speaker 2>cheepetiting type, give me an image of X, right, and

0:04:13.840 --> 0:04:15.520
<v Speaker 2>give me an image of Y, give me a video

0:04:15.520 --> 0:04:16.960
<v Speaker 2>of this, give me an audio of this. And so

0:04:17.000 --> 0:04:21.080
<v Speaker 2>suddenly there's no barrier to entry. But more importantly, social

0:04:21.120 --> 0:04:24.600
<v Speaker 2>media dominates the landscape. We went from a few million

0:04:24.720 --> 0:04:28.000
<v Speaker 2>users to a few billion users, and so now not

0:04:28.080 --> 0:04:31.719
<v Speaker 2>only could people easily with no barrier to entry, create

0:04:32.120 --> 0:04:36.400
<v Speaker 2>fake content, they could distribute it to the masses and

0:04:37.160 --> 0:04:42.080
<v Speaker 2>it gets amplified because the algorithms amplify the most outrageous things.

0:04:42.440 --> 0:04:46.280
<v Speaker 2>People want things that conform to the worldview. We are hyperpartisan,

0:04:46.400 --> 0:04:51.480
<v Speaker 2>both here and abroad, and that was the perfect storm create, distribute, amplify,

0:04:51.839 --> 0:04:56.040
<v Speaker 2>rinse and repeat. And so now through the AI revolution,

0:04:56.680 --> 0:04:58.560
<v Speaker 2>it's bizarre what's happening.

0:04:59.120 --> 0:05:01.520
<v Speaker 1>We'll dive into the world Careed does on deep fakes

0:05:01.520 --> 0:05:03.960
<v Speaker 1>in a bit, but first I had to ask you

0:05:04.080 --> 0:05:10.960
<v Speaker 1>about something seemingly completely unrelated, death bods. So you were

0:05:11.040 --> 0:05:14.359
<v Speaker 1>quoted in this Atlantic article about death bods with the

0:05:14.400 --> 0:05:18.440
<v Speaker 1>headline no one is ready for digital immortality, So nie it.

0:05:18.560 --> 0:05:20.560
<v Speaker 1>It'll be good to define on terms like what do

0:05:20.600 --> 0:05:23.080
<v Speaker 1>we mean by this idea of digital immortality?

0:05:23.720 --> 0:05:26.080
<v Speaker 2>Yeah, I don't know that it's a well established term.

0:05:26.120 --> 0:05:30.320
<v Speaker 2>But here's my definition. Is that your likeness, the way

0:05:30.360 --> 0:05:32.719
<v Speaker 2>you think, the way you talk, the way you look,

0:05:33.200 --> 0:05:37.000
<v Speaker 2>lives on an eternity in a digital form through a

0:05:37.120 --> 0:05:41.600
<v Speaker 2>version of AI that embodies how I write, how I think,

0:05:41.720 --> 0:05:44.800
<v Speaker 2>how I talk in order to interact with other people.

0:05:45.000 --> 0:05:47.359
<v Speaker 2>It's interactive, that's the key, but it's dynamic.

0:05:47.680 --> 0:05:50.120
<v Speaker 1>What got you interested in this topic and why did

0:05:50.160 --> 0:05:51.520
<v Speaker 1>you agree to be a source in the story.

0:05:51.960 --> 0:05:55.359
<v Speaker 2>So this is almost a philosophical and legal question, and

0:05:55.440 --> 0:05:58.320
<v Speaker 2>I'm neither of those things. But I got to say

0:05:58.440 --> 0:06:03.120
<v Speaker 2>I've been thinking a lot about it, technically, personally, philosophically.

0:06:03.880 --> 0:06:06.600
<v Speaker 2>Here's why I've been thinking about it. So one is,

0:06:06.640 --> 0:06:08.840
<v Speaker 2>I'm a professor. I've been a professor for twenty five years.

0:06:08.880 --> 0:06:11.480
<v Speaker 2>I love teaching. I love my students. I hate them

0:06:11.480 --> 0:06:15.000
<v Speaker 2>some days, but I usually love them. They're amazing and

0:06:15.040 --> 0:06:18.200
<v Speaker 2>weird and wonderful in many ways. So is there a

0:06:18.279 --> 0:06:21.880
<v Speaker 2>story here where I can keep teaching after I die?

0:06:22.560 --> 0:06:25.640
<v Speaker 2>Like there's something sort of magical about that. I think

0:06:25.680 --> 0:06:27.480
<v Speaker 2>about it for my parents. Both my parents are now

0:06:27.480 --> 0:06:29.560
<v Speaker 2>in their late eighties. One of them will die first,

0:06:29.600 --> 0:06:31.599
<v Speaker 2>almost certainly, And what does it mean for the one

0:06:31.760 --> 0:06:35.360
<v Speaker 2>who They've been together for fifty years? So there's parts

0:06:35.400 --> 0:06:38.039
<v Speaker 2>of it. I think this is wonderful, this idea that

0:06:38.200 --> 0:06:40.240
<v Speaker 2>one of my parents can wake up and open up

0:06:40.240 --> 0:06:42.479
<v Speaker 2>their iPad and have a conversation with the person that

0:06:42.480 --> 0:06:45.039
<v Speaker 2>they spent fifty years of their lives with. On the

0:06:45.080 --> 0:06:47.680
<v Speaker 2>other hand, if that happens early in life, is that

0:06:47.720 --> 0:06:50.479
<v Speaker 2>healthy for somebody? If a thirty year old loses their spouse,

0:06:50.560 --> 0:06:53.359
<v Speaker 2>is that good? That they never sort of physically move on.

0:06:53.880 --> 0:06:55.919
<v Speaker 2>I also think about it from a technical perspective, what

0:06:55.960 --> 0:06:58.520
<v Speaker 2>would that look like for somebody who's famous where there's

0:06:58.520 --> 0:07:01.000
<v Speaker 2>a big digital footprint. I think we have all the

0:07:01.040 --> 0:07:04.039
<v Speaker 2>pieces to do that. We have the large language models,

0:07:04.480 --> 0:07:07.919
<v Speaker 2>we have voice, we have likeness, we have video, and

0:07:07.920 --> 0:07:11.040
<v Speaker 2>you're already seeing people do this creating digital avatars of

0:07:11.160 --> 0:07:13.640
<v Speaker 2>both people who are with us and not with us,

0:07:13.960 --> 0:07:16.120
<v Speaker 2>so that you can interact with them. I can go

0:07:16.240 --> 0:07:18.440
<v Speaker 2>scrape every single piece of writing that Martin Luther King

0:07:18.520 --> 0:07:20.600
<v Speaker 2>Junior wrote. I can grab his speeches, I can grab

0:07:20.640 --> 0:07:22.200
<v Speaker 2>his likeness, I can grab his voice, and I could

0:07:22.240 --> 0:07:24.520
<v Speaker 2>create an avatar of him that I could interact with.

0:07:24.760 --> 0:07:27.200
<v Speaker 1>Well, it reminds me of your work on deep fis

0:07:27.240 --> 0:07:29.840
<v Speaker 1>in some sense, because, as you said, exactly, all the

0:07:29.840 --> 0:07:33.800
<v Speaker 1>pieces are there technically and otherwise. Yeah, but society's clearly

0:07:33.800 --> 0:07:34.239
<v Speaker 1>not ready.

0:07:34.880 --> 0:07:37.040
<v Speaker 2>I don't think we're ready. But look a lot of things,

0:07:37.320 --> 0:07:39.720
<v Speaker 2>if you look at the last two, three, four or

0:07:39.760 --> 0:07:42.880
<v Speaker 2>five decades from technology we weren't ready for and we

0:07:42.920 --> 0:07:43.720
<v Speaker 2>became ready for it.

0:07:43.800 --> 0:07:43.960
<v Speaker 1>Right.

0:07:44.320 --> 0:07:46.400
<v Speaker 2>Look, you can go back to in vitual fertilization. When

0:07:46.400 --> 0:07:49.840
<v Speaker 2>it first started, people were freaked out by that completely

0:07:50.000 --> 0:07:52.160
<v Speaker 2>normal now, right, And by the way, this could also

0:07:52.200 --> 0:07:55.200
<v Speaker 2>be generational. I can imagine some of my students here

0:07:55.200 --> 0:07:59.400
<v Speaker 2>at UC Berkeley think sure, who cares, right, And I'm

0:07:59.440 --> 0:08:01.000
<v Speaker 2>an older guy and I'm like, ah, that seems a

0:08:01.000 --> 0:08:03.560
<v Speaker 2>little weird. So this may just go away generationally, which

0:08:03.600 --> 0:08:04.600
<v Speaker 2>is usually how this happens.

0:08:04.640 --> 0:08:07.160
<v Speaker 1>By the way, do you think we'll see a fundamental

0:08:07.400 --> 0:08:09.440
<v Speaker 1>shift in our society in this case in terms of

0:08:10.400 --> 0:08:11.640
<v Speaker 1>how we think about death.

0:08:12.560 --> 0:08:16.920
<v Speaker 2>I think this idea of a digital immortality is really profound.

0:08:17.600 --> 0:08:21.200
<v Speaker 2>And look, I don't know where this AI revolution is

0:08:21.200 --> 0:08:24.120
<v Speaker 2>going right now. I don't think anybody really does. But

0:08:24.200 --> 0:08:27.800
<v Speaker 2>something is happening. There is something here that is quite dramatic.

0:08:28.240 --> 0:08:30.320
<v Speaker 2>I think it's going to reshape society. I think it's

0:08:30.360 --> 0:08:32.520
<v Speaker 2>going to reshape education. I think it's going to reshape

0:08:32.520 --> 0:08:34.800
<v Speaker 2>the workforce. I think it's going to reshape a lot

0:08:34.800 --> 0:08:37.920
<v Speaker 2>of things. And I do think your likeness or your

0:08:37.920 --> 0:08:40.080
<v Speaker 2>being or your essence or whatever you want to call

0:08:40.120 --> 0:08:42.800
<v Speaker 2>that can live on and you can interact with people.

0:08:43.000 --> 0:08:45.880
<v Speaker 2>You can continue to have a podcast after you die,

0:08:46.320 --> 0:08:47.840
<v Speaker 2>you can keep interviewing people.

0:08:49.320 --> 0:08:53.280
<v Speaker 1>When we come back. How deep fakes impact everyone even

0:08:53.320 --> 0:09:02.000
<v Speaker 1>if you don't know it. There's an interesting point of

0:09:02.040 --> 0:09:06.280
<v Speaker 1>intersection between so death bots and your more core field

0:09:06.280 --> 0:09:10.240
<v Speaker 1>of study, and that's this Indian politician who a parliamentary candidate,

0:09:10.280 --> 0:09:14.760
<v Speaker 1>who created a video of his deceased father endorsing him

0:09:14.760 --> 0:09:15.760
<v Speaker 1>as his rightful heir.

0:09:16.080 --> 0:09:17.440
<v Speaker 2>Yeah.

0:09:17.520 --> 0:09:19.240
<v Speaker 1>I mean this is kind of a world's collide moment

0:09:19.280 --> 0:09:22.599
<v Speaker 1>between misinformation, deep fakes, and digital immortality.

0:09:22.840 --> 0:09:25.640
<v Speaker 2>Yeah. Yeah, So for people who didn't see it, India

0:09:25.679 --> 0:09:28.520
<v Speaker 2>had an election this year, big one and you know,

0:09:29.200 --> 0:09:33.480
<v Speaker 2>billion plus people voting. It was chaotic, and a politician

0:09:33.480 --> 0:09:36.320
<v Speaker 2>did exactly this. His father was a well known politician,

0:09:36.360 --> 0:09:38.600
<v Speaker 2>and he created a digital recreation with his voice and

0:09:38.600 --> 0:09:40.480
<v Speaker 2>his likeness and he was talking and endorsing his son.

0:09:41.040 --> 0:09:43.520
<v Speaker 2>So I have a couple of thoughts on that right now,

0:09:43.559 --> 0:09:46.559
<v Speaker 2>in this particular moment, as we're still grappling, I think

0:09:46.559 --> 0:09:50.080
<v Speaker 2>there should be two rules, which are consent and disclosure.

0:09:51.240 --> 0:09:52.920
<v Speaker 2>And it's really simple, like, if you're going to use

0:09:52.920 --> 0:09:55.840
<v Speaker 2>somebody's likeness, you should have consent, and if you're going

0:09:55.880 --> 0:09:59.000
<v Speaker 2>to distribute it, you should have disclosure. Now, consent is

0:09:59.000 --> 0:10:01.520
<v Speaker 2>difficult when somebody is dead. But if I want to

0:10:01.520 --> 0:10:04.880
<v Speaker 2>get an endorsement from somebody, who's living. I have to

0:10:04.880 --> 0:10:08.280
<v Speaker 2>get their consent yep. And if I distribute that, it

0:10:08.320 --> 0:10:11.880
<v Speaker 2>has to be very clearly labeled and disclosed as this

0:10:11.920 --> 0:10:14.640
<v Speaker 2>is AI generated. I'll give you a really nice example

0:10:14.679 --> 0:10:15.880
<v Speaker 2>of this where it was sort of cool. I was

0:10:15.960 --> 0:10:19.800
<v Speaker 2>during the Olympics. One of the newscasters, well known and

0:10:20.000 --> 0:10:22.559
<v Speaker 2>I'm just blinking out his name right now, was creating

0:10:22.600 --> 0:10:26.080
<v Speaker 2>AI generated personalized summaries. So my wife was watching the

0:10:26.080 --> 0:10:28.880
<v Speaker 2>Olympics and she would get these personalized summaries from the broadcaster.

0:10:29.600 --> 0:10:32.240
<v Speaker 2>So the content was personalized to her based on what

0:10:32.320 --> 0:10:36.079
<v Speaker 2>she was watching. And then the voice being generated was his,

0:10:36.920 --> 0:10:39.760
<v Speaker 2>and the script was being AI generated. Everything was with

0:10:39.840 --> 0:10:41.920
<v Speaker 2>his permission, and it was disclosed to her that it

0:10:41.960 --> 0:10:44.640
<v Speaker 2>was AI generated and summarized. And I think that was

0:10:44.679 --> 0:10:47.960
<v Speaker 2>really well done in terms of the things that were

0:10:48.000 --> 0:10:50.240
<v Speaker 2>made clear of what you were getting and how it

0:10:50.280 --> 0:10:51.200
<v Speaker 2>was being delivered to you.

0:10:52.120 --> 0:10:54.840
<v Speaker 1>That's sort of a high watermark for how this stuff works.

0:10:54.840 --> 0:10:57.040
<v Speaker 1>When it works well. Yeah, Do you think as a

0:10:57.040 --> 0:11:00.920
<v Speaker 1>society we're more likely to move toward that high water

0:11:01.040 --> 0:11:06.079
<v Speaker 1>mark through collective demand or through regulation or through some

0:11:06.120 --> 0:11:08.600
<v Speaker 1>decision from the tech overlords like what gets us there?

0:11:08.679 --> 0:11:12.000
<v Speaker 2>More broadly, yeah. I mean, there's nothing in the last

0:11:12.040 --> 0:11:14.880
<v Speaker 2>twenty years or twenty five years that gives me confident

0:11:14.960 --> 0:11:16.800
<v Speaker 2>that our tech overlords are going to do the right thing.

0:11:17.520 --> 0:11:20.120
<v Speaker 2>They're going to do the thing that maximizes their profits.

0:11:20.120 --> 0:11:23.040
<v Speaker 2>And we know this. Let's stop pretending otherwise that Silicon

0:11:23.120 --> 0:11:24.880
<v Speaker 2>Value is anything other than it is. It's a modern

0:11:24.960 --> 0:11:27.480
<v Speaker 2>day Wall Street in some ways, by the way, even

0:11:27.559 --> 0:11:31.480
<v Speaker 2>more powerful, right, because they control information, not just money,

0:11:31.559 --> 0:11:34.800
<v Speaker 2>and that arguably is much more powerful. I don't think

0:11:34.840 --> 0:11:38.319
<v Speaker 2>this comes from consumers, because we're not customers, we're the product.

0:11:39.160 --> 0:11:42.480
<v Speaker 2>We as users, I should say, have almost no power

0:11:42.520 --> 0:11:45.920
<v Speaker 2>at all. And so the media we tried, right, we

0:11:45.960 --> 0:11:49.040
<v Speaker 2>tried criticizing and embarrassing, and we tried dragging them in

0:11:49.040 --> 0:11:53.840
<v Speaker 2>front of Congress. Nothing effects change. So what does good regulation?

0:11:54.320 --> 0:11:56.560
<v Speaker 2>We got to put guardrails on this and look, there's

0:11:56.720 --> 0:12:00.120
<v Speaker 2>nothing there is nothing in our physical world that it

0:12:00.160 --> 0:12:04.400
<v Speaker 2>is not subject to regulation to make products safe and reasonable.

0:12:04.679 --> 0:12:07.240
<v Speaker 2>But somehow we've abandoned that for the last twenty five

0:12:07.320 --> 0:12:09.920
<v Speaker 2>years because it's the Internet. So I do think it's

0:12:09.960 --> 0:12:11.400
<v Speaker 2>going to have to come I don't think it's going

0:12:11.480 --> 0:12:14.040
<v Speaker 2>to come from the US. It is coming from the UK.

0:12:14.200 --> 0:12:16.720
<v Speaker 2>It is coming from the EU, it is coming from Australia,

0:12:17.120 --> 0:12:19.160
<v Speaker 2>and I think those are going to be the leaders

0:12:19.160 --> 0:12:21.160
<v Speaker 2>in this space. And you saw this with GDPR with

0:12:21.200 --> 0:12:24.040
<v Speaker 2>the privacy rules in many ways that I don't think

0:12:24.040 --> 0:12:26.160
<v Speaker 2>it solved the privacy problem around the world, but it

0:12:26.240 --> 0:12:29.160
<v Speaker 2>moved the needle on the problem. And the EU and

0:12:29.200 --> 0:12:32.280
<v Speaker 2>the UK have moved very aggressively on AI safety, on

0:12:32.360 --> 0:12:36.439
<v Speaker 2>digital safety, and on misuse of monopolies, and I think

0:12:36.440 --> 0:12:38.280
<v Speaker 2>it's going to have to come at that level.

0:12:38.840 --> 0:12:40.880
<v Speaker 1>I want to talk about some of the more personal

0:12:40.960 --> 0:12:44.040
<v Speaker 1>ways in which we can experience deep fakes. I think

0:12:44.040 --> 0:12:47.040
<v Speaker 1>a lot of people think maybe it only touches politicians

0:12:47.120 --> 0:12:50.120
<v Speaker 1>or celebrities. But there was an NPR story about a

0:12:50.160 --> 0:12:54.280
<v Speaker 1>case you worked someone that involved a Baltimore teacher. Can

0:12:54.320 --> 0:12:55.440
<v Speaker 1>you talk about what happened there?

0:12:55.960 --> 0:12:58.640
<v Speaker 2>This case is I'm fascinated by, and I still don't

0:12:58.640 --> 0:13:00.199
<v Speaker 2>think we've gotten to the end of it. Tell you,

0:13:00.240 --> 0:13:03.319
<v Speaker 2>first of all, your listeners, what the case is. Baltimore

0:13:03.320 --> 0:13:06.520
<v Speaker 2>Public School audio of the principle saying things that were

0:13:06.640 --> 0:13:10.920
<v Speaker 2>racist was leaked, and it was leaked to some news outlet,

0:13:11.400 --> 0:13:13.360
<v Speaker 2>and it was bad and it was if you listen

0:13:13.440 --> 0:13:17.040
<v Speaker 2>to it it's pretty bad, and the principal said, this

0:13:17.160 --> 0:13:21.960
<v Speaker 2>isn't me, this is AI generated, And we analyze the audio.

0:13:22.120 --> 0:13:26.679
<v Speaker 2>Several labs analyze the audio. There is alteration to the audio.

0:13:26.840 --> 0:13:29.800
<v Speaker 2>That is, we can hear and see that it's been

0:13:29.880 --> 0:13:34.440
<v Speaker 2>spliced together five or six segments, but when we analyze

0:13:34.440 --> 0:13:37.560
<v Speaker 2>the individual segments, it is not one hundred percent clear

0:13:37.600 --> 0:13:39.760
<v Speaker 2>to us that it is AI generated. It could be

0:13:40.600 --> 0:13:42.640
<v Speaker 2>that he said these things, but they were sort of

0:13:42.720 --> 0:13:45.160
<v Speaker 2>stitched together in a way that put them out of context,

0:13:45.440 --> 0:13:48.000
<v Speaker 2>which would be deceptive. It could be that it's AI

0:13:48.040 --> 0:13:50.800
<v Speaker 2>generated and our tools simply didn't detect it. It could

0:13:50.840 --> 0:13:52.640
<v Speaker 2>be that this is a case of the liar's dividend,

0:13:52.640 --> 0:13:54.760
<v Speaker 2>where the principle really did say this, but he's claiming

0:13:54.800 --> 0:13:55.520
<v Speaker 2>he didn't say it.

0:13:55.640 --> 0:13:58.520
<v Speaker 1>Honey, can you explain exactly what the liar's dividend is?

0:13:58.960 --> 0:14:01.480
<v Speaker 2>The liar's divining go something like this. It says, when

0:14:01.520 --> 0:14:04.840
<v Speaker 2>you live in a world or anything can be manipulated.

0:14:04.880 --> 0:14:06.839
<v Speaker 2>Any image can be fake, any audio can be fake,

0:14:06.880 --> 0:14:09.520
<v Speaker 2>any video can be fake. Nothing has to be real.

0:14:09.840 --> 0:14:13.240
<v Speaker 2>I get to use the fact that fake things exist

0:14:13.280 --> 0:14:16.520
<v Speaker 2>as an excuse for what I've done. But this case

0:14:16.559 --> 0:14:21.960
<v Speaker 2>is a really good example of how dangerous this technology

0:14:22.040 --> 0:14:25.160
<v Speaker 2>is for two reasons. One is, with twenty to thirty

0:14:25.160 --> 0:14:28.920
<v Speaker 2>seconds of your voice, I don't need hours. I can

0:14:28.960 --> 0:14:31.880
<v Speaker 2>clone your voice. I can upload it to an AI

0:14:32.040 --> 0:14:34.200
<v Speaker 2>tool that I use and that I can type and

0:14:34.240 --> 0:14:37.680
<v Speaker 2>have you say anything I want. That means anybody with

0:14:37.800 --> 0:14:41.480
<v Speaker 2>twenty seconds of their voice available has a vulnerability. So

0:14:41.520 --> 0:14:45.200
<v Speaker 2>this is not for movie stars and podcasters. This is everybody.

0:14:45.360 --> 0:14:49.840
<v Speaker 2>Number one. Number two is anybody who's caught saying or

0:14:49.880 --> 0:14:53.440
<v Speaker 2>doing something that they don't want to take ownership of

0:14:53.560 --> 0:14:56.760
<v Speaker 2>can say it's fake. Yep, the dog ate my homework,

0:14:57.040 --> 0:14:59.840
<v Speaker 2>all right, this is easy. And so both of those

0:14:59.880 --> 0:15:03.720
<v Speaker 2>are problematic because where's our shared sense of reality. It

0:15:03.800 --> 0:15:07.000
<v Speaker 2>used to be when you had images and video, despite

0:15:07.000 --> 0:15:09.280
<v Speaker 2>the fact that there was photoshop, despite the fact that

0:15:09.520 --> 0:15:13.240
<v Speaker 2>Hollywood could could manipulate videos, we had a pretty reasonable

0:15:13.240 --> 0:15:16.040
<v Speaker 2>confidence in what we read and saw and heard. And

0:15:16.080 --> 0:15:18.880
<v Speaker 2>you can't say that anymore. This is why I spent

0:15:18.960 --> 0:15:21.640
<v Speaker 2>so much time talking to journalists and fact checkers and

0:15:21.760 --> 0:15:25.560
<v Speaker 2>lawyers and law enforcement. So on this particular case, it

0:15:25.640 --> 0:15:28.640
<v Speaker 2>really showed how this has trickled all the way down

0:15:28.920 --> 0:15:30.359
<v Speaker 2>to high school teachers.

0:15:30.600 --> 0:15:33.800
<v Speaker 1>Zooming out from the individuals to the collective. One of

0:15:33.840 --> 0:15:37.480
<v Speaker 1>the interesting things that happens is whenever there's like a

0:15:37.520 --> 0:15:40.720
<v Speaker 1>world event that everyone's paying attention to, you get this

0:15:40.840 --> 0:15:44.280
<v Speaker 1>fire hose of fake images. I remember in the early

0:15:44.400 --> 0:15:47.720
<v Speaker 1>days of the conflict in Gaza, there was this aerial

0:15:47.840 --> 0:15:52.160
<v Speaker 1>image with what was supposed to be Palestinian tents making

0:15:52.200 --> 0:15:55.520
<v Speaker 1>the word help us or you know. Right out to

0:15:55.560 --> 0:15:58.280
<v Speaker 1>the La fires began, there were these images of the

0:15:58.320 --> 0:16:01.280
<v Speaker 1>Hollywood Sign on fire. I don't know how many people

0:16:01.320 --> 0:16:04.880
<v Speaker 1>believe these images were actually true or in some ways,

0:16:04.880 --> 0:16:07.720
<v Speaker 1>what the harm is if they did. But what's going

0:16:07.760 --> 0:16:08.160
<v Speaker 1>on here?

0:16:08.880 --> 0:16:10.960
<v Speaker 2>So let's start with the La fires. First of all,

0:16:11.200 --> 0:16:15.480
<v Speaker 2>many images coming out of those fires were fake. What's

0:16:15.520 --> 0:16:18.200
<v Speaker 2>the harm, Well, this one's easy. If people believe there's

0:16:18.240 --> 0:16:21.600
<v Speaker 2>fire in this neighborhood, that is very bad. Fire departments

0:16:21.600 --> 0:16:23.440
<v Speaker 2>are going to get distracted. First responders are going to

0:16:23.440 --> 0:16:26.000
<v Speaker 2>get distracted. People are scared that their neighborhood is on fire.

0:16:26.000 --> 0:16:27.920
<v Speaker 2>They're going to get distracted. So I do think there

0:16:27.960 --> 0:16:30.480
<v Speaker 2>is real harm I think in the Gosspt images. Also,

0:16:30.920 --> 0:16:34.360
<v Speaker 2>this is a complicated conflict, and we are all trying

0:16:34.360 --> 0:16:36.680
<v Speaker 2>to get our heads around this thing and figure it out,

0:16:36.920 --> 0:16:40.880
<v Speaker 2>and meanwhile people are fanning the flames, trying to push

0:16:40.880 --> 0:16:43.240
<v Speaker 2>a particular narrative on either side, and I don't think

0:16:43.280 --> 0:16:47.960
<v Speaker 2>that's healthy. Look, we can have serious discussions about how

0:16:48.000 --> 0:16:50.680
<v Speaker 2>to combat climate change, we can have serious discussions about

0:16:50.720 --> 0:16:53.640
<v Speaker 2>how to resolve the Israeli Palestinian conflict. We can have

0:16:53.680 --> 0:16:56.640
<v Speaker 2>serious discussions about a lot of things, but we've got

0:16:56.680 --> 0:16:59.080
<v Speaker 2>to start with a set of facts. And when you

0:16:59.120 --> 0:17:03.160
<v Speaker 2>pollute the higher information ecosystem, we are at a loss.

0:17:03.560 --> 0:17:05.920
<v Speaker 2>You could say, okay, well somebody believe the fake image

0:17:05.920 --> 0:17:08.960
<v Speaker 2>of the tense. Okay, who cares? But here's why you care,

0:17:09.440 --> 0:17:11.840
<v Speaker 2>Because then when the real images come out showing human

0:17:11.920 --> 0:17:16.360
<v Speaker 2>rights violations, showing people being killed, people being bombed, how

0:17:16.359 --> 0:17:19.840
<v Speaker 2>do I believe it? When you pollute the information ecosystem,

0:17:20.119 --> 0:17:23.479
<v Speaker 2>everything is in doubt. And suddenly you have people who

0:17:23.520 --> 0:17:26.560
<v Speaker 2>are denying that anybody's died, You have people denying that

0:17:26.600 --> 0:17:28.960
<v Speaker 2>the fires exist, you have people denying that people are

0:17:29.040 --> 0:17:32.480
<v Speaker 2>dying from COVID. Because this is how untrusting we have become,

0:17:33.040 --> 0:17:35.280
<v Speaker 2>and that I have a real problem with, because look,

0:17:35.840 --> 0:17:39.359
<v Speaker 2>no matter what side of the political or ideological aisle

0:17:39.440 --> 0:17:42.360
<v Speaker 2>you are on, can we at least agree that if

0:17:42.359 --> 0:17:45.720
<v Speaker 2>we don't have a shared factual system, a shared sense

0:17:45.720 --> 0:17:49.720
<v Speaker 2>of reality, we do not have a society or democracy.

0:17:49.760 --> 0:17:52.160
<v Speaker 2>We can't be arguing about whether one plus one is two.

0:17:52.480 --> 0:17:54.960
<v Speaker 2>And I would argue that this problem started well before

0:17:55.080 --> 0:17:58.440
<v Speaker 2>deep fakes. Social media is the one that is amplifying

0:17:58.920 --> 0:18:04.160
<v Speaker 2>and encouraging this type of behavior because it engages users,

0:18:04.520 --> 0:18:08.400
<v Speaker 2>drives ad drives attention, drives profits. The problem is not

0:18:08.480 --> 0:18:11.800
<v Speaker 2>just the creation side, it's the distribution side, and that,

0:18:12.040 --> 0:18:14.359
<v Speaker 2>I would argue, is the bigger problem here than the

0:18:14.400 --> 0:18:14.880
<v Speaker 2>deep fake.

0:18:16.920 --> 0:18:19.679
<v Speaker 1>Coming up. Harney for Reid on what it takes to

0:18:19.720 --> 0:18:29.639
<v Speaker 1>identify a deep fake stay with us. When we first spoke,

0:18:29.720 --> 0:18:33.240
<v Speaker 1>it was just five years ago in twenty nineteen. The

0:18:33.280 --> 0:18:36.760
<v Speaker 1>big question at the time was is there going to

0:18:36.760 --> 0:18:42.919
<v Speaker 1>be a causal piece of fake media that measurably sways

0:18:43.040 --> 0:18:46.399
<v Speaker 1>the outcome of an election? And some people say the

0:18:46.400 --> 0:18:47.920
<v Speaker 1>answer to that is no. I mean the New yorkd

0:18:47.920 --> 0:18:51.399
<v Speaker 1>apiece in twenty twenty three saying basically, you know that

0:18:51.440 --> 0:18:55.199
<v Speaker 1>the deep fakes haven't had that characterismic effect that some

0:18:55.240 --> 0:18:58.920
<v Speaker 1>people thought they would. The Atlantic run a story recently

0:18:59.000 --> 0:19:01.919
<v Speaker 1>under the headline AI's fingerprints were all over the election,

0:19:02.040 --> 0:19:05.480
<v Speaker 1>but deep fakes and information weren't the main issue, and

0:19:05.720 --> 0:19:08.359
<v Speaker 1>the kind of the point about both pieces was that

0:19:08.840 --> 0:19:10.919
<v Speaker 1>what deep fakes are really being used for is to

0:19:10.960 --> 0:19:14.439
<v Speaker 1>create memes and satire rather than to directly trick people.

0:19:14.960 --> 0:19:17.600
<v Speaker 1>And the second point was quote to growing numbers of people,

0:19:18.200 --> 0:19:21.040
<v Speaker 1>everything is fake now except what they know or other feel.

0:19:21.600 --> 0:19:21.959
<v Speaker 2>Yeah.

0:19:22.000 --> 0:19:27.600
<v Speaker 1>So has this been less explosively destructive than people thought

0:19:27.640 --> 0:19:29.560
<v Speaker 1>it would be? Or are the New York and the

0:19:29.560 --> 0:19:31.520
<v Speaker 1>Atlantic slightly missing the point in your view?

0:19:32.359 --> 0:19:35.720
<v Speaker 2>I agree and disagree with them. I agree that there

0:19:35.760 --> 0:19:39.760
<v Speaker 2>was no single atomic bomb that got dropped, and that

0:19:39.800 --> 0:19:41.560
<v Speaker 2>you can draw a line from me to be saying

0:19:41.560 --> 0:19:44.400
<v Speaker 2>this change in election, But nobody thought that was going

0:19:44.480 --> 0:19:45.879
<v Speaker 2>to be the case. So I think that's a little

0:19:45.880 --> 0:19:47.320
<v Speaker 2>bit of a straw man argument.

0:19:47.640 --> 0:19:47.800
<v Speaker 1>Right.

0:19:47.880 --> 0:19:50.680
<v Speaker 2>Okay, here's the other reason I disagree. Go talk to

0:19:50.720 --> 0:19:53.480
<v Speaker 2>the people in Slovakia, because what they will tell you

0:19:54.320 --> 0:19:58.320
<v Speaker 2>is that forty hours before election, there were two candidates,

0:19:58.359 --> 0:20:00.600
<v Speaker 2>a Pronato and a pro Putin candidate, and the pro

0:20:00.720 --> 0:20:04.199
<v Speaker 2>NATO candidate was up four points. A deep fake of

0:20:04.240 --> 0:20:06.320
<v Speaker 2>the Pro NATO candidate was released saying We're going to

0:20:06.400 --> 0:20:09.560
<v Speaker 2>rig the election, and two days later the pro Plutin

0:20:09.640 --> 0:20:12.800
<v Speaker 2>candidate won by four points. There was an eight point

0:20:12.920 --> 0:20:16.400
<v Speaker 2>swing in the polls in forty eight hours. Now were

0:20:16.440 --> 0:20:19.119
<v Speaker 2>the polls wrong? Possibly? Did it have anything to do

0:20:19.200 --> 0:20:22.480
<v Speaker 2>with the deep fake? Don't know, but this could have

0:20:22.600 --> 0:20:25.160
<v Speaker 2>been the first example, just a couple of years ago,

0:20:25.600 --> 0:20:28.520
<v Speaker 2>of where a deep fake was a tipping point. So

0:20:29.000 --> 0:20:31.560
<v Speaker 2>I'm not sure i'd buy that story. I think this

0:20:31.640 --> 0:20:34.880
<v Speaker 2>is more about death by a thousand cuts than by

0:20:35.119 --> 0:20:38.800
<v Speaker 2>dropping an atomic bomb. I think that when you keep

0:20:38.840 --> 0:20:44.040
<v Speaker 2>polluting the information ecosystem, everybody loses trust because you don't

0:20:44.119 --> 0:20:46.120
<v Speaker 2>trust NPR, you don't trust in your times, you don't

0:20:46.119 --> 0:20:48.320
<v Speaker 2>trust an end Who do you trust? Well, you trust

0:20:48.320 --> 0:20:50.879
<v Speaker 2>the guy who's yelling at you telling you what to believe, right,

0:20:50.880 --> 0:20:53.680
<v Speaker 2>because you've sort of given up. Yeah, And I would

0:20:53.720 --> 0:20:56.720
<v Speaker 2>say that, you know it's fundamentally Is that a deep

0:20:56.720 --> 0:20:59.280
<v Speaker 2>fake problem? No, I think that's a social media problem.

0:20:59.560 --> 0:21:01.840
<v Speaker 2>I think that's traditional media problem. I think that's a

0:21:01.880 --> 0:21:05.240
<v Speaker 2>polarization problem. I think it's the nature of politics today,

0:21:05.240 --> 0:21:07.600
<v Speaker 2>both here and abroad, because we have politicians who are

0:21:07.600 --> 0:21:10.480
<v Speaker 2>just outright lying to us now. So I do think

0:21:10.520 --> 0:21:14.000
<v Speaker 2>that can you point specifically to deep fakes? No, but

0:21:14.200 --> 0:21:16.199
<v Speaker 2>I do think it was an accelerant. I do think

0:21:16.240 --> 0:21:20.840
<v Speaker 2>it contributed to our general distrust and then our inability

0:21:21.240 --> 0:21:24.800
<v Speaker 2>to hear things that go against our worldview, and I

0:21:24.840 --> 0:21:28.040
<v Speaker 2>do think that that affected change. I do think you

0:21:28.200 --> 0:21:31.920
<v Speaker 2>can't look at the landscape of what Facebook and Twitter

0:21:32.080 --> 0:21:35.200
<v Speaker 2>and YouTube and TikTok, how they control the information ecosystem

0:21:35.240 --> 0:21:38.359
<v Speaker 2>for the vast majority of Americans, how they have promoted

0:21:38.520 --> 0:21:42.479
<v Speaker 2>false information, both traditionally false and deep fake falls. You

0:21:42.520 --> 0:21:44.040
<v Speaker 2>can't look at that and say that has had no

0:21:44.160 --> 0:21:47.520
<v Speaker 2>impact on the way we think. I think that's probably wrong.

0:21:47.800 --> 0:21:50.320
<v Speaker 1>So you mentioned you've been at this for some time

0:21:50.480 --> 0:21:55.760
<v Speaker 1>since opening that legal textbook all those years ago. Could

0:21:55.800 --> 0:22:00.399
<v Speaker 1>you have imagined how much trust in society has raided?

0:22:00.520 --> 0:22:02.440
<v Speaker 1>And where did you see it kind of happening all

0:22:02.440 --> 0:22:05.400
<v Speaker 1>the way? So the answer is no, I didn't see

0:22:05.400 --> 0:22:08.040
<v Speaker 1>this coming. And in the early days the liar's diving,

0:22:08.119 --> 0:22:10.840
<v Speaker 1>it didn't exist. When there was when there was film

0:22:10.960 --> 0:22:13.680
<v Speaker 1>in audio of you saying and doing something, nobody said

0:22:13.680 --> 0:22:15.000
<v Speaker 1>it was fake. And by the way, here's how you

0:22:15.040 --> 0:22:18.840
<v Speaker 1>know I'm right. Go back to twenty sixteen. Then the

0:22:18.920 --> 0:22:21.960
<v Speaker 1>first candidate Trump got caught on the Access Hollywood tape

0:22:22.400 --> 0:22:25.439
<v Speaker 1>saying what that he grabs women in places that I

0:22:25.480 --> 0:22:28.200
<v Speaker 1>won't mention on this podcast. And when he got called

0:22:28.240 --> 0:22:31.080
<v Speaker 1>on it, he didn't say it was fake. He apologized

0:22:31.480 --> 0:22:33.879
<v Speaker 1>three months later, when he was now in office, he

0:22:34.000 --> 0:22:36.520
<v Speaker 1>said it was fake. That was the moment when I

0:22:36.520 --> 0:22:39.479
<v Speaker 1>realized this was a real thing. So it was actually

0:22:39.520 --> 0:22:42.400
<v Speaker 1>fairly recently, because up until then the tech wasn't good enough,

0:22:42.440 --> 0:22:45.439
<v Speaker 1>and frankly, nobody had thought about it. But once Trump

0:22:45.560 --> 0:22:49.200
<v Speaker 1>normalized that you don't like information, call it fake news,

0:22:49.680 --> 0:22:52.800
<v Speaker 1>suddenly this became the mantra. AI was still pretty nascent,

0:22:53.280 --> 0:22:57.000
<v Speaker 1>but now it's actually a plausible deniability. Now it's actually

0:22:57.040 --> 0:22:58.800
<v Speaker 1>not an unreasonable thing. And if you go back and

0:22:58.840 --> 0:23:01.679
<v Speaker 1>look at that Access Hollywood tape, you never see him talking.

0:23:01.920 --> 0:23:06.440
<v Speaker 2>It's just audio. And so if that was released today, yeah,

0:23:06.600 --> 0:23:08.440
<v Speaker 2>you we'd have to think pretty carefully whether it was

0:23:08.480 --> 0:23:08.960
<v Speaker 2>real or not.

0:23:09.520 --> 0:23:11.879
<v Speaker 1>Your vacation in some ways to talk about this and

0:23:11.880 --> 0:23:14.320
<v Speaker 1>bring attention to it in the media. But your business

0:23:14.359 --> 0:23:19.600
<v Speaker 1>is also to bring some technological solutions to the detection problem.

0:23:19.640 --> 0:23:21.840
<v Speaker 2>Is that right? Yeah? Yeah, So I will tell you

0:23:22.480 --> 0:23:25.000
<v Speaker 2>I say this only half jokingly. I started the company

0:23:25.000 --> 0:23:27.159
<v Speaker 2>just because I couldn't keep up with the demand. I

0:23:27.240 --> 0:23:28.800
<v Speaker 2>just needed people to help me do this.

0:23:28.760 --> 0:23:31.720
<v Speaker 1>Because the best way to start a company, I think.

0:23:31.880 --> 0:23:34.479
<v Speaker 2>Yeah, I'm like guys, I used to get one call

0:23:34.520 --> 0:23:35.960
<v Speaker 2>a week, and there was one a day, and now

0:23:35.960 --> 0:23:37.920
<v Speaker 2>it's time to day and pretty soon it's gonna be

0:23:37.920 --> 0:23:40.040
<v Speaker 2>one Hundreday I can't. I honestly can't keep up. But

0:23:40.119 --> 0:23:44.400
<v Speaker 2>more less snarky, if you will, Like you know, we

0:23:44.440 --> 0:23:46.560
<v Speaker 2>really need to get a handle on this problem. And

0:23:46.600 --> 0:23:48.159
<v Speaker 2>I think there's a couple of places we want to

0:23:48.200 --> 0:23:51.080
<v Speaker 2>help organizations get a handle on it. So clearly, media outlets,

0:23:51.160 --> 0:23:54.280
<v Speaker 2>clearly you have to help the big news wires and

0:23:54.320 --> 0:23:57.439
<v Speaker 2>the major news agencies when they are dealing with breaking

0:23:57.480 --> 0:24:01.080
<v Speaker 2>news of La fires and Gaza and Inaugurate and whatever.

0:24:01.440 --> 0:24:03.439
<v Speaker 2>They've got to know what the hell's going on. We

0:24:03.520 --> 0:24:05.639
<v Speaker 2>have to help them. We clearly have to help law

0:24:05.720 --> 0:24:09.680
<v Speaker 2>enforcement at national security agencies reason about a very complicated world,

0:24:09.720 --> 0:24:12.520
<v Speaker 2>from evidence in a court of law to things with

0:24:12.560 --> 0:24:16.919
<v Speaker 2>geopolitical implications. We have to help organizations. We are seeing

0:24:17.119 --> 0:24:20.720
<v Speaker 2>massive frauds being perpetrayed on Fortune five hundred companies. We

0:24:20.800 --> 0:24:24.760
<v Speaker 2>are seeing imposter hiring. We are seeing people attack companies

0:24:24.800 --> 0:24:27.959
<v Speaker 2>with fake audio and video of CEOs, to damage their

0:24:27.960 --> 0:24:31.439
<v Speaker 2>stock price. We want to help individuals right deal with

0:24:31.480 --> 0:24:35.120
<v Speaker 2>the stuff when they are getting information, how do they trust?

0:24:35.600 --> 0:24:38.840
<v Speaker 2>And so we are developing a suite of tools that

0:24:38.920 --> 0:24:43.280
<v Speaker 2>would authenticate content, images, audio, and video to help people

0:24:43.359 --> 0:24:46.280
<v Speaker 2>make decisions. And it's not a value judgment. We're not

0:24:46.320 --> 0:24:48.280
<v Speaker 2>saying this is good or bad or ineter. In fact,

0:24:48.320 --> 0:24:50.680
<v Speaker 2>now we're even saying if it's true or false. We

0:24:50.760 --> 0:24:53.600
<v Speaker 2>are simply saying is this an authentic photo, image or

0:24:53.680 --> 0:24:56.600
<v Speaker 2>video or is it not. It's a pretty simple question

0:24:56.680 --> 0:24:59.679
<v Speaker 2>with a very very complicated and difficult answer. And by

0:24:59.720 --> 0:25:02.200
<v Speaker 2>the way, if that's not an if, it's a when,

0:25:02.960 --> 0:25:04.680
<v Speaker 2>it's a when that happens that you have to start

0:25:04.720 --> 0:25:07.920
<v Speaker 2>thinking about this because it will happen, right, because anybody

0:25:07.920 --> 0:25:11.159
<v Speaker 2>can create these fakes. Now there's somebody doesn't like their

0:25:11.200 --> 0:25:13.200
<v Speaker 2>seat on an airline, they're going to go off and

0:25:13.240 --> 0:25:15.280
<v Speaker 2>attack your company by creating a fake image or a

0:25:15.320 --> 0:25:16.720
<v Speaker 2>video or an audio and they're going to try to

0:25:16.760 --> 0:25:19.000
<v Speaker 2>hurt you. And it's frankly not that hard to do.

0:25:19.560 --> 0:25:21.720
<v Speaker 1>And the protote elements of what you're working on, what

0:25:21.880 --> 0:25:23.560
<v Speaker 1>is the technology that enables it.

0:25:24.160 --> 0:25:25.720
<v Speaker 2>Yeah, I'm going to tell you a little bit about it,

0:25:25.760 --> 0:25:27.679
<v Speaker 2>but not all of it, because you know, in the

0:25:27.720 --> 0:25:31.199
<v Speaker 2>cybersecurity world you have to be a little careful. But

0:25:31.400 --> 0:25:34.160
<v Speaker 2>underneath it is I've been doing this for twenty five years.

0:25:34.160 --> 0:25:37.040
<v Speaker 2>We have developed a suite of different technologies that look

0:25:37.119 --> 0:25:40.520
<v Speaker 2>at content from many different perspectives. We think about the

0:25:40.680 --> 0:25:44.000
<v Speaker 2>entire content creation process. So let's take an image for

0:25:44.040 --> 0:25:46.520
<v Speaker 2>an example. What happens with an image. You start out

0:25:46.600 --> 0:25:50.199
<v Speaker 2>here in the physical three dimensional world. Light moves and

0:25:50.280 --> 0:25:52.879
<v Speaker 2>hits the front of a lens. It passes through an

0:25:52.880 --> 0:25:55.720
<v Speaker 2>optical train, It hits an electronic sensor where it gets

0:25:55.760 --> 0:26:00.080
<v Speaker 2>converted from light photons analog to digital. It goes to

0:26:00.119 --> 0:26:02.880
<v Speaker 2>a series of post processing steps. It gets compressed into

0:26:02.880 --> 0:26:06.800
<v Speaker 2>a file, It gets uploaded to social media, it gets

0:26:06.840 --> 0:26:10.320
<v Speaker 2>downloaded onto my desk, and then my job begins. And

0:26:10.400 --> 0:26:13.880
<v Speaker 2>what we do is we insert ourselves into every part

0:26:13.920 --> 0:26:17.160
<v Speaker 2>of that process, the physical world, the optics, the electronic sensor,

0:26:17.240 --> 0:26:21.480
<v Speaker 2>the post processing, the packaging, and we build mathematical models

0:26:21.720 --> 0:26:24.760
<v Speaker 2>that we can say this is physically plausible, this is

0:26:24.760 --> 0:26:27.560
<v Speaker 2>physically implausible, this is consistent with a natural image, this

0:26:27.640 --> 0:26:30.199
<v Speaker 2>is consistent with an AI generated image. And we have

0:26:30.240 --> 0:26:33.800
<v Speaker 2>this suite of tools and then collectively, those come together

0:26:33.880 --> 0:26:36.720
<v Speaker 2>to tell a story about our belief that that piece

0:26:36.720 --> 0:26:38.040
<v Speaker 2>of content is authentic or not.

0:26:38.560 --> 0:26:41.639
<v Speaker 1>What degree of conviction do you have on any given

0:26:41.640 --> 0:26:43.600
<v Speaker 1>piece of content that you can verify with on it

0:26:43.640 --> 0:26:43.960
<v Speaker 1>is real?

0:26:44.320 --> 0:26:46.920
<v Speaker 2>First of all, great question, and I don't think it's

0:26:46.960 --> 0:26:49.239
<v Speaker 2>going to surprise you that the answer is complicated. I mean,

0:26:49.240 --> 0:26:50.760
<v Speaker 2>I'd like to be able to tell you ninety nine

0:26:50.760 --> 0:26:52.880
<v Speaker 2>point seven percent. And by the way, anybody who tells

0:26:52.920 --> 0:26:54.840
<v Speaker 2>you ninety nine point seven doesn't know what they're talking about.

0:26:54.880 --> 0:26:58.159
<v Speaker 2>And here's why it depends. So for example, if you

0:26:58.280 --> 0:27:02.880
<v Speaker 2>give me a high resolution twelve megapixel image, to its

0:27:02.960 --> 0:27:06.280
<v Speaker 2>high quality, we can say a lot. If you give

0:27:06.320 --> 0:27:08.560
<v Speaker 2>me an image that's three hundred by three hundred pixel

0:27:08.600 --> 0:27:11.560
<v Speaker 2>and has gone through five levels of compression and resizing

0:27:11.600 --> 0:27:15.439
<v Speaker 2>and uploaded and downloaded, it's really really hard. So it

0:27:15.480 --> 0:27:18.400
<v Speaker 2>depends on the content. So there's a number of factors

0:27:18.440 --> 0:27:21.040
<v Speaker 2>that play in, but the obvious ones are this. If

0:27:21.040 --> 0:27:23.719
<v Speaker 2>you have a high quality, high resolution piece of content,

0:27:24.200 --> 0:27:27.240
<v Speaker 2>we're pretty good at this, and that level of confidence

0:27:27.280 --> 0:27:30.639
<v Speaker 2>and ability goes down as the quality the content degreates.

0:27:30.720 --> 0:27:33.200
<v Speaker 2>It's like a physical DNA sample. You find a pile

0:27:33.240 --> 0:27:36.600
<v Speaker 2>of blood. Your DNA sample is good, You find a

0:27:36.640 --> 0:27:40.400
<v Speaker 2>tiny little half a drop of blood not so good. Look,

0:27:40.480 --> 0:27:42.520
<v Speaker 2>anybody who knows anything about the space knows there are

0:27:42.600 --> 0:27:45.160
<v Speaker 2>days where you say I don't know. I would much

0:27:45.240 --> 0:27:48.879
<v Speaker 2>much rather say I don't know than get it wrong.

0:27:49.320 --> 0:27:52.200
<v Speaker 1>So we told you about a regulation solution, you're working

0:27:52.240 --> 0:27:56.000
<v Speaker 1>on a product solution. What about the average person who

0:27:56.000 --> 0:27:59.439
<v Speaker 1>is listening to this podcast. What is the way to

0:27:59.520 --> 0:28:01.920
<v Speaker 1>protect in this changing environment?

0:28:02.160 --> 0:28:04.679
<v Speaker 2>This is easy. I really like this question because the

0:28:04.680 --> 0:28:07.000
<v Speaker 2>answer they answer to everything is hard. This one's easy.

0:28:07.200 --> 0:28:10.000
<v Speaker 2>Get off of social media. Stop getting your news from

0:28:10.040 --> 0:28:12.639
<v Speaker 2>social media. That's it. You're not going to become an

0:28:12.720 --> 0:28:15.200
<v Speaker 2>armchair analyst. You're not going to become a digital forensic expert.

0:28:15.240 --> 0:28:17.600
<v Speaker 2>You're not going to become a misinformation expert. You can't

0:28:17.600 --> 0:28:19.040
<v Speaker 2>do that, you can't do it at scale. But here's

0:28:19.040 --> 0:28:21.400
<v Speaker 2>what you can do. Stop getting your goddamn news from

0:28:21.400 --> 0:28:26.119
<v Speaker 2>social media. Annie, thank you, great talking to you guy.

0:28:26.160 --> 0:28:28.800
<v Speaker 2>I can't believe it's been five years. Okay, let's do

0:28:28.880 --> 0:28:30.840
<v Speaker 2>this again in five years and see and see where

0:28:30.840 --> 0:28:32.800
<v Speaker 2>we are, and maybe it'll be my avatar that'll be

0:28:32.800 --> 0:28:33.280
<v Speaker 2>talking with you.

0:28:33.359 --> 0:28:38.440
<v Speaker 1>Then that's it for this week in Tech the text

0:28:38.440 --> 0:28:42.520
<v Speaker 1>off i'mos Vloshin. This episode was produced by Eliza Dennis,

0:28:42.600 --> 0:28:46.880
<v Speaker 1>Victoria Dominguez, and Lizzie Jacobs. It was executive produced by Me,

0:28:47.360 --> 0:28:50.800
<v Speaker 1>Karen Price, and Kate Osborne for Kaleidis Kote and Katrina

0:28:50.880 --> 0:28:54.720
<v Speaker 1>Novelle for iHeart Podcasts. Jack Insley mixed this episode and

0:28:54.800 --> 0:28:57.960
<v Speaker 1>Kyle Murdoch Rodolphine Song join us on Friday for a

0:28:58.000 --> 0:29:02.120
<v Speaker 1>special crossover episode with the podcasts part Time Genius. We'll

0:29:02.160 --> 0:29:05.200
<v Speaker 1>be talking to Brian Merchant, author of Blood in the Machine,

0:29:05.520 --> 0:29:09.320
<v Speaker 1>about being a ludd eyed. Please rate, review, and reach

0:29:09.360 --> 0:29:12.600
<v Speaker 1>out to us at tech Stuff podcast at gmail dot com.

0:29:12.720 --> 0:29:13.800
<v Speaker 1>We're excited to hear from you.