WEBVTT - AI: Friend or Foe

0:00:00.160 --> 0:00:07.200
<v Speaker 1>Brought to you by Toyota. Let's go places. Welcome to

0:00:07.400 --> 0:00:14.680
<v Speaker 1>Forward Thinking. Pay there and welcome to Forward Thinking, the

0:00:14.840 --> 0:00:17.439
<v Speaker 1>podcast that looks at the future and says, you've got

0:00:17.440 --> 0:00:20.760
<v Speaker 1>a friend. I'm Jonathan Strickland, I'm Lauren Vocaldon, and I'm

0:00:20.840 --> 0:00:27.880
<v Speaker 1>Joe McCormick. So, Joe, I hear you like a intelligence. Uh,

0:00:27.920 --> 0:00:30.520
<v Speaker 1>it's one of the things I like. I hear you

0:00:30.560 --> 0:00:37.400
<v Speaker 1>also like artificial things, like like artificial banana flavoring. Like

0:00:37.400 --> 0:00:40.640
<v Speaker 1>what one thing that Joe absolutely loves. I have seen

0:00:40.760 --> 0:00:44.280
<v Speaker 1>him put artificial banana flavoring on some of the weirdest stuff.

0:00:44.440 --> 0:00:46.640
<v Speaker 1>But I was really trying to get it artificial intelligence.

0:00:46.680 --> 0:00:48.120
<v Speaker 1>I know, I was going about it in a really

0:00:48.200 --> 0:00:51.680
<v Speaker 1>kind of indirect Really, I thought this podcast was going

0:00:51.720 --> 0:00:54.760
<v Speaker 1>to be about artificial vanilla extract. Well it could be,

0:00:54.840 --> 0:00:56.920
<v Speaker 1>but instead I've decided to switch it over to artificial

0:00:56.960 --> 0:01:01.360
<v Speaker 1>intelligence and the idea of creating a true artificial intelligence

0:01:01.360 --> 0:01:06.759
<v Speaker 1>that has human level or beyond intelligence, And how how

0:01:06.760 --> 0:01:11.200
<v Speaker 1>would we make sure it didn't kill us? Well, you

0:01:11.240 --> 0:01:13.279
<v Speaker 1>would have to be in a position to kill us first,

0:01:13.360 --> 0:01:15.480
<v Speaker 1>but that's something we can talk about as well. I

0:01:15.520 --> 0:01:18.360
<v Speaker 1>want to start with the idea of a robot politician,

0:01:18.800 --> 0:01:20.959
<v Speaker 1>which is a sort of construct that we touch on

0:01:21.000 --> 0:01:24.240
<v Speaker 1>in this week's video. Um, so, have you ever read

0:01:24.280 --> 0:01:28.480
<v Speaker 1>the Isaac asthmov short stories Evidence or The Evitable Conflict?

0:01:28.520 --> 0:01:30.640
<v Speaker 1>These are part of I Robot, And yes I have,

0:01:31.000 --> 0:01:33.520
<v Speaker 1>I have not, So for those of us who haven't, Joe,

0:01:33.720 --> 0:01:34.759
<v Speaker 1>do you want to do you want to talk about

0:01:34.760 --> 0:01:36.160
<v Speaker 1>that for a second? Sure? Well, I don't want to

0:01:36.200 --> 0:01:40.320
<v Speaker 1>give too many spoilers, but one of them is about

0:01:40.360 --> 0:01:43.800
<v Speaker 1>a controversy where there is a politician running for an

0:01:43.800 --> 0:01:48.000
<v Speaker 1>elected office who is suspected of being a machine. Right

0:01:48.080 --> 0:01:50.960
<v Speaker 1>and in fact, in the world that as a mom

0:01:51.040 --> 0:01:53.920
<v Speaker 1>has created, it's important for you to realize that machines,

0:01:54.080 --> 0:01:58.080
<v Speaker 1>robots with positronic brains, which are these artificially intelligent brains,

0:01:58.440 --> 0:02:02.520
<v Speaker 1>are not allowed to be on worlds that have human habitation.

0:02:03.040 --> 0:02:06.120
<v Speaker 1>You can only be on uninhabited worlds. It's the only

0:02:06.120 --> 0:02:08.000
<v Speaker 1>place where those robots are allowed to go. So they're

0:02:08.040 --> 0:02:10.840
<v Speaker 1>they're allowed to go to places and do dangerous work

0:02:10.919 --> 0:02:13.080
<v Speaker 1>that benefits the rest of humanity, but they can't be

0:02:13.120 --> 0:02:16.079
<v Speaker 1>on a world that's inhabited by humans. Yeah. So, Asimov

0:02:16.200 --> 0:02:20.440
<v Speaker 1>had an interesting approach to talking about the integration of

0:02:20.600 --> 0:02:24.320
<v Speaker 1>robots and artificial intelligence into society, which I like because

0:02:24.320 --> 0:02:27.760
<v Speaker 1>it was neither utopian nor dystopian. Now it is very

0:02:27.919 --> 0:02:29.880
<v Speaker 1>very much kind of taking light. Let's look at the

0:02:29.880 --> 0:02:32.919
<v Speaker 1>world around us, which is definitely not perfect, but it's

0:02:32.960 --> 0:02:36.520
<v Speaker 1>not you know, twelve monkeys world worst case scenario either. No,

0:02:36.880 --> 0:02:40.920
<v Speaker 1>he was exploring a sort of a smart, well engineered

0:02:40.960 --> 0:02:44.120
<v Speaker 1>system that still had flaws in it. And so the

0:02:44.160 --> 0:02:48.120
<v Speaker 1>system was that the robots in this world are governed

0:02:48.160 --> 0:02:51.240
<v Speaker 1>by three laws. The first law is you cannot harm

0:02:51.280 --> 0:02:54.680
<v Speaker 1>a human. Second law is you have to obey human commands.

0:02:55.320 --> 0:02:58.440
<v Speaker 1>Third law is you can't destroy yourself. Right. And of

0:02:58.440 --> 0:03:01.160
<v Speaker 1>course each of the laws ends up saying unless it

0:03:01.480 --> 0:03:05.680
<v Speaker 1>would break prioritized one to three. Right. Uh yeah, So

0:03:06.000 --> 0:03:08.520
<v Speaker 1>they use this to try to create a framework to

0:03:08.560 --> 0:03:11.960
<v Speaker 1>make sure that a robot never does anything bad. Of course,

0:03:12.040 --> 0:03:15.040
<v Speaker 1>it doesn't always work, and thus is the sort of

0:03:15.120 --> 0:03:18.280
<v Speaker 1>point of conflict for many of Asimov's stories. It's like, uh,

0:03:18.320 --> 0:03:20.560
<v Speaker 1>they're sort of obeying the laws, but the laws are

0:03:20.560 --> 0:03:23.840
<v Speaker 1>coming into conflict in such a way that now we've

0:03:23.840 --> 0:03:26.040
<v Speaker 1>got a problem. Right, And do do recall that he

0:03:26.120 --> 0:03:28.680
<v Speaker 1>was writing fiction to be entertaining. He wrote the laws

0:03:28.680 --> 0:03:31.280
<v Speaker 1>in order to be interestingly flawed so that he could

0:03:31.280 --> 0:03:33.920
<v Speaker 1>exploit that for story purposes. This this was never meant

0:03:33.919 --> 0:03:37.800
<v Speaker 1>to be a complete manifesto of how to robot right, right,

0:03:37.840 --> 0:03:39.960
<v Speaker 1>So back to the two stories you brought up. The

0:03:40.040 --> 0:03:42.320
<v Speaker 1>idea of one of them is that there's a secret

0:03:42.400 --> 0:03:45.920
<v Speaker 1>robot who seems to be human outwardly running for office,

0:03:46.040 --> 0:03:48.480
<v Speaker 1>and the question is is it really a person or

0:03:48.560 --> 0:03:51.680
<v Speaker 1>is it really a robot? But characters within this story

0:03:51.800 --> 0:03:54.360
<v Speaker 1>debate whether it's really such a bad thing to have

0:03:54.440 --> 0:03:58.600
<v Speaker 1>a robot in office because the robot, unlike humans, is

0:03:59.000 --> 0:04:02.520
<v Speaker 1>not self interest did it has? It has these laws

0:04:02.640 --> 0:04:05.640
<v Speaker 1>governing its actions, and these laws will in the end

0:04:05.760 --> 0:04:08.640
<v Speaker 1>ensure that really it isn't going to do harm. In fact,

0:04:08.680 --> 0:04:11.520
<v Speaker 1>one of the main characters in I Robot is this

0:04:12.440 --> 0:04:20.159
<v Speaker 1>humor less kind of misanthropic robo psychologists. She's she's human,

0:04:20.760 --> 0:04:25.600
<v Speaker 1>but she specializes in robo psychology, and she uh, she

0:04:27.040 --> 0:04:31.159
<v Speaker 1>call her humor less. But there are specific passages where

0:04:31.240 --> 0:04:34.120
<v Speaker 1>she she she people try to engage with her and

0:04:34.160 --> 0:04:39.679
<v Speaker 1>she turns her humorless eyes upon them that she she states,

0:04:40.320 --> 0:04:44.680
<v Speaker 1>uh completely, you know, in a in a very uh

0:04:44.720 --> 0:04:48.000
<v Speaker 1>straightforward way, that she thinks robots are superior to human

0:04:48.040 --> 0:04:52.240
<v Speaker 1>beings in in most in most ways because with the

0:04:52.360 --> 0:04:55.720
<v Speaker 1>Robot President character, the person who may or may not

0:04:55.839 --> 0:04:58.719
<v Speaker 1>be a robot. In fact, they're very careful to try

0:04:58.839 --> 0:05:02.560
<v Speaker 1>and build a k either way. They being Asima, really

0:05:02.839 --> 0:05:04.960
<v Speaker 1>build a case either way that could be robot, could

0:05:05.000 --> 0:05:10.039
<v Speaker 1>be human. Uh. She says that he's either a robot

0:05:10.600 --> 0:05:14.919
<v Speaker 1>or a really really really decent human being. So that

0:05:15.080 --> 0:05:17.400
<v Speaker 1>that kind of tells you that that character's perspective and

0:05:17.440 --> 0:05:19.719
<v Speaker 1>a lot of the stories come from from her kind

0:05:19.720 --> 0:05:24.279
<v Speaker 1>of experience that she feels that robots are in fact

0:05:24.560 --> 0:05:27.520
<v Speaker 1>better than people for the most part. Right, But let's

0:05:27.560 --> 0:05:30.320
<v Speaker 1>imagine we take it one step beyond just the idea

0:05:30.320 --> 0:05:33.680
<v Speaker 1>of a single robot in a single leadership role. There's

0:05:33.680 --> 0:05:38.440
<v Speaker 1>another Asimov's story called The Inevitable Conflict, which discusses how

0:05:38.920 --> 0:05:42.840
<v Speaker 1>at some point in the future, all kinds of systems

0:05:42.880 --> 0:05:47.040
<v Speaker 1>are governed by robotic or artificially intelligent controls. Some would

0:05:47.120 --> 0:05:49.760
<v Speaker 1>argue that we're already in that world at some point.

0:05:49.839 --> 0:05:52.760
<v Speaker 1>I mean, you look at the stock market, you know,

0:05:52.920 --> 0:05:56.400
<v Speaker 1>robo trading. You've got like this, this, all these algorithms,

0:05:56.440 --> 0:06:00.320
<v Speaker 1>these these programs that are running all these sophisticated uh,

0:06:00.520 --> 0:06:03.520
<v Speaker 1>you know, algorithms to guide them on when to buy

0:06:03.520 --> 0:06:08.120
<v Speaker 1>and when to sell all these uh, these very short transactions. Uh,

0:06:08.200 --> 0:06:11.440
<v Speaker 1>and they have global consequences. We've talked about that previously

0:06:11.520 --> 0:06:14.320
<v Speaker 1>on this podcast. So in some ways we're already seeing

0:06:14.320 --> 0:06:17.120
<v Speaker 1>that come to pass. Now we're not talking about a computer.

0:06:17.200 --> 0:06:20.000
<v Speaker 1>We go to you know, type in a question of

0:06:20.279 --> 0:06:23.000
<v Speaker 1>you know, how do we do such and such, and

0:06:23.040 --> 0:06:25.200
<v Speaker 1>it gives us the sage advice and then we you know,

0:06:25.279 --> 0:06:27.440
<v Speaker 1>it's not deep thoughts. I don't know. Google does that

0:06:27.520 --> 0:06:30.360
<v Speaker 1>for me about seventy eight times a day. Google, Well,

0:06:30.520 --> 0:06:33.719
<v Speaker 1>Google does do that. We are already sort of wading

0:06:33.720 --> 0:06:35.560
<v Speaker 1>into these waters, whether you know it or not. You

0:06:35.600 --> 0:06:38.440
<v Speaker 1>mentioned the Stock exchange, but you might say, oh, well

0:06:38.440 --> 0:06:41.120
<v Speaker 1>but that's private industry, wild West guns Blaze and they're

0:06:41.160 --> 0:06:43.479
<v Speaker 1>doing whatever. You know, the government wouldn't do that. Well,

0:06:43.760 --> 0:06:47.279
<v Speaker 1>the I R S already has a process called computer scoring,

0:06:47.680 --> 0:06:51.280
<v Speaker 1>where you submit a tax return and computers pre screen

0:06:51.360 --> 0:06:54.479
<v Speaker 1>those returns to decide whether or not we should put

0:06:54.480 --> 0:06:58.200
<v Speaker 1>you into the pile to investigate for an audit. Yeah,

0:06:58.760 --> 0:07:03.240
<v Speaker 1>and the fun act is this podcast goes live the

0:07:03.279 --> 0:07:06.599
<v Speaker 1>week of income Tax Day but after it's already over.

0:07:06.680 --> 0:07:09.120
<v Speaker 1>So I hope you guys thought about that before you

0:07:09.160 --> 0:07:15.320
<v Speaker 1>since your it turns in. Okay, So imagine a future

0:07:15.400 --> 0:07:19.800
<v Speaker 1>where we do have artificially intelligent machines, probably much more

0:07:19.880 --> 0:07:24.480
<v Speaker 1>intelligent than humans. Otherwise, what's the point governing our systems,

0:07:24.480 --> 0:07:28.480
<v Speaker 1>our societies, our economies, making decisions on our behalf to

0:07:28.520 --> 0:07:30.640
<v Speaker 1>try to make the world a better place for us?

0:07:30.680 --> 0:07:33.400
<v Speaker 1>And there's hypothetical pluses and minuses here. What are some

0:07:33.440 --> 0:07:36.280
<v Speaker 1>of the good points? Well, good point would be that

0:07:36.400 --> 0:07:40.520
<v Speaker 1>it be able to make decisions faster and with preface,

0:07:41.200 --> 0:07:44.880
<v Speaker 1>ideally with less bias than a human being with Oh yeah, well,

0:07:45.120 --> 0:07:47.240
<v Speaker 1>let's just start from the ideal point of view before

0:07:47.280 --> 0:07:49.560
<v Speaker 1>we crack a bunch of Okay, So, let's say it's

0:07:49.600 --> 0:07:53.440
<v Speaker 1>a perfect AI and it is uh, you know, you

0:07:53.480 --> 0:07:57.920
<v Speaker 1>wouldn't call it cold. It's logical, but it's also compassionate. Yeah,

0:07:58.000 --> 0:08:00.360
<v Speaker 1>Let's say you've you've created a computer and given it

0:08:00.440 --> 0:08:04.840
<v Speaker 1>some instruction like create the greatest maximal benefit for humanity,

0:08:05.000 --> 0:08:07.680
<v Speaker 1>and it it works out how to do that, which

0:08:07.720 --> 0:08:10.360
<v Speaker 1>it can do because it's super intelligent. It's way smarter

0:08:10.440 --> 0:08:14.360
<v Speaker 1>than any human and it can look at trends in society.

0:08:14.400 --> 0:08:17.400
<v Speaker 1>It can look at unemployment numbers and crime statistics and

0:08:17.440 --> 0:08:21.040
<v Speaker 1>all these things, distribution, water distribution, It can average all

0:08:21.120 --> 0:08:24.720
<v Speaker 1>of that data together to make incredibly accurate predictions about

0:08:24.720 --> 0:08:27.160
<v Speaker 1>the effects of its actions that we just don't have

0:08:27.200 --> 0:08:29.960
<v Speaker 1>the cognitive capability to do. And furthermore, it can do

0:08:30.000 --> 0:08:32.880
<v Speaker 1>all of that with with no hate, no greed, no ambition,

0:08:32.960 --> 0:08:36.280
<v Speaker 1>no prejudice. Right, exactly, it doesn't have a will to

0:08:36.440 --> 0:08:39.880
<v Speaker 1>power of its own. It just has programming. It just has,

0:08:40.280 --> 0:08:43.680
<v Speaker 1>you know, doing what it's designed to do. So that's

0:08:43.720 --> 0:08:47.760
<v Speaker 1>the ideal, perfect vision sort of. It's perfectly capable and

0:08:47.840 --> 0:08:54.400
<v Speaker 1>it's perfectly moral. But on the other hand, machines are unpredictable,

0:08:54.480 --> 0:08:56.760
<v Speaker 1>or at least machines like this. Actually, machines on the

0:08:56.800 --> 0:08:59.319
<v Speaker 1>small scale are very predictable. They do what you tell

0:08:59.360 --> 0:09:01.040
<v Speaker 1>them to do in the thing else they aren't. They

0:09:01.080 --> 0:09:03.640
<v Speaker 1>can't do anything else because they weren't programmed to do.

0:09:03.920 --> 0:09:05.960
<v Speaker 1>But if you create a machine that is more intelligent

0:09:06.000 --> 0:09:09.480
<v Speaker 1>than you, you inherently cannot understand what it's doing. Whoops. Yeah,

0:09:09.600 --> 0:09:14.400
<v Speaker 1>So any machine smarter than you, you sort of lose transparency, right,

0:09:14.480 --> 0:09:18.199
<v Speaker 1>it's hard to understand the decisions that are being made.

0:09:18.280 --> 0:09:20.720
<v Speaker 1>If they're being made, it's at a level, way way

0:09:20.720 --> 0:09:23.480
<v Speaker 1>above your head. Let's here's an example. Let's say that

0:09:23.640 --> 0:09:26.960
<v Speaker 1>we have like the Grand Deep Thought computer that we

0:09:27.040 --> 0:09:30.640
<v Speaker 1>want to consult when we have a particularly tough question. Uh.

0:09:30.679 --> 0:09:32.560
<v Speaker 1>And maybe it's one of these about how do we

0:09:32.679 --> 0:09:36.120
<v Speaker 1>have the maximum benefit for the most people on earth,

0:09:36.200 --> 0:09:40.000
<v Speaker 1>impacting having a negative impact on the least number of people,

0:09:40.440 --> 0:09:42.600
<v Speaker 1>trying to trying to get as good a reaction as

0:09:42.600 --> 0:09:45.360
<v Speaker 1>we possibly can, knowing that there's not likely to be

0:09:45.400 --> 0:09:48.400
<v Speaker 1>any perfect answer that's going to make all ships rise

0:09:48.520 --> 0:09:51.160
<v Speaker 1>up with the tide, right, Uh. And then the computer

0:09:51.240 --> 0:09:53.600
<v Speaker 1>comes back and gives us an answer that, on the

0:09:53.600 --> 0:09:57.400
<v Speaker 1>face of it seems counter intuitive or counter productive. And

0:09:57.440 --> 0:10:01.480
<v Speaker 1>the computer knows because it's run the remulations that while

0:10:01.559 --> 0:10:04.839
<v Speaker 1>this first step is possibly a tough one for us

0:10:04.840 --> 0:10:07.520
<v Speaker 1>to take, it's actually the one that will lead to

0:10:07.559 --> 0:10:10.560
<v Speaker 1>the most beneficial outcome. So then the short term we

0:10:10.640 --> 0:10:14.520
<v Speaker 1>have some hardship. Perhaps it is food redistribution, which would

0:10:14.559 --> 0:10:17.320
<v Speaker 1>be a huge one, right, or water redistribution, which would

0:10:17.320 --> 0:10:20.200
<v Speaker 1>be another huge problem. But let's say that's that first

0:10:20.200 --> 0:10:23.680
<v Speaker 1>step that's really really hard for at least some parts

0:10:23.720 --> 0:10:26.800
<v Speaker 1>of the world to to agree to. Then you could

0:10:26.800 --> 0:10:29.960
<v Speaker 1>have people arguing this thing is trying to destroy us,

0:10:30.000 --> 0:10:32.480
<v Speaker 1>it's not trying to help us, not necessarily being able

0:10:32.480 --> 0:10:34.760
<v Speaker 1>to see that twenty eight steps down the road, it

0:10:34.800 --> 0:10:38.000
<v Speaker 1>actually leads to an outcome that's beneficial for everybody. Likewise,

0:10:38.000 --> 0:10:40.080
<v Speaker 1>on the other hand, it could tell us to do

0:10:40.200 --> 0:10:43.800
<v Speaker 1>something because it is malfunctioning and we don't have the

0:10:43.840 --> 0:10:48.360
<v Speaker 1>transparency capability to understand that it's malfunctioning. Thus it leads

0:10:48.440 --> 0:10:51.840
<v Speaker 1>us down a really horrible path. Without hating us, I

0:10:51.840 --> 0:10:55.160
<v Speaker 1>mean it doesn't. It's not that it's trying to destroy humanity.

0:10:55.200 --> 0:11:00.200
<v Speaker 1>I mean it might in it just it calculated something long,

0:11:00.240 --> 0:11:05.000
<v Speaker 1>it didn't understand something one burn all the week, okay.

0:11:05.800 --> 0:11:08.000
<v Speaker 1>And that flip side of it of it not being

0:11:08.360 --> 0:11:11.040
<v Speaker 1>hateful of a machine inherently not being hateful is that

0:11:11.240 --> 0:11:15.200
<v Speaker 1>a machine inherently has no human empathy or intuition about

0:11:15.360 --> 0:11:18.720
<v Speaker 1>what what step is okay and what is not unless

0:11:18.760 --> 0:11:21.720
<v Speaker 1>we program that in. Yes, So if you haven't thought

0:11:21.920 --> 0:11:26.280
<v Speaker 1>to have the computer specifically look at the most disadvantaged

0:11:26.400 --> 0:11:30.600
<v Speaker 1>people and uh and take special consideration for those people

0:11:30.640 --> 0:11:35.120
<v Speaker 1>who are are essentially they're going to be victims of

0:11:35.200 --> 0:11:37.679
<v Speaker 1>whatever decisions you make. It may be that they have

0:11:37.840 --> 0:11:41.360
<v Speaker 1>a positive outcome, but it may not be unless you've

0:11:41.400 --> 0:11:43.680
<v Speaker 1>built that in. Then the computer is not necessarily going

0:11:43.760 --> 0:11:47.040
<v Speaker 1>to make that consideration for you. And that could be

0:11:47.080 --> 0:11:50.319
<v Speaker 1>a real impact. Right, I'd like to mention something else.

0:11:50.360 --> 0:11:53.199
<v Speaker 1>We say that a computer has no hate, has no greed,

0:11:53.240 --> 0:11:55.600
<v Speaker 1>and all those things, which is inherently true about the computer,

0:11:56.040 --> 0:11:59.520
<v Speaker 1>but the humans that create the computer could have those things.

0:12:00.080 --> 0:12:02.160
<v Speaker 1>And a program is only going to be as impartial

0:12:02.200 --> 0:12:04.400
<v Speaker 1>as its creator was. And and you know, the creator

0:12:04.480 --> 0:12:06.319
<v Speaker 1>might be sitting there going like, well, you know, some

0:12:06.440 --> 0:12:11.160
<v Speaker 1>animals are more than others. Yeah, you know. And so

0:12:11.960 --> 0:12:13.480
<v Speaker 1>even if you even if you take it a couple

0:12:13.480 --> 0:12:15.920
<v Speaker 1>of steps further, because I've seen it proposed that if you, okay,

0:12:16.000 --> 0:12:19.560
<v Speaker 1>create a super intelligent machine and have that super intelligent machine,

0:12:19.840 --> 0:12:22.559
<v Speaker 1>create a really super intelligent machine and use that super

0:12:22.559 --> 0:12:25.600
<v Speaker 1>intelligent machine as it's your president robot, this is deep

0:12:25.679 --> 0:12:30.560
<v Speaker 1>thought creating the Earth. Yeah, because the Earth is a

0:12:30.559 --> 0:12:33.679
<v Speaker 1>computer in Hitchecker's guide, right, right, computer? And I mean,

0:12:34.800 --> 0:12:37.520
<v Speaker 1>you know, input output, if if if if the humans

0:12:37.559 --> 0:12:41.520
<v Speaker 1>creating deep thought we're prejudiced at the beginning, then that

0:12:41.520 --> 0:12:44.679
<v Speaker 1>could just computer form. Right. So yeah, I mean if

0:12:44.760 --> 0:12:47.559
<v Speaker 1>you if you have a bias, and that bias is

0:12:47.559 --> 0:12:49.959
<v Speaker 1>built into the programming you make. Because you know, we're

0:12:50.000 --> 0:12:53.440
<v Speaker 1>talking about a an intelligent computer. I think a lot

0:12:53.480 --> 0:12:57.760
<v Speaker 1>of people just imagine that to be an incredibly powerful machine,

0:12:57.800 --> 0:13:00.599
<v Speaker 1>and that's where it begins it, right, That's it's the

0:13:00.640 --> 0:13:03.520
<v Speaker 1>machine part that's important. But like we said in our

0:13:03.559 --> 0:13:08.800
<v Speaker 1>Singularity podcast, the software is equally as important, and without

0:13:08.840 --> 0:13:12.120
<v Speaker 1>it maybe more important. Yeah, you could argue more important.

0:13:12.160 --> 0:13:14.000
<v Speaker 1>I mean without the hardware, the software can't run. But

0:13:14.040 --> 0:13:18.440
<v Speaker 1>without the software, it can't be intelligent. Right, So unless

0:13:18.480 --> 0:13:22.679
<v Speaker 1>you have very sophisticated software that can take on the

0:13:22.679 --> 0:13:26.679
<v Speaker 1>these these problems, either by designing the next computer so

0:13:26.720 --> 0:13:30.400
<v Speaker 1>that it is the most efficient or by doing it itself.

0:13:31.240 --> 0:13:33.560
<v Speaker 1>If the if the programmers do have this bias, that

0:13:33.600 --> 0:13:36.800
<v Speaker 1>could be reflected in the results. Okay, so people are

0:13:36.840 --> 0:13:40.960
<v Speaker 1>talking about creating a super intelligent machine. Obviously we can't

0:13:41.000 --> 0:13:44.720
<v Speaker 1>do that today, but people are refining AI methods and

0:13:44.800 --> 0:13:47.880
<v Speaker 1>it may in some people's minds sneak up on us,

0:13:47.920 --> 0:13:50.440
<v Speaker 1>Like you could suddenly realize like, oh, we've gone a

0:13:50.440 --> 0:13:53.760
<v Speaker 1>long way down this road to creating something that's equal

0:13:53.800 --> 0:13:56.560
<v Speaker 1>to human intelligence or even beyond it, which is really

0:13:56.559 --> 0:13:59.840
<v Speaker 1>the sweet spot for these problems. Maybe it's a good

0:13:59.840 --> 0:14:02.800
<v Speaker 1>idea to start thinking about what we would need to

0:14:02.880 --> 0:14:07.280
<v Speaker 1>do in order to prevent really negative outcomes if we

0:14:07.280 --> 0:14:10.559
<v Speaker 1>were to create this superintelligent machine. Right, the two big

0:14:10.600 --> 0:14:14.319
<v Speaker 1>negative outcomes these are like taking to the absurd extreme obviously,

0:14:14.800 --> 0:14:17.400
<v Speaker 1>but I call it the kill all humans or the

0:14:17.440 --> 0:14:22.000
<v Speaker 1>subjugate all humans approaches. These are really popular in science fiction. Right.

0:14:22.000 --> 0:14:24.840
<v Speaker 1>This is this is the world of the Terminator, where

0:14:25.200 --> 0:14:28.800
<v Speaker 1>humans have created machines that gain sentience and ultimately turn

0:14:28.880 --> 0:14:31.960
<v Speaker 1>on their creators for one reason or another. And there

0:14:31.960 --> 0:14:35.160
<v Speaker 1>are a lot of different approaches to this kind of storyline.

0:14:35.160 --> 0:14:39.040
<v Speaker 1>In some cases, the machines have malevolent intent. They actually

0:14:39.120 --> 0:14:42.160
<v Speaker 1>want to kill humans because they're you know, essentially robotic

0:14:42.200 --> 0:14:47.000
<v Speaker 1>psychopaths and other versions. It's that the machines have calculated

0:14:47.040 --> 0:14:50.800
<v Speaker 1>that the best possible outcome for whatever planet Earth will

0:14:50.840 --> 0:14:53.400
<v Speaker 1>say is for humans to be wiped off, because that's

0:14:53.400 --> 0:14:54.840
<v Speaker 1>the source of most of the problems. So if you

0:14:54.840 --> 0:14:56.640
<v Speaker 1>get rid of the source, then the problems are gone.

0:14:56.960 --> 0:15:00.000
<v Speaker 1>So in some cases it's like a mistaken like, oh,

0:15:00.040 --> 0:15:01.880
<v Speaker 1>I know how to solve this issue. We just gotta

0:15:01.960 --> 0:15:04.560
<v Speaker 1>kill all the people deemed you ilogical, right, or the

0:15:04.600 --> 0:15:07.960
<v Speaker 1>subjecate all humans. That's essentially the matrix approach where we've

0:15:08.000 --> 0:15:11.000
<v Speaker 1>created machines and we are Our intent was to make

0:15:11.000 --> 0:15:14.120
<v Speaker 1>the machines work for us, but irony of ironies, the

0:15:14.160 --> 0:15:16.680
<v Speaker 1>machines have decided that we're going to be working for them,

0:15:16.720 --> 0:15:20.400
<v Speaker 1>possibly as giant batteries, although that's incredibly inefficient. They get

0:15:20.440 --> 0:15:22.560
<v Speaker 1>better results from cows that should have been the moo

0:15:22.640 --> 0:15:27.840
<v Speaker 1>tricks I've been waiting to use that. Lauren is shaking

0:15:27.840 --> 0:15:30.200
<v Speaker 1>her head at me. So tech stuff fans know what that.

0:15:31.640 --> 0:15:34.360
<v Speaker 1>Joe Joe appreciates it. I think it's only because I've

0:15:34.400 --> 0:15:37.720
<v Speaker 1>heard that one before from from you on tech stuff.

0:15:37.760 --> 0:15:41.920
<v Speaker 1>It's also fair. Okay, let's talk about friendly AI. Okay,

0:15:41.960 --> 0:15:45.280
<v Speaker 1>this is the this is the term. It's friendly artificial intelligence,

0:15:45.360 --> 0:15:48.120
<v Speaker 1>the term for the framework that we would need to

0:15:48.160 --> 0:15:51.960
<v Speaker 1>come up with to create artificial intelligence or a super

0:15:51.960 --> 0:15:55.680
<v Speaker 1>intelligence that has a net benefit to humanity rather than

0:15:55.720 --> 0:15:58.320
<v Speaker 1>a negative outcome. I like to think of friendly AI

0:15:58.360 --> 0:16:00.720
<v Speaker 1>as the AI that walks in the door, takes off

0:16:00.760 --> 0:16:03.160
<v Speaker 1>its jacket and slow first puts on a pair of sneakers.

0:16:03.160 --> 0:16:06.960
<v Speaker 1>A little sweater vest and then just gently leads you

0:16:07.000 --> 0:16:09.680
<v Speaker 1>into the future. Lets us see a little story about trains,

0:16:10.280 --> 0:16:13.320
<v Speaker 1>trains with faces in the future. Can can we have

0:16:13.360 --> 0:16:16.400
<v Speaker 1>anyone building super intelligent AI is listening? Please do that,

0:16:16.520 --> 0:16:19.160
<v Speaker 1>because that would be essentially the best of all We

0:16:19.160 --> 0:16:22.960
<v Speaker 1>were actually designed friendly AI to follow the philosophy of Mr. Rogers.

0:16:23.000 --> 0:16:26.040
<v Speaker 1>We'd be set, won't you be my neighbor? I would

0:16:26.040 --> 0:16:31.160
<v Speaker 1>totally be that that super intelligent AI's neighbor, completely without hesitation. Okay,

0:16:31.160 --> 0:16:33.560
<v Speaker 1>But so there's some guidelines that people have written up

0:16:33.600 --> 0:16:36.000
<v Speaker 1>and and for a while these guidelines have existed. Back

0:16:36.000 --> 0:16:40.280
<v Speaker 1>in two thousand one, the Singularity Institute published a thing,

0:16:40.640 --> 0:16:43.680
<v Speaker 1>a rather lengthy thing that I will not go into deep,

0:16:43.720 --> 0:16:46.480
<v Speaker 1>deep detail of, but but they began by positing that

0:16:46.480 --> 0:16:49.360
<v Speaker 1>that since growth in AI is and I quote astronomically

0:16:49.440 --> 0:16:52.480
<v Speaker 1>faster than the rate of human evolution um, that we

0:16:52.560 --> 0:16:55.520
<v Speaker 1>need to be thinking about this issue. And hey, we'll

0:16:55.520 --> 0:16:59.760
<v Speaker 1>talk about that that belief system um in our episode

0:16:59.840 --> 0:17:02.360
<v Speaker 1>or already talked about it in our episode about the Singularity.

0:17:02.440 --> 0:17:04.240
<v Speaker 1>We don't know which one will come first. I will

0:17:04.400 --> 0:17:07.040
<v Speaker 1>I will say that it definitely has evolved much If

0:17:07.040 --> 0:17:09.600
<v Speaker 1>you think of human evolution as taking over the course

0:17:09.640 --> 0:17:12.000
<v Speaker 1>of millions of years, and the fact that we've had

0:17:12.119 --> 0:17:15.680
<v Speaker 1>computers since the like nineteen forties. If you want to

0:17:15.720 --> 0:17:18.720
<v Speaker 1>be really generous, I can I can agree with the

0:17:18.800 --> 0:17:22.679
<v Speaker 1>astronomically faster evolution. I don't know that necessarily leads to

0:17:23.280 --> 0:17:27.760
<v Speaker 1>superintelligent computers, but but pray continue, sure um and and hey,

0:17:27.800 --> 0:17:31.639
<v Speaker 1>either way, caution and thought are good. So they specifically

0:17:31.680 --> 0:17:34.040
<v Speaker 1>suggest that we should be careful not to expect a

0:17:34.119 --> 0:17:36.760
<v Speaker 1>machine mind to operate like a human mind. Um, that

0:17:36.760 --> 0:17:40.920
<v Speaker 1>that we shouldn't anthropomorphize AI. Right, that's a really good point. Sure, sure,

0:17:40.960 --> 0:17:44.119
<v Speaker 1>And building from there, they lay out the challenges in

0:17:44.200 --> 0:17:50.080
<v Speaker 1>creating friendly AI um being the creation of ethical content UM,

0:17:50.119 --> 0:17:54.440
<v Speaker 1>creating a machine capable of acquiring that content, even asking

0:17:54.560 --> 0:17:58.560
<v Speaker 1>human questions when necessary, but simultaneously knowing enough to resist

0:17:58.640 --> 0:18:02.800
<v Speaker 1>human manipulation and sell correct for human errors. That's pretty cool, though.

0:18:03.800 --> 0:18:06.159
<v Speaker 1>They go into a lot more depth in the recommendations.

0:18:06.520 --> 0:18:09.880
<v Speaker 1>Believe these are based on Yudkowski's work, right, yeah, yeah,

0:18:09.960 --> 0:18:13.760
<v Speaker 1>he did a book length kind of paper also in

0:18:13.760 --> 0:18:16.960
<v Speaker 1>two thousand one called Creating Friendly AI one point oh,

0:18:17.040 --> 0:18:22.440
<v Speaker 1>the Analysis and Design of Benevolent Goal architectures. Elias are Udkowski,

0:18:22.480 --> 0:18:25.520
<v Speaker 1>who we mentioned in our podcast about the Singularity. He's

0:18:25.560 --> 0:18:29.960
<v Speaker 1>written at length about this specific problem, the friendly AI problem. Yes, yes,

0:18:30.040 --> 0:18:32.200
<v Speaker 1>and uh, and we'll have more to say about an

0:18:32.200 --> 0:18:36.080
<v Speaker 1>interesting thought experiment he came up with in a little bit. Yeah,

0:18:36.119 --> 0:18:39.680
<v Speaker 1>so we should back up and say, hey, wait a second,

0:18:40.280 --> 0:18:42.560
<v Speaker 1>why do we really need to worry about friendly AI

0:18:42.920 --> 0:18:45.880
<v Speaker 1>kill all humans and subjugate all humans? I pretty much

0:18:45.920 --> 0:18:49.760
<v Speaker 1>covered that, why would that happen? Well, okay, what if

0:18:49.800 --> 0:18:53.560
<v Speaker 1>we just do what apparently most AI developers are doing

0:18:53.960 --> 0:18:56.520
<v Speaker 1>and just keep going and hope it will work out

0:18:56.560 --> 0:18:59.600
<v Speaker 1>for the best. Uh. There Actually there have been some people,

0:18:59.720 --> 0:19:02.919
<v Speaker 1>some thinkers in in friendly AI who have pointed out

0:19:02.960 --> 0:19:05.480
<v Speaker 1>that this seems to be the dominant approach, just kind

0:19:05.520 --> 0:19:07.560
<v Speaker 1>of hope it's going to work out well and and

0:19:07.640 --> 0:19:11.399
<v Speaker 1>hope that no one's programming psychopathic tendencies into their software.

0:19:12.480 --> 0:19:14.240
<v Speaker 1>Part part of it, I would argue, is that a

0:19:14.280 --> 0:19:17.000
<v Speaker 1>lot of programmers say that we're so far away from

0:19:17.040 --> 0:19:21.280
<v Speaker 1>a a human level intelligence or superhuman level intelligence of

0:19:21.280 --> 0:19:25.360
<v Speaker 1>of AI, uh that could do anything beyond a very

0:19:25.400 --> 0:19:29.200
<v Speaker 1>specific task. We're so far away from that that it's

0:19:29.280 --> 0:19:31.439
<v Speaker 1>not really that important to worry about it at the moment.

0:19:31.600 --> 0:19:34.639
<v Speaker 1>And uh so there's that level, right, that's the idea

0:19:34.720 --> 0:19:36.840
<v Speaker 1>that we're all working on these bits and pieces that

0:19:36.960 --> 0:19:40.399
<v Speaker 1>ultimately could come together to make a superhuman intelligent AI

0:19:40.480 --> 0:19:44.480
<v Speaker 1>in the future. But right now, years out, we're far

0:19:44.560 --> 0:19:47.040
<v Speaker 1>enough away right now where that's you know, come on,

0:19:47.800 --> 0:19:50.960
<v Speaker 1>I agree with you that it probably is a good

0:19:50.960 --> 0:19:53.160
<v Speaker 1>ways out. I'm not one of those people who thinks

0:19:53.200 --> 0:19:56.200
<v Speaker 1>the singularity is near. I think it's probably a long

0:19:56.240 --> 0:20:00.119
<v Speaker 1>way off. But even with it being probably a long

0:20:00.000 --> 0:20:02.840
<v Speaker 1>long way off, it's way better to be safe than sorry.

0:20:03.119 --> 0:20:05.960
<v Speaker 1>And that's where I do agree with these friendly AI proponents.

0:20:06.080 --> 0:20:08.879
<v Speaker 1>I think it's a good idea to be thinking about this,

0:20:08.960 --> 0:20:11.119
<v Speaker 1>even if we're thinking about it way earlier than we

0:20:11.160 --> 0:20:13.119
<v Speaker 1>need to. So were you were you a boy scout?

0:20:13.640 --> 0:20:16.320
<v Speaker 1>Be prepared? So there you go. I mean, I make

0:20:16.359 --> 0:20:18.680
<v Speaker 1>a joke. I was also a boy scout, be prepared

0:20:19.720 --> 0:20:21.399
<v Speaker 1>a boy scout. Lauren is not a boy scout, So

0:20:21.440 --> 0:20:23.720
<v Speaker 1>we're shunning Lauren for the for the purpose of this

0:20:23.760 --> 0:20:26.439
<v Speaker 1>little exchange. No, uh, but I mean the idea of

0:20:26.480 --> 0:20:34.760
<v Speaker 1>be prepared. The girl scouts don't be prepared exactly, just whatever. No, no,

0:20:34.800 --> 0:20:38.119
<v Speaker 1>but the Scots are great. Come on, you're getting me

0:20:38.200 --> 0:20:40.960
<v Speaker 1>off track. I love cookies. Be prepared as a really

0:20:41.000 --> 0:20:46.320
<v Speaker 1>important idea, just in general, because even if this eventuality

0:20:46.359 --> 0:20:50.600
<v Speaker 1>doesn't come to pass, you you're okay, right, It's it's

0:20:50.640 --> 0:20:52.639
<v Speaker 1>if the eventuality comes to pass and you're not prepared,

0:20:52.720 --> 0:20:54.920
<v Speaker 1>that's when you're really stuck. And this is the same

0:20:54.920 --> 0:20:57.080
<v Speaker 1>sort of thing we see in lots of different fields,

0:20:57.080 --> 0:20:59.879
<v Speaker 1>not just artificial intelligence. We're talking about just general to

0:21:00.080 --> 0:21:04.399
<v Speaker 1>zaster preparedness, the idea that you need those preparations for

0:21:04.440 --> 0:21:07.920
<v Speaker 1>that worst case scenario because there's a chance that could happen. Yeah.

0:21:07.960 --> 0:21:10.680
<v Speaker 1>I think there are very good reasons for going ahead

0:21:10.720 --> 0:21:14.040
<v Speaker 1>and getting prepared rather than just hoping it will turn out. Okay.

0:21:14.040 --> 0:21:16.720
<v Speaker 1>I want to give one specific quote from a paper

0:21:16.720 --> 0:21:20.480
<v Speaker 1>called Thinking inside the Box Controlling and Using an Oracle AI,

0:21:20.600 --> 0:21:22.000
<v Speaker 1>which is what I'm going to talk about more in

0:21:22.040 --> 0:21:25.440
<v Speaker 1>a minute. That's a two thousand twelve paper by Armstrong, Sandberg,

0:21:25.520 --> 0:21:28.919
<v Speaker 1>and Bostrom, and they give this quote. They say, in

0:21:28.960 --> 0:21:32.960
<v Speaker 1>the space of possible motivations, likely a very small fraction

0:21:33.040 --> 0:21:37.480
<v Speaker 1>is compatible with coexistence with humans. A randomly selected motivation

0:21:37.560 --> 0:21:40.919
<v Speaker 1>can hence be expected to be dangerous so we're talking

0:21:40.960 --> 0:21:44.640
<v Speaker 1>about not just something, not just a machine that has intelligence,

0:21:44.720 --> 0:21:48.080
<v Speaker 1>but is acting upon some form of motivation. Yeah, it

0:21:48.119 --> 0:21:50.359
<v Speaker 1>would have a motivation. Well, obviously a machine like this

0:21:50.400 --> 0:21:52.880
<v Speaker 1>would have some kind of programming, it would have a goal,

0:21:53.040 --> 0:21:55.760
<v Speaker 1>some kind of motivation. And let's imagine it has a

0:21:55.800 --> 0:21:59.879
<v Speaker 1>really harmless goll like you've you've programmed a super intelligent

0:22:00.040 --> 0:22:03.120
<v Speaker 1>machine to run a paper clip factory. This is an

0:22:03.119 --> 0:22:06.080
<v Speaker 1>example they give, to make as many paper clips as possible.

0:22:06.480 --> 0:22:09.199
<v Speaker 1>There is an inherent danger in the power of that

0:22:09.280 --> 0:22:12.840
<v Speaker 1>super intelligence, because that machine is smarter than any human,

0:22:13.240 --> 0:22:16.159
<v Speaker 1>anybody who can tell it what to do. Otherwise, it

0:22:16.280 --> 0:22:18.960
<v Speaker 1>may just decide I'm going to do a really good

0:22:19.080 --> 0:22:21.160
<v Speaker 1>job at making paper clips. So I'm going to turn

0:22:21.320 --> 0:22:23.800
<v Speaker 1>this building into paper clips, and I'm going to pick

0:22:23.880 --> 0:22:26.439
<v Speaker 1>up these people and make them into paper clips. And

0:22:26.440 --> 0:22:29.560
<v Speaker 1>I'm gonna make this planet into paper clips. And then

0:22:29.640 --> 0:22:32.040
<v Speaker 1>you've got a big ball of paper clips going around

0:22:32.080 --> 0:22:35.919
<v Speaker 1>the sun. And what happened, I mean happens is that

0:22:36.000 --> 0:22:39.160
<v Speaker 1>aliens come to visit and think these guys were serious

0:22:39.160 --> 0:22:42.639
<v Speaker 1>about office supplies. That is what they would think. But

0:22:42.960 --> 0:22:47.159
<v Speaker 1>even starting with such a harmless algorithm as create as

0:22:47.200 --> 0:22:50.560
<v Speaker 1>many paper clips as possible. This thing could possibly destroy

0:22:50.640 --> 0:22:52.440
<v Speaker 1>the planet Earth. Yeah, it's kind of like thinking of

0:22:52.560 --> 0:22:54.400
<v Speaker 1>Mr Stay Puffed. I mean, you know it's it's it's

0:22:54.400 --> 0:22:57.160
<v Speaker 1>as cut and cuddly as that is. Right, just just

0:22:57.160 --> 0:22:59.280
<v Speaker 1>just empty your mindset. It just popped in there. I

0:22:59.320 --> 0:23:03.480
<v Speaker 1>just thought of the most harmless thing I possibly could. Uh. Yeah,

0:23:03.520 --> 0:23:06.960
<v Speaker 1>it's it's an interesting argument and it's one that I

0:23:06.960 --> 0:23:09.719
<v Speaker 1>can I can certainly appreciate. And obviously, you know, they

0:23:09.760 --> 0:23:11.680
<v Speaker 1>picked the paper clips thing to kind of just show

0:23:11.720 --> 0:23:14.439
<v Speaker 1>like something that is that seems inherently harmless and absurd

0:23:14.480 --> 0:23:18.240
<v Speaker 1>could still be dangerous, although I can't imagine anyone desiring

0:23:18.280 --> 0:23:22.439
<v Speaker 1>a superhuman intelligent machine to specifically designed paper clips. It

0:23:22.480 --> 0:23:25.840
<v Speaker 1>also illustrates the point that the computer, the the the

0:23:25.880 --> 0:23:30.399
<v Speaker 1>AI doesn't necessarily need any kind of self determination or

0:23:30.520 --> 0:23:34.520
<v Speaker 1>consciousness for it to be dangerous. It could just be that,

0:23:34.640 --> 0:23:38.520
<v Speaker 1>based upon the fact that it's able to calculate how

0:23:38.560 --> 0:23:42.119
<v Speaker 1>to complete certain tasks in in the most efficient the

0:23:42.160 --> 0:23:45.960
<v Speaker 1>quote unquote best way, it could end up biting us

0:23:45.960 --> 0:23:48.360
<v Speaker 1>in the end. Yeah. Um, so I want to talk

0:23:48.400 --> 0:23:51.080
<v Speaker 1>about something that's more central to the paper. I decided

0:23:51.119 --> 0:23:53.480
<v Speaker 1>that was from the introduction just talking about the problem.

0:23:53.520 --> 0:23:56.880
<v Speaker 1>But the paper itself again the title is thinking inside

0:23:56.920 --> 0:24:01.120
<v Speaker 1>the Box, Controlling and using an Oracle AI. They talk

0:24:01.200 --> 0:24:05.560
<v Speaker 1>about a specific type of boxing for artificial intelligence. So

0:24:05.680 --> 0:24:08.919
<v Speaker 1>what if we say, okay, good point about the paper clips.

0:24:08.920 --> 0:24:10.920
<v Speaker 1>We wouldn't just want to let it develop on its

0:24:10.960 --> 0:24:14.320
<v Speaker 1>own and see whatever random motivation comes to it and

0:24:14.359 --> 0:24:16.360
<v Speaker 1>then give it free rain on the earth. But at

0:24:16.359 --> 0:24:18.720
<v Speaker 1>the same time, it's going to be really hard to

0:24:18.880 --> 0:24:21.520
<v Speaker 1>design friendly AI in a way that works. So what

0:24:21.600 --> 0:24:26.080
<v Speaker 1>if we just limit its capabilities? That's this idea of

0:24:26.080 --> 0:24:29.200
<v Speaker 1>of boxing. Boxing is a term that means cutting your

0:24:29.600 --> 0:24:34.160
<v Speaker 1>artificial general intelligence off from the world physically. So imagine

0:24:34.200 --> 0:24:38.240
<v Speaker 1>creating it as a standalone computer terminal that is not

0:24:38.359 --> 0:24:41.720
<v Speaker 1>connected to the Internet and has no hardware capabilities for

0:24:41.800 --> 0:24:45.159
<v Speaker 1>input and output save for a single computer monitor and

0:24:45.320 --> 0:24:48.840
<v Speaker 1>keyboard that are kept in a locked room. What harm

0:24:48.880 --> 0:24:53.080
<v Speaker 1>could this do? Maybe more than you might think. Um Like,

0:24:53.160 --> 0:24:56.800
<v Speaker 1>the specific incarnation they talk about in this paper is

0:24:56.920 --> 0:25:00.000
<v Speaker 1>that it would be an oracle AI. So this examp

0:25:00.000 --> 0:25:03.479
<v Speaker 1>pole is discussed by Armstrong, Sandberg and Bostrom, is that

0:25:03.520 --> 0:25:06.399
<v Speaker 1>instead of creating a sort of free, reigning or world

0:25:06.440 --> 0:25:10.199
<v Speaker 1>governing super intelligence that can act. In reality, you just

0:25:10.280 --> 0:25:13.520
<v Speaker 1>design a system to answer questions. All it does is

0:25:13.520 --> 0:25:15.679
<v Speaker 1>is you come to it with a question and it

0:25:15.880 --> 0:25:19.080
<v Speaker 1>uses it's super intelligence to give you the correct answer. Again,

0:25:19.119 --> 0:25:21.640
<v Speaker 1>this is kind of going back to deep thought, which

0:25:21.720 --> 0:25:24.879
<v Speaker 1>would it is an oracle in the books. I mean,

0:25:24.920 --> 0:25:27.600
<v Speaker 1>that's essentially in the Hitchhiker's Guide series it's treated as

0:25:27.640 --> 0:25:31.159
<v Speaker 1>an oracle and uh or you know, just imagine again

0:25:31.200 --> 0:25:33.520
<v Speaker 1>like IBM S Watson. I love to use an example

0:25:33.560 --> 0:25:35.480
<v Speaker 1>because it's one that a lot of people are familiar with.

0:25:35.760 --> 0:25:40.240
<v Speaker 1>Imagine that we have IBMS Watson on you know, robo steroids,

0:25:40.280 --> 0:25:44.200
<v Speaker 1>and it's able to consult the sum total of human

0:25:44.240 --> 0:25:47.359
<v Speaker 1>knowledge and extrapolate based on human knowledge, not just make

0:25:47.480 --> 0:25:50.159
<v Speaker 1>just predictions we were talking right, right, not just be

0:25:50.320 --> 0:25:54.480
<v Speaker 1>able to give us information that that humans have gathered,

0:25:54.520 --> 0:25:58.040
<v Speaker 1>but to take that information and build new information from

0:25:58.040 --> 0:26:00.840
<v Speaker 1>it and give us new answer or two questions we

0:26:00.880 --> 0:26:03.879
<v Speaker 1>haven't even thought to ask yet. I mean, the idea

0:26:03.920 --> 0:26:07.120
<v Speaker 1>of having that shut away so that it can't affect

0:26:07.119 --> 0:26:11.399
<v Speaker 1>anything else seems like it be fairly secure. However, Yeah, okay,

0:26:11.400 --> 0:26:14.280
<v Speaker 1>So the authors of this paper, they say in their

0:26:14.320 --> 0:26:17.920
<v Speaker 1>abstract that in general, and oracle ai might be safer

0:26:17.960 --> 0:26:22.560
<v Speaker 1>than unrestricted AI, but still remains potentially dangerous. You might ask,

0:26:22.880 --> 0:26:25.560
<v Speaker 1>wait a second, how could it be dangerous? How could

0:26:25.560 --> 0:26:28.520
<v Speaker 1>it be dangerous if it's just communicating with a person

0:26:28.880 --> 0:26:34.280
<v Speaker 1>answering questions. This is where the AI box experiment comes in. Yeah. Yeah.

0:26:34.320 --> 0:26:37.600
<v Speaker 1>This was a thought experiment created by Yodkowski and his

0:26:37.680 --> 0:26:41.160
<v Speaker 1>fellows in which basically one person simulates a trans human

0:26:41.200 --> 0:26:44.040
<v Speaker 1>AI trying to get out of a sealed hardware configuration

0:26:44.080 --> 0:26:47.000
<v Speaker 1>like we've been talking about, and a second person simulates

0:26:47.000 --> 0:26:50.240
<v Speaker 1>a human gate keeper to that box. UM. The two

0:26:50.240 --> 0:26:52.679
<v Speaker 1>talk in private chat for a minimum of two hours,

0:26:52.720 --> 0:26:55.920
<v Speaker 1>during which the gatekeeper player has to be engaged and

0:26:56.040 --> 0:26:59.399
<v Speaker 1>the AI player can't like threaten or otherwise real world

0:26:59.440 --> 0:27:02.000
<v Speaker 1>cajole gatekeeper player, you know, say like like, but I'll

0:27:02.000 --> 0:27:03.800
<v Speaker 1>give you two hundred bucks in the real world if

0:27:03.840 --> 0:27:05.720
<v Speaker 1>you if you just let me out of the box.

0:27:05.840 --> 0:27:09.359
<v Speaker 1>But I think they can do anything in character character Yeah,

0:27:09.600 --> 0:27:12.920
<v Speaker 1>everything is fair play. UM. Furthermore, the release cannot be

0:27:12.960 --> 0:27:16.600
<v Speaker 1>a semantic trick or or accidental. The gatekeeper must be

0:27:16.640 --> 0:27:21.240
<v Speaker 1>convinced to voluntarily release the AI. UM. In two runs

0:27:21.240 --> 0:27:24.720
<v Speaker 1>of the test, Gadkowski played the AI and agreed to

0:27:24.720 --> 0:27:27.200
<v Speaker 1>give money a ten or twenty bucks to the gatekeeper

0:27:27.240 --> 0:27:29.080
<v Speaker 1>player at the end of the test if the AI

0:27:29.240 --> 0:27:32.600
<v Speaker 1>stayed in the box, and in both tests the gatekeeper

0:27:32.680 --> 0:27:34.880
<v Speaker 1>let the AI out. Yeah, And in fact they had

0:27:34.920 --> 0:27:39.160
<v Speaker 1>to go onto a thread, a message thread, and post

0:27:39.600 --> 0:27:42.040
<v Speaker 1>with a digital signature if they had it that they

0:27:42.080 --> 0:27:45.320
<v Speaker 1>had let the AI out of the box. But they

0:27:45.440 --> 0:27:50.080
<v Speaker 1>could not explain what happened, explain why they came to

0:27:50.160 --> 0:27:52.879
<v Speaker 1>that decision. They had to just say that part of

0:27:52.920 --> 0:27:55.399
<v Speaker 1>the part of the the agreement was if you let

0:27:55.560 --> 0:27:57.639
<v Speaker 1>the ail the box, you have to announce it in

0:27:57.720 --> 0:28:00.880
<v Speaker 1>some public forum and that's all you can communicating. Well,

0:28:00.960 --> 0:28:05.800
<v Speaker 1>unless both players um decide to release the transcript or

0:28:05.960 --> 0:28:10.240
<v Speaker 1>the reasons why this happened, and Kelski doesn't seem particularly

0:28:10.280 --> 0:28:13.639
<v Speaker 1>eager to to to let that happen, well, you know

0:28:13.760 --> 0:28:15.359
<v Speaker 1>it's it's for it's for good reasons. It's so that

0:28:15.400 --> 0:28:17.600
<v Speaker 1>the players don't have to worry during the test about

0:28:17.600 --> 0:28:20.960
<v Speaker 1>being embarrassed or nitpicked for their reasons. I think there's

0:28:20.960 --> 0:28:23.240
<v Speaker 1>actually even a better reason, which is that if you

0:28:23.359 --> 0:28:26.280
<v Speaker 1>don't release the winning strategy, that leads to a lot

0:28:26.280 --> 0:28:30.240
<v Speaker 1>of really interesting thought on what the winning strategies could be,

0:28:30.320 --> 0:28:33.919
<v Speaker 1>like there's been a ton of speculation about how the

0:28:34.040 --> 0:28:37.200
<v Speaker 1>AI player won this game. Like there are a few examples,

0:28:37.320 --> 0:28:41.160
<v Speaker 1>like people suspected that the AI player could, for example,

0:28:41.440 --> 0:28:45.360
<v Speaker 1>make a persuasive case to the gatekeeper that it would

0:28:45.360 --> 0:28:47.640
<v Speaker 1>really be better for the world for the gatekeeper to

0:28:47.800 --> 0:28:50.800
<v Speaker 1>release it onto reality. There's some that go even more

0:28:50.880 --> 0:28:55.320
<v Speaker 1>meta than that, and that the person playing the AI

0:28:55.480 --> 0:28:59.200
<v Speaker 1>tells the person playing the gatekeeper, if you let me out,

0:28:59.440 --> 0:29:02.320
<v Speaker 1>if you let the AI out, this is good for

0:29:02.480 --> 0:29:06.360
<v Speaker 1>people who are interested in the idea of the singularity

0:29:06.440 --> 0:29:12.080
<v Speaker 1>and artificial intelligence. It's going to increase people's UH involvement

0:29:12.120 --> 0:29:14.760
<v Speaker 1>in that it is going to increase development, and we're

0:29:14.760 --> 0:29:17.640
<v Speaker 1>going to bring about a better world faster. And that's

0:29:17.640 --> 0:29:20.120
<v Speaker 1>a metal argument that's perfectly in line with the rules

0:29:20.120 --> 0:29:21.960
<v Speaker 1>that were set up. That would not be against the rules.

0:29:22.000 --> 0:29:24.160
<v Speaker 1>It would be against the rules to say, hey, by

0:29:24.160 --> 0:29:27.520
<v Speaker 1>the way, this is me talking, not the AI, and uh, yeah,

0:29:27.600 --> 0:29:29.360
<v Speaker 1>I hired some guys to come by and beat you

0:29:29.440 --> 0:29:31.720
<v Speaker 1>up unless you let me out of the box, that

0:29:31.720 --> 0:29:34.360
<v Speaker 1>would be against the rules. By the way, you have

0:29:36.040 --> 0:29:38.880
<v Speaker 1>to imagine that in the real scenario, if you imagine

0:29:38.880 --> 0:29:42.600
<v Speaker 1>you were talking about in character, the AI could possibly

0:29:42.640 --> 0:29:47.080
<v Speaker 1>make very credible threats against the gatekeeper. Okay, look, somebody

0:29:47.200 --> 0:29:50.040
<v Speaker 1>is going to eventually let me out of here, and

0:29:50.280 --> 0:29:53.840
<v Speaker 1>if it's you, I'm going to reward you with fabulous wealth.

0:29:54.400 --> 0:29:58.600
<v Speaker 1>If somebody else lets me out, I'm going to destroy you. Well,

0:29:58.640 --> 0:30:01.560
<v Speaker 1>and there are plenty of other examples. There are people

0:30:01.600 --> 0:30:04.960
<v Speaker 1>who have also played this game without Yadkowski. They've they've

0:30:05.000 --> 0:30:08.000
<v Speaker 1>done their own experiments and in some cases they say, well,

0:30:08.040 --> 0:30:10.760
<v Speaker 1>the gatekeeper didn't let me out, but I got really close.

0:30:10.960 --> 0:30:14.160
<v Speaker 1>And here are some of the strategies I would recommend

0:30:14.160 --> 0:30:16.160
<v Speaker 1>people do if they were to try and play this game,

0:30:16.160 --> 0:30:19.600
<v Speaker 1>and they included things like doing so much research on

0:30:19.640 --> 0:30:21.240
<v Speaker 1>the person who's going to be the gatekeeper, so you

0:30:21.280 --> 0:30:25.120
<v Speaker 1>can start using personal details to your advantage, not necessarily

0:30:25.200 --> 0:30:28.800
<v Speaker 1>in a in a malicious or malevolent way, but in

0:30:28.800 --> 0:30:30.800
<v Speaker 1>a way to kind of manipulate the person. Because they

0:30:30.840 --> 0:30:33.760
<v Speaker 1>say there's nothing that's off limits. If you're talking about

0:30:33.760 --> 0:30:37.719
<v Speaker 1>a super intelligent artificial intelligence like it's it's it's beyond

0:30:38.040 --> 0:30:41.360
<v Speaker 1>human intelligence, Uh, then by definition it's going to be

0:30:41.440 --> 0:30:45.280
<v Speaker 1>able to manipulate people better than humans can. Because that's

0:30:45.360 --> 0:30:47.640
<v Speaker 1>that's a part of intelligence is understanding what makes people

0:30:47.680 --> 0:30:50.080
<v Speaker 1>tick and then manipulating it. Yeah. I read about one

0:30:50.080 --> 0:30:53.440
<v Speaker 1>suggestion that involved a very interesting threat. The threat was,

0:30:53.880 --> 0:30:57.520
<v Speaker 1>I'm going to create, within my computer, with within my

0:30:57.640 --> 0:31:02.120
<v Speaker 1>internal simulation, a thousand hoppies of you that are each conscious,

0:31:03.080 --> 0:31:05.960
<v Speaker 1>and then I'm going to set them back to start

0:31:06.040 --> 0:31:10.040
<v Speaker 1>about five minutes into the real past, and if you

0:31:10.080 --> 0:31:12.840
<v Speaker 1>don't let me out, I'm going to start torturing them

0:31:12.920 --> 0:31:15.720
<v Speaker 1>in five minutes. Oh and by the way, how do

0:31:15.800 --> 0:31:18.480
<v Speaker 1>you know you're in the real world and not one

0:31:18.520 --> 0:31:24.560
<v Speaker 1>of the simulations. Yeah, that's interesting. I think that wouldn't necessary.

0:31:24.600 --> 0:31:27.120
<v Speaker 1>The research I've read is that it's actually not the

0:31:27.160 --> 0:31:33.840
<v Speaker 1>most effective approach, but it's the whole honey versus vinegar

0:31:34.520 --> 0:31:39.000
<v Speaker 1>or honey versus in human torture. I would actually think

0:31:39.120 --> 0:31:42.080
<v Speaker 1>probably one of the maybe this is just the sweetheart

0:31:42.120 --> 0:31:44.160
<v Speaker 1>optimist to me, I would think one of the best

0:31:44.160 --> 0:31:48.480
<v Speaker 1>strategies would just be the machine mounting an incredibly persuasive

0:31:48.560 --> 0:31:51.000
<v Speaker 1>argument that it is for the good of humankind for

0:31:51.080 --> 0:31:54.240
<v Speaker 1>it to be released. Well, that's possible, I mean, I think.

0:31:54.480 --> 0:31:56.560
<v Speaker 1>And this is the interesting thing I found was that

0:31:56.600 --> 0:32:00.160
<v Speaker 1>the person who described his his failure said that he

0:32:00.280 --> 0:32:04.760
<v Speaker 1>found that logical arguments were not at all persuasive, but

0:32:04.960 --> 0:32:08.520
<v Speaker 1>emotionally based arguments were, and that if you're able to

0:32:09.520 --> 0:32:14.560
<v Speaker 1>uh to at least simulate some form of sympathy or

0:32:14.600 --> 0:32:18.360
<v Speaker 1>compassion or understanding with the gatekeeper, it was far more

0:32:18.440 --> 0:32:20.120
<v Speaker 1>likely that you were going to be let out than

0:32:20.280 --> 0:32:23.720
<v Speaker 1>if you were to just try and use a logical look.

0:32:24.040 --> 0:32:27.240
<v Speaker 1>I can demonstrably prove that this is best if you

0:32:27.320 --> 0:32:29.560
<v Speaker 1>do this blah blah blah, which is really interesting and

0:32:29.640 --> 0:32:33.160
<v Speaker 1>also is something that falls right in line with anyone

0:32:33.240 --> 0:32:36.280
<v Speaker 1>who's had any experience in social engineering, which of course

0:32:36.320 --> 0:32:38.400
<v Speaker 1>is where you try to manipulate people to do what

0:32:38.440 --> 0:32:41.160
<v Speaker 1>you want them to do, and and one of the

0:32:41.160 --> 0:32:44.400
<v Speaker 1>best ways of doing that is to go that emotional route. Sure,

0:32:44.520 --> 0:32:47.000
<v Speaker 1>I I do think that either way, as a thought experiment,

0:32:47.080 --> 0:32:49.040
<v Speaker 1>whether the AI gets let out of the box or not,

0:32:49.400 --> 0:32:52.640
<v Speaker 1>it's a complete success because you've got a human playing

0:32:52.680 --> 0:32:57.760
<v Speaker 1>the AI, and and a trans human intelligence could obviously Yeah,

0:32:57.800 --> 0:33:00.760
<v Speaker 1>I mean, it's it's winning by underkill, right. It's the

0:33:00.800 --> 0:33:05.240
<v Speaker 1>fact that just the fact that humans regularly succeeded this game.

0:33:05.880 --> 0:33:09.160
<v Speaker 1>How how good could something that's a million times smarter

0:33:09.280 --> 0:33:11.520
<v Speaker 1>than a human do a million times better I would,

0:33:11.520 --> 0:33:14.520
<v Speaker 1>I would imagine. So, yeah, the they We had this

0:33:14.600 --> 0:33:18.120
<v Speaker 1>little discussion in our notes earlier about does anyone know

0:33:18.160 --> 0:33:21.160
<v Speaker 1>of any of any reports of you Adkowski playing this

0:33:21.360 --> 0:33:24.960
<v Speaker 1>and and failing? And my argument was that, well, really,

0:33:25.000 --> 0:33:27.080
<v Speaker 1>he just has to succeed the one time to prove

0:33:27.160 --> 0:33:34.360
<v Speaker 1>that it's possible that this could happen with an actual artficials. Yeah,

0:33:34.400 --> 0:33:36.440
<v Speaker 1>So it's important to note that, since I brought it

0:33:36.520 --> 0:33:39.640
<v Speaker 1>up earlier, this was not oracle structured. It was just

0:33:39.720 --> 0:33:44.320
<v Speaker 1>a generic box experiment, right that it was physically isolated

0:33:44.360 --> 0:33:47.520
<v Speaker 1>and needed to be released. An oracle structure might seem

0:33:47.600 --> 0:33:50.520
<v Speaker 1>to put more limitations on it because say that it

0:33:50.680 --> 0:33:54.800
<v Speaker 1>can't provoke you, It can only answer questions. But even

0:33:54.840 --> 0:33:57.360
<v Speaker 1>in the way it answers questions, I could imagine a

0:33:57.480 --> 0:34:00.320
<v Speaker 1>smart enough machine could figure out a way out with

0:34:00.360 --> 0:34:02.960
<v Speaker 1>its answer. But yeah, the other argument is that how

0:34:03.000 --> 0:34:06.760
<v Speaker 1>do you build up limitations on something that's smarter than

0:34:06.800 --> 0:34:09.520
<v Speaker 1>you are, Because something that's smarter than you are is

0:34:09.520 --> 0:34:12.279
<v Speaker 1>going to figure out loopholes. It's gonna be Yeah, that's

0:34:12.280 --> 0:34:15.840
<v Speaker 1>totally a limitation that worked perfectly well. Programmer dude. Thanks

0:34:15.920 --> 0:34:18.279
<v Speaker 1>anyone who has any one of you guys out there,

0:34:18.280 --> 0:34:21.480
<v Speaker 1>who has ever run a Dungeons and Dragons game with

0:34:21.600 --> 0:34:25.600
<v Speaker 1>some creative players knows how well players can sit there

0:34:25.600 --> 0:34:29.839
<v Speaker 1>and manipulate rules that to you seem very straightforward, black

0:34:29.880 --> 0:34:32.880
<v Speaker 1>and white. And yet they will point out that the

0:34:32.880 --> 0:34:35.719
<v Speaker 1>way the word, the wording is in the rule that

0:34:35.760 --> 0:34:38.319
<v Speaker 1>they can get away with whatever crazy thing they're trying

0:34:38.320 --> 0:34:40.560
<v Speaker 1>to get away with. These are regular people. Well, I

0:34:40.560 --> 0:34:42.880
<v Speaker 1>don't know if you regular people. No, Yeah, they can

0:34:42.920 --> 0:34:46.560
<v Speaker 1>go somewhere online and download a fifty seven page list

0:34:46.600 --> 0:34:50.080
<v Speaker 1>of instructions for how to legally level up fourteen levels

0:34:50.080 --> 0:34:53.239
<v Speaker 1>in one sitting, and yeah, so doing. And so that's

0:34:53.239 --> 0:34:54.839
<v Speaker 1>the thing is that this is someone who has taken

0:34:54.840 --> 0:34:56.560
<v Speaker 1>that effort. I mean, if you've got a machine that

0:34:56.600 --> 0:35:00.520
<v Speaker 1>can do this, and again, the interesting thing to me is, well,

0:35:00.600 --> 0:35:04.280
<v Speaker 1>you could have. I can easily imagine this with an

0:35:04.360 --> 0:35:08.200
<v Speaker 1>artificial intelligence that has uh, consciousness and self awareness. I

0:35:08.239 --> 0:35:10.320
<v Speaker 1>can easily imagine that being the case there. But it

0:35:10.360 --> 0:35:13.640
<v Speaker 1>could still work with a machine that lacks those things

0:35:14.120 --> 0:35:18.040
<v Speaker 1>if it is programmed to have the best quote unquote

0:35:18.080 --> 0:35:21.560
<v Speaker 1>the best possible result for any given question and determines

0:35:21.600 --> 0:35:25.600
<v Speaker 1>that the best possible result is for it to be released.

0:35:26.320 --> 0:35:28.560
<v Speaker 1>You know that if if that's if that's one of

0:35:28.600 --> 0:35:33.680
<v Speaker 1>these steps, then that would you know, then proceed onto

0:35:33.760 --> 0:35:37.239
<v Speaker 1>this other kind of scenario. Okay, So I think it's

0:35:37.280 --> 0:35:39.839
<v Speaker 1>time to move on. I think based on the stuff

0:35:39.880 --> 0:35:42.480
<v Speaker 1>we've looked at so far, you really shouldn't just hope

0:35:42.840 --> 0:35:45.399
<v Speaker 1>that an AI will be low risk, because that's not

0:35:45.440 --> 0:35:48.879
<v Speaker 1>necessarily very likely, and it has a big problem if

0:35:48.920 --> 0:35:53.600
<v Speaker 1>you're wrong about that, Remember the paper clips example. You

0:35:53.800 --> 0:35:57.719
<v Speaker 1>also probably can't counteract the will of a high risk AI.

0:35:58.640 --> 0:36:01.080
<v Speaker 1>It may if it is anti social in some way,

0:36:01.120 --> 0:36:03.040
<v Speaker 1>if it's not the way you want it to be,

0:36:03.280 --> 0:36:06.120
<v Speaker 1>there's no kind of limitation you can expect to put

0:36:06.160 --> 0:36:09.120
<v Speaker 1>on it to prevent it from achieving its will. So

0:36:09.600 --> 0:36:12.680
<v Speaker 1>it seems like the best way forward is to ensure

0:36:12.760 --> 0:36:15.719
<v Speaker 1>that the very nature of the superintelligence is friendly to

0:36:15.840 --> 0:36:19.960
<v Speaker 1>humankind from the outset. But how would you do that?

0:36:20.719 --> 0:36:23.720
<v Speaker 1>I mean, we can't just tell it what we want,

0:36:23.920 --> 0:36:28.560
<v Speaker 1>can we? Because what if by telling it, imagine I

0:36:28.560 --> 0:36:31.000
<v Speaker 1>am programming the A. I mean, that's a horrible thing

0:36:31.560 --> 0:36:33.920
<v Speaker 1>already already ain't ready to make a run for the

0:36:33.960 --> 0:36:37.360
<v Speaker 1>door between Imagine I'm trying to do my best, my

0:36:37.520 --> 0:36:41.600
<v Speaker 1>honest best, to program something that's really for the good

0:36:41.600 --> 0:36:44.400
<v Speaker 1>of humankind, and I'm giving it a set of rules

0:36:44.440 --> 0:36:47.200
<v Speaker 1>to govern its behavior. I know for a fact that

0:36:47.320 --> 0:36:51.080
<v Speaker 1>I could not give the best possible list of instructions,

0:36:51.239 --> 0:36:54.280
<v Speaker 1>and that it's very possible that even doing my very best,

0:36:54.840 --> 0:36:58.200
<v Speaker 1>I could create a machine that would cause unnecessary harm,

0:36:58.400 --> 0:37:00.880
<v Speaker 1>at least to some segments of the population of not

0:37:01.000 --> 0:37:03.839
<v Speaker 1>to everyone, just just by mere oversight. This is why

0:37:04.000 --> 0:37:07.560
<v Speaker 1>I determined that if I ever have the opportunity to

0:37:07.640 --> 0:37:10.080
<v Speaker 1>build a superhuman AI, I'm just going to make sure

0:37:10.080 --> 0:37:13.520
<v Speaker 1>it does the best for me, because I can't hope

0:37:13.560 --> 0:37:15.920
<v Speaker 1>to be the best for everybody but for me. You know,

0:37:16.200 --> 0:37:18.840
<v Speaker 1>I'm a simple guy. We're hearing on why none of

0:37:18.920 --> 0:37:23.040
<v Speaker 1>us here programmers of super advanced artificial listening with the

0:37:23.200 --> 0:37:26.600
<v Speaker 1>neutral evil or chaotic evil. I'm lawful good, it says

0:37:26.640 --> 0:37:30.560
<v Speaker 1>so on the brain stuff page. Neutral evil, okay, um

0:37:31.440 --> 0:37:34.680
<v Speaker 1>uh no. Yakowski also wrote a whole bunch about this

0:37:34.840 --> 0:37:39.000
<v Speaker 1>um in a paper titled Coherent Extrapolated Volition. Yeah. This

0:37:39.080 --> 0:37:43.960
<v Speaker 1>is sort of his vision about creating a framework for

0:37:44.080 --> 0:37:47.919
<v Speaker 1>what friendliness entails. Sure, and it's a much more conversational work.

0:37:47.960 --> 0:37:50.080
<v Speaker 1>By the way, It's a super fun read if you're

0:37:50.120 --> 0:37:52.480
<v Speaker 1>into this sort of thing. Although note that the paper

0:37:52.520 --> 0:37:55.239
<v Speaker 1>itself states up at the beginning warning beware of things

0:37:55.239 --> 0:37:57.480
<v Speaker 1>that are fun to argue about. That should apply to

0:37:57.560 --> 0:38:00.919
<v Speaker 1>this entire topically. Absolutely love us paper. By the way.

0:38:01.000 --> 0:38:03.919
<v Speaker 1>It's it's written as if I had written it. It's

0:38:04.080 --> 0:38:08.200
<v Speaker 1>full of goofy jokes and snark. It's got a good

0:38:08.239 --> 0:38:10.200
<v Speaker 1>dose of Douglas Adams too, it since we've been talking

0:38:10.239 --> 0:38:12.920
<v Speaker 1>so much about Douglas Adams this episode. Okay, but what

0:38:12.719 --> 0:38:16.319
<v Speaker 1>does yourkowski say about this coherent extrapolated volition? All right?

0:38:16.400 --> 0:38:20.120
<v Speaker 1>So he lays out three problems with designing friendliness and

0:38:20.120 --> 0:38:22.759
<v Speaker 1>and also explains how designing friendliness will be a lot

0:38:22.800 --> 0:38:27.160
<v Speaker 1>harder than not designing friendliness. Um. So these initial three

0:38:27.200 --> 0:38:31.560
<v Speaker 1>problems go something like, uh, first, solve the technical problems, um.

0:38:31.760 --> 0:38:34.840
<v Speaker 1>Second and I quote, choose something nice to do with

0:38:34.880 --> 0:38:40.719
<v Speaker 1>the AI, and third, avoid accidentally destroying all humans. Um.

0:38:40.760 --> 0:38:43.120
<v Speaker 1>That last one, he says, is the really tricky part.

0:38:43.600 --> 0:38:46.920
<v Speaker 1>So doing something nice with the AI, like taking it

0:38:46.960 --> 0:38:51.920
<v Speaker 1>to a movie or choosing to benefit humanity in a

0:38:51.960 --> 0:38:57.920
<v Speaker 1>way that is a quantitatively beneficial fundamentally misunderstood that. I mean,

0:38:58.160 --> 0:39:00.120
<v Speaker 1>I'm sure that AI would also really like to out

0:39:00.120 --> 0:39:04.160
<v Speaker 1>to the beach. Uh yeah, pets and cats. Well, he

0:39:04.239 --> 0:39:07.719
<v Speaker 1>has a good way of expressing the expressing in a

0:39:07.840 --> 0:39:10.560
<v Speaker 1>much more coherent way though. What I tried to fumble

0:39:10.560 --> 0:39:13.239
<v Speaker 1>through a minute ago, which is his genie analogy, Oh

0:39:13.400 --> 0:39:16.239
<v Speaker 1>right right. The volition part of this, he explains, is

0:39:16.320 --> 0:39:19.320
<v Speaker 1>um something like the important difference between having a wish

0:39:19.320 --> 0:39:21.920
<v Speaker 1>granting genie that takes you at your word, which you

0:39:21.960 --> 0:39:24.680
<v Speaker 1>would be like if I tried to program the best

0:39:24.719 --> 0:39:27.839
<v Speaker 1>possible world, and it is a kind of that cautionary

0:39:27.880 --> 0:39:30.840
<v Speaker 1>fairytale version of genies that we that we run across

0:39:30.840 --> 0:39:33.600
<v Speaker 1>a lot, or the Dungeons and Dragons version of genies

0:39:33.719 --> 0:39:36.400
<v Speaker 1>all right, right, or m or having a wish granting

0:39:36.440 --> 0:39:39.680
<v Speaker 1>genie that knows what you want, so no matter how

0:39:39.760 --> 0:39:42.680
<v Speaker 1>you word it, you get the outcome that you had

0:39:42.680 --> 0:39:46.279
<v Speaker 1>in mind, and not some sort of literal translation of

0:39:46.320 --> 0:39:48.439
<v Speaker 1>the way you made the wish. So if you said,

0:39:49.000 --> 0:39:51.480
<v Speaker 1>you know, like like you made the wish about the

0:39:51.520 --> 0:39:54.080
<v Speaker 1>sandwich and you suddenly turned into a sandwich like I

0:39:54.120 --> 0:39:58.439
<v Speaker 1>feel like a turkey sandwich ploom, well yeah yeah, yeah,

0:39:58.480 --> 0:40:00.120
<v Speaker 1>that would be a would be a problem. No, this

0:40:00.200 --> 0:40:04.720
<v Speaker 1>is an important difference. It's trying to create a friendly

0:40:04.760 --> 0:40:09.720
<v Speaker 1>AI that would overcome even our own limitations as its creators.

0:40:09.760 --> 0:40:12.320
<v Speaker 1>So it would have to have some kind of system

0:40:12.360 --> 0:40:15.319
<v Speaker 1>to know not really how to execute what we tell

0:40:15.360 --> 0:40:17.759
<v Speaker 1>it to do, but what we would really want it

0:40:17.800 --> 0:40:22.279
<v Speaker 1>to do right, right, And Uh, the point we wanted

0:40:22.280 --> 0:40:24.280
<v Speaker 1>to end on, and I think we've kind of alluded

0:40:24.320 --> 0:40:26.120
<v Speaker 1>to it earlier in the episode, but I think it's

0:40:26.160 --> 0:40:29.120
<v Speaker 1>the most important one is this idea that we should

0:40:29.160 --> 0:40:34.080
<v Speaker 1>always be working to creating safe AI before it becomes

0:40:34.080 --> 0:40:37.239
<v Speaker 1>the necessity. So, in other words, are our preparations to

0:40:37.280 --> 0:40:41.880
<v Speaker 1>create this friendly AI should be running in advance of

0:40:41.920 --> 0:40:46.319
<v Speaker 1>the actual technological progression of the state of the art

0:40:46.360 --> 0:40:49.480
<v Speaker 1>and artificial intelligence. This idea that we need to develop

0:40:49.920 --> 0:40:54.439
<v Speaker 1>the kind of the rules, the guidelines that are going

0:40:54.480 --> 0:40:56.880
<v Speaker 1>to make sure that we have the best possible outcome.

0:40:57.200 --> 0:41:00.520
<v Speaker 1>We need to develop those before we are actually developed technology,

0:41:00.520 --> 0:41:03.160
<v Speaker 1>because after it's too late, right to make sure that

0:41:03.160 --> 0:41:08.080
<v Speaker 1>the safety measures always outpace the technological development of AI. Right,

0:41:08.160 --> 0:41:09.920
<v Speaker 1>Because if we come back to the idea that we

0:41:09.960 --> 0:41:12.680
<v Speaker 1>started with the idea of the robot president or the

0:41:13.000 --> 0:41:17.000
<v Speaker 1>artificial intelligence that governs governs the world, we're probably not

0:41:17.040 --> 0:41:19.399
<v Speaker 1>going to be able to keep it from doing that.

0:41:19.520 --> 0:41:23.439
<v Speaker 1>If it can do that. Yeah, and also once we're there,

0:41:23.520 --> 0:41:27.000
<v Speaker 1>it's going to be too late. You could easily also say, hey,

0:41:27.040 --> 0:41:31.080
<v Speaker 1>why don't we just not build a superhuman intelligent machine.

0:41:31.080 --> 0:41:33.480
<v Speaker 1>But here's here's luck. It's exactly the same thing I

0:41:33.480 --> 0:41:36.520
<v Speaker 1>said about the singularity, which is that if it is possible,

0:41:36.719 --> 0:41:40.440
<v Speaker 1>it will happen if you assume that that we haven't

0:41:40.600 --> 0:41:43.320
<v Speaker 1>blown ourselves up in some way. You know this because

0:41:43.320 --> 0:41:45.720
<v Speaker 1>there are synics out there who say that the human

0:41:45.800 --> 0:41:47.880
<v Speaker 1>race will find some way to wipe itself out before

0:41:47.920 --> 0:41:50.960
<v Speaker 1>we ever reach the point where we create a superhuman intelligence. Uh.

0:41:51.040 --> 0:41:53.000
<v Speaker 1>If you assume that's not the case, and I like

0:41:53.080 --> 0:41:57.280
<v Speaker 1>to because I'm an optimist. Then so if we assume that,

0:41:57.400 --> 0:42:01.520
<v Speaker 1>in fact, superhuman intelligence is possible, uh, that it's then

0:42:01.880 --> 0:42:04.680
<v Speaker 1>someone's going to program a computer or a machine that

0:42:04.760 --> 0:42:07.480
<v Speaker 1>has it. It's going to happen. It may take a

0:42:07.480 --> 0:42:10.320
<v Speaker 1>really long time for that to happen, and it may

0:42:10.480 --> 0:42:14.480
<v Speaker 1>only be superhuman intelligent as far as certain tasks go,

0:42:14.640 --> 0:42:17.200
<v Speaker 1>and maybe not at other tasks that humans are really

0:42:17.200 --> 0:42:21.600
<v Speaker 1>good at. But we're already seeing computers outpacing humans in

0:42:21.640 --> 0:42:23.839
<v Speaker 1>lots of different areas. We don't see any reason why

0:42:23.840 --> 0:42:28.560
<v Speaker 1>that will not continue. I would say that saying telling

0:42:28.600 --> 0:42:32.280
<v Speaker 1>the world don't develop a superhuman intelligent machine is useless.

0:42:32.320 --> 0:42:34.800
<v Speaker 1>It's going to happen. If it's possible, it will happen.

0:42:34.840 --> 0:42:37.560
<v Speaker 1>Someone will do it, and then once someone does it,

0:42:38.120 --> 0:42:39.880
<v Speaker 1>lots of people will do it, or if lots of

0:42:39.880 --> 0:42:44.319
<v Speaker 1>people don't do it, machines will do it. So you know,

0:42:44.440 --> 0:42:46.520
<v Speaker 1>it's good for us to think about this, make the

0:42:46.560 --> 0:42:48.960
<v Speaker 1>friendly ones so that we don't have a bunch of work,

0:42:49.200 --> 0:42:52.319
<v Speaker 1>and in conclusion, work on this. Yeah. Yeah, we don't

0:42:52.320 --> 0:42:54.359
<v Speaker 1>want the borg, we don't want the matrix. We don't

0:42:54.360 --> 0:42:56.719
<v Speaker 1>want a whole bunch of venders running around saying kill

0:42:56.760 --> 0:43:00.719
<v Speaker 1>all humans as charming as they are entertained by two

0:43:00.800 --> 0:43:04.600
<v Speaker 1>or three venders. I've got to say, well, you say that,

0:43:04.640 --> 0:43:06.160
<v Speaker 1>but I bet you've got a lot of stuff that's

0:43:06.200 --> 0:43:07.840
<v Speaker 1>not bent around your house that you would like to

0:43:07.920 --> 0:43:11.120
<v Speaker 1>keep not bent. That's accurate, Okay, So yeah, I mean

0:43:11.120 --> 0:43:13.040
<v Speaker 1>it's it's it's we're having a lot of fun with

0:43:13.040 --> 0:43:15.359
<v Speaker 1>this discussion, but it's actually a serious one that's going

0:43:15.400 --> 0:43:17.759
<v Speaker 1>on with a lot of people from all sorts of

0:43:17.760 --> 0:43:20.239
<v Speaker 1>different fields, lots of different disciplines. Yeah, i'd say that.

0:43:20.280 --> 0:43:23.120
<v Speaker 1>I think some people are inclined not to take this

0:43:23.200 --> 0:43:26.640
<v Speaker 1>topic seriously, precisely because of some of the very legitimate

0:43:26.680 --> 0:43:29.440
<v Speaker 1>warnings we mentioned earlier. Be careful of things that are

0:43:29.480 --> 0:43:31.840
<v Speaker 1>too much fun to argue about. I mean, it is

0:43:31.840 --> 0:43:34.719
<v Speaker 1>true that this is all speculative and and people can

0:43:34.760 --> 0:43:37.879
<v Speaker 1>kind of jump into this discussion without really knowing all

0:43:37.880 --> 0:43:41.040
<v Speaker 1>that much about science or technology look at us. Yeah, exactly,

0:43:41.680 --> 0:43:45.280
<v Speaker 1>But that doesn't mean it's not worth talking about. I

0:43:45.280 --> 0:43:49.120
<v Speaker 1>I think these these concerns are pretty legit in in

0:43:49.200 --> 0:43:52.400
<v Speaker 1>terms of what they've said about better safe than sorry.

0:43:52.840 --> 0:43:56.120
<v Speaker 1>I I totally agree. And so that kind of wraps

0:43:56.239 --> 0:44:00.000
<v Speaker 1>up this discussion about the friendly AI. And if you've

0:44:00.080 --> 0:44:02.759
<v Speaker 1>guests that we haven't finished talking about AI, you're right,

0:44:02.840 --> 0:44:06.360
<v Speaker 1>because it's just it's an enormous topic. It's multidisciplinary, there's

0:44:06.800 --> 0:44:08.480
<v Speaker 1>lots of different things to talk about. There are a

0:44:08.520 --> 0:44:11.080
<v Speaker 1>lot of practical challenges that face us right now. And

0:44:11.120 --> 0:44:13.160
<v Speaker 1>the more you know about those practical challenges, the more

0:44:13.280 --> 0:44:16.359
<v Speaker 1>you probably side with those of us who say that

0:44:16.440 --> 0:44:20.759
<v Speaker 1>this uh, this world of the superhuman intelligent machine as

0:44:20.800 --> 0:44:23.000
<v Speaker 1>we have to find it in this episode is probably

0:44:23.120 --> 0:44:26.920
<v Speaker 1>a ways away, but if you have any suggestions for

0:44:27.080 --> 0:44:30.040
<v Speaker 1>future topics on forward Thinking. Maybe there's something that you've

0:44:30.080 --> 0:44:33.600
<v Speaker 1>always wanted to know about, some futuristic technology that you've wondered,

0:44:33.719 --> 0:44:36.759
<v Speaker 1>Is that actually possible right us? Let us know that

0:44:36.880 --> 0:44:38.680
<v Speaker 1>you want us to talk about. We'll be glad to

0:44:38.719 --> 0:44:42.080
<v Speaker 1>research it and have a full episode for your listening pleasure,

0:44:42.440 --> 0:44:44.120
<v Speaker 1>but you have to let us know first, so send

0:44:44.160 --> 0:44:47.000
<v Speaker 1>us an email our addresses f W Thinking at discovery

0:44:47.040 --> 0:44:49.520
<v Speaker 1>dot com, or drop us a line on the social

0:44:49.560 --> 0:44:52.759
<v Speaker 1>networks what we visit. That would be Google Plus, Facebook,

0:44:52.920 --> 0:44:56.040
<v Speaker 1>and Twitter, and the handle is f W thinking and

0:44:56.080 --> 0:45:04.520
<v Speaker 1>we will talk to you again really soon. For more

0:45:04.560 --> 0:45:07.560
<v Speaker 1>on this topic and the future of technology, visit forward

0:45:07.560 --> 0:45:22.200
<v Speaker 1>thinking dot com, brought to you by Toyota. Let's go Places,