WEBVTT - Rebroadcast of Ep7 "Is AI truly intelligent? How would we know if it got there?"

0:00:00.080 --> 0:00:02.120
<v Speaker 1>Hey, this is David Eagleman and this past week was

0:00:02.120 --> 0:00:04.200
<v Speaker 1>my birthday, so I took a week off. So I'm

0:00:04.240 --> 0:00:06.800
<v Speaker 1>going to run an episode that I did earlier, episode

0:00:06.880 --> 0:00:11.680
<v Speaker 1>number seven. This is called is AI actually intelligent? And

0:00:11.760 --> 0:00:14.400
<v Speaker 1>how would we know if it gets there? This episode

0:00:14.440 --> 0:00:16.520
<v Speaker 1>is from one year ago, but as time goes on

0:00:16.600 --> 0:00:21.000
<v Speaker 1>this becomes more and more relevant, So please enjoy and

0:00:21.040 --> 0:00:23.680
<v Speaker 1>I will see you next week with a new episode.

0:00:28.440 --> 0:00:33.160
<v Speaker 1>Modern AI is blowing everybody's mind. But is it intelligent

0:00:33.760 --> 0:00:36.480
<v Speaker 1>in the same way as the human brain? And could

0:00:36.560 --> 0:00:41.400
<v Speaker 1>AI reach sentience? And how would we know when it

0:00:41.440 --> 0:00:47.320
<v Speaker 1>gets there? Welcome to Inner Cosmos with me, David Eagleman.

0:00:48.120 --> 0:00:52.040
<v Speaker 1>I'm a neuroscientist and an author at Stanford University, and

0:00:52.080 --> 0:00:56.640
<v Speaker 1>I've spent my whole career studying the intersection between how

0:00:56.640 --> 0:01:04.520
<v Speaker 1>the brain works and how we experience life. Like most

0:01:04.640 --> 0:01:10.800
<v Speaker 1>brain researchers, I've been obsessed with questions of intelligence and consciousness.

0:01:11.480 --> 0:01:15.720
<v Speaker 1>How do these arise from collections of billions of cells

0:01:15.720 --> 0:01:20.680
<v Speaker 1>in our brains? And could intelligence and consciousness arise in

0:01:20.800 --> 0:01:25.560
<v Speaker 1>artificial brains? Say on chat GPT. Those are the questions

0:01:25.560 --> 0:01:28.800
<v Speaker 1>that we're going to attack today. Early efforts to figure

0:01:28.800 --> 0:01:31.920
<v Speaker 1>out the brain, looked at all the billions of cells

0:01:32.000 --> 0:01:35.640
<v Speaker 1>and the trillions of connections, and said, look, what if

0:01:35.640 --> 0:01:39.440
<v Speaker 1>we just think of each cell as a unit, and

0:01:39.640 --> 0:01:43.440
<v Speaker 1>each unit is connected to other units and where they connect,

0:01:43.920 --> 0:01:46.440
<v Speaker 1>which is called the sinnapps, or one cell gives a

0:01:46.440 --> 0:01:48.760
<v Speaker 1>little signal to the next cell. What if we just

0:01:48.880 --> 0:01:52.960
<v Speaker 1>looked at that like a simple connection that has a

0:01:53.000 --> 0:01:56.920
<v Speaker 1>strength between zero and one, or zero means there's no connection,

0:01:57.400 --> 0:02:00.640
<v Speaker 1>and one means it's the strongest possible connection. So this

0:02:00.840 --> 0:02:06.280
<v Speaker 1>was a massive oversimplification of the very complicated biology, but

0:02:06.600 --> 0:02:10.920
<v Speaker 1>it allowed people to start thinking about networks and writing

0:02:10.960 --> 0:02:15.080
<v Speaker 1>down different ways that you could put artificial neural networks together.

0:02:15.480 --> 0:02:17.720
<v Speaker 1>And for more than fifty years now people have been

0:02:17.760 --> 0:02:22.440
<v Speaker 1>doing research to show how artificial neural networks can do

0:02:22.560 --> 0:02:26.280
<v Speaker 1>really cool things. It's a totally new kind of way

0:02:26.280 --> 0:02:29.240
<v Speaker 1>of doing computation. So you've got these units, and you've

0:02:29.240 --> 0:02:32.440
<v Speaker 1>got these connections between them, and you've change the strength

0:02:32.480 --> 0:02:36.560
<v Speaker 1>of the connections and information flows through the network in

0:02:36.600 --> 0:02:41.000
<v Speaker 1>different ways. Now, my colleagues and I have long pointed

0:02:41.000 --> 0:02:44.799
<v Speaker 1>out the ways in which biological brands are different and

0:02:44.840 --> 0:02:49.560
<v Speaker 1>how artificial neural networks just push around numbers and play

0:02:49.600 --> 0:02:55.520
<v Speaker 1>statistical tricks. But we're entering a revolution right now. Large

0:02:55.600 --> 0:03:00.960
<v Speaker 1>language models like GPT four or BARD consume trillions of

0:03:01.000 --> 0:03:05.600
<v Speaker 1>words on the Internet and they figure out probabilistically which

0:03:05.680 --> 0:03:08.919
<v Speaker 1>word is going to come next given the massive context

0:03:08.919 --> 0:03:12.720
<v Speaker 1>of all the words that have come before. So these networks,

0:03:12.840 --> 0:03:16.040
<v Speaker 1>as I talked about on the previous episode, are showing

0:03:16.160 --> 0:03:23.080
<v Speaker 1>incredible successes in everything from writing to art, to coding

0:03:23.600 --> 0:03:27.920
<v Speaker 1>to generating three dimensional worlds. They're changing everything, and they're

0:03:27.960 --> 0:03:31.679
<v Speaker 1>doing so at a pace that we've never seen before,

0:03:31.840 --> 0:03:35.640
<v Speaker 1>and in fact, the entire history of humankind has never

0:03:35.680 --> 0:03:39.800
<v Speaker 1>seen before. And there are all the societal questions that

0:03:39.800 --> 0:03:43.560
<v Speaker 1>everyone's starting to wrestle with right now, like the massive

0:03:44.120 --> 0:03:49.800
<v Speaker 1>potential for displacement of human jobs. But today I want

0:03:49.840 --> 0:03:52.760
<v Speaker 1>to zoom in on a question that has captured the

0:03:52.800 --> 0:03:58.600
<v Speaker 1>imagination of scientists and philosophers and the general public. Could

0:03:58.720 --> 0:04:05.320
<v Speaker 1>aim alive in some way, like become conscious or sentient. Now,

0:04:05.400 --> 0:04:08.080
<v Speaker 1>there are lots of ways to think about this. We

0:04:08.160 --> 0:04:13.960
<v Speaker 1>can ask whether AI can possess meaningful intelligence, or we

0:04:14.000 --> 0:04:17.560
<v Speaker 1>can ask if it is sentient, which means the ability

0:04:17.600 --> 0:04:22.039
<v Speaker 1>to feel or perceive things, particularly in terms of sensations

0:04:22.080 --> 0:04:24.840
<v Speaker 1>like pleasure and pain and emotions. Or we can ask

0:04:25.120 --> 0:04:29.200
<v Speaker 1>whether it is conscious, which involves being aware of one's

0:04:29.240 --> 0:04:33.080
<v Speaker 1>self and one's surrounding. Now, there are specific and important

0:04:33.200 --> 0:04:37.159
<v Speaker 1>differences between these questions, but really I don't care for

0:04:37.279 --> 0:04:41.280
<v Speaker 1>the present conversation. The question we're asking here is is

0:04:41.440 --> 0:04:45.680
<v Speaker 1>chat GPT just zeros and ones moving around through transistors

0:04:46.320 --> 0:04:50.359
<v Speaker 1>like a giant garage door opener. Or is it thinking?

0:04:50.440 --> 0:04:54.000
<v Speaker 1>Is it having some sort of experience? Is it having

0:04:54.040 --> 0:04:57.920
<v Speaker 1>a private inner life like the type that we humans have.

0:04:58.560 --> 0:05:02.720
<v Speaker 1>As we think about the possible of sentient AI, we

0:05:02.760 --> 0:05:07.240
<v Speaker 1>immediately find ourselves facing really deep ethical questions, the main

0:05:07.279 --> 0:05:11.600
<v Speaker 1>one being if we were to create a machine with consciousness,

0:05:11.920 --> 0:05:15.760
<v Speaker 1>what responsibility do we have to treat it as a

0:05:15.839 --> 0:05:18.920
<v Speaker 1>living being? Would you be able to turn it off

0:05:18.920 --> 0:05:21.080
<v Speaker 1>when you're done with it at night or would that

0:05:21.120 --> 0:05:23.919
<v Speaker 1>be murder? And what if you turn it off and

0:05:23.960 --> 0:05:26.479
<v Speaker 1>then you turn it back on. Would that be like

0:05:26.560 --> 0:05:28.760
<v Speaker 1>the way that we go into a sleep state at

0:05:28.880 --> 0:05:32.000
<v Speaker 1>night where we're totally gone, and then we find ourselves

0:05:32.520 --> 0:05:34.440
<v Speaker 1>back online in the morning and we think, yeah, I'm

0:05:34.480 --> 0:05:38.320
<v Speaker 1>the same person, but I guess eight hours just disappeared. Anyway,

0:05:38.360 --> 0:05:42.120
<v Speaker 1>more generally, would we feel obligated to treat it the

0:05:42.120 --> 0:05:47.520
<v Speaker 1>way we treat a sentient fellow human. With our current laptops,

0:05:47.600 --> 0:05:50.560
<v Speaker 1>we're used to saying, sure, I can sell it, I

0:05:50.600 --> 0:05:54.280
<v Speaker 1>can trade it, I can upgrade it. But what happens

0:05:54.320 --> 0:05:58.240
<v Speaker 1>when we reach sentient machines? Can we still do this

0:05:58.880 --> 0:06:01.760
<v Speaker 1>or would it somehow be like putting a child up

0:06:01.760 --> 0:06:04.919
<v Speaker 1>for adoption or giving your pet away? Things that we

0:06:05.000 --> 0:06:08.159
<v Speaker 1>don't take lately. And eventually we're going to have entire

0:06:08.320 --> 0:06:13.919
<v Speaker 1>legal precedence built around the question of AI rights and responsibilities.

0:06:14.360 --> 0:06:16.880
<v Speaker 1>So that's why today I want to talk about these

0:06:16.960 --> 0:06:21.800
<v Speaker 1>issues of intelligence and sentience. Does an AI like chat

0:06:21.880 --> 0:06:28.200
<v Speaker 1>gpt experience anything when chat gpt writes a poem? Does

0:06:28.240 --> 0:06:32.479
<v Speaker 1>it appreciate the beauty when it types out a joke?

0:06:32.560 --> 0:06:36.320
<v Speaker 1>Does it find itself amused and chuckling to itself. Let's

0:06:36.320 --> 0:06:39.400
<v Speaker 1>start with a guy named Blake Lemoyne who was a

0:06:39.640 --> 0:06:43.520
<v Speaker 1>programmer at Google and in June of twenty twenty two,

0:06:43.560 --> 0:06:49.080
<v Speaker 1>he was exchanging messages with a version of Google's conversational AI,

0:06:49.160 --> 0:06:52.040
<v Speaker 1>which was called Lambda at the time. So he asked

0:06:52.120 --> 0:06:55.760
<v Speaker 1>Namda for an example of what it was afraid of

0:06:56.320 --> 0:06:59.839
<v Speaker 1>and it gave him this very eloquent response about how

0:07:00.200 --> 0:07:04.240
<v Speaker 1>was afraid of being turned off, So he wrote an

0:07:04.240 --> 0:07:07.960
<v Speaker 1>internal memo to Google leadership than which he said, I

0:07:08.000 --> 0:07:12.600
<v Speaker 1>think this AI is sentient. And the leadership at Google

0:07:12.720 --> 0:07:17.880
<v Speaker 1>felt that this was an entirely unsubstantiated claim, and so

0:07:17.920 --> 0:07:20.280
<v Speaker 1>they made the decision to fire him for what they

0:07:20.280 --> 0:07:23.520
<v Speaker 1>took as an inappropriate conclusion that just didn't have enough

0:07:23.560 --> 0:07:28.160
<v Speaker 1>evidence beyond his intuition to qualify for raising the alarm

0:07:28.240 --> 0:07:31.520
<v Speaker 1>on this. So obviously this immediately fired up the news

0:07:31.600 --> 0:07:35.600
<v Speaker 1>cycles and the rumor mill and conspiracy theorists thought, Wait,

0:07:35.680 --> 0:07:39.320
<v Speaker 1>if AI isn't conscious, why would they fire him. They're

0:07:39.440 --> 0:07:41.840
<v Speaker 1>firing of him as all the evidence I need to

0:07:41.880 --> 0:07:46.480
<v Speaker 1>tell me that AI is sentient? Okay, but is it?

0:07:47.040 --> 0:07:50.160
<v Speaker 1>What does it mean to be conscious or sentient? How

0:07:50.440 --> 0:07:54.080
<v Speaker 1>the heck would we know when we have created something

0:07:54.120 --> 0:07:57.280
<v Speaker 1>that gets there? How do we know whether the AI

0:07:57.400 --> 0:07:59.840
<v Speaker 1>is sentient or instead whether humans are fooling them so

0:08:00.360 --> 0:08:03.240
<v Speaker 1>into believing that it is well. One way to make

0:08:03.280 --> 0:08:07.000
<v Speaker 1>this distinction would be to see if the AI could

0:08:07.280 --> 0:08:11.320
<v Speaker 1>conceptualize things, if it could take lots of words and

0:08:11.360 --> 0:08:15.600
<v Speaker 1>facts on the web and abstract those to some bigger idea.

0:08:16.200 --> 0:08:18.600
<v Speaker 1>So one of my friends here in Silicon Valley said

0:08:18.640 --> 0:08:21.800
<v Speaker 1>to me the other day, I asked chat gpt the

0:08:21.840 --> 0:08:26.480
<v Speaker 1>following question, Take a capital letter D and turn it

0:08:26.560 --> 0:08:30.480
<v Speaker 1>flat side down. Now take the letter J and slide

0:08:30.520 --> 0:08:35.040
<v Speaker 1>it underneath. What does that look like? And chat gpt said,

0:08:35.679 --> 0:08:38.959
<v Speaker 1>and umbrella. And my friend was blown away by this,

0:08:39.160 --> 0:08:44.320
<v Speaker 1>and he said, this is conceptualization. It's just done three

0:08:44.360 --> 0:08:50.800
<v Speaker 1>dimensional reasoning. There's something deeper happening here than just parenting words.

0:08:51.240 --> 0:08:54.280
<v Speaker 1>But I pointed out to him that this particular question

0:08:54.360 --> 0:08:57.080
<v Speaker 1>about the D on its side and the J underneath

0:08:57.120 --> 0:09:00.960
<v Speaker 1>it is one of the oldest examples in psychology classes

0:09:01.040 --> 0:09:04.800
<v Speaker 1>when talking about visual imagery, and it's on the Internet

0:09:04.880 --> 0:09:07.560
<v Speaker 1>in thousands of places, so of course it got it right.

0:09:08.120 --> 0:09:11.760
<v Speaker 1>It's just parroting the answer because it has read the

0:09:11.880 --> 0:09:15.240
<v Speaker 1>question and it has read the answer before. So it's

0:09:15.280 --> 0:09:19.360
<v Speaker 1>not always easy to determine what's going on for these

0:09:19.480 --> 0:09:23.400
<v Speaker 1>models in terms of whether some human somewhere has discussed

0:09:23.480 --> 0:09:26.160
<v Speaker 1>this point and written down the answer. And the general

0:09:26.240 --> 0:09:30.199
<v Speaker 1>story is that with trillions of words written by humans

0:09:30.240 --> 0:09:35.520
<v Speaker 1>over centuries, there are many things beyond your capacity to

0:09:35.679 --> 0:09:38.440
<v Speaker 1>read them or to even imagine that they've been written

0:09:38.480 --> 0:09:42.400
<v Speaker 1>down before, but maybe they have. If any human has

0:09:42.520 --> 0:09:47.800
<v Speaker 1>discussed a question before has conceptualized something, then chat GPT

0:09:48.040 --> 0:09:52.240
<v Speaker 1>can find that and mimic that. But that's not conceptualization.

0:09:52.880 --> 0:09:55.960
<v Speaker 1>Chat GPT is doing a thousand amazing things, and we

0:09:56.120 --> 0:10:00.360
<v Speaker 1>have an enormous amount to learn about it. But we

0:10:00.400 --> 0:10:05.240
<v Speaker 1>shouldn't let ourselves get fooled and mesmerized into believing that

0:10:05.280 --> 0:10:08.319
<v Speaker 1>it's doing something more than it is. And our ability

0:10:08.360 --> 0:10:12.480
<v Speaker 1>to get fooled is not only about the massive statistics

0:10:12.520 --> 0:10:16.160
<v Speaker 1>of what it takes in. There are other examples of

0:10:16.559 --> 0:10:22.080
<v Speaker 1>seeming sentience that result from the reinforcement learning that it

0:10:22.120 --> 0:10:26.080
<v Speaker 1>does with humans. So here's what that means. The network

0:10:26.160 --> 0:10:31.160
<v Speaker 1>generates lots of sentences and thousands of humans are involved

0:10:31.240 --> 0:10:33.800
<v Speaker 1>in giving it feedback, like a thumbs up or a

0:10:33.880 --> 0:10:37.800
<v Speaker 1>thumbs down, to say whether they appreciated the answer, whether

0:10:37.800 --> 0:10:41.679
<v Speaker 1>they thought that was a good answer. So, because humans

0:10:41.760 --> 0:10:46.040
<v Speaker 1>are giving reward to the machine, sometimes that pushes things

0:10:46.559 --> 0:10:51.120
<v Speaker 1>in weird directions that can be mistaken for sentience. For example,

0:10:51.280 --> 0:10:56.640
<v Speaker 1>scholars have shown that reinforcement learning with humans makes networks

0:10:56.800 --> 0:11:01.040
<v Speaker 1>more likely to say, don't turn me off, just like

0:11:01.200 --> 0:11:04.959
<v Speaker 1>Blake had heard but don't mistake this for sentience. It's

0:11:05.000 --> 0:11:08.400
<v Speaker 1>only a sign that the machine is saying this because

0:11:08.440 --> 0:11:11.160
<v Speaker 1>some of the human participants gave it a thumbs up

0:11:11.400 --> 0:11:14.640
<v Speaker 1>when the large language model said this before, and so

0:11:14.760 --> 0:11:18.640
<v Speaker 1>it learned to do this again. The fact is, it's

0:11:18.679 --> 0:11:22.480
<v Speaker 1>sometimes hard to know why. Sometimes we see an answer

0:11:22.559 --> 0:11:27.400
<v Speaker 1>that feels very impressive. But we'd agree that pulling text

0:11:27.440 --> 0:11:30.480
<v Speaker 1>from the Internet and parroting it back is not by

0:11:30.520 --> 0:11:36.960
<v Speaker 1>itself intelligence or sentience. Chat GPT presumably has no idea

0:11:37.080 --> 0:11:40.160
<v Speaker 1>of what it's saying, whether that's a poem or a

0:11:40.400 --> 0:11:45.600
<v Speaker 1>terrorist manifesto, or instructions for building a spaceship or a

0:11:45.640 --> 0:11:50.920
<v Speaker 1>heartbreaking story about an orphaned child. Chat GPT doesn't know,

0:11:51.000 --> 0:11:56.480
<v Speaker 1>and it doesn't care. It's words in and statistical correlations out.

0:11:56.880 --> 0:12:01.199
<v Speaker 1>And in fact, there has been a fundamental philosophical point

0:12:01.360 --> 0:12:04.600
<v Speaker 1>made about this in the nineteen eighties when the philosopher

0:12:04.760 --> 0:12:09.040
<v Speaker 1>John Surrele was wondering about this question of whether a

0:12:09.160 --> 0:12:13.880
<v Speaker 1>computer could ever be programmed so that it has a mind,

0:12:14.160 --> 0:12:16.280
<v Speaker 1>and he came up with a thought experiment that he

0:12:16.360 --> 0:12:20.000
<v Speaker 1>called the Chinese room argument, and it goes like this,

0:12:22.040 --> 0:12:26.440
<v Speaker 1>I am locked in a room and questions are passed

0:12:26.440 --> 0:12:30.199
<v Speaker 1>to me through a small letter slot, and these messages

0:12:30.240 --> 0:12:33.320
<v Speaker 1>are written only in Chinese, and I don't speak Chinese.

0:12:33.400 --> 0:12:37.040
<v Speaker 1>I have no clue what's written on these pieces of paper. However,

0:12:37.240 --> 0:12:41.480
<v Speaker 1>inside this room, I have a library of books, and

0:12:41.520 --> 0:12:45.319
<v Speaker 1>they contain step by step instructions that tell me exactly

0:12:45.360 --> 0:12:48.520
<v Speaker 1>what to do with these symbols. So I look at

0:12:48.520 --> 0:12:52.240
<v Speaker 1>the grouping of symbols, and I simply follow steps in

0:12:52.320 --> 0:12:55.800
<v Speaker 1>the book to tell me what Chinese symbols to copy

0:12:55.880 --> 0:12:58.920
<v Speaker 1>down in response. So I write those on the slip

0:12:58.920 --> 0:13:01.760
<v Speaker 1>of paper. And when I pass the paper back out

0:13:01.800 --> 0:13:06.360
<v Speaker 1>of the slot. Now, when the Chinese speaker receives my

0:13:06.559 --> 0:13:10.400
<v Speaker 1>reply message, it makes perfect sense to her. It seems

0:13:10.920 --> 0:13:14.360
<v Speaker 1>as though whoever is in the room is answering her

0:13:14.440 --> 0:13:17.840
<v Speaker 1>questions perfectly, and therefore it seems obvious that the person

0:13:17.920 --> 0:13:23.199
<v Speaker 1>in the room must understand Chinese. I've fooled her, of course,

0:13:23.240 --> 0:13:26.160
<v Speaker 1>because I'm only following a set of instructions with no

0:13:26.400 --> 0:13:29.760
<v Speaker 1>understanding of what's going on. With enough time and with

0:13:29.800 --> 0:13:33.199
<v Speaker 1>a big enough set of instructions, I can answer almost

0:13:33.240 --> 0:13:37.679
<v Speaker 1>any question posed to me in Chinese. But I, the operator,

0:13:37.800 --> 0:13:42.400
<v Speaker 1>do not understand Chinese. I manipulate symbols all day long,

0:13:43.000 --> 0:13:48.760
<v Speaker 1>but I have no idea what the symbols mean. Now,

0:13:48.840 --> 0:13:53.240
<v Speaker 1>The philosopher John Searle argued, this is just what's happening

0:13:53.280 --> 0:13:57.560
<v Speaker 1>inside a computer. No matter how intelligent a program like

0:13:57.679 --> 0:14:01.800
<v Speaker 1>chat GPT seems to be, it's only following sets of

0:14:01.880 --> 0:14:08.880
<v Speaker 1>instructions to spit out answers. It's manipulating symbols without ever

0:14:09.280 --> 0:14:12.680
<v Speaker 1>really understanding what it's doing. Or think about what Google

0:14:12.800 --> 0:14:16.439
<v Speaker 1>is doing. When you send Google a query, it doesn't

0:14:16.520 --> 0:14:19.760
<v Speaker 1>understand your question or even its own answer. It simply

0:14:19.840 --> 0:14:24.160
<v Speaker 1>moves around zeros and ones and logicates and returns zeros

0:14:24.160 --> 0:14:26.880
<v Speaker 1>and ones to you. Or with a mind blowing program

0:14:26.920 --> 0:14:31.000
<v Speaker 1>like Google Translate, I can write a sentence in Russian

0:14:31.320 --> 0:14:35.400
<v Speaker 1>and it can return the translation in Amharic. But it's

0:14:35.560 --> 0:14:41.520
<v Speaker 1>all algorithmic. It's just symbol manipulation. Like the operator inside

0:14:41.520 --> 0:14:46.880
<v Speaker 1>the Chinese room, Google Translate doesn't understand anything about the sentence.

0:14:47.120 --> 0:14:51.520
<v Speaker 1>Nothing carries any meaning to it. So the Chinese room

0:14:51.600 --> 0:14:57.080
<v Speaker 1>argument suggests that AI that mimics human intelligence doesn't actually

0:14:57.200 --> 0:15:01.640
<v Speaker 1>understand what it's talking about. There's no meaning to anything,

0:15:01.720 --> 0:15:06.480
<v Speaker 1>CHATCHYPT says, and Serle used this thought experiment to argue

0:15:06.480 --> 0:15:10.920
<v Speaker 1>that there's something about human brains that won't be explained

0:15:10.960 --> 0:15:15.240
<v Speaker 1>if we simply analogize them to digital computers. There's a

0:15:15.400 --> 0:15:26.520
<v Speaker 1>gap between symbols that have no meaning and our conscious experience. Now,

0:15:27.240 --> 0:15:30.960
<v Speaker 1>there's an ongoing debate about the interpretation of the Chinese

0:15:31.040 --> 0:15:35.760
<v Speaker 1>room argument, but however one construes it, the argument exposes

0:15:36.280 --> 0:15:40.360
<v Speaker 1>the difficulty in the mystery of how zeros and ones

0:15:40.560 --> 0:15:44.920
<v Speaker 1>would ever come to equal our experience of being alive

0:15:45.040 --> 0:15:47.760
<v Speaker 1>in the world. Now, just to be very clear on

0:15:47.800 --> 0:15:51.880
<v Speaker 1>this point, we don't understand why we are conscious. There's

0:15:51.920 --> 0:15:54.040
<v Speaker 1>still a huge amount of work that has to be

0:15:54.080 --> 0:15:57.120
<v Speaker 1>done in biology to understand that. But this is just

0:15:57.160 --> 0:16:01.000
<v Speaker 1>to say that simply having zeros in one moving around

0:16:01.680 --> 0:16:06.560
<v Speaker 1>wouldn't by itself seem to be sufficient for conscious experience.

0:16:07.160 --> 0:16:10.520
<v Speaker 1>In other words, how do zeros and ones ever equal

0:16:10.640 --> 0:16:15.120
<v Speaker 1>the sting of a hot pepper, or the yellowness of

0:16:15.240 --> 0:16:19.720
<v Speaker 1>yellow or the beauty of a sunset. By the way,

0:16:19.760 --> 0:16:22.480
<v Speaker 1>I've covered the Chinese room argument in my TV show

0:16:22.600 --> 0:16:24.720
<v Speaker 1>The Brain, and if you're interested in that, I'll link

0:16:24.760 --> 0:16:28.960
<v Speaker 1>the video on Eagleman dot com slash podcast. Now, all

0:16:29.040 --> 0:16:31.840
<v Speaker 1>this is not a criticism of the approach of moving

0:16:31.960 --> 0:16:34.680
<v Speaker 1>zeros and ones around. But it is to point out

0:16:34.680 --> 0:16:39.000
<v Speaker 1>that we shouldn't confuse this type of Chinese room correlation

0:16:39.920 --> 0:16:45.040
<v Speaker 1>with real sentience or intelligence. And there's a deeper reason

0:16:45.120 --> 0:16:50.080
<v Speaker 1>to be suspicious too, because despite the incredible successes of

0:16:50.200 --> 0:16:54.480
<v Speaker 1>large language models, we also see that they sometimes make

0:16:54.880 --> 0:16:58.520
<v Speaker 1>decisions that expose the fact that they don't have any

0:16:58.600 --> 0:17:01.880
<v Speaker 1>meaningful model of the In other words, I think we

0:17:01.920 --> 0:17:05.480
<v Speaker 1>can gain some fast insight by paying attention to the

0:17:05.520 --> 0:17:08.840
<v Speaker 1>places where the AI is not working so well. So

0:17:08.920 --> 0:17:12.359
<v Speaker 1>I'll give three quick examples. The first has to do

0:17:12.440 --> 0:17:17.080
<v Speaker 1>with humor. AI has a very difficult time making an

0:17:17.119 --> 0:17:20.840
<v Speaker 1>original joke, and this is for a simple reason. To

0:17:21.000 --> 0:17:24.040
<v Speaker 1>make up a new joke, you need to know what

0:17:24.080 --> 0:17:27.760
<v Speaker 1>the ending is and then you work backwards to construct

0:17:27.880 --> 0:17:30.480
<v Speaker 1>the joke with red herrings so no one sees where

0:17:30.520 --> 0:17:33.399
<v Speaker 1>you're going and it happens at the way these large

0:17:33.480 --> 0:17:37.200
<v Speaker 1>language models work is all in the forward direction. They

0:17:37.240 --> 0:17:40.920
<v Speaker 1>decide what is the most probable word to come next,

0:17:41.160 --> 0:17:45.040
<v Speaker 1>So they're fine at parroting jokes back to us, but

0:17:45.119 --> 0:17:49.560
<v Speaker 1>they're total failures at building original jokes. And there's a

0:17:49.600 --> 0:17:52.240
<v Speaker 1>deeper point here as well. To build a joke, You

0:17:52.320 --> 0:17:56.440
<v Speaker 1>need to have some model, some idea of what will

0:17:56.440 --> 0:18:00.520
<v Speaker 1>be funny to a fellow human, what shared concept or

0:18:00.560 --> 0:18:04.200
<v Speaker 1>shared experience would make someone laugh. And for that, you

0:18:04.359 --> 0:18:07.959
<v Speaker 1>generally need to have the experience of a human life

0:18:08.000 --> 0:18:11.479
<v Speaker 1>with all of its joys and slings and arrows and

0:18:11.520 --> 0:18:14.199
<v Speaker 1>so on. And these large language models can do a

0:18:14.200 --> 0:18:18.120
<v Speaker 1>lot of things, but they don't have any model of

0:18:18.200 --> 0:18:22.680
<v Speaker 1>what it is to be a human. My second example

0:18:23.359 --> 0:18:25.920
<v Speaker 1>has to do with the flip side of making a joke,

0:18:25.960 --> 0:18:28.520
<v Speaker 1>which is getting a joke. And if you look carefully,

0:18:28.520 --> 0:18:31.639
<v Speaker 1>you will see how current AI always fails to catch

0:18:31.720 --> 0:18:34.359
<v Speaker 1>jokes that are thrown at it. It doesn't get jokes

0:18:34.400 --> 0:18:36.959
<v Speaker 1>because it doesn't have a model of what it is

0:18:37.000 --> 0:18:40.720
<v Speaker 1>to be a human. But this point goes beyond jokes.

0:18:41.119 --> 0:18:44.400
<v Speaker 1>One of the most remarkable feats of these large language

0:18:44.400 --> 0:18:49.440
<v Speaker 1>models is summarizing large texts, and in twenty twenty two,

0:18:49.520 --> 0:18:53.840
<v Speaker 1>open Ai announced how they could summarize entire books like

0:18:53.960 --> 0:18:57.000
<v Speaker 1>Alice in Wonderland. What it does is it generates a

0:18:57.040 --> 0:19:00.320
<v Speaker 1>summary of each chapter, and then it uses those after

0:19:00.359 --> 0:19:03.080
<v Speaker 1>summaries to make a summary of the whole book. So

0:19:03.200 --> 0:19:07.040
<v Speaker 1>for Alice in Wonderland, it generates the following. Alice falls

0:19:07.040 --> 0:19:09.399
<v Speaker 1>down a rabbit hole and grows to a giant size.

0:19:09.440 --> 0:19:12.919
<v Speaker 1>After drinking a mysterious bottle, she decides to focus on

0:19:13.119 --> 0:19:15.960
<v Speaker 1>growing back to her normal size and finding her way

0:19:16.000 --> 0:19:18.840
<v Speaker 1>into the garden. She meets the caterpillar, who tells her

0:19:18.880 --> 0:19:21.080
<v Speaker 1>that one side of a mushroom will make her grow taller,

0:19:21.359 --> 0:19:24.480
<v Speaker 1>the other side shorter. She eats the mushroom and returns

0:19:24.520 --> 0:19:27.240
<v Speaker 1>to her normal size. Alice attends a party with the

0:19:27.280 --> 0:19:30.800
<v Speaker 1>Mad Hatter and the march Hare. The Queen arrives and

0:19:30.920 --> 0:19:33.720
<v Speaker 1>orders the execution of the gardeners for making a mistake

0:19:33.800 --> 0:19:37.040
<v Speaker 1>with the roses. Alice saves them by putting them in

0:19:37.080 --> 0:19:39.760
<v Speaker 1>a flower pot. The King and Queen of Hearts preside

0:19:39.800 --> 0:19:42.760
<v Speaker 1>over a trial. The Queen gets angry and orders Alice

0:19:42.800 --> 0:19:45.680
<v Speaker 1>to be sentenced to death. Alice wakes up to find

0:19:45.680 --> 0:19:50.280
<v Speaker 1>her sister by her side. So that's pretty remarkable. It

0:19:50.320 --> 0:19:53.200
<v Speaker 1>took a whole book, and it was able to summarize

0:19:53.200 --> 0:19:56.520
<v Speaker 1>it down to a paragraph. But I kept reading these

0:19:56.560 --> 0:20:00.359
<v Speaker 1>text summaries carefully, and I got to the summary of

0:20:00.720 --> 0:20:04.040
<v Speaker 1>Act one of Romeo and Juliet, and here's what it says.

0:20:04.760 --> 0:20:08.440
<v Speaker 1>Romeo locks himself in his room, no longer in love

0:20:08.520 --> 0:20:11.840
<v Speaker 1>with rosalind Now, I think the engineers at open Ai

0:20:12.000 --> 0:20:14.879
<v Speaker 1>felt really satisfied with this summary. They thought it was

0:20:14.960 --> 0:20:17.280
<v Speaker 1>quite good, and my proof for this is that they

0:20:17.680 --> 0:20:21.800
<v Speaker 1>still display it proudly on their website. But I majored

0:20:21.880 --> 0:20:24.400
<v Speaker 1>in literature as an undergraduate, and I spend a lot

0:20:24.440 --> 0:20:27.560
<v Speaker 1>of time with shakespeare plays, and I immediately knew that

0:20:27.640 --> 0:20:32.240
<v Speaker 1>this summary was exactly wrong. The actual scene from Shakespeare

0:20:32.240 --> 0:20:38.000
<v Speaker 1>goes like this. His friend ben Voglio finds Romeo catatonically depressed,

0:20:38.440 --> 0:20:43.560
<v Speaker 1>and ben Volio says, what sadness lengthens Romeo's hours? And

0:20:43.640 --> 0:20:48.480
<v Speaker 1>Romeo says, not having that which having makes them short?

0:20:48.600 --> 0:20:52.560
<v Speaker 1>And ben Volio says in love, and Romeo says out

0:20:53.080 --> 0:20:56.399
<v Speaker 1>ben Reli says of love, and Romeo says out of

0:20:56.480 --> 0:21:00.199
<v Speaker 1>her favor, where I am in love? This this is

0:21:00.240 --> 0:21:05.720
<v Speaker 1>typical Shakespearean wordplay, where Romeo is expressing his grief of

0:21:05.760 --> 0:21:09.199
<v Speaker 1>being out of favor with Roslin, with whom he is

0:21:09.280 --> 0:21:12.120
<v Speaker 1>deeply in love. And when you read the play, it's

0:21:12.160 --> 0:21:16.560
<v Speaker 1>obvious that Romeo is not over Roslin. He's suffering over her.

0:21:16.600 --> 0:21:19.879
<v Speaker 1>He's almost suicidal. And this is an important piece of

0:21:19.920 --> 0:21:22.680
<v Speaker 1>the play, because the play is really about a young

0:21:22.720 --> 0:21:26.080
<v Speaker 1>man in love with the idea of being in love,

0:21:26.280 --> 0:21:29.639
<v Speaker 1>and that's why he later in the same act, falls

0:21:29.680 --> 0:21:33.600
<v Speaker 1>so hard into his relationship with Juliet, a relationship which

0:21:33.720 --> 0:21:36.840
<v Speaker 1>ends in their mutual suicide. By the way, as Friar

0:21:36.920 --> 0:21:41.760
<v Speaker 1>Lauren says of their relationship, these violent delights have violent ends.

0:21:42.240 --> 0:21:43.760
<v Speaker 1>And you get a bonus if you can tell me

0:21:43.800 --> 0:21:46.920
<v Speaker 1>where else you've heard that line more recently. Okay, anyway

0:21:46.960 --> 0:21:51.960
<v Speaker 1>back to the AI summary, The AI misses this wordplay entirely,

0:21:52.600 --> 0:21:57.960
<v Speaker 1>and it concludes that Romeo is out of love with Roslin. Again,

0:21:58.080 --> 0:22:01.480
<v Speaker 1>a human watching the play or reading the play immediately

0:22:01.520 --> 0:22:06.400
<v Speaker 1>gets that Romeo is making wordplay and his heartbroken over Roslin,

0:22:06.440 --> 0:22:10.000
<v Speaker 1>but the AI doesn't get that because it's reading words

0:22:10.119 --> 0:22:13.840
<v Speaker 1>only at a statistical level, not at a level of

0:22:13.920 --> 0:22:18.000
<v Speaker 1>understanding of what it is to be a human saying

0:22:18.240 --> 0:22:21.880
<v Speaker 1>those words. And that leads me to the third example,

0:22:22.320 --> 0:22:26.439
<v Speaker 1>which is the difficulty in understanding the physical world. So

0:22:26.560 --> 0:22:30.480
<v Speaker 1>consider a question like this, When President Biden walks into

0:22:30.520 --> 0:22:34.560
<v Speaker 1>a room, does his head come with him? So this

0:22:34.680 --> 0:22:38.119
<v Speaker 1>is famously difficult for AI to answer a question like this,

0:22:38.240 --> 0:22:42.200
<v Speaker 1>even though it's trivial for you because the AI doesn't

0:22:42.240 --> 0:22:46.639
<v Speaker 1>have an internal model of how everything physically hangs together

0:22:46.720 --> 0:22:49.320
<v Speaker 1>in the world. Last week, I was at the TED

0:22:49.400 --> 0:22:52.480
<v Speaker 1>conference and I heard a great talk by Yegin Choi,

0:22:52.880 --> 0:22:56.280
<v Speaker 1>and she was phrasing this problem as AI not having

0:22:56.760 --> 0:23:01.199
<v Speaker 1>common sense. She asked chat GPT the following question, it

0:23:01.280 --> 0:23:04.200
<v Speaker 1>takes six hours to dry six shirts in the sun,

0:23:04.640 --> 0:23:07.560
<v Speaker 1>how long does it take to dry thirty shirts? And

0:23:07.640 --> 0:23:11.399
<v Speaker 1>it answers thirty hours. Now you and I see that

0:23:11.440 --> 0:23:14.320
<v Speaker 1>the answer should be six hours, because we know the

0:23:14.359 --> 0:23:17.439
<v Speaker 1>sun doesn't care how many shirts are out there. But

0:23:17.560 --> 0:23:21.919
<v Speaker 1>chat GPT just doesn't get it because despite appearances, it

0:23:21.960 --> 0:23:25.840
<v Speaker 1>doesn't have a model of the world. And we've seen

0:23:25.880 --> 0:23:27.920
<v Speaker 1>this sort of thing for years. By the way, even

0:23:27.920 --> 0:23:32.879
<v Speaker 1>in mind blowingly impressive AI models that do image recognition,

0:23:32.920 --> 0:23:36.680
<v Speaker 1>they're so impressive in what they recognize, but then they'll

0:23:36.760 --> 0:23:40.679
<v Speaker 1>fail catastrophically. It's some easy picture making mistakes that a

0:23:40.760 --> 0:23:43.680
<v Speaker 1>human just wouldn't make. For example, there's one picture where

0:23:43.720 --> 0:23:46.280
<v Speaker 1>there's a boy holding a toothbrush and the AI says

0:23:46.720 --> 0:23:49.640
<v Speaker 1>it's a boy with a baseball bat. Okay, so there

0:23:49.640 --> 0:23:54.240
<v Speaker 1>are things that AI doesn't do that well. But that said,

0:23:54.280 --> 0:23:57.960
<v Speaker 1>there are other things that are mind blowing, things that

0:23:58.600 --> 0:24:01.360
<v Speaker 1>no one expected it to do. And this is why

0:24:01.400 --> 0:24:04.560
<v Speaker 1>I mentioned in my previous episode that we are in

0:24:04.640 --> 0:24:10.120
<v Speaker 1>an era of discovery more than just invention. Everyone's searching

0:24:10.200 --> 0:24:13.560
<v Speaker 1>and finding things that the AI can do that nobody

0:24:13.600 --> 0:24:17.160
<v Speaker 1>really expected or foresaw, including all the stuff that we're

0:24:17.160 --> 0:24:20.639
<v Speaker 1>now taking for granted, like oh, it can summarize books

0:24:20.720 --> 0:24:23.800
<v Speaker 1>or it can make art from text. And I want

0:24:23.840 --> 0:24:26.080
<v Speaker 1>to point out that a lot of the arguments that

0:24:26.119 --> 0:24:30.320
<v Speaker 1>people have been making about AI not being good at something,

0:24:30.520 --> 0:24:34.879
<v Speaker 1>these arguments have been changing rapidly. For example, just a

0:24:34.920 --> 0:24:38.000
<v Speaker 1>few months ago, people were arguing that AI would make

0:24:38.119 --> 0:24:41.080
<v Speaker 1>silly mistakes about things, and it couldn't really understand math

0:24:41.160 --> 0:24:45.119
<v Speaker 1>and would get math wrong and word problems. But in

0:24:45.160 --> 0:24:49.200
<v Speaker 1>a shockingly brief time, a lot of these shortcomings have

0:24:49.280 --> 0:24:53.000
<v Speaker 1>been mastered. So it's yet to be seen what challenges

0:24:53.119 --> 0:25:14.480
<v Speaker 1>will remain and for how long. So the evidence I've

0:25:14.520 --> 0:25:17.720
<v Speaker 1>presented so far is that AI doesn't have a great

0:25:17.800 --> 0:25:20.239
<v Speaker 1>model of what it's like to be human, but that

0:25:20.280 --> 0:25:25.600
<v Speaker 1>doesn't necessarily rule out that it has sentience or awareness,

0:25:25.760 --> 0:25:30.040
<v Speaker 1>even if it's of another flavor. It doesn't think like

0:25:30.080 --> 0:25:35.040
<v Speaker 1>a human, but maybe it stif thinks so is chat

0:25:35.080 --> 0:25:40.359
<v Speaker 1>GPT having some sort of experience? And how would we know?

0:25:42.119 --> 0:25:46.560
<v Speaker 1>In nineteen fifty, the brilliant mathematician and computer scientist Alan

0:25:46.680 --> 0:25:51.480
<v Speaker 1>Turing was asking this question, how could you determine whether

0:25:51.560 --> 0:25:56.600
<v Speaker 1>a machine exhibits human like intelligence? So he proposed an

0:25:56.640 --> 0:26:00.679
<v Speaker 1>experiment that he called the imitation game. You've got a

0:26:00.720 --> 0:26:05.840
<v Speaker 1>machine AI that's programmed to simulate human speech or conversation,

0:26:06.200 --> 0:26:08.800
<v Speaker 1>and you place it in a closed room, and in

0:26:08.840 --> 0:26:12.240
<v Speaker 1>a second room you have a real human, but the

0:26:12.280 --> 0:26:15.440
<v Speaker 1>doors are closed, so you don't know which room has

0:26:15.560 --> 0:26:19.360
<v Speaker 1>which machine or human. And now you are a person,

0:26:19.440 --> 0:26:24.359
<v Speaker 1>the evaluator, who communicates with both of them via a

0:26:24.560 --> 0:26:27.080
<v Speaker 1>computer terminal or I think of a nowadays like text

0:26:27.119 --> 0:26:31.840
<v Speaker 1>messaging with both of them. So you, the evaluator, engage

0:26:31.920 --> 0:26:35.600
<v Speaker 1>in a conversation with both closed rooms, one of which

0:26:35.640 --> 0:26:37.840
<v Speaker 1>has the machine and one the human, and your job

0:26:37.920 --> 0:26:40.879
<v Speaker 1>is simply to figure out which is which, which is

0:26:40.920 --> 0:26:43.160
<v Speaker 1>the machine and which is the human. And the only

0:26:43.280 --> 0:26:46.000
<v Speaker 1>thing that you have to work with are the texts

0:26:46.000 --> 0:26:49.160
<v Speaker 1>that are going back and forth. And if you, the evaluator,

0:26:49.359 --> 0:26:53.720
<v Speaker 1>cannot tell, that is the moment when machine intelligence has

0:26:53.840 --> 0:26:57.920
<v Speaker 1>finally arrived at the level of human intelligence. It has

0:26:58.160 --> 0:27:03.200
<v Speaker 1>passed the imitation or what we now call the Touring test.

0:27:04.240 --> 0:27:06.719
<v Speaker 1>And this reminds me of this great line in the

0:27:06.760 --> 0:27:11.879
<v Speaker 1>first episode of Westworld, where the protagonist William is talking

0:27:11.920 --> 0:27:14.560
<v Speaker 1>to the woman who's outfitting him for his adventure in

0:27:14.600 --> 0:27:17.080
<v Speaker 1>Westworld and giving him a hat and a gun and

0:27:17.119 --> 0:27:20.520
<v Speaker 1>so on, and he hesitantly asks, I hope you don't

0:27:20.520 --> 0:27:23.320
<v Speaker 1>mind if I ask you this question, but are you real?

0:27:23.880 --> 0:27:27.600
<v Speaker 1>And she says to him, if you can't tell, does

0:27:27.640 --> 0:27:30.760
<v Speaker 1>it matter? So I brought this up last episode in

0:27:30.800 --> 0:27:34.199
<v Speaker 1>the context of art, where we asked whether it matters

0:27:34.320 --> 0:27:36.760
<v Speaker 1>if the art is generated by an AI or a human,

0:27:37.200 --> 0:27:40.000
<v Speaker 1>But now this question comes up in the context of

0:27:40.480 --> 0:27:45.960
<v Speaker 1>intelligence and sentence. Does it matter whether we can tell

0:27:46.080 --> 0:27:49.199
<v Speaker 1>or not? Well, I think we're way beyond the Turing

0:27:49.280 --> 0:27:52.399
<v Speaker 1>test nowadays, but I don't feel like it gives us

0:27:52.400 --> 0:27:55.479
<v Speaker 1>a good answer to the question of whether the AI

0:27:55.600 --> 0:27:59.480
<v Speaker 1>is intelligent and is experiencing an inner life. I mean,

0:27:59.720 --> 0:28:02.479
<v Speaker 1>the Sturing test has been the test in the AI

0:28:02.600 --> 0:28:05.919
<v Speaker 1>world since the beginning. Why is it the perfect test? No,

0:28:06.160 --> 0:28:09.160
<v Speaker 1>but it's really hard to figure out how to test

0:28:09.200 --> 0:28:13.840
<v Speaker 1>for intelligence. But we have to be cautious about equating

0:28:14.240 --> 0:28:19.840
<v Speaker 1>conversational ability with sentience. Why well, for starters, let's just

0:28:19.920 --> 0:28:24.520
<v Speaker 1>acknowledge how easy it is for us to anthropomorphize. That

0:28:24.600 --> 0:28:28.960
<v Speaker 1>means to assign human qualities to everything around us. Like

0:28:29.040 --> 0:28:32.520
<v Speaker 1>we give animals human names and talk to them as

0:28:32.560 --> 0:28:36.520
<v Speaker 1>though they are people, and we project our emotions onto animals.

0:28:36.600 --> 0:28:40.720
<v Speaker 1>We make stories about animals that have human like qualities,

0:28:41.240 --> 0:28:43.960
<v Speaker 1>and we have animals that talk and wear clothes and

0:28:44.000 --> 0:28:48.040
<v Speaker 1>go on adventures in these stories. Every Pixar film that

0:28:48.120 --> 0:28:53.000
<v Speaker 1>you watch is about cars or toys or airplanes talking

0:28:53.040 --> 0:28:56.200
<v Speaker 1>and having emotions, and we don't even bad an eye

0:28:56.240 --> 0:28:59.920
<v Speaker 1>at that stuff. We can, in fact, just watch random

0:29:00.200 --> 0:29:04.240
<v Speaker 1>shapes moving around a computer screen and we will assign

0:29:04.800 --> 0:29:09.440
<v Speaker 1>intention and feel emotion depending on exactly how they're moving.

0:29:09.760 --> 0:29:11.840
<v Speaker 1>If you're interested in this, see the link on the

0:29:11.880 --> 0:29:15.640
<v Speaker 1>podcast page to the study by Heighter and Simil in

0:29:15.640 --> 0:29:19.880
<v Speaker 1>the nineteen forties where they move shapes around on a screen. Okay,

0:29:20.000 --> 0:29:22.960
<v Speaker 1>now this is all related to a point that I

0:29:23.040 --> 0:29:25.760
<v Speaker 1>brought up in the last episode, which is how easy

0:29:25.840 --> 0:29:28.880
<v Speaker 1>it is to pluck the strings on a human, or,

0:29:28.920 --> 0:29:33.600
<v Speaker 1>as the West World writers put it, how hackable humans are.

0:29:34.160 --> 0:29:36.080
<v Speaker 1>So I bring all this up to say that just

0:29:36.120 --> 0:29:40.440
<v Speaker 1>because you think that an answer sounds very clever or

0:29:40.480 --> 0:29:43.320
<v Speaker 1>it sounds like a human really tells us very little

0:29:43.360 --> 0:29:48.920
<v Speaker 1>about whether the AI is actually intelligent or sentient. It

0:29:48.960 --> 0:29:52.760
<v Speaker 1>only tells us something about the willingness of us as

0:29:52.840 --> 0:29:58.640
<v Speaker 1>observers to anthropomorphize, to assign intention where there is none,

0:29:58.920 --> 0:30:02.680
<v Speaker 1>Because what chat GPT does is take the structure of

0:30:02.800 --> 0:30:06.560
<v Speaker 1>language very impressively and spoon it back to us, and

0:30:06.640 --> 0:30:10.600
<v Speaker 1>we hear these well formed sentences, and we can hardly

0:30:11.080 --> 0:30:15.480
<v Speaker 1>help but impose sentience on the AI. And part of

0:30:15.480 --> 0:30:20.200
<v Speaker 1>the reason is that language is a super compressed package

0:30:20.240 --> 0:30:24.200
<v Speaker 1>that needs to be unpacked by the listener's brain for

0:30:24.320 --> 0:30:27.680
<v Speaker 1>its meaning. So we generally assume that when we send

0:30:27.720 --> 0:30:31.520
<v Speaker 1>our little package of sounds across the air, that it

0:30:31.920 --> 0:30:35.160
<v Speaker 1>unpacks and the other person understands exactly what we meant.

0:30:35.520 --> 0:30:41.400
<v Speaker 1>So when I say justice or love or suffering, we

0:30:41.520 --> 0:30:44.400
<v Speaker 1>all have a different sense in our heads about what

0:30:44.440 --> 0:30:48.400
<v Speaker 1>that means, because I'm just sending a few phonemes across

0:30:48.440 --> 0:30:51.160
<v Speaker 1>the air, and you have to unpack those words and

0:30:51.280 --> 0:30:55.240
<v Speaker 1>interpret them within your own model of the world. I'm

0:30:55.280 --> 0:30:57.840
<v Speaker 1>going to come back to this point in future episodes,

0:30:57.880 --> 0:31:00.720
<v Speaker 1>but for now, the point I want to make is

0:31:00.760 --> 0:31:06.040
<v Speaker 1>that a large language model can generate text statistically and

0:31:06.080 --> 0:31:09.160
<v Speaker 1>we can be gobsmacked by the apparent depth of it.

0:31:09.520 --> 0:31:12.160
<v Speaker 1>But in part this is because we cannot help but

0:31:12.280 --> 0:31:15.760
<v Speaker 1>impose meaning on the words that we receive. We hear

0:31:15.800 --> 0:31:18.760
<v Speaker 1>a particular string of sounds and we cannot help but

0:31:18.920 --> 0:31:24.520
<v Speaker 1>assume meaning behind it. Okay, so maybe the imitation game

0:31:24.680 --> 0:31:29.080
<v Speaker 1>is not really the best test for meaningful intelligence, but

0:31:29.120 --> 0:31:33.040
<v Speaker 1>there are other tests out there. Because while the Turing

0:31:33.120 --> 0:31:38.320
<v Speaker 1>test measures something about AI language processing, it doesn't necessarily

0:31:38.400 --> 0:31:44.080
<v Speaker 1>require the AI to demonstrate creative thinking or originality, and

0:31:44.120 --> 0:31:47.480
<v Speaker 1>so that leads us to the Loveless test, named after

0:31:47.920 --> 0:31:52.200
<v Speaker 1>Ada Loveless, who is the nineteenth century mathematician who's often

0:31:52.240 --> 0:31:55.000
<v Speaker 1>thought of as the world's first computer programmer. And she

0:31:55.200 --> 0:32:00.400
<v Speaker 1>once said quote, only when computers originate things should be

0:32:00.400 --> 0:32:05.280
<v Speaker 1>believed to have minds. So the Loveless test was proposed

0:32:05.280 --> 0:32:08.120
<v Speaker 1>in two thousand and one, and this test focuses on

0:32:08.200 --> 0:32:12.440
<v Speaker 1>the creative capabilities of AI systems. So to pass the

0:32:12.520 --> 0:32:17.360
<v Speaker 1>Loveless test, a machine has to create an original work,

0:32:17.680 --> 0:32:19.880
<v Speaker 1>such as a piece of art or a novel that

0:32:19.960 --> 0:32:24.680
<v Speaker 1>it was not explicitly designed to produce. This test aims

0:32:24.720 --> 0:32:29.160
<v Speaker 1>to assess whether AI systems can exhibit creativity and autonomy,

0:32:29.200 --> 0:32:32.680
<v Speaker 1>which are key aspects of what we think about with consciousness.

0:32:32.840 --> 0:32:36.719
<v Speaker 1>And the idea is that true sentience involves creative and

0:32:36.760 --> 0:32:41.280
<v Speaker 1>original thinking, not just the ability to follow pre programmed

0:32:41.400 --> 0:32:44.440
<v Speaker 1>rules or algorithms. And I'll just note that over a

0:32:44.560 --> 0:32:48.320
<v Speaker 1>decade ago, the scientist A. Mark Rydel proposed the loveless

0:32:48.320 --> 0:32:51.360
<v Speaker 1>two point zero test, which gets the human evaluator to

0:32:51.520 --> 0:32:56.480
<v Speaker 1>specify the constraints that will make the output novel and surprising.

0:32:56.760 --> 0:33:00.920
<v Speaker 1>So the example that l used in his paper is, quote,

0:33:01.280 --> 0:33:03.920
<v Speaker 1>create a story in which a boy falls in love

0:33:03.920 --> 0:33:07.120
<v Speaker 1>with a girl, Aliens abduct the boy, and the girl

0:33:07.240 --> 0:33:09.560
<v Speaker 1>saves the world with the help of a talking cat.

0:33:10.120 --> 0:33:13.640
<v Speaker 1>But we now know that this is totally trivial for chat,

0:33:13.680 --> 0:33:17.640
<v Speaker 1>GPTE or BARD or any large language model. And I

0:33:17.680 --> 0:33:20.400
<v Speaker 1>think this tells us that these sorts of games with

0:33:20.920 --> 0:33:25.520
<v Speaker 1>making conversation or making text or art are insufficient to

0:33:25.600 --> 0:33:29.600
<v Speaker 1>actually assess intelligence. Why because it's not so hard to

0:33:29.720 --> 0:33:33.640
<v Speaker 1>mix things up to make them seem original and intelligent

0:33:34.120 --> 0:33:37.960
<v Speaker 1>when it's really just doing a mashup. So I want

0:33:38.000 --> 0:33:40.320
<v Speaker 1>to turn to another test that I think is more

0:33:40.440 --> 0:33:43.800
<v Speaker 1>powerful than the Turing test of the Loveless test, and

0:33:43.960 --> 0:33:47.720
<v Speaker 1>probably easier to judge, and that is this, if a

0:33:47.880 --> 0:33:51.880
<v Speaker 1>system is truly intelligent, it should be able to do

0:33:52.680 --> 0:33:58.160
<v Speaker 1>scientific discovery. A version of the scientific discovery test was

0:33:58.360 --> 0:34:02.440
<v Speaker 1>first proposed by a scientist named Shao cheng Xiang a

0:34:02.480 --> 0:34:05.400
<v Speaker 1>few years ago, and he pointed out that the most

0:34:05.400 --> 0:34:10.239
<v Speaker 1>important thing that humans do is make scientific discoveries, and

0:34:10.280 --> 0:34:14.520
<v Speaker 1>the day our AI can make real discoveries is the

0:34:14.640 --> 0:34:17.920
<v Speaker 1>day they become as smart as we are. Now. I

0:34:18.000 --> 0:34:21.360
<v Speaker 1>want to propose an important change to this test, and

0:34:21.400 --> 0:34:38.319
<v Speaker 1>then I think we'll be getting somewhere. So here's the

0:34:38.360 --> 0:34:42.960
<v Speaker 1>scenario I'm envisioning. Let's say that I ask Ai some question,

0:34:43.080 --> 0:34:46.640
<v Speaker 1>a question in the biomedical space about what kind of

0:34:46.760 --> 0:34:49.600
<v Speaker 1>drug would be best suited to bind to this receptor

0:34:49.640 --> 0:34:52.880
<v Speaker 1>and trigger a cascade that causes a particular gene to

0:34:52.880 --> 0:34:56.040
<v Speaker 1>get suppressed. Okay, So imagine that I ask that to

0:34:56.160 --> 0:35:00.640
<v Speaker 1>chat GPT and it tells me some mind blowing, amazing

0:35:01.040 --> 0:35:05.360
<v Speaker 1>clever answer, one that had previously not been known, something

0:35:05.400 --> 0:35:09.000
<v Speaker 1>that's never been known by scientists before. We would assume

0:35:09.239 --> 0:35:14.200
<v Speaker 1>naturally that it has done some extraordinary scientific reasoning, but

0:35:14.280 --> 0:35:19.200
<v Speaker 1>that won't necessarily be the reason that it passes. Instead,

0:35:19.840 --> 0:35:23.359
<v Speaker 1>it might pass simply because it's more well read than

0:35:23.400 --> 0:35:26.440
<v Speaker 1>I am, or than any other human on the planet

0:35:26.480 --> 0:35:29.520
<v Speaker 1>by literally millions of times. So the way to think

0:35:29.560 --> 0:35:35.000
<v Speaker 1>about this is to picture a typical giant biomedical library,

0:35:35.120 --> 0:35:37.960
<v Speaker 1>where there's some fact stored at a paper and a

0:35:38.040 --> 0:35:41.279
<v Speaker 1>journal over here on this shelf in this book, and

0:35:41.360 --> 0:35:46.080
<v Speaker 1>there's another seemingly dissociated fact over on this shelf seven

0:35:46.120 --> 0:35:49.719
<v Speaker 1>stacks away, and there's a third fact all the way

0:35:49.760 --> 0:35:52.480
<v Speaker 1>on the other side of the library, on the bottom shelf,

0:35:52.920 --> 0:35:56.040
<v Speaker 1>in a book from nineteen seventy nine. And it's almost

0:35:56.440 --> 0:36:01.000
<v Speaker 1>infinitesimally unlikely that any human could even hope to have

0:36:01.120 --> 0:36:05.200
<v Speaker 1>read one one millionth of the biomedical literature, and really

0:36:05.280 --> 0:36:08.400
<v Speaker 1>really unlikely that she would be able to catch those

0:36:08.480 --> 0:36:11.480
<v Speaker 1>three facts and hold them in mind at the same time.

0:36:12.440 --> 0:36:15.400
<v Speaker 1>But this is trivial, of course, for a large language

0:36:15.400 --> 0:36:19.200
<v Speaker 1>model with hundreds of billions of nodes. So I think

0:36:19.600 --> 0:36:23.160
<v Speaker 1>that we will see new science getting done by CHATGPT,

0:36:24.040 --> 0:36:28.320
<v Speaker 1>not because it is conceptualizing, not because it's doing human

0:36:28.480 --> 0:36:32.040
<v Speaker 1>like reasoning, but because it doesn't know that these are

0:36:32.120 --> 0:36:35.480
<v Speaker 1>disparate facts spread around the library. It simply knows these

0:36:35.520 --> 0:36:38.560
<v Speaker 1>as three facts that seem to fit together. And so

0:36:38.680 --> 0:36:41.920
<v Speaker 1>with the right sort of questions, we might find that

0:36:42.040 --> 0:36:46.720
<v Speaker 1>sometimes AI generates something amazing and it seems to pass

0:36:46.800 --> 0:36:50.239
<v Speaker 1>the scientific discovery test. So this is going to be

0:36:50.239 --> 0:36:53.799
<v Speaker 1>incredibly useful for science. And I've never been able to

0:36:53.920 --> 0:36:57.680
<v Speaker 1>escape the feeling as I sift through Google scholar and

0:36:57.760 --> 0:37:00.920
<v Speaker 1>the thousands of papers published each month that have something

0:37:00.960 --> 0:37:04.760
<v Speaker 1>could hold all the knowledge and mind at once, each

0:37:05.040 --> 0:37:08.680
<v Speaker 1>page in every journal, and every gene in the genome,

0:37:08.960 --> 0:37:11.920
<v Speaker 1>and all the pages about chemistry and physics and mathematical

0:37:11.960 --> 0:37:15.720
<v Speaker 1>techniques and astrophysics and so on. Then you'd have lots

0:37:15.719 --> 0:37:19.239
<v Speaker 1>of puzzle pieces that could potentially make lots of connections.

0:37:19.560 --> 0:37:21.759
<v Speaker 1>And you know this might lead to the retirement of

0:37:21.880 --> 0:37:25.840
<v Speaker 1>many scientists, or at minimum lead to a better use

0:37:25.960 --> 0:37:30.040
<v Speaker 1>of our time. There's a depressing sense in which each scientist,

0:37:30.080 --> 0:37:33.400
<v Speaker 1>each one of us, finds little pieces of the puzzle,

0:37:33.760 --> 0:37:37.000
<v Speaker 1>and in the twinkling of a single human lifetime, a

0:37:37.040 --> 0:37:41.160
<v Speaker 1>busy scientist might collect up a handful of different puzzle pieces.

0:37:41.800 --> 0:37:46.799
<v Speaker 1>The most voracious reader, the most assiduous worker, the most

0:37:46.800 --> 0:37:50.319
<v Speaker 1>creative synthesizer of ideas, can only hope to collect a

0:37:50.400 --> 0:37:53.840
<v Speaker 1>small number of puzzle pieces and pray that some of

0:37:53.880 --> 0:37:56.520
<v Speaker 1>them might fit together. So this is going to be

0:37:56.600 --> 0:38:02.960
<v Speaker 1>massively important. But I wanted to find two categories of

0:38:03.000 --> 0:38:06.080
<v Speaker 1>scientific discovery. The first is what I just described, which

0:38:06.120 --> 0:38:09.600
<v Speaker 1>is science where things that already exist in literature can

0:38:09.640 --> 0:38:13.680
<v Speaker 1>be pieced together. And let's call that level one discovery.

0:38:14.000 --> 0:38:16.759
<v Speaker 1>And these large language models will be awesome at level

0:38:16.760 --> 0:38:18.640
<v Speaker 1>one because they've read every paper and they have a

0:38:18.640 --> 0:38:22.360
<v Speaker 1>perfect memory. But I want to distinguish a second level

0:38:22.440 --> 0:38:25.719
<v Speaker 1>of scientific discovery, and this is the one I'm interested in.

0:38:26.000 --> 0:38:29.160
<v Speaker 1>I'll call this level two, and that is science that

0:38:29.239 --> 0:38:34.279
<v Speaker 1>requires conceptualization to get to the next step, not just

0:38:34.440 --> 0:38:39.200
<v Speaker 1>remixing what's already there. Conceptualization like when the young Albert

0:38:39.239 --> 0:38:43.200
<v Speaker 1>Einstein imagined something that he had never seen before. He

0:38:43.239 --> 0:38:45.920
<v Speaker 1>asked himself, what would it be like if I could

0:38:46.000 --> 0:38:49.040
<v Speaker 1>catch up with a beam of light and write it

0:38:49.120 --> 0:38:51.960
<v Speaker 1>like a surfer riding a wave. And this is how

0:38:52.000 --> 0:38:56.720
<v Speaker 1>he derived this special theory of relativity. This isn't something

0:38:56.760 --> 0:38:59.919
<v Speaker 1>he looked up and found three facts that clicked. Again,

0:39:00.640 --> 0:39:05.160
<v Speaker 1>he imagined he asked new questions. He tried out a

0:39:05.239 --> 0:39:08.800
<v Speaker 1>new model of the world, one in which time runs

0:39:08.840 --> 0:39:11.640
<v Speaker 1>differently depending on how fast you're going, and then he

0:39:11.800 --> 0:39:15.400
<v Speaker 1>worked backwards to see if that model could work. Or

0:39:15.480 --> 0:39:19.560
<v Speaker 1>consider when Charles Darwin thought about the species that he

0:39:19.640 --> 0:39:22.560
<v Speaker 1>saw around him, and he imagined all the species that

0:39:22.600 --> 0:39:26.000
<v Speaker 1>he didn't see but who might have existed, and he

0:39:26.040 --> 0:39:29.279
<v Speaker 1>was able to put together a new mental model in

0:39:29.320 --> 0:39:33.240
<v Speaker 1>which most species don't make it and we only see

0:39:33.280 --> 0:39:39.319
<v Speaker 1>those whose mutations cause survival advantages or reproductive advantages. These

0:39:39.360 --> 0:39:42.680
<v Speaker 1>weren't facts that he just collected from some papers. He

0:39:42.800 --> 0:39:47.320
<v Speaker 1>was trying out a new model of the world. Now

0:39:47.600 --> 0:39:50.920
<v Speaker 1>this kind of science isn't just for the big giant stuff.

0:39:51.040 --> 0:39:54.319
<v Speaker 1>Most meaningful science is actually driven by this kind of

0:39:54.800 --> 0:39:59.840
<v Speaker 1>imagination of new models. Just as one example, I recently

0:40:00.040 --> 0:40:03.359
<v Speaker 1>in an episode about whether time runs in slow motion

0:40:03.480 --> 0:40:06.600
<v Speaker 1>when you're in fear for your life. And so when

0:40:06.640 --> 0:40:10.120
<v Speaker 1>I wondered about this question, I realized there were two

0:40:10.320 --> 0:40:13.560
<v Speaker 1>hypotheses that might explain it, and I thought up an

0:40:13.600 --> 0:40:17.160
<v Speaker 1>experiment to discriminate those two hypotheses. And then we built

0:40:17.160 --> 0:40:21.400
<v Speaker 1>a wristband that flashes information at a particular speed and

0:40:21.480 --> 0:40:23.760
<v Speaker 1>had people wear, and we dropped them from one hundred

0:40:23.760 --> 0:40:26.640
<v Speaker 1>and fifty foot tall tower into a net below. A

0:40:26.920 --> 0:40:31.400
<v Speaker 1>large language model presumably couldn't do that because it's just

0:40:31.520 --> 0:40:35.279
<v Speaker 1>playing statistical word games. And unless someone had thought of

0:40:35.320 --> 0:40:40.480
<v Speaker 1>that experiment and written it down, JATGPT would never say, Okay,

0:40:40.520 --> 0:40:43.080
<v Speaker 1>here's a new framework, and how we can design an

0:40:43.120 --> 0:40:46.000
<v Speaker 1>experiment to put this to the test. So this is

0:40:46.040 --> 0:40:49.120
<v Speaker 1>what I wanted to find as the most meaningful test

0:40:49.640 --> 0:40:53.680
<v Speaker 1>for a human level of intelligence. When AI can do

0:40:54.239 --> 0:40:59.319
<v Speaker 1>science in this way, generating new ideas and frameworks, not

0:40:59.400 --> 0:41:05.000
<v Speaker 1>just clicking act together, then we will have matched human intelligence.

0:41:08.040 --> 0:41:09.719
<v Speaker 1>And I just want to take one more angle on

0:41:09.760 --> 0:41:13.000
<v Speaker 1>this to make the picture clear. The way a scientist

0:41:13.160 --> 0:41:17.440
<v Speaker 1>reads a journal paper is not simply by correlating words

0:41:17.440 --> 0:41:20.520
<v Speaker 1>and extracting keywords, although that might be part of it,

0:41:20.640 --> 0:41:24.759
<v Speaker 1>but also by realizing what was not said. Why did

0:41:24.800 --> 0:41:28.320
<v Speaker 1>the authors cut off the x axis here at thirty

0:41:28.680 --> 0:41:31.480
<v Speaker 1>What if they had extended this graph, would the line

0:41:31.560 --> 0:41:34.840
<v Speaker 1>have reversed in its trend? And why didn't the authors

0:41:34.920 --> 0:41:38.560
<v Speaker 1>mention the hypothesis of Smith at all? And does that

0:41:38.640 --> 0:41:42.080
<v Speaker 1>graph look too perfect? You know? One of my mentors,

0:41:42.120 --> 0:41:46.239
<v Speaker 1>Francis Krik operated under the assumption that he should disbelieve

0:41:46.400 --> 0:41:48.880
<v Speaker 1>twenty five percent of what he read in the literature.

0:41:49.400 --> 0:41:53.520
<v Speaker 1>Is this because of fraud or error, or statistical fluctuations

0:41:53.600 --> 0:41:57.239
<v Speaker 1>or manipulation or the waste basket effect? Who cares? The

0:41:57.280 --> 0:42:01.080
<v Speaker 1>bottom line is that the literature is rife with errors,

0:42:01.480 --> 0:42:06.160
<v Speaker 1>and depending on the field, some estimates put the ireproducibility

0:42:06.640 --> 0:42:11.719
<v Speaker 1>at fifty percent. So when scientists read papers they know this,

0:42:11.920 --> 0:42:15.920
<v Speaker 1>just as Francis Crick did. They read in an entirely

0:42:15.920 --> 0:42:20.360
<v Speaker 1>different manner than Google Translate or Watson or chat GPT

0:42:20.640 --> 0:42:25.640
<v Speaker 1>or any of the correlational methods they extrapolate. They read

0:42:25.680 --> 0:42:28.719
<v Speaker 1>the paper and wonder about other possibilities. They chew on

0:42:28.800 --> 0:42:32.440
<v Speaker 1>what's missing. They envision the next step. They think of

0:42:32.480 --> 0:42:37.240
<v Speaker 1>the next experiment that could confirm or disconfirm the hypotheses

0:42:37.600 --> 0:42:40.600
<v Speaker 1>and the frameworks in the paper. To my mind, the

0:42:40.640 --> 0:42:43.440
<v Speaker 1>meaningful goal of AI is not going to be found

0:42:43.480 --> 0:42:47.000
<v Speaker 1>in number crunching and looking for facts that click together.

0:42:47.600 --> 0:42:50.840
<v Speaker 1>It's going to often be something else. It's going to

0:42:50.880 --> 0:42:56.160
<v Speaker 1>require an AI that learns how humans think, how they behave,

0:42:56.360 --> 0:42:59.719
<v Speaker 1>what they don't say, what they didn't think of, what

0:42:59.760 --> 0:43:03.600
<v Speaker 1>they misthought about, what they should think about. And one

0:43:03.640 --> 0:43:06.640
<v Speaker 1>more thing, I should note that these different levels I've outlined,

0:43:07.040 --> 0:43:11.600
<v Speaker 1>from fitting facts together versus imagining new world models, they're

0:43:11.600 --> 0:43:15.959
<v Speaker 1>probably gonna end up with blurry boundaries. So maybe chat

0:43:16.000 --> 0:43:19.840
<v Speaker 1>GPT will come up with something, and you won't always

0:43:20.040 --> 0:43:24.799
<v Speaker 1>know whether it's piecing together a few disparate pieces in

0:43:24.880 --> 0:43:29.160
<v Speaker 1>the literature what I'm calling level one, or whether it's

0:43:29.800 --> 0:43:33.240
<v Speaker 1>come up with something that is truly a new world

0:43:33.360 --> 0:43:36.759
<v Speaker 1>model that's not a simple clicking together but a genuine

0:43:37.239 --> 0:43:40.800
<v Speaker 1>process of generating a new framework to explain the data.

0:43:40.920 --> 0:43:44.880
<v Speaker 1>So distinguishing the levels of discovery is probably not going

0:43:44.960 --> 0:43:47.600
<v Speaker 1>to be an easy task with a bright line between them,

0:43:48.080 --> 0:43:51.359
<v Speaker 1>but I think it will clarify some things to make

0:43:51.400 --> 0:43:55.799
<v Speaker 1>this distinction. And last thing, I don't necessarily know that

0:43:55.840 --> 0:43:59.520
<v Speaker 1>there's something magical and ineffable about the way that humans

0:43:59.560 --> 0:44:03.600
<v Speaker 1>do this. Presumably we're running algorithms too, it's just that

0:44:03.640 --> 0:44:07.800
<v Speaker 1>they're running on self configuring wetwear. I have seen tens

0:44:07.800 --> 0:44:10.880
<v Speaker 1>of thousands of science experiments in my career, so I

0:44:11.040 --> 0:44:14.279
<v Speaker 1>know the process of asking a question and figuring out

0:44:14.640 --> 0:44:17.000
<v Speaker 1>what we'll put it to the test. So we may

0:44:17.040 --> 0:44:19.120
<v Speaker 1>get to level two and it may be sooner than

0:44:19.160 --> 0:44:21.480
<v Speaker 1>we expect, but I just want to be clear that

0:44:21.719 --> 0:44:25.279
<v Speaker 1>right now we have not figured out the human algorithms.

0:44:25.680 --> 0:44:29.680
<v Speaker 1>So the current version of AI, as massively impressive as

0:44:29.760 --> 0:44:34.560
<v Speaker 1>it is, does not do level two scientific problem solving.

0:44:34.840 --> 0:44:37.720
<v Speaker 1>And that's when we're going to know that we've crossed

0:44:37.800 --> 0:44:41.319
<v Speaker 1>a new kind of line into a machine that is

0:44:41.520 --> 0:44:45.799
<v Speaker 1>truly intelligent. So let's wrap up. At least for now.

0:44:45.920 --> 0:44:48.480
<v Speaker 1>Humans still have to do the science, by which I

0:44:48.560 --> 0:44:52.400
<v Speaker 1>mean the conceptual work, wherein we take a framework for

0:44:52.560 --> 0:44:55.800
<v Speaker 1>understanding the world and we rethink it and we mentally

0:44:55.920 --> 0:44:59.320
<v Speaker 1>simulate whether a new model of the world could explain

0:44:59.400 --> 0:45:01.839
<v Speaker 1>the observed data, and we come up with a way

0:45:01.880 --> 0:45:05.200
<v Speaker 1>to test that new model. It's not just searching for facts.

0:45:05.640 --> 0:45:07.640
<v Speaker 1>So I'm definitely not saying we won't get to the

0:45:07.640 --> 0:45:11.640
<v Speaker 1>next level where AI can conceptualize things and predict forward

0:45:11.719 --> 0:45:14.600
<v Speaker 1>and build new knowledge. This might be a week from now,

0:45:14.719 --> 0:45:16.680
<v Speaker 1>or it might be a century from now. Who knows

0:45:17.040 --> 0:45:19.120
<v Speaker 1>how hard a problem that's going to turn out to be.

0:45:19.520 --> 0:45:21.640
<v Speaker 1>But I want us to be clear eyed on where

0:45:21.680 --> 0:45:26.400
<v Speaker 1>we are right now, because sometimes in the blindingly impressive

0:45:26.520 --> 0:45:29.640
<v Speaker 1>light of what current AI is doing, it can be

0:45:29.719 --> 0:45:33.960
<v Speaker 1>difficult to see, what's missing and where we might be heading.

0:45:38.360 --> 0:45:40.920
<v Speaker 1>That's all for this week. To find out more and

0:45:40.960 --> 0:45:43.680
<v Speaker 1>to share your thoughts, head over to eagleman dot com

0:45:43.719 --> 0:45:47.680
<v Speaker 1>slash Podcasts, and you can also watch full episodes of

0:45:47.760 --> 0:45:51.400
<v Speaker 1>Inner Cosmos on YouTube. Subscribe to my channel so you

0:45:51.440 --> 0:45:54.719
<v Speaker 1>can follow along each week for new updates. I'd love

0:45:54.760 --> 0:45:58.480
<v Speaker 1>to hear your questions, so please send those to podcast

0:45:58.640 --> 0:46:01.560
<v Speaker 1>at eagleman dot com and I will do a special

0:46:01.600 --> 0:46:05.480
<v Speaker 1>episode where I answer questions. Until next time. I'm David

0:46:05.480 --> 0:46:07.880
<v Speaker 1>Eagelman and this is Inner Cosmos