WEBVTT - Ep105 "What if AI is not actually intelligent?" (with Alison Gopnik)

0:00:05.160 --> 0:00:09.040
<v Speaker 1>Is AI an intelligent agent or is there a totally

0:00:09.039 --> 0:00:12.080
<v Speaker 1>different way that we should be thinking about this. Perhaps

0:00:12.080 --> 0:00:16.520
<v Speaker 1>it's more like a piece of cultural technology. What in

0:00:16.560 --> 0:00:21.360
<v Speaker 1>the world is cultural technology? And how would rethinking this

0:00:21.960 --> 0:00:25.160
<v Speaker 1>change the way we approach what to do next? And

0:00:25.200 --> 0:00:26.880
<v Speaker 1>what does any of this have to do with the

0:00:26.920 --> 0:00:31.479
<v Speaker 1>myth of the golom or Socrates, or the printing press

0:00:31.720 --> 0:00:39.000
<v Speaker 1>or Martin Luther or the story of stone soup. Welcome

0:00:39.000 --> 0:00:42.199
<v Speaker 1>to Intercosmos with me David Eagleman. I'm a neuroscientist and

0:00:42.240 --> 0:00:45.919
<v Speaker 1>an author at Stanford and in these episodes we examine

0:00:46.000 --> 0:00:49.720
<v Speaker 1>brains and the world around us to understand who we

0:00:49.840 --> 0:01:09.319
<v Speaker 1>are and where we're going. So let's start with the

0:01:09.520 --> 0:01:12.280
<v Speaker 1>appreciation that we are smack in the middle of the

0:01:12.280 --> 0:01:17.160
<v Speaker 1>most dramatic technological shift in human history. Every few weeks,

0:01:17.200 --> 0:01:21.119
<v Speaker 1>a new AI system is released that can answer questions

0:01:21.160 --> 0:01:25.479
<v Speaker 1>of increasing complexity. As we've all seen, it can write

0:01:25.520 --> 0:01:29.800
<v Speaker 1>beautiful prose. It can punch out incredibly good code for software.

0:01:30.319 --> 0:01:34.720
<v Speaker 1>It composes music, It mimics voices. It produces images so

0:01:34.880 --> 0:01:38.160
<v Speaker 1>realistic that we've long ago lost the ability to tell

0:01:38.200 --> 0:01:39.679
<v Speaker 1>if a photo is real or not.

0:01:40.160 --> 0:01:41.760
<v Speaker 2>And this increasingly applies.

0:01:41.440 --> 0:01:45.560
<v Speaker 1>To video as well, So with every leap forward, the

0:01:45.600 --> 0:01:51.440
<v Speaker 1>same questions become louder. Is this an intelligent agent? Is

0:01:51.480 --> 0:01:55.520
<v Speaker 1>it conscious? Will it one day surpass us? And if so,

0:01:55.600 --> 0:01:59.200
<v Speaker 1>what happens to us? Today's episode is about how to

0:01:59.280 --> 0:02:01.720
<v Speaker 1>think about this from an angle that.

0:02:01.640 --> 0:02:03.080
<v Speaker 2>Will probably surprise you.

0:02:03.440 --> 0:02:06.880
<v Speaker 1>So as it stands now, in the public conversation about AI,

0:02:07.200 --> 0:02:10.680
<v Speaker 1>we really have just one metaphor, which is that AI

0:02:10.960 --> 0:02:16.040
<v Speaker 1>is an intelligent agent and of increasing intelligence. We all

0:02:16.080 --> 0:02:20.320
<v Speaker 1>talk about these systems as digital minds that can reason

0:02:20.360 --> 0:02:24.240
<v Speaker 1>and plan and act and perhaps even at some point desire.

0:02:25.000 --> 0:02:28.320
<v Speaker 1>This narrative of an AI with its own mind has

0:02:28.360 --> 0:02:30.880
<v Speaker 1>always been with us in science fiction, of course, but

0:02:30.919 --> 0:02:34.440
<v Speaker 1>today we hear it constantly in policy conversations and in

0:02:34.520 --> 0:02:38.680
<v Speaker 1>media headlines. And whether the tone is optimistic or anxious,

0:02:39.040 --> 0:02:43.600
<v Speaker 1>the underlying premise is the same that these are minds

0:02:43.760 --> 0:02:46.079
<v Speaker 1>in the making, that we are witnessing the birth of

0:02:46.120 --> 0:02:50.799
<v Speaker 1>a new kind of intelligence. But what if that metaphor

0:02:51.400 --> 0:02:55.680
<v Speaker 1>is misleading so much so that it's sending our conversations,

0:02:56.000 --> 0:03:01.400
<v Speaker 1>our policy, our research priorities off course. So today's episode

0:03:01.639 --> 0:03:06.720
<v Speaker 1>is about reframing what large AI models really are and

0:03:06.760 --> 0:03:10.720
<v Speaker 1>what they aren't My guest today is Alison Gopnik. She's

0:03:10.760 --> 0:03:13.880
<v Speaker 1>a professor of psychology at Berkeley, very well known in

0:03:13.919 --> 0:03:18.040
<v Speaker 1>the areas of cognitive and language development. She studies infants

0:03:18.080 --> 0:03:21.760
<v Speaker 1>and young children to understand how learning takes place. And

0:03:21.800 --> 0:03:23.799
<v Speaker 1>she was just by the way, elected to the National

0:03:23.800 --> 0:03:26.919
<v Speaker 1>Academy of Sciences. But I'm talking with her today about

0:03:26.919 --> 0:03:29.560
<v Speaker 1>a new paper she co authored in the journal Science

0:03:29.840 --> 0:03:35.040
<v Speaker 1>about AI with colleagues Henry Ferrell, Cosmishalsi, and James Evans.

0:03:35.400 --> 0:03:37.120
<v Speaker 2>The paper argues that.

0:03:37.120 --> 0:03:40.920
<v Speaker 1>We should stop thinking of large models as intelligent agents

0:03:41.480 --> 0:03:45.480
<v Speaker 1>and instead see them as a new kind of cultural

0:03:45.560 --> 0:03:47.160
<v Speaker 1>and social technology.

0:03:47.520 --> 0:03:48.440
<v Speaker 2>Now what does that mean.

0:03:48.960 --> 0:03:50.560
<v Speaker 1>Well, I'll give you a quick preview and then we'll

0:03:50.600 --> 0:03:54.520
<v Speaker 1>jump into the interview. All throughout history, humans have built

0:03:54.520 --> 0:03:59.480
<v Speaker 1>tools to organize information and transmit it. Think of spoken

0:03:59.600 --> 0:04:06.480
<v Speaker 1>language and then writing, and then printing, libraries, television, the Internet.

0:04:06.720 --> 0:04:11.280
<v Speaker 1>Each of these systems reshaped human culture, not because they

0:04:11.280 --> 0:04:16.120
<v Speaker 1>were intelligent in themselves, but because they allowed information to

0:04:16.279 --> 0:04:21.279
<v Speaker 1>be shared and transformed and coordinated in new ways. Just

0:04:21.279 --> 0:04:25.280
<v Speaker 1>think of how the printing press amplified voices, or how

0:04:25.320 --> 0:04:30.440
<v Speaker 1>something like markets distill the messy complexity of economies into

0:04:30.760 --> 0:04:32.440
<v Speaker 1>a single price.

0:04:32.279 --> 0:04:34.760
<v Speaker 2>Signal, how much does this thing cost?

0:04:35.520 --> 0:04:39.400
<v Speaker 1>Or how bureaucracies take the chaos of signals and sort

0:04:39.440 --> 0:04:43.840
<v Speaker 1>it into categories. These are not minds, but they are

0:04:43.880 --> 0:04:48.880
<v Speaker 1>powerful technologies of culture, technologies that change how we all

0:04:49.000 --> 0:04:50.120
<v Speaker 1>think and how we.

0:04:50.040 --> 0:04:50.960
<v Speaker 2>Act and how we live.

0:04:51.240 --> 0:04:54.080
<v Speaker 1>So the argument we'll hear today is that large AI

0:04:54.200 --> 0:04:59.680
<v Speaker 1>models are best understood in this lineage. They don't think,

0:05:00.120 --> 0:05:05.160
<v Speaker 1>but they process the vast collective output of human thought.

0:05:05.760 --> 0:05:09.480
<v Speaker 1>They are trained on millions of texts and images and voices,

0:05:09.839 --> 0:05:13.839
<v Speaker 1>everything from Shakespeare to Reddit threads to government paperwork, and

0:05:13.880 --> 0:05:19.159
<v Speaker 1>they summarize and reorganize and remix that cultural data. And

0:05:19.200 --> 0:05:22.400
<v Speaker 1>they also surface patterns in the data that maybe hadn't

0:05:22.440 --> 0:05:25.479
<v Speaker 1>been seen before. And so when you're interacting with such

0:05:25.480 --> 0:05:28.320
<v Speaker 1>a piece of technology, let's say, asking it to write

0:05:28.360 --> 0:05:31.600
<v Speaker 1>you a poem or explain a concept, you're not talking

0:05:31.640 --> 0:05:36.520
<v Speaker 1>to a mind. You're participating with a kind of cultural

0:05:36.640 --> 0:05:40.520
<v Speaker 1>compression and recombination machine. So see what you think of

0:05:40.560 --> 0:05:44.000
<v Speaker 1>the perspective that you hear today, because it can change

0:05:44.080 --> 0:05:48.479
<v Speaker 1>our concerns and our eventual legislative approaches if we stop

0:05:48.560 --> 0:05:52.040
<v Speaker 1>assuming that these are minds and instead treat them as

0:05:52.600 --> 0:05:58.240
<v Speaker 1>cultural infrastructures like search engines or even democratic institutions, then

0:05:58.279 --> 0:06:00.839
<v Speaker 1>we can start asking the questions.

0:06:01.200 --> 0:06:03.240
<v Speaker 2>One quick thing before we jump into the interview.

0:06:03.760 --> 0:06:07.840
<v Speaker 1>You've heard of large language models llms, and more recently

0:06:08.000 --> 0:06:11.400
<v Speaker 1>large multimodal models that are trained on words and images

0:06:11.800 --> 0:06:14.880
<v Speaker 1>and increasingly other data as well. So nowadays we just

0:06:14.960 --> 0:06:18.640
<v Speaker 1>refer to these as large models. So here's my interview

0:06:18.640 --> 0:06:25.560
<v Speaker 1>with Alison Gothnik. So, Alison, before we get started talking

0:06:25.560 --> 0:06:29.120
<v Speaker 1>about AI, you have built a very wonderful career studying

0:06:29.720 --> 0:06:32.640
<v Speaker 1>scientists who are unusually small and spend most of their

0:06:32.680 --> 0:06:34.320
<v Speaker 1>time lying down, So tell us about that.

0:06:35.080 --> 0:06:39.000
<v Speaker 3>So what I've always been most interested in is how

0:06:39.080 --> 0:06:41.440
<v Speaker 3>is it that people can figure out the world around them?

0:06:41.480 --> 0:06:44.039
<v Speaker 3>How is it that human beings with just a bunch

0:06:44.120 --> 0:06:47.440
<v Speaker 3>of photons hitting our eyes and little disturbances of air

0:06:47.560 --> 0:06:50.000
<v Speaker 3>at our ears, nevertheless, we know about a world of

0:06:50.040 --> 0:06:54.360
<v Speaker 3>people and objects and ultimately quarks and distant planets. How

0:06:54.360 --> 0:06:56.000
<v Speaker 3>could we ever do that? How could we ever learn

0:06:56.040 --> 0:06:58.359
<v Speaker 3>so much from so little? And of course the people

0:06:58.360 --> 0:07:01.400
<v Speaker 3>who are doing that more than anyone else are little children.

0:07:01.760 --> 0:07:04.920
<v Speaker 3>So for the past forty years, what I've been doing

0:07:04.960 --> 0:07:07.240
<v Speaker 3>is trying to figure out how is it that even

0:07:07.279 --> 0:07:10.480
<v Speaker 3>little children can learn so much so quickly from such

0:07:10.920 --> 0:07:14.600
<v Speaker 3>little information. And one of the questions is what kinds

0:07:14.600 --> 0:07:17.080
<v Speaker 3>of computations, what's going on in their brains? What are

0:07:17.120 --> 0:07:21.400
<v Speaker 3>their brains and minds doing that lets them solve these

0:07:21.520 --> 0:07:25.119
<v Speaker 3>really deep problems so quickly and so effectively. And that's

0:07:25.160 --> 0:07:28.239
<v Speaker 3>been the central idea in my career. And it's turned

0:07:28.240 --> 0:07:31.679
<v Speaker 3>out that by looking at kids empirically, by actually studying

0:07:31.680 --> 0:07:34.960
<v Speaker 3>them as scientists, we've discovered that they both know more

0:07:35.000 --> 0:07:37.160
<v Speaker 3>and learn more than we ever would have thought before.

0:07:37.200 --> 0:07:38.480
<v Speaker 3>They're the best learners.

0:07:38.120 --> 0:07:39.320
<v Speaker 4>That we know of in the universe.

0:07:39.600 --> 0:07:40.080
<v Speaker 2>Amazing.

0:07:40.400 --> 0:07:44.520
<v Speaker 1>So we're in this quite remarkable time where for both

0:07:44.520 --> 0:07:48.360
<v Speaker 1>of us we've been doing, you know, the same research

0:07:48.440 --> 0:07:51.640
<v Speaker 1>that we've been doing for many decades, and one might

0:07:51.680 --> 0:07:54.200
<v Speaker 1>have thought five years ago, okay, we'll probably be doing

0:07:54.240 --> 0:07:57.080
<v Speaker 1>that in twenty twenty five, But suddenly the world has

0:07:57.120 --> 0:08:02.120
<v Speaker 1>really changed around us because of A and so you

0:08:02.160 --> 0:08:03.800
<v Speaker 1>and I both are spending a lot of our time

0:08:03.920 --> 0:08:07.080
<v Speaker 1>writing about that and thinking about how to position AI,

0:08:07.160 --> 0:08:09.680
<v Speaker 1>how to understand what it does and does not mean.

0:08:10.440 --> 0:08:13.040
<v Speaker 1>So a lot of people, of course, are concerned about

0:08:14.360 --> 0:08:17.560
<v Speaker 1>super intelligence and the alignment problem, and so on. But

0:08:17.640 --> 0:08:20.880
<v Speaker 1>you and your colleagues have a quite different take that

0:08:20.920 --> 0:08:24.520
<v Speaker 1>you just wrote up in the journal Science in March,

0:08:25.040 --> 0:08:27.200
<v Speaker 1>and I thought it was a really lovely paper. So

0:08:27.240 --> 0:08:29.480
<v Speaker 1>that's what I want to ask you about. So you

0:08:29.760 --> 0:08:33.000
<v Speaker 1>are talking about the right way to look at large

0:08:33.040 --> 0:08:36.480
<v Speaker 1>models is as a social and cultural technology.

0:08:36.559 --> 0:08:38.840
<v Speaker 2>So let's unpack that, right.

0:08:38.960 --> 0:08:41.400
<v Speaker 3>So, as I said, you know, my career has been

0:08:41.440 --> 0:08:43.040
<v Speaker 3>about how could we learn as much as we do?

0:08:43.160 --> 0:08:44.880
<v Speaker 3>And how do children learn as much as they do?

0:08:45.320 --> 0:08:48.400
<v Speaker 3>And part of that has always been if we wanted

0:08:48.400 --> 0:08:51.040
<v Speaker 3>to design a computer or design an artificial system that

0:08:51.040 --> 0:08:53.360
<v Speaker 3>could learn the way children do, what would that system

0:08:53.440 --> 0:08:55.720
<v Speaker 3>look like, what could we put in, what kinds of

0:08:55.760 --> 0:08:59.440
<v Speaker 3>things would it have to do? So for twenty years

0:08:59.520 --> 0:09:02.560
<v Speaker 3>I've been laborating with computer scientists about what would that

0:09:02.640 --> 0:09:05.360
<v Speaker 3>kind of artificial system look like? But as you say,

0:09:05.760 --> 0:09:07.920
<v Speaker 3>even though this has been a long project, in the

0:09:08.000 --> 0:09:11.880
<v Speaker 3>last five years or so, these advances in AI have

0:09:12.000 --> 0:09:14.480
<v Speaker 3>really made us think about that in a different way.

0:09:14.720 --> 0:09:17.480
<v Speaker 3>And one of the interesting things is the big advances

0:09:17.520 --> 0:09:20.559
<v Speaker 3>have been in machine learning. They've actually been in designing

0:09:20.640 --> 0:09:23.960
<v Speaker 3>systems that don't just know things, but can learn things.

0:09:24.040 --> 0:09:26.160
<v Speaker 3>And as I say, children are the best learners we

0:09:26.160 --> 0:09:28.480
<v Speaker 3>know of in the universe. So there's been a really

0:09:28.520 --> 0:09:31.120
<v Speaker 3>interesting development, which is a lot of the people in

0:09:31.160 --> 0:09:35.720
<v Speaker 3>AI have been turning to developmental psychologists like me to say, look,

0:09:35.760 --> 0:09:38.199
<v Speaker 3>could we get some clues from how children are learning

0:09:38.440 --> 0:09:41.640
<v Speaker 3>to design systems that could learn that could learn in

0:09:41.640 --> 0:09:45.400
<v Speaker 3>the same way. Now, the interesting thing is that what

0:09:45.679 --> 0:09:49.079
<v Speaker 3>actually has happened in AI, specifically in the last five

0:09:49.160 --> 0:09:51.760
<v Speaker 3>years or so are these large models, these large language

0:09:51.800 --> 0:09:55.120
<v Speaker 3>models and more recently large language and vision models, and

0:09:55.160 --> 0:09:57.880
<v Speaker 3>they are the things that have really revolutionized our everyday

0:09:57.920 --> 0:10:02.000
<v Speaker 3>interactions with AI. It's important to say those are really

0:10:02.160 --> 0:10:05.480
<v Speaker 3>really different from children, and in fact, they're different from humans.

0:10:05.520 --> 0:10:08.440
<v Speaker 3>They're doing something that I think is really really different

0:10:08.480 --> 0:10:15.280
<v Speaker 3>from human intelligence. And it's very natural for people to think, oh, okay,

0:10:15.320 --> 0:10:18.000
<v Speaker 3>look I talked to chatchept and I get an answer back,

0:10:18.320 --> 0:10:20.720
<v Speaker 3>it must have the same kind of intelligence that my

0:10:20.800 --> 0:10:23.320
<v Speaker 3>friend does or my child does. And it turns out

0:10:23.400 --> 0:10:26.480
<v Speaker 3>that that's not true. Those systems are really really different.

0:10:26.760 --> 0:10:28.600
<v Speaker 1>So before we go on, let's unpack that a little

0:10:28.640 --> 0:10:30.080
<v Speaker 1>bit in what ways are they different?

0:10:30.320 --> 0:10:34.920
<v Speaker 3>So a very common kind of model for how a

0:10:34.920 --> 0:10:37.640
<v Speaker 3>AI works is to think of it as if something

0:10:37.679 --> 0:10:40.880
<v Speaker 3>like chatcheept is an agent, an intelligent agent in the world,

0:10:41.000 --> 0:10:43.880
<v Speaker 3>like a person or even an animal that you know about.

0:10:45.160 --> 0:10:48.960
<v Speaker 3>But that's actually, I think, an illusion. That's what we've argued.

0:10:49.480 --> 0:10:51.720
<v Speaker 3>A better way to think about it is that as

0:10:51.800 --> 0:10:55.600
<v Speaker 3>long as we've been human, we've learned from other people,

0:10:55.800 --> 0:10:59.120
<v Speaker 3>and we've had great technological advances that have helped us

0:10:59.440 --> 0:11:02.400
<v Speaker 3>to learn more effectively for more and more people. So

0:11:02.440 --> 0:11:07.000
<v Speaker 3>if you think about language itself, or writing or print,

0:11:07.360 --> 0:11:11.000
<v Speaker 3>those are all examples of technological changes that made us

0:11:11.080 --> 0:11:13.880
<v Speaker 3>able to get more information from others. And what the

0:11:13.960 --> 0:11:16.600
<v Speaker 3>large models do is not go out into the world

0:11:16.600 --> 0:11:19.560
<v Speaker 3>and learn and think the way that babies do. What

0:11:19.600 --> 0:11:24.960
<v Speaker 3>they do is summarize information that human beings have actually

0:11:25.000 --> 0:11:28.000
<v Speaker 3>already discovered. So what they do is take all the

0:11:28.040 --> 0:11:30.440
<v Speaker 3>information and knowledge that human beings have put out on

0:11:30.840 --> 0:11:34.800
<v Speaker 3>the web essentially, and then summarize that in a way

0:11:34.840 --> 0:11:38.440
<v Speaker 3>that lets other people access it more efficiently. So it's

0:11:38.600 --> 0:11:42.520
<v Speaker 3>much more the technological development is much more like something

0:11:42.600 --> 0:11:45.400
<v Speaker 3>like writing that lets you find out what other people

0:11:45.400 --> 0:11:48.760
<v Speaker 3>are thinking than it is creating a system that could

0:11:48.800 --> 0:11:49.800
<v Speaker 3>learn and think itself.

0:11:50.160 --> 0:11:50.439
<v Speaker 2>Yeah.

0:11:50.720 --> 0:11:53.200
<v Speaker 1>In a previous episode of Intercosmos. They did a calculation

0:11:53.280 --> 0:11:56.920
<v Speaker 1>showing that the amount of information that one of these

0:11:57.000 --> 0:12:01.440
<v Speaker 1>dllms consumes would take you one thousand lifetimes.

0:12:00.880 --> 0:12:01.679
<v Speaker 2>For you to read.

0:12:02.600 --> 0:12:07.080
<v Speaker 1>And so it's consumed more than you could ever imagine.

0:12:07.280 --> 0:12:11.280
<v Speaker 1>And what it's doing fundamentally is when you ask it

0:12:11.320 --> 0:12:13.959
<v Speaker 1>a question, it's giving you an echo of the human

0:12:14.000 --> 0:12:16.200
<v Speaker 1>intelligence that's already in there. So I call this the

0:12:16.240 --> 0:12:19.320
<v Speaker 1>echo intelligence solution, where we feel like, wow, that thing's

0:12:19.360 --> 0:12:23.600
<v Speaker 1>really smart. But it's not smart. It's not smart in

0:12:23.640 --> 0:12:27.640
<v Speaker 1>the same way that a human is. It's taking advantage

0:12:27.679 --> 0:12:29.480
<v Speaker 1>of all the things that are already out there.

0:12:29.520 --> 0:12:31.760
<v Speaker 3>So here's here's a way I like to I like

0:12:31.840 --> 0:12:35.440
<v Speaker 3>to convey this. I think you know, storytelling, as you know,

0:12:35.600 --> 0:12:38.720
<v Speaker 3>is really important. So here's two stories you could tell

0:12:39.040 --> 0:12:42.840
<v Speaker 3>about how current ani work. So one story is sort

0:12:42.880 --> 0:12:45.800
<v Speaker 3>of the story of the Gollum, right, the Rabbi of

0:12:45.880 --> 0:12:49.319
<v Speaker 3>Progue and the Gollum. You create this artificial system and

0:12:49.960 --> 0:12:53.000
<v Speaker 3>it's magical and you put special magic in it, and

0:12:53.040 --> 0:12:55.080
<v Speaker 3>then it turns into something that's almost alive.

0:12:55.200 --> 0:12:57.839
<v Speaker 1>And it's interesting for anyone who doesn't know about the

0:12:57.840 --> 0:13:00.320
<v Speaker 1>story of the Gallum. That was a figure made of clay.

0:13:01.240 --> 0:13:04.959
<v Speaker 1>He was brought to life and defended the community.

0:13:05.760 --> 0:13:09.440
<v Speaker 3>But then, well, it turns out that these stories about

0:13:09.480 --> 0:13:12.319
<v Speaker 3>what would happen if you had something that wasn't human,

0:13:12.360 --> 0:13:15.120
<v Speaker 3>that was artificial that you brought to life are really

0:13:15.160 --> 0:13:18.760
<v Speaker 3>ancient there, way before even the Industrial Revolution. And I

0:13:18.760 --> 0:13:21.200
<v Speaker 3>can tell you right now it never ends well, the

0:13:21.320 --> 0:13:26.040
<v Speaker 3>story always variably. The end of the story is that

0:13:26.280 --> 0:13:29.719
<v Speaker 3>some terrible thing happens and the column goes mad and

0:13:30.640 --> 0:13:32.240
<v Speaker 3>causes trouble and chaos, and.

0:13:32.200 --> 0:13:36.200
<v Speaker 1>That inspired Frankenstein some hundreds of years later. They don't

0:13:36.200 --> 0:13:36.960
<v Speaker 1>have the same character.

0:13:37.200 --> 0:13:41.400
<v Speaker 3>So there's this very basic human fear about what would

0:13:41.440 --> 0:13:44.240
<v Speaker 3>it be like if there was something that wasn't actually

0:13:44.360 --> 0:13:46.920
<v Speaker 3>living that you treated as if it was living, as

0:13:47.000 --> 0:13:47.760
<v Speaker 3>you treated as.

0:13:47.640 --> 0:13:48.400
<v Speaker 4>If it was an agent.

0:13:48.440 --> 0:13:51.920
<v Speaker 3>And I think that basic picture, that's the sci fi picture,

0:13:52.080 --> 0:13:54.920
<v Speaker 3>that's the picture that a lot of people, including people

0:13:54.920 --> 0:13:57.680
<v Speaker 3>in the AI world themselves, have about what's happened in Ai.

0:13:58.320 --> 0:14:01.560
<v Speaker 3>Here's a really different story, also, a different ancient story.

0:14:01.600 --> 0:14:04.240
<v Speaker 3>This is the story of stone Soup. So what's the

0:14:04.280 --> 0:14:06.280
<v Speaker 3>story of stone soup. The story of stone Soup is

0:14:06.280 --> 0:14:09.960
<v Speaker 3>there's visitors who come to a village and they say,

0:14:10.000 --> 0:14:12.280
<v Speaker 3>we'd like some food and the villagers say, no, we

0:14:12.320 --> 0:14:14.360
<v Speaker 3>don't have any extra food. And they say, it's okay,

0:14:14.360 --> 0:14:16.319
<v Speaker 3>we're going to make stone soup. And they take out

0:14:16.320 --> 0:14:18.880
<v Speaker 3>a big pot. They put a couple of stones in it,

0:14:19.160 --> 0:14:21.360
<v Speaker 3>they put some water in, they start to boil it

0:14:21.480 --> 0:14:23.240
<v Speaker 3>up and they say, this will be delicious. We're going

0:14:23.280 --> 0:14:25.240
<v Speaker 3>to make stone soup just with these stones, and the

0:14:25.280 --> 0:14:28.760
<v Speaker 3>villagers say really. They say, yeah, it would be even

0:14:28.840 --> 0:14:30.960
<v Speaker 3>better if we had an onion and a carrot in it,

0:14:31.000 --> 0:14:32.360
<v Speaker 3>but if we don't, we don't.

0:14:32.400 --> 0:14:33.520
<v Speaker 4>And the villager says.

0:14:33.440 --> 0:14:35.360
<v Speaker 3>I think I have an onion and a carrot somewhere,

0:14:35.440 --> 0:14:38.360
<v Speaker 3>and they go and put it in and then they say,

0:14:38.480 --> 0:14:40.320
<v Speaker 3>you know, when we made this for the rich people,

0:14:40.400 --> 0:14:42.800
<v Speaker 3>we put barley and buttermilk in it, which makes it

0:14:42.840 --> 0:14:45.800
<v Speaker 3>even better. But it's okay, it'll still be good stone soup.

0:14:45.840 --> 0:14:48.360
<v Speaker 3>And another villager goes and gets the barley and buttermilk,

0:14:48.400 --> 0:14:49.360
<v Speaker 3>and you can imagine.

0:14:49.080 --> 0:14:49.640
<v Speaker 4>How this goes.

0:14:49.680 --> 0:14:52.320
<v Speaker 3>And they say, the king said that we should put

0:14:52.320 --> 0:14:54.480
<v Speaker 3>a chicken in it, which would make it really royal,

0:14:54.960 --> 0:14:57.760
<v Speaker 3>but we don't have any chicken. So another villager goes

0:14:57.760 --> 0:14:59.760
<v Speaker 3>and gets the chicken from the back, and by the

0:14:59.800 --> 0:15:02.480
<v Speaker 3>time they're done of course, they have this really wonderful

0:15:02.520 --> 0:15:05.440
<v Speaker 3>soup with all the contributions from all the villagers, and

0:15:05.600 --> 0:15:07.720
<v Speaker 3>they go to eat it, and the villagers say, this

0:15:07.880 --> 0:15:10.680
<v Speaker 3>is amazing. There's this wonderful soup and it was just

0:15:10.800 --> 0:15:13.600
<v Speaker 3>made from stones. Okay, here's the modern version of this.

0:15:14.000 --> 0:15:17.280
<v Speaker 3>There's a bunch of tech guys and they go to

0:15:17.320 --> 0:15:20.600
<v Speaker 3>the village of computer users and they say, we're going

0:15:20.680 --> 0:15:25.360
<v Speaker 3>to make artificial general intelligence just from gradient descent and

0:15:25.600 --> 0:15:30.280
<v Speaker 3>transformers and a few algorithms. And the computer used to say,

0:15:30.320 --> 0:15:33.120
<v Speaker 3>that sounds great. We're gonna have artificial general intelligence. And

0:15:33.160 --> 0:15:34.920
<v Speaker 3>they say, yeah, but it would be better if we

0:15:34.960 --> 0:15:37.000
<v Speaker 3>had more data. What we need for this, as you

0:15:37.120 --> 0:15:39.360
<v Speaker 3>just said, David, is lots and lots of data.

0:15:39.440 --> 0:15:41.520
<v Speaker 4>Could you guys put all of.

0:15:41.440 --> 0:15:44.120
<v Speaker 3>Your texts and pictures on the internet for us and

0:15:44.160 --> 0:15:45.840
<v Speaker 3>then let us use them to.

0:15:45.800 --> 0:15:48.240
<v Speaker 4>Train our systems, And the computer user to say, oh,

0:15:48.280 --> 0:15:49.000
<v Speaker 4>that sounds good.

0:15:49.080 --> 0:15:51.960
<v Speaker 3>We'll just keep putting more of our pictures and our

0:15:52.440 --> 0:15:56.680
<v Speaker 3>writings and our books on the internet, and I guess

0:15:56.720 --> 0:15:59.360
<v Speaker 3>you can just use them all for free, and then

0:15:59.480 --> 0:16:02.840
<v Speaker 3>the then the tech person, oh, this is really good.

0:16:02.840 --> 0:16:04.600
<v Speaker 3>This is getting to be more intelligent. But you know,

0:16:04.640 --> 0:16:06.840
<v Speaker 3>it still says really stupid things. A lot of the

0:16:06.840 --> 0:16:10.760
<v Speaker 3>time it says weird things. So what we could do

0:16:10.880 --> 0:16:13.360
<v Speaker 3>is reinforcement learning from human feedback, which is actually a

0:16:13.440 --> 0:16:16.360
<v Speaker 3>really important part of these systems. What we'll do is

0:16:16.400 --> 0:16:19.760
<v Speaker 3>we'll give them to humans and then people can say

0:16:19.800 --> 0:16:21.880
<v Speaker 3>whether what they're saying is good or not, and then

0:16:21.920 --> 0:16:23.280
<v Speaker 3>we'll use that for the training.

0:16:24.120 --> 0:16:27.000
<v Speaker 4>The computer used to say, oh, okay, we're happy to

0:16:27.080 --> 0:16:27.440
<v Speaker 4>do that.

0:16:27.560 --> 0:16:29.640
<v Speaker 3>We'll actually go out and say whether this is good

0:16:29.760 --> 0:16:30.960
<v Speaker 3>or not. There's a whole and this.

0:16:31.000 --> 0:16:31.840
<v Speaker 4>Is literally true.

0:16:31.880 --> 0:16:34.920
<v Speaker 3>There are whole villages in Kenya that we'll do this

0:16:35.080 --> 0:16:36.880
<v Speaker 3>for very small amounts of money.

0:16:37.840 --> 0:16:39.560
<v Speaker 4>And the tech pro said, oh.

0:16:39.440 --> 0:16:42.760
<v Speaker 3>Look see it's even smarter, but it's still saying really

0:16:42.760 --> 0:16:46.440
<v Speaker 3>stupid things sometimes. How about if you did prompt engineering.

0:16:46.520 --> 0:16:49.640
<v Speaker 3>So think really hard about exactly how to ask it

0:16:49.680 --> 0:16:52.720
<v Speaker 3>the right questions so that you can get the right answers,

0:16:52.720 --> 0:16:54.760
<v Speaker 3>because otherwise it's going to say stupid things. And the

0:16:54.880 --> 0:16:57.320
<v Speaker 3>users say, oh, okay, we'll do that. We'll sit down

0:16:57.320 --> 0:17:00.120
<v Speaker 3>and we'll figure out how to do prompt engineering. At

0:17:00.160 --> 0:17:04.160
<v Speaker 3>the end of this process, the tech bro say, seeing,

0:17:04.280 --> 0:17:07.320
<v Speaker 3>we told you we made artificial general intelligence and it

0:17:07.400 --> 0:17:10.439
<v Speaker 3>was just from a few algorithms and the computer you

0:17:10.560 --> 0:17:13.439
<v Speaker 3>to say that's amazing, that's amazing, We're going to have

0:17:13.560 --> 0:17:17.240
<v Speaker 3>artificial intelligence, and it's just you, brilliant tech guys who

0:17:17.400 --> 0:17:20.080
<v Speaker 3>invented it. So, of course, the point of this is

0:17:20.119 --> 0:17:23.040
<v Speaker 3>that it's a sort of debunking story, but it's also

0:17:23.400 --> 0:17:26.239
<v Speaker 3>in both versions, a positive story, because the point is

0:17:26.280 --> 0:17:30.159
<v Speaker 3>when you have a combination of lots and lots of contributions,

0:17:30.200 --> 0:17:34.159
<v Speaker 3>of lots and lots of intelligent people, lots of humans

0:17:34.160 --> 0:17:36.760
<v Speaker 3>who we know are intelligent, both in terms of the

0:17:36.840 --> 0:17:39.280
<v Speaker 3>data they provide and in terms of things like reinforcement

0:17:39.359 --> 0:17:42.840
<v Speaker 3>learning and from human feedback and prompt engineering, you've got

0:17:42.920 --> 0:17:46.600
<v Speaker 3>something that that's bigger than any individual human could have.

0:17:47.080 --> 0:17:49.520
<v Speaker 3>But it's not that what you've got is a gall

0:17:49.760 --> 0:17:51.600
<v Speaker 3>it's not that what you've got is an agent that's

0:17:51.640 --> 0:17:54.479
<v Speaker 3>gone out and been intelligent itself. It's really just a

0:17:54.520 --> 0:17:58.080
<v Speaker 3>system for putting together the thoughts of other agents.

0:17:58.320 --> 0:18:01.800
<v Speaker 1>So the lesson that surfaces here is that although we

0:18:01.960 --> 0:18:07.280
<v Speaker 1>humans love to anthromorphize things and love to make inanimate

0:18:07.320 --> 0:18:12.000
<v Speaker 1>objects into agents in our minds, it's probably not the

0:18:12.080 --> 0:18:15.199
<v Speaker 1>right way to think about these language models. Or let

0:18:15.240 --> 0:18:19.040
<v Speaker 1>me say, these large models, language, vision, multimodal. So what

0:18:19.160 --> 0:18:21.280
<v Speaker 1>is the right way to think about it? So let's

0:18:21.320 --> 0:18:24.080
<v Speaker 1>really unpack this issue about what a social or cultural

0:18:24.119 --> 0:18:25.080
<v Speaker 1>technology is.

0:18:25.400 --> 0:18:31.800
<v Speaker 3>Yeah, so, ever since we've been human, we've made progress

0:18:32.040 --> 0:18:36.040
<v Speaker 3>by learning from other humans, and a number of people

0:18:36.640 --> 0:18:39.320
<v Speaker 3>like Joseph Henrick, for example, and Brob Boyd have argued

0:18:39.359 --> 0:18:41.760
<v Speaker 3>that that's kind of our human our great human gift,

0:18:42.680 --> 0:18:45.520
<v Speaker 3>that's really our secret sauce is not so much that

0:18:45.560 --> 0:18:49.520
<v Speaker 3>we can individually learn things that other creatures can't, although

0:18:49.520 --> 0:18:51.400
<v Speaker 3>I think that's part of it, but that we can

0:18:51.440 --> 0:18:54.119
<v Speaker 3>take advantage of all the things that other humans have

0:18:54.240 --> 0:18:57.440
<v Speaker 3>done over many, many, many generations. I like to think

0:18:57.440 --> 0:19:00.960
<v Speaker 3>of this in terms of the postmenopausal grandmother. I think,

0:19:01.200 --> 0:19:03.040
<v Speaker 3>you know, one of the distinctive human things is that

0:19:03.040 --> 0:19:05.320
<v Speaker 3>we have these postmenopausal grandmothers, and a lot of what

0:19:05.359 --> 0:19:08.040
<v Speaker 3>they do is tell us about the things that they've

0:19:08.400 --> 0:19:12.440
<v Speaker 3>learned in their long, wise lives. And by taking advantage

0:19:12.480 --> 0:19:15.719
<v Speaker 3>of what granny says, you can make progress, even if

0:19:15.760 --> 0:19:18.000
<v Speaker 3>what you're doing is now finding new things that you

0:19:18.000 --> 0:19:22.560
<v Speaker 3>will tell your grandchildren. And that capacity is really the

0:19:22.600 --> 0:19:26.600
<v Speaker 3>capacity that makes us special. And we've had special technologies

0:19:26.680 --> 0:19:30.159
<v Speaker 3>ever since we evolved, that tuned up that capacity, that

0:19:30.200 --> 0:19:33.720
<v Speaker 3>made it more powerful. So language itself, of course, which

0:19:33.760 --> 0:19:36.520
<v Speaker 3>is one of our distinctive things about humans, lets us

0:19:36.600 --> 0:19:39.280
<v Speaker 3>learn from others. But even more if you think about

0:19:39.280 --> 0:19:42.679
<v Speaker 3>something like the invention of writing that enabled us to

0:19:42.760 --> 0:19:45.040
<v Speaker 3>learn from you know, not just our own granny, but

0:19:45.080 --> 0:19:47.679
<v Speaker 3>granny's who were far away in space and in time,

0:19:48.080 --> 0:19:52.080
<v Speaker 3>and it's fascinating. Socrates famously has a whole section about

0:19:52.080 --> 0:19:55.800
<v Speaker 3>why he thinks writing is a terrible idea because exactly

0:19:55.880 --> 0:19:58.480
<v Speaker 3>because he thinks, people will read something in a book

0:19:58.520 --> 0:20:00.879
<v Speaker 3>and they'll think it's actually a person. They'll think that

0:20:00.960 --> 0:20:04.520
<v Speaker 3>it's a person who said this, and it's not. It's

0:20:04.640 --> 0:20:06.600
<v Speaker 3>just something that's written in a book. You won't be

0:20:06.640 --> 0:20:09.560
<v Speaker 3>able to have Socratic dialogues with something that's written in

0:20:09.600 --> 0:20:13.120
<v Speaker 3>a book. It's not really a person, but because it's language,

0:20:13.440 --> 0:20:16.720
<v Speaker 3>will treat it as if it's a person. So writing

0:20:16.880 --> 0:20:20.000
<v Speaker 3>is a good example of something that even though the

0:20:20.040 --> 0:20:24.040
<v Speaker 3>books aren't intelligent, the books in some sense don't know things.

0:20:24.160 --> 0:20:28.000
<v Speaker 3>In another sense, we know things because of books a way.

0:20:28.040 --> 0:20:30.119
<v Speaker 3>I think if this, sometimes they suppose someone asks you,

0:20:30.760 --> 0:20:34.680
<v Speaker 3>who knows more me or the UC Berkeley library. Well,

0:20:35.000 --> 0:20:37.600
<v Speaker 3>the library has much more knowledge in it. It's got

0:20:37.800 --> 0:20:40.159
<v Speaker 3>vast amounts of knowledge that I could never actually have

0:20:40.280 --> 0:20:43.160
<v Speaker 3>in my head. But it's not the sort of thing

0:20:43.200 --> 0:20:45.639
<v Speaker 3>that knows. I'm the sort of person who knows and

0:20:45.880 --> 0:20:48.920
<v Speaker 3>I know things because I can do things like consult

0:20:48.960 --> 0:20:52.000
<v Speaker 3>the library. And then you have print, which is even

0:20:52.119 --> 0:20:55.680
<v Speaker 3>more powerful and has even more powerful effects. You have

0:20:56.800 --> 0:21:00.600
<v Speaker 3>video and film. You have pictures, which I think are

0:21:00.640 --> 0:21:03.560
<v Speaker 3>a really important medium that we don't pay enough attention to.

0:21:03.920 --> 0:21:07.040
<v Speaker 3>So when we talk about vision models, for example, they're

0:21:07.040 --> 0:21:10.000
<v Speaker 3>not actually using vision. What they're using is all the

0:21:10.000 --> 0:21:12.199
<v Speaker 3>pictures that we put on the Internet, and pictures are

0:21:12.240 --> 0:21:16.400
<v Speaker 3>a really important source of communication to So to say

0:21:16.440 --> 0:21:19.119
<v Speaker 3>that it's a cultural technology, to say it's one of

0:21:19.160 --> 0:21:22.199
<v Speaker 3>these technologies that lets humans learn from other humans is

0:21:22.240 --> 0:21:24.840
<v Speaker 3>not at all to dismiss it. Those cultural technologies are

0:21:24.840 --> 0:21:27.480
<v Speaker 3>the things that have led, for better or for worse,

0:21:27.600 --> 0:21:30.000
<v Speaker 3>to the world that we have now. But it's just

0:21:30.160 --> 0:21:32.920
<v Speaker 3>a really different thing. It's what philosophers would call a

0:21:32.960 --> 0:21:36.800
<v Speaker 3>category mistake to think that it's like an intelligent agent,

0:21:36.800 --> 0:21:38.359
<v Speaker 3>which is not to say that at some point in

0:21:38.400 --> 0:21:42.560
<v Speaker 3>the future AI might not develop intelligent agents, but that's

0:21:42.600 --> 0:21:44.600
<v Speaker 3>not what the large models are doing, and that's a

0:21:44.680 --> 0:21:47.880
<v Speaker 3>much much harder lift, something we're much further away from

0:21:48.000 --> 0:21:55.800
<v Speaker 3>than this fantastic, interesting, powerful new cultural technology that we've invented.

0:22:11.920 --> 0:22:14.280
<v Speaker 1>So when we're thinking about cultural technologies, those are all

0:22:14.440 --> 0:22:17.680
<v Speaker 1>amazing examples that you gave about the invention of writing

0:22:18.800 --> 0:22:21.680
<v Speaker 1>and the printing press and then the Internet and so on.

0:22:22.640 --> 0:22:25.480
<v Speaker 1>In the paper that you wrote with your colleagues, you

0:22:25.560 --> 0:22:31.000
<v Speaker 1>mentioned other things also, like markets and bureaucracies and representative democracies.

0:22:31.080 --> 0:22:33.560
<v Speaker 1>Just give us a sense of how those are our

0:22:33.640 --> 0:22:34.960
<v Speaker 1>cultural technologies as well.

0:22:35.119 --> 0:22:39.040
<v Speaker 3>Yeah, so this paper, it was a wonderful collaboration between

0:22:39.080 --> 0:22:42.600
<v Speaker 3>me and Henry Farrell, who's a really distinguished political scientist,

0:22:42.880 --> 0:22:46.680
<v Speaker 3>James Evans, who's a fantastic sociologist, and Cosmo Shalizi, who's

0:22:46.720 --> 0:22:48.840
<v Speaker 3>the statistician. So it was kind of like everybody in

0:22:48.880 --> 0:22:52.520
<v Speaker 3>the social sciences. And what Henry and James have pointed

0:22:52.560 --> 0:22:55.720
<v Speaker 3>out is that we have things like writing in print,

0:22:55.760 --> 0:22:58.399
<v Speaker 3>but if you think about things like a market, this

0:22:58.480 --> 0:23:02.399
<v Speaker 3>is an old observation in economics, what a market really

0:23:02.440 --> 0:23:06.399
<v Speaker 3>does is to summarize information from lots and lots of

0:23:06.400 --> 0:23:09.399
<v Speaker 3>individual people. Imagine that you're in a forager culture and

0:23:09.440 --> 0:23:12.000
<v Speaker 3>you want to exchange you know, I have a two

0:23:12.840 --> 0:23:15.600
<v Speaker 3>turnips and I want to exchange them for some beads. Right,

0:23:16.000 --> 0:23:18.880
<v Speaker 3>I have to work that out for each individual group

0:23:18.920 --> 0:23:20.919
<v Speaker 3>of people. And what a market does is let you

0:23:20.960 --> 0:23:23.399
<v Speaker 3>do that literally on a planetary scale, lets you do

0:23:23.440 --> 0:23:26.040
<v Speaker 3>it for billions of people. And the price is a

0:23:26.119 --> 0:23:29.600
<v Speaker 3>kind of summary of here's all the desires and all

0:23:29.600 --> 0:23:32.680
<v Speaker 3>the goals and all the preferences of all of these

0:23:32.680 --> 0:23:35.960
<v Speaker 3>people just summarized in this one, you know number that

0:23:36.000 --> 0:23:39.280
<v Speaker 3>you find when you look on an Amazon. Now, of

0:23:39.280 --> 0:23:43.159
<v Speaker 3>course that's interesting because it didn't even rely on you know,

0:23:43.280 --> 0:23:47.480
<v Speaker 3>markets start in the industrial age, but they don't rely

0:23:47.600 --> 0:23:50.760
<v Speaker 3>on computations. They they're way before we have computers. They're

0:23:50.800 --> 0:23:53.679
<v Speaker 3>way before we even have calculators. But the invention of

0:23:53.760 --> 0:23:56.879
<v Speaker 3>markets was a kind of information processing invention that let

0:23:56.960 --> 0:24:01.399
<v Speaker 3>us take individual desires and put them together. And democratic

0:24:01.440 --> 0:24:04.080
<v Speaker 3>elections are like that too, where we have all of

0:24:04.119 --> 0:24:08.320
<v Speaker 3>these people who have different preferences, and the democratic process

0:24:08.440 --> 0:24:11.600
<v Speaker 3>lets us figure out a way of combining them all.

0:24:11.640 --> 0:24:13.960
<v Speaker 3>And that's another thing that these large models can do,

0:24:14.000 --> 0:24:16.400
<v Speaker 3>that can take lots of information from lots of different people,

0:24:16.440 --> 0:24:18.480
<v Speaker 3>put it together in a single format. And again this

0:24:18.600 --> 0:24:20.280
<v Speaker 3>is for good or for ill, and maybe we could

0:24:20.320 --> 0:24:23.440
<v Speaker 3>talk about both of those sides in a bit.

0:24:23.680 --> 0:24:25.240
<v Speaker 2>Great. Well, actually let's go there now.

0:24:25.359 --> 0:24:28.840
<v Speaker 1>So all previous social and cultural technologies always come with

0:24:28.880 --> 0:24:29.400
<v Speaker 1>good and bad.

0:24:29.520 --> 0:24:31.560
<v Speaker 2>So what do you see.

0:24:31.280 --> 0:24:34.480
<v Speaker 1>As far as large models go with our current moment

0:24:34.520 --> 0:24:37.640
<v Speaker 1>of AI. What do you see as the potential good

0:24:37.680 --> 0:24:38.080
<v Speaker 1>and bad?

0:24:38.440 --> 0:24:38.720
<v Speaker 4>Yeah?

0:24:39.119 --> 0:24:41.680
<v Speaker 3>So, one thing that I think is worth pointing out

0:24:41.880 --> 0:24:45.359
<v Speaker 3>is that actually the big technological change was not the

0:24:45.560 --> 0:24:48.639
<v Speaker 3>change in large models. It was the fact that around

0:24:48.680 --> 0:24:52.000
<v Speaker 3>the year two thousand there was this remarkable thing happened

0:24:52.040 --> 0:24:54.840
<v Speaker 3>that nobody really noticed or paid attention to, which is

0:24:54.880 --> 0:24:58.280
<v Speaker 3>that all the previous media got turned into bits. So

0:24:58.920 --> 0:25:05.040
<v Speaker 3>it's fascinating. Around two thousand, you get the first computerized movies,

0:25:05.080 --> 0:25:08.880
<v Speaker 3>You get things like Toy Story and Pixar. You get PDFs.

0:25:09.000 --> 0:25:11.520
<v Speaker 3>PDFs are taking print and turning them into bits. You

0:25:11.560 --> 0:25:17.440
<v Speaker 3>get HDTV, so you get TV that's now digital, And suddenly,

0:25:17.880 --> 0:25:20.920
<v Speaker 3>in a very short space of time, the only analog

0:25:21.040 --> 0:25:21.720
<v Speaker 3>media are.

0:25:21.600 --> 0:25:23.080
<v Speaker 4>In museums in kindergartens.

0:25:23.080 --> 0:25:25.920
<v Speaker 3>You know, everything else is everything else is digital, which

0:25:25.960 --> 0:25:28.840
<v Speaker 3>means that now not only do you have information, but

0:25:28.920 --> 0:25:33.800
<v Speaker 3>it's infinitely reproducible, and it's instantaneously transmissible because it's in

0:25:33.840 --> 0:25:36.959
<v Speaker 3>the format of bits. And once that happened, then it

0:25:37.000 --> 0:25:39.080
<v Speaker 3>was just a matter of time before we found new

0:25:39.160 --> 0:25:44.040
<v Speaker 3>ways of accessing and summarizing and organizing that information. And

0:25:44.240 --> 0:25:47.760
<v Speaker 3>large models, I think are the are the result of that. Okay,

0:25:47.840 --> 0:25:51.960
<v Speaker 3>so what do we know about these past changes in technologies?

0:25:52.960 --> 0:25:53.919
<v Speaker 4>You go back to writing.

0:25:53.960 --> 0:25:57.320
<v Speaker 3>As I mentioned, Socrates was very dubious about whether writing

0:25:57.480 --> 0:26:00.280
<v Speaker 3>was going to be a good thing or not, because

0:26:00.320 --> 0:26:03.320
<v Speaker 3>he pointed out that you don't have Socratic dialogues when

0:26:03.320 --> 0:26:06.600
<v Speaker 3>you have books, when you have writing, and you don't

0:26:07.040 --> 0:26:09.560
<v Speaker 3>memorize all of Homer when you have books, and he

0:26:09.680 --> 0:26:12.560
<v Speaker 3>was right, we don't memorize all of Homer anymore. Well,

0:26:12.560 --> 0:26:14.359
<v Speaker 3>we do tend to think that things that are written

0:26:14.359 --> 0:26:16.320
<v Speaker 3>down are truer than they actually are.

0:26:16.600 --> 0:26:17.199
<v Speaker 2>And by the way, in.

0:26:17.200 --> 0:26:19.560
<v Speaker 1>The fifteenth century, once the printing press was invented, there

0:26:19.600 --> 0:26:21.960
<v Speaker 1>were a lot of these same complaints that surfaced, where

0:26:21.960 --> 0:26:25.080
<v Speaker 1>people said, look, you know, if you ask a kid

0:26:25.080 --> 0:26:26.600
<v Speaker 1>a question, now they're just going to go to the

0:26:26.640 --> 0:26:29.119
<v Speaker 1>shelf and pull it right off and there's the answer.

0:26:29.160 --> 0:26:30.159
<v Speaker 2>They don't have to think about it.

0:26:30.280 --> 0:26:35.280
<v Speaker 3>That's exactly right, and so there was misinformation. You know,

0:26:35.320 --> 0:26:37.600
<v Speaker 3>when you can pass on information, one of the first

0:26:37.640 --> 0:26:40.480
<v Speaker 3>things that happens is you can also pass on misinformation.

0:26:41.640 --> 0:26:44.760
<v Speaker 3>An example that I really like to give is, so

0:26:44.800 --> 0:26:48.400
<v Speaker 3>you were mentioning about the fifteenth century when printing starts.

0:26:49.000 --> 0:26:50.880
<v Speaker 3>Something that I just found out recently that I think

0:26:50.960 --> 0:26:53.359
<v Speaker 3>is really fascinating is, you know how we all have

0:26:53.480 --> 0:26:57.920
<v Speaker 3>this mythology about Luther nailed the articles to the door,

0:26:58.320 --> 0:27:00.440
<v Speaker 3>and that was, you know, the big defiance thing. Well,

0:27:00.480 --> 0:27:02.760
<v Speaker 3>it turns out nailing things to the door was like

0:27:02.840 --> 0:27:05.400
<v Speaker 3>having post it notes, Like everybody was always nailing things

0:27:05.440 --> 0:27:06.960
<v Speaker 3>to the door. That was just like the way that

0:27:07.000 --> 0:27:11.040
<v Speaker 3>you distributed it. What Luther did was print his ideas,

0:27:11.520 --> 0:27:15.160
<v Speaker 3>and when they were printed, then they could be distributed

0:27:15.200 --> 0:27:19.399
<v Speaker 3>to everybody, including like the common people. And that was

0:27:19.440 --> 0:27:24.560
<v Speaker 3>the revolutionary thing. That was the disruptive That was the disruptive.

0:27:24.080 --> 0:27:27.439
<v Speaker 1>Thing, analogous to let's say, off Hillary using radio in

0:27:27.480 --> 0:27:30.320
<v Speaker 1>the thirties, reaching a much wider audience.

0:27:30.560 --> 0:27:34.080
<v Speaker 3>But my favorite example is in the eighteenth century, the

0:27:34.240 --> 0:27:37.840
<v Speaker 3>late eighteenth century, there were further technological changes in printing,

0:27:37.880 --> 0:27:41.520
<v Speaker 3>which meant that it became extremely I mean Essentially, anybody

0:27:41.520 --> 0:27:43.879
<v Speaker 3>could go and find a printing press and print a

0:27:43.920 --> 0:27:45.399
<v Speaker 3>pamphlet and distributed.

0:27:45.920 --> 0:27:47.760
<v Speaker 4>And there's a very good.

0:27:47.640 --> 0:27:51.359
<v Speaker 3>Argument that this was responsible for the Enlightenment, that you know,

0:27:51.440 --> 0:27:55.520
<v Speaker 3>something like it's not a coincidence that Ben Franklin was

0:27:55.560 --> 0:27:58.720
<v Speaker 3>a printer. A lot of the source of the American

0:27:58.720 --> 0:28:03.879
<v Speaker 3>Revolution was these pamphlets that were spreading new ideas from

0:28:03.920 --> 0:28:07.399
<v Speaker 3>the Enlightenment, things like Tom Payne's common Sense, the idea

0:28:07.480 --> 0:28:11.240
<v Speaker 3>that people could have a democratic state. Those were all

0:28:11.280 --> 0:28:14.040
<v Speaker 3>things that were distributed through printing. So we all think

0:28:14.480 --> 0:28:18.080
<v Speaker 3>that's great, like, that's fantastic, we could get things really quickly.

0:28:18.600 --> 0:28:22.960
<v Speaker 3>But at the same time, the scholar Robert Darten pointed

0:28:23.000 --> 0:28:25.640
<v Speaker 3>this out a long time ago. If you actually look

0:28:25.720 --> 0:28:29.360
<v Speaker 3>and read all of the pamphlets that were produced in France,

0:28:29.480 --> 0:28:31.520
<v Speaker 3>so they were happening in America, they were also happening

0:28:31.560 --> 0:28:32.040
<v Speaker 3>in France.

0:28:32.760 --> 0:28:34.399
<v Speaker 4>You will be amazed to hear this, David.

0:28:34.480 --> 0:28:39.520
<v Speaker 3>But most of them were libel lies, misinformation, and a

0:28:39.560 --> 0:28:42.760
<v Speaker 3>lot of soft core porn. That's the first thing when

0:28:42.920 --> 0:28:45.600
<v Speaker 3>people have a new cultural technology, that's the first thing

0:28:45.640 --> 0:28:48.000
<v Speaker 3>they use it for. And a lot of the French

0:28:48.040 --> 0:28:52.240
<v Speaker 3>Revolution came because, for example, Marie Antoinette saying let them

0:28:52.280 --> 0:28:52.680
<v Speaker 3>eat cake.

0:28:53.040 --> 0:28:55.560
<v Speaker 4>That was a meme. That wasn't something that she said.

0:28:55.640 --> 0:28:58.880
<v Speaker 3>That was something that came out of this underground of

0:28:59.360 --> 0:29:03.200
<v Speaker 3>people just beating pamphlets. So you get the same benefits

0:29:03.200 --> 0:29:07.240
<v Speaker 3>and drawbacks. You get information being distributed really quickly to

0:29:07.320 --> 0:29:09.480
<v Speaker 3>many more people in a way that means that new

0:29:09.520 --> 0:29:13.360
<v Speaker 3>ideas can spread. But it also means that misinformation and

0:29:15.680 --> 0:29:18.719
<v Speaker 3>libel and other kinds of things you don't want can spread.

0:29:18.760 --> 0:29:21.240
<v Speaker 3>And I think we see this happening with the current

0:29:21.600 --> 0:29:26.640
<v Speaker 3>systems as well. So lms are representing the humans you know,

0:29:26.680 --> 0:29:29.959
<v Speaker 3>who are putting You know that the soup just tastes

0:29:30.080 --> 0:29:32.040
<v Speaker 3>like what all the people have put in the soup,

0:29:32.560 --> 0:29:35.280
<v Speaker 3>And that means that if people are putting in things

0:29:35.320 --> 0:29:42.400
<v Speaker 3>that are racist or sexist, or just wrong or misinformation

0:29:42.840 --> 0:29:44.880
<v Speaker 3>or outrage, that's.

0:29:44.680 --> 0:29:47.040
<v Speaker 4>What's going to show up from the llms.

0:29:47.160 --> 0:29:50.160
<v Speaker 1>And of course that's no different than the Berkeley Library

0:29:50.200 --> 0:29:52.200
<v Speaker 1>in the sense that if people are writing books and

0:29:52.240 --> 0:29:56.600
<v Speaker 1>they understand something about planetary motion or dark energy or whatever,

0:29:56.640 --> 0:29:58.640
<v Speaker 1>and they write books on it, that's that's all we

0:29:58.720 --> 0:30:01.000
<v Speaker 1>have to draw from R two things.

0:30:01.080 --> 0:30:02.920
<v Speaker 4>What is it that they can't do well? What they

0:30:03.000 --> 0:30:05.320
<v Speaker 4>can't do is go out and find something new.

0:30:05.800 --> 0:30:07.920
<v Speaker 3>And that's the great thing that human beings can do,

0:30:07.960 --> 0:30:11.040
<v Speaker 3>and especially actually human children can do, is go out

0:30:11.080 --> 0:30:13.959
<v Speaker 3>into the world say this is what everyone's told me

0:30:14.080 --> 0:30:16.400
<v Speaker 3>is true, but you know what, I'm not sure it

0:30:16.440 --> 0:30:18.760
<v Speaker 3>is true. Let me go and find out something new.

0:30:19.000 --> 0:30:22.040
<v Speaker 3>That's what kids do, that's what teenagers do. That's what

0:30:22.120 --> 0:30:26.480
<v Speaker 3>scientists do. Go out in the world, revise, change, think

0:30:26.520 --> 0:30:29.040
<v Speaker 3>about things in new ways, find out something new about

0:30:29.040 --> 0:30:29.400
<v Speaker 3>the world.

0:30:29.600 --> 0:30:30.600
<v Speaker 4>And that's exactly the.

0:30:30.520 --> 0:30:32.880
<v Speaker 3>Thing that ellms can't do. What elms can do is

0:30:32.880 --> 0:30:35.440
<v Speaker 3>summarize what all the other humans have said. They can't

0:30:35.480 --> 0:30:40.320
<v Speaker 3>go out and find something that's not just not just

0:30:40.320 --> 0:30:43.000
<v Speaker 3>an extrapolation from what people have already done.

0:30:44.560 --> 0:30:47.560
<v Speaker 1>So there's something really interesting about this, because what elms

0:30:47.600 --> 0:30:50.600
<v Speaker 1>are great at doing, of course, is interpolating between different

0:30:50.680 --> 0:30:54.840
<v Speaker 1>things that have been said, and sometimes those are lucune,

0:30:54.880 --> 0:30:58.320
<v Speaker 1>those are holes that humans haven't explored in before for

0:30:58.360 --> 0:31:01.880
<v Speaker 1>whatever reason. So what I and some of my colleagues

0:31:01.880 --> 0:31:04.000
<v Speaker 1>has been doing for a while is, you know, pitching

0:31:04.040 --> 0:31:08.720
<v Speaker 1>these really strange scientific questions at it and saying, you know,

0:31:09.080 --> 0:31:11.080
<v Speaker 1>what's your hypothesis, and it comes up with something and

0:31:11.080 --> 0:31:15.040
<v Speaker 1>we say, okay, you think you know. More broadly, just

0:31:15.760 --> 0:31:18.400
<v Speaker 1>come up with something that could explain this, and you know,

0:31:18.560 --> 0:31:20.360
<v Speaker 1>and you keep pushing it and it comes up with

0:31:20.440 --> 0:31:23.680
<v Speaker 1>pretty creative things. And they're creative in the sense that

0:31:23.720 --> 0:31:27.080
<v Speaker 1>they are remixes of what it's taken in before. And

0:31:27.120 --> 0:31:31.320
<v Speaker 1>it can do something, you know, interpolating between points that

0:31:31.360 --> 0:31:32.120
<v Speaker 1>are already known.

0:31:32.600 --> 0:31:35.040
<v Speaker 2>Now I totally agree with you. Of course, what it

0:31:35.120 --> 0:31:35.840
<v Speaker 2>can't do.

0:31:36.280 --> 0:31:39.400
<v Speaker 1>Is think about something outside of the sphere of human knowledge,

0:31:39.440 --> 0:31:42.320
<v Speaker 1>which is what let's say, you know Einstein does when

0:31:42.320 --> 0:31:44.200
<v Speaker 1>he says, what if I'm writing on a photon and

0:31:44.240 --> 0:31:45.480
<v Speaker 1>what would things look like?

0:31:45.560 --> 0:31:45.840
<v Speaker 2>And so on.

0:31:45.960 --> 0:31:49.480
<v Speaker 1>It comes up with a special theory of relativity that is,

0:31:50.720 --> 0:31:53.560
<v Speaker 1>as you say, taking all the stuff that's coming forward

0:31:53.560 --> 0:31:55.600
<v Speaker 1>and saying, hey, maybe that's not right and there's a.

0:31:55.600 --> 0:31:57.400
<v Speaker 2>Completely different way to look at it.

0:31:57.920 --> 0:32:02.520
<v Speaker 1>And of course with that, Record Buyers is the ability

0:32:02.560 --> 0:32:04.960
<v Speaker 1>to not only come up with a new idea, but

0:32:05.000 --> 0:32:09.080
<v Speaker 1>then simulate that idea to its conclusions and say, oh,

0:32:09.120 --> 0:32:12.280
<v Speaker 1>you know, that explains things better than what we currently have.

0:32:12.480 --> 0:32:15.120
<v Speaker 3>And a very important piece of that is it also

0:32:15.240 --> 0:32:18.200
<v Speaker 3>involves going out and testing it, so you know, it

0:32:18.240 --> 0:32:21.760
<v Speaker 3>wasn't just Einstein, it was people going out and actually,

0:32:21.840 --> 0:32:25.760
<v Speaker 3>you know, measuring the eclipses. That meant that that theory

0:32:25.880 --> 0:32:29.320
<v Speaker 3>was confirmed. So one of the things that even little

0:32:29.360 --> 0:32:33.280
<v Speaker 3>kids do is test things, go out experiment. When kids

0:32:33.320 --> 0:32:35.680
<v Speaker 3>do it, we call it getting into everything. But we've

0:32:35.720 --> 0:32:38.520
<v Speaker 3>been recently doing a lot of work showing that even

0:32:38.560 --> 0:32:41.040
<v Speaker 3>when little kids are getting into everything, what they're actually

0:32:41.080 --> 0:32:44.040
<v Speaker 3>doing is trying to get data that they can use

0:32:44.120 --> 0:32:46.840
<v Speaker 3>to change what they think, to revise what they think.

0:32:46.920 --> 0:32:50.400
<v Speaker 3>And that's a very human kind of intelligence. The reason

0:32:50.480 --> 0:32:55.280
<v Speaker 3>why the llms hallucinate is what people often call it.

0:32:55.040 --> 0:32:57.600
<v Speaker 3>It's not that they're hallucinating, it's that they just don't

0:32:58.000 --> 0:33:00.920
<v Speaker 3>They're not designed to know the difference between truth and falsehood.

0:33:01.160 --> 0:33:04.240
<v Speaker 3>They're not their objective function, as they say. The thing

0:33:04.240 --> 0:33:06.720
<v Speaker 3>they're trying to do is not to get the truth.

0:33:06.880 --> 0:33:09.200
<v Speaker 3>It's not going out and doing an experiment. It's not

0:33:09.320 --> 0:33:12.560
<v Speaker 3>changing their minds. It's trying to get the best summary

0:33:12.640 --> 0:33:15.760
<v Speaker 3>of the things that they already have heard from they

0:33:15.800 --> 0:33:17.520
<v Speaker 3>already have heard from other human beings.

0:33:17.760 --> 0:33:20.640
<v Speaker 1>Now, just for clarification, everything that you and I are

0:33:20.680 --> 0:33:25.000
<v Speaker 1>talking about right now is with current large models. But

0:33:25.520 --> 0:33:27.680
<v Speaker 1>what a lot of people are talking about now is

0:33:27.800 --> 0:33:30.840
<v Speaker 1>the third wave. The next wave of AI is probably

0:33:30.880 --> 0:33:34.560
<v Speaker 1>going to be agents who are experimenting in the world,

0:33:34.600 --> 0:33:36.800
<v Speaker 1>who are doing things in the world and gathering data

0:33:36.800 --> 0:33:39.960
<v Speaker 1>that way, because clearly we've already done the common crawl,

0:33:40.320 --> 0:33:43.480
<v Speaker 1>where these AI agents have crawled everything that has been

0:33:43.520 --> 0:33:46.800
<v Speaker 1>written by humans and there's no more data to be had.

0:33:47.240 --> 0:33:48.560
<v Speaker 2>So the next step.

0:33:48.320 --> 0:33:52.720
<v Speaker 1>Is run experiments in the real world, try out hypotheses,

0:33:53.240 --> 0:33:55.800
<v Speaker 1>and so that's going to change everything again.

0:33:56.800 --> 0:33:59.440
<v Speaker 3>Yeah, I mean that's where the kids are like a

0:33:59.440 --> 0:34:02.800
<v Speaker 3>wonderful because that's what kids are doing. And you know,

0:34:02.840 --> 0:34:05.320
<v Speaker 3>you mentioned something that we think about something like Einstein

0:34:05.360 --> 0:34:07.960
<v Speaker 3>as being this big change. But think about I think

0:34:07.960 --> 0:34:11.839
<v Speaker 3>about this with my uh my grandchildren. For example, when

0:34:11.880 --> 0:34:15.920
<v Speaker 3>I look at a phone, right or a computer, what

0:34:16.000 --> 0:34:18.120
<v Speaker 3>I say to myself is okay, well you use.

0:34:18.520 --> 0:34:20.239
<v Speaker 4>You use a keyboard.

0:34:20.480 --> 0:34:22.600
<v Speaker 3>But then there's these other gimmicks about, like I could

0:34:22.640 --> 0:34:24.200
<v Speaker 3>talk to it or I could touch it.

0:34:24.200 --> 0:34:24.799
<v Speaker 4>It would work.

0:34:25.280 --> 0:34:31.279
<v Speaker 3>My grandchildren, even my eighteen month old grandchildren, think, oh no,

0:34:31.640 --> 0:34:33.440
<v Speaker 3>this is a system that you talk to and you

0:34:33.520 --> 0:34:35.680
<v Speaker 3>touch and then things happen when you talk. You know,

0:34:35.719 --> 0:34:37.920
<v Speaker 3>if you actually have this little physical object and you

0:34:37.960 --> 0:34:40.800
<v Speaker 3>talk to it and touch it, you're going to get effects.

0:34:41.080 --> 0:34:42.960
<v Speaker 3>And they think that the you know, the keyboard is like,

0:34:43.040 --> 0:34:47.000
<v Speaker 3>what is this? This is this weird, strange, awkward thing

0:34:47.239 --> 0:34:49.040
<v Speaker 3>that you know, peripheral device.

0:34:49.080 --> 0:34:50.440
<v Speaker 4>We don't want to have anything to do with that.

0:34:50.960 --> 0:34:54.760
<v Speaker 3>Eighteen month olds, right aren't reading, let alone using a keyboard,

0:34:55.040 --> 0:34:57.840
<v Speaker 3>so they're even just you know, the difference between my

0:34:58.320 --> 0:35:01.440
<v Speaker 3>vision of what a computer is and what my eighteen

0:35:01.440 --> 0:35:08.120
<v Speaker 3>month old grandson's vision is is already a really radically

0:35:08.120 --> 0:35:08.840
<v Speaker 3>different vision.

0:35:09.200 --> 0:35:09.680
<v Speaker 2>That's right.

0:35:09.800 --> 0:35:13.319
<v Speaker 1>So the long arc of moral progress, as well as

0:35:13.400 --> 0:35:18.200
<v Speaker 1>the arc of human knowledge is all about is all

0:35:18.239 --> 0:35:20.640
<v Speaker 1>about us passing on to the next generation. Hey, here's

0:35:20.680 --> 0:35:23.000
<v Speaker 1>the things we've learned exactly, and then they pick it

0:35:23.080 --> 0:35:25.080
<v Speaker 1>up and they springboard right off of that.

0:35:25.480 --> 0:35:27.399
<v Speaker 3>So that's going to be you know, if you look

0:35:27.440 --> 0:35:29.600
<v Speaker 3>at the difference between how much progress has been made

0:35:29.600 --> 0:35:33.120
<v Speaker 3>with the LMS and something like robotics, where there's progress,

0:35:33.120 --> 0:35:36.880
<v Speaker 3>but it's much slower and still very very far removed

0:35:36.920 --> 0:35:41.719
<v Speaker 3>from what every baby's doing. You know, I do think

0:35:42.040 --> 0:35:44.719
<v Speaker 3>and we've been thinking about whether you could design a

0:35:44.760 --> 0:35:50.160
<v Speaker 3>system that, for instance, let me describe this one of

0:35:50.200 --> 0:35:54.840
<v Speaker 3>the kinds of AI systems, not an LM. But a

0:35:54.880 --> 0:35:58.520
<v Speaker 3>really different approach is what's called reinforcement learning. And reinforcement

0:35:58.600 --> 0:36:00.720
<v Speaker 3>learning is when a system does something and then learns

0:36:00.719 --> 0:36:03.360
<v Speaker 3>from the outcomes of what it does. But that's still

0:36:03.640 --> 0:36:09.000
<v Speaker 3>very very labor and computation intensive. It's hard, it's hard

0:36:09.040 --> 0:36:12.319
<v Speaker 3>to do efficiently, and it has a big problem, which

0:36:12.360 --> 0:36:15.120
<v Speaker 3>is in reinforcement learning, what those systems are doing is

0:36:15.160 --> 0:36:17.439
<v Speaker 3>just trying to, say, get a big higher score in game.

0:36:17.520 --> 0:36:20.320
<v Speaker 3>So reinforcement learning is how they solved go and chess,

0:36:20.320 --> 0:36:22.719
<v Speaker 3>So you can say, okay, you want to win the game.

0:36:22.719 --> 0:36:24.080
<v Speaker 3>What do you have to do to win the game.

0:36:24.480 --> 0:36:26.480
<v Speaker 3>But of course a lot of what children are doing

0:36:26.600 --> 0:36:27.880
<v Speaker 3>is just trying.

0:36:27.600 --> 0:36:29.040
<v Speaker 4>To figure out how the world works.

0:36:29.120 --> 0:36:31.360
<v Speaker 3>It doesn't really matter whether you're winning or losing, or

0:36:31.920 --> 0:36:35.400
<v Speaker 3>you're getting things or not. It's you just want to

0:36:35.440 --> 0:36:37.719
<v Speaker 3>figure out how the world works. So one thing we've

0:36:37.760 --> 0:36:39.640
<v Speaker 3>been doing is saying, what would happen if you had

0:36:39.640 --> 0:36:42.480
<v Speaker 3>a reinforcement learning system and instead of trying to get

0:36:42.480 --> 0:36:44.920
<v Speaker 3>a high score, it was trying to get more information,

0:36:45.280 --> 0:36:47.840
<v Speaker 3>or it was trying to figure out how to be

0:36:47.920 --> 0:36:50.640
<v Speaker 3>more effective or figure out how cause and effect worked.

0:36:50.920 --> 0:36:53.520
<v Speaker 3>Would that be a better way of describing that certainly

0:36:53.560 --> 0:36:55.560
<v Speaker 3>is much closer to what the kids are doing. But

0:36:55.680 --> 0:36:58.160
<v Speaker 3>part of the problem is that in the systems that

0:36:58.200 --> 0:37:00.480
<v Speaker 3>we have now you know, you were mentioning, well, you

0:37:00.520 --> 0:37:03.680
<v Speaker 3>could go to a chat subutee and say, okay, give

0:37:03.719 --> 0:37:06.960
<v Speaker 3>me five different ways of answering this question. But the

0:37:07.000 --> 0:37:11.920
<v Speaker 3>way that they currently are generating that kind of variability

0:37:11.920 --> 0:37:14.560
<v Speaker 3>and novelty is basically just by being random, just by

0:37:15.440 --> 0:37:18.080
<v Speaker 3>what they sometimes call turning up the temperature, just doing

0:37:18.120 --> 0:37:21.480
<v Speaker 3>things that are more different from one another. They're not

0:37:21.960 --> 0:37:26.160
<v Speaker 3>able to say, Okay, this is a plausible answer, this

0:37:26.239 --> 0:37:28.160
<v Speaker 3>could be the right answer, and this is just you know,

0:37:28.719 --> 0:37:31.920
<v Speaker 3>completely irrelevant. That's part of the reason why they hallucinate

0:37:32.080 --> 0:37:35.600
<v Speaker 3>is because they'll generate something and it seems like it

0:37:35.640 --> 0:37:38.480
<v Speaker 3>could be potentially be an answer, and they're not evaluating

0:37:38.600 --> 0:37:40.960
<v Speaker 3>does this make sense or not? And that's something that

0:37:41.400 --> 0:37:43.680
<v Speaker 3>we know that kids are doing, and we don't have

0:37:43.719 --> 0:37:45.160
<v Speaker 3>a very good account of.

0:37:45.120 --> 0:37:45.680
<v Speaker 4>How they do it.

0:37:45.800 --> 0:37:48.680
<v Speaker 3>So, you know, this is the old This shows how

0:37:48.760 --> 0:37:51.799
<v Speaker 3>old I am. There used to be a show about

0:37:51.840 --> 0:37:54.360
<v Speaker 3>you know, kids say the darkness things, and there's.

0:37:54.320 --> 0:37:58.400
<v Speaker 4>Slightly more expletive latent things like that on the internet.

0:37:58.719 --> 0:38:02.000
<v Speaker 3>Kids are always saying the creative strange things, but they're

0:38:02.000 --> 0:38:05.480
<v Speaker 3>not random, like they're not. It's not that they're just

0:38:06.040 --> 0:38:10.360
<v Speaker 3>randomly saying saying things that make no sense. They're really

0:38:10.480 --> 0:38:14.000
<v Speaker 3>different from what a grown up would ever say, but

0:38:14.040 --> 0:38:16.720
<v Speaker 3>they're not, but they kind of make sense. A lovely

0:38:16.760 --> 0:38:19.880
<v Speaker 3>example recently is one of a post doc was taking

0:38:19.920 --> 0:38:22.319
<v Speaker 3>her child out for her four year old out for

0:38:22.320 --> 0:38:24.920
<v Speaker 3>a walk on the Berkeley campus and the campus has

0:38:24.960 --> 0:38:27.279
<v Speaker 3>a companyli it's a bell tower that's, you know, a

0:38:27.360 --> 0:38:29.600
<v Speaker 3>clock that's very high up on top of a tower.

0:38:30.200 --> 0:38:32.399
<v Speaker 3>And a little boy looked at it and said, there's

0:38:32.440 --> 0:38:33.640
<v Speaker 3>a clock up there.

0:38:34.040 --> 0:38:36.600
<v Speaker 4>And then he thought to himself and he said, why

0:38:36.640 --> 0:38:39.680
<v Speaker 4>did they put the clock up there? And he said

0:38:40.520 --> 0:38:43.800
<v Speaker 4>it must be so the students and the children couldn't

0:38:43.800 --> 0:38:47.600
<v Speaker 4>break it, which is of course a wonderful explanation, right,

0:38:47.600 --> 0:38:48.880
<v Speaker 4>like you put it up really.

0:38:48.719 --> 0:38:52.280
<v Speaker 3>High so it'll be out of reach of the students

0:38:52.280 --> 0:38:55.200
<v Speaker 3>and they can't go and break the valuable clock. Not

0:38:55.320 --> 0:38:57.960
<v Speaker 3>something that a grown up would think of, but something

0:38:58.000 --> 0:39:02.239
<v Speaker 3>that's kind of plausible. And it's that kind of capacity

0:39:02.320 --> 0:39:05.799
<v Speaker 3>to generate something that isn't random but makes sense that

0:39:05.920 --> 0:39:08.160
<v Speaker 3>kids are really really good at doing. And we've done

0:39:08.239 --> 0:39:10.279
<v Speaker 3>a bunch of experiments at show. In some cases they're

0:39:10.320 --> 0:39:14.120
<v Speaker 3>better at doing that than grown ups are. But that's

0:39:14.120 --> 0:39:17.759
<v Speaker 3>something that's still very much not part of what's available

0:39:17.800 --> 0:39:22.240
<v Speaker 3>in AI. And there's an old observation in AI, sometimes

0:39:22.320 --> 0:39:27.000
<v Speaker 3>called Mirovich's paradox haunts Morovit was originally the person who

0:39:27.080 --> 0:39:29.480
<v Speaker 3>noticed it, which is that a lot of the things

0:39:29.480 --> 0:39:32.759
<v Speaker 3>that we as humans think are really really hard and

0:39:32.800 --> 0:39:36.680
<v Speaker 3>require a lot of intelligence, like playing chess, are actually

0:39:37.080 --> 0:39:40.040
<v Speaker 3>not that hard for AI systems. Things that we think

0:39:40.080 --> 0:39:42.800
<v Speaker 3>of is just taking for granted, like picking up a

0:39:42.920 --> 0:39:45.319
<v Speaker 3>chess piece and putting it on the board, turn out

0:39:45.320 --> 0:39:49.080
<v Speaker 3>to be a lot harder than playing chess. So the

0:39:49.120 --> 0:39:51.520
<v Speaker 3>thing that the kids can do, which is take a

0:39:51.560 --> 0:39:53.439
<v Speaker 3>bunch of you know, take a box full of mixed

0:39:53.480 --> 0:39:55.719
<v Speaker 3>up chess pieces and pull out the right ones and

0:39:55.719 --> 0:39:59.240
<v Speaker 3>put them in the right place, or have someone say,

0:39:59.760 --> 0:40:01.359
<v Speaker 3>are right, now, we're going to play chess, but we're

0:40:01.360 --> 0:40:03.560
<v Speaker 3>going to have a different role. You can move the

0:40:03.640 --> 0:40:07.360
<v Speaker 3>ponds as many spaces as you want. Those kinds of

0:40:07.440 --> 0:40:10.040
<v Speaker 3>abilities are exactly the ones that even the great chess

0:40:10.040 --> 0:40:12.680
<v Speaker 3>playing programs are going to have a hard time of doing.

0:40:28.280 --> 0:40:31.360
<v Speaker 1>That's right, and this is what I was mentioning before

0:40:31.440 --> 0:40:33.839
<v Speaker 1>about this third wave. So the first wave of AI

0:40:34.040 --> 0:40:38.239
<v Speaker 1>was really about reinforcement learning entirely. It was let's nail chess,

0:40:38.320 --> 0:40:41.120
<v Speaker 1>let's nail go, and it's all about let's play millions

0:40:41.160 --> 0:40:43.959
<v Speaker 1>of games new reinforcement. Then the second wave of AI

0:40:44.080 --> 0:40:46.480
<v Speaker 1>was totally different. It was, Hey, let's just absorb everything

0:40:46.520 --> 0:40:49.080
<v Speaker 1>that's out there. And so the third wave is really

0:40:49.280 --> 0:40:52.040
<v Speaker 1>something like becoming closer to a child than interacting with

0:40:52.080 --> 0:40:56.160
<v Speaker 1>the world and getting that feedback, right, And that's the future.

0:40:56.160 --> 0:40:58.279
<v Speaker 1>I mean, we don't have that yet in twenty twenty five,

0:40:58.360 --> 0:41:00.640
<v Speaker 1>but it'll be great to revi is that this part

0:41:00.680 --> 0:41:04.440
<v Speaker 1>of the conversation in twenty twenty eight and see where

0:41:04.480 --> 0:41:05.719
<v Speaker 1>things are and what they look like.

0:41:06.120 --> 0:41:08.880
<v Speaker 3>Yeah, I mean it maybe that we're headed for another

0:41:08.960 --> 0:41:12.400
<v Speaker 3>you know, the famous AI springs at AI Winters. My

0:41:13.000 --> 0:41:16.120
<v Speaker 3>intuition is that we're getting to the limits of what

0:41:16.239 --> 0:41:19.120
<v Speaker 3>the LLM cultural technology piece can do now that will

0:41:19.200 --> 0:41:21.920
<v Speaker 3>change the world, There's no question about it. And in

0:41:21.960 --> 0:41:25.120
<v Speaker 3>a way, you know the fact that kids can get

0:41:25.120 --> 0:41:27.759
<v Speaker 3>on the internet and find out kids anyone can pick

0:41:27.840 --> 0:41:30.040
<v Speaker 3>up something that's in their pocket and find out all

0:41:30.080 --> 0:41:32.200
<v Speaker 3>of this information about the world that is going to

0:41:32.600 --> 0:41:36.680
<v Speaker 3>change the world. But if we're thinking about intelligence, then

0:41:37.600 --> 0:41:39.279
<v Speaker 3>I think we still have a long way to go

0:41:39.320 --> 0:41:41.799
<v Speaker 3>to have something that has the kind of intelligence that

0:41:41.840 --> 0:41:42.920
<v Speaker 3>every child has.

0:41:42.719 --> 0:41:46.359
<v Speaker 1>Has in the AI being an intelligence being exactly right,

0:41:47.000 --> 0:41:47.640
<v Speaker 1>exactly right.

0:41:47.719 --> 0:41:48.960
<v Speaker 2>I'm curious what you think about this.

0:41:49.120 --> 0:41:51.400
<v Speaker 1>I've been sort of campaigning on this point for a

0:41:51.400 --> 0:41:53.799
<v Speaker 1>long time that I think the next generation is going

0:41:53.880 --> 0:41:57.279
<v Speaker 1>to be smarter than we are simply because of their

0:41:57.800 --> 0:42:02.120
<v Speaker 1>access to the world's knowledge. With the rectangle in their pocket,

0:42:01.480 --> 0:42:04.319
<v Speaker 1>the moment they're curious about a question and have the

0:42:04.360 --> 0:42:09.960
<v Speaker 1>right neurotransmitters present, they get the answer, and the exposure

0:42:10.680 --> 0:42:14.680
<v Speaker 1>to everything that's known is so incredible as far as

0:42:15.280 --> 0:42:17.759
<v Speaker 1>filling their warehouse so they can do remixes and come

0:42:17.840 --> 0:42:18.760
<v Speaker 1>up with new ideas.

0:42:18.920 --> 0:42:22.799
<v Speaker 2>What's your take hoting children or children growing up in

0:42:22.840 --> 0:42:23.960
<v Speaker 2>the digital children.

0:42:23.680 --> 0:42:24.640
<v Speaker 4>Growing up in the digital age.

0:42:24.680 --> 0:42:27.800
<v Speaker 3>Yeah, so I think that's a really good That's obviously

0:42:27.800 --> 0:42:31.000
<v Speaker 3>a really good, profound question. And I think again, if

0:42:31.040 --> 0:42:36.320
<v Speaker 3>you think about those past examples, like what did writing

0:42:36.600 --> 0:42:42.520
<v Speaker 3>and pictures and print and then the Internet itself, the

0:42:42.560 --> 0:42:45.760
<v Speaker 3>ways that that really changed in some ways that really

0:42:45.920 --> 0:42:51.520
<v Speaker 3>really beefed up human intelligence in really important significant ways.

0:42:51.600 --> 0:42:54.200
<v Speaker 3>The amount of things that's just the number of things

0:42:54.239 --> 0:42:56.680
<v Speaker 3>we know, right, the number of things we could know,

0:42:56.800 --> 0:42:58.280
<v Speaker 3>the number of things we could access.

0:42:58.360 --> 0:42:59.120
<v Speaker 4>All of those things.

0:42:59.280 --> 0:43:02.279
<v Speaker 3>Those culture technologies have really changed, and I think there's

0:43:02.320 --> 0:43:04.400
<v Speaker 3>every reason to believe that that will happen with the

0:43:04.440 --> 0:43:07.479
<v Speaker 3>new technologies as well. Now. At the same time, there's

0:43:07.520 --> 0:43:11.080
<v Speaker 3>always been these trade offs where other kinds of intelligence

0:43:11.120 --> 0:43:13.920
<v Speaker 3>that we had before, like the kind of intelligence that

0:43:13.960 --> 0:43:16.200
<v Speaker 3>you need to have to be a hunter gatherer, for example,

0:43:16.280 --> 0:43:18.719
<v Speaker 3>being able to be out in the world, take in

0:43:18.760 --> 0:43:22.680
<v Speaker 3>lots of information, go out and act, those kinds of

0:43:22.760 --> 0:43:27.200
<v Speaker 3>intelligences might suffer and probably will suffer as a result

0:43:27.280 --> 0:43:31.840
<v Speaker 3>of that. So, you know, being able to being able

0:43:31.880 --> 0:43:35.080
<v Speaker 3>to build a house, right, which is a really useful

0:43:35.080 --> 0:43:37.200
<v Speaker 3>thing to be able to do, is not something that

0:43:37.239 --> 0:43:40.680
<v Speaker 3>you can do just based on YouTube videos, although maybe

0:43:40.719 --> 0:43:45.000
<v Speaker 3>YouTube videos can maybe YouTube videos can help. My suspicion

0:43:45.120 --> 0:43:47.760
<v Speaker 3>is that what tends to happen with these technological changes

0:43:47.880 --> 0:43:51.080
<v Speaker 3>is they make all the difference in the world and

0:43:51.120 --> 0:43:53.640
<v Speaker 3>nobody realizes it because by the time they've made all

0:43:53.680 --> 0:43:56.000
<v Speaker 3>the difference, you just sort of take it for granted.

0:43:56.239 --> 0:43:58.600
<v Speaker 3>So an example that I like to give is as

0:43:58.640 --> 0:44:02.600
<v Speaker 3>I'm walking down the street. Now, I'm spending a lot

0:44:02.600 --> 0:44:06.280
<v Speaker 3>of time decoding text, So as I walk down the street,

0:44:06.600 --> 0:44:11.080
<v Speaker 3>I'm completely surrounded by these signs. You know, go call

0:44:11.160 --> 0:44:16.080
<v Speaker 3>our lawyer for your accident in case you're in an accident,

0:44:16.160 --> 0:44:19.080
<v Speaker 3>right or whatever, all the signs that are on the street.

0:44:19.520 --> 0:44:24.400
<v Speaker 3>I don't think to myself, God, it's exhausting every minute.

0:44:24.680 --> 0:44:26.560
<v Speaker 3>I never get to just walk down the street. I'm

0:44:26.600 --> 0:44:29.080
<v Speaker 3>always putting all this energy into trying to take all

0:44:29.080 --> 0:44:31.239
<v Speaker 3>this text and read it. And if you think about it,

0:44:31.280 --> 0:44:33.359
<v Speaker 3>if you were a pre literate person, it probably would

0:44:33.440 --> 0:44:35.280
<v Speaker 3>be exhausting if you had to sit down and figure

0:44:35.280 --> 0:44:37.080
<v Speaker 3>out what is it that each one of these signs

0:44:37.160 --> 0:44:40.240
<v Speaker 3>is saying. But of course we don't even I mean literally,

0:44:40.239 --> 0:44:43.000
<v Speaker 3>we're not even conscious of it. It's just part of

0:44:43.080 --> 0:44:45.839
<v Speaker 3>what goes on in the background. And we know from

0:44:46.080 --> 0:44:48.960
<v Speaker 3>neuroscience that in fact, parts of our brain have been

0:44:49.040 --> 0:44:52.040
<v Speaker 3>adapted to just do this kind of processing really quickly,

0:44:52.080 --> 0:44:54.080
<v Speaker 3>so we don't even notice that we're doing it. And

0:44:54.160 --> 0:44:56.320
<v Speaker 3>my suspicion is that that's what's going to happen with

0:44:57.760 --> 0:45:02.000
<v Speaker 3>the next generation and the the internet and information. We'll

0:45:02.040 --> 0:45:04.640
<v Speaker 3>be doing these things we won't even think about it.

0:45:04.680 --> 0:45:07.239
<v Speaker 3>But there's a really important caveat to that, which is

0:45:07.239 --> 0:45:10.120
<v Speaker 3>that if you go back to those examples of print

0:45:10.239 --> 0:45:16.000
<v Speaker 3>and writing, why is it that we didn't just have

0:45:16.239 --> 0:45:21.080
<v Speaker 3>all the evil misinformation and libel of the French Revolution indefinitely. Well,

0:45:21.120 --> 0:45:23.359
<v Speaker 3>what's happened is every time we've had one of these

0:45:23.440 --> 0:45:27.680
<v Speaker 3>new cultural technologies, we've also developed systems for keeping them

0:45:27.760 --> 0:45:32.040
<v Speaker 3>under control. So newspapers, you know, again we just sort

0:45:32.040 --> 0:45:35.040
<v Speaker 3>of take newspapers for granted, and newspapers are sort of disappearing,

0:45:35.320 --> 0:45:39.040
<v Speaker 3>But newspapers were away of taking that printed information and

0:45:39.120 --> 0:45:42.600
<v Speaker 3>curating it and having editors who said this is true,

0:45:42.640 --> 0:45:44.560
<v Speaker 3>and this isn't true, and this is something we want

0:45:44.600 --> 0:45:46.120
<v Speaker 3>to tell our readers, and this is something we don't

0:45:46.120 --> 0:45:48.799
<v Speaker 3>want to tell our readers. And you had norms that

0:45:48.920 --> 0:45:53.680
<v Speaker 3>developed things like journalism or journalism school or libel laws

0:45:53.719 --> 0:45:56.480
<v Speaker 3>that said, no, here's a way that we can take

0:45:56.520 --> 0:46:01.040
<v Speaker 3>this new culture and make control it in a way

0:46:01.080 --> 0:46:03.359
<v Speaker 3>that will let the good parts come out and keep

0:46:03.400 --> 0:46:06.239
<v Speaker 3>the bad parts under control. And every time there's a

0:46:06.239 --> 0:46:10.560
<v Speaker 3>new cultural technology, we've had new laws, we've had new norms,

0:46:10.600 --> 0:46:13.480
<v Speaker 3>we've had new institutions, we've had new kinds of people

0:46:13.840 --> 0:46:18.080
<v Speaker 3>who were there to try to make these institutions work

0:46:18.120 --> 0:46:19.839
<v Speaker 3>for and the same thing, by the way, it's true

0:46:19.840 --> 0:46:23.040
<v Speaker 3>for markets. You know, think about the way that markets

0:46:23.080 --> 0:46:25.960
<v Speaker 3>are great for coordinating people, but we all know all

0:46:26.000 --> 0:46:28.319
<v Speaker 3>the terrible things that can happen with markets do so

0:46:28.360 --> 0:46:30.920
<v Speaker 3>you need to have laws, you need to have institutions,

0:46:30.920 --> 0:46:33.239
<v Speaker 3>you need to have ways to regulate them. And I

0:46:33.239 --> 0:46:35.200
<v Speaker 3>think the same thing's going to be true with AI.

0:46:35.400 --> 0:46:38.040
<v Speaker 3>If we're going to succeed, we're both going to have

0:46:38.120 --> 0:46:38.560
<v Speaker 3>to have.

0:46:38.760 --> 0:46:41.440
<v Speaker 4>Internal you know, internal norms.

0:46:41.560 --> 0:46:43.960
<v Speaker 3>I think you see some of that happening with kids,

0:46:43.960 --> 0:46:47.279
<v Speaker 3>where kids will say things like, oh no, I don't

0:46:47.440 --> 0:46:49.520
<v Speaker 3>you know this is the wrong thing to pay attention to.

0:46:49.920 --> 0:46:52.080
<v Speaker 3>If you go to that YouTube site, it's full of

0:46:52.520 --> 0:46:54.960
<v Speaker 3>you know, it's full of nonsense. This one is actually

0:46:55.400 --> 0:46:59.239
<v Speaker 3>this one is actually better. And also we're going to

0:46:59.280 --> 0:47:02.440
<v Speaker 3>have to have all those boring things like legislation and

0:47:02.640 --> 0:47:08.759
<v Speaker 3>code and regulatory agencies that's already sort of starting that

0:47:08.800 --> 0:47:10.879
<v Speaker 3>will make sure that things are good rather than bad.

0:47:11.160 --> 0:47:14.319
<v Speaker 3>Another example, I like to give a great technology, technological

0:47:14.400 --> 0:47:17.240
<v Speaker 3>change that we don't think about very much. Think about

0:47:17.360 --> 0:47:19.879
<v Speaker 3>it's nineteen hundred and people are saying.

0:47:20.200 --> 0:47:21.000
<v Speaker 4>You know what we should do.

0:47:21.040 --> 0:47:23.080
<v Speaker 3>We should take all our wooden houses and we should

0:47:23.080 --> 0:47:25.880
<v Speaker 3>put electricity in them. So we should put wires that

0:47:25.920 --> 0:47:28.880
<v Speaker 3>have electricity, which we know burns things, and we should

0:47:28.880 --> 0:47:32.520
<v Speaker 3>put them in everybody's house, including everybody's wooden.

0:47:32.200 --> 0:47:35.200
<v Speaker 4>House, and it'll be fine, right, it'll be great.

0:47:35.760 --> 0:47:39.319
<v Speaker 3>Well. The only reason why we can do that, and

0:47:39.400 --> 0:47:41.279
<v Speaker 3>we did have a lot of houses burned down to

0:47:41.320 --> 0:47:44.600
<v Speaker 3>begin with, is because we have, as anyone who's you know,

0:47:44.640 --> 0:47:47.680
<v Speaker 3>anyone who's a contractor or has said a reno, Nos,

0:47:47.800 --> 0:47:50.040
<v Speaker 3>we have this thing called code, which is this book

0:47:50.160 --> 0:47:53.840
<v Speaker 3>that's like this thick that has all the rules about

0:47:53.960 --> 0:47:56.360
<v Speaker 3>here's what you have to do, here's how the wiring

0:47:56.400 --> 0:47:57.960
<v Speaker 3>has to work, here's all the things that you have

0:47:58.000 --> 0:48:01.520
<v Speaker 3>to do to make electricity work. And nobody thinks about

0:48:01.520 --> 0:48:05.440
<v Speaker 3>that big book of code as being a fantastic human invention.

0:48:05.560 --> 0:48:07.319
<v Speaker 3>We mostly think of it as, oh, God, that's why

0:48:07.360 --> 0:48:10.480
<v Speaker 3>my contractor is charging me so much. But that invention,

0:48:10.719 --> 0:48:12.840
<v Speaker 3>in a way, is just as important as the invention

0:48:12.880 --> 0:48:16.560
<v Speaker 3>of electricity itself. The invention of electricity itself was a

0:48:16.600 --> 0:48:21.600
<v Speaker 3>big scientific advance, but you couldn't use that unless you

0:48:21.600 --> 0:48:23.880
<v Speaker 3>had this other kind of advance, which was the code,

0:48:24.239 --> 0:48:27.640
<v Speaker 3>the laws, the regulations, the legislation. And I think that's

0:48:27.680 --> 0:48:30.680
<v Speaker 3>something that a lot of people in AI are realizing.

0:48:32.640 --> 0:48:35.400
<v Speaker 1>So when you're thinking about what will be the legislation

0:48:35.480 --> 0:48:37.919
<v Speaker 1>and the norms that are coming down the pike, when

0:48:37.920 --> 0:48:40.000
<v Speaker 1>you squint into the future, can you see what that's

0:48:40.040 --> 0:48:40.560
<v Speaker 1>going to look like?

0:48:41.360 --> 0:48:44.120
<v Speaker 3>I think that's a good question, and it will have

0:48:44.200 --> 0:48:46.680
<v Speaker 3>to be somewhat different. You know, each one of the

0:48:46.719 --> 0:48:48.920
<v Speaker 3>way that we deal with language, which is, you know,

0:48:49.000 --> 0:48:51.319
<v Speaker 3>sort of having a moral principle that you shouldn't lie,

0:48:51.360 --> 0:48:53.200
<v Speaker 3>for example, is different from what we have to do

0:48:53.239 --> 0:48:55.240
<v Speaker 3>with writing, is different from what we do with print,

0:48:55.680 --> 0:48:58.280
<v Speaker 3>is different from what we had to do with the Internet.

0:48:58.760 --> 0:48:59.719
<v Speaker 4>One of the points that.

0:48:59.680 --> 0:49:02.799
<v Speaker 3>We make in are in that paper that I think

0:49:02.920 --> 0:49:06.880
<v Speaker 3>is really important is that there are real dangers in

0:49:06.920 --> 0:49:10.960
<v Speaker 3>the fact that this technology has been monopolized by a

0:49:11.000 --> 0:49:14.040
<v Speaker 3>few big agencies, for example, a few big companies.

0:49:14.400 --> 0:49:15.200
<v Speaker 4>So one thing is.

0:49:15.160 --> 0:49:18.200
<v Speaker 3>How are we going to make sure that that it's democratized,

0:49:18.320 --> 0:49:22.520
<v Speaker 3>that in fact, people can use it outside of the

0:49:22.560 --> 0:49:25.120
<v Speaker 3>control of just a few big companies.

0:49:25.160 --> 0:49:28.040
<v Speaker 1>And because of just a quick interjection, I actually I

0:49:28.080 --> 0:49:29.960
<v Speaker 1>have no worries about that, and I'll tell you why.

0:49:30.000 --> 0:49:31.680
<v Speaker 2>It's because everything starts this way.

0:49:32.320 --> 0:49:35.080
<v Speaker 1>And you know, we saw when deep Sea came out

0:49:35.120 --> 0:49:39.000
<v Speaker 1>recently they had reportedly done it for six million bucks

0:49:39.040 --> 0:49:43.840
<v Speaker 1>instead of billions. And it's only going to get cheaper

0:49:43.920 --> 0:49:45.120
<v Speaker 1>and easier to do this.

0:49:46.280 --> 0:49:46.880
<v Speaker 2>So I think it.

0:49:46.880 --> 0:49:50.399
<v Speaker 1>Will quickly become democratized, just like every other example we've

0:49:50.440 --> 0:49:53.359
<v Speaker 1>had of things like this leg printing. I mean, now

0:49:53.400 --> 0:49:55.880
<v Speaker 1>we all have a printer on our desk at home, right,

0:49:56.080 --> 0:49:58.040
<v Speaker 1>whereas a printing press was a big.

0:49:57.880 --> 0:49:59.239
<v Speaker 2>Deal that only a few people had.

0:49:59.520 --> 0:50:01.760
<v Speaker 4>Yeah, well, I think that's partly true.

0:50:01.800 --> 0:50:04.919
<v Speaker 3>But on the other hand, the fact that these all

0:50:04.960 --> 0:50:09.480
<v Speaker 3>depend on these pre trained systems that do take an

0:50:09.600 --> 0:50:13.279
<v Speaker 3>enormous amount of money to get started, and even with

0:50:13.400 --> 0:50:15.920
<v Speaker 3>deep Seak, it seems like part of what Deepseek was

0:50:15.920 --> 0:50:20.160
<v Speaker 3>doing was taking advantage of the fact that this pre

0:50:20.280 --> 0:50:22.880
<v Speaker 3>training had already been done by other systems.

0:50:22.920 --> 0:50:24.799
<v Speaker 4>So I think that will happen.

0:50:24.880 --> 0:50:28.360
<v Speaker 3>But I think the question about who will have control

0:50:28.840 --> 0:50:33.360
<v Speaker 3>is a really important political and economic question. Here's another

0:50:33.560 --> 0:50:37.680
<v Speaker 3>interesting point that my colleague Henry Ferrell made in that

0:50:38.080 --> 0:50:41.200
<v Speaker 3>has made and we made in that science piece. If

0:50:41.200 --> 0:50:47.560
<v Speaker 3>you think about cultural technologies they intrinsically always involve attention

0:50:48.200 --> 0:50:52.560
<v Speaker 3>between the creators and the distributors. So the ideas cultural

0:50:52.600 --> 0:50:55.839
<v Speaker 3>technology is I can get information from other people. That's

0:50:55.920 --> 0:50:59.799
<v Speaker 3>essentially the idea, But that means someone has to make

0:50:59.840 --> 0:51:02.319
<v Speaker 3>up that information, someone has to generate it, someone has

0:51:02.360 --> 0:51:04.440
<v Speaker 3>to find out what's true, for example.

0:51:05.200 --> 0:51:09.960
<v Speaker 4>And there's this kind of paradoxical synergy between the people.

0:51:09.680 --> 0:51:12.200
<v Speaker 3>Who are out there creating things that are new and

0:51:12.239 --> 0:51:14.480
<v Speaker 3>then the people who are distributing them. So there's no

0:51:14.600 --> 0:51:17.399
<v Speaker 3>point in writing, as you and I both know as writers, right,

0:51:17.400 --> 0:51:19.880
<v Speaker 3>there's no point. Well, or maybe there's some point, but

0:51:19.880 --> 0:51:21.759
<v Speaker 3>there's not much point in writing a book unless you

0:51:21.760 --> 0:51:24.279
<v Speaker 3>have a publisher who can actually get it out into

0:51:24.280 --> 0:51:24.640
<v Speaker 3>the world.

0:51:24.840 --> 0:51:25.000
<v Speaker 4>Right.

0:51:26.160 --> 0:51:29.120
<v Speaker 3>But for the publisher, there's no point in being a

0:51:29.160 --> 0:51:32.239
<v Speaker 3>publisher unless you can get people to write books for you,

0:51:32.360 --> 0:51:35.120
<v Speaker 3>unless you can get new content, new ideas.

0:51:36.280 --> 0:51:37.800
<v Speaker 4>But that also means that there's.

0:51:37.560 --> 0:51:41.879
<v Speaker 3>This intrinsic tension, economic tension between who's going to pay right,

0:51:42.000 --> 0:51:44.319
<v Speaker 3>So for the distributors, it's always going to be in

0:51:44.360 --> 0:51:46.480
<v Speaker 3>their interest for the creators.

0:51:46.000 --> 0:51:48.840
<v Speaker 4>To get paid as little as possible.

0:51:48.680 --> 0:51:50.400
<v Speaker 3>And for the creators it's always going to be in

0:51:50.440 --> 0:51:51.920
<v Speaker 3>their interests for the distributors to.

0:51:51.920 --> 0:51:53.640
<v Speaker 4>Get paid as little as possible.

0:51:53.960 --> 0:51:56.440
<v Speaker 3>So if you're thinking about that from an economic perspective,

0:51:56.440 --> 0:51:59.280
<v Speaker 3>there's always going to be this tension between the people

0:51:59.280 --> 0:52:02.160
<v Speaker 3>who are creating and the people who are distributing. And

0:52:02.200 --> 0:52:04.960
<v Speaker 3>you can already see that in what's happened in journalism,

0:52:04.960 --> 0:52:08.880
<v Speaker 3>for instance, or what's happened to the death of local newspapers.

0:52:09.239 --> 0:52:13.160
<v Speaker 3>We had this kind of strange equilibrium where you know,

0:52:13.360 --> 0:52:16.400
<v Speaker 3>the want ads and the weather reports paid for the

0:52:16.440 --> 0:52:22.480
<v Speaker 3>investigative journalism and the arts criticism. And that's once that

0:52:22.560 --> 0:52:25.400
<v Speaker 3>two thousand digital convergence happened, that wasn't going to be

0:52:25.400 --> 0:52:26.200
<v Speaker 3>there anymore.

0:52:26.320 --> 0:52:27.840
<v Speaker 4>And now it's.

0:52:27.960 --> 0:52:30.879
<v Speaker 3>Really sort of up for grabs about who is how

0:52:30.920 --> 0:52:34.319
<v Speaker 3>people are going to get compensated, and who is going

0:52:34.360 --> 0:52:37.480
<v Speaker 3>to have the power of the distributors or the creators,

0:52:37.520 --> 0:52:39.400
<v Speaker 3>And I think that's going to be a really important

0:52:39.440 --> 0:52:41.239
<v Speaker 3>issue as we're going forward with this too.

0:52:41.560 --> 0:52:44.080
<v Speaker 1>One idea that people have been floating recently is to

0:52:44.200 --> 0:52:47.960
<v Speaker 1>have digital dividends come back to the creators, so that

0:52:48.000 --> 0:52:51.480
<v Speaker 1>the people whose work has been used to train the

0:52:51.520 --> 0:52:55.720
<v Speaker 1>models and maybe provided point one percent of the answer

0:52:55.800 --> 0:52:58.600
<v Speaker 1>that that model just gave gets a few pennies back

0:52:59.000 --> 0:53:04.320
<v Speaker 1>each time. That's idea how those economics models will evolve

0:53:05.000 --> 0:53:05.920
<v Speaker 1>over the coming year.

0:53:06.280 --> 0:53:07.920
<v Speaker 3>I mean, if you think about it, like what we

0:53:07.960 --> 0:53:11.200
<v Speaker 3>did for print was we invented copyright law, so we

0:53:11.239 --> 0:53:14.120
<v Speaker 3>have a whole lot of laws about Look, you can't,

0:53:14.560 --> 0:53:16.959
<v Speaker 3>you know, I can't just take a David Eagleman book

0:53:17.000 --> 0:53:19.640
<v Speaker 3>and copy it and say that it's mine. But of

0:53:19.680 --> 0:53:24.319
<v Speaker 3>course that involves the actual text. So copyright depends on

0:53:24.360 --> 0:53:26.799
<v Speaker 3>the fact that you're taking word for words something that

0:53:26.800 --> 0:53:30.279
<v Speaker 3>someone else has written. How about if Google decides to

0:53:30.360 --> 0:53:33.440
<v Speaker 3>use David Eagleman's book to train its large language model.

0:53:33.800 --> 0:53:37.080
<v Speaker 3>It's not taking every single word, but the reason why

0:53:37.160 --> 0:53:40.239
<v Speaker 3>it's giving you good answers about neuroscience is because it's

0:53:40.239 --> 0:53:42.560
<v Speaker 3>been trained on those books. And we know as a

0:53:42.560 --> 0:53:45.840
<v Speaker 3>matter of fact that those large models have been trained

0:53:45.920 --> 0:53:49.239
<v Speaker 3>on my books and your books and all the other

0:53:49.239 --> 0:53:50.200
<v Speaker 3>books that are out there.

0:53:50.280 --> 0:53:50.440
<v Speaker 4>Now.

0:53:50.440 --> 0:53:54.759
<v Speaker 3>They're not copying it down exactly, but the fact that

0:53:54.800 --> 0:53:56.799
<v Speaker 3>you could go to chat ChiPT and it could tell

0:53:56.800 --> 0:54:01.520
<v Speaker 3>you here's what Alison Govnik would say about about children's learning,

0:54:01.840 --> 0:54:04.719
<v Speaker 3>comes because it's gone out there and used all that

0:54:04.800 --> 0:54:06.280
<v Speaker 3>information in all of those books.

0:54:06.320 --> 0:54:07.880
<v Speaker 4>So it's really tricky.

0:54:08.120 --> 0:54:08.600
<v Speaker 2>Yeah.

0:54:08.960 --> 0:54:10.920
<v Speaker 1>What I think is really tricky about this is the

0:54:11.000 --> 0:54:12.960
<v Speaker 1>reason you and I are able to write books is

0:54:12.960 --> 0:54:16.920
<v Speaker 1>because we've sat in thousands of lectures, we've heard ideas

0:54:16.960 --> 0:54:18.480
<v Speaker 1>from people, and we've put things.

0:54:18.239 --> 0:54:20.280
<v Speaker 2>Together in our own way and our own voice.

0:54:20.960 --> 0:54:23.920
<v Speaker 1>And in a sense, the lllm's doing what humans do

0:54:24.000 --> 0:54:25.120
<v Speaker 1>in terms of creativity.

0:54:25.160 --> 0:54:27.080
<v Speaker 2>We're remixing all of our inputs.

0:54:27.320 --> 0:54:29.960
<v Speaker 4>Yeah, but of course we wouldn't.

0:54:30.840 --> 0:54:33.239
<v Speaker 3>Our books wouldn't be worth reading if they were just

0:54:33.320 --> 0:54:35.880
<v Speaker 3>summaries of everything that had gone before. So part of

0:54:35.880 --> 0:54:38.680
<v Speaker 3>what we want is to have exactly this.

0:54:39.360 --> 0:54:39.799
<v Speaker 4>People in.

0:54:41.600 --> 0:54:45.160
<v Speaker 3>A field called cultural evolution actually study how is it

0:54:45.200 --> 0:54:48.320
<v Speaker 3>that information comes from one generation to another.

0:54:48.719 --> 0:54:52.080
<v Speaker 4>And we've studied this in children. It's really fascinating.

0:54:52.200 --> 0:54:55.600
<v Speaker 3>In The Gardener and the Carpenter, which is my book

0:54:55.640 --> 0:54:59.399
<v Speaker 3>about parents and children, looking at how children learn from

0:54:59.400 --> 0:55:03.000
<v Speaker 3>other people, there's lots of examples in there about how

0:55:03.120 --> 0:55:07.120
<v Speaker 3>children can take information from their parents, for example, but

0:55:07.160 --> 0:55:10.080
<v Speaker 3>they don't just swallow it hole they think about it

0:55:10.120 --> 0:55:14.279
<v Speaker 3>in new ways, they remix it. In cultural evolution they

0:55:14.280 --> 0:55:17.239
<v Speaker 3>talk about this as the balance between imitation and innovation.

0:55:17.719 --> 0:55:18.520
<v Speaker 4>So you need both.

0:55:18.560 --> 0:55:20.440
<v Speaker 3>You need to be able to imitate the other people

0:55:20.480 --> 0:55:23.000
<v Speaker 3>around you to make progress, but then you also want

0:55:23.040 --> 0:55:25.359
<v Speaker 3>to do something that's not just what the other people

0:55:25.400 --> 0:55:27.000
<v Speaker 3>around you have done. I mean, there'd be no point

0:55:27.040 --> 0:55:30.320
<v Speaker 3>in imitating if somebody hadn't innovated.

0:55:29.920 --> 0:55:31.160
<v Speaker 4>At some point in the past.

0:55:31.600 --> 0:55:35.560
<v Speaker 3>And it's a really hard scientific problem about how do

0:55:35.600 --> 0:55:36.800
<v Speaker 3>you get that balance.

0:55:36.880 --> 0:55:38.759
<v Speaker 4>And what you see with children is that.

0:55:38.719 --> 0:55:43.760
<v Speaker 3>If you give children information, they will They're very likely,

0:55:43.880 --> 0:55:48.000
<v Speaker 3>especially little children, will kind of imitate you literally. You know,

0:55:48.400 --> 0:55:51.359
<v Speaker 3>you produce a gesture and the kids will imitate it.

0:55:51.840 --> 0:55:54.560
<v Speaker 3>But then they'll also change it depending on whether they

0:55:54.600 --> 0:55:55.640
<v Speaker 3>think it makes sense or not.

0:55:56.120 --> 0:56:00.399
<v Speaker 1>Yeah, And as you know, in the animal kingdom, see

0:56:00.440 --> 0:56:04.919
<v Speaker 1>that all animals have this trade off between exploration and exploitation.

0:56:05.080 --> 0:56:07.759
<v Speaker 1>So they'll spend eighty percent of a time exploiting the

0:56:07.840 --> 0:56:11.120
<v Speaker 1>things that they know, as in under that rock, there

0:56:11.120 --> 0:56:13.719
<v Speaker 1>are these grubs that I eat, and then they'll spend

0:56:13.719 --> 0:56:16.960
<v Speaker 1>twenty percent of their time exploring other places that they

0:56:16.960 --> 0:56:19.840
<v Speaker 1>haven't seen before. Across the animal kingdom, we see this

0:56:19.880 --> 0:56:22.000
<v Speaker 1>exploration exploitation trade off, and it strikes me that the

0:56:23.120 --> 0:56:28.160
<v Speaker 1>imitation innovation trade off is essentially the cognitive equivalent analogy

0:56:28.200 --> 0:56:28.480
<v Speaker 1>of that.

0:56:28.920 --> 0:56:31.760
<v Speaker 4>No I think that's exactly right. And what I've argued.

0:56:31.560 --> 0:56:35.560
<v Speaker 3>Is that if you think about childhood right, childhood is

0:56:35.680 --> 0:56:38.759
<v Speaker 3>very evolutionarily paradoxical, like why would we have this long

0:56:38.800 --> 0:56:41.120
<v Speaker 3>time when not only are we not producing anything, but

0:56:41.160 --> 0:56:47.520
<v Speaker 3>we're extracting resources from the other people around us. And

0:56:47.560 --> 0:56:51.520
<v Speaker 3>there's a really interesting biological generalization which is that the

0:56:51.600 --> 0:56:55.120
<v Speaker 3>smarter the adultonimal, the longer childhood it has. And that's

0:56:55.360 --> 0:57:00.319
<v Speaker 3>very very general. It's mammals, birds, marsupials, even insects see

0:57:00.320 --> 0:57:04.440
<v Speaker 3>this relationship between the length of childhood and the intelligency

0:57:04.440 --> 0:57:08.560
<v Speaker 3>in the anthropomorphic sense of the adult, how good the

0:57:08.600 --> 0:57:11.840
<v Speaker 3>adult is learning and figuring out the world. And I've

0:57:11.880 --> 0:57:14.840
<v Speaker 3>made the argument that this is really an example of

0:57:14.880 --> 0:57:18.800
<v Speaker 3>this explore exploit trade off. So childhood is evolution's way

0:57:19.040 --> 0:57:22.840
<v Speaker 3>of resolving the explore exploit trade off, because what it

0:57:22.880 --> 0:57:25.720
<v Speaker 3>does is it gives you this period of childhood.

0:57:25.880 --> 0:57:26.560
<v Speaker 4>Well you don't have.

0:57:26.520 --> 0:57:28.880
<v Speaker 3>To worry about exploiting, you don't actually have to worry

0:57:28.920 --> 0:57:30.960
<v Speaker 3>about taking care of yourself and going out in the

0:57:30.960 --> 0:57:34.360
<v Speaker 3>world and getting resources. As I always say, babies and

0:57:34.360 --> 0:57:38.080
<v Speaker 3>young children have exactly one utility function, as the economists say,

0:57:38.080 --> 0:57:40.320
<v Speaker 3>which is be as cute as you possibly can be,

0:57:40.600 --> 0:57:43.160
<v Speaker 3>and they're very very very good at maximizing that.

0:57:43.440 --> 0:57:45.280
<v Speaker 4>And as long as you're as cute as you possibly

0:57:45.280 --> 0:57:47.480
<v Speaker 4>can be, you don't have to worry about anything.

0:57:47.240 --> 0:57:49.640
<v Speaker 3>Else, right, you don't have to worry about getting fed

0:57:49.680 --> 0:57:50.560
<v Speaker 3>and taken care of.

0:57:51.040 --> 0:57:52.800
<v Speaker 4>But what that means is that that.

0:57:54.600 --> 0:57:57.440
<v Speaker 3>Frees you up to do the things that babies and

0:57:57.480 --> 0:58:02.000
<v Speaker 3>young children do, which is play and explore and experiment

0:58:02.200 --> 0:58:06.600
<v Speaker 3>and have weird, crazy, imaginative pretend and do all these

0:58:06.680 --> 0:58:09.480
<v Speaker 3>kinds of explore functions. And then because the children are

0:58:09.520 --> 0:58:12.240
<v Speaker 3>doing that, then the adults can take advantage of all

0:58:12.360 --> 0:58:16.680
<v Speaker 3>the novelty and innovation that the children are producing. So

0:58:17.800 --> 0:58:20.280
<v Speaker 3>and it's interesting that in the computer science literature they

0:58:20.280 --> 0:58:22.720
<v Speaker 3>also talk a lot about this explore exploit trade off,

0:58:23.080 --> 0:58:26.640
<v Speaker 3>and there's literally proofs that you can't. It's a trade off.

0:58:26.640 --> 0:58:28.760
<v Speaker 3>You can't have both of them at the same time.

0:58:29.240 --> 0:58:33.000
<v Speaker 3>And often the best solution is, and you know, I

0:58:33.040 --> 0:58:36.000
<v Speaker 3>think people can feel this intuitively, is start out exploring

0:58:36.360 --> 0:58:39.240
<v Speaker 3>when you're trying to solve.

0:58:38.200 --> 0:58:38.840
<v Speaker 4>A new problem.

0:58:39.200 --> 0:58:43.720
<v Speaker 3>Start out just brainstorming, having crazy ideas, not worrying too

0:58:43.760 --> 0:58:46.680
<v Speaker 3>much about whether you're getting there or not. And then

0:58:47.000 --> 0:58:49.960
<v Speaker 3>narrow in and say, Okay, now I have a solution.

0:58:50.040 --> 0:58:51.680
<v Speaker 3>Now I want to figure out how do I find

0:58:51.720 --> 0:58:56.440
<v Speaker 3>tune that solution. And I think childhood is nature's way

0:58:56.520 --> 0:59:01.520
<v Speaker 3>of implementing that idea. Explore first, explore later, and again,

0:59:01.640 --> 0:59:04.960
<v Speaker 3>if you're thinking about these artificial intelligence systems, it's a

0:59:05.000 --> 0:59:08.520
<v Speaker 3>really deep problem about how could you get design a

0:59:08.560 --> 0:59:12.760
<v Speaker 3>system that can trade off those two capacities in an

0:59:12.800 --> 0:59:13.720
<v Speaker 3>intelligent way?

0:59:14.040 --> 0:59:16.959
<v Speaker 1>And you know, I think that's that's the third wave

0:59:17.040 --> 0:59:19.640
<v Speaker 1>that's coming. Yeah, that's the part that we don't have currently,

0:59:19.760 --> 0:59:21.120
<v Speaker 1>But that's what's saying next.

0:59:21.280 --> 0:59:25.000
<v Speaker 3>What's interesting is in humans, the way that we do

0:59:25.080 --> 0:59:28.920
<v Speaker 3>it is because we have a particular what biologists call

0:59:29.000 --> 0:59:33.120
<v Speaker 3>life history, We have a particular developmental history. We start

0:59:33.120 --> 0:59:36.960
<v Speaker 3>out being children, we become adults. Then for humans, we

0:59:37.040 --> 0:59:40.320
<v Speaker 3>become elders, we become post mental pausal grandmothers. Now we

0:59:40.440 --> 0:59:44.280
<v Speaker 3>tend to think, and you hear a lot of this

0:59:44.600 --> 0:59:49.560
<v Speaker 3>from AI guys, with guys being the relevant term that

0:59:49.680 --> 0:59:53.680
<v Speaker 3>somehow the thirty five year old, the thirty five year

0:59:53.680 --> 0:59:57.880
<v Speaker 3>old adult male is like the peak of the psychologist

0:59:58.000 --> 1:00:00.439
<v Speaker 3>or the philosopher or the AI guy is the peak

1:00:00.520 --> 1:00:03.960
<v Speaker 3>of human intelligence. And then there's this thing called intelligence

1:00:03.960 --> 1:00:05.520
<v Speaker 3>that you can have a little of or a lot of,

1:00:05.560 --> 1:00:08.280
<v Speaker 3>and the thirty five year olds or maybe even the

1:00:08.320 --> 1:00:10.400
<v Speaker 3>twenty five year olds like they have the most of it,

1:00:10.480 --> 1:00:13.000
<v Speaker 3>and then childhood is just building up to it, and

1:00:13.040 --> 1:00:15.840
<v Speaker 3>then elderhood is just falling off from it. But that

1:00:15.880 --> 1:00:19.880
<v Speaker 3>doesn't make much sense from an evolutionary perspective. In fact,

1:00:20.040 --> 1:00:22.160
<v Speaker 3>what seems to be true is that we have these

1:00:22.280 --> 1:00:27.840
<v Speaker 3>different functions, really different, radically different kinds of intelligence that

1:00:27.960 --> 1:00:30.720
<v Speaker 3>trade off. They're not just different, they trade off against

1:00:30.800 --> 1:00:36.919
<v Speaker 3>one another, like exploration exploitation, and our developmental trajectory from

1:00:37.000 --> 1:00:40.920
<v Speaker 3>childhood to adulthood to elderhood is a way that we

1:00:41.040 --> 1:00:43.520
<v Speaker 3>managed to deal with those kinds of trade offs. So

1:00:43.520 --> 1:00:47.520
<v Speaker 3>we have this childhood that lets us explore and lets

1:00:47.600 --> 1:00:50.840
<v Speaker 3>us get information from other people. We have this adulthood

1:00:50.880 --> 1:00:54.400
<v Speaker 3>where we can go out and use that information to

1:00:54.480 --> 1:00:56.920
<v Speaker 3>do all the things that Gronim's do, like find mates

1:00:56.960 --> 1:01:00.200
<v Speaker 3>and resources and find our way in the hierarchy. And

1:01:00.240 --> 1:01:03.920
<v Speaker 3>then we have this elderhood where now we're motivated to

1:01:04.960 --> 1:01:07.800
<v Speaker 3>use our resources to help the next generation, to pass

1:01:07.880 --> 1:01:09.880
<v Speaker 3>on the wisdom and information that we've got to the

1:01:09.920 --> 1:01:10.640
<v Speaker 3>next generation.

1:01:11.160 --> 1:01:11.800
<v Speaker 4>So away.

1:01:11.840 --> 1:01:15.920
<v Speaker 3>I put this perhaps a little meanly sometimes, is basically,

1:01:15.960 --> 1:01:19.000
<v Speaker 3>we're humans up to puberty and after menopause, and in

1:01:19.040 --> 1:01:22.000
<v Speaker 3>between we're sort of basically glorified primates. In between we're

1:01:22.000 --> 1:01:25.640
<v Speaker 3>doing all those things like you know, mating and predating

1:01:25.680 --> 1:01:30.480
<v Speaker 3>and finding resources and all the fun stuff is what

1:01:30.520 --> 1:01:32.520
<v Speaker 3>the kids and the grandmom's fit to do, which is

1:01:32.720 --> 1:01:38.080
<v Speaker 3>play and explore and tell stories and pass on recipes

1:01:38.200 --> 1:01:40.880
<v Speaker 3>and do Broadway show tunes, all the things that I

1:01:40.920 --> 1:01:44.800
<v Speaker 3>do as a grandmother with my with my grandchildren. But

1:01:44.840 --> 1:01:46.880
<v Speaker 3>there is something that's deeper than that, which is that

1:01:47.280 --> 1:01:51.640
<v Speaker 3>typically AI has not kind of had this developmental perspective.

1:01:51.680 --> 1:01:54.880
<v Speaker 3>And I think that's another thing that developmental science can

1:01:54.920 --> 1:01:58.280
<v Speaker 3>contribute to AI, is think not just about that there's

1:01:58.320 --> 1:02:00.640
<v Speaker 3>this thing called intelligence that we want more of or

1:02:00.720 --> 1:02:05.080
<v Speaker 3>less of, but how do you in a society and

1:02:05.160 --> 1:02:08.760
<v Speaker 3>in an individual across time trade off these really different

1:02:08.800 --> 1:02:10.000
<v Speaker 3>kinds of intelligences.

1:02:12.200 --> 1:02:14.520
<v Speaker 1>So to zoom this back out to the big picture,

1:02:14.560 --> 1:02:18.880
<v Speaker 1>then the idea of not looking at AI as an

1:02:18.920 --> 1:02:22.439
<v Speaker 1>intelligent agent, but looking at it as a cultural technology.

1:02:22.520 --> 1:02:24.480
<v Speaker 1>One of the things that suggests, and you say this

1:02:24.520 --> 1:02:27.600
<v Speaker 1>in the paper, is that it's not just engineers who

1:02:27.640 --> 1:02:30.360
<v Speaker 1>should be studying AI, but it's social scientists who should

1:02:30.360 --> 1:02:31.240
<v Speaker 1>be commenting on it.

1:02:31.560 --> 1:02:34.920
<v Speaker 4>Yeah, so just yeah, it's sort of fascinating.

1:02:34.960 --> 1:02:37.680
<v Speaker 3>So if we wanted to say, let's try and figure

1:02:37.680 --> 1:02:40.160
<v Speaker 3>out what's going to be good, what's going to be bad,

1:02:40.280 --> 1:02:42.320
<v Speaker 3>how do we regulate it, how do we make.

1:02:42.200 --> 1:02:43.000
<v Speaker 4>It good or bad?

1:02:43.800 --> 1:02:46.240
<v Speaker 3>The people we should be talking to are people who

1:02:46.280 --> 1:02:51.440
<v Speaker 3>know something about how print and writing and pictures affected society,

1:02:51.640 --> 1:02:57.440
<v Speaker 3>or how developing markets and democracies and bureaucracies restructured society.

1:02:57.520 --> 1:03:01.720
<v Speaker 3>And they all had enormous effects, right, I mean in

1:03:01.760 --> 1:03:04.840
<v Speaker 3>a way like it's easy and fun to bring another

1:03:04.880 --> 1:03:08.280
<v Speaker 3>intelligent agent into the universe. It's a lot more fun

1:03:08.320 --> 1:03:10.400
<v Speaker 3>than coding. Actually, we know how to do that and

1:03:10.400 --> 1:03:12.960
<v Speaker 3>we're doing it all the time, and it makes a

1:03:12.960 --> 1:03:15.400
<v Speaker 3>little difference, but not an enormous difference. Bringing a new

1:03:15.400 --> 1:03:20.880
<v Speaker 3>cultural technology into the universe like print, that's a giant change.

1:03:21.080 --> 1:03:23.160
<v Speaker 3>But we've done it before. It's not as if this

1:03:23.240 --> 1:03:25.600
<v Speaker 3>is a singularity. It's not as if this isn't something

1:03:25.600 --> 1:03:28.240
<v Speaker 3>that humans have done before. The people who know about

1:03:28.280 --> 1:03:32.360
<v Speaker 3>it are psychologists and political scientists and sociologists and historians

1:03:32.360 --> 1:03:33.560
<v Speaker 3>of technology, and.

1:03:33.560 --> 1:03:35.240
<v Speaker 4>One thing that I think would be really.

1:03:35.000 --> 1:03:39.760
<v Speaker 3>Really helpful would be to get this historical perspective into

1:03:39.880 --> 1:03:42.720
<v Speaker 3>the way that we think about AI, rather than having

1:03:42.720 --> 1:03:46.400
<v Speaker 3>the sort of Gallum Frankenstein's story perspective of this is

1:03:46.440 --> 1:03:48.760
<v Speaker 3>something that's unlike anything that's ever happened in.

1:03:50.560 --> 1:03:51.560
<v Speaker 4>Humankind before.

1:03:51.680 --> 1:03:56.040
<v Speaker 3>It's something that is important and really has had significant effects,

1:03:56.040 --> 1:03:58.840
<v Speaker 3>but it is something that's happened in human life before,

1:03:58.920 --> 1:04:01.400
<v Speaker 3>and it's something that comes out of the way that

1:04:01.480 --> 1:04:04.880
<v Speaker 3>human beings work, not something that comes out of some

1:04:05.080 --> 1:04:08.120
<v Speaker 3>strange alien inhuman kind of development.

1:04:12.680 --> 1:04:15.800
<v Speaker 1>That was my interview with Alison Gopnik, professor at Berkeley.

1:04:16.080 --> 1:04:18.240
<v Speaker 1>There are two points I want to return to. I've

1:04:18.280 --> 1:04:21.880
<v Speaker 1>previously argued this point in Inner Cosmos in episode seventy two,

1:04:22.320 --> 1:04:25.760
<v Speaker 1>that AI is best thought of as a processor of

1:04:25.800 --> 1:04:30.720
<v Speaker 1>the intelligence of billions of humans. And that's because I

1:04:30.760 --> 1:04:33.680
<v Speaker 1>was noticing, even back then, the number of people who

1:04:33.720 --> 1:04:36.680
<v Speaker 1>typed in a sophisticated question and they got back what

1:04:36.720 --> 1:04:39.720
<v Speaker 1>appeared to be a sophisticated answer, and they concluded, Wow,

1:04:39.760 --> 1:04:44.200
<v Speaker 1>this thing is truly intelligent, but they were simply confusing

1:04:44.280 --> 1:04:49.280
<v Speaker 1>that with the intellectual endeavors of humans before them. Maybe

1:04:49.600 --> 1:04:52.360
<v Speaker 1>dozens of people had written about that topic, or maybe

1:04:52.440 --> 1:04:55.920
<v Speaker 1>hundreds or thousands, but they simply didn't know that, and

1:04:55.960 --> 1:05:00.400
<v Speaker 1>so they heard the echo and they misinterpreted that as

1:05:00.480 --> 1:05:03.800
<v Speaker 1>the proud voice of AI. So I named this the

1:05:04.040 --> 1:05:08.920
<v Speaker 1>intelligence echo illusion. Second point is, right after my conversation

1:05:09.000 --> 1:05:11.600
<v Speaker 1>with Alison, I found myself thinking a lot about the question,

1:05:12.200 --> 1:05:14.680
<v Speaker 1>when there's a giant machine that collects up data and

1:05:14.720 --> 1:05:19.440
<v Speaker 1>processes it, why are we so quick to anthropomorphize it,

1:05:19.520 --> 1:05:23.080
<v Speaker 1>to see it as a being. Well, it turns out

1:05:23.120 --> 1:05:26.080
<v Speaker 1>that brains are very quick to do that. For example,

1:05:26.480 --> 1:05:29.520
<v Speaker 1>imagine you hear some sound in your home at night.

1:05:29.920 --> 1:05:33.840
<v Speaker 1>You assume it's a living, intentional creature, even though you

1:05:33.920 --> 1:05:37.400
<v Speaker 1>might discover after some investigation with a flashlight in a

1:05:37.440 --> 1:05:39.960
<v Speaker 1>baseball bat that it was just the wind blowing and

1:05:40.040 --> 1:05:44.960
<v Speaker 1>knocking something around. Presumably this is a survival mechanism to

1:05:45.400 --> 1:05:49.360
<v Speaker 1>assume everything is alive and has intention. But I think

1:05:49.360 --> 1:05:52.040
<v Speaker 1>there's another aspect to it also, which is that we

1:05:52.120 --> 1:05:55.880
<v Speaker 1>seem unable to think about complex systems and so we

1:05:56.040 --> 1:06:00.800
<v Speaker 1>have to assign a central character to it. Examples of this,

1:06:00.840 --> 1:06:02.840
<v Speaker 1>which I'm going to talk about in an episode in

1:06:02.920 --> 1:06:05.240
<v Speaker 1>a few weeks from now, but for now, I'll just

1:06:05.360 --> 1:06:08.439
<v Speaker 1>use as an example what's known is the Great Man

1:06:08.680 --> 1:06:12.880
<v Speaker 1>view of history. We look to some historical figure, this

1:06:12.880 --> 1:06:15.680
<v Speaker 1>could be a woman as well, and we attribute a

1:06:15.920 --> 1:06:19.959
<v Speaker 1>historical outcome to that person. But this is a very

1:06:20.000 --> 1:06:25.160
<v Speaker 1>misleading narrative tendency, because anything that happens historically is a

1:06:25.360 --> 1:06:31.320
<v Speaker 1>hugely complex event, shaped by thousands of people and often institutions,

1:06:31.360 --> 1:06:36.160
<v Speaker 1>and sometimes random contingencies. We say that Hitler led Germany

1:06:36.200 --> 1:06:40.120
<v Speaker 1>into World War Two, but that glosses over the collaborators

1:06:40.160 --> 1:06:44.440
<v Speaker 1>and the predecessors and the cultural scaffolding that made all

1:06:44.480 --> 1:06:48.160
<v Speaker 1>these things possible. My assertion is that this isn't just

1:06:48.200 --> 1:06:53.440
<v Speaker 1>a storytelling shortcut. It reflects something deep about human cognition.

1:06:53.840 --> 1:06:57.280
<v Speaker 1>We're just not able to hold that level of complexity

1:06:57.320 --> 1:07:02.240
<v Speaker 1>in our heads, so we become story tellers. We anthropomorphize

1:07:02.360 --> 1:07:05.400
<v Speaker 1>to one or a few central characters. In other words,

1:07:05.520 --> 1:07:09.640
<v Speaker 1>it's really hard for our brains to grasp sprawling, distributed

1:07:09.680 --> 1:07:13.280
<v Speaker 1>networks of influence, so we turn a vast social tapestry

1:07:13.640 --> 1:07:17.080
<v Speaker 1>into a single silhouette. Okay, so now let's zoom back

1:07:17.080 --> 1:07:20.240
<v Speaker 1>out to the main picture. What happens when we stop

1:07:20.400 --> 1:07:24.280
<v Speaker 1>asking whether a large model is intelligent and instead ask

1:07:24.680 --> 1:07:28.640
<v Speaker 1>what kind of cultural machine it is. Today's conversation with

1:07:28.720 --> 1:07:32.680
<v Speaker 1>Alison Gopnik took us through a reframing that I think

1:07:32.800 --> 1:07:36.400
<v Speaker 1>is clarifying and possibly really important. When we think of

1:07:36.600 --> 1:07:41.560
<v Speaker 1>large models as minds, as proto agents, it's easy to

1:07:41.560 --> 1:07:43.920
<v Speaker 1>get swept up in a speculative drama.

1:07:44.280 --> 1:07:45.080
<v Speaker 2>We ask if.

1:07:44.920 --> 1:07:48.520
<v Speaker 1>They'll surpass us, if they'll enslave us. But when we

1:07:48.960 --> 1:07:53.400
<v Speaker 1>shift the lens, when we see these systems as cultural technologies,

1:07:53.760 --> 1:07:58.480
<v Speaker 1>more akin to libraries or markets, then our questions become

1:07:59.000 --> 1:08:00.960
<v Speaker 1>more grounded and actionable.

1:08:01.280 --> 1:08:03.080
<v Speaker 2>Large models are changing.

1:08:02.800 --> 1:08:06.120
<v Speaker 1>Everything about our lives, how we write, how we seek information,

1:08:06.240 --> 1:08:11.120
<v Speaker 1>how we work. They are reorganizing the structure of knowledge

1:08:11.240 --> 1:08:15.520
<v Speaker 1>in real time, and like every major information technology before them,

1:08:15.960 --> 1:08:20.240
<v Speaker 1>whether that's writing or printing or broadcast media, these tools

1:08:20.240 --> 1:08:24.760
<v Speaker 1>are going to shape culture not because they think, but

1:08:24.840 --> 1:08:28.800
<v Speaker 1>simply because they change the flow of ideas. So maybe

1:08:28.840 --> 1:08:32.320
<v Speaker 1>the first step towards wise stewardship of this new era

1:08:32.880 --> 1:08:38.400
<v Speaker 1>is to improve our metaphors less Golum and Frankenstein, more

1:08:38.800 --> 1:08:44.840
<v Speaker 1>printing press and library, less digital brain, more public infrastructure,

1:08:45.360 --> 1:08:47.560
<v Speaker 1>less will it become sentient?

1:08:47.840 --> 1:08:51.800
<v Speaker 2>And more? What cultural shifts does this give rise to?

1:08:52.200 --> 1:08:54.960
<v Speaker 1>There's a lot at stake and how we frame these systems,

1:08:55.000 --> 1:08:59.320
<v Speaker 1>because our analogies and our metaphors are never neutral. If

1:08:59.360 --> 1:09:03.599
<v Speaker 1>we imagine a large model as a proto, human will

1:09:03.680 --> 1:09:06.400
<v Speaker 1>fear it and will regulate it accordingly. But if we

1:09:06.479 --> 1:09:10.639
<v Speaker 1>see it as a new kind of social technology, something

1:09:10.720 --> 1:09:14.599
<v Speaker 1>like a market or a library or a sprawling editorial system,

1:09:15.080 --> 1:09:19.200
<v Speaker 1>we can draw on the history of how societies have

1:09:19.360 --> 1:09:23.960
<v Speaker 1>dealt with the cultural impact of information systems. In other words,

1:09:24.439 --> 1:09:29.000
<v Speaker 1>how we think about new technology will shape the world

1:09:29.360 --> 1:09:35.920
<v Speaker 1>we're about to live into. Go to eagleman dot com

1:09:35.920 --> 1:09:39.840
<v Speaker 1>slash podcast for more information and find further reading. Check

1:09:39.840 --> 1:09:42.200
<v Speaker 1>out my newsletter on substack and be a part of

1:09:42.240 --> 1:09:44.080
<v Speaker 1>the online conversation there.

1:09:44.600 --> 1:09:45.880
<v Speaker 2>Finally, you can watch.

1:09:45.680 --> 1:09:48.320
<v Speaker 1>Videos of Inner Cosmos on YouTube, where you can leave

1:09:48.400 --> 1:09:54.960
<v Speaker 1>comments Until next time. I'm David Eagleman and this is

1:09:55.080 --> 1:10:03.439
<v Speaker 1>inner Cosmos. You can help you to have to