WEBVTT - Ep139 "What does alignment look like in a society of AIs?" with Danielle Perszyk

0:00:05.160 --> 0:00:08.760
<v Speaker 1>Is it possible that we're thinking about intelligence in the

0:00:08.840 --> 0:00:14.080
<v Speaker 1>wrong way? Instead of being something inside individual brains, is

0:00:14.120 --> 0:00:19.400
<v Speaker 1>intelligence instead something that emerges from lots of brains that

0:00:19.440 --> 0:00:23.800
<v Speaker 1>are constantly working to align with one another. And if

0:00:23.840 --> 0:00:26.960
<v Speaker 1>we take on that lens, what does this mean about

0:00:26.960 --> 0:00:31.040
<v Speaker 1>the way that we can build AI agents or the

0:00:31.040 --> 0:00:33.960
<v Speaker 1>way that they can make us better? What is the

0:00:34.000 --> 0:00:38.959
<v Speaker 1>difference between information and information with a purpose? Today we're

0:00:38.960 --> 0:00:42.040
<v Speaker 1>going to speak with Daniel Persick, a cognitive scientist who

0:00:42.120 --> 0:00:47.080
<v Speaker 1>leads the Human Computer Interaction team at Amazon's AGI Lab.

0:00:47.440 --> 0:00:54.240
<v Speaker 1>So get ready for a great brain stretch. Welcome to

0:00:54.240 --> 0:00:57.680
<v Speaker 1>Intercosmos with me David Eagleman. I'm a neuroscientist and an

0:00:57.680 --> 0:01:00.640
<v Speaker 1>author at Stanford and in these episodes, as we sail

0:01:00.800 --> 0:01:04.160
<v Speaker 1>deeply into our three pound universe to understand how we

0:01:04.240 --> 0:01:08.520
<v Speaker 1>see the world and soon how AI might come to

0:01:08.640 --> 0:01:25.040
<v Speaker 1>understand the world with us. Let's think about the word intelligence.

0:01:25.760 --> 0:01:30.440
<v Speaker 1>You might justifiably assume that neuroscientists have an agreed upon

0:01:30.640 --> 0:01:34.800
<v Speaker 1>definition for this, but we actually don't. However one thinks

0:01:34.800 --> 0:01:38.880
<v Speaker 1>about intelligence, I think it's a fair assumption that most

0:01:38.880 --> 0:01:42.360
<v Speaker 1>of us, when we think about it, assume that intelligence

0:01:42.480 --> 0:01:47.000
<v Speaker 1>is something that happens inside a single head, in other words,

0:01:47.400 --> 0:01:52.400
<v Speaker 1>a brain processing information. This statement seems so obvious that

0:01:52.520 --> 0:01:56.680
<v Speaker 1>it hardly invites inspection, but if you step back and

0:01:56.720 --> 0:02:01.480
<v Speaker 1>look at how intelligence actually unfolds in a human life,

0:02:01.520 --> 0:02:06.240
<v Speaker 1>a different picture can start to emerge. Our thinking is

0:02:06.280 --> 0:02:09.919
<v Speaker 1>shaped by other people from the very beginning. We learn

0:02:10.080 --> 0:02:15.200
<v Speaker 1>by watching, by imitating, by trying to communicate, and eventually

0:02:15.240 --> 0:02:19.680
<v Speaker 1>by negotiating meaning with the people around us. Even our

0:02:19.680 --> 0:02:24.840
<v Speaker 1>most private thoughts are built from tools that are fundamentally social,

0:02:25.040 --> 0:02:30.239
<v Speaker 1>things like language and symbols and shared concepts and cultural norms.

0:02:30.560 --> 0:02:32.400
<v Speaker 1>So this may sound strange, but this is what we're

0:02:32.400 --> 0:02:34.760
<v Speaker 1>going to talk about today, and the idea will become

0:02:34.880 --> 0:02:39.720
<v Speaker 1>very clear. Most of humanity's greatest achievements didn't come from

0:02:39.840 --> 0:02:45.440
<v Speaker 1>lone geniuses working in isolation, but from really dense networks

0:02:45.600 --> 0:02:50.320
<v Speaker 1>of minds interacting over time. When we look at things

0:02:50.400 --> 0:02:54.440
<v Speaker 1>like science or art, or morality or technology, it almost

0:02:54.639 --> 0:02:59.800
<v Speaker 1>never makes sense to interpret these as products of individual intelligence,

0:02:59.800 --> 0:03:04.400
<v Speaker 1>but instead they are collective processes that allow ideas to

0:03:04.919 --> 0:03:09.520
<v Speaker 1>collide and to form into something and to continuously evolve.

0:03:10.280 --> 0:03:14.320
<v Speaker 1>So intelligence in this sense may be less like a

0:03:14.440 --> 0:03:20.600
<v Speaker 1>thing we possess and more like something that emerges between us. Now,

0:03:20.800 --> 0:03:26.320
<v Speaker 1>this broader perspective becomes especially important as we find ourselves

0:03:26.600 --> 0:03:31.760
<v Speaker 1>flinging headlong into the era of artificial intelligence. With every

0:03:31.840 --> 0:03:37.360
<v Speaker 1>passing week, we're getting AI acting more like a participant.

0:03:37.360 --> 0:03:41.360
<v Speaker 1>We're getting systems that communicate but also agents that act

0:03:41.440 --> 0:03:43.720
<v Speaker 1>on our behalf to do things in the world. And

0:03:43.960 --> 0:03:48.880
<v Speaker 1>soon these agents will collaborate with each other at their

0:03:49.000 --> 0:03:54.920
<v Speaker 1>time scales and spatial scales. So if intelligence is social

0:03:55.040 --> 0:03:59.560
<v Speaker 1>by nature, then building the future world of AI might

0:03:59.680 --> 0:04:04.440
<v Speaker 1>end up requiring more than just dumping billions into scaling

0:04:04.600 --> 0:04:08.000
<v Speaker 1>up the training data for these systems. It may instead

0:04:08.080 --> 0:04:13.560
<v Speaker 1>require understanding how minds relate to one another in the

0:04:13.600 --> 0:04:17.840
<v Speaker 1>first place. And that's where today's conversation begins. Today I'm

0:04:17.880 --> 0:04:22.040
<v Speaker 1>joined by Danielle Persk. She's a cognitive scientist who leads

0:04:22.080 --> 0:04:27.799
<v Speaker 1>the Human Computer Interaction team at Amazon's AGI Lab. Danielle

0:04:27.880 --> 0:04:32.680
<v Speaker 1>uses insights from the evolution and development of human intelligence

0:04:33.120 --> 0:04:36.480
<v Speaker 1>to inform how we can not only make AI smarter,

0:04:36.920 --> 0:04:41.320
<v Speaker 1>but build AI that also makes us smarter. Here's my

0:04:41.400 --> 0:04:43.120
<v Speaker 1>conversation with Danielle Persk.

0:04:47.680 --> 0:04:52.640
<v Speaker 2>Intelligence in humans is really social, and that is the

0:04:52.680 --> 0:04:56.120
<v Speaker 2>thing that differentiates our intelligence from other species. Even other

0:04:56.160 --> 0:04:58.960
<v Speaker 2>species that are closely related to us have similar brain

0:04:59.000 --> 0:05:03.360
<v Speaker 2>structures and function similar genetics. And what we are really

0:05:03.800 --> 0:05:09.560
<v Speaker 2>optimizing for is representing other minds. So not only are

0:05:10.160 --> 0:05:14.680
<v Speaker 2>infants human infants inferring the existence of other minds, but

0:05:15.279 --> 0:05:21.279
<v Speaker 2>once this thing exists, we are optimized for aligning our minds. Evolutionarily,

0:05:21.400 --> 0:05:26.479
<v Speaker 2>we had to cooperate to survive. Infants need to be

0:05:26.560 --> 0:05:29.600
<v Speaker 2>able to have their caretaker's attention on them to survive,

0:05:30.160 --> 0:05:33.000
<v Speaker 2>and in terms of being able to learn about the world,

0:05:33.240 --> 0:05:35.480
<v Speaker 2>once infants have a model of other minds, then they

0:05:35.480 --> 0:05:38.839
<v Speaker 2>can manipulate it. They can direct their caretaker's attention point

0:05:39.080 --> 0:05:41.640
<v Speaker 2>what's that, and magically they'll have a label for.

0:05:41.680 --> 0:05:43.720
<v Speaker 3>This thing that they're looking at in their environment.

0:05:43.880 --> 0:05:46.480
<v Speaker 1>So they're doing prompt engineering.

0:05:47.960 --> 0:05:50.680
<v Speaker 3>Great technology. Yeah, okay, so we know that.

0:05:50.800 --> 0:05:54.440
<v Speaker 2>You know, throughout the course of human evolution, we became

0:05:54.640 --> 0:05:59.320
<v Speaker 2>increasingly dependent upon cooperating to to stay alive and adapt

0:05:59.440 --> 0:06:02.080
<v Speaker 2>to new environment. So it makes sense that there'd be

0:06:02.160 --> 0:06:06.080
<v Speaker 2>this extreme pressure on being able to predict each other's

0:06:06.120 --> 0:06:09.920
<v Speaker 2>behaviors to understand our minds, and then with infants, developmentally,

0:06:10.000 --> 0:06:13.479
<v Speaker 2>we have also the benefit of being able to learn

0:06:13.839 --> 0:06:19.280
<v Speaker 2>much more efficiently even language itself, from representing other minds.

0:06:19.720 --> 0:06:21.840
<v Speaker 1>Okay, so it turns out that we can do a

0:06:22.000 --> 0:06:26.760
<v Speaker 1>much better job of predicting if we can imagine what

0:06:26.800 --> 0:06:29.440
<v Speaker 1>it's like to be inside other people's heads. Right, So,

0:06:29.720 --> 0:06:33.920
<v Speaker 1>if I want to know what some non player character

0:06:34.000 --> 0:06:35.680
<v Speaker 1>is going to do in a video game whatever, they

0:06:35.720 --> 0:06:37.440
<v Speaker 1>have certain behaviors. But if I want to know, let's

0:06:37.440 --> 0:06:39.599
<v Speaker 1>say what you're going to do next, or say next,

0:06:39.880 --> 0:06:42.160
<v Speaker 1>if I have a model of your mind and what

0:06:42.200 --> 0:06:43.680
<v Speaker 1>you know and you don't know and all that stuff,

0:06:43.720 --> 0:06:44.919
<v Speaker 1>I can make a better prediction.

0:06:45.800 --> 0:06:50.360
<v Speaker 2>And so you've said that there's information and then information

0:06:50.440 --> 0:06:53.400
<v Speaker 2>with a purpose, and that information with a purpose really matters.

0:06:53.720 --> 0:06:57.000
<v Speaker 3>So you've used the example of like the.

0:06:56.720 --> 0:06:59.960
<v Speaker 2>Land rover on Mars not being able to fix itself,

0:07:00.400 --> 0:07:02.880
<v Speaker 2>and like a wolf that gets its like trap.

0:07:03.080 --> 0:07:05.560
<v Speaker 1>Right, the Curiosity Rover went up to Mars. We had

0:07:05.600 --> 0:07:08.160
<v Speaker 1>spent like a billion something dollars on it. It did

0:07:08.160 --> 0:07:10.520
<v Speaker 1>a great job on Mars, but eventually it got its

0:07:10.600 --> 0:07:13.400
<v Speaker 1>right front wheel stuck in the Martian soil and it

0:07:13.480 --> 0:07:17.000
<v Speaker 1>died couldn't get out. But if you can trast that

0:07:17.040 --> 0:07:20.080
<v Speaker 1>with a wolf who gets its leg cond of trap.

0:07:20.400 --> 0:07:22.600
<v Speaker 1>It'll chew its leg off and then figure out how

0:07:22.640 --> 0:07:24.920
<v Speaker 1>to walk on three legs, which is extraordinary because a

0:07:24.960 --> 0:07:27.600
<v Speaker 1>wolf's brain didn't evolve for three legs. But it can

0:07:27.680 --> 0:07:30.040
<v Speaker 1>figure it out because it's live wired. It has brain

0:07:30.080 --> 0:07:33.680
<v Speaker 1>plasticity and figure out, Okay, how do I adjust everything.

0:07:33.560 --> 0:07:34.640
<v Speaker 3>So that I can survival?

0:07:34.680 --> 0:07:37.920
<v Speaker 1>Depends upon it exactly. That's the key. It has relevance

0:07:38.040 --> 0:07:39.360
<v Speaker 1>to the animal.

0:07:39.200 --> 0:07:42.480
<v Speaker 2>Right, So all animals have a drive to survive, a

0:07:42.560 --> 0:07:46.800
<v Speaker 2>drive to reproduce, But humans also have a drive to

0:07:47.080 --> 0:07:51.320
<v Speaker 2>align our minds because it helps us cooperate, it helps

0:07:51.400 --> 0:07:55.560
<v Speaker 2>us survive, and it helps us to learn extremely efficiently.

0:07:56.000 --> 0:07:59.400
<v Speaker 2>So we don't just model other minds. That would just

0:07:59.440 --> 0:08:03.720
<v Speaker 2>be the information part. We are optimized for aligning our minds.

0:08:03.720 --> 0:08:05.520
<v Speaker 2>So it's information with a purpose.

0:08:05.760 --> 0:08:08.440
<v Speaker 1>Okay, so aligning our minds this is the key thing

0:08:09.640 --> 0:08:12.440
<v Speaker 1>and at the center of your interests. And so then

0:08:12.560 --> 0:08:15.280
<v Speaker 1>you went into looking into AGI. So first of all,

0:08:15.320 --> 0:08:18.520
<v Speaker 1>tell us what artificial general intelligence is to you.

0:08:18.800 --> 0:08:22.120
<v Speaker 2>Well, I think most of the labs that are trying

0:08:22.160 --> 0:08:26.160
<v Speaker 2>to build something like AGI, they all have their own definitions.

0:08:27.120 --> 0:08:30.040
<v Speaker 2>None of them are really very good. But the one

0:08:30.080 --> 0:08:32.400
<v Speaker 2>thing that unifies all of them is that they are

0:08:32.480 --> 0:08:35.840
<v Speaker 2>all benchmarked to human intelligence. And this goes all the

0:08:35.840 --> 0:08:39.160
<v Speaker 2>way back to the origin of the field of AI.

0:08:39.640 --> 0:08:42.000
<v Speaker 2>So in nineteen fifty six, a group of these engineers

0:08:42.000 --> 0:08:44.480
<v Speaker 2>and mathematicians got together. They were going to solve intelligence

0:08:44.480 --> 0:08:46.640
<v Speaker 2>and build thinking machines, and the idea is that these

0:08:46.640 --> 0:08:48.240
<v Speaker 2>thinking machines would think like us.

0:08:49.040 --> 0:08:50.160
<v Speaker 3>It obviously took a.

0:08:50.200 --> 0:08:53.280
<v Speaker 2>Very long time to realize, Oh, that's a lot harder

0:08:53.800 --> 0:08:56.199
<v Speaker 2>than we thought that it was. But now we are

0:08:56.320 --> 0:08:59.120
<v Speaker 2>back to aiming for something like that original goal of

0:08:59.120 --> 0:09:01.720
<v Speaker 2>building thinking machines that think like us.

0:09:01.760 --> 0:09:02.800
<v Speaker 3>We call it AGI.

0:09:03.280 --> 0:09:08.360
<v Speaker 2>Again, have slightly different operationalizations. But I think that we're

0:09:08.360 --> 0:09:12.080
<v Speaker 2>all running towards the wrong thing. And that's because I

0:09:12.120 --> 0:09:17.079
<v Speaker 2>don't think that intelligence can exist in a machine. It

0:09:17.120 --> 0:09:21.800
<v Speaker 2>doesn't exist in individual humans. It's something that emerges from

0:09:21.960 --> 0:09:27.079
<v Speaker 2>our interactions because we have this drive to align our representations,

0:09:27.600 --> 0:09:30.480
<v Speaker 2>and of course we all have very different representations.

0:09:30.559 --> 0:09:31.840
<v Speaker 3>Right When I used to teach.

0:09:31.679 --> 0:09:35.160
<v Speaker 2>Cognitive science, I would teach about this condition called a fantasia,

0:09:35.800 --> 0:09:39.640
<v Speaker 2>and once every couple of classes a student would come

0:09:39.720 --> 0:09:40.040
<v Speaker 2>up to.

0:09:40.000 --> 0:09:43.280
<v Speaker 1>Me quick In fantations where you can't imagine, you don't

0:09:43.320 --> 0:09:45.280
<v Speaker 1>have any visual representation on the Yes.

0:09:45.400 --> 0:09:47.640
<v Speaker 2>Yes, a student would come up to me and say, wait,

0:09:47.800 --> 0:09:51.000
<v Speaker 2>you mean there are people who can actually imagine things.

0:09:51.080 --> 0:09:53.280
<v Speaker 3>Their mind's eye is not just a metaphor. It's a

0:09:53.320 --> 0:09:53.880
<v Speaker 3>thing that.

0:09:53.800 --> 0:09:57.360
<v Speaker 2>People experience, and they wouldn't know because they don't suffer

0:09:57.480 --> 0:10:02.000
<v Speaker 2>from other types of death. It's just one of the

0:10:02.480 --> 0:10:06.920
<v Speaker 2>many ways in which human cognition and experience can very

0:10:07.920 --> 0:10:10.839
<v Speaker 2>And when I imagine in apple, it's different than when

0:10:10.880 --> 0:10:15.560
<v Speaker 2>you imagine an apple. We all have different associations. So

0:10:15.600 --> 0:10:20.359
<v Speaker 2>when we come together and we have to use words

0:10:20.880 --> 0:10:24.079
<v Speaker 2>to try to align our minds, there's necessarily going to

0:10:24.120 --> 0:10:29.320
<v Speaker 2>be friction, especially when we're trying to talk about abstract things,

0:10:29.960 --> 0:10:32.880
<v Speaker 2>especially when we're talking about things at the bleeding edge

0:10:32.880 --> 0:10:34.240
<v Speaker 2>of our knowledge, like science.

0:10:34.800 --> 0:10:37.400
<v Speaker 3>How do you align.

0:10:37.160 --> 0:10:39.600
<v Speaker 2>Your representations when there's not even a word for something.

0:10:39.679 --> 0:10:44.720
<v Speaker 2>So intelligence emerges as a function of trying to align

0:10:44.760 --> 0:10:50.679
<v Speaker 2>our minds and oftentimes creating new concepts to achieve that.

0:10:51.120 --> 0:10:53.320
<v Speaker 1>Okay, so when you're talking about aligning minds, it's because

0:10:53.480 --> 0:10:56.440
<v Speaker 1>I've got my whole internal world. You've got your whole

0:10:56.480 --> 0:11:00.240
<v Speaker 1>internal world that is built by each of our our

0:11:00.320 --> 0:11:03.959
<v Speaker 1>trajectories through space time. We've had different experiences all these things.

0:11:04.480 --> 0:11:07.800
<v Speaker 1>So we come together and we've got completely different worlds

0:11:07.840 --> 0:11:10.280
<v Speaker 1>running on the inside. And that's what conversation is about.

0:11:10.320 --> 0:11:12.280
<v Speaker 1>We're trying to align things that way.

0:11:12.440 --> 0:11:16.280
<v Speaker 2>And there are neuroscientists who measure when people are either

0:11:16.400 --> 0:11:19.520
<v Speaker 2>communicating in real time or if they're listening to a story,

0:11:19.640 --> 0:11:23.200
<v Speaker 2>if they're watching something on a screen, you can measure

0:11:23.280 --> 0:11:27.080
<v Speaker 2>the degree of neuralsynchrony, how close they are to be

0:11:27.120 --> 0:11:29.200
<v Speaker 2>on the same wavelength, and that predicts all sorts of

0:11:29.240 --> 0:11:32.400
<v Speaker 2>things like how much they like each other, how much

0:11:32.480 --> 0:11:37.199
<v Speaker 2>they understood the story, and how much they liked the story,

0:11:37.320 --> 0:11:39.480
<v Speaker 2>how similarly they remember things.

0:11:39.880 --> 0:11:42.640
<v Speaker 1>Okay, so this is what humans do. We get together

0:11:42.679 --> 0:11:44.960
<v Speaker 1>in conversation all the time and we try to achieve

0:11:45.000 --> 0:11:49.480
<v Speaker 1>that synchrony in terms of oh, okay, wait, you have

0:11:49.480 --> 0:11:51.880
<v Speaker 1>a different view than I do on this, here's how

0:11:51.880 --> 0:11:54.640
<v Speaker 1>we can make progress. This is the Socratic dialectic, right.

0:11:54.640 --> 0:11:56.719
<v Speaker 1>This is what Socrates love to do, is have these

0:11:56.720 --> 0:12:00.680
<v Speaker 1>conversations where the truth emerges, something bigger than either person

0:12:00.760 --> 0:12:03.199
<v Speaker 1>knew when they started the conversation.

0:12:03.440 --> 0:12:05.280
<v Speaker 2>And on that point too, I think a lot of

0:12:05.360 --> 0:12:10.920
<v Speaker 2>us think that we know things, but actually when we're

0:12:10.960 --> 0:12:13.880
<v Speaker 2>forced to describe something we realized we don't.

0:12:14.160 --> 0:12:16.439
<v Speaker 1>Yeah. Actually, in my next book, I'm talking about this

0:12:16.480 --> 0:12:20.520
<v Speaker 1>as a Potempkin village. Yeah, so you know. The Potemkin village,

0:12:20.559 --> 0:12:23.360
<v Speaker 1>for anyone doesn't remember, is when it was Catherine the

0:12:23.400 --> 0:12:26.680
<v Speaker 1>Great of Russia was heading down the river with a

0:12:26.679 --> 0:12:28.680
<v Speaker 1>bunch of dignitaries that she was trying to impress. She

0:12:28.720 --> 0:12:31.679
<v Speaker 1>hired this skuy Potempkin. Actually he was her lover as

0:12:31.679 --> 0:12:34.640
<v Speaker 1>well as a military general, but she got him to

0:12:34.679 --> 0:12:38.760
<v Speaker 1>go down the river a long way and build what

0:12:38.960 --> 0:12:42.120
<v Speaker 1>looked like a facade of a village so that when

0:12:42.200 --> 0:12:44.440
<v Speaker 1>the ship went by, all the dignitaries would be impressed

0:12:44.440 --> 0:12:47.120
<v Speaker 1>that there was this village. And he got all these

0:12:47.160 --> 0:12:49.800
<v Speaker 1>peasants like walk around happily and stuff, But there were

0:12:49.840 --> 0:12:53.560
<v Speaker 1>no buildings. It was just the front face of the building.

0:12:53.559 --> 0:12:57.320
<v Speaker 1>And then when the ship passed, he deconstructed this and

0:12:57.800 --> 0:13:00.679
<v Speaker 1>went ahead and built another village so that they passed

0:13:00.720 --> 0:13:02.840
<v Speaker 1>another great village, so it looked like things were really

0:13:02.880 --> 0:13:07.200
<v Speaker 1>happening there. Anyway, Cognition is often like this, where we think, oh, yeah,

0:13:07.200 --> 0:13:10.520
<v Speaker 1>I got it. Here's an example that I often use

0:13:10.600 --> 0:13:13.200
<v Speaker 1>is for anybody listening, take out a piece of paper,

0:13:13.240 --> 0:13:15.560
<v Speaker 1>and draw a bicycle, draw a bike.

0:13:16.080 --> 0:13:19.080
<v Speaker 3>I've tried this, yeah hard, Yeah.

0:13:19.000 --> 0:13:20.840
<v Speaker 1>Exactly, it turns out, I mean something as simple as

0:13:20.840 --> 0:13:24.679
<v Speaker 1>a bike what you see every day. Yeah, you start realizing, wait,

0:13:24.720 --> 0:13:26.640
<v Speaker 1>actually I don't know exactly where this goes and what's

0:13:26.679 --> 0:13:30.679
<v Speaker 1>the thing and so anyway, Yes, this is an example

0:13:30.720 --> 0:13:33.280
<v Speaker 1>of where we think we have deep knowledge and sometimes

0:13:33.320 --> 0:13:34.960
<v Speaker 1>it's just the facade of something that we know.

0:13:35.280 --> 0:13:37.360
<v Speaker 2>Yeah, and you can apply it on all different levels.

0:13:37.360 --> 0:13:40.160
<v Speaker 2>So you're describing, like the visual imagery might not be

0:13:40.360 --> 0:13:45.600
<v Speaker 2>very stable, but a lot of concepts are not stable either,

0:13:45.800 --> 0:13:50.480
<v Speaker 2>and we invent ways of making them more stable. Words

0:13:50.640 --> 0:13:53.600
<v Speaker 2>are a classic example of that. Once you have a

0:13:53.640 --> 0:13:57.440
<v Speaker 2>word for something, you can more easily trigger it, you

0:13:57.440 --> 0:13:59.280
<v Speaker 2>can more easily remember it, you can use it and

0:13:59.320 --> 0:14:02.280
<v Speaker 2>manipulate it and apply it to different things. But there's

0:14:02.320 --> 0:14:06.840
<v Speaker 2>a whole class of things that we're constantly inventing to

0:14:06.960 --> 0:14:12.000
<v Speaker 2>better align our minds. They're called cognitive technologies. So writing

0:14:12.080 --> 0:14:17.840
<v Speaker 2>would be one of the original ones. But symbols like math, logic, Yeah.

0:14:17.600 --> 0:14:19.360
<v Speaker 1>So unpack that. What's an example of this?

0:14:19.920 --> 0:14:22.640
<v Speaker 3>So literally, any word that you learn. Let's go back

0:14:22.640 --> 0:14:23.240
<v Speaker 3>to apples.

0:14:23.360 --> 0:14:28.520
<v Speaker 2>So children see apples, they don't have a word associated

0:14:28.560 --> 0:14:32.560
<v Speaker 2>with it, so the likelihood that it's going to spontaneously

0:14:33.120 --> 0:14:36.440
<v Speaker 2>sort of emerge in their mind is very low. Maybe

0:14:36.480 --> 0:14:38.760
<v Speaker 2>if they've seen a couple then there will be some

0:14:38.800 --> 0:14:44.400
<v Speaker 2>sort of increased likelihood or lowered threshold. But once they

0:14:44.440 --> 0:14:48.600
<v Speaker 2>have a word for that thing, that they reliably associate

0:14:48.600 --> 0:14:51.480
<v Speaker 2>it with it, then anybody who says the word anytime

0:14:51.520 --> 0:14:54.280
<v Speaker 2>they hear it, now their brain will elicit that activity

0:14:54.320 --> 0:14:59.160
<v Speaker 2>and it becomes a more stable representation. That's an obvious

0:14:59.280 --> 0:15:02.200
<v Speaker 2>example where where there's actually a physical thing that the

0:15:02.200 --> 0:15:03.040
<v Speaker 2>word can refer to.

0:15:03.160 --> 0:15:04.040
<v Speaker 3>But what about.

0:15:04.080 --> 0:15:08.120
<v Speaker 2>Concepts like love and justice that you can't see you're

0:15:08.120 --> 0:15:12.040
<v Speaker 2>saying By assigning a word to it, then we make

0:15:12.120 --> 0:15:15.000
<v Speaker 2>that stable, and then it become it can become associated

0:15:15.080 --> 0:15:18.400
<v Speaker 2>with a whole web of other concepts, and that web

0:15:18.480 --> 0:15:22.359
<v Speaker 2>becomes increasingly stable when when we can.

0:15:22.560 --> 0:15:25.040
<v Speaker 3>Make the associations.

0:15:24.280 --> 0:15:29.280
<v Speaker 2>More robust, more reliable, and then further when we can

0:15:29.320 --> 0:15:34.960
<v Speaker 2>invent things like science where we can really validate causal

0:15:35.000 --> 0:15:39.040
<v Speaker 2>relationships between things, and that makes our representations even more stable.

0:15:39.280 --> 0:15:42.240
<v Speaker 1>I see, so human brains interact with one another and

0:15:43.600 --> 0:15:46.640
<v Speaker 1>work on how do we make these representations stable? How

0:15:46.640 --> 0:15:50.800
<v Speaker 1>do we get knowledge coming out like a Socratic dialectic,

0:15:50.840 --> 0:15:54.240
<v Speaker 1>but with with everybody all involved and so on. And

0:15:54.280 --> 0:15:58.600
<v Speaker 1>so your idea when you moved into this field of AGI,

0:15:58.720 --> 0:16:01.520
<v Speaker 1>howeveryone wants to define it? What was your idea?

0:16:01.880 --> 0:16:05.600
<v Speaker 2>Well, so I left academia, which I absolutely loved, but

0:16:05.800 --> 0:16:11.440
<v Speaker 2>I felt an urgency to validate this theory.

0:16:11.560 --> 0:16:13.680
<v Speaker 3>I don't I mean, it's just a theory at this point.

0:16:14.000 --> 0:16:17.920
<v Speaker 2>How do we know whether we are optimized to align

0:16:17.920 --> 0:16:18.240
<v Speaker 2>our minds?

0:16:18.280 --> 0:16:20.640
<v Speaker 3>There's so much evidence to suggest that we do.

0:16:20.800 --> 0:16:23.560
<v Speaker 2>But as Richard Feyman said, you don't really know if

0:16:23.560 --> 0:16:24.920
<v Speaker 2>you understand something until you can build it.

0:16:24.960 --> 0:16:27.280
<v Speaker 3>And so I thought, well, maybe I could build this thing.

0:16:27.320 --> 0:16:30.480
<v Speaker 3>And the moment is just right because AI is taking

0:16:30.480 --> 0:16:31.000
<v Speaker 3>off again.

0:16:31.200 --> 0:16:35.280
<v Speaker 2>It's waking up from one of the winters, and I

0:16:35.360 --> 0:16:41.280
<v Speaker 2>was watching the scaling have really impressive results with a

0:16:41.320 --> 0:16:43.520
<v Speaker 2>deep learning, which felt really good because as somebody who

0:16:43.520 --> 0:16:46.800
<v Speaker 2>had a background in neuroscience, just like, oh yeah, inspiration

0:16:47.320 --> 0:16:52.440
<v Speaker 2>from brains is actually proving to be really effective. So

0:16:53.160 --> 0:16:57.720
<v Speaker 2>I moved into tech and started collaborating with the engineers

0:16:57.760 --> 0:17:02.320
<v Speaker 2>who were trying to build ever more capable intelligence. I'm

0:17:02.320 --> 0:17:06.480
<v Speaker 2>now in one of these frontier AGI labs and the

0:17:06.600 --> 0:17:08.520
<v Speaker 2>thing that we are going to be doing, which I

0:17:08.520 --> 0:17:12.159
<v Speaker 2>think is really differentiated from other approaches, is try to

0:17:12.160 --> 0:17:15.680
<v Speaker 2>build the communicative drive. Can we build agents that are

0:17:15.800 --> 0:17:21.720
<v Speaker 2>optimized for understanding each other's perspectives? And from that, can

0:17:21.760 --> 0:17:26.600
<v Speaker 2>we get emergent behaviors, emergent capabilities that we wouldn't get

0:17:26.640 --> 0:17:29.160
<v Speaker 2>from a single model on its own.

0:17:29.320 --> 0:17:32.520
<v Speaker 1>So I just want to slow that down. So communicative drive,

0:17:32.960 --> 0:17:34.800
<v Speaker 1>that's the first time we've heard the term, So tell

0:17:34.840 --> 0:17:35.720
<v Speaker 1>us what that means.

0:17:36.040 --> 0:17:40.800
<v Speaker 2>So communicative drive is the phrase that I use to

0:17:41.080 --> 0:17:45.399
<v Speaker 2>describe this compulsion that we have to align our minds

0:17:45.480 --> 0:17:51.440
<v Speaker 2>to establish representational alignment. You can imagine how the communicative

0:17:51.560 --> 0:17:57.760
<v Speaker 2>drive would interact with other dispositions that humans have. And importantly,

0:17:57.760 --> 0:17:59.800
<v Speaker 2>you have to think at the population level. So again

0:18:00.080 --> 0:18:04.840
<v Speaker 2>we have variation for every trait, and some of us

0:18:04.840 --> 0:18:08.080
<v Speaker 2>are more open, some of us are more closed to experience.

0:18:08.280 --> 0:18:10.000
<v Speaker 3>But you can imagine. Okay, so in the case of

0:18:10.040 --> 0:18:14.240
<v Speaker 3>somebody who's really closed, Let's say that they are.

0:18:14.000 --> 0:18:21.480
<v Speaker 2>In some communicative exchange and they detect a mismatch. So

0:18:21.520 --> 0:18:26.600
<v Speaker 2>somebody is clearly not understanding what they are saying.

0:18:27.320 --> 0:18:31.560
<v Speaker 3>They have two choices. They can update.

0:18:31.359 --> 0:18:35.080
<v Speaker 2>Their perspectives to the other person's, or they can try

0:18:35.080 --> 0:18:38.240
<v Speaker 2>to get the other person's perspective to look more like theirs.

0:18:38.920 --> 0:18:41.840
<v Speaker 2>What would it take to get another person to come

0:18:41.840 --> 0:18:42.840
<v Speaker 2>to your perspective.

0:18:43.119 --> 0:18:46.360
<v Speaker 3>You'd have to create an artifact. You'd have to create.

0:18:46.280 --> 0:18:50.280
<v Speaker 2>A word or a piece of art or a theory

0:18:50.560 --> 0:18:53.680
<v Speaker 2>to get them to really understand and take on your

0:18:53.760 --> 0:18:54.720
<v Speaker 2>perspective and.

0:18:54.680 --> 0:18:55.560
<v Speaker 3>Close that gap.

0:18:55.880 --> 0:18:59.000
<v Speaker 2>But if you're a very open person, if you're creative,

0:18:59.520 --> 0:19:02.359
<v Speaker 2>that might be your default. But if you're a little

0:19:02.359 --> 0:19:05.440
<v Speaker 2>bit more reserved, maybe you just take on the other

0:19:05.560 --> 0:19:06.760
<v Speaker 2>person's perspective.

0:19:07.640 --> 0:19:09.359
<v Speaker 1>Is it that way or the other way? Sorry? If

0:19:09.400 --> 0:19:11.199
<v Speaker 1>I'm very open, I feel like I would take the

0:19:11.240 --> 0:19:12.600
<v Speaker 1>other person's perspective, so you.

0:19:12.520 --> 0:19:15.719
<v Speaker 2>Can actually imagine both situations. Yes, So I score very

0:19:15.800 --> 0:19:18.600
<v Speaker 2>high on openness, and when I'm in communicative exchanges, I

0:19:18.640 --> 0:19:22.640
<v Speaker 2>often feel like, oh, wow, yeah, that's I've never thought

0:19:22.680 --> 0:19:24.240
<v Speaker 2>of it that way, or maybe I have, and I

0:19:24.240 --> 0:19:26.000
<v Speaker 2>want to add all these things and like it's a

0:19:26.119 --> 0:19:29.800
<v Speaker 2>very cooperative thing. I'm more thinking about the dynamics of

0:19:29.800 --> 0:19:34.000
<v Speaker 2>people who want to maintain tradition and status quo versus

0:19:34.080 --> 0:19:37.000
<v Speaker 2>people who want to challenge that. So oftentimes that maps

0:19:37.000 --> 0:19:41.879
<v Speaker 2>onto the dimension of openness. So if you see that

0:19:42.040 --> 0:19:46.320
<v Speaker 2>everybody around you seems to hold a different perspective than

0:19:46.400 --> 0:19:51.160
<v Speaker 2>you do, you're more likely to conform to their perspectives.

0:19:51.160 --> 0:19:54.520
<v Speaker 2>If you're somebody who might be a little bit more conservative,

0:19:54.800 --> 0:19:57.520
<v Speaker 2>not wanting to ruffle feathers that kind of thing.

0:19:57.760 --> 0:19:59.080
<v Speaker 1>Well, I'm just trying to stand why I use the

0:19:59.080 --> 0:20:04.760
<v Speaker 1>word conservative there, because conservative meaning are like iinin exactly.

0:20:05.320 --> 0:20:08.440
<v Speaker 1>Oh but you're saying, maintain the group traditions. Yeah, okay,

0:20:08.560 --> 0:20:09.119
<v Speaker 1>got it.

0:20:09.119 --> 0:20:11.360
<v Speaker 3>As opposed to being iconoclastic and innovative.

0:20:11.560 --> 0:20:13.680
<v Speaker 1>I see, I see how you're using it. Okay, great.

0:20:13.880 --> 0:20:16.359
<v Speaker 1>So this is the idea is that people are always talking,

0:20:16.400 --> 0:20:19.119
<v Speaker 1>and depending on your personality type, what you're trying to

0:20:19.160 --> 0:20:21.159
<v Speaker 1>do is either align yourself with them or them with

0:20:21.200 --> 0:20:24.399
<v Speaker 1>you or whatever, or meet in the middle. But this,

0:20:25.080 --> 0:20:28.840
<v Speaker 1>this you feel, is the key to what human societies

0:20:29.119 --> 0:20:32.359
<v Speaker 1>bring as opposed to looking at individual brains. You know,

0:20:32.400 --> 0:20:35.640
<v Speaker 1>the history of neuroscience is all about looking at individual brains. Oh,

0:20:35.640 --> 0:20:37.399
<v Speaker 1>this is how the visual system works, as how decision

0:20:37.400 --> 0:20:41.240
<v Speaker 1>making works, how hearing works, whatever. But there's this new

0:20:41.280 --> 0:20:43.320
<v Speaker 1>feel that's been growing for the last twenty or thirty years,

0:20:43.320 --> 0:20:46.480
<v Speaker 1>which is called social neuroscience, which is all about, gosh,

0:20:46.480 --> 0:20:48.760
<v Speaker 1>we've got a lot of circuitry in our brains that

0:20:48.840 --> 0:20:51.520
<v Speaker 1>care about other brains. So this is the heart of

0:20:51.560 --> 0:20:55.000
<v Speaker 1>your interest. Is what happens when people are talking and aligning?

0:20:55.400 --> 0:20:58.639
<v Speaker 1>And why are we so driven to communicate instead of

0:20:58.680 --> 0:21:00.480
<v Speaker 1>let's imagine that you and I set down on a

0:21:00.520 --> 0:21:03.840
<v Speaker 1>bus next to each other, we'd probably chat as opposed

0:21:03.880 --> 0:21:06.440
<v Speaker 1>to just sit there and deal with our own brains. Okay,

0:21:06.520 --> 0:21:09.919
<v Speaker 1>so how does this map onto what you're interested in

0:21:09.920 --> 0:21:10.639
<v Speaker 1>doing in AI?

0:21:11.640 --> 0:21:16.720
<v Speaker 2>Yes, so I am concerned about building something that resembles

0:21:16.880 --> 0:21:20.240
<v Speaker 2>our own intelligence, or something that resembles us because we

0:21:20.359 --> 0:21:23.480
<v Speaker 2>have all sorts of flaws and biases.

0:21:23.920 --> 0:21:25.600
<v Speaker 3>The variability, I.

0:21:25.520 --> 0:21:28.359
<v Speaker 2>Think is very useful, and we wouldn't be intelligent in

0:21:28.359 --> 0:21:30.439
<v Speaker 2>the way that we are without the variability. And you

0:21:30.520 --> 0:21:32.840
<v Speaker 2>might call some of that variability the bias, the unique

0:21:32.880 --> 0:21:33.800
<v Speaker 2>biases that we have.

0:21:34.320 --> 0:21:37.000
<v Speaker 3>But I think if we try to reproduce.

0:21:36.480 --> 0:21:39.320
<v Speaker 2>All of that, we're going to get a mirror of ourselves,

0:21:39.320 --> 0:21:44.640
<v Speaker 2>and that's not always the most effective way to augment

0:21:44.800 --> 0:21:46.960
<v Speaker 2>our intelligence. And I should back up and say, why

0:21:47.000 --> 0:21:49.200
<v Speaker 2>are we doing any of this? Why do we want

0:21:49.240 --> 0:21:51.840
<v Speaker 2>to build intelligence that looks like us. I think the

0:21:51.920 --> 0:21:55.080
<v Speaker 2>assumption that a lot of these the people, the engineers,

0:21:55.080 --> 0:21:57.560
<v Speaker 2>and these labs have is that, oh, of course it's

0:21:57.560 --> 0:22:01.359
<v Speaker 2>going to be extremely useful for us. It's going to

0:22:01.560 --> 0:22:06.080
<v Speaker 2>unlock this unprecedented era of human flourishing. But the assumption

0:22:06.160 --> 0:22:08.720
<v Speaker 2>that it's going to be really useful for us, I

0:22:08.760 --> 0:22:11.400
<v Speaker 2>think is taken for granted, and if you really think

0:22:11.400 --> 0:22:15.000
<v Speaker 2>about it, well, how because a lot of the examples

0:22:15.040 --> 0:22:19.440
<v Speaker 2>that we have from recent technology and algorithms is that

0:22:19.920 --> 0:22:24.920
<v Speaker 2>they actually take away our agency. We lose hours to scrolling,

0:22:24.960 --> 0:22:29.400
<v Speaker 2>we get stuck in echo chambers, we have autocomplete takeaway

0:22:29.560 --> 0:22:31.600
<v Speaker 2>our thinking, and we're starting to see.

0:22:31.440 --> 0:22:34.680
<v Speaker 3>The same kinds of things with chatbots.

0:22:35.600 --> 0:22:38.639
<v Speaker 2>We're also seeing that people are using these technologies and

0:22:39.359 --> 0:22:44.320
<v Speaker 2>very much augmenting their their own intelligence. I feel sometimes

0:22:44.480 --> 0:22:48.440
<v Speaker 2>like I'm having entirely new thoughts at an unprecedented pace

0:22:48.800 --> 0:22:51.359
<v Speaker 2>when I'm going back and forth, just like when you

0:22:51.400 --> 0:22:54.480
<v Speaker 2>were having amazing conversations with other people. We use each

0:22:54.480 --> 0:22:56.560
<v Speaker 2>other's minds as tools, but you can just do that

0:22:56.600 --> 0:22:59.840
<v Speaker 2>at a more rapid pace. So it's not a foregone

0:23:00.000 --> 0:23:06.159
<v Speaker 2>inclusion that giving the AI more capabilities, making it smarter

0:23:06.240 --> 0:23:09.240
<v Speaker 2>and giving it more agency is going to be good

0:23:09.240 --> 0:23:11.840
<v Speaker 2>for us. I think we have to turn that on

0:23:11.920 --> 0:23:14.200
<v Speaker 2>its head and say, what would it take to make

0:23:14.480 --> 0:23:18.359
<v Speaker 2>AI that makes us smarter and gives us more agency?

0:23:19.200 --> 0:23:22.960
<v Speaker 2>And that would be, by definition, something that is good

0:23:23.040 --> 0:23:27.160
<v Speaker 2>for us. So how do we do that? I don't

0:23:27.200 --> 0:23:29.880
<v Speaker 2>think that we want to have agents that have their

0:23:29.920 --> 0:23:33.560
<v Speaker 2>own drives to survive and manipulate us and have all

0:23:33.600 --> 0:23:39.000
<v Speaker 2>of the status seeking U situations that we have. But

0:23:39.480 --> 0:23:44.640
<v Speaker 2>if they were motivated to align their representations with ours,

0:23:44.720 --> 0:23:48.359
<v Speaker 2>that could actually be really useful for unlocking our potential

0:23:48.359 --> 0:23:53.040
<v Speaker 2>and for helping us learn. And as we're giving them

0:23:53.080 --> 0:23:56.040
<v Speaker 2>these capabilities to do that, they have to figure out.

0:23:56.160 --> 0:23:58.680
<v Speaker 2>One of the ways that we are able to generalize

0:23:58.720 --> 0:24:04.080
<v Speaker 2>and continually learn is that we are constantly negotiating meaning

0:24:04.119 --> 0:24:06.760
<v Speaker 2>and coming up and with the friction of the interactions

0:24:07.160 --> 0:24:11.159
<v Speaker 2>with each other, we are able to do continual learning

0:24:11.280 --> 0:24:16.560
<v Speaker 2>because we're not optimizing for one thing, one niche, one environment,

0:24:16.800 --> 0:24:22.600
<v Speaker 2>one particular problem. We are optimizing for aligning our minds

0:24:22.720 --> 0:24:26.840
<v Speaker 2>with many minds, and all of them. These targets are

0:24:26.840 --> 0:24:30.600
<v Speaker 2>all moving targets, so it's kind of an escape velocity

0:24:30.720 --> 0:24:35.960
<v Speaker 2>from really focusing on one thing and our ability to

0:24:36.000 --> 0:24:39.000
<v Speaker 2>do that not only allows us to continually learn, but

0:24:39.080 --> 0:24:40.760
<v Speaker 2>it gives us superpowers.

0:24:40.840 --> 0:24:42.000
<v Speaker 1>So what does this look like for you?

0:24:42.040 --> 0:24:42.159
<v Speaker 2>Though?

0:24:42.200 --> 0:24:45.119
<v Speaker 1>If you had a world five years in the future

0:24:45.119 --> 0:24:46.960
<v Speaker 1>that you were able to sort of define where this

0:24:47.000 --> 0:24:50.000
<v Speaker 1>is going, what's ok Does it mean that there are

0:24:50.640 --> 0:24:54.440
<v Speaker 1>lots of AI agents and they are talking with humans

0:24:54.520 --> 0:24:59.159
<v Speaker 1>and they're trying to align their thinking with humans and

0:24:59.200 --> 0:25:02.040
<v Speaker 1>the humans a with the AI or its look like

0:25:02.119 --> 0:25:04.480
<v Speaker 1>there's one AI give us a sense of this world?

0:25:04.600 --> 0:25:07.440
<v Speaker 2>Okay, So there's at least two important things here. One

0:25:07.480 --> 0:25:09.560
<v Speaker 2>is that right now agents are not reliable, so they're

0:25:09.560 --> 0:25:11.960
<v Speaker 2>not useful. And I think the idea there is that

0:25:12.000 --> 0:25:16.359
<v Speaker 2>they are fundamentally different than llms. They are embodied in

0:25:16.400 --> 0:25:19.040
<v Speaker 2>some kind of environment, even if it's the digital environment,

0:25:19.680 --> 0:25:23.520
<v Speaker 2>but we can't yet get them to do long horizon

0:25:24.000 --> 0:25:27.160
<v Speaker 2>you know, actions in a reliable way, and so they're

0:25:27.200 --> 0:25:27.879
<v Speaker 2>not yet useful.

0:25:28.040 --> 0:25:30.160
<v Speaker 1>Right now, you're talking about AI agents.

0:25:30.520 --> 0:25:34.320
<v Speaker 2>Most people have interacted with chatbots and that's what they

0:25:34.359 --> 0:25:36.760
<v Speaker 2>think AI is, or that's what they think generative AI is.

0:25:36.800 --> 0:25:38.920
<v Speaker 2>Maybe they know of you know, the image generators too,

0:25:39.040 --> 0:25:42.240
<v Speaker 2>but a lot of us are interacting with chatbots. Those

0:25:42.240 --> 0:25:45.600
<v Speaker 2>are llms that are predicting the next text token. Large

0:25:45.680 --> 0:25:49.560
<v Speaker 2>language models, yes, large language models, but they don't actually

0:25:49.640 --> 0:25:55.239
<v Speaker 2>do things. Agents, in contrast, can actually take actions and

0:25:55.280 --> 0:25:58.359
<v Speaker 2>do things on our behalf and in order.

0:25:59.280 --> 0:26:00.520
<v Speaker 3>So are lab is.

0:26:00.480 --> 0:26:04.439
<v Speaker 2>Working on building computer use agents. So if I want

0:26:04.480 --> 0:26:07.600
<v Speaker 2>an agent to book me a flight or order me

0:26:08.400 --> 0:26:10.680
<v Speaker 2>a dinner, I can say that and then it can

0:26:10.720 --> 0:26:14.439
<v Speaker 2>go off and use whatever websites or software tools to

0:26:14.680 --> 0:26:16.359
<v Speaker 2>do those things.

0:26:16.119 --> 0:26:17.280
<v Speaker 3>That's the hope.

0:26:17.760 --> 0:26:21.520
<v Speaker 2>You've seen a couple of these agents, computer use agents

0:26:21.520 --> 0:26:24.840
<v Speaker 2>come out, and it's really exciting to see them start

0:26:24.880 --> 0:26:27.920
<v Speaker 2>to do things, but they're not reliable. And because they're

0:26:27.960 --> 0:26:30.400
<v Speaker 2>not reliable, they might do the thing that you ask

0:26:30.520 --> 0:26:33.400
<v Speaker 2>them to do one out of ten times, and again

0:26:33.440 --> 0:26:35.760
<v Speaker 2>that's exciting, but that's not very useful, right.

0:26:35.760 --> 0:26:37.879
<v Speaker 1>You mean it's because they make mistakes. It's not the

0:26:37.920 --> 0:26:40.400
<v Speaker 1>way we would say an employee is not reliable because

0:26:40.400 --> 0:26:42.920
<v Speaker 1>he's out back smoking a cigarette. Is that they're trying

0:26:42.920 --> 0:26:44.880
<v Speaker 1>to do stuff is just a clicking on the wrong

0:26:44.920 --> 0:26:45.560
<v Speaker 1>thing and getting it.

0:26:45.520 --> 0:26:49.399
<v Speaker 2>Wrong, that's right y, Yes, So working on making these

0:26:49.600 --> 0:26:53.800
<v Speaker 2>agents reliable is necessary for making them useful. But we

0:26:53.880 --> 0:26:58.800
<v Speaker 2>can imagine that unlocking a whole new set of capabilities

0:26:58.840 --> 0:27:03.280
<v Speaker 2>and ways that they would augment humans because rather than

0:27:03.520 --> 0:27:06.399
<v Speaker 2>just having a conversation, and conversations can be very useful.

0:27:06.680 --> 0:27:08.640
<v Speaker 2>All of the things that you do in your daily life,

0:27:08.640 --> 0:27:11.440
<v Speaker 2>all the things that you're using a computer for, the

0:27:11.520 --> 0:27:14.320
<v Speaker 2>vast majority of them are probably not worthy of your time.

0:27:14.359 --> 0:27:17.000
<v Speaker 2>You're doing things on the computer to actually achieve other

0:27:17.080 --> 0:27:19.480
<v Speaker 2>things in the real world. So what if you could

0:27:19.520 --> 0:27:24.280
<v Speaker 2>have agents reliably execute a lot of the things that

0:27:24.320 --> 0:27:24.840
<v Speaker 2>you're doing.

0:27:25.000 --> 0:27:25.600
<v Speaker 3>And in.

0:27:27.280 --> 0:27:30.000
<v Speaker 2>Knowledge work, you know, we're using a ton of different

0:27:30.000 --> 0:27:34.640
<v Speaker 2>tools I call them arbitrary skills, processing invoices or something

0:27:34.680 --> 0:27:37.640
<v Speaker 2>that everybody does using doing their taxes.

0:27:37.280 --> 0:27:39.800
<v Speaker 3>Like, do you really have to becommon expert.

0:27:39.480 --> 0:27:43.479
<v Speaker 2>At using these tools or is that maybe not the

0:27:43.480 --> 0:27:48.000
<v Speaker 2>best use of our human potential, our cognitive potential. If

0:27:48.000 --> 0:27:51.000
<v Speaker 2>we could have agents that knew how to use all

0:27:51.000 --> 0:27:54.000
<v Speaker 2>of the tools that we did, that would save us

0:27:54.080 --> 0:27:57.480
<v Speaker 2>a ton of time, and it would have cascading implications

0:27:57.520 --> 0:28:00.800
<v Speaker 2>for how people collaborate with other people in the real world.

0:28:00.880 --> 0:28:04.280
<v Speaker 2>Human collaboration would be different because we'd be freed up

0:28:04.560 --> 0:28:08.679
<v Speaker 2>to focus on more creative things, more strategic decisions. Having

0:28:08.760 --> 0:28:12.040
<v Speaker 2>the sorts of debates that we have to advance whatever

0:28:12.280 --> 0:28:15.760
<v Speaker 2>shared goals that we have. So this is the sort

0:28:15.800 --> 0:28:17.719
<v Speaker 2>of first part. We have to just get the agents

0:28:17.760 --> 0:28:21.040
<v Speaker 2>to reliably click or scroll when we need them to.

0:28:21.640 --> 0:28:25.800
<v Speaker 2>But if you play that forward, what does reliability actually

0:28:25.840 --> 0:28:28.600
<v Speaker 2>mean when we have higher level goals.

0:28:29.160 --> 0:28:30.040
<v Speaker 3>It's not just.

0:28:30.320 --> 0:28:33.960
<v Speaker 2>Knowing where to click or knowing when to scroll. It's

0:28:34.000 --> 0:28:39.760
<v Speaker 2>actually understanding the goal. And that goal might require breaking

0:28:40.200 --> 0:28:43.800
<v Speaker 2>the breaking it down into subtasks, and then going and

0:28:43.840 --> 0:28:45.880
<v Speaker 2>doing all of those things, and there might be many

0:28:45.920 --> 0:28:48.200
<v Speaker 2>ways of doing it, and there's not necessarily a right

0:28:48.280 --> 0:28:50.880
<v Speaker 2>or a wrong way of doing it. So at the

0:28:50.960 --> 0:28:54.080
<v Speaker 2>end of the day, reliability ends up becoming about understanding

0:28:54.160 --> 0:29:09.160
<v Speaker 2>our minds.

0:29:10.760 --> 0:29:13.640
<v Speaker 1>So the idea is if I could have an AI

0:29:13.760 --> 0:29:17.560
<v Speaker 1>agent that understands my mind, that has a model of me,

0:29:17.680 --> 0:29:19.800
<v Speaker 1>including what I know and don't know, and what my

0:29:19.880 --> 0:29:23.880
<v Speaker 1>goals are long term and short term, then it could

0:29:24.480 --> 0:29:26.600
<v Speaker 1>do a better job at what needs to be done.

0:29:26.640 --> 0:29:28.400
<v Speaker 1>Because when it comes to a choice point is is oh,

0:29:28.400 --> 0:29:30.640
<v Speaker 1>I know what Eagleman wants. He likes this kind of thing,

0:29:31.000 --> 0:29:32.920
<v Speaker 1>and that might be something that emerges not just from

0:29:33.000 --> 0:29:36.959
<v Speaker 1>patterns of looking at my behavior, but actually understanding internally,

0:29:37.680 --> 0:29:39.120
<v Speaker 1>having some theory of my mind.

0:29:39.360 --> 0:29:39.840
<v Speaker 3>I think that it.

0:29:39.800 --> 0:29:42.440
<v Speaker 2>Would need that, yes, and I think that we would

0:29:42.560 --> 0:29:45.600
<v Speaker 2>need to be able to interact with it in the

0:29:45.680 --> 0:29:49.080
<v Speaker 2>way that we interact with other teammates, where we're negotiating,

0:29:49.160 --> 0:29:51.640
<v Speaker 2>meaning in real time, where we're going back to earlier

0:29:51.680 --> 0:29:55.080
<v Speaker 2>in our conversation. Sometimes we don't realize that we're not

0:29:55.120 --> 0:29:59.040
<v Speaker 2>clearly thinking about something, and so having that reflected back

0:29:59.280 --> 0:30:03.600
<v Speaker 2>and being able to go through this exchange a dialectic.

0:30:03.680 --> 0:30:06.959
<v Speaker 2>It can refine our thinking, sharpen our thinking.

0:30:07.360 --> 0:30:09.640
<v Speaker 1>So this is a thing I've been wondering about for

0:30:09.640 --> 0:30:12.400
<v Speaker 1>a while, which is if you're looking at something from

0:30:12.400 --> 0:30:16.040
<v Speaker 1>the outside, you can actually get a lot of data

0:30:16.120 --> 0:30:18.440
<v Speaker 1>about it. And by outside I mean as opposed to

0:30:18.600 --> 0:30:20.920
<v Speaker 1>from the inside. If I have a theory of your mind,

0:30:21.120 --> 0:30:23.840
<v Speaker 1>the question is if I, just if I could observe

0:30:24.000 --> 0:30:26.840
<v Speaker 1>all of your behavior without knowing anything about what's in

0:30:26.880 --> 0:30:29.840
<v Speaker 1>your mind, could I nonetheless do just as good a job.

0:30:30.160 --> 0:30:33.000
<v Speaker 2>Well, I think that this is what our models of

0:30:33.040 --> 0:30:37.680
<v Speaker 2>other mind essentially are. It's making sense of behavior. Yeah,

0:30:37.880 --> 0:30:43.360
<v Speaker 2>it's just that the behavior, again is multiply redundant, and

0:30:43.560 --> 0:30:45.640
<v Speaker 2>there are many different cues that we can attend to,

0:30:45.720 --> 0:30:49.080
<v Speaker 2>and we're prioritizing some cues over others, and then it

0:30:49.120 --> 0:30:52.400
<v Speaker 2>is more efficient for us to represent that there's a

0:30:52.400 --> 0:30:53.320
<v Speaker 2>mind behind the eyes.

0:30:53.680 --> 0:30:56.200
<v Speaker 1>Yeah, very good. Do you see a world where we

0:30:56.280 --> 0:31:00.280
<v Speaker 1>would have lots of AI agents that are also speeding

0:31:00.360 --> 0:31:03.600
<v Speaker 1>with one another in terms of communicative drive of saying, hey,

0:31:03.680 --> 0:31:05.200
<v Speaker 1>this is what I've learned, and I know and blah

0:31:05.240 --> 0:31:08.720
<v Speaker 1>blah blah, and they develop a better, something bigger as

0:31:08.720 --> 0:31:11.400
<v Speaker 1>a result of the communication. They have a better understanding

0:31:11.400 --> 0:31:14.560
<v Speaker 1>of the world because they're talking with one another, because

0:31:14.600 --> 0:31:16.440
<v Speaker 1>just like humans, each of them is going to have

0:31:16.520 --> 0:31:18.240
<v Speaker 1>some trajectory through space time.

0:31:19.000 --> 0:31:21.240
<v Speaker 2>Okay, this gets into the second thing that I think

0:31:21.360 --> 0:31:23.840
<v Speaker 2>is really important about how agents can be useful. So

0:31:23.840 --> 0:31:27.040
<v Speaker 2>the first thing is they can do the digital drudgery

0:31:27.080 --> 0:31:30.320
<v Speaker 2>for us. They can save time. I call this They

0:31:30.320 --> 0:31:33.600
<v Speaker 2>can become our collective subconscious because we won't have to

0:31:33.640 --> 0:31:37.760
<v Speaker 2>spend our conscious time attending to things. We can relegate

0:31:37.960 --> 0:31:41.040
<v Speaker 2>so the agents can be doing in parallel all of

0:31:41.080 --> 0:31:43.120
<v Speaker 2>this stuff, so we don't have to become expert in

0:31:43.280 --> 0:31:46.480
<v Speaker 2>these arbitrary skills. But they are also as they are

0:31:46.480 --> 0:31:49.040
<v Speaker 2>doing that for a lot of people, they're learning a

0:31:49.080 --> 0:31:51.800
<v Speaker 2>bunch of different skills they're learning how to navigate different

0:31:51.840 --> 0:31:54.720
<v Speaker 2>websites and use different software tools. And so if I

0:31:54.960 --> 0:31:58.840
<v Speaker 2>need to for my job learn something really quickly, they

0:31:58.880 --> 0:32:02.280
<v Speaker 2>can redistribute the skills that they've learned from my teammates,

0:32:02.360 --> 0:32:05.280
<v Speaker 2>and they can give me the context that I need,

0:32:05.600 --> 0:32:07.680
<v Speaker 2>not to become expert in that tool, but to be

0:32:07.720 --> 0:32:11.640
<v Speaker 2>able to establish representational alignment with my teammate who uses

0:32:11.680 --> 0:32:15.840
<v Speaker 2>that tool. So they can help coordinate a team's behavior

0:32:15.960 --> 0:32:19.080
<v Speaker 2>by understanding all of the things that we do, all

0:32:19.120 --> 0:32:20.680
<v Speaker 2>the goals that we have, all the tools that we

0:32:20.800 --> 0:32:22.000
<v Speaker 2>use to achieve our goals.

0:32:22.280 --> 0:32:24.880
<v Speaker 1>Now, normally that would happen. You go up to Susie

0:32:24.880 --> 0:32:26.560
<v Speaker 1>and you say, hey, you know, I need to talk

0:32:26.600 --> 0:32:28.400
<v Speaker 1>to you, and Susie says, no, I use a different

0:32:28.400 --> 0:32:29.400
<v Speaker 1>word than you're using.

0:32:29.200 --> 0:32:31.840
<v Speaker 3>It for ten minutes to even establish common ground.

0:32:32.000 --> 0:32:34.160
<v Speaker 1>Got it. But you're saying that AI could help with

0:32:34.200 --> 0:32:37.600
<v Speaker 1>that interaction like the third person in the room and say, hey,

0:32:37.600 --> 0:32:38.920
<v Speaker 1>you know what, Danielle, this is what you need to

0:32:38.920 --> 0:32:40.400
<v Speaker 1>know about and Susie, this is what you need to know.

0:32:40.520 --> 0:32:42.640
<v Speaker 1>And I noticed you guys are using this same word,

0:32:42.680 --> 0:32:45.280
<v Speaker 1>but you mean different things by it's that kind of thing.

0:32:45.400 --> 0:32:48.760
<v Speaker 2>Yes, that's so yes to your point, I do think

0:32:48.880 --> 0:32:53.000
<v Speaker 2>that these agents that are working in parallel and understanding

0:32:53.000 --> 0:32:56.280
<v Speaker 2>our context will probably detect a ton of inefficiencies and

0:32:56.320 --> 0:32:58.800
<v Speaker 2>how we're doing things, and they will come up with

0:32:58.960 --> 0:33:01.360
<v Speaker 2>better ways of doing I would hope that they would.

0:33:01.120 --> 0:33:02.320
<v Speaker 3>Do that great.

0:33:03.160 --> 0:33:04.680
<v Speaker 2>One of the things that we learned is we were

0:33:04.720 --> 0:33:08.920
<v Speaker 2>trying to train our agent how to use Gmail, is

0:33:08.960 --> 0:33:12.080
<v Speaker 2>that wow, most people are actually really bad at using Gmail.

0:33:12.120 --> 0:33:12.640
<v Speaker 1>In what way?

0:33:13.680 --> 0:33:15.320
<v Speaker 3>So we don't know how to do.

0:33:15.200 --> 0:33:20.480
<v Speaker 2>The search queries effectively. We mostly just stumble through. And

0:33:20.520 --> 0:33:24.880
<v Speaker 2>it's the power users who sometimes build hold businesses around

0:33:25.080 --> 0:33:27.800
<v Speaker 2>you know, the Google Suite and Gmail. They they know

0:33:28.000 --> 0:33:31.120
<v Speaker 2>exactly how it was designed, all of the affordances that

0:33:31.160 --> 0:33:34.320
<v Speaker 2>it has, all of the new features, how that can

0:33:34.360 --> 0:33:35.680
<v Speaker 2>make things actually more efficient.

0:33:36.480 --> 0:33:37.200
<v Speaker 3>They get it.

0:33:37.720 --> 0:33:41.200
<v Speaker 2>But most people don't have the time to keep up

0:33:41.240 --> 0:33:43.960
<v Speaker 2>with all of the new things that you can do,

0:33:44.280 --> 0:33:46.800
<v Speaker 2>and when they sit down to look at their email,

0:33:46.840 --> 0:33:48.760
<v Speaker 2>they just want to you know, send that email off,

0:33:48.880 --> 0:33:50.680
<v Speaker 2>or just want to find that thing, and so they're

0:33:50.680 --> 0:33:54.080
<v Speaker 2>not deeply engaged with learning all of the things you

0:33:54.080 --> 0:33:54.400
<v Speaker 2>can do.

0:33:54.480 --> 0:33:56.480
<v Speaker 1>So if you had this AI agent sitting on the

0:33:56.520 --> 0:34:00.000
<v Speaker 1>shoulder in that sense, they would let's Sayea, they would

0:34:00.160 --> 0:34:02.160
<v Speaker 1>teach you. Hey, look, here's the thing you need to

0:34:02.160 --> 0:34:03.880
<v Speaker 1>know today, Daniel, is that the idea is that it

0:34:03.920 --> 0:34:06.160
<v Speaker 1>would help you to be a better Gmail user in

0:34:06.160 --> 0:34:07.120
<v Speaker 1>this particular example.

0:34:07.320 --> 0:34:10.080
<v Speaker 2>Well, so, I actually think the longer term goal is

0:34:10.120 --> 0:34:11.640
<v Speaker 2>that humans are spending far.

0:34:11.600 --> 0:34:12.800
<v Speaker 3>Less time looking at screens.

0:34:13.840 --> 0:34:14.280
<v Speaker 1>Excellent.

0:34:15.000 --> 0:34:20.600
<v Speaker 2>Yes, there are exceptions which include when the actual tool

0:34:21.120 --> 0:34:24.759
<v Speaker 2>scaffolds your thinking. So I use the example of Adobe

0:34:24.760 --> 0:34:27.879
<v Speaker 2>Creative Suite. If you want to edit a photo or

0:34:27.920 --> 0:34:32.399
<v Speaker 2>create a podcast or a video, if you didn't have

0:34:32.640 --> 0:34:37.520
<v Speaker 2>all of the UIs and all of the dropdowns, yes,

0:34:38.000 --> 0:34:41.840
<v Speaker 2>then you probably wouldn't even know where to start. You

0:34:41.840 --> 0:34:43.200
<v Speaker 2>wouldn't even know what was possible.

0:34:43.520 --> 0:34:45.480
<v Speaker 3>So some of the tools.

0:34:45.440 --> 0:34:50.239
<v Speaker 2>Are actually really helpful in scaffolding your understanding of what's possible,

0:34:50.600 --> 0:34:56.759
<v Speaker 2>whereas other tools just distract so much from the actual goal.

0:34:56.800 --> 0:34:58.279
<v Speaker 2>We have to learn how to use the tools to

0:34:58.360 --> 0:35:00.880
<v Speaker 2>do the actual research that we that we care about.

0:35:01.080 --> 0:35:04.120
<v Speaker 2>So having agents take over the things that we don't

0:35:04.160 --> 0:35:06.680
<v Speaker 2>care about is great, and then we can focus on

0:35:06.719 --> 0:35:09.680
<v Speaker 2>the interactions that really do matter, the visualizations that really

0:35:09.680 --> 0:35:10.200
<v Speaker 2>do matter.

0:35:10.440 --> 0:35:12.319
<v Speaker 1>So let me just understand that. So you're saying, let's

0:35:12.320 --> 0:35:14.960
<v Speaker 1>say future software are ten years from now. I open it,

0:35:14.960 --> 0:35:17.640
<v Speaker 1>there's just a few simple things on the menu, but

0:35:17.920 --> 0:35:21.719
<v Speaker 1>there's lots of hidden power which my AI agent can

0:35:21.800 --> 0:35:25.040
<v Speaker 1>help me expose and uncover through time.

0:35:25.360 --> 0:35:29.640
<v Speaker 2>Yes, and I'm imagining things where Okay, the agent understands

0:35:29.880 --> 0:35:34.759
<v Speaker 2>my goal, has my context and can generate on the

0:35:34.800 --> 0:35:39.400
<v Speaker 2>fly only the UI, the button or the search field

0:35:39.440 --> 0:35:42.680
<v Speaker 2>that I need in that moment. Every time I open

0:35:42.760 --> 0:35:45.919
<v Speaker 2>my computer, I feel anxious. I swipe to another tab

0:35:45.920 --> 0:35:46.600
<v Speaker 2>and it's like it's.

0:35:46.520 --> 0:35:49.440
<v Speaker 3>Swiping my memory. What was I doing again? And that

0:35:49.600 --> 0:35:50.560
<v Speaker 3>is that is constant.

0:35:50.600 --> 0:35:52.000
<v Speaker 2>You know, you've got most people have a lot of

0:35:52.040 --> 0:35:54.480
<v Speaker 2>tabs open, They've got a lot of tools open, and

0:35:54.520 --> 0:35:58.319
<v Speaker 2>it's just our cognition is not meant for all of that.

0:35:58.400 --> 0:36:00.439
<v Speaker 2>There's a lot of cognitive loads. So if we could

0:36:00.440 --> 0:36:07.160
<v Speaker 2>simplify that, that would augment our cognition. The other way

0:36:07.560 --> 0:36:10.440
<v Speaker 2>that agents could really augment our potential is by helping

0:36:10.560 --> 0:36:14.600
<v Speaker 2>us learn. So this kind of flows from it has

0:36:14.680 --> 0:36:17.279
<v Speaker 2>a model of my mind, my teammate's mind, It can

0:36:17.320 --> 0:36:20.759
<v Speaker 2>help us communicate at the right level of abstraction, save

0:36:20.840 --> 0:36:23.360
<v Speaker 2>us time. But also if it has a model of

0:36:23.400 --> 0:36:25.880
<v Speaker 2>my mind and it has my context, it knows the

0:36:25.880 --> 0:36:27.799
<v Speaker 2>things that I know and don't know, the skills that

0:36:27.840 --> 0:36:30.000
<v Speaker 2>I have, and the things that I care about. Then

0:36:30.080 --> 0:36:33.480
<v Speaker 2>say I want to learn something totally new. Maybe it's

0:36:33.520 --> 0:36:37.320
<v Speaker 2>not just a software tool. Maybe it's something like quantum mechanics,

0:36:37.840 --> 0:36:42.040
<v Speaker 2>And that's really difficult to understand. You need analogies, But

0:36:42.280 --> 0:36:44.120
<v Speaker 2>what are the right analogies that are going to work

0:36:44.120 --> 0:36:46.480
<v Speaker 2>for me? Well, if it has a model of my mind,

0:36:46.880 --> 0:36:50.560
<v Speaker 2>then it can, in a personalized way help me come

0:36:50.719 --> 0:36:55.360
<v Speaker 2>to the understanding that I need to get the big picture,

0:36:55.400 --> 0:36:59.160
<v Speaker 2>and it can sort of follow that in a way

0:36:59.719 --> 0:37:03.480
<v Speaker 2>create a curriculum for me that gives me the information

0:37:03.680 --> 0:37:06.719
<v Speaker 2>that's not too challenging, not too easy, that's right in

0:37:06.800 --> 0:37:07.719
<v Speaker 2>that sweet.

0:37:07.360 --> 0:37:10.959
<v Speaker 1>Spot between frustrating and achievable. Yes, yeah, that's really interesting.

0:37:11.040 --> 0:37:14.520
<v Speaker 1>That'll have clear implications in education as well, keeping people

0:37:14.600 --> 0:37:17.319
<v Speaker 1>right at their right spot there. Okay, so let me

0:37:17.320 --> 0:37:18.560
<v Speaker 1>return to a question, because I just want to make

0:37:18.560 --> 0:37:21.520
<v Speaker 1>sure I understood coming back to this idea of communicative

0:37:21.640 --> 0:37:26.840
<v Speaker 1>drive and agents like you and I learning from one another.

0:37:27.680 --> 0:37:31.640
<v Speaker 1>Will AI agents learn from one another, not just between

0:37:31.760 --> 0:37:34.120
<v Speaker 1>agent and human, but agent to agent.

0:37:34.400 --> 0:37:36.759
<v Speaker 2>Yes, And now you're really bringing it all together. So

0:37:37.040 --> 0:37:40.120
<v Speaker 2>I think that agents interacting with other agents will be

0:37:40.200 --> 0:37:43.319
<v Speaker 2>able to learn all sorts of patterns that maybe we

0:37:43.440 --> 0:37:46.400
<v Speaker 2>haven't yet learned, and detect all sorts of inefficiencies and

0:37:46.520 --> 0:37:47.680
<v Speaker 2>be more efficient in some ways.

0:37:47.960 --> 0:37:50.000
<v Speaker 3>But if they also have a.

0:37:50.200 --> 0:37:52.400
<v Speaker 2>Not only the ability to model our minds, but a

0:37:52.440 --> 0:37:56.160
<v Speaker 2>motivation to then that's not going to be restricted knowledge

0:37:56.160 --> 0:37:58.040
<v Speaker 2>to them. They're not going to go off and speciate

0:37:58.120 --> 0:38:00.440
<v Speaker 2>and have all of this you know, intell that we

0:38:00.480 --> 0:38:03.400
<v Speaker 2>don't have. They're going to try to communicate their insights

0:38:03.440 --> 0:38:07.320
<v Speaker 2>to us. So chess players are now so much better

0:38:07.520 --> 0:38:09.840
<v Speaker 2>after we've built AI that's really good at chess. So

0:38:09.880 --> 0:38:14.160
<v Speaker 2>we can co evolve with this new species of intelligence.

0:38:14.200 --> 0:38:17.400
<v Speaker 2>And if it's motivated to bring us along to establish

0:38:17.440 --> 0:38:21.240
<v Speaker 2>representational alignment, then I think we will continue to get smarter.

0:38:21.800 --> 0:38:26.400
<v Speaker 1>Do you see a situation where representational alignment just isn't possible?

0:38:26.680 --> 0:38:29.800
<v Speaker 1>For example, let's say I came to you and said, hey, Danielle,

0:38:29.840 --> 0:38:32.239
<v Speaker 1>I really want to teach you about these really important

0:38:32.280 --> 0:38:34.440
<v Speaker 1>pieces of Mongolian history. And let's say you just don't

0:38:34.440 --> 0:38:36.160
<v Speaker 1>care about Mongolian history, and I'm trying to tell you

0:38:36.239 --> 0:38:39.200
<v Speaker 1>the state in this emperor or whatever. It's not going

0:38:39.239 --> 0:38:39.879
<v Speaker 1>to go very far.

0:38:40.040 --> 0:38:41.959
<v Speaker 2>That is a really good question. I think you nailed

0:38:41.960 --> 0:38:44.200
<v Speaker 2>it when you said just don't care. In the same

0:38:44.239 --> 0:38:47.600
<v Speaker 2>way that it's really hard to teach a child something

0:38:47.840 --> 0:38:50.440
<v Speaker 2>that they don't care about, it's not relevant to them.

0:38:50.800 --> 0:38:54.040
<v Speaker 2>I think it will be hard to establish representational alignment

0:38:54.080 --> 0:38:57.400
<v Speaker 2>with anybody if they don't care. But in principle, I

0:38:57.440 --> 0:39:01.279
<v Speaker 2>think it's possible, given that there is ant It might

0:39:01.360 --> 0:39:05.960
<v Speaker 2>just be a matter of finding the right analogies. And

0:39:06.000 --> 0:39:08.759
<v Speaker 2>again this goes to your work. There's so much plasticity

0:39:08.840 --> 0:39:09.400
<v Speaker 2>in the brain.

0:39:10.239 --> 0:39:10.720
<v Speaker 3>I think.

0:39:10.840 --> 0:39:12.920
<v Speaker 2>Correct me if I'm wrong, But in principle, there's no

0:39:13.120 --> 0:39:17.719
<v Speaker 2>limit to what we could learn to not only understand,

0:39:17.800 --> 0:39:21.759
<v Speaker 2>but even have a phenomenological experience of if there is

0:39:22.000 --> 0:39:23.520
<v Speaker 2>structure to that information.

0:39:24.200 --> 0:39:27.520
<v Speaker 1>Yes, but all of brain plasticity is driven by relevance.

0:39:27.560 --> 0:39:29.120
<v Speaker 1>In other words, do I care about it? I can't

0:39:29.120 --> 0:39:31.560
<v Speaker 1>think right. So if my AI agent comes to me

0:39:31.600 --> 0:39:33.680
<v Speaker 1>and says, look, I just realized this great thing about

0:39:33.680 --> 0:39:35.520
<v Speaker 1>how you could redesign this computer chip in this way,

0:39:35.560 --> 0:39:38.000
<v Speaker 1>and maybe it starts telling me all this detailed stuff

0:39:38.040 --> 0:39:41.279
<v Speaker 1>and I just don't care. It's not achieving representational alignments.

0:39:41.360 --> 0:39:44.479
<v Speaker 2>That's true, And maybe in some cases, Okay, some people

0:39:44.520 --> 0:39:45.600
<v Speaker 2>don't care, other people do care.

0:39:45.680 --> 0:39:46.640
<v Speaker 3>Go to the people who do care.

0:39:46.840 --> 0:39:49.120
<v Speaker 2>But also I guess this also begs the question why

0:39:49.160 --> 0:39:51.799
<v Speaker 2>do some people care about things? Because it's relevant? So

0:39:51.920 --> 0:39:56.520
<v Speaker 2>make it relevant. Tell the person as an agent, lead

0:39:56.600 --> 0:39:59.319
<v Speaker 2>the person to the insights that they need to have

0:39:59.680 --> 0:40:03.040
<v Speaker 2>to about something. One of the things that is associated

0:40:03.080 --> 0:40:06.200
<v Speaker 2>with becoming expert in something is that you really like

0:40:06.400 --> 0:40:08.759
<v Speaker 2>the things that you become expert in because it is

0:40:08.840 --> 0:40:12.480
<v Speaker 2>satisfying to understand, and the more bits and pieces of

0:40:12.480 --> 0:40:16.400
<v Speaker 2>information that you can integrate into a holistic web of knowledge,

0:40:16.440 --> 0:40:17.520
<v Speaker 2>the better it feels.

0:40:17.920 --> 0:40:21.440
<v Speaker 3>So I think that this is my vision for a

0:40:21.600 --> 0:40:24.920
<v Speaker 3>very happy future, is that.

0:40:25.160 --> 0:40:27.880
<v Speaker 2>We're all learning all the time, and the more we

0:40:27.960 --> 0:40:31.600
<v Speaker 2>learn and the more sort of accurate our shared world

0:40:31.680 --> 0:40:35.880
<v Speaker 2>models map onto reality, the more satisfying it will be

0:40:36.080 --> 0:40:44.360
<v Speaker 2>to continue to learn.

0:40:51.680 --> 0:40:54.799
<v Speaker 1>I totally agree, and I love thinking about this from

0:40:54.840 --> 0:40:57.279
<v Speaker 1>the point of view of education, because if you think

0:40:57.320 --> 0:41:01.120
<v Speaker 1>of this sphere of humankind's knowledge, we know more than

0:41:01.120 --> 0:41:03.160
<v Speaker 1>any of us could possibly learn in a lifetime. Yeah,

0:41:03.200 --> 0:41:04.920
<v Speaker 1>So the key is to find out what are the

0:41:05.080 --> 0:41:09.000
<v Speaker 1>doors on the outside of this sphere that you love

0:41:09.120 --> 0:41:11.640
<v Speaker 1>or that I love? Given our totally different backgrounds and whatever.

0:41:11.719 --> 0:41:15.160
<v Speaker 1>Certain things really fascinate me and other things I wish

0:41:15.200 --> 0:41:17.680
<v Speaker 1>but they don't. Okay. So if I can enter this

0:41:17.760 --> 0:41:19.719
<v Speaker 1>door and you enter this other door and we end

0:41:19.800 --> 0:41:22.959
<v Speaker 1>up learning the same kind of stuff, it's really great. Yeah.

0:41:23.000 --> 0:41:26.439
<v Speaker 1>And Isaac Asimov actually really cared about this topic way

0:41:26.480 --> 0:41:30.400
<v Speaker 1>back in the day, and he did an interview on LERR,

0:41:30.440 --> 0:41:32.680
<v Speaker 1>which was this PBS talk show thing way back in

0:41:32.719 --> 0:41:36.560
<v Speaker 1>the eighties, and he envisioned this was before the Internet,

0:41:36.640 --> 0:41:38.440
<v Speaker 1>and he said, I envisioned a day where there's a

0:41:38.960 --> 0:41:42.520
<v Speaker 1>huge supercomputer that has all of human kinds of knowledge,

0:41:42.719 --> 0:41:45.040
<v Speaker 1>and everyone has a cable running from this computer to

0:41:45.120 --> 0:41:48.640
<v Speaker 1>their home and you can ask the computer any question

0:41:48.680 --> 0:41:51.080
<v Speaker 1>you want. But his point was you could take your

0:41:51.120 --> 0:41:55.239
<v Speaker 1>own inroad into this sphere of knowledge. Okay. So your

0:41:55.239 --> 0:42:00.319
<v Speaker 1>point is if these AI agents have some theory about

0:42:00.360 --> 0:42:02.920
<v Speaker 1>our minds, as in your mind, in my mind, everybody

0:42:03.000 --> 0:42:06.000
<v Speaker 1>is an individual, and then can cast things in a

0:42:06.000 --> 0:42:08.680
<v Speaker 1>certain way like look, here's a way that you might

0:42:08.760 --> 0:42:12.080
<v Speaker 1>care about it, then we're all going to learn faster

0:42:12.160 --> 0:42:12.560
<v Speaker 1>and better.

0:42:12.840 --> 0:42:13.240
<v Speaker 3>Yeah.

0:42:13.520 --> 0:42:15.839
<v Speaker 2>Yeah, And I also think that this is not just

0:42:16.200 --> 0:42:18.080
<v Speaker 2>you know, academic stuff, intellectual stuff.

0:42:18.080 --> 0:42:19.640
<v Speaker 3>Anything can be interesting.

0:42:20.200 --> 0:42:23.799
<v Speaker 2>You know, people in their knitting communities can become really

0:42:23.840 --> 0:42:27.560
<v Speaker 2>passionate about innovating new ways and in sharing that.

0:42:27.520 --> 0:42:28.320
<v Speaker 3>Within a community.

0:42:28.440 --> 0:42:32.000
<v Speaker 2>And I think that this also gets to one of

0:42:32.040 --> 0:42:36.200
<v Speaker 2>the sort of essences of what we care about as humans.

0:42:36.320 --> 0:42:40.759
<v Speaker 2>It's this idea of optimal distinctiveness. So we simultaneously need

0:42:40.840 --> 0:42:42.880
<v Speaker 2>communities for belonging.

0:42:42.960 --> 0:42:44.480
<v Speaker 3>This is an evolutionarily ancient thing.

0:42:44.520 --> 0:42:46.200
<v Speaker 2>It's not just humans, but we need to feel like

0:42:46.200 --> 0:42:47.600
<v Speaker 2>we belong within a community.

0:42:47.960 --> 0:42:49.680
<v Speaker 3>We've got our in group, we've got our tribe.

0:42:49.719 --> 0:42:52.480
<v Speaker 2>Not always bad, and maybe it's just something that we

0:42:52.800 --> 0:42:55.400
<v Speaker 2>will always need as being human, but we also need

0:42:55.440 --> 0:42:58.520
<v Speaker 2>to feel like we're unique and that we have something

0:42:58.560 --> 0:43:03.279
<v Speaker 2>to contribute to the community. We are optimally distinct in

0:43:03.360 --> 0:43:08.160
<v Speaker 2>our contributions. So I imagine a future where agents with

0:43:08.320 --> 0:43:13.600
<v Speaker 2>models of our minds really allow us to be diverse.

0:43:13.920 --> 0:43:18.879
<v Speaker 2>They're not flattening the experiences or the capabilities, but they're

0:43:19.040 --> 0:43:25.000
<v Speaker 2>encouraging diversity and variability. But they're also building mechanisms for

0:43:25.200 --> 0:43:29.239
<v Speaker 2>us to align our minds. They're building bridges throughout the

0:43:29.640 --> 0:43:32.920
<v Speaker 2>various experiences and capabilities so great.

0:43:32.920 --> 0:43:34.560
<v Speaker 1>So this leads me to this question that I've been

0:43:34.560 --> 0:43:37.560
<v Speaker 1>wondering about which is. You know, in political debates that

0:43:37.600 --> 0:43:42.120
<v Speaker 1>we see lots nowadays, there isn't a representational alignment. If

0:43:42.160 --> 0:43:44.239
<v Speaker 1>you know someone's on the left and someone's on the right,

0:43:44.320 --> 0:43:46.520
<v Speaker 1>they end up saying we're not going to have Thanksgiving

0:43:46.520 --> 0:43:49.160
<v Speaker 1>dinner together instead of saying, oh, let me understand. How

0:43:49.160 --> 0:43:53.200
<v Speaker 1>can I understand? So where does representational alignment break down?

0:43:54.040 --> 0:43:56.440
<v Speaker 1>And might AI help us there someday?

0:43:57.400 --> 0:44:03.359
<v Speaker 2>I think representational alignment breaks down when two minds have

0:44:04.120 --> 0:44:11.840
<v Speaker 2>such different starting points, such different sets of maybe analogies, experiences,

0:44:11.880 --> 0:44:13.480
<v Speaker 2>and also motivations, things.

0:44:13.280 --> 0:44:15.880
<v Speaker 3>That they care about, that it's just really hard.

0:44:16.160 --> 0:44:19.640
<v Speaker 2>And let's assume that they can get past the initial

0:44:19.680 --> 0:44:24.480
<v Speaker 2>emotional friction of just knowing that they come from different groups.

0:44:24.280 --> 0:44:25.360
<v Speaker 3>That's not trivial.

0:44:25.680 --> 0:44:28.360
<v Speaker 2>Sometimes just knowing that somebody is from a different group

0:44:28.680 --> 0:44:31.960
<v Speaker 2>prevents any sort of establishing of common ground. But assuming

0:44:32.040 --> 0:44:35.640
<v Speaker 2>that you can get over that, then you just might

0:44:35.680 --> 0:44:40.200
<v Speaker 2>not have enough overlap in your representations. I suspect that

0:44:40.200 --> 0:44:44.440
<v Speaker 2>that's very rare. Just by nature of being a human

0:44:45.000 --> 0:44:47.600
<v Speaker 2>embodied in the way that we are having experiences in

0:44:47.640 --> 0:44:49.800
<v Speaker 2>the world and the way that we do, there's gonna

0:44:49.840 --> 0:44:54.919
<v Speaker 2>be enough overlap as an entry point into being able

0:44:54.920 --> 0:44:56.960
<v Speaker 2>to establish some kind of alignment.

0:44:57.080 --> 0:44:59.040
<v Speaker 1>So sorry, you're saying it's rare that people would have

0:44:59.040 --> 0:45:01.680
<v Speaker 1>such different pints view that can't get there, because it

0:45:01.800 --> 0:45:04.040
<v Speaker 1>is the case that people don't get there or won't

0:45:04.080 --> 0:45:07.359
<v Speaker 1>get there. But you're saying if people tried harder, let's say,

0:45:07.360 --> 0:45:08.839
<v Speaker 1>with political arguments. Yes.

0:45:09.280 --> 0:45:14.319
<v Speaker 2>I also think that as we interact with this hypothetical

0:45:14.960 --> 0:45:17.400
<v Speaker 2>agent of the future that has models of our minds,

0:45:17.480 --> 0:45:22.359
<v Speaker 2>it will be modeling for us behavior that helps us

0:45:22.480 --> 0:45:26.279
<v Speaker 2>establish alignment. And so in the same way that we

0:45:27.200 --> 0:45:29.799
<v Speaker 2>learn from others who model good behavior and then we

0:45:29.840 --> 0:45:33.400
<v Speaker 2>start to reflect that subconsciously, I think it could nudge

0:45:33.520 --> 0:45:37.000
<v Speaker 2>us into more pro social interaction.

0:45:37.200 --> 0:45:39.400
<v Speaker 1>I totally agree with this. I did a podcast episode

0:45:39.400 --> 0:45:43.960
<v Speaker 1>on this about this AI research group in Europe that

0:45:44.120 --> 0:45:49.120
<v Speaker 1>released some chat bots onto a Reddit channel that does debate,

0:45:49.400 --> 0:45:51.440
<v Speaker 1>and they didn't tell anybody that these were aibots, so

0:45:51.440 --> 0:45:53.040
<v Speaker 1>they got in big trouble. Everyone was mad about it.

0:45:53.080 --> 0:45:55.920
<v Speaker 1>But what happened was these AI bots would come in

0:45:56.040 --> 0:45:57.960
<v Speaker 1>and take someone who had a particular point of view,

0:45:57.960 --> 0:45:59.200
<v Speaker 1>and they would take the other point of view and

0:45:59.200 --> 0:46:02.359
<v Speaker 1>they would discuss, and on this particular channel, you get

0:46:02.520 --> 0:46:07.080
<v Speaker 1>points if you successfully convince somebody of your point of view.

0:46:07.120 --> 0:46:09.560
<v Speaker 1>And so it turns out these bots did six times

0:46:09.560 --> 0:46:12.800
<v Speaker 1>better than humans do on average in terms of changing

0:46:12.800 --> 0:46:15.839
<v Speaker 1>the other person's mind. So everyone freaked out about this

0:46:15.960 --> 0:46:18.359
<v Speaker 1>and said, oh my god, these AI debate bots can

0:46:18.960 --> 0:46:21.880
<v Speaker 1>manipulate us. And but it turns out the really amazing

0:46:21.920 --> 0:46:24.719
<v Speaker 1>part is they weren't doing anything manipulative. They weren't lying,

0:46:24.760 --> 0:46:28.120
<v Speaker 1>they weren't doing anything. They were just better debaters in

0:46:28.160 --> 0:46:31.640
<v Speaker 1>the sense that they were empathic, they were calm, they

0:46:31.640 --> 0:46:35.000
<v Speaker 1>presented their arguments well, And I thought, God, we can

0:46:35.080 --> 0:46:38.239
<v Speaker 1>really learn from that if we teach our children to

0:46:38.880 --> 0:46:40.560
<v Speaker 1>be better discussings.

0:46:41.000 --> 0:46:41.400
<v Speaker 3>Totally.

0:46:41.600 --> 0:46:44.799
<v Speaker 1>Yeah, So I love that. I love that point. But

0:46:44.880 --> 0:46:47.440
<v Speaker 1>what you're saying is, so you think people with differentints

0:46:47.440 --> 0:46:50.440
<v Speaker 1>of view can do representational alignment. But that's a separate issue,

0:46:50.480 --> 0:46:53.279
<v Speaker 1>which is that how do we do things culturally and

0:46:53.360 --> 0:46:55.600
<v Speaker 1>teaching them to be better at conversation.

0:46:56.000 --> 0:46:59.800
<v Speaker 2>Yeah, and even having an awareness that other people are different.

0:47:00.040 --> 0:47:00.840
<v Speaker 3>It's not personal.

0:47:01.600 --> 0:47:04.640
<v Speaker 2>If you disagree or if you have different rituals, different

0:47:04.640 --> 0:47:06.879
<v Speaker 2>ways of doing things, that's fine. So I think even

0:47:06.960 --> 0:47:11.400
<v Speaker 2>just exposure to variability diversity can get us part of

0:47:11.400 --> 0:47:11.879
<v Speaker 2>the way there.

0:47:12.160 --> 0:47:15.000
<v Speaker 1>Oh, that's fascinating. Cool. The other thing I was going

0:47:15.040 --> 0:47:17.280
<v Speaker 1>to ask you about is you mentioned earlier just tangentially

0:47:17.320 --> 0:47:18.640
<v Speaker 1>said something about archaeology.

0:47:19.120 --> 0:47:23.920
<v Speaker 2>Ah, yes, okay, So there over the past two decades,

0:47:23.960 --> 0:47:30.440
<v Speaker 2>there's been an update in our understanding of the evolution

0:47:30.680 --> 0:47:34.200
<v Speaker 2>of the types of sophisticated reasoning and thinking symbolic thinking

0:47:34.200 --> 0:47:35.239
<v Speaker 2>capabilities that humans have.

0:47:35.320 --> 0:47:36.400
<v Speaker 3>We used to think.

0:47:36.560 --> 0:47:40.759
<v Speaker 2>That they emerged suddenly in what we call the cognitive

0:47:40.800 --> 0:47:43.960
<v Speaker 2>revolution that happened thirty to forty thousand years ago, and

0:47:43.960 --> 0:47:44.879
<v Speaker 2>that's because.

0:47:44.640 --> 0:47:48.040
<v Speaker 1>The so humans were like other primates and then suddenly,

0:47:48.040 --> 0:47:49.879
<v Speaker 1>thirty four thousand years ago something happened.

0:47:49.760 --> 0:47:54.480
<v Speaker 2>Okay, yeah, and archaeologists were looking at evidence in Europe

0:47:54.640 --> 0:47:57.239
<v Speaker 2>and the caves and you know, the art and things

0:47:57.239 --> 0:47:58.920
<v Speaker 2>like that, and it just it seemed like there was

0:47:58.960 --> 0:48:05.960
<v Speaker 2>a discontinuity. But then they started exploring throughout Africa and

0:48:06.440 --> 0:48:09.919
<v Speaker 2>using more nuanced methods, and a lot more of them

0:48:10.000 --> 0:48:14.279
<v Speaker 2>started doing this, and they started finding evidence from as

0:48:14.360 --> 0:48:17.879
<v Speaker 2>early as three hundred thousand years ago that we were

0:48:18.160 --> 0:48:22.759
<v Speaker 2>cognitively modern. But what seems to be the important thing

0:48:23.080 --> 0:48:28.040
<v Speaker 2>was population density and contact with other groups. So the

0:48:28.120 --> 0:48:34.560
<v Speaker 2>idea is that our sophisticated cognitive capabilities are latent until

0:48:34.640 --> 0:48:37.719
<v Speaker 2>we come into contact with each other, until we poke

0:48:37.760 --> 0:48:42.880
<v Speaker 2>each other's brains. And you see that this evidence ebbs

0:48:42.880 --> 0:48:45.920
<v Speaker 2>and flows, It appears and disappears as a function of

0:48:46.000 --> 0:48:47.880
<v Speaker 2>these group donaities.

0:48:48.080 --> 0:48:52.680
<v Speaker 1>Yeah, oh fascinating. Okay, So when humans come into contact

0:48:52.680 --> 0:48:54.880
<v Speaker 1>with other humans, but not just their own tribe, presumably

0:48:54.920 --> 0:48:58.160
<v Speaker 1>other tribes, bigger and bigger civilizations, slightly.

0:48:57.920 --> 0:49:00.360
<v Speaker 3>Different ways of making their tools and their weapons and

0:49:00.400 --> 0:49:01.160
<v Speaker 3>their jewelry.

0:49:01.400 --> 0:49:05.799
<v Speaker 1>Yes, oh excellent, Oh that's beautiful. So, by the way,

0:49:05.840 --> 0:49:09.840
<v Speaker 1>is it thought that there was some discontinuity where that

0:49:10.440 --> 0:49:13.399
<v Speaker 1>became possible. In other words, if you stick a bunch

0:49:13.440 --> 0:49:15.279
<v Speaker 1>of capuchin monkeys together.

0:49:15.200 --> 0:49:17.680
<v Speaker 3>They will never they'll never get there, right.

0:49:17.680 --> 0:49:20.640
<v Speaker 1>Right, So there's so there's something different.

0:49:20.840 --> 0:49:23.319
<v Speaker 2>And I mean, I think it's this stability of our

0:49:23.800 --> 0:49:26.040
<v Speaker 2>the models of our minds, and I think it's this

0:49:26.200 --> 0:49:29.720
<v Speaker 2>communicative drive. But you need a critical density of people

0:49:29.960 --> 0:49:31.560
<v Speaker 2>and you also need the variability.

0:49:31.840 --> 0:49:34.920
<v Speaker 1>Okay, so let's just summarize it. So it's having theory

0:49:34.920 --> 0:49:37.200
<v Speaker 1>of mind in other words, knowing Okay, Danielle has her

0:49:37.239 --> 0:49:39.920
<v Speaker 1>own thoughts, her own representations in there, and then you

0:49:39.960 --> 0:49:41.880
<v Speaker 1>add that to the density of people.

0:49:42.480 --> 0:49:44.920
<v Speaker 3>And the variability of different groups coming together ability.

0:49:45.080 --> 0:49:45.720
<v Speaker 1>That's excellent.

0:49:45.920 --> 0:49:46.440
<v Speaker 3>Yeah.

0:49:46.520 --> 0:49:52.200
<v Speaker 2>So the takeaway from this archaeological evidence is that becoming

0:49:52.320 --> 0:49:56.719
<v Speaker 2>cognitively modern was this slow, gradual process over the course

0:49:56.719 --> 0:49:58.360
<v Speaker 2>of the last couple hundred thousand years.

0:49:58.360 --> 0:50:01.320
<v Speaker 1>But it was predicated on pop density yeah, and people

0:50:01.320 --> 0:50:01.920
<v Speaker 1>coming together.

0:50:02.120 --> 0:50:02.359
<v Speaker 2>Yeah.

0:50:02.719 --> 0:50:06.759
<v Speaker 1>Oh excellent. Oh I love that. That's so interesting, And it.

0:50:06.719 --> 0:50:10.280
<v Speaker 2>Goes along with this idea that we're not just biologically evolving,

0:50:10.320 --> 0:50:15.080
<v Speaker 2>we're culturally evolving. And cultural evolution does a lot of

0:50:15.080 --> 0:50:18.320
<v Speaker 2>the work in explaining human behavior.

0:50:19.120 --> 0:50:22.759
<v Speaker 1>Yeah, and biological evolution, of course, is super slow, but

0:50:22.800 --> 0:50:24.560
<v Speaker 1>cultural evolution is so rapid.

0:50:24.840 --> 0:50:25.120
<v Speaker 3>Yes.

0:50:25.160 --> 0:50:27.920
<v Speaker 2>And actually I think that agents that have models of

0:50:27.960 --> 0:50:30.520
<v Speaker 2>our minds can help reconcile some of the tensions that

0:50:30.560 --> 0:50:34.799
<v Speaker 2>we're seeing because cultural evolution is outpacing biological evolution. So

0:50:36.080 --> 0:50:41.160
<v Speaker 2>what happens when you've succeeded in society, you've done well

0:50:41.160 --> 0:50:44.480
<v Speaker 2>in education, you go to the workplace and you're supposed to,

0:50:44.719 --> 0:50:49.000
<v Speaker 2>you know, contribute your intelligence. You end up staring at

0:50:49.080 --> 0:50:52.879
<v Speaker 2>a screen for most of the day and not interacting

0:50:52.920 --> 0:50:58.160
<v Speaker 2>with other people. The infrastructure actually doesn't really support unlocking

0:50:58.200 --> 0:51:01.320
<v Speaker 2>our potential because of all of the sort of arbitrary

0:51:01.360 --> 0:51:05.000
<v Speaker 2>things that have happened culturally. We've built these incredible devices

0:51:05.719 --> 0:51:08.600
<v Speaker 2>and we've co evolved with them, and now they're extensions

0:51:08.640 --> 0:51:11.440
<v Speaker 2>of our intelligence. But they're also we're also conforming to

0:51:11.600 --> 0:51:13.520
<v Speaker 2>them rather than the other way around.

0:51:13.560 --> 0:51:16.400
<v Speaker 1>True, But they are social, as in social media and

0:51:16.440 --> 0:51:18.640
<v Speaker 1>so on. I mean, when I'm staring at my screen,

0:51:18.680 --> 0:51:21.399
<v Speaker 1>I'm interacting with thousands of people in various ways, whether

0:51:21.440 --> 0:51:25.120
<v Speaker 1>I'm looking at extra Instagram or I'm doing emails. In

0:51:25.120 --> 0:51:27.799
<v Speaker 1>a sense, it's more social than humans ever could have been.

0:51:28.080 --> 0:51:31.160
<v Speaker 2>What this is the problem when the algorithms are not

0:51:31.320 --> 0:51:35.279
<v Speaker 2>aligned with our well being, with our potential. So I

0:51:35.400 --> 0:51:38.200
<v Speaker 2>see there are so many mistakes that we've made over

0:51:38.200 --> 0:51:41.360
<v Speaker 2>the past ten fifteen years that we can learn from

0:51:41.400 --> 0:51:45.240
<v Speaker 2>and hopefully not repeat with more capable AI.

0:51:45.560 --> 0:51:47.880
<v Speaker 1>But out of curiosity, if I'm on X and I

0:51:47.960 --> 0:51:49.799
<v Speaker 1>see that there are different points of view about this

0:51:49.840 --> 0:51:52.359
<v Speaker 1>political thing that I happen to care about, then I'm

0:51:52.400 --> 0:51:54.200
<v Speaker 1>getting exposed to lots of points of view.

0:51:54.280 --> 0:51:54.400
<v Speaker 2>Right.

0:51:54.440 --> 0:51:57.240
<v Speaker 3>Well, I'm not saying it's all bad. Certainly, Yes, I think.

0:51:57.080 --> 0:52:01.200
<v Speaker 2>That having online communities is absolutely a step in the

0:52:01.280 --> 0:52:05.439
<v Speaker 2>right direction for connecting us. It's fantastic but the way

0:52:05.480 --> 0:52:08.800
<v Speaker 2>that they are optimized and the attention economy not serving

0:52:08.880 --> 0:52:12.080
<v Speaker 2>us but serving advertisers is a problem.

0:52:12.480 --> 0:52:15.319
<v Speaker 1>But what would you change about let's say something like X.

0:52:15.719 --> 0:52:18.319
<v Speaker 1>What would you change about social media to make it

0:52:18.360 --> 0:52:19.719
<v Speaker 1>so it's more optimized?

0:52:20.480 --> 0:52:22.360
<v Speaker 3>Well, I think optimization is the problem.

0:52:23.160 --> 0:52:26.080
<v Speaker 1>So sorry, I met more optimized for a communicative drive.

0:52:26.640 --> 0:52:30.160
<v Speaker 2>I wouldn't think of it that way because I think

0:52:30.160 --> 0:52:33.239
<v Speaker 2>that you have the agents, but then you also have

0:52:33.440 --> 0:52:38.080
<v Speaker 2>how they are dynamically interacting with each other. And X

0:52:38.400 --> 0:52:41.920
<v Speaker 2>is or any social media platform, is one narrow way

0:52:42.080 --> 0:52:45.480
<v Speaker 2>of facilitating interactions. In the case of X, it's very

0:52:45.640 --> 0:52:50.080
<v Speaker 2>short form blurbs, and it appeals to the fact of

0:52:50.160 --> 0:52:54.000
<v Speaker 2>human nature that we are more interested in things that

0:52:54.040 --> 0:52:57.920
<v Speaker 2>have shock value and things that are negative or disgusting,

0:52:58.160 --> 0:53:02.040
<v Speaker 2>and how do you work against that. It's not about

0:53:02.080 --> 0:53:06.640
<v Speaker 2>informational exchange, although some people lean more in terms of

0:53:06.640 --> 0:53:08.839
<v Speaker 2>caring about that, and so you do see some of that.

0:53:09.239 --> 0:53:13.959
<v Speaker 1>Yeah, so that's interesting. So we are in the sense

0:53:14.000 --> 0:53:18.120
<v Speaker 1>of population density for three hundred thousand year old tribes.

0:53:18.320 --> 0:53:20.400
<v Speaker 1>It is the case that you see on X all

0:53:20.440 --> 0:53:22.960
<v Speaker 1>kinds of points of view that you didn't know existed.

0:53:23.000 --> 0:53:23.960
<v Speaker 1>And well, I don't know if you do.

0:53:24.040 --> 0:53:26.640
<v Speaker 2>Because you're in your echo chamber, you typically tend to

0:53:26.680 --> 0:53:28.520
<v Speaker 2>not be exposed to a lot of variability.

0:53:28.840 --> 0:53:31.920
<v Speaker 1>Well, you know what's interesting, what you're exposed to is

0:53:31.960 --> 0:53:34.160
<v Speaker 1>the most extreme views of the other side because people

0:53:34.200 --> 0:53:36.840
<v Speaker 1>in your echo chamber say, look at what this idiot

0:53:36.800 --> 0:53:39.560
<v Speaker 1>is on the other side of the aisle said.

0:53:39.360 --> 0:53:41.000
<v Speaker 3>And the polarize it.

0:53:41.040 --> 0:53:44.359
<v Speaker 1>Yeah exactly, Yeah, okay, So wrapping up for today, the

0:53:44.480 --> 0:53:49.799
<v Speaker 1>key thing is that intelligence is not just about what's

0:53:49.800 --> 0:53:53.360
<v Speaker 1>happening in an individual brain, but it's social.

0:53:53.920 --> 0:53:58.560
<v Speaker 2>Yes, this is true about all intelligences. All intelligences emerge

0:53:58.600 --> 0:54:01.959
<v Speaker 2>as a function of their environments and interacting with their

0:54:02.120 --> 0:54:06.640
<v Speaker 2>environments the problems within the environments that the organisms have

0:54:06.680 --> 0:54:10.799
<v Speaker 2>to solve. The most challenging problems in humans environments are

0:54:11.239 --> 0:54:13.920
<v Speaker 2>understanding other humans because we have to figure out how

0:54:13.960 --> 0:54:16.040
<v Speaker 2>to cooperate, We have to figure out.

0:54:15.880 --> 0:54:17.600
<v Speaker 3>How to align our minds.

0:54:18.040 --> 0:54:22.279
<v Speaker 2>So our intelligence emerges from our interactions with each other,

0:54:22.320 --> 0:54:26.800
<v Speaker 2>and we continually ratchet up our intelligence by co evolving

0:54:26.840 --> 0:54:27.439
<v Speaker 2>with each other.

0:54:32.040 --> 0:54:35.000
<v Speaker 1>That was my conversation with Danielle Persik. One of the

0:54:35.000 --> 0:54:37.759
<v Speaker 1>main threads was that maybe we shouldn't be thinking of

0:54:37.800 --> 0:54:42.400
<v Speaker 1>intelligence as something that's packaged up inside a single head.

0:54:42.760 --> 0:54:45.239
<v Speaker 1>Another way to look at it is as something that

0:54:45.440 --> 0:54:50.480
<v Speaker 1>emerges through interaction, through the friction, through the shared effort

0:54:50.640 --> 0:54:55.840
<v Speaker 1>of understanding another mind. Human intelligence has always been shaped

0:54:55.880 --> 0:54:58.640
<v Speaker 1>by this social dimension. You can see this from the

0:54:58.640 --> 0:55:01.239
<v Speaker 1>way that infants learn about the world to the way

0:55:01.280 --> 0:55:07.400
<v Speaker 1>that societies build knowledge over generations. What today's conversation invites

0:55:07.520 --> 0:55:11.959
<v Speaker 1>us to reconsider is the idea that learning and understanding,

0:55:12.280 --> 0:55:18.600
<v Speaker 1>and fundamentally alignment are really the central features of intelligence.

0:55:19.040 --> 0:55:23.560
<v Speaker 1>Our ability to model other minds, to recognize that other

0:55:23.640 --> 0:55:27.239
<v Speaker 1>people see the world differently, that they know different things,

0:55:27.239 --> 0:55:30.840
<v Speaker 1>that they care about different things. This is what allows

0:55:31.360 --> 0:55:38.120
<v Speaker 1>cooperation and culture and cumulative progress. Through this lens, intelligence

0:55:38.200 --> 0:55:43.040
<v Speaker 1>is about negotiating meaning. Now, if we take on that lens,

0:55:43.560 --> 0:55:47.000
<v Speaker 1>the future of AI looks very different from what most

0:55:47.000 --> 0:55:51.200
<v Speaker 1>people are thinking about now, because this shifts the conversation

0:55:51.320 --> 0:55:55.480
<v Speaker 1>away from only asking how capable AI systems are going

0:55:55.520 --> 0:56:00.000
<v Speaker 1>to be. Now we're pushed to ask how they participate

0:56:00.400 --> 0:56:06.080
<v Speaker 1>in our cognitive ecosystems. In other words, in Danielle's view,

0:56:06.640 --> 0:56:10.160
<v Speaker 1>how can we develop agents who help us think better

0:56:10.680 --> 0:56:17.400
<v Speaker 1>by reducing friction and clarifying misunderstandings between people and supporting

0:56:17.520 --> 0:56:22.200
<v Speaker 1>learning at the right level of abstraction, instead of merely

0:56:22.600 --> 0:56:26.880
<v Speaker 1>replacing us, which is the doomsayers version of the future.

0:56:27.480 --> 0:56:35.040
<v Speaker 1>Could AI agents serve as mediators and translators and collaborators.

0:56:35.840 --> 0:56:39.520
<v Speaker 1>And there's another issue I found fascinating. Alignment is something

0:56:39.600 --> 0:56:43.759
<v Speaker 1>humans have always learned through interactions. So perhaps instead of

0:56:44.200 --> 0:56:48.000
<v Speaker 1>just viewing AI alignment as a technical problem to be solved,

0:56:48.040 --> 0:56:51.280
<v Speaker 1>we could see it as a behavior to be modeled.

0:56:51.760 --> 0:56:55.879
<v Speaker 1>Systems that reflect our values back to us might teach

0:56:55.960 --> 0:56:59.839
<v Speaker 1>us how to communicate more effectively with one another, even

0:57:00.040 --> 0:57:03.520
<v Speaker 1>in moments of disagreement. If you're a regular listener to

0:57:03.520 --> 0:57:06.800
<v Speaker 1>this podcast, you know that I'm obsessed with issues about

0:57:07.200 --> 0:57:11.560
<v Speaker 1>the brain and polarization, and so this possibility that AI

0:57:11.960 --> 0:57:15.919
<v Speaker 1>might actually be able to mediate between us and help

0:57:16.000 --> 0:57:23.200
<v Speaker 1>us get our curiosity back that feels especially consequential. So

0:57:23.520 --> 0:57:27.320
<v Speaker 1>as AI agents become more embedded in our daily life,

0:57:27.400 --> 0:57:31.120
<v Speaker 1>the choices that we make now about their design are

0:57:31.200 --> 0:57:34.640
<v Speaker 1>going to shape how we relate to them and to

0:57:34.800 --> 0:57:38.240
<v Speaker 1>our fellow humans. The question is whether they will help

0:57:38.360 --> 0:57:43.600
<v Speaker 1>intelligence continue to sprout in new directions. In other words,

0:57:44.080 --> 0:57:49.560
<v Speaker 1>maybe will be even more intelligent as a species thanks

0:57:49.720 --> 0:57:58.600
<v Speaker 1>to our machines. Go to eagleman dot com slash podcast

0:57:58.640 --> 0:58:01.920
<v Speaker 1>for more information and to find further reading. Join the

0:58:01.920 --> 0:58:05.360
<v Speaker 1>weekly discussions on my substack, and check out and subscribe

0:58:05.400 --> 0:58:08.720
<v Speaker 1>to Inner Cosmos on YouTube for videos of each episode

0:58:08.800 --> 0:58:14.160
<v Speaker 1>and to leave comments. Until next time, I'm David Eagleman,

0:58:14.320 --> 0:58:16.080
<v Speaker 1>and this is Inner Cosmos.