WEBVTT - Ep78 "Does your brain have one model of the world or thousands?"

0:00:05.320 --> 0:00:09.399
<v Speaker 1>What is special about the wrinkly outer layer of the brain,

0:00:09.480 --> 0:00:11.640
<v Speaker 1>the cortex, And what does this have to do with

0:00:11.680 --> 0:00:15.720
<v Speaker 1>the way that you come to explore and understand the world.

0:00:16.079 --> 0:00:17.520
<v Speaker 2>And by the way, why do you.

0:00:17.480 --> 0:00:20.760
<v Speaker 1>See a whole image when you open your eyes even though.

0:00:20.680 --> 0:00:22.799
<v Speaker 2>Each part of your visual.

0:00:22.480 --> 0:00:26.720
<v Speaker 1>Cortex has access to only a tiny bit of the image.

0:00:27.280 --> 0:00:30.040
<v Speaker 1>And for that matter, the brain is divided into different

0:00:30.080 --> 0:00:32.639
<v Speaker 1>areas for sight and sound and touch and so on.

0:00:33.159 --> 0:00:36.559
<v Speaker 1>And so why when you're petting a cat, why does

0:00:36.560 --> 0:00:40.040
<v Speaker 1>the cat seem unified? Why doesn't the site of the

0:00:40.080 --> 0:00:44.159
<v Speaker 1>cat seem separate from the purring and the feel of

0:00:44.240 --> 0:00:47.640
<v Speaker 1>the fur. Can we build a new model of how

0:00:47.640 --> 0:00:50.720
<v Speaker 1>the brain works and in what ways is what the

0:00:50.760 --> 0:00:55.640
<v Speaker 1>brain doing something very different than what's happening in current AI.

0:00:59.040 --> 0:01:02.680
<v Speaker 1>Welcome to Intercouse with me David Eagleman. I'm a neuroscientist

0:01:02.680 --> 0:01:07.080
<v Speaker 1>at Stanford and in these episodes we sail deeply into

0:01:07.080 --> 0:01:10.040
<v Speaker 1>our three pound universe to understand.

0:01:09.600 --> 0:01:12.280
<v Speaker 2>Why and how our lives look the way they do.

0:01:24.040 --> 0:01:27.720
<v Speaker 1>Today's episode is about a new model of the brain

0:01:28.240 --> 0:01:31.160
<v Speaker 1>developed by my friend and colleague, Jeff Hawkins, and we'll

0:01:31.160 --> 0:01:33.920
<v Speaker 1>get into an interview with him shortly but let me

0:01:33.959 --> 0:01:38.200
<v Speaker 1>preface by saying that for centuries people have stared at

0:01:38.200 --> 0:01:41.240
<v Speaker 1>the brain and tried to figure out how this thing works.

0:01:41.280 --> 0:01:44.360
<v Speaker 1>Because when you stare at it, it's just a huge

0:01:44.840 --> 0:01:45.960
<v Speaker 1>lump of cells.

0:01:46.160 --> 0:01:46.880
<v Speaker 2>You can see that.

0:01:46.880 --> 0:01:51.400
<v Speaker 1>There's a wrinkled layer on the outside. And when people

0:01:51.400 --> 0:01:54.480
<v Speaker 1>dissect that, they can see that that part is about

0:01:54.600 --> 0:01:58.520
<v Speaker 1>three millimeters thick, and it looks a little different, looks grayer.

0:01:58.920 --> 0:02:01.280
<v Speaker 1>And so that part is called the gray matter. And

0:02:01.360 --> 0:02:05.520
<v Speaker 1>we call this the cortex, which means bark, like tree bark.

0:02:06.000 --> 0:02:09.160
<v Speaker 1>And the stuff below that thin layer is called white matter.

0:02:09.200 --> 0:02:13.040
<v Speaker 1>And it looks white because the tiny data cables coming

0:02:13.080 --> 0:02:16.240
<v Speaker 1>off the cells, the axons, these are wrapped in a.

0:02:16.200 --> 0:02:19.120
<v Speaker 2>Little sheath called myelin, which makes it look white.

0:02:19.320 --> 0:02:22.799
<v Speaker 1>Okay, Now, what you immediately noticed by looking at brains

0:02:22.919 --> 0:02:26.960
<v Speaker 1>across different mammals is that all the stuff you find

0:02:27.120 --> 0:02:31.520
<v Speaker 1>under the cortex, all the sub cortical stuff, looks essentially

0:02:31.600 --> 0:02:34.960
<v Speaker 1>the same. Horses and elephants and mice. They all have

0:02:35.080 --> 0:02:37.799
<v Speaker 1>the same architecture going on that we do.

0:02:37.840 --> 0:02:38.440
<v Speaker 2>They all have.

0:02:38.400 --> 0:02:42.840
<v Speaker 1>A thalamus and hippocampus and cerebellum and so on. But

0:02:42.919 --> 0:02:46.560
<v Speaker 1>there's one thing that really distinguishes us from our cousins,

0:02:47.040 --> 0:02:51.200
<v Speaker 1>and for that we return to the gray matter, the cortex.

0:02:51.360 --> 0:02:54.800
<v Speaker 1>It's not that our cousins don't have a cortex. What

0:02:54.960 --> 0:03:00.760
<v Speaker 1>distinguishes us is the absolute enormity of our cortex. We

0:03:00.919 --> 0:03:05.400
<v Speaker 1>humans have a ton of this stuff. So take four

0:03:05.440 --> 0:03:08.800
<v Speaker 1>pieces of paper from your printer and place them next

0:03:08.800 --> 0:03:11.120
<v Speaker 1>to each other to make one really large piece.

0:03:11.560 --> 0:03:13.440
<v Speaker 2>That's how much cortex.

0:03:13.080 --> 0:03:16.760
<v Speaker 1>A human has. If you were to spread out the wrinkles. Now,

0:03:16.880 --> 0:03:20.040
<v Speaker 1>our nearest cousins, the great apes only have about one

0:03:20.080 --> 0:03:22.799
<v Speaker 1>piece of paper worth, and most mammals have a lot

0:03:22.880 --> 0:03:26.880
<v Speaker 1>less than that. So something about the story of the

0:03:27.080 --> 0:03:30.600
<v Speaker 1>runaway human success has to do with the fact that

0:03:30.639 --> 0:03:34.200
<v Speaker 1>we have way more cortex for our body size than

0:03:34.320 --> 0:03:38.120
<v Speaker 1>any other creature. And side note, I'm really talking about

0:03:38.200 --> 0:03:42.280
<v Speaker 1>what's called the neocortex or new cortex, because we also

0:03:42.280 --> 0:03:45.720
<v Speaker 1>have a little bit of paleocortex or old cortex. But

0:03:45.760 --> 0:03:48.320
<v Speaker 1>the thing that really makes us outstanding is the amount

0:03:48.360 --> 0:03:50.280
<v Speaker 1>of neo cortex that we have.

0:03:51.400 --> 0:03:54.040
<v Speaker 2>But what is this neocortex doing.

0:03:54.920 --> 0:03:58.280
<v Speaker 1>Well, if you look at any neuroscience textbook, you'll see

0:03:58.320 --> 0:04:01.240
<v Speaker 1>that this part of the brain, the cortex, is often

0:04:01.720 --> 0:04:05.760
<v Speaker 1>drawn with different colored regions like this red region over

0:04:05.800 --> 0:04:08.000
<v Speaker 1>here is devoted to vision, and this green one is

0:04:08.040 --> 0:04:10.760
<v Speaker 1>devoted to hearing, and this yellow one to touch and

0:04:10.800 --> 0:04:14.320
<v Speaker 1>so on. But something I've been obsessed with and write

0:04:14.360 --> 0:04:16.640
<v Speaker 1>about in my latest book, Live Wired, is that this

0:04:16.720 --> 0:04:19.000
<v Speaker 1>is the wrong way to think about it, because the

0:04:19.000 --> 0:04:22.240
<v Speaker 1>neocortex is remarkably flexible.

0:04:22.279 --> 0:04:23.599
<v Speaker 2>It's not a fixed map.

0:04:24.160 --> 0:04:27.440
<v Speaker 1>If you are born blind, the part of your cortex

0:04:27.520 --> 0:04:30.000
<v Speaker 1>that we would have thought of as visual cortex gets

0:04:30.040 --> 0:04:33.720
<v Speaker 1>taken over by hearing and touch and so on. Now

0:04:33.839 --> 0:04:35.400
<v Speaker 1>let me just be really clear what I mean by

0:04:35.480 --> 0:04:39.679
<v Speaker 1>taking over. The neurons there are the same The cortex

0:04:39.760 --> 0:04:42.479
<v Speaker 1>looks exactly the same from the outside, but the function

0:04:42.600 --> 0:04:46.760
<v Speaker 1>of those particular neurons is now not visual. They have

0:04:46.839 --> 0:04:50.120
<v Speaker 1>nothing to do with visual information anymore. Now that same

0:04:50.240 --> 0:04:55.200
<v Speaker 1>neuron instead of firing when it detects a moving object,

0:04:55.600 --> 0:04:58.880
<v Speaker 1>now it responds to a touch on your.

0:04:58.720 --> 0:05:03.479
<v Speaker 2>Toe, or hearing a B flat note or whatever. So

0:05:03.600 --> 0:05:05.000
<v Speaker 2>the little labels that.

0:05:04.920 --> 0:05:08.240
<v Speaker 1>We draw onto the brain, these maps that we impose,

0:05:08.960 --> 0:05:09.400
<v Speaker 1>these are.

0:05:09.320 --> 0:05:11.240
<v Speaker 2>Actually massively flexible.

0:05:11.480 --> 0:05:13.320
<v Speaker 1>And as you may know, I gave a talk at

0:05:13.480 --> 0:05:15.520
<v Speaker 1>TED about this a while ago, where I showed that

0:05:15.880 --> 0:05:19.680
<v Speaker 1>you can feed in new kinds of information, let's say

0:05:19.680 --> 0:05:22.440
<v Speaker 1>through the ears or the skin, and the brain will

0:05:22.480 --> 0:05:25.320
<v Speaker 1>figure out how to deal with that data. It will

0:05:25.760 --> 0:05:30.200
<v Speaker 1>flexibly devote part of its cortical real estate to that.

0:05:30.839 --> 0:05:34.360
<v Speaker 1>And this line of thinking led some scientists, like Vernon

0:05:34.360 --> 0:05:37.920
<v Speaker 1>mount Castle some decades ago to realize that the cells

0:05:37.920 --> 0:05:39.839
<v Speaker 1>of the cortex are.

0:05:39.839 --> 0:05:41.120
<v Speaker 2>A one trick pony.

0:05:41.800 --> 0:05:46.840
<v Speaker 1>No neuron is inherently a visual neuron or a neuron

0:05:46.880 --> 0:05:50.120
<v Speaker 1>devoted to hearing or touch or smell or taste or

0:05:50.160 --> 0:05:54.039
<v Speaker 1>memory or whatever. All parts of the cortex are perfectly

0:05:54.120 --> 0:05:58.120
<v Speaker 1>capable and willing to take on any job. So that

0:05:58.240 --> 0:06:02.200
<v Speaker 1>suggests they're all running some sort of basic algorithm. And

0:06:02.200 --> 0:06:05.400
<v Speaker 1>it doesn't matter what kind of data you feed in.

0:06:05.440 --> 0:06:08.479
<v Speaker 1>Different parts of the cortex will say cool, I'll build

0:06:08.560 --> 0:06:11.479
<v Speaker 1>a representation of that data. I don't care if it

0:06:11.480 --> 0:06:15.520
<v Speaker 1>comes from photons or air compression waves or temperature or whatever.

0:06:15.960 --> 0:06:19.520
<v Speaker 1>I'm on the job here to build an understanding of

0:06:19.560 --> 0:06:24.680
<v Speaker 1>whatever is coming in locally. Now, it's not individual neurons

0:06:24.680 --> 0:06:28.560
<v Speaker 1>that are building models, but instead groups of many tens

0:06:28.680 --> 0:06:33.760
<v Speaker 1>of thousands of neurons arranged in a six layered cylinder.

0:06:33.920 --> 0:06:37.080
<v Speaker 1>So think about this like you're a geologist and you

0:06:37.200 --> 0:06:40.480
<v Speaker 1>drilled out a cylinder of rock and you saw six

0:06:40.600 --> 0:06:43.120
<v Speaker 1>layers in it, six sedimentary layers.

0:06:43.480 --> 0:06:45.359
<v Speaker 2>That's what the neocortex looks like.

0:06:45.520 --> 0:06:49.680
<v Speaker 1>Six layers. And it's built out of these columns which

0:06:49.720 --> 0:06:52.839
<v Speaker 1>have the same types of neurons with the same connection

0:06:53.000 --> 0:06:56.760
<v Speaker 1>patterns in each column. And so think about the cortex

0:06:57.160 --> 0:07:00.440
<v Speaker 1>as being made of lots of these columns, like taking

0:07:00.880 --> 0:07:03.600
<v Speaker 1>hundreds of thousands of grains of rice and standing them

0:07:03.680 --> 0:07:05.760
<v Speaker 1>up on their end and packing them all next to

0:07:05.800 --> 0:07:09.560
<v Speaker 1>each other. Now, people have known about cortical columns for

0:07:09.600 --> 0:07:13.600
<v Speaker 1>many decades since Vernon Mountcastle first discovered these in nineteen

0:07:13.680 --> 0:07:17.880
<v Speaker 1>fifty seven. But recently someone has pulled together several different

0:07:17.960 --> 0:07:22.640
<v Speaker 1>threads to propose how this could underlie what the cortex

0:07:22.760 --> 0:07:26.800
<v Speaker 1>is all about. And that's someone is Jeff Hawkins and

0:07:26.840 --> 0:07:30.040
<v Speaker 1>his team. And so I met with Jeff in my studio. Now,

0:07:30.200 --> 0:07:33.000
<v Speaker 1>Jeff is one of my favorite people because he does

0:07:33.600 --> 0:07:37.000
<v Speaker 1>theoretical neuroscience. He really tries to figure out the big

0:07:37.080 --> 0:07:40.360
<v Speaker 1>picture of what the brain is doing. Now, Jeff has

0:07:40.400 --> 0:07:43.160
<v Speaker 1>a very interesting history, so I'll just mention that in

0:07:43.200 --> 0:07:46.400
<v Speaker 1>the nineteen eighties he was a graduate student at Berkeley,

0:07:46.400 --> 0:07:50.119
<v Speaker 1>where he proposed a PhD thesis on a new theory

0:07:50.160 --> 0:07:53.560
<v Speaker 1>of the cortex, but his proposal was rejected, and so

0:07:53.640 --> 0:07:58.000
<v Speaker 1>he ended up pursuing his vision for mobile computing instead,

0:07:58.080 --> 0:08:01.640
<v Speaker 1>and in nineteen ninety two he launch the company Palm,

0:08:01.840 --> 0:08:05.160
<v Speaker 1>which made the Palm Pilot. If you remember that, this

0:08:05.320 --> 0:08:08.720
<v Speaker 1>was this little handheld device and you could write on

0:08:08.760 --> 0:08:11.880
<v Speaker 1>it with a stylus and it would translate your handwriting

0:08:11.880 --> 0:08:14.520
<v Speaker 1>into text. And you can use this for your address

0:08:14.520 --> 0:08:17.119
<v Speaker 1>book and your calendar and your contacts and note taking.

0:08:17.440 --> 0:08:21.400
<v Speaker 1>This was the first entrant into the world of portable computing,

0:08:21.480 --> 0:08:21.720
<v Speaker 1>and it.

0:08:21.680 --> 0:08:22.679
<v Speaker 2>Really changed the world.

0:08:23.480 --> 0:08:27.320
<v Speaker 1>Anyhow, A decade later, Jeff returned to his original love,

0:08:27.360 --> 0:08:31.440
<v Speaker 1>which was theoretical neuroscience, trying to figure out what's going

0:08:31.520 --> 0:08:34.440
<v Speaker 1>on with the brain, and he wrote a book in

0:08:34.440 --> 0:08:37.800
<v Speaker 1>two thousand and four called on Intelligence, which was very

0:08:37.880 --> 0:08:40.640
<v Speaker 1>influential on me and lots of other thinkers I know.

0:08:40.920 --> 0:08:43.360
<v Speaker 1>So I was very excited when Jeff recently came out

0:08:43.400 --> 0:08:47.319
<v Speaker 1>with his next book that represents his last decade and

0:08:47.400 --> 0:08:51.040
<v Speaker 1>a half of research. It's called One Thousand Brains, a

0:08:51.080 --> 0:08:54.559
<v Speaker 1>New Theory of Intelligence, and it describes his framework for

0:08:54.640 --> 0:08:57.880
<v Speaker 1>thinking about the brain. So, without further ado, let's dive

0:08:57.920 --> 0:09:06.560
<v Speaker 1>into a very cool new model of the brain. Okay, Jeff,

0:09:06.600 --> 0:09:09.079
<v Speaker 1>So you are a theoretician. You think about the brain

0:09:09.080 --> 0:09:11.840
<v Speaker 1>from a high level. We're in this era now of

0:09:12.000 --> 0:09:14.400
<v Speaker 1>AI where AI is doing all kinds of things that

0:09:14.440 --> 0:09:17.200
<v Speaker 1>are amazing and no one expected. But you see the

0:09:17.280 --> 0:09:19.920
<v Speaker 1>brain as being very different from what is going on

0:09:19.960 --> 0:09:22.520
<v Speaker 1>with let's say, large language models. So tell us about that.

0:09:22.840 --> 0:09:26.480
<v Speaker 2>That's absolutely true. You know, the current AI wave is

0:09:26.480 --> 0:09:29.200
<v Speaker 2>really amazing, but those models don't work at all like

0:09:29.280 --> 0:09:32.160
<v Speaker 2>the brain. And I think you could start with one

0:09:32.360 --> 0:09:37.720
<v Speaker 2>really fundamental difference. Brains work through movement. We move our

0:09:37.760 --> 0:09:40.640
<v Speaker 2>bodies through the world. We move our hands over objects

0:09:40.679 --> 0:09:42.520
<v Speaker 2>to touch and learn what they are. We move our

0:09:42.520 --> 0:09:45.600
<v Speaker 2>eyes constantly, so the inputs of the brain are constantly changing,

0:09:45.640 --> 0:09:48.800
<v Speaker 2>but mostly because we're moving through the world. And the

0:09:48.880 --> 0:09:51.520
<v Speaker 2>term for that is a centory motor system. And the

0:09:51.559 --> 0:09:54.560
<v Speaker 2>brain can't understand its inputs unless it knows how it's

0:09:54.559 --> 0:09:57.280
<v Speaker 2>moving through the world. So we learn by exploring, by

0:09:57.679 --> 0:10:01.880
<v Speaker 2>moving different places, picking things up, touching, so on, and

0:10:01.920 --> 0:10:05.120
<v Speaker 2>that's all Animals that move in the world learn this way.

0:10:05.320 --> 0:10:07.760
<v Speaker 2>So this idea that the brain is the central motor

0:10:07.800 --> 0:10:10.320
<v Speaker 2>system has been known back in the late eighteen hundreds,

0:10:10.480 --> 0:10:14.400
<v Speaker 2>but it's pretty much ignored by everybody. But it leads

0:10:14.440 --> 0:10:18.080
<v Speaker 2>to a very fundamental different way of how we acquire

0:10:18.120 --> 0:10:21.520
<v Speaker 2>knowledge and how knowledge is represented in the brain. Whereas

0:10:22.080 --> 0:10:25.000
<v Speaker 2>today's AI is most of it's built on well deep

0:10:25.080 --> 0:10:28.280
<v Speaker 2>learning of transformer technologies, which are essentially we feed data

0:10:28.320 --> 0:10:31.360
<v Speaker 2>to it. We don't it doesn't explore it, and we

0:10:31.400 --> 0:10:34.280
<v Speaker 2>feed to large language models. We just feed a language.

0:10:34.400 --> 0:10:37.280
<v Speaker 2>So there's no inherent knowledge about what these words mean,

0:10:37.400 --> 0:10:40.720
<v Speaker 2>only what these words mean in the context of other words. Right,

0:10:40.760 --> 0:10:43.320
<v Speaker 2>But you and I can pick up a cat and

0:10:43.400 --> 0:10:45.640
<v Speaker 2>touch it and feel it and know this warmth, and

0:10:45.640 --> 0:10:48.360
<v Speaker 2>we understand how its body's moved because no one has

0:10:48.440 --> 0:10:52.240
<v Speaker 2>to tell us that. We just experience it directly. So

0:10:52.880 --> 0:10:56.880
<v Speaker 2>this is a huge gap between brains. Pretty much all

0:10:56.920 --> 0:11:00.080
<v Speaker 2>brains work by century motor learning and almost all oh

0:11:00.559 --> 0:11:03.920
<v Speaker 2>it doesn't. And you can just peel the layers apart

0:11:03.960 --> 0:11:05.920
<v Speaker 2>and see what the differences are, and it makes a

0:11:06.040 --> 0:11:08.679
<v Speaker 2>huge difference. So I'm not a fan. I'm a fan

0:11:08.760 --> 0:11:10.920
<v Speaker 2>of AI today, but I don't think it's the future

0:11:10.960 --> 0:11:12.960
<v Speaker 2>of AI. I don't think it's going to get you

0:11:13.080 --> 0:11:15.480
<v Speaker 2>to what people really want or truly intelligent machines.

0:11:16.840 --> 0:11:19.000
<v Speaker 1>Okay, terrific, And we'll dive into that more in a

0:11:19.040 --> 0:11:19.480
<v Speaker 1>little bit.

0:11:19.559 --> 0:11:19.760
<v Speaker 2>Now.

0:11:19.960 --> 0:11:23.559
<v Speaker 1>When we look at let's say the human brain, there's

0:11:24.000 --> 0:11:26.760
<v Speaker 1>lots of areas that we can point to. There's the

0:11:26.920 --> 0:11:31.199
<v Speaker 1>cortext and wrinkly outer bit, there's all these subcortical areas.

0:11:31.440 --> 0:11:35.640
<v Speaker 1>When you think about intelligence and the stuff that we're

0:11:35.640 --> 0:11:37.280
<v Speaker 1>going to talk about today, what is the part that

0:11:37.320 --> 0:11:38.040
<v Speaker 1>you concentrate on.

0:11:38.160 --> 0:11:41.319
<v Speaker 2>Well, we concentrate first and foremost in the New York cortext,

0:11:41.360 --> 0:11:43.560
<v Speaker 2>which is about seventy five percent of the volume of

0:11:43.600 --> 0:11:45.120
<v Speaker 2>your brain. I mean it's what you see, as you said,

0:11:45.360 --> 0:11:46.760
<v Speaker 2>if you can take a scope, and that's what you

0:11:46.800 --> 0:11:50.240
<v Speaker 2>see in the New York cortext. And so it's a pretty

0:11:50.280 --> 0:11:53.600
<v Speaker 2>dominant part of what we think of intelligence. You can't

0:11:53.640 --> 0:11:56.120
<v Speaker 2>consider it completely on its own. I mean, it's connected

0:11:56.160 --> 0:11:59.000
<v Speaker 2>to all these other things. And so we also study

0:11:59.080 --> 0:12:02.160
<v Speaker 2>those other things and in service to the new or cortex.

0:12:02.320 --> 0:12:05.040
<v Speaker 2>So we study the thalmbists, and we study the cerebellopment,

0:12:05.080 --> 0:12:07.320
<v Speaker 2>We study the basic just because you have to know

0:12:07.360 --> 0:12:12.640
<v Speaker 2>how the cortex workship these other things. But our primarily

0:12:12.679 --> 0:12:15.400
<v Speaker 2>our goal and many neurosign this goal is to understand

0:12:15.440 --> 0:12:18.360
<v Speaker 2>the New York cortex, because that's what mammals have. We've

0:12:18.360 --> 0:12:21.280
<v Speaker 2>got a big one. You know, everything we think, most

0:12:21.320 --> 0:12:23.560
<v Speaker 2>of what we think about being intelligent, about our ability

0:12:23.600 --> 0:12:25.960
<v Speaker 2>to understand the world and generate language and see and

0:12:26.040 --> 0:12:28.080
<v Speaker 2>hear and and so on, is the New York cortext

0:12:28.160 --> 0:12:31.640
<v Speaker 2>not one hundred percent, but most of it. Unfortunately. Also,

0:12:31.800 --> 0:12:33.520
<v Speaker 2>not only is it the biggest structure, but it's a

0:12:33.679 --> 0:12:36.800
<v Speaker 2>very very regular structure. So you can look at this thing,

0:12:36.800 --> 0:12:38.760
<v Speaker 2>the New or Courts is like a sheet of cells.

0:12:38.880 --> 0:12:41.600
<v Speaker 2>It's you know, it's like a size of a large

0:12:41.600 --> 0:12:44.439
<v Speaker 2>dinner napkin and only a few millimeters thick, and it

0:12:44.480 --> 0:12:48.520
<v Speaker 2>gets wrinkly because it's stuck it in your head that

0:12:48.960 --> 0:12:53.239
<v Speaker 2>everywhere you look on it, it looks remarkably complicated and remarkably

0:12:53.280 --> 0:12:56.240
<v Speaker 2>the same. So the areas are doing vision, look like

0:12:56.520 --> 0:12:59.880
<v Speaker 2>the areas are doing languages, look like the areas are

0:12:59.880 --> 0:13:02.520
<v Speaker 2>doing touch. Look the areas that are doing everything. Really

0:13:02.960 --> 0:13:05.839
<v Speaker 2>and so there's been long speculated that there's sort of

0:13:05.880 --> 0:13:10.480
<v Speaker 2>a common algorithmic principle that's applying to everything everything we do,

0:13:10.520 --> 0:13:12.679
<v Speaker 2>all of our sensory inputs, all of our thinking, all

0:13:12.720 --> 0:13:16.160
<v Speaker 2>the language, it's hard to believe, but the evidence is overwhelming,

0:13:16.800 --> 0:13:19.560
<v Speaker 2>and so our research has really been to understand what

0:13:19.720 --> 0:13:23.679
<v Speaker 2>is that that algorithm, corical algorithm, often referred to as

0:13:23.679 --> 0:13:27.640
<v Speaker 2>a cortical column. You know, this repeated structure that seems

0:13:27.640 --> 0:13:29.760
<v Speaker 2>to underline vision and hearing and touch and thought and

0:13:29.800 --> 0:13:33.760
<v Speaker 2>everything we do. And that's that's just just an appealing

0:13:33.800 --> 0:13:35.920
<v Speaker 2>thing to try to understand. And we've cracked it. We've

0:13:35.960 --> 0:13:38.160
<v Speaker 2>actually we actually cracked it. We understand what's going on.

0:13:38.280 --> 0:13:38.880
<v Speaker 2>That's awesome.

0:13:38.920 --> 0:13:41.640
<v Speaker 1>Okay, so a couple of things, right, So the way

0:13:41.679 --> 0:13:43.959
<v Speaker 1>I sometimes phrase this to people is that if I

0:13:44.000 --> 0:13:46.720
<v Speaker 1>had a magical microscope and could show you a part

0:13:46.720 --> 0:13:48.360
<v Speaker 1>of the brain and you can see all the activity

0:13:48.480 --> 0:13:51.680
<v Speaker 1>running around in the cortext there, could you tell me

0:13:51.800 --> 0:13:54.599
<v Speaker 1>is that visual cortex are auditory or somatosensory? And the

0:13:54.640 --> 0:13:56.400
<v Speaker 1>answer is you couldn't tell me, and I couldn't tell you.

0:13:56.400 --> 0:13:59.120
<v Speaker 2>Guys, Right, it all looks the same, and there's a

0:13:59.280 --> 0:14:01.240
<v Speaker 2>and there's as you know, oh, there's these experiments people

0:14:01.240 --> 0:14:02.840
<v Speaker 2>have done where well, first of all, if you have

0:14:02.920 --> 0:14:05.319
<v Speaker 2>trauma to one part of the cortext other parts will

0:14:05.320 --> 0:14:08.800
<v Speaker 2>pick up the same function. You can also people re

0:14:08.960 --> 0:14:11.280
<v Speaker 2>routed sensory and puts the different parts of the cortext

0:14:11.559 --> 0:14:13.280
<v Speaker 2>in animals and they seem the work.

0:14:13.559 --> 0:14:17.480
<v Speaker 1>So, for example, you have visual information instead of going

0:14:17.480 --> 0:14:19.920
<v Speaker 1>to the back of the visual cortex, that gets rerouted

0:14:19.960 --> 0:14:24.520
<v Speaker 1>to the auditory cortex, and that auditory cortex becomes visual cortext.

0:14:24.640 --> 0:14:30.080
<v Speaker 2>Right, It's incredibly powerful and flexible system. And mammals, you know,

0:14:30.160 --> 0:14:32.800
<v Speaker 2>all we have we have a set of sensors, quite

0:14:32.800 --> 0:14:34.440
<v Speaker 2>a few action more than most people think because the

0:14:34.480 --> 0:14:37.800
<v Speaker 2>skin there's a lot of different sensors. But other animals

0:14:37.880 --> 0:14:40.680
<v Speaker 2>have different sensors and they have cortext too, And so

0:14:41.920 --> 0:14:45.760
<v Speaker 2>there's seems to be this universal algorithm that can be applied.

0:14:46.440 --> 0:14:50.000
<v Speaker 2>And now we know it's a century motor algorithm. It

0:14:50.000 --> 0:14:52.840
<v Speaker 2>can be applied, and and we've spent decades trying to

0:14:52.840 --> 0:14:55.480
<v Speaker 2>figure this out and we've cracked it. Oh that's awesome.

0:14:55.760 --> 0:14:57.520
<v Speaker 1>Just before you tell us about that, So tell us

0:14:57.520 --> 0:14:58.880
<v Speaker 1>what a cortical column is.

0:14:59.000 --> 0:15:01.240
<v Speaker 2>Okay, So imagine we talked about. The near cortex is

0:15:01.240 --> 0:15:04.800
<v Speaker 2>a sheet of cells like three milimeters stick. A cortical

0:15:04.920 --> 0:15:07.600
<v Speaker 2>column is a little section of that going through the

0:15:07.640 --> 0:15:11.480
<v Speaker 2>three free milimeters. It's convariating with from a third of

0:15:11.480 --> 0:15:14.040
<v Speaker 2>a millimeters to a millimeters in diameter. It's it's it's

0:15:14.040 --> 0:15:15.920
<v Speaker 2>a it's not something you would see. It's not like

0:15:15.960 --> 0:15:18.520
<v Speaker 2>it's sitting there to be plucked out. But we know

0:15:18.560 --> 0:15:21.920
<v Speaker 2>they exist, and so within that, let's let's say it's

0:15:21.960 --> 0:15:25.600
<v Speaker 2>a three millimeters tall and a half millimeters wide cylinder

0:15:25.640 --> 0:15:30.320
<v Speaker 2>that goes across the cortex that contains all the neural

0:15:30.400 --> 0:15:33.560
<v Speaker 2>machinery that you would see anywhere in the cortex and

0:15:33.560 --> 0:15:36.880
<v Speaker 2>in each cortical column, because they look like a little

0:15:36.880 --> 0:15:38.520
<v Speaker 2>grain of rice in some sense. Why you can imagine

0:15:38.560 --> 0:15:40.440
<v Speaker 2>what's a little brains of ice stacked next to each other.

0:15:41.160 --> 0:15:44.640
<v Speaker 2>Each cornico column gets input from some well in parts

0:15:44.640 --> 0:15:47.320
<v Speaker 2>of the bend they get from some pats of sensory input,

0:15:47.400 --> 0:15:49.520
<v Speaker 2>so from pats to the retina, from pats to the

0:15:49.560 --> 0:15:53.600
<v Speaker 2>cochlea of patrio skin. Other parts get information from parts

0:15:53.640 --> 0:15:56.560
<v Speaker 2>of the near cortex. So the cortex is connected to cortex,

0:15:57.200 --> 0:15:58.960
<v Speaker 2>but each one is looking at a small If you

0:15:58.960 --> 0:16:01.640
<v Speaker 2>think about the primary entry regions of the cortex, which

0:16:01.640 --> 0:16:06.240
<v Speaker 2>are quite large, they're getting input from a small sensory area, right,

0:16:06.320 --> 0:16:09.400
<v Speaker 2>And so people used to think that, well, if this

0:16:09.560 --> 0:16:11.480
<v Speaker 2>bil colm is only getting input from a small part

0:16:11.480 --> 0:16:13.840
<v Speaker 2>of the rent. Now, it can't really doing very much right,

0:16:13.880 --> 0:16:15.920
<v Speaker 2>It can't be very smart. All you could do is

0:16:15.960 --> 0:16:18.880
<v Speaker 2>process a little piece of information there, and therefore maybe

0:16:18.920 --> 0:16:20.760
<v Speaker 2>it's going to detect an edge or something like that,

0:16:20.800 --> 0:16:22.560
<v Speaker 2>and there's a lot of evidence for that. But we

0:16:22.600 --> 0:16:27.560
<v Speaker 2>now know what happens is that the cortical comms they

0:16:27.600 --> 0:16:30.800
<v Speaker 2>get input from over time, from different parts of the world.

0:16:30.880 --> 0:16:33.160
<v Speaker 2>So the eyes are moving like three times a second,

0:16:33.360 --> 0:16:35.320
<v Speaker 2>and so that cortical comms may be looking at three

0:16:35.360 --> 0:16:39.640
<v Speaker 2>different things every second, and it can integrate how the

0:16:39.720 --> 0:16:42.240
<v Speaker 2>sensor is moving, how your eyes are moving, with what

0:16:42.320 --> 0:16:45.400
<v Speaker 2>it's sensing to build models that are much larger than

0:16:45.400 --> 0:16:47.840
<v Speaker 2>it can sense. And the same way that you could

0:16:47.840 --> 0:16:51.280
<v Speaker 2>take your finger in a dark room and say, okay, David,

0:16:51.320 --> 0:16:53.280
<v Speaker 2>I want you to learn this new object. Let's call it.

0:16:53.320 --> 0:16:55.880
<v Speaker 2>You know, a coffee cup. You never touch it, and

0:16:55.880 --> 0:16:57.640
<v Speaker 2>so you could do is you touch the coffee cup

0:16:57.680 --> 0:16:59.600
<v Speaker 2>and you move your finger along and around, and as

0:16:59.640 --> 0:17:01.840
<v Speaker 2>you do, you build a three dimensional model of the cup.

0:17:01.920 --> 0:17:04.639
<v Speaker 2>Even though you're only getting input from one fingertip, the

0:17:04.720 --> 0:17:06.640
<v Speaker 2>eyes are doing the same thing. It's surprising you don't

0:17:06.680 --> 0:17:10.200
<v Speaker 2>realize this so every quarter do Colm, when we understand

0:17:10.240 --> 0:17:13.880
<v Speaker 2>now is doing this sort of processing movement, information and

0:17:13.960 --> 0:17:17.200
<v Speaker 2>sense for information, building what we call structure or three

0:17:17.280 --> 0:17:19.400
<v Speaker 2>D models of things in the world. So it's quite

0:17:19.440 --> 0:17:22.560
<v Speaker 2>different than even those neurosciences think about it, and there's

0:17:22.600 --> 0:17:24.080
<v Speaker 2>a lot of reasons we can talk about how it

0:17:24.160 --> 0:17:25.840
<v Speaker 2>was missed for all these years.

0:17:25.960 --> 0:17:28.200
<v Speaker 1>So in the court, you have a century six layers

0:17:28.240 --> 0:17:32.679
<v Speaker 1>of cells, and a column is all six layers, is

0:17:32.720 --> 0:17:36.639
<v Speaker 1>all six layers. It's going up and down. It's like

0:17:36.880 --> 0:17:38.440
<v Speaker 1>think of it like layers of a cake. And the

0:17:38.520 --> 0:17:41.480
<v Speaker 1>column is you're taking a straw and shoving it through

0:17:41.520 --> 0:17:43.000
<v Speaker 1>the top, and so you've got.

0:17:42.760 --> 0:17:46.320
<v Speaker 2>This, Okay, got a straw cake.

0:17:46.560 --> 0:17:49.280
<v Speaker 1>Okay, great, And so the idea is if you're looking

0:17:49.280 --> 0:17:54.120
<v Speaker 1>at some column in you know, in primary visual cortex. Yeah,

0:17:54.240 --> 0:17:57.600
<v Speaker 1>your point, Jeff was that, you know, it's it's like

0:17:57.680 --> 0:17:59.960
<v Speaker 1>looking at the world through a straw.

0:18:00.119 --> 0:18:02.439
<v Speaker 2>It only sees a little tiny piece of the world.

0:18:02.480 --> 0:18:05.520
<v Speaker 1>But because the eyes are moving arout, because you're exploring

0:18:05.560 --> 0:18:09.560
<v Speaker 1>the world, this is actually getting lots of parts of information.

0:18:09.600 --> 0:18:11.920
<v Speaker 1>It's exploring the world in the same way that your

0:18:11.920 --> 0:18:13.000
<v Speaker 1>finger typically.

0:18:12.640 --> 0:18:15.160
<v Speaker 2>Right, and it has to integrate information over time, that's

0:18:15.160 --> 0:18:17.040
<v Speaker 2>the key, right, And you can literally do this. You

0:18:17.040 --> 0:18:19.720
<v Speaker 2>can look at the world through a straw, right, and

0:18:19.720 --> 0:18:21.600
<v Speaker 2>and you can say, oh, what am I looking at? Well,

0:18:21.600 --> 0:18:24.320
<v Speaker 2>you can't tell them. You start moving the straw and

0:18:24.359 --> 0:18:26.600
<v Speaker 2>then you can start and you can also learn objects

0:18:26.640 --> 0:18:29.200
<v Speaker 2>that way. So literally you can learn by looking through

0:18:29.200 --> 0:18:31.640
<v Speaker 2>a straw, which is what sort of what one column

0:18:31.680 --> 0:18:32.920
<v Speaker 2>is doing? Got it?

0:18:33.040 --> 0:18:37.119
<v Speaker 1>And in your model there are thousands of such columns

0:18:37.320 --> 0:18:42.359
<v Speaker 1>and each one of these is learning a model of

0:18:42.400 --> 0:18:44.800
<v Speaker 1>the world as it's going. So tell us about right, right.

0:18:44.920 --> 0:18:48.720
<v Speaker 2>So I think this idea that there's all these columns

0:18:48.800 --> 0:18:50.960
<v Speaker 2>is not a new idea and that they have this

0:18:51.040 --> 0:18:54.320
<v Speaker 2>fundamental argithm. But what we were I think the first

0:18:54.320 --> 0:18:56.439
<v Speaker 2>people to kind of figure out what it is and

0:18:56.480 --> 0:18:59.119
<v Speaker 2>what it's doing. So the trick of this thing is

0:18:59.280 --> 0:19:01.680
<v Speaker 2>it's trick it here. You know, when you look out

0:19:01.680 --> 0:19:04.439
<v Speaker 2>at the world, you have a sense anybody you have

0:19:04.480 --> 0:19:06.200
<v Speaker 2>a sense where things are. I have a sense where

0:19:06.240 --> 0:19:07.760
<v Speaker 2>you are relative to me. I have a sense where

0:19:07.800 --> 0:19:09.240
<v Speaker 2>this microphone is relative to me. I know where my

0:19:09.240 --> 0:19:12.399
<v Speaker 2>hand is relative to this cop I Now there turns

0:19:12.400 --> 0:19:14.880
<v Speaker 2>out that you have any kind of sense of location

0:19:15.040 --> 0:19:17.840
<v Speaker 2>in space, you have to have neurons representing it. There's

0:19:17.880 --> 0:19:19.920
<v Speaker 2>nothing goes on in the brain if there aren't neurons

0:19:19.960 --> 0:19:22.960
<v Speaker 2>firing doing it. Turns out most of the machinery in

0:19:23.000 --> 0:19:25.600
<v Speaker 2>the New York cortex is keeping track of where things

0:19:25.640 --> 0:19:28.720
<v Speaker 2>are relative to other things. So those six layers, all

0:19:28.720 --> 0:19:32.640
<v Speaker 2>those cells, at least half of that circuitry is tracking

0:19:33.160 --> 0:19:35.679
<v Speaker 2>where the sensory input is coming from in the world.

0:19:36.000 --> 0:19:38.120
<v Speaker 2>So if I move my finger over this coffee cup,

0:19:38.760 --> 0:19:41.560
<v Speaker 2>the part that's getting information from the sensory like I'm

0:19:41.600 --> 0:19:44.399
<v Speaker 2>sensing an edge, for your example, as I move my finger,

0:19:44.560 --> 0:19:47.000
<v Speaker 2>it has to keep track of where my finger is,

0:19:47.040 --> 0:19:50.080
<v Speaker 2>a location of it and its orientation relative this cup.

0:19:50.160 --> 0:19:53.719
<v Speaker 2>Is quite complicated, but that's what it has to do

0:19:53.800 --> 0:19:55.639
<v Speaker 2>to build the models. And now we know how it

0:19:55.720 --> 0:19:59.000
<v Speaker 2>does it. There's all this evidence for it. So the

0:19:59.080 --> 0:20:01.280
<v Speaker 2>brain is just trying to track of where all of

0:20:01.280 --> 0:20:03.479
<v Speaker 2>its inputs are in the world, all relative other things.

0:20:03.520 --> 0:20:05.800
<v Speaker 2>Then it builds up these three dimensional models of the world.

0:20:06.000 --> 0:20:08.120
<v Speaker 2>So tell us about how it does that then, right,

0:20:08.200 --> 0:20:10.520
<v Speaker 2>so you can think about when you're in high school,

0:20:10.520 --> 0:20:13.360
<v Speaker 2>you learned about Cartesian coordinates, x, y, and z coordinates, right,

0:20:13.800 --> 0:20:15.359
<v Speaker 2>and so if I wanted to say where is something?

0:20:15.400 --> 0:20:17.399
<v Speaker 2>Where are your relative to me? I might say, okay,

0:20:17.440 --> 0:20:19.359
<v Speaker 2>your nose the origin, and I could say it's some

0:20:19.680 --> 0:20:21.879
<v Speaker 2>distance from here, and you know X, Y and Z.

0:20:22.520 --> 0:20:26.080
<v Speaker 2>Well you have to have something like that. But brains

0:20:26.119 --> 0:20:27.840
<v Speaker 2>don't do it that way. They do it another way.

0:20:27.880 --> 0:20:31.320
<v Speaker 2>And this was some very clever research in the last

0:20:31.359 --> 0:20:35.280
<v Speaker 2>twenty years that people discovered in the antarilo cortex and hippocampus.

0:20:35.280 --> 0:20:37.639
<v Speaker 2>These cells called grid cells and play cells, which actually

0:20:38.000 --> 0:20:42.280
<v Speaker 2>operate as reference frames. They are a way of neurons

0:20:42.320 --> 0:20:46.800
<v Speaker 2>to represent locations and they work differently than X, Y

0:20:46.840 --> 0:20:48.880
<v Speaker 2>and Z, so there's no origin. It's kind of really

0:20:48.920 --> 0:20:52.639
<v Speaker 2>clever how they work. The nature has discovered a different

0:20:52.640 --> 0:20:54.560
<v Speaker 2>way of doing this, so yeah, make sure you tell

0:20:54.640 --> 0:20:56.240
<v Speaker 2>us a little bit about that. Well, okay, but these

0:20:56.280 --> 0:20:58.160
<v Speaker 2>these are well known thing. It's just like grid cells,

0:20:58.160 --> 0:21:01.400
<v Speaker 2>which entronoic cord six. What they do is they these cells,

0:21:01.680 --> 0:21:04.000
<v Speaker 2>if you take a set of them, individual cells could

0:21:04.119 --> 0:21:06.240
<v Speaker 2>are not unique, and any d real sell me said,

0:21:06.280 --> 0:21:08.639
<v Speaker 2>I fired different locations in space, but if you take

0:21:08.640 --> 0:21:10.680
<v Speaker 2>a set of them, they're unique, and so you can

0:21:10.960 --> 0:21:14.600
<v Speaker 2>encode a unique location in space. And the key thing

0:21:14.640 --> 0:21:18.680
<v Speaker 2>about them is these cells automatically update as you move.

0:21:18.960 --> 0:21:20.960
<v Speaker 2>So the original grid cells are where your body is

0:21:20.960 --> 0:21:25.080
<v Speaker 2>in a room, and as you move, it's called past integration.

0:21:25.160 --> 0:21:27.800
<v Speaker 2>It says, okay, you're moving at this dis direction at

0:21:27.800 --> 0:21:30.720
<v Speaker 2>this speed, so we'll just automatically update these neurons. Is

0:21:30.720 --> 0:21:33.160
<v Speaker 2>if we know where you are right and so it's

0:21:33.400 --> 0:21:35.919
<v Speaker 2>it's what sales used to do dead reckoning. You just say, oh,

0:21:36.920 --> 0:21:38.960
<v Speaker 2>you know, I could I'm heading north for an hour

0:21:39.080 --> 0:21:41.479
<v Speaker 2>or three knots there for all these three miles in

0:21:41.520 --> 0:21:45.200
<v Speaker 2>this direction. So we know that these cells exist. They've

0:21:45.200 --> 0:21:47.359
<v Speaker 2>been well studied, people with Nobel Prize for these things.

0:21:47.880 --> 0:21:52.720
<v Speaker 2>So we speculated that the same neural mechanisms, these grid

0:21:52.760 --> 0:21:55.560
<v Speaker 2>cells and equivalents would be in the cortex at every

0:21:55.560 --> 0:21:59.560
<v Speaker 2>corticle home and sure enough they're finding that now. So

0:22:00.040 --> 0:22:02.320
<v Speaker 2>all kinds of research now they're finding in humans and

0:22:02.359 --> 0:22:05.320
<v Speaker 2>other animals that there are grid cell like structures in

0:22:05.359 --> 0:22:07.159
<v Speaker 2>cortical column And so what does that tell you? It

0:22:07.160 --> 0:22:09.520
<v Speaker 2>tells me that that's the mechanism by which the brain

0:22:09.640 --> 0:22:12.160
<v Speaker 2>uses for reference frames. And so literally, when you build

0:22:12.200 --> 0:22:13.840
<v Speaker 2>a model of something in the world, like a model

0:22:13.840 --> 0:22:16.240
<v Speaker 2>of a cup or a model of anything, it's essentially

0:22:16.320 --> 0:22:18.679
<v Speaker 2>what you're doing. You're just saying, here's the sensation, and

0:22:18.720 --> 0:22:22.000
<v Speaker 2>here it's location. Here's another sensation a different location. Here's

0:22:22.000 --> 0:22:24.479
<v Speaker 2>another sensation at a different location. You add all these together

0:22:24.560 --> 0:22:26.880
<v Speaker 2>and you get a three dimensional model. You can say,

0:22:26.960 --> 0:22:30.520
<v Speaker 2>this thing consists of these features in these locations relative

0:22:30.520 --> 0:22:34.160
<v Speaker 2>to each other. And so literally, in our head we

0:22:34.240 --> 0:22:37.159
<v Speaker 2>build models of the world that are three dimensional analogs

0:22:37.200 --> 0:22:41.400
<v Speaker 2>of the physical things we interact with. And that's why

0:22:41.440 --> 0:22:43.760
<v Speaker 2>you appears three dimensional to me. You know you're not

0:22:43.800 --> 0:22:46.040
<v Speaker 2>an image, You're a three dimensional structure because I have

0:22:46.080 --> 0:22:48.080
<v Speaker 2>a three dimensional model of humans, and I have a

0:22:48.080 --> 0:22:49.240
<v Speaker 2>special model for you, David.

0:22:51.040 --> 0:23:09.440
<v Speaker 1>Okay, great, okay. So you've got these columns in the cortex.

0:23:09.520 --> 0:23:12.280
<v Speaker 1>They're building three dimensional models or keeping track of where

0:23:12.320 --> 0:23:14.760
<v Speaker 1>your fingertips are, where your eyes are. So we've got

0:23:14.760 --> 0:23:18.720
<v Speaker 1>these different windows into the brain. You've got these data

0:23:18.760 --> 0:23:21.760
<v Speaker 1>cables coming in carrying spikes. It's all spikes, but some

0:23:21.800 --> 0:23:24.080
<v Speaker 1>of them carrying visual intraces to monitory is some touch.

0:23:25.320 --> 0:23:27.400
<v Speaker 2>Every brain doesn't know that, by the way, exactly right.

0:23:28.640 --> 0:23:32.840
<v Speaker 2>It's all the spikes exactly right. And so for any

0:23:32.920 --> 0:23:37.720
<v Speaker 2>particular column, it might only be getting a subset of

0:23:37.760 --> 0:23:40.720
<v Speaker 2>those tell us about that, right, right? Well, any well,

0:23:40.760 --> 0:23:41.960
<v Speaker 2>I'm not shore going to be a subset.

0:23:42.119 --> 0:23:44.239
<v Speaker 1>But what I mean is if I if I am

0:23:44.240 --> 0:23:45.800
<v Speaker 1>a cortical column that happens to be sitting in the

0:23:45.880 --> 0:23:48.200
<v Speaker 1>visual cortex, that I happen to be getting visual information,

0:23:48.320 --> 0:23:50.600
<v Speaker 1>but I'm not getting auditorial So.

0:23:50.560 --> 0:23:52.720
<v Speaker 2>There's a real One of the first things we had

0:23:52.720 --> 0:23:55.040
<v Speaker 2>to address with this series is why does the world

0:23:55.080 --> 0:23:59.280
<v Speaker 2>appear unified? Right? I don't feel like you know, I

0:23:59.480 --> 0:24:01.760
<v Speaker 2>I don't feel like, oh, I'm touching something with my

0:24:01.880 --> 0:24:03.680
<v Speaker 2>hands and I'm looking at something else in my eyes.

0:24:03.720 --> 0:24:05.639
<v Speaker 2>It's all one thing. There's this cup, right, and I

0:24:05.640 --> 0:24:07.439
<v Speaker 2>feel the warmth of it, and I know it. I mean,

0:24:07.440 --> 0:24:09.239
<v Speaker 2>it's one thing. It's and yet we have all these

0:24:09.280 --> 0:24:11.639
<v Speaker 2>different models. So it turns out you have models of

0:24:11.640 --> 0:24:14.560
<v Speaker 2>cups and that are taxile models. They're based on how

0:24:14.800 --> 0:24:16.760
<v Speaker 2>how it feels. You have models of how it looks.

0:24:17.520 --> 0:24:19.199
<v Speaker 2>You might even have a model how it sounds like

0:24:19.240 --> 0:24:21.800
<v Speaker 2>this particular ceramic cup. I have an expectation what it

0:24:21.920 --> 0:24:23.760
<v Speaker 2>sound like. I put on this account here my different

0:24:24.400 --> 0:24:29.080
<v Speaker 2>ceramic counter. And yet these models are they're all independent,

0:24:29.119 --> 0:24:32.120
<v Speaker 2>but they're not completely in pandit so there's these long

0:24:32.200 --> 0:24:34.960
<v Speaker 2>range connections in the cortex. They go from all different

0:24:34.960 --> 0:24:36.159
<v Speaker 2>sides to the left side of the brain and the right

0:24:36.200 --> 0:24:37.960
<v Speaker 2>side of the brain and all over the place. There's lots

0:24:38.000 --> 0:24:41.119
<v Speaker 2>of different types. What they're essentially doing is they're voting.

0:24:41.600 --> 0:24:44.200
<v Speaker 2>They're all saying, like one my finger says, I think

0:24:44.200 --> 0:24:46.240
<v Speaker 2>I'm touching something that feels like a cup, and I

0:24:46.280 --> 0:24:48.360
<v Speaker 2>may not be certain. Another thing, I have something too

0:24:48.440 --> 0:24:50.320
<v Speaker 2>that's I'm not really certain, and the nice thing and

0:24:50.359 --> 0:24:52.399
<v Speaker 2>they very quickly are reaching the set. The only thing

0:24:52.400 --> 0:24:55.119
<v Speaker 2>that makes sense for all our input is we're all

0:24:55.160 --> 0:24:57.720
<v Speaker 2>looking at the same object. And so there's like across

0:24:57.840 --> 0:25:02.399
<v Speaker 2>these long range connections, it'll into a percept that's what

0:25:02.440 --> 0:25:05.960
<v Speaker 2>you perceive. You don't actually normally perceive the individual sensations

0:25:05.960 --> 0:25:07.680
<v Speaker 2>from your eye or your fingers. You just say, I'm

0:25:07.680 --> 0:25:11.159
<v Speaker 2>holding this cup in my hand, and it's one percept.

0:25:11.280 --> 0:25:13.800
<v Speaker 2>And so it's these long raine connections and how these

0:25:13.840 --> 0:25:16.960
<v Speaker 2>columns vote all the time. This is why I can

0:25:17.000 --> 0:25:21.399
<v Speaker 2>flash an image in front of your eye and say, okay,

0:25:21.440 --> 0:25:23.800
<v Speaker 2>well each column is looking at part of that image.

0:25:23.960 --> 0:25:27.040
<v Speaker 2>Who decides what the whole image right? And by the way,

0:25:27.080 --> 0:25:28.600
<v Speaker 2>I don't even have time to move my eyes. Once

0:25:28.600 --> 0:25:30.399
<v Speaker 2>I've run into objects, I don't have to move my

0:25:30.440 --> 0:25:32.760
<v Speaker 2>eyes to recognize them. Man, what we call a flash inference.

0:25:33.800 --> 0:25:35.600
<v Speaker 2>The reason is because each part of the court to

0:25:35.720 --> 0:25:38.719
<v Speaker 2>visual cortext has a hypothesis about what it might be seeing,

0:25:39.000 --> 0:25:40.920
<v Speaker 2>and they vote, and the only thing that makes sense

0:25:40.960 --> 0:25:43.520
<v Speaker 2>is the final thing they agree upon. So I have

0:25:43.600 --> 0:25:46.000
<v Speaker 2>to learn by moving my eyes, by tending to different

0:25:46.000 --> 0:25:48.800
<v Speaker 2>things and my fingers. But I don't always have to

0:25:48.840 --> 0:25:51.359
<v Speaker 2>infer or recognize things by movement. I don't always have to.

0:25:51.480 --> 0:25:53.320
<v Speaker 2>I can just flash an image in front of you

0:25:53.520 --> 0:25:54.800
<v Speaker 2>and you say, I know what that is, and you

0:25:54.840 --> 0:25:57.119
<v Speaker 2>don't have time to move rise. This folded a lot

0:25:57.119 --> 0:25:59.199
<v Speaker 2>of vision researchers for many years because they assume that

0:25:59.320 --> 0:26:02.480
<v Speaker 2>the movement was necessary because I can flash an image

0:26:02.480 --> 0:26:04.520
<v Speaker 2>in front of you. But you can't learn that way.

0:26:04.720 --> 0:26:08.160
<v Speaker 2>You have to learn by attending to different things, quite right,

0:26:08.600 --> 0:26:10.480
<v Speaker 2>Just so it's clear to the audience.

0:26:10.520 --> 0:26:13.840
<v Speaker 1>So this issue about voting, it's not that they're all

0:26:13.840 --> 0:26:16.920
<v Speaker 1>submitting their votes to some central agency. It's that they're

0:26:16.920 --> 0:26:22.120
<v Speaker 1>all talking with one another simultaneously, simultaneously. And something about

0:26:22.119 --> 0:26:25.040
<v Speaker 1>the spike patterns holds into shape.

0:26:24.760 --> 0:26:27.199
<v Speaker 2>Right, right, well, we know exactly how this occurs. We

0:26:27.240 --> 0:26:30.960
<v Speaker 2>have models of it, and we've simulated and matches of neuroscience.

0:26:31.840 --> 0:26:33.320
<v Speaker 2>It's a little it takes a little while for people

0:26:33.359 --> 0:26:34.560
<v Speaker 2>to get the sense of it. You're right, there's no

0:26:34.720 --> 0:26:38.919
<v Speaker 2>central voting tally. It's like it's and I don't have

0:26:39.040 --> 0:26:40.399
<v Speaker 2>all the commns, don't have to talk to all the

0:26:40.400 --> 0:26:42.040
<v Speaker 2>other comms. It turns out they only have to stalk

0:26:42.040 --> 0:26:43.800
<v Speaker 2>to a few other commns as long as everyone talks

0:26:43.840 --> 0:26:45.800
<v Speaker 2>to somebody and the whole thing is connected, so they

0:26:45.800 --> 0:26:49.760
<v Speaker 2>don't have to like in Zilian connections. But it's it's

0:26:49.800 --> 0:26:53.199
<v Speaker 2>more like you have a neuro you know, magic neurons

0:26:53.200 --> 0:26:57.520
<v Speaker 2>are spiking, and in I have I have five thousand

0:26:57.520 --> 0:27:00.639
<v Speaker 2>neurons that representing what I'm seeing. That's not that man actually,

0:27:00.640 --> 0:27:03.400
<v Speaker 2>so five thousand neurons and in the brain. We're getting

0:27:03.400 --> 0:27:07.159
<v Speaker 2>a little technical here. Activations are typically sparse, meaning of

0:27:07.200 --> 0:27:10.919
<v Speaker 2>those five thousand cells, maybe only two percent or one

0:27:10.960 --> 0:27:12.960
<v Speaker 2>hundred are active at any point in time. The others

0:27:12.960 --> 0:27:18.600
<v Speaker 2>are silent. So I'm representing something by saying there's one

0:27:18.640 --> 0:27:21.359
<v Speaker 2>hundred neurons active out of five thousand. Now, if I

0:27:21.600 --> 0:27:25.560
<v Speaker 2>wasn't certain, I might say, oh, well, let's do this.

0:27:25.960 --> 0:27:27.560
<v Speaker 2>I'm going to say it could be object day, it

0:27:27.560 --> 0:27:29.239
<v Speaker 2>could be object being, could be object C. And I'm

0:27:29.240 --> 0:27:31.000
<v Speaker 2>gonna activate them all the same time. So now I

0:27:31.040 --> 0:27:34.959
<v Speaker 2>have three hundred neurons out of five hundred they're simultaneously active.

0:27:35.280 --> 0:27:39.080
<v Speaker 2>Now that might seem confusing, but it isn't. No trouble

0:27:39.119 --> 0:27:42.159
<v Speaker 2>is this and everybody's doing the same thing. They're all

0:27:42.280 --> 0:27:46.160
<v Speaker 2>doing multiple hy positives, and it very quickly says you're

0:27:46.200 --> 0:27:49.600
<v Speaker 2>supporting this positive and you're supporting this po. It happens simultaneously.

0:27:49.680 --> 0:27:52.000
<v Speaker 2>No one has to go through so early. There's no

0:27:52.160 --> 0:27:54.199
<v Speaker 2>like counting the vote. So let's try this like positive

0:27:54.200 --> 0:27:56.440
<v Speaker 2>in this it all settles very very quickly.

0:27:57.840 --> 0:28:00.199
<v Speaker 1>It's kind of cool thing if you thought about what

0:28:00.359 --> 0:28:05.159
<v Speaker 1>happens when you settle on a hypothesis and then you switch.

0:28:05.200 --> 0:28:07.560
<v Speaker 1>For example, looking at the Neckar cube, this cube made

0:28:07.560 --> 0:28:08.760
<v Speaker 1>out of it, yes, twelve lines.

0:28:09.040 --> 0:28:09.480
<v Speaker 2>What you know?

0:28:09.680 --> 0:28:11.240
<v Speaker 1>You see it one way, then you see it the

0:28:11.280 --> 0:28:14.239
<v Speaker 1>other way? What is it that allows it to switch? All?

0:28:14.320 --> 0:28:17.520
<v Speaker 2>Right? Now, a Necker cube is a two dimensional image, right,

0:28:17.560 --> 0:28:21.440
<v Speaker 2>it's a two dimensional image of a three dimensional wireframe

0:28:21.520 --> 0:28:24.679
<v Speaker 2>cube or something like that. Right, And so it's not

0:28:24.680 --> 0:28:28.480
<v Speaker 2>three dimensional. It's really two dimensional, but your brain wants

0:28:28.480 --> 0:28:31.280
<v Speaker 2>to make it three dimensional, right because it doesn't know

0:28:31.359 --> 0:28:34.680
<v Speaker 2>two dimensional things that look like that, And so everything

0:28:34.720 --> 0:28:36.880
<v Speaker 2>we try to do fits into our models, right, Right.

0:28:36.960 --> 0:28:38.680
<v Speaker 2>We don't say, oh, that's a two dimensional image. It

0:28:38.720 --> 0:28:40.200
<v Speaker 2>can't be a cube. No, you says, oh, no, that's

0:28:40.200 --> 0:28:41.640
<v Speaker 2>got to be a cube, because I know cubes. I

0:28:41.640 --> 0:28:43.719
<v Speaker 2>don't think it looks like that's not a cube. So

0:28:43.840 --> 0:28:46.440
<v Speaker 2>it wants to settle on a hypothesis. Is like, okay,

0:28:46.480 --> 0:28:48.400
<v Speaker 2>well this corner is in front of that corner, and

0:28:48.400 --> 0:28:49.920
<v Speaker 2>this corner is behind that corner, the corner to the

0:28:50.000 --> 0:28:51.760
<v Speaker 2>left of that corner. It just has to do that

0:28:51.800 --> 0:28:53.800
<v Speaker 2>to fit its models. That's right.

0:28:53.960 --> 0:28:56.600
<v Speaker 1>But why doesn't it land on a hypothesis and stick there.

0:28:56.680 --> 0:28:59.960
<v Speaker 2>Well, I don't really know, but there's other people hypothesis

0:29:00.200 --> 0:29:03.640
<v Speaker 2>about this is that the evidence goes both ways, right,

0:29:03.640 --> 0:29:07.200
<v Speaker 2>there's multiple hypositive and so neurons have a way of

0:29:07.200 --> 0:29:09.600
<v Speaker 2>getting tired about what they're doing. After a while, they say,

0:29:09.640 --> 0:29:12.480
<v Speaker 2>you know, literally, they have a way of they say,

0:29:12.480 --> 0:29:14.600
<v Speaker 2>you know, I'm not going to keep finding on this forever.

0:29:14.720 --> 0:29:16.640
<v Speaker 2>You know, things are changing the world. We don't just

0:29:16.680 --> 0:29:21.520
<v Speaker 2>get stuck. So there's various speculated mechanisms BEHUWD neurons, and

0:29:21.560 --> 0:29:24.560
<v Speaker 2>it's been observed. We'll sort of, you know, say okay,

0:29:24.600 --> 0:29:25.960
<v Speaker 2>I'll be active a little while and then I'm going

0:29:26.040 --> 0:29:28.400
<v Speaker 2>to stop U. Lets someone else try something, right.

0:29:28.400 --> 0:29:29.920
<v Speaker 1>Yeah, it, But what it means is that the other

0:29:30.000 --> 0:29:33.160
<v Speaker 1>hypothesis has to be kept alive somewhere somehow.

0:29:33.200 --> 0:29:35.720
<v Speaker 2>Well it maybe not. Maybe just like I have this

0:29:35.840 --> 0:29:38.560
<v Speaker 2>hypothesis that locked in on it, and now I'm going

0:29:38.640 --> 0:29:41.760
<v Speaker 2>to say that's no longer possible. Just go back to

0:29:41.800 --> 0:29:44.760
<v Speaker 2>square one. What is possible? You know? So it's not

0:29:44.920 --> 0:29:48.360
<v Speaker 2>like I have these two images in my head conceptual

0:29:48.600 --> 0:29:51.120
<v Speaker 2>or perceptually. You don't feel that way, right, You only

0:29:51.160 --> 0:29:53.080
<v Speaker 2>one or the other? Do you lock in the one?

0:29:53.320 --> 0:29:57.160
<v Speaker 2>The other is forgotten? But then if i'd say disabled

0:29:57.200 --> 0:29:59.320
<v Speaker 2>the first hypothesis, We're not gonna allow that be anymore.

0:29:59.520 --> 0:30:02.280
<v Speaker 2>Then it's okay, what's possible. This one's possible, I'll switch

0:30:02.320 --> 0:30:05.400
<v Speaker 2>to that one. It's not like they're both active. One's

0:30:05.440 --> 0:30:07.880
<v Speaker 2>active and then it gets tired and then the other, well,

0:30:08.120 --> 0:30:08.520
<v Speaker 2>I work.

0:30:09.480 --> 0:30:11.920
<v Speaker 1>So coming back to the main thing, one part that

0:30:11.960 --> 0:30:14.560
<v Speaker 1>I want to return to is just this issue that

0:30:15.080 --> 0:30:20.080
<v Speaker 1>a particular column might only be receiving touch information. Another

0:30:20.080 --> 0:30:22.960
<v Speaker 1>call might be receiving only auditory information, and so on.

0:30:23.080 --> 0:30:26.520
<v Speaker 2>Well, they build independent models, right they. I could have

0:30:26.560 --> 0:30:28.520
<v Speaker 2>a tactle model of an object, a visual model of

0:30:28.520 --> 0:30:30.880
<v Speaker 2>an object, right, they're not the same. The visual model

0:30:30.840 --> 0:30:32.760
<v Speaker 2>of the object will have color, perhaps the tact will

0:30:32.760 --> 0:30:37.320
<v Speaker 2>have temperature and texture and things like that. So they're

0:30:37.360 --> 0:30:40.280
<v Speaker 2>different models. But because they can vote, you have a

0:30:40.320 --> 0:30:42.400
<v Speaker 2>single percept of it. Yeah, okay.

0:30:42.440 --> 0:30:44.320
<v Speaker 1>And one of the things that's important here, which of

0:30:44.320 --> 0:30:46.280
<v Speaker 1>course you have, so you and I both emphasize this

0:30:46.320 --> 0:30:47.959
<v Speaker 1>is a lot in our books, is that all we

0:30:48.000 --> 0:30:50.640
<v Speaker 1>are ever seeing is our model of the world, right,

0:30:50.800 --> 0:30:53.960
<v Speaker 1>and so we don't have any direct access to what's

0:30:54.080 --> 0:30:54.960
<v Speaker 1>actually out there.

0:30:55.000 --> 0:30:56.160
<v Speaker 2>And so the fact.

0:30:57.480 --> 0:31:00.600
<v Speaker 1>You mentioned earlier the binding problem, if you mentioned it

0:31:00.640 --> 0:31:02.440
<v Speaker 1>by name. But the binding problem is this issue that

0:31:02.480 --> 0:31:04.480
<v Speaker 1>when the coffee cup is here and it's moving, how

0:31:04.480 --> 0:31:06.160
<v Speaker 1>come the color doesn't bleed off the cup?

0:31:06.200 --> 0:31:07.960
<v Speaker 2>And how come it seems like one thing and so on.

0:31:08.160 --> 0:31:11.200
<v Speaker 2>Buying problem is a poorly defined problem. Exactly. It means

0:31:11.240 --> 0:31:12.840
<v Speaker 2>a lot of different things, a lot of different people.

0:31:12.880 --> 0:31:15.120
<v Speaker 2>So you gotta be really careful a say, oh, I

0:31:15.360 --> 0:31:18.640
<v Speaker 2>let's talk about the binding problem, I might have a

0:31:18.640 --> 0:31:20.280
<v Speaker 2>different perception of what the binding problem is.

0:31:20.640 --> 0:31:20.880
<v Speaker 1>To me.

0:31:21.120 --> 0:31:23.200
<v Speaker 2>The binding problem is the one I've already discussed, which

0:31:23.280 --> 0:31:30.200
<v Speaker 2>is you have these different sensory inputs that but somehow

0:31:30.200 --> 0:31:33.080
<v Speaker 2>they lead to a single perscept and you can switch

0:31:33.160 --> 0:31:35.560
<v Speaker 2>back and forth. It's like, I don't It's like, how

0:31:35.600 --> 0:31:38.040
<v Speaker 2>do I bring these things together? How do I say

0:31:38.120 --> 0:31:41.080
<v Speaker 2>these are all the same thing? And people used to

0:31:41.120 --> 0:31:43.040
<v Speaker 2>think in the binding problems like, oh, if I have

0:31:43.120 --> 0:31:46.160
<v Speaker 2>the auditory cortex and the visual cortext and somautter centric

0:31:46.160 --> 0:31:50.400
<v Speaker 2>cortex touch, then they must all project to someplace where

0:31:50.480 --> 0:31:54.200
<v Speaker 2>they are binding together into a single model. And we

0:31:54.360 --> 0:31:57.440
<v Speaker 2>flip that on its head. They don't bind. They bind

0:31:57.480 --> 0:32:00.560
<v Speaker 2>together through just long range connections. But there's no place

0:32:00.600 --> 0:32:03.160
<v Speaker 2>I have to do that. There's no nobody's sitting on

0:32:03.200 --> 0:32:05.840
<v Speaker 2>top of it and saying, hey, what's your vote? What's

0:32:05.840 --> 0:32:08.200
<v Speaker 2>your vote? Now? It's just like, so there, we don't

0:32:08.240 --> 0:32:11.760
<v Speaker 2>need a model that incorporates all the aspects of objects.

0:32:11.840 --> 0:32:15.680
<v Speaker 2>We have independent models that we can invoke as needed,

0:32:15.800 --> 0:32:19.520
<v Speaker 2>and they all they all vote to reach common consensus.

0:32:19.520 --> 0:32:21.960
<v Speaker 2>So I have no problems navigating, you know, doing things

0:32:22.040 --> 0:32:23.840
<v Speaker 2>in the dark. I have no troubles doing things just

0:32:23.840 --> 0:32:25.680
<v Speaker 2>by vision. I have no I can do things sometimes

0:32:25.680 --> 0:32:28.720
<v Speaker 2>of audition, you know, like I know the same things

0:32:28.720 --> 0:32:30.800
<v Speaker 2>are going on. I have the same model in the world, right.

0:32:31.000 --> 0:32:33.120
<v Speaker 2>You know, if I if I'm walking at night between

0:32:33.120 --> 0:32:36.239
<v Speaker 2>my bed and the bathroom and it's pitch black, I

0:32:36.280 --> 0:32:38.400
<v Speaker 2>still have the same model in the house. I still

0:32:38.440 --> 0:32:40.160
<v Speaker 2>know where the door is going to be and everything else.

0:32:40.240 --> 0:32:42.560
<v Speaker 2>You know, if I can do with touch for the vision. Right,

0:32:43.200 --> 0:32:45.440
<v Speaker 2>So there isn't a central model that says, here's a

0:32:45.480 --> 0:32:47.240
<v Speaker 2>model of my house of touch and vision hearing. It

0:32:47.280 --> 0:32:49.320
<v Speaker 2>says all these independent models. Right.

0:32:49.680 --> 0:32:52.280
<v Speaker 1>And now the reason you called your book a thousand brains,

0:32:52.360 --> 0:32:53.200
<v Speaker 1>we call this ypods.

0:32:53.240 --> 0:32:56.400
<v Speaker 2>It's one thousand brains theary theory, right, is.

0:32:57.960 --> 0:33:01.120
<v Speaker 1>Precisely because you've got all these cortical calls and they're

0:33:01.280 --> 0:33:03.360
<v Speaker 1>each making a little model of the world, and they're

0:33:03.360 --> 0:33:06.000
<v Speaker 1>all talking to one another. Right, So you know, hi,

0:33:06.680 --> 0:33:08.760
<v Speaker 1>this is this feels like coffee cup. This looks like

0:33:08.800 --> 0:33:10.720
<v Speaker 1>coffee cup. It sounds like coffee cup when it plays down.

0:33:10.760 --> 0:33:13.360
<v Speaker 1>This is the temperature of coffee cup, and so and

0:33:13.440 --> 0:33:15.040
<v Speaker 1>so these are all talking with one another.

0:33:15.600 --> 0:33:18.240
<v Speaker 2>So so the reason they call the thousand brains. Is

0:33:18.280 --> 0:33:23.440
<v Speaker 2>that each cortical column is doing what the entire brain

0:33:23.520 --> 0:33:25.800
<v Speaker 2>is doing. Right, each quarter column is a senior model

0:33:25.840 --> 0:33:30.320
<v Speaker 2>learning system. And and when we ask where is a

0:33:30.360 --> 0:33:32.400
<v Speaker 2>model of something? We've been talking about this, where is

0:33:32.400 --> 0:33:35.360
<v Speaker 2>the model of the skull or the microphone or whatever.

0:33:36.160 --> 0:33:39.320
<v Speaker 2>So many things we know, where is that model? It's not,

0:33:39.600 --> 0:33:43.040
<v Speaker 2>it's it's in many different places. So there's a thousand

0:33:43.160 --> 0:33:46.760
<v Speaker 2>models of coffee cops, a thousand models. You don't perceive that,

0:33:47.680 --> 0:33:51.680
<v Speaker 2>but they exist. And and so it's like it's it

0:33:51.760 --> 0:33:54.600
<v Speaker 2>was really trying to capture that original idea that cortical

0:33:54.640 --> 0:33:57.080
<v Speaker 2>columns are common and that and that there's all these

0:33:57.120 --> 0:33:59.640
<v Speaker 2>different models out there that are different and they can

0:33:59.720 --> 0:34:02.560
<v Speaker 2>vote one day, they vote to reach a consensus.

0:34:02.680 --> 0:34:05.160
<v Speaker 1>Yeah, and it's certainly consistent with the idea that you know,

0:34:05.160 --> 0:34:07.560
<v Speaker 1>for example, if someone is born blind and then visual

0:34:07.640 --> 0:34:10.359
<v Speaker 1>cortex gets taken over by hearing and touches on, they

0:34:10.360 --> 0:34:13.239
<v Speaker 1>are better at hearing in touch presuming because they just

0:34:13.239 --> 0:34:14.800
<v Speaker 1>have a lot more real estates.

0:34:14.760 --> 0:34:17.040
<v Speaker 2>Voted right, right, or real estate is a lot more

0:34:17.080 --> 0:34:21.759
<v Speaker 2>practice too, right right. So it's amazing how flexible it is.

0:34:21.960 --> 0:34:26.399
<v Speaker 1>Yes, given your model of the brain. Let's talk about

0:34:26.440 --> 0:34:28.560
<v Speaker 1>AI and what you think is going on currently with

0:34:28.800 --> 0:34:31.120
<v Speaker 1>LLLMS and what that is missing.

0:34:31.400 --> 0:34:35.000
<v Speaker 2>Right, LMS flipp is interesting. Well, let me start with

0:34:35.040 --> 0:34:38.920
<v Speaker 2>the criticism AI in general. Okay, AI has always been

0:34:38.960 --> 0:34:42.279
<v Speaker 2>focused on what they call benchmarks, like how well can

0:34:42.320 --> 0:34:45.560
<v Speaker 2>you solve problems? So, how well came this system recognized images?

0:34:45.600 --> 0:34:47.360
<v Speaker 2>How well can play chess? How well can play go?

0:34:47.480 --> 0:34:49.839
<v Speaker 2>How well I can translate from one language to the other.

0:34:50.000 --> 0:34:52.280
<v Speaker 2>And you have all these benchmarks, and everyone competes against

0:34:52.280 --> 0:34:54.600
<v Speaker 2>these benchmarks that they're kind of diverse all over the place.

0:34:55.239 --> 0:34:58.680
<v Speaker 2>That's the wrong way to think about it. When we

0:34:59.120 --> 0:35:02.640
<v Speaker 2>let's use computer as an analogy. When we say something

0:35:02.640 --> 0:35:05.000
<v Speaker 2>as a computer, we don't based on what it's doing.

0:35:05.080 --> 0:35:07.960
<v Speaker 2>We based on how it works. Alan Turing and John

0:35:08.000 --> 0:35:10.680
<v Speaker 2>Fan Nordmann define what we now call a universal turning machine,

0:35:10.680 --> 0:35:13.200
<v Speaker 2>which is like, okay, if a system has memory and

0:35:13.640 --> 0:35:18.120
<v Speaker 2>a processor, and the memory has data and instructions, and

0:35:18.160 --> 0:35:20.200
<v Speaker 2>you can change the instructions and change the data, it

0:35:20.200 --> 0:35:23.120
<v Speaker 2>can do anything. And that is a computer. So I

0:35:23.160 --> 0:35:25.560
<v Speaker 2>can say my toaster is a computer, even though it's

0:35:25.560 --> 0:35:28.120
<v Speaker 2>a very limited computer, because it has one of those

0:35:28.120 --> 0:35:32.080
<v Speaker 2>things inside. Right, if it was hard coded with springs

0:35:32.080 --> 0:35:33.799
<v Speaker 2>and wires and stuff, it wouldn't be a computer. But

0:35:33.920 --> 0:35:37.080
<v Speaker 2>because it has a little microprocessor has those definitions, it's

0:35:37.120 --> 0:35:39.480
<v Speaker 2>a computer. So that's how we do it in the

0:35:39.480 --> 0:35:42.239
<v Speaker 2>computer world. We say, these are the functions that it

0:35:42.239 --> 0:35:44.080
<v Speaker 2>has to perform, and you can apply it to big problems,

0:35:44.080 --> 0:35:46.040
<v Speaker 2>little problems, different types of problems all over the place.

0:35:46.080 --> 0:35:48.520
<v Speaker 2>And AI we've been focused on this idea that oh

0:35:48.560 --> 0:35:50.719
<v Speaker 2>a benchmarks, you know, and we always want to be

0:35:50.719 --> 0:35:53.480
<v Speaker 2>beat some human Well, like a dog. Almost everyone who

0:35:53.520 --> 0:35:56.240
<v Speaker 2>has the dog says it's intelligent, right, but doesn't have language,

0:35:56.280 --> 0:35:58.000
<v Speaker 2>it doesn't play standswer, it doesn't play go. But why

0:35:58.040 --> 0:35:59.920
<v Speaker 2>do we say it's intelligent because we can tell the

0:36:00.080 --> 0:36:01.920
<v Speaker 2>that dog has an eternal model of the world. It's

0:36:01.960 --> 0:36:03.640
<v Speaker 2>kind of like my internal the model. He knows where

0:36:03.640 --> 0:36:05.160
<v Speaker 2>the door is, it knows how to get go on

0:36:05.160 --> 0:36:09.080
<v Speaker 2>the walk. And so why why focus on this issue

0:36:09.120 --> 0:36:10.919
<v Speaker 2>of like, well, it's not intelligent because it doesn't play

0:36:10.960 --> 0:36:14.680
<v Speaker 2>go better than the best human player. So I think

0:36:14.719 --> 0:36:17.080
<v Speaker 2>part of the problem was that people didn't know how

0:36:17.120 --> 0:36:19.040
<v Speaker 2>brains worked, and so if you don't, what are you

0:36:19.080 --> 0:36:21.719
<v Speaker 2>going to do? Right? We don't know it. We we

0:36:21.840 --> 0:36:23.960
<v Speaker 2>know enough to build this stuff. So I think in

0:36:24.000 --> 0:36:25.759
<v Speaker 2>the future that's what's going to be. We're going to

0:36:25.840 --> 0:36:28.440
<v Speaker 2>say AI systems don't have to be like humans. They

0:36:28.440 --> 0:36:30.120
<v Speaker 2>don't even have to do the same things humans do.

0:36:30.360 --> 0:36:32.000
<v Speaker 2>Some of them are going to be very dedicated. It's

0:36:32.120 --> 0:36:33.920
<v Speaker 2>very focused tasks, and we're going to be very broad

0:36:33.960 --> 0:36:37.120
<v Speaker 2>So might be you know, engineers building space stations, all

0:36:37.200 --> 0:36:38.880
<v Speaker 2>this huge rider, but they're all going to work on

0:36:38.920 --> 0:36:43.160
<v Speaker 2>the same principles that biology has discovered. Today's AI doesn't

0:36:43.160 --> 0:36:45.279
<v Speaker 2>work on those principles, you know, most of it. If

0:36:45.280 --> 0:36:47.800
<v Speaker 2>you talk about the large language models, these are transform

0:36:47.800 --> 0:36:51.160
<v Speaker 2>well models. We feed in a string of tokens basically

0:36:51.360 --> 0:36:54.920
<v Speaker 2>words or wordlike things, and it just learns the structure

0:36:54.960 --> 0:36:57.680
<v Speaker 2>of that string, and it's very good at what it does.

0:36:57.960 --> 0:37:00.840
<v Speaker 2>But there's no inherent knowledge of the actual the world.

0:37:01.160 --> 0:37:03.359
<v Speaker 2>It doesn't have a three dimensional models of the world.

0:37:03.360 --> 0:37:05.759
<v Speaker 2>It doesn't if someone's written about it, it'll tell you

0:37:05.800 --> 0:37:09.160
<v Speaker 2>about it, but it can't experience it itself. So you

0:37:09.200 --> 0:37:11.239
<v Speaker 2>couldn't send in one of these AI systems down the

0:37:11.280 --> 0:37:14.440
<v Speaker 2>space and say, you know, go to Mars explore and

0:37:14.600 --> 0:37:16.279
<v Speaker 2>see what's out there that we can build things with

0:37:16.400 --> 0:37:18.239
<v Speaker 2>and here's some tools and start building a structure. It

0:37:18.280 --> 0:37:20.200
<v Speaker 2>just no why they're gonna do this though it's not

0:37:20.200 --> 0:37:23.319
<v Speaker 2>gonna happen, but to contact. The tools we're working on

0:37:23.560 --> 0:37:26.279
<v Speaker 2>can do that. That's what humans do, and that's the

0:37:26.360 --> 0:37:30.080
<v Speaker 2>promise of AI. It's not just you know, targeting things

0:37:30.120 --> 0:37:32.440
<v Speaker 2>that humans can do, like high level things like you know,

0:37:32.480 --> 0:37:35.319
<v Speaker 2>translating language or writing poems or things like that. It's

0:37:35.360 --> 0:37:37.080
<v Speaker 2>really how do you build the system to understands the

0:37:37.080 --> 0:37:39.239
<v Speaker 2>world and knows how to act in that world. And

0:37:39.600 --> 0:37:54.760
<v Speaker 2>that's the key.

0:37:56.640 --> 0:37:58.040
<v Speaker 1>One of the things you wrote in your book that

0:37:58.080 --> 0:38:01.000
<v Speaker 1>I thought was great was, uh, you address this issue

0:38:01.040 --> 0:38:04.040
<v Speaker 1>of the existential threat of AI that a lot of

0:38:04.040 --> 0:38:07.040
<v Speaker 1>people are banging on about and you don't think it's

0:38:07.040 --> 0:38:07.399
<v Speaker 1>a threat.

0:38:07.480 --> 0:38:09.440
<v Speaker 2>I don't think it's a threat. I mean you have to,

0:38:09.719 --> 0:38:12.080
<v Speaker 2>you have to tease it apart because so many people

0:38:12.200 --> 0:38:14.960
<v Speaker 2>like there's different existential threats, but you know that I

0:38:15.000 --> 0:38:17.439
<v Speaker 2>one is called the alignment problem, Like all these AI

0:38:17.520 --> 0:38:19.680
<v Speaker 2>agents are gonna you know, you're gonna tell what to do,

0:38:19.719 --> 0:38:22.799
<v Speaker 2>but it won't be aligned with our values. And I'm

0:38:22.840 --> 0:38:26.600
<v Speaker 2>just saying they don't have any values and it's just

0:38:26.640 --> 0:38:28.960
<v Speaker 2>so far from reality. I just if once you understand

0:38:28.960 --> 0:38:30.840
<v Speaker 2>how brains work, you said, like I'm doing any of

0:38:30.880 --> 0:38:33.560
<v Speaker 2>that stuff. It's it's hard to it's hard for me

0:38:33.600 --> 0:38:36.160
<v Speaker 2>to give a succinct answer to this. But I don't

0:38:36.239 --> 0:38:39.560
<v Speaker 2>think that today's AI systems have any of these problems.

0:38:40.239 --> 0:38:42.399
<v Speaker 2>They're not gonna run away. They're not They're not gonna

0:38:42.400 --> 0:38:45.480
<v Speaker 2>have their own desires. They're not gonna say, hey, I'm awake,

0:38:45.600 --> 0:38:47.799
<v Speaker 2>I need to survive, you know.

0:38:48.880 --> 0:38:52.080
<v Speaker 1>Because because these current large language models are just statistical

0:38:52.120 --> 0:38:55.720
<v Speaker 1>parrots that are taking much language and spinning language back out.

0:38:55.880 --> 0:38:57.960
<v Speaker 2>Right, and you can apply they'll prime the robotics and

0:38:58.000 --> 0:39:02.960
<v Speaker 2>other things, but they're gonna be still so statistical parents exactly.

0:39:03.160 --> 0:39:05.719
<v Speaker 2>But and and by the way, they lack the human brain.

0:39:05.760 --> 0:39:07.640
<v Speaker 2>We talked earlier, the new or corts is the biggest

0:39:07.640 --> 0:39:08.640
<v Speaker 2>part of the brain, but we have a lot of

0:39:08.680 --> 0:39:11.920
<v Speaker 2>other parts of the brain, and our emotional centers and

0:39:12.320 --> 0:39:14.480
<v Speaker 2>how much of what makes us humans, our drives and

0:39:14.520 --> 0:39:18.279
<v Speaker 2>motivations are mostly not the near cortex. Right. There are

0:39:18.400 --> 0:39:21.560
<v Speaker 2>these other things. And if you provided an AI system

0:39:21.600 --> 0:39:24.000
<v Speaker 2>with those other things, I might start worrying about it.

0:39:24.080 --> 0:39:27.879
<v Speaker 2>But if you're just trying to model stuff, it's it's

0:39:27.920 --> 0:39:32.640
<v Speaker 2>no threat. It's we just assume that some AI system,

0:39:32.680 --> 0:39:35.399
<v Speaker 2>because it can spew back language, is going to think

0:39:35.480 --> 0:39:37.719
<v Speaker 2>like us and be like us and have our same motivations.

0:39:37.760 --> 0:39:38.640
<v Speaker 2>Nothing like it at all.

0:39:38.760 --> 0:39:41.680
<v Speaker 1>So tell us about the thousand brains projects and how

0:39:41.719 --> 0:39:42.640
<v Speaker 1>you're gonna make this happen.

0:39:42.760 --> 0:39:45.680
<v Speaker 2>Right, So, we kind of been working on this theory

0:39:45.719 --> 0:39:48.560
<v Speaker 2>for decades really, and maybe five or six years ago

0:39:48.600 --> 0:39:50.560
<v Speaker 2>we really had some breakthroughs and sort of all came

0:39:50.600 --> 0:39:53.400
<v Speaker 2>together and then we said, well, I always thought that

0:39:53.440 --> 0:39:55.280
<v Speaker 2>this is the way we're going to build tuly intelligent machines.

0:39:55.320 --> 0:39:57.040
<v Speaker 2>And this is at the same time as deep learning

0:39:57.080 --> 0:39:59.480
<v Speaker 2>and transformers are taken off and all this excitement about it.

0:40:00.440 --> 0:40:03.120
<v Speaker 2>But that didn't distract us. We said, Okay, let's see

0:40:03.120 --> 0:40:04.680
<v Speaker 2>if we can start building this stuff. So we for

0:40:04.680 --> 0:40:06.719
<v Speaker 2>a couple of years we had a small team that

0:40:06.880 --> 0:40:10.399
<v Speaker 2>was trying to implement the thousand brains theory, modeling quarter

0:40:10.440 --> 0:40:13.600
<v Speaker 2>u cooms, the voting, all this stuff, multisensories things, all

0:40:13.640 --> 0:40:17.080
<v Speaker 2>this stuff, were modeling it, and we decided earlier this

0:40:17.160 --> 0:40:19.160
<v Speaker 2>year that the best way to go forward was to

0:40:19.200 --> 0:40:22.200
<v Speaker 2>do this in an open source project. We haven't actually

0:40:22.239 --> 0:40:24.759
<v Speaker 2>told people we're doing this before. So we've created a

0:40:24.800 --> 0:40:29.160
<v Speaker 2>thousand Brains project. We're taking all of our our code

0:40:29.160 --> 0:40:31.680
<v Speaker 2>and putting an open source We are taking the patterns.

0:40:31.680 --> 0:40:32.920
<v Speaker 2>We have a lot of patents. We're going to make

0:40:33.040 --> 0:40:34.759
<v Speaker 2>a non a short clause. We've done that. I'm not

0:40:34.800 --> 0:40:37.680
<v Speaker 2>a short close on our patents. We have. We hire

0:40:37.680 --> 0:40:41.440
<v Speaker 2>a team of people to like open source project manager

0:40:41.480 --> 0:40:43.839
<v Speaker 2>for the outside people. We've already got quite a few

0:40:43.840 --> 0:40:48.239
<v Speaker 2>people interested. We already have received some funding from the

0:40:48.239 --> 0:40:50.600
<v Speaker 2>Gates Foundation for this significant funding and help fund the

0:40:50.600 --> 0:40:52.440
<v Speaker 2>project for a couple of years. We have there's a

0:40:52.480 --> 0:40:55.320
<v Speaker 2>guy named John hen at Connegie Mellen University who's building

0:40:55.320 --> 0:40:58.240
<v Speaker 2>silicon to implement quartical columns. So there's the people around

0:40:58.280 --> 0:41:00.600
<v Speaker 2>the world who've been excited about our work and following

0:41:00.640 --> 0:41:03.160
<v Speaker 2>it and want to join in on this day. So

0:41:03.160 --> 0:41:05.840
<v Speaker 2>we figured let's get them all together, let's build a

0:41:05.880 --> 0:41:10.360
<v Speaker 2>framework open source project. And so we built out this team.

0:41:10.840 --> 0:41:14.239
<v Speaker 2>We have just been run by the women Vision Clay,

0:41:14.239 --> 0:41:18.360
<v Speaker 2>who's just brilliant, and technical side is by Dunam Neils

0:41:19.280 --> 0:41:22.600
<v Speaker 2>and and so we're just starting this, you know. So

0:41:22.840 --> 0:41:25.560
<v Speaker 2>we actually haven't officially and we've talked about it, but

0:41:25.560 --> 0:41:28.279
<v Speaker 2>we haven't officially launched yet because not everything is open

0:41:28.360 --> 0:41:29.840
<v Speaker 2>yet and we have to there's a lot of stuff

0:41:29.840 --> 0:41:31.160
<v Speaker 2>you have to do to put in to get those

0:41:31.200 --> 0:41:33.920
<v Speaker 2>whole do work. But we're going full boring this and

0:41:33.960 --> 0:41:38.480
<v Speaker 2>I think my hope is that anyone who's excited about

0:41:38.520 --> 0:41:40.879
<v Speaker 2>the work, and there's quite a few people can help

0:41:40.960 --> 0:41:43.600
<v Speaker 2>join us and work on this and propel it forward

0:41:43.680 --> 0:41:47.120
<v Speaker 2>and really created what I not only just an alternate

0:41:47.160 --> 0:41:49.880
<v Speaker 2>form of AI centory motor AI based on brain principles,

0:41:50.000 --> 0:41:52.120
<v Speaker 2>but I think what's going to be actually the ultimate,

0:41:52.560 --> 0:41:57.360
<v Speaker 2>uh primary source of AI, which is brain modeling a

0:41:57.400 --> 0:41:59.759
<v Speaker 2>thousand brains projects. This is amazing. So how do people

0:41:59.800 --> 0:42:00.680
<v Speaker 2>get involved in this?

0:42:01.120 --> 0:42:01.239
<v Speaker 1>Uh?

0:42:01.520 --> 0:42:04.480
<v Speaker 2>You can just go to our website dementa and dot

0:42:04.520 --> 0:42:07.040
<v Speaker 2>com and you know, there's a lot of information already,

0:42:07.239 --> 0:42:09.480
<v Speaker 2>a tons of information. It's like we have all the

0:42:09.480 --> 0:42:14.160
<v Speaker 2>stuff with accumulated documentation, code, videos, were all that up there,

0:42:14.239 --> 0:42:16.759
<v Speaker 2>plus tutorials and so on. So you just you can

0:42:16.880 --> 0:42:19.319
<v Speaker 2>just you can you go to our nomenta dot com.

0:42:19.320 --> 0:42:21.520
<v Speaker 2>It'll be obvious how to sign up to be informed

0:42:21.520 --> 0:42:23.080
<v Speaker 2>of what's going on or how to get involved.

0:42:23.080 --> 0:42:25.759
<v Speaker 1>Great, So some listener to this podcast is I want

0:42:25.760 --> 0:42:27.640
<v Speaker 1>to get involved and understand more about this thing. Go

0:42:27.640 --> 0:42:29.440
<v Speaker 1>to Nementa dot com and they can.

0:42:29.760 --> 0:42:31.920
<v Speaker 2>They can. First of all, they'll start, they'll sign up

0:42:31.960 --> 0:42:34.839
<v Speaker 2>to getting notified about things are happening. They can get

0:42:34.960 --> 0:42:38.720
<v Speaker 2>educated on the whole project. They can I don't think

0:42:38.880 --> 0:42:42.239
<v Speaker 2>right yet they can contribute code yet, but that will

0:42:42.280 --> 0:42:44.839
<v Speaker 2>happen within a month or so. It'll be obvious how

0:42:44.880 --> 0:42:47.200
<v Speaker 2>to get started. There's there's a lot of information to learn.

0:42:47.320 --> 0:42:49.440
<v Speaker 2>I would think if you haven't. If you haven't, you

0:42:49.520 --> 0:42:51.479
<v Speaker 2>might want to start with just by reading the book

0:42:51.480 --> 0:42:53.719
<v Speaker 2>to the Thousand Brains, because it gives you that not

0:42:53.800 --> 0:42:56.080
<v Speaker 2>only the basics of the theory, but it also gives

0:42:56.080 --> 0:42:57.920
<v Speaker 2>you the vision about how this is going to play

0:42:57.920 --> 0:42:58.399
<v Speaker 2>out over time.

0:42:58.600 --> 0:43:00.840
<v Speaker 1>And so the idea is just so I'm straight on this.

0:43:00.920 --> 0:43:04.280
<v Speaker 1>So the idea is a person can download the code

0:43:04.320 --> 0:43:05.760
<v Speaker 1>and run this model.

0:43:06.480 --> 0:43:10.279
<v Speaker 2>Right the first we're making said that you can do that.

0:43:10.320 --> 0:43:13.920
<v Speaker 2>You can run our current experiments, you can recreate them,

0:43:13.960 --> 0:43:16.080
<v Speaker 2>you can apply them different ways. Great.

0:43:16.560 --> 0:43:18.520
<v Speaker 1>So something that you and I have in common that

0:43:18.560 --> 0:43:21.760
<v Speaker 1>we are obsessed about is this idea that we're living

0:43:22.120 --> 0:43:25.359
<v Speaker 1>inside our own internal models. This is all a construction.

0:43:26.000 --> 0:43:27.880
<v Speaker 1>And you had a line in the book that I loved,

0:43:27.920 --> 0:43:32.680
<v Speaker 1>which is that if you had different sensors for picking

0:43:32.760 --> 0:43:35.600
<v Speaker 1>up different information in the world, we would have a

0:43:35.680 --> 0:43:39.880
<v Speaker 1>different perceptual experience, a completely different experience of the universe.

0:43:40.080 --> 0:43:45.880
<v Speaker 2>Well maybe completely, not completely, Like a blind person is

0:43:46.000 --> 0:43:48.399
<v Speaker 2>we're learning the worth of touch, and a person who

0:43:49.480 --> 0:43:53.880
<v Speaker 2>is deaferent, a person maybe who has sensory problems on

0:43:54.000 --> 0:43:56.000
<v Speaker 2>his hand, they will end up with a similar structure.

0:43:56.239 --> 0:43:58.440
<v Speaker 1>Sorry, but what I mean is not in terms of

0:43:58.520 --> 0:44:00.400
<v Speaker 1>how we pick up on the visible light ratee. But

0:44:00.719 --> 0:44:02.680
<v Speaker 1>I pick up on infrared and you pick up on

0:44:02.840 --> 0:44:03.680
<v Speaker 1>radio waves.

0:44:03.680 --> 0:44:06.040
<v Speaker 2>Okay, right, you might if you if you really did that,

0:44:06.280 --> 0:44:07.919
<v Speaker 2>then you would have a different view of the world,

0:44:07.960 --> 0:44:10.840
<v Speaker 2>like it's often you know, like it take the issue

0:44:10.840 --> 0:44:13.399
<v Speaker 2>of color. It's often said that bees, you know, seeing

0:44:13.400 --> 0:44:16.160
<v Speaker 2>the ultra violet and we don't. So what looks like

0:44:16.280 --> 0:44:18.400
<v Speaker 2>toss is a white flower to them? Is this beautifully

0:44:18.400 --> 0:44:20.120
<v Speaker 2>colorful variegated flower.

0:44:20.920 --> 0:44:22.799
<v Speaker 1>But let's say you saw a totally different part of

0:44:22.800 --> 0:44:26.560
<v Speaker 1>the electromagic spectrum and so you see in the microwave range.

0:44:27.000 --> 0:44:29.319
<v Speaker 1>Question is would would we have color at all?

0:44:29.480 --> 0:44:32.360
<v Speaker 2>I don't know, It's hard to say, right, there's a

0:44:32.480 --> 0:44:37.399
<v Speaker 2>there's an underlying really interesting philosophical problem called qualitia, which

0:44:37.440 --> 0:44:40.840
<v Speaker 2>is like, why does color feel like color? Right? And

0:44:40.880 --> 0:44:45.279
<v Speaker 2>it doesn't feel like sounds or tactile sensations. And it's

0:44:45.320 --> 0:44:48.279
<v Speaker 2>an interesting challenge to understand that. I've written about it

0:44:48.320 --> 0:44:48.560
<v Speaker 2>a bit.

0:44:49.440 --> 0:44:51.680
<v Speaker 1>Yeah, do you have a hypothesis about this, I'll tell

0:44:51.680 --> 0:44:54.279
<v Speaker 1>you what mine is, But it is it's always sort

0:44:54.320 --> 0:44:56.080
<v Speaker 1>of half one, which is I think it's about the

0:44:56.320 --> 0:44:59.840
<v Speaker 1>structure of the data coming in defines the quality.

0:45:00.080 --> 0:45:02.239
<v Speaker 2>I don't know why or how that's true.

0:45:01.960 --> 0:45:04.359
<v Speaker 1>But you know, with the eyes, you've got two two

0:45:04.440 --> 0:45:08.440
<v Speaker 1>dimensional sheets of data coming in, and so vision feels

0:45:08.680 --> 0:45:12.120
<v Speaker 1>like something with hearing, it's a one dimensional signals just

0:45:12.120 --> 0:45:14.160
<v Speaker 1>going up and down and vibringing your ear drum. That

0:45:14.360 --> 0:45:17.080
<v Speaker 1>feels like something. You don't confuse vision with hearing. That's

0:45:17.280 --> 0:45:20.399
<v Speaker 1>like completely different worlds to you. My interest has been

0:45:20.440 --> 0:45:23.960
<v Speaker 1>in what happens when we feed news structures. We've done

0:45:23.960 --> 0:45:25.160
<v Speaker 1>a lot of interesting stuff in this area.

0:45:25.280 --> 0:45:28.640
<v Speaker 2>Exactly would you have a completely new quality? Is it possible? So?

0:45:28.840 --> 0:45:32.080
<v Speaker 2>I mean, certainly you can imagine. First of all, I

0:45:32.120 --> 0:45:36.719
<v Speaker 2>agree with you again, it's all spikes, right, So there's

0:45:36.719 --> 0:45:41.319
<v Speaker 2>nothing there's no color spikes, there's no heat, spikes. It's

0:45:41.360 --> 0:45:45.719
<v Speaker 2>just spikes. And so obviously the different quality it has

0:45:45.719 --> 0:45:48.279
<v Speaker 2>to come about somehow from the structure of the data

0:45:48.440 --> 0:45:52.359
<v Speaker 2>spatially and temporarily, and also sensory motory, you know, it's

0:45:52.360 --> 0:45:54.640
<v Speaker 2>like how things change as you move, and I think

0:45:54.640 --> 0:45:56.200
<v Speaker 2>that's a big part of it. So I agree with

0:45:56.280 --> 0:45:58.600
<v Speaker 2>on a fundamental level that it has to be some

0:45:58.840 --> 0:46:04.120
<v Speaker 2>in the data and it's nothing else. And we can

0:46:04.120 --> 0:46:06.560
<v Speaker 2>then ask ourselves something like, well, imagine you've been blind

0:46:06.680 --> 0:46:08.880
<v Speaker 2>your whole life. You don't have a sense of color.

0:46:09.400 --> 0:46:11.680
<v Speaker 2>You've never experienced color, and so to you would be

0:46:11.800 --> 0:46:14.120
<v Speaker 2>kind of mysterious things. Someone can say, well, can't you

0:46:14.160 --> 0:46:16.920
<v Speaker 2>tell that's that's you know, that's this type of orange

0:46:16.920 --> 0:46:19.600
<v Speaker 2>and that time? What are you talking about? Right? They'd

0:46:19.600 --> 0:46:21.480
<v Speaker 2>have to accept that you have some super sense and

0:46:21.520 --> 0:46:24.120
<v Speaker 2>the world looks different to you because you have vision

0:46:24.200 --> 0:46:26.479
<v Speaker 2>and I don't. And they may be able to touch

0:46:26.560 --> 0:46:28.640
<v Speaker 2>things that you know, sense things that I don't. Sign,

0:46:28.680 --> 0:46:30.000
<v Speaker 2>So I could be able to try to read braille

0:46:30.160 --> 0:46:32.160
<v Speaker 2>if you're not a braille reader, that feels like what

0:46:32.440 --> 0:46:35.759
<v Speaker 2>the stuff? It's a blur? Right? So they oh, no,

0:46:35.840 --> 0:46:38.359
<v Speaker 2>I feel everything there, right, So we can we can

0:46:38.400 --> 0:46:40.400
<v Speaker 2>just ask ourselves a questions like what's the world like

0:46:40.440 --> 0:46:43.360
<v Speaker 2>to different people, and sometimes we'll end up with a

0:46:43.400 --> 0:46:45.719
<v Speaker 2>similar model, like yeah, well you and I would have

0:46:45.760 --> 0:46:48.200
<v Speaker 2>no matter what censors you have, we'd have the model

0:46:48.200 --> 0:46:51.879
<v Speaker 2>of physical structure of a coffee cup. But other times

0:46:51.920 --> 0:46:54.080
<v Speaker 2>it could be quite different, you know, and certain if

0:46:54.080 --> 0:46:58.480
<v Speaker 2>you start like sensing parts of the radio spectrum or

0:46:59.400 --> 0:47:01.480
<v Speaker 2>other things, just be you know. One of the things

0:47:01.480 --> 0:47:03.200
<v Speaker 2>I always wondered, like, what would be like if you

0:47:03.200 --> 0:47:06.680
<v Speaker 2>had if you had smell sensors stand under your fingers, right,

0:47:07.160 --> 0:47:10.680
<v Speaker 2>and then everything you touch. Well, we kind of we have.

0:47:10.800 --> 0:47:13.560
<v Speaker 2>We have temperature sensors, and we have tapping with all

0:47:13.640 --> 0:47:15.480
<v Speaker 2>kinds of But what I could smell like you could

0:47:15.480 --> 0:47:18.640
<v Speaker 2>tell chemicals that were on the surface of objects. This

0:47:18.719 --> 0:47:21.120
<v Speaker 2>is what dogs do. You know. Dogs they don't just smell.

0:47:21.360 --> 0:47:23.319
<v Speaker 2>They stick their nose right on the thing and they smell.

0:47:23.400 --> 0:47:25.720
<v Speaker 2>They moved to the next bot it smell. Dogs build

0:47:25.760 --> 0:47:28.920
<v Speaker 2>this freedom, actual structure of smells. We don't have that

0:47:28.960 --> 0:47:30.919
<v Speaker 2>smell for us. It's kind of like wasting in from

0:47:30.920 --> 0:47:33.920
<v Speaker 2>some direction, right. Dogs have this incredible model of the

0:47:33.920 --> 0:47:36.799
<v Speaker 2>world smell mod and it's hard to imagine what it is.

0:47:36.840 --> 0:47:39.160
<v Speaker 2>But I'm sure they have it, so I think it's

0:47:39.239 --> 0:47:42.600
<v Speaker 2>fun to think about these things. I don't you know,

0:47:42.640 --> 0:47:44.719
<v Speaker 2>in the future will build machines that perceive the world

0:47:44.719 --> 0:47:46.120
<v Speaker 2>different than we do. But that'll be great.

0:47:46.640 --> 0:47:50.920
<v Speaker 1>Yeah, Okay, Jeff, this has been wonderful.

0:47:50.960 --> 0:47:53.560
<v Speaker 2>Thank you for being here too. Thanks David. It's always

0:47:53.560 --> 0:47:55.120
<v Speaker 2>great talking to you and I enjoy it. It's a

0:47:55.160 --> 0:47:58.920
<v Speaker 2>lot of fun we were and I love your podcast.

0:47:58.960 --> 0:48:06.920
<v Speaker 1>So that was Jeff Hawkins, theoretician and author of A

0:48:07.120 --> 0:48:08.200
<v Speaker 1>Thousand Brains.

0:48:08.680 --> 0:48:08.879
<v Speaker 2>Now.

0:48:08.920 --> 0:48:12.000
<v Speaker 1>I love his model because it builds on previous research

0:48:12.080 --> 0:48:15.239
<v Speaker 1>and gives us a possible starting point for how this

0:48:15.320 --> 0:48:18.759
<v Speaker 1>whole system might be working. This is a view of

0:48:18.800 --> 0:48:21.400
<v Speaker 1>the brain in which you don't have just a single

0:48:21.560 --> 0:48:25.200
<v Speaker 1>model of the world being constructed, but hundreds of thousands

0:48:25.200 --> 0:48:29.680
<v Speaker 1>of little models, each viewing the world through their little straw.

0:48:29.840 --> 0:48:33.600
<v Speaker 1>And these models are independent, but they're not completely independent,

0:48:33.680 --> 0:48:36.840
<v Speaker 1>so they communicate with each other and they vote, and

0:48:36.880 --> 0:48:40.720
<v Speaker 1>in this way, the whole system converges on its best

0:48:40.960 --> 0:48:44.120
<v Speaker 1>guess of what's going on out there in the world.

0:48:44.600 --> 0:48:48.520
<v Speaker 1>And by this mechanism we construct a full three dimensional

0:48:48.560 --> 0:48:52.000
<v Speaker 1>representation of the environment around us, with its sites and

0:48:52.120 --> 0:48:55.600
<v Speaker 1>sounds and three dimensional structure. So this gives us a

0:48:55.920 --> 0:48:59.719
<v Speaker 1>clear framework for thinking about the neocortex. Now, we might

0:48:59.719 --> 0:49:02.160
<v Speaker 1>not know, oh for a while, if this answers everything,

0:49:02.600 --> 0:49:06.000
<v Speaker 1>or it needs some tweaking, or if there are far

0:49:06.160 --> 0:49:09.719
<v Speaker 1>better models coming down the pike. But what I absolutely

0:49:09.800 --> 0:49:13.520
<v Speaker 1>love about this is that this is where the endeavor

0:49:13.560 --> 0:49:19.600
<v Speaker 1>of science shines. Taking something that seems insanely complex, eighty

0:49:19.600 --> 0:49:23.680
<v Speaker 1>six billion neurons with two hundred trillion connections, something of

0:49:24.080 --> 0:49:29.239
<v Speaker 1>such vast complexity that it bankrupts our language, and saying, wait,

0:49:29.600 --> 0:49:32.799
<v Speaker 1>what if there's a really simple principle at work here?

0:49:32.840 --> 0:49:35.839
<v Speaker 1>What if there's a way that we could reduce all

0:49:35.920 --> 0:49:39.080
<v Speaker 1>that complexity by just looking at this from a new angle.

0:49:39.600 --> 0:49:42.040
<v Speaker 1>So let me give an analogy here. Just think about

0:49:42.040 --> 0:49:44.759
<v Speaker 1>what it would be like if you had a magical

0:49:44.960 --> 0:49:48.319
<v Speaker 1>microscope with which you could look into a cell and

0:49:48.400 --> 0:49:51.520
<v Speaker 1>into the nucleus in the middle. What you would see

0:49:51.600 --> 0:49:56.440
<v Speaker 1>is mind boggling complexity. There. You'd see millions or billions

0:49:56.440 --> 0:50:01.279
<v Speaker 1>of molecules racing around and interacting and doing god knows what,

0:50:01.760 --> 0:50:04.520
<v Speaker 1>and you'd say, wow, there's no way.

0:50:04.360 --> 0:50:05.719
<v Speaker 2>We're ever going to understand this.

0:50:06.560 --> 0:50:10.520
<v Speaker 1>But then Krick and Watson come along and say, actually,

0:50:10.560 --> 0:50:15.600
<v Speaker 1>the important thing is this DNA molecule and keeping the

0:50:15.840 --> 0:50:17.680
<v Speaker 1>order of these base.

0:50:17.480 --> 0:50:20.640
<v Speaker 2>Pairs, and all the rest is housekeeping.

0:50:21.520 --> 0:50:26.239
<v Speaker 1>And suddenly the fog of confusion lifts. Now something that

0:50:26.360 --> 0:50:29.480
<v Speaker 1>seemed well beyond us can be described in a sentence

0:50:29.560 --> 0:50:34.440
<v Speaker 1>or two, and science leaps forward and things move fast

0:50:34.480 --> 0:50:36.719
<v Speaker 1>from there. I worked with Francis Crik when I was

0:50:36.719 --> 0:50:40.440
<v Speaker 1>in my postdoctoral years, and now I look around me

0:50:40.560 --> 0:50:44.000
<v Speaker 1>at Stanford and Silicon Valley, and there are thousands of

0:50:44.200 --> 0:50:49.040
<v Speaker 1>laboratories and companies doing amazing work with genomes, and their

0:50:49.160 --> 0:50:54.480
<v Speaker 1>existence results entirely from this one simplifying insight about DNA

0:50:54.840 --> 0:50:59.280
<v Speaker 1>in nineteen fifty three, that new model that suddenly clarified

0:50:59.760 --> 0:51:04.040
<v Speaker 1>what what is happening inside the nucleus. By the same token,

0:51:04.160 --> 0:51:07.080
<v Speaker 1>this is what we're trying to do with the brain.

0:51:07.320 --> 0:51:13.520
<v Speaker 1>Brains appear to be ferociously complex, and yet we have

0:51:13.680 --> 0:51:15.480
<v Speaker 1>lots of brains running around the planet.

0:51:15.480 --> 0:51:17.480
<v Speaker 2>We've got eight point two billion of them.

0:51:18.000 --> 0:51:22.719
<v Speaker 1>So something must be straightforward about their architecture, or else

0:51:22.800 --> 0:51:25.680
<v Speaker 1>Mother Nature wouldn't be able to build these over and

0:51:25.719 --> 0:51:30.480
<v Speaker 1>over with such reliability. You couldn't drop this massive quantity

0:51:30.560 --> 0:51:33.720
<v Speaker 1>into the world and have them all functioning well unless

0:51:33.719 --> 0:51:38.600
<v Speaker 1>there was something pretty uncomplicated about building and running a brain.

0:51:39.560 --> 0:51:44.000
<v Speaker 1>So that is the overarching game of science to take

0:51:44.040 --> 0:51:50.080
<v Speaker 1>the overwhelming complexity around us and to find new angles

0:51:50.160 --> 0:51:58.120
<v Speaker 1>to look at things to reveal simplicity. Go to eagleman

0:51:58.200 --> 0:52:02.000
<v Speaker 1>dot com slash podcast for more information and find further reading.

0:52:02.520 --> 0:52:05.360
<v Speaker 1>Send me an email at podcasts at eagleman dot com

0:52:05.360 --> 0:52:08.880
<v Speaker 1>with questions or discussion, and check out and subscribe to

0:52:09.080 --> 0:52:12.480
<v Speaker 1>Inner Cosmos on YouTube for videos of each episode and

0:52:12.520 --> 0:52:16.520
<v Speaker 1>to leave comments until next time. I'm David Eagleman, and

0:52:16.560 --> 0:52:18.319
<v Speaker 1>this is Inner Cosmos.