WEBVTT - Ep63  "Why do brains love faces?"

0:00:04.480 --> 0:00:10.039
<v Speaker 1>Brains love to look at faces, But why and what

0:00:10.200 --> 0:00:15.400
<v Speaker 1>is face blindness? And what is a super recognizer? Why

0:00:15.400 --> 0:00:19.079
<v Speaker 1>does your electrical plug sometimes look to you like a

0:00:19.120 --> 0:00:24.160
<v Speaker 1>little face? Did aliens plant a signal for us on Mars?

0:00:24.600 --> 0:00:28.200
<v Speaker 1>Or are we looking at a quirk of our own brains?

0:00:28.520 --> 0:00:30.200
<v Speaker 1>What does any of this have to do with the

0:00:30.640 --> 0:00:35.360
<v Speaker 1>neurologist Oliver Sachs and his inability to recognize most of

0:00:35.400 --> 0:00:38.160
<v Speaker 1>the people in his life, or looking at a magazine

0:00:38.280 --> 0:00:44.480
<v Speaker 1>upside down? Or what the mistakes in computerized face recognition technology?

0:00:45.080 --> 0:00:51.199
<v Speaker 1>Tell us Welcome to Inner Cosmos with me David Eagleman.

0:00:51.560 --> 0:00:55.000
<v Speaker 1>I'm a neuroscientist and an author at Stanford, and in

0:00:55.040 --> 0:00:59.600
<v Speaker 1>these episodes we dive deeply into our three pound universe

0:01:00.080 --> 0:01:08.880
<v Speaker 1>to understand some of the most surprising aspects of our lives.

0:01:12.440 --> 0:01:17.640
<v Speaker 1>Today's episode is about faces. Now, let's say I posed

0:01:18.080 --> 0:01:22.679
<v Speaker 1>the question to you, how does the brain recognize faces?

0:01:23.280 --> 0:01:26.600
<v Speaker 1>You may reasonably think, why is that even a question?

0:01:27.120 --> 0:01:32.440
<v Speaker 1>Recognizing people's faces isn't really that hard, But think about

0:01:32.480 --> 0:01:37.200
<v Speaker 1>how similar faces are to one another. They're insanely similar.

0:01:37.360 --> 0:01:42.160
<v Speaker 1>You have two eyes, nose, mouth, chin, ears, everyone's face

0:01:42.200 --> 0:01:46.840
<v Speaker 1>looks pretty much the same people's faces are pretty much

0:01:46.959 --> 0:01:51.240
<v Speaker 1>very tiny variations on a theme. The only reason you

0:01:51.320 --> 0:01:55.200
<v Speaker 1>are able to distinguish faces, literally thousands of faces from

0:01:55.240 --> 0:01:59.840
<v Speaker 1>one another, is because your brain devotes an enormous amount

0:01:59.880 --> 0:02:03.280
<v Speaker 1>of circuitry to it. It puts so much effort in

0:02:03.920 --> 0:02:07.360
<v Speaker 1>that it feels effortless to you. And we have all

0:02:07.440 --> 0:02:14.120
<v Speaker 1>this circuitry because face recognition is extremely salient. It's how

0:02:14.560 --> 0:02:19.359
<v Speaker 1>we recognize this package of identity from this other. Oh

0:02:19.440 --> 0:02:23.880
<v Speaker 1>that's my mother, that's my neighbor, that's my wife. There's

0:02:24.440 --> 0:02:27.440
<v Speaker 1>Tom Cruise and his face is similar but different from

0:02:27.800 --> 0:02:32.840
<v Speaker 1>Dwayne Johnson, and that's different from Joe Biden. It's massively

0:02:32.880 --> 0:02:37.400
<v Speaker 1>important for us as a social species to keep track

0:02:37.880 --> 0:02:42.200
<v Speaker 1>of identities. Your brain wants to know who it is

0:02:42.280 --> 0:02:45.679
<v Speaker 1>and everything that entails, like is that a friend or

0:02:45.720 --> 0:02:50.240
<v Speaker 1>an enemy? What are the risks and opportunities here? And

0:02:50.280 --> 0:02:53.040
<v Speaker 1>the answer to that is massively different. If you're looking

0:02:53.440 --> 0:02:57.160
<v Speaker 1>at the face of Ted Bundy versus your best pal,

0:02:57.480 --> 0:03:00.040
<v Speaker 1>and keep in mind that to a space alien and

0:03:00.080 --> 0:03:04.680
<v Speaker 1>these faces would be essentially indistinguishable. Now, when you're trying

0:03:04.680 --> 0:03:09.040
<v Speaker 1>to distinguish faces, there are lots of other cues like

0:03:09.520 --> 0:03:13.800
<v Speaker 1>hairline and hairstyle and height, and details of their skin

0:03:14.040 --> 0:03:18.519
<v Speaker 1>from color to smoothness, and even cues like how they

0:03:18.639 --> 0:03:23.600
<v Speaker 1>walk and their clothing choice. And context also plays a

0:03:23.680 --> 0:03:28.600
<v Speaker 1>large role. You've probably experienced some time when you saw

0:03:28.680 --> 0:03:31.320
<v Speaker 1>somebody you know well, like a neighbor or a colleague,

0:03:31.639 --> 0:03:35.640
<v Speaker 1>but it's in an unfamiliar setting and you struggle to

0:03:35.720 --> 0:03:39.720
<v Speaker 1>recognize them immediately. Why does that happen Because our brains

0:03:39.880 --> 0:03:44.760
<v Speaker 1>use contextual cues when identifying people, So our brains rely

0:03:45.000 --> 0:03:49.960
<v Speaker 1>not just on facial features, but also on surrounding context

0:03:49.960 --> 0:03:54.760
<v Speaker 1>of all sorts to identify people. But despite all these

0:03:54.800 --> 0:03:58.920
<v Speaker 1>other cues that we use, just note that facial recognition

0:03:59.200 --> 0:04:02.640
<v Speaker 1>is possible even over zoom, where many of those other

0:04:02.800 --> 0:04:06.160
<v Speaker 1>cues are gone. And even if you've got a bunch

0:04:06.240 --> 0:04:09.400
<v Speaker 1>of movie stars to put on wigs and change their

0:04:09.440 --> 0:04:12.440
<v Speaker 1>hairlines and so on, you'd still be pretty good at

0:04:12.440 --> 0:04:17.840
<v Speaker 1>distinguishing them just based on the details of the face itself.

0:04:18.360 --> 0:04:20.800
<v Speaker 1>And I think there are at least two ways to

0:04:20.880 --> 0:04:26.479
<v Speaker 1>appreciate how hard a challenge face recognition is to the brain.

0:04:26.880 --> 0:04:30.640
<v Speaker 1>The first is to understand how difficult it is to

0:04:30.680 --> 0:04:35.960
<v Speaker 1>get computers to recognize and distinguish human faces. Now you

0:04:36.040 --> 0:04:40.600
<v Speaker 1>might think, aren't we pretty good at this with modern technology. Yes,

0:04:41.000 --> 0:04:45.400
<v Speaker 1>but this has been a sixty year slog by very

0:04:45.440 --> 0:04:48.400
<v Speaker 1>smart people in the computer science world. Starting in the

0:04:48.480 --> 0:04:53.000
<v Speaker 1>nineteen sixties, people realized that face recognition was a really

0:04:53.040 --> 0:04:56.839
<v Speaker 1>hard problem, and they've been climbing that hill ever since.

0:04:57.279 --> 0:05:01.000
<v Speaker 1>So the way that researchers have gotten face recognition to

0:05:01.120 --> 0:05:05.800
<v Speaker 1>work on computers nowadays is by busting the problem up

0:05:06.040 --> 0:05:12.960
<v Speaker 1>into different computational stages. So first, you have algorithms just

0:05:13.000 --> 0:05:16.479
<v Speaker 1>for detecting a face, like is there a face out there?

0:05:17.200 --> 0:05:20.240
<v Speaker 1>Then you have a second step where you segment this

0:05:20.400 --> 0:05:23.880
<v Speaker 1>face from the background and figure out how the face

0:05:24.080 --> 0:05:26.480
<v Speaker 1>is aligned to tell you about the size of the

0:05:26.520 --> 0:05:29.760
<v Speaker 1>face and the illumination on the face and even the

0:05:29.839 --> 0:05:32.920
<v Speaker 1>pose of the face. Then you have a third step

0:05:33.160 --> 0:05:37.159
<v Speaker 1>where you extract the facial features, like here's the length

0:05:37.200 --> 0:05:39.960
<v Speaker 1>of the nose and the distance between the eyes, and

0:05:40.000 --> 0:05:42.520
<v Speaker 1>the shape of the mouth and the position of the ears.

0:05:42.880 --> 0:05:46.160
<v Speaker 1>All these things you pinpoint and you measure them, and

0:05:46.200 --> 0:05:50.279
<v Speaker 1>then in the fourth step, you match those numbers against

0:05:50.320 --> 0:05:53.719
<v Speaker 1>a database of faces to see if you can identify

0:05:53.800 --> 0:05:57.320
<v Speaker 1>this one. So this is a really complicated algorithm and

0:05:57.400 --> 0:06:01.160
<v Speaker 1>it's instructive to see how computer get things wrong. I

0:06:01.200 --> 0:06:05.400
<v Speaker 1>saw a great photograph where it's a family sitting on

0:06:05.440 --> 0:06:08.960
<v Speaker 1>their porch at Halloween, and the computer has identified each

0:06:09.000 --> 0:06:12.160
<v Speaker 1>of their faces and put a little label next to it,

0:06:12.279 --> 0:06:16.159
<v Speaker 1>like Henry and John and Sarah, and then it identifies

0:06:16.480 --> 0:06:21.040
<v Speaker 1>another face and says unknown person. But that face is

0:06:21.200 --> 0:06:23.880
<v Speaker 1>the Jack O lantern, the pumpkin that's sitting there on

0:06:23.920 --> 0:06:27.400
<v Speaker 1>the porch. Now, a human observer would never make that

0:06:27.560 --> 0:06:30.479
<v Speaker 1>kind of mistake. But this goes to show that it's

0:06:30.560 --> 0:06:36.760
<v Speaker 1>a very computationally challenging problem. So some version of this

0:06:36.880 --> 0:06:41.640
<v Speaker 1>very complicated algorithm, probably not exactly the same, but equally complex.

0:06:42.120 --> 0:06:44.960
<v Speaker 1>This is what your brain is doing under the hood

0:06:45.600 --> 0:06:49.280
<v Speaker 1>when you just glanced at some billboard and you think, oh,

0:06:49.400 --> 0:06:52.440
<v Speaker 1>that's Paul Giamati. I love that guy. And we'll come

0:06:52.520 --> 0:06:54.920
<v Speaker 1>back to what the brain is doing in just a moment,

0:06:55.320 --> 0:06:57.280
<v Speaker 1>but I just want to say for now that quite

0:06:57.279 --> 0:07:02.520
<v Speaker 1>often we find that the tasks that seem most effortless

0:07:02.560 --> 0:07:08.039
<v Speaker 1>to us are the things underpinned by the most brain circuitry.

0:07:08.440 --> 0:07:10.760
<v Speaker 1>And I want to look at a second way to

0:07:10.880 --> 0:07:15.360
<v Speaker 1>appreciate how hard a challenge face recognition is for the brain.

0:07:16.120 --> 0:07:20.920
<v Speaker 1>Just look at a bunch of Golden Retriever faces. If

0:07:20.960 --> 0:07:23.480
<v Speaker 1>you were in a situation where you were hanging out

0:07:23.520 --> 0:07:27.880
<v Speaker 1>with hundreds or thousands of Golden Retriever dogs, do they

0:07:27.920 --> 0:07:33.080
<v Speaker 1>really look that different to you? Or consider horse faces

0:07:33.400 --> 0:07:39.560
<v Speaker 1>or squirrel faces or cowfaces. Obviously we can imagine some

0:07:39.760 --> 0:07:42.200
<v Speaker 1>faces at the extreme that look a little bit different,

0:07:42.200 --> 0:07:44.960
<v Speaker 1>But in general, if I dropped you into the middle

0:07:44.960 --> 0:07:48.480
<v Speaker 1>of a giant ranch and you looked at thousands of cows,

0:07:48.680 --> 0:07:52.840
<v Speaker 1>you'd say, okay, well, each face has two eyes and

0:07:52.920 --> 0:07:55.640
<v Speaker 1>a snout and a mouth, and there's just not that

0:07:55.760 --> 0:07:59.880
<v Speaker 1>much difference from one face to another. You wouldn't meet

0:08:00.400 --> 0:08:03.400
<v Speaker 1>recognized if one of those cows had been in a

0:08:03.440 --> 0:08:06.760
<v Speaker 1>movie like oh, that's that famous cow, or if one

0:08:06.800 --> 0:08:10.600
<v Speaker 1>of those cows was wanted by the legal system, like

0:08:11.120 --> 0:08:14.200
<v Speaker 1>wait a minute, I recognize that cow from the poster

0:08:14.320 --> 0:08:17.000
<v Speaker 1>of the post office. And of course, if one of

0:08:17.040 --> 0:08:19.960
<v Speaker 1>these cows was looking at you and a bunch of

0:08:19.960 --> 0:08:23.840
<v Speaker 1>other humans, she would feel exactly the same way. There's

0:08:23.920 --> 0:08:28.480
<v Speaker 1>not that much difference between human faces. Now side note,

0:08:28.520 --> 0:08:31.560
<v Speaker 1>which we'll return to. If you were the rancher, you

0:08:31.640 --> 0:08:35.319
<v Speaker 1>could get better at distinguishing cow faces. This is because

0:08:35.360 --> 0:08:40.319
<v Speaker 1>of brain plasticity and because distinguishing the cows is salient

0:08:40.640 --> 0:08:44.200
<v Speaker 1>to your rancher brain. But for now, what I want

0:08:44.200 --> 0:08:49.079
<v Speaker 1>to surface is just how similar faces are before you

0:08:49.160 --> 0:08:54.280
<v Speaker 1>get good at them from experience. Faces are unbelievably similar.

0:08:54.640 --> 0:08:58.400
<v Speaker 1>And one expression of this difficulty in distinguishing faces comes

0:08:58.400 --> 0:09:02.720
<v Speaker 1>from a psychological effe fact called the other race effect,

0:09:03.200 --> 0:09:06.160
<v Speaker 1>and this refers to the tendency of people to have

0:09:06.679 --> 0:09:11.640
<v Speaker 1>more difficulty recognizing faces that belong to races that are

0:09:11.640 --> 0:09:14.200
<v Speaker 1>not their own race. This phenomenon is also known as

0:09:14.240 --> 0:09:18.440
<v Speaker 1>the cross race effect or the own race bias, and

0:09:18.520 --> 0:09:21.400
<v Speaker 1>it's been widely studied and you see it across all cultures.

0:09:21.800 --> 0:09:24.440
<v Speaker 1>Now people sometimes hear this and they take this as

0:09:24.440 --> 0:09:28.679
<v Speaker 1>evidence for racism. It's not racism in the sense of

0:09:28.720 --> 0:09:33.040
<v Speaker 1>you treating your own group better and other groups more poorly. Instead,

0:09:33.120 --> 0:09:36.440
<v Speaker 1>the other race effect simply results from the fact that

0:09:36.480 --> 0:09:39.880
<v Speaker 1>your brain gets trained up on the faces around you,

0:09:40.360 --> 0:09:44.640
<v Speaker 1>and you are better at recognizing those faces than other faces.

0:09:45.000 --> 0:09:48.800
<v Speaker 1>So if you grew up in Cambodia, you will be

0:09:48.960 --> 0:09:53.160
<v Speaker 1>excellent and distinguishing Cambodian faces, and maybe not so great

0:09:53.240 --> 0:09:57.080
<v Speaker 1>at distinguishing Norwegian faces. If you grew up in Norway,

0:09:57.600 --> 0:10:01.480
<v Speaker 1>you're excellent and distinguishing Norwegian f but not so great

0:10:01.559 --> 0:10:06.000
<v Speaker 1>at distinguishing Zimbabwean faces. And if you grew up in Zimbabwe,

0:10:06.080 --> 0:10:09.360
<v Speaker 1>you're not so good at Intuit Eskimo faces and so on.

0:10:09.760 --> 0:10:14.480
<v Speaker 1>So your brain becomes attuned to the type of faces

0:10:14.520 --> 0:10:18.920
<v Speaker 1>that you see most often, and this heightened sensitivity makes

0:10:19.000 --> 0:10:23.679
<v Speaker 1>us better at distinguishing among those familiar faces. As a

0:10:23.720 --> 0:10:27.280
<v Speaker 1>side note, can you mitigate the other race effect? Sure,

0:10:27.360 --> 0:10:31.920
<v Speaker 1>it's just about exposure. You can improve the ability to

0:10:32.000 --> 0:10:37.480
<v Speaker 1>recognize other race faces through training and exposure. Now, Hollywood

0:10:37.520 --> 0:10:39.640
<v Speaker 1>has been on the forefront of this for a long time,

0:10:39.960 --> 0:10:42.960
<v Speaker 1>making sure that there's a mixture that we all get

0:10:42.960 --> 0:10:46.800
<v Speaker 1>exposed to different sorts of faces, and that's presumably very useful.

0:10:47.160 --> 0:10:50.040
<v Speaker 1>It could be noted that Hollywood has tended to be

0:10:50.160 --> 0:10:52.960
<v Speaker 1>mostly in love with white faces and black faces, and

0:10:53.000 --> 0:10:55.080
<v Speaker 1>that leaves a lot of people out of the mix.

0:10:55.480 --> 0:10:58.840
<v Speaker 1>So we the viewing audience don't necessarily get much better

0:10:58.880 --> 0:11:03.320
<v Speaker 1>at Cambodian face or Norwegian faces, or Inuit Eskimo faces

0:11:03.400 --> 0:11:07.680
<v Speaker 1>or whatever. But it's a start to training our brains

0:11:07.760 --> 0:11:11.320
<v Speaker 1>on a broader diet of faces. So the other race

0:11:11.360 --> 0:11:14.920
<v Speaker 1>effect underscores how our brains can get tuned to the

0:11:14.960 --> 0:11:19.040
<v Speaker 1>social world around us, and it highlights our natural tendency

0:11:19.400 --> 0:11:35.280
<v Speaker 1>to recognize familiar faces more easily. Okay, so where are

0:11:35.280 --> 0:11:39.320
<v Speaker 1>we so far. It's really hard to distinguish faces unless

0:11:39.360 --> 0:11:42.360
<v Speaker 1>your brain has had lots and lots of practice at it.

0:11:42.840 --> 0:11:46.920
<v Speaker 1>But again it seems effortless to us now. Even though

0:11:47.120 --> 0:11:50.680
<v Speaker 1>we're not always great at distinguishing faces from one another,

0:11:50.880 --> 0:11:54.800
<v Speaker 1>unless we've had lots of practice, we are highly programmed

0:11:54.880 --> 0:11:58.640
<v Speaker 1>to identify that there is a face there and to

0:11:58.800 --> 0:12:02.640
<v Speaker 1>zoom in on it. Why because faces carry so much

0:12:02.640 --> 0:12:05.840
<v Speaker 1>information for us, And this is why you sometimes look

0:12:05.840 --> 0:12:09.240
<v Speaker 1>at the electrical plug in your wall and you see

0:12:09.240 --> 0:12:12.400
<v Speaker 1>a little face, or you see a pattern of burn

0:12:12.480 --> 0:12:14.800
<v Speaker 1>marks on a piece of toast and you think, oh,

0:12:14.840 --> 0:12:17.679
<v Speaker 1>that looks just like a face, or you look at

0:12:17.760 --> 0:12:20.080
<v Speaker 1>the surface of the moon and you think, oh, there's

0:12:20.200 --> 0:12:23.440
<v Speaker 1>a man there looking at me. In science, this is

0:12:23.480 --> 0:12:30.360
<v Speaker 1>called paradolia. Paradolia refers to our tendency to perceive specific

0:12:30.520 --> 0:12:35.240
<v Speaker 1>and often meaningful images in random patterns, and the most

0:12:35.280 --> 0:12:37.840
<v Speaker 1>common usage of this word is in the case of

0:12:37.960 --> 0:12:42.920
<v Speaker 1>perceiving the pattern of a face in random unrelated objects.

0:12:42.920 --> 0:12:45.760
<v Speaker 1>And we've all been there where your brain says, oh,

0:12:46.000 --> 0:12:47.880
<v Speaker 1>that looks like a face when you're looking at a

0:12:48.280 --> 0:12:53.960
<v Speaker 1>cloud or a rock formation, or some arrangement of fruits

0:12:53.960 --> 0:12:55.920
<v Speaker 1>on a table. And I don't know if you've ever

0:12:55.920 --> 0:12:59.480
<v Speaker 1>seen this image from the surface of Mars. I'll put

0:12:59.480 --> 0:13:03.520
<v Speaker 1>it on the show. But on the bumpy landscape of Mars,

0:13:03.720 --> 0:13:06.120
<v Speaker 1>there are some bumps that happen to fall into the

0:13:06.160 --> 0:13:09.120
<v Speaker 1>pattern that look like two y's and a nose and

0:13:09.160 --> 0:13:12.040
<v Speaker 1>a mouth. And many people go nuts about this image

0:13:12.080 --> 0:13:15.520
<v Speaker 1>because they suggest that it's a sculpture of a human

0:13:15.679 --> 0:13:20.080
<v Speaker 1>face planted there by aliens. But probably not. This is

0:13:20.200 --> 0:13:28.199
<v Speaker 1>merely our brains experiencing garden variety paradolia. We assign patterns

0:13:28.280 --> 0:13:33.560
<v Speaker 1>to random inputs, and in particular, we love to see faces. Okay,

0:13:33.800 --> 0:13:36.920
<v Speaker 1>but why does this happen to us so commonly. Well,

0:13:37.400 --> 0:13:42.000
<v Speaker 1>it's because we are hardwired to see faces everywhere, and

0:13:42.040 --> 0:13:46.920
<v Speaker 1>the reason is that faces are crucial for social interaction.

0:13:47.600 --> 0:13:51.840
<v Speaker 1>Seeing a face tells us about a person's identity, their

0:13:52.000 --> 0:13:55.920
<v Speaker 1>emotional state, whether they're a friend or a foe. This

0:13:56.080 --> 0:14:00.680
<v Speaker 1>ability helped our ancestors to survive by recognize is saying oh,

0:14:00.720 --> 0:14:04.439
<v Speaker 1>that's a tribe member. Oh, that's a potential threat. In fact,

0:14:04.520 --> 0:14:08.360
<v Speaker 1>even newborn babies prefer to look at faces. They look

0:14:08.360 --> 0:14:12.640
<v Speaker 1>at face like stimuli more than other stimuli. So take

0:14:13.040 --> 0:14:16.800
<v Speaker 1>two little circles and a little square beneath that, and

0:14:16.840 --> 0:14:20.560
<v Speaker 1>a little rectangle underneath that, they'll stare at that as

0:14:20.600 --> 0:14:24.800
<v Speaker 1>opposed to the same shapes in a different configuration. And

0:14:24.840 --> 0:14:30.360
<v Speaker 1>this underscores that brains are primed to focus on faces.

0:14:30.800 --> 0:14:35.120
<v Speaker 1>Paridolia is just an extension of this bias to recognize faces.

0:14:35.160 --> 0:14:39.960
<v Speaker 1>It's our brain's way of staying vigilant because in the wild,

0:14:40.160 --> 0:14:44.120
<v Speaker 1>it is safer to mistakenly see a face where there

0:14:44.200 --> 0:14:48.720
<v Speaker 1>isn't one than to miss a real face. So this

0:14:49.080 --> 0:14:53.720
<v Speaker 1>hypersensitivity to faces means the difference between life and death

0:14:53.760 --> 0:14:58.480
<v Speaker 1>in some situations, like spotting a lurking predator or an

0:14:58.520 --> 0:15:02.920
<v Speaker 1>aggressive intruder. By the way, I'll just mention that paradolia

0:15:03.040 --> 0:15:06.720
<v Speaker 1>is something that artists have always exploited. They turn random

0:15:06.760 --> 0:15:12.320
<v Speaker 1>patterns or abstract shapes into recognizable faces. Just as one example,

0:15:12.440 --> 0:15:15.880
<v Speaker 1>one of my favorite artists, Salvador Dhli. He very often

0:15:15.920 --> 0:15:20.800
<v Speaker 1>takes advantage of paradolia to create double images that play

0:15:20.880 --> 0:15:24.280
<v Speaker 1>with the viewer's perception. You can see this as a

0:15:24.520 --> 0:15:30.200
<v Speaker 1>rock formation or as a face. So paradolia highlights how

0:15:30.200 --> 0:15:32.520
<v Speaker 1>our brains are constantly trying to make sense of the

0:15:32.560 --> 0:15:37.200
<v Speaker 1>world around us, even when there isn't anything immediately recognizable.

0:15:37.720 --> 0:15:39.640
<v Speaker 1>And I want to give just one other angle on

0:15:39.760 --> 0:15:43.600
<v Speaker 1>paradolia here. If you're a regular listener to this podcast,

0:15:44.040 --> 0:15:46.480
<v Speaker 1>you know that I often talk about how the brain's

0:15:46.720 --> 0:15:50.600
<v Speaker 1>main job is to build an internal model of the

0:15:50.680 --> 0:15:56.720
<v Speaker 1>outside world, and our model influences our perception. So whatever

0:15:56.800 --> 0:16:00.920
<v Speaker 1>you specialize in, you'll see lots of that world. We're

0:16:00.960 --> 0:16:05.480
<v Speaker 1>all specialists at faces, and we often see those where

0:16:05.480 --> 0:16:09.480
<v Speaker 1>there isn't actually one, but it generalizes beyond that. So

0:16:09.640 --> 0:16:11.680
<v Speaker 1>for me personally, when I look around the world, I

0:16:11.760 --> 0:16:15.800
<v Speaker 1>tend to impose shapes that look like brains. I think, oh,

0:16:15.800 --> 0:16:18.880
<v Speaker 1>there's a brain, or there's a midsadgital section, or there's

0:16:18.920 --> 0:16:23.320
<v Speaker 1>a cerebellum. My friends who are pilots tend to see

0:16:23.520 --> 0:16:28.440
<v Speaker 1>airplane shapes and runways. My collogist friends see mushroom shapes,

0:16:28.840 --> 0:16:32.680
<v Speaker 1>my dermatologists friends see melanomas, and so on. Whatever you

0:16:32.840 --> 0:16:37.360
<v Speaker 1>specialize in makes you an expert in detecting that in

0:16:37.440 --> 0:16:41.840
<v Speaker 1>the world, and you impose that interpretation on lots of things.

0:16:42.280 --> 0:16:48.240
<v Speaker 1>It's paridolia writ large. Okay, so back to faces in particular.

0:16:48.680 --> 0:16:51.600
<v Speaker 1>Now we're ready to turn to how the brain does

0:16:51.640 --> 0:16:55.600
<v Speaker 1>its magic with face recognition. One of the key areas

0:16:55.640 --> 0:16:58.400
<v Speaker 1>in our brain that does this is in the temporal lobe.

0:16:58.440 --> 0:17:02.760
<v Speaker 1>It's a little region called the fusiform face area or

0:17:03.160 --> 0:17:07.000
<v Speaker 1>FFA fusiform face area. Now, we can see in brain

0:17:07.040 --> 0:17:12.439
<v Speaker 1>imaging that this area is particularly sensitive to faces. But

0:17:12.680 --> 0:17:15.239
<v Speaker 1>what's interesting is it's not just any part of the

0:17:15.280 --> 0:17:19.720
<v Speaker 1>face that gets attention from the FFA, it's the holistic

0:17:19.880 --> 0:17:25.080
<v Speaker 1>view of the face. So the FFA responds strongly to

0:17:25.200 --> 0:17:30.200
<v Speaker 1>the overall configuration of a face rather than individual features.

0:17:30.480 --> 0:17:34.720
<v Speaker 1>So that allows us to quickly recognize faces and familiar faces,

0:17:34.880 --> 0:17:37.040
<v Speaker 1>even if we only get a quick glimpse of them.

0:17:37.520 --> 0:17:40.639
<v Speaker 1>When you look at these brain imaging studies, for example,

0:17:40.720 --> 0:17:46.400
<v Speaker 1>with fMRI, you find that this fusiform face area responds

0:17:46.560 --> 0:17:51.200
<v Speaker 1>more to familiar faces than to unfamiliar faces. And given

0:17:51.240 --> 0:17:54.240
<v Speaker 1>what we saw a moment ago with the other race effect,

0:17:54.800 --> 0:17:57.960
<v Speaker 1>the FFA is more active when you're viewing faces of

0:17:58.000 --> 0:18:01.800
<v Speaker 1>your own race compared to faces of other races. So again,

0:18:02.080 --> 0:18:06.120
<v Speaker 1>just like recognizing the distinction between different cows if you're

0:18:06.200 --> 0:18:10.200
<v Speaker 1>the rancher, or distinguishing faces that you grew up with

0:18:10.240 --> 0:18:13.200
<v Speaker 1>as opposed to other cultures that you rarely see. Your

0:18:13.320 --> 0:18:18.600
<v Speaker 1>visual system is specialized to efficiently process faces that you

0:18:18.680 --> 0:18:21.960
<v Speaker 1>see a lot more. Now back to this issue about

0:18:22.040 --> 0:18:27.159
<v Speaker 1>recognizing the face holistically rather than detail by detail. This

0:18:27.280 --> 0:18:31.480
<v Speaker 1>is what underlies the phenomenon that's called the inversion effect,

0:18:31.600 --> 0:18:34.840
<v Speaker 1>which is when you see a face that is upside down,

0:18:35.480 --> 0:18:39.359
<v Speaker 1>it's really hard to recognize. So try this. Just open

0:18:39.400 --> 0:18:44.280
<v Speaker 1>a magazine upside down and flip through the pages and

0:18:44.280 --> 0:18:47.159
<v Speaker 1>see if you can recognize the people in it. And

0:18:47.200 --> 0:18:49.240
<v Speaker 1>then when you flip it right side up, it's a

0:18:49.280 --> 0:18:52.440
<v Speaker 1>whole different experience. And this is because this part of

0:18:52.480 --> 0:18:57.080
<v Speaker 1>the brain FFA is super specialized on faces, and it's

0:18:57.119 --> 0:19:00.960
<v Speaker 1>always seen faces in a particular way, and it's recognizing

0:19:00.960 --> 0:19:04.320
<v Speaker 1>the whole big picture with two eyes above the nose,

0:19:04.600 --> 0:19:08.160
<v Speaker 1>above the mouth, and if you throw a ranch in that,

0:19:08.600 --> 0:19:11.280
<v Speaker 1>it doesn't function so well. The key thing is that

0:19:11.359 --> 0:19:15.840
<v Speaker 1>this inversion effect is much stronger for faces than for

0:19:15.960 --> 0:19:22.800
<v Speaker 1>other objects, which highlights this very specialized nature of face processing. Now,

0:19:22.880 --> 0:19:25.479
<v Speaker 1>what's wild is that this part of the brain is

0:19:25.600 --> 0:19:29.320
<v Speaker 1>trying to see faces everywhere, as we saw in paridolia.

0:19:29.800 --> 0:19:32.520
<v Speaker 1>And so if someone puts an electrode into this part

0:19:32.520 --> 0:19:35.520
<v Speaker 1>of your brain, then you might look at, let's say,

0:19:35.520 --> 0:19:38.600
<v Speaker 1>a soccer ball, and when the electrode turns on and

0:19:38.680 --> 0:19:42.760
<v Speaker 1>stimulates this area with electricity, the hexagons of the soccer

0:19:42.800 --> 0:19:46.480
<v Speaker 1>ball seem to you to become the eyes and the

0:19:46.560 --> 0:19:49.879
<v Speaker 1>mouth of a face. So activity in this part of

0:19:49.880 --> 0:19:54.040
<v Speaker 1>the brain is always working over time to detect faces

0:19:54.040 --> 0:19:57.280
<v Speaker 1>in the world, and sometimes it does this by imposing

0:19:57.320 --> 0:20:02.240
<v Speaker 1>the interpretation onto the canvas of the world. Now, consistent

0:20:02.240 --> 0:20:06.679
<v Speaker 1>with what I mentioned before, the FFA isn't specialized just

0:20:06.800 --> 0:20:11.000
<v Speaker 1>for faces. It also plays a role in recognizing other

0:20:11.119 --> 0:20:15.920
<v Speaker 1>things that we're highly familiar with, like cars for car enthusiasts,

0:20:16.119 --> 0:20:20.480
<v Speaker 1>or birds for bird watchers. So that tells us slightly

0:20:20.520 --> 0:20:24.280
<v Speaker 1>more generally that this part of the brain specializes in

0:20:24.320 --> 0:20:28.159
<v Speaker 1>recognizing very detailed things that are relevant and that we

0:20:28.200 --> 0:20:31.920
<v Speaker 1>have a lot of experience seeing. So back to faces,

0:20:32.600 --> 0:20:35.640
<v Speaker 1>the FFA, it's not just that it's sensitive to faces.

0:20:36.040 --> 0:20:38.600
<v Speaker 1>It's actually critical if you want to see a face.

0:20:38.760 --> 0:20:42.240
<v Speaker 1>If you get damaged to the FFA or to its connections,

0:20:42.480 --> 0:20:47.159
<v Speaker 1>then you become unable to recognize faces. And will return

0:20:47.200 --> 0:20:49.440
<v Speaker 1>to that in a moment, but first I just want

0:20:49.480 --> 0:20:52.879
<v Speaker 1>to make one more thing clear. Like almost everything in

0:20:52.920 --> 0:20:56.840
<v Speaker 1>the brain, it's not just one area that's involved. Face

0:20:56.920 --> 0:21:02.480
<v Speaker 1>recognition uses a broader, specialized network. Beyond the FFA, you

0:21:02.560 --> 0:21:06.600
<v Speaker 1>have other areas involved, like the occipital face area and

0:21:06.640 --> 0:21:09.919
<v Speaker 1>the superior temporal sulcus. Don't worry about the details, but

0:21:09.960 --> 0:21:11.919
<v Speaker 1>the thing I want to surface here is that some

0:21:12.080 --> 0:21:15.320
<v Speaker 1>areas are more focused on the detailed features of the

0:21:15.359 --> 0:21:19.840
<v Speaker 1>face and others are sensitive to facial expressions and movements.

0:21:20.200 --> 0:21:24.119
<v Speaker 1>And the key is that together these areas form a

0:21:24.320 --> 0:21:29.360
<v Speaker 1>larger network that allows us to recognize faces and interpret

0:21:29.400 --> 0:21:32.080
<v Speaker 1>their expressions well. Your brain's also doing a lot of

0:21:32.119 --> 0:21:37.560
<v Speaker 1>work to gauge a person's emotions and then navigating social

0:21:37.560 --> 0:21:42.680
<v Speaker 1>interactions by carefully watching the reactions of someone's facial expressions.

0:21:43.200 --> 0:21:47.160
<v Speaker 1>We humans are super specialists at this because our well

0:21:47.200 --> 0:21:53.040
<v Speaker 1>being generally depends on our ability to recognize and understand

0:21:53.080 --> 0:21:55.439
<v Speaker 1>the people around us. And just to give you a

0:21:55.480 --> 0:21:59.080
<v Speaker 1>sense of how all these networks interact, there are studies

0:21:59.119 --> 0:22:04.000
<v Speaker 1>showing that you're interpretation of facial expressions is influenced by

0:22:04.040 --> 0:22:08.080
<v Speaker 1>the broader context. So just as an example, in one study,

0:22:08.400 --> 0:22:12.640
<v Speaker 1>you're shown faces with neutral expressions, but you're given different

0:22:13.119 --> 0:22:16.639
<v Speaker 1>contextual information about the person. So if you're told the

0:22:16.720 --> 0:22:21.879
<v Speaker 1>person is happy, you perceive the neutral face as slightly smiling,

0:22:22.160 --> 0:22:25.520
<v Speaker 1>and if you're told this person is sad, you perceive

0:22:25.600 --> 0:22:29.320
<v Speaker 1>that same face as slightly frowning. So, in other words,

0:22:29.320 --> 0:22:33.720
<v Speaker 1>your brain combines the facial information with the context of

0:22:33.800 --> 0:22:39.240
<v Speaker 1>the larger situation whenever you're interpreting emotions and for that matter, intentions.

0:22:39.720 --> 0:22:43.480
<v Speaker 1>So putting together the giant networks in the brain involved

0:22:43.480 --> 0:22:47.760
<v Speaker 1>in identifying and interpreting faces, the thing that becomes clear

0:22:47.840 --> 0:22:52.840
<v Speaker 1>is that we are massively finely tuned to process faces

0:22:53.320 --> 0:22:58.639
<v Speaker 1>in a way that benefits are social interactions and are survival. Now,

0:22:58.960 --> 0:23:02.119
<v Speaker 1>if you've been listening to other episodes of this podcast,

0:23:02.160 --> 0:23:05.560
<v Speaker 1>you'll know that I'm obsessed by the differences in the

0:23:05.600 --> 0:23:09.280
<v Speaker 1>internal life from person to person, from head to head.

0:23:09.800 --> 0:23:13.080
<v Speaker 1>And one place this really comes up is with individual

0:23:13.200 --> 0:23:18.560
<v Speaker 1>variability in face recognition. So, while most people are fairly

0:23:18.560 --> 0:23:23.080
<v Speaker 1>good at recognizing faces, there's a shocking amount of variation

0:23:23.520 --> 0:23:26.359
<v Speaker 1>in this ability. Some people are exceptionally good at it,

0:23:26.600 --> 0:23:29.600
<v Speaker 1>and other people struggle. And this brings us to the

0:23:29.680 --> 0:23:34.919
<v Speaker 1>concepts of super recognizers on the one hand, and face

0:23:35.160 --> 0:23:38.520
<v Speaker 1>blindness on the other hand. So let's start with the

0:23:38.880 --> 0:23:43.440
<v Speaker 1>super recognizers. These are people who have an extraordinary ability

0:23:43.760 --> 0:23:48.680
<v Speaker 1>to recognize faces. They can remember and identify faces even

0:23:48.720 --> 0:23:52.760
<v Speaker 1>after just brief encounters or after long periods of time.

0:23:53.040 --> 0:23:57.159
<v Speaker 1>Super recognizers are terrific at picking out their acquaintances and

0:23:57.320 --> 0:24:01.240
<v Speaker 1>really large crowds, and they sometimes end up getting employed

0:24:01.240 --> 0:24:05.879
<v Speaker 1>in security or law enforcement to identify perpetrators. Like they'll

0:24:05.880 --> 0:24:09.040
<v Speaker 1>be hired to watch the video of let's say a

0:24:09.080 --> 0:24:12.280
<v Speaker 1>subway entrance and you'll see thousands of people flowing in

0:24:12.320 --> 0:24:15.040
<v Speaker 1>and out, and they'll just stare at the feed for

0:24:15.160 --> 0:24:18.400
<v Speaker 1>hours or days, and then they'll say, oh, there's the guy. Now.

0:24:18.440 --> 0:24:20.920
<v Speaker 1>On the other end of the spectrum, we have what's

0:24:20.960 --> 0:24:25.000
<v Speaker 1>known as face blindness, and this is known as prosopagnosia.

0:24:25.320 --> 0:24:28.320
<v Speaker 1>This word comes from the Greek pro sopon meaning face,

0:24:28.480 --> 0:24:33.800
<v Speaker 1>and agnosia meaning lack of knowledge. So people with prosopagnosia

0:24:34.359 --> 0:24:38.879
<v Speaker 1>have a very hard time recognizing faces, even faces of

0:24:39.080 --> 0:24:42.120
<v Speaker 1>close family and friends. Now, by the way, this isn't

0:24:42.200 --> 0:24:46.360
<v Speaker 1>all or none. The condition ranges from mild to severe,

0:24:46.600 --> 0:24:52.640
<v Speaker 1>like difficulty recognizing familiar faces to an inability to distinguish

0:24:52.720 --> 0:24:55.240
<v Speaker 1>any faces at all. And at that far end of

0:24:55.280 --> 0:24:59.760
<v Speaker 1>the spectrum, just imagine not being able to recognize your

0:24:59.760 --> 0:25:03.880
<v Speaker 1>own spouse or child if you see them out of context.

0:25:04.320 --> 0:25:08.600
<v Speaker 1>And this is the reality for many people with prosopagnosia. Now,

0:25:08.800 --> 0:25:13.239
<v Speaker 1>how does prosopagnosia happen? Usually you're born with it. This

0:25:13.320 --> 0:25:17.160
<v Speaker 1>is called congenital prosopagnosia, just meaning it's present from birth.

0:25:17.800 --> 0:25:22.760
<v Speaker 1>In rarer cases, you can have acquired prosopagnosia, which just

0:25:22.840 --> 0:25:26.520
<v Speaker 1>means that you get it because of a brain injury

0:25:26.760 --> 0:25:29.760
<v Speaker 1>later in life, like a stroke or a traumatic brain injury.

0:25:30.000 --> 0:25:33.960
<v Speaker 1>And in these acquired cases, people notice the sudden change

0:25:33.960 --> 0:25:38.000
<v Speaker 1>in their ability to recognize faces, which is obviously distressing

0:25:38.040 --> 0:25:41.439
<v Speaker 1>and isolating. But what's fascinating is that people born with

0:25:41.600 --> 0:25:44.720
<v Speaker 1>this usually don't realize they have it because they've never

0:25:44.800 --> 0:25:48.280
<v Speaker 1>known anything different, and so they just get along by

0:25:48.359 --> 0:25:52.399
<v Speaker 1>using other cues like people's voices or their clothing or

0:25:52.440 --> 0:25:56.119
<v Speaker 1>the way they walk, and this is how they recognize people.

0:25:56.400 --> 0:25:59.679
<v Speaker 1>One example of a person who had prosopagnosia was the

0:25:59.800 --> 0:26:03.119
<v Speaker 1>neurologist Oliver Sacks, and he wrote an article in The

0:26:03.119 --> 0:26:05.560
<v Speaker 1>New Yorker in twenty ten on this. He talked about

0:26:05.560 --> 0:26:09.600
<v Speaker 1>his personal lifelong struggle with face blindness, but he never

0:26:09.720 --> 0:26:13.680
<v Speaker 1>realized there was something unusual about this until his middle age.

0:26:13.760 --> 0:26:16.520
<v Speaker 1>He'd always just been really bad with recognizing faces, but

0:26:16.600 --> 0:26:19.560
<v Speaker 1>that was just the way it was. But he went

0:26:19.600 --> 0:26:22.120
<v Speaker 1>to visit an older brother in Australia and they got

0:26:22.160 --> 0:26:24.720
<v Speaker 1>to talking and he realized that his brother had the

0:26:24.800 --> 0:26:27.919
<v Speaker 1>same problems with faces that he did, and it dawned

0:26:27.920 --> 0:26:31.520
<v Speaker 1>on him that this was something beyond normal variation, and

0:26:31.560 --> 0:26:35.320
<v Speaker 1>that they both had this trait of face blindness, and

0:26:35.480 --> 0:26:39.040
<v Speaker 1>he guessed that there was probably some distinctive genetic basis

0:26:39.040 --> 0:26:42.359
<v Speaker 1>to it. And very informally, I've noticed among people that

0:26:42.440 --> 0:26:47.680
<v Speaker 1>I know that those with prosopagnosia often don't enjoy movies

0:26:47.800 --> 0:26:50.720
<v Speaker 1>as much as other people, because in order to follow

0:26:50.800 --> 0:26:54.520
<v Speaker 1>a plot, especially if it's switching between an A story

0:26:54.520 --> 0:26:56.720
<v Speaker 1>and a B story and a C story, in order

0:26:56.760 --> 0:26:59.639
<v Speaker 1>to follow that plot, you really need to get it

0:27:00.040 --> 0:27:03.040
<v Speaker 1>when this actor comes into the scene who you haven't

0:27:03.040 --> 0:27:06.280
<v Speaker 1>seen for fifteen minutes and who's now wearing different clothes,

0:27:06.760 --> 0:27:09.679
<v Speaker 1>but this is just a continuation of his plot, and

0:27:09.760 --> 0:27:13.480
<v Speaker 1>everyone else in the audience recognizes him and immediately remembers

0:27:13.520 --> 0:27:15.399
<v Speaker 1>what was going on with him in the last act.

0:27:16.000 --> 0:27:19.000
<v Speaker 1>But just try to imagine how difficult it makes things

0:27:19.320 --> 0:27:22.840
<v Speaker 1>to follow along if you don't immediately recognize the person.

0:27:23.160 --> 0:27:25.639
<v Speaker 1>And by the way, if you don't have prosopagnosia, you

0:27:25.720 --> 0:27:28.240
<v Speaker 1>might be thinking what would that be like? How could

0:27:28.320 --> 0:27:32.960
<v Speaker 1>you not recognize a person? A common analogy is to

0:27:33.000 --> 0:27:36.440
<v Speaker 1>think about what it's like to walk through a forest

0:27:36.720 --> 0:27:40.200
<v Speaker 1>and just imagine if you had to remember and distinguish

0:27:40.520 --> 0:27:43.240
<v Speaker 1>every tree that you see. Imagine you had to have

0:27:43.280 --> 0:27:45.280
<v Speaker 1>a name for every tree, and you were quizzed on

0:27:45.359 --> 0:27:48.560
<v Speaker 1>it later. That sounds impossible to most of us, but

0:27:48.640 --> 0:27:51.840
<v Speaker 1>this is what daily life is like for the person

0:27:51.920 --> 0:27:56.160
<v Speaker 1>with face blindness. There wandering through a forest of people,

0:27:56.720 --> 0:28:01.320
<v Speaker 1>all of whom look essentially indistinguish. Now, I just want

0:28:01.320 --> 0:28:05.320
<v Speaker 1>to note beyond issues of following movie plots, prosopagnosia can

0:28:05.359 --> 0:28:10.080
<v Speaker 1>have pretty significant social effects like anxiety and embarrassment and

0:28:10.119 --> 0:28:15.040
<v Speaker 1>frustration because of and inability to recognize other people, And

0:28:15.160 --> 0:28:19.120
<v Speaker 1>for children with face blindness, social development can be tough

0:28:19.160 --> 0:28:22.479
<v Speaker 1>because you need to recognize your peers and your teacher,

0:28:23.040 --> 0:28:27.480
<v Speaker 1>and occasionally kids will get misdiagnosed with other learning or

0:28:27.480 --> 0:28:31.480
<v Speaker 1>behavioral disorders, which complicates things for them and an adults,

0:28:31.560 --> 0:28:36.520
<v Speaker 1>face blindness can lead to social isolation or in difficulties

0:28:36.560 --> 0:28:41.880
<v Speaker 1>in forming and maintaining relationships. So super recognizers and people

0:28:41.880 --> 0:28:48.040
<v Speaker 1>with prosopagnosia they represent extreme ends of the face recognition spectrum.

0:28:48.080 --> 0:28:51.840
<v Speaker 1>Most people fall somewhere in between, but with varying degrees

0:28:51.960 --> 0:28:57.040
<v Speaker 1>of ability and understanding the spectrum in face recognition. This

0:28:57.120 --> 0:29:00.880
<v Speaker 1>is how we can get insight into the differences between

0:29:01.080 --> 0:29:15.120
<v Speaker 1>people's realities. Okay, now I just want to say lest

0:29:15.200 --> 0:29:18.640
<v Speaker 1>you think that these issues about recognizing faces just show

0:29:18.720 --> 0:29:21.960
<v Speaker 1>up in podcasts, In fact, they show up in courtrooms

0:29:22.000 --> 0:29:25.680
<v Speaker 1>all the time, most notably in the very stormy world

0:29:26.040 --> 0:29:30.400
<v Speaker 1>of eyewitness identification. If you check out my episode nineteen,

0:29:30.880 --> 0:29:34.120
<v Speaker 1>I talked all about the difficulties of eyewitness identification, which

0:29:34.160 --> 0:29:38.920
<v Speaker 1>is generally very difficult because of fundamental flaws in memory.

0:29:39.240 --> 0:29:41.360
<v Speaker 1>But there are other challenges to this as well, and

0:29:41.400 --> 0:29:44.640
<v Speaker 1>the main one of interest here is when people are

0:29:44.800 --> 0:29:48.600
<v Speaker 1>asked to reproduce faces, as in who did you see

0:29:48.680 --> 0:29:51.480
<v Speaker 1>at the scene of the crime. So what police did

0:29:51.480 --> 0:29:55.280
<v Speaker 1>for centuries was to have a trained artist who sketches

0:29:55.360 --> 0:29:58.600
<v Speaker 1>what you describe. But of course it's really hard to

0:29:58.680 --> 0:30:01.840
<v Speaker 1>describe a face, and so by the late nineteen fifties,

0:30:01.920 --> 0:30:05.600
<v Speaker 1>the Los Angeles Police Department introduced a new way of

0:30:05.720 --> 0:30:10.120
<v Speaker 1>doing this where you line up individual strips for the eyes,

0:30:10.280 --> 0:30:13.440
<v Speaker 1>the eyebrows, the nose, the mouths, and so on. So

0:30:13.520 --> 0:30:16.560
<v Speaker 1>instead of me asking you, hey, can you describe that

0:30:16.640 --> 0:30:20.720
<v Speaker 1>guy's face? Instead, you now get a bunch of possible

0:30:20.760 --> 0:30:24.560
<v Speaker 1>eyes and possible noses and mouths and ears and chins,

0:30:24.840 --> 0:30:28.040
<v Speaker 1>and you try to reconstruct it that way. And this

0:30:28.200 --> 0:30:31.200
<v Speaker 1>ended up spreading from Los Angeles to Scotland Yard by

0:30:31.200 --> 0:30:34.760
<v Speaker 1>the nineteen sixties, and they had some successes, and eventually

0:30:34.760 --> 0:30:38.480
<v Speaker 1>this became a computerized system where people can piece this

0:30:38.560 --> 0:30:41.320
<v Speaker 1>together on a computer, and so this was considered a

0:30:41.400 --> 0:30:44.760
<v Speaker 1>real success about how you can use things beyond just

0:30:45.280 --> 0:30:52.600
<v Speaker 1>unreliable verbal descriptions to identify perpetrators. But when researchers subjected

0:30:52.640 --> 0:30:54.920
<v Speaker 1>this to more careful study, it turned out that this

0:30:54.960 --> 0:30:59.040
<v Speaker 1>approach of piecing faces together this is pretty imperfect, and

0:30:59.080 --> 0:31:02.520
<v Speaker 1>the fundamental problem is that the composite is built from

0:31:02.640 --> 0:31:05.600
<v Speaker 1>pieces and parts, like the guy's eyes look like that,

0:31:05.720 --> 0:31:07.680
<v Speaker 1>and his mouth looks like that, his nose like that,

0:31:07.760 --> 0:31:10.520
<v Speaker 1>and so on. But that's not the way the human

0:31:10.600 --> 0:31:15.280
<v Speaker 1>visual system works. It recognizes faces based on the whole picture,

0:31:15.760 --> 0:31:19.880
<v Speaker 1>the gestalt, where the whole is perceived as more than

0:31:19.880 --> 0:31:23.000
<v Speaker 1>the sum of the parts. And so it turns out

0:31:23.280 --> 0:31:27.480
<v Speaker 1>that reconstructing a face from pieces and parts is not

0:31:27.680 --> 0:31:32.040
<v Speaker 1>really so easy. I'll also mention another face recognition issue

0:31:32.040 --> 0:31:34.400
<v Speaker 1>which shows up in courts all the time, and that's

0:31:34.480 --> 0:31:37.840
<v Speaker 1>the other race effect which I mentioned earlier. People are

0:31:37.880 --> 0:31:42.840
<v Speaker 1>more likely to misidentify individuals of other races, for example,

0:31:43.080 --> 0:31:46.840
<v Speaker 1>a Hispanic person identifying a Japanese person and so on.

0:31:47.160 --> 0:31:51.400
<v Speaker 1>This is not racism, it's just neuroscience. Your brain comes

0:31:51.440 --> 0:31:55.400
<v Speaker 1>to represent what you see around you, and so in

0:31:55.480 --> 0:31:58.160
<v Speaker 1>many ways, the legal system has to chew on the

0:31:58.240 --> 0:32:03.320
<v Speaker 1>issue of how brain recognized faces and wear brains don't

0:32:03.320 --> 0:32:06.920
<v Speaker 1>do it so well. Okay, So moving beyond courts into

0:32:06.960 --> 0:32:11.160
<v Speaker 1>the broader society, we're now entering a brave new world

0:32:11.400 --> 0:32:14.920
<v Speaker 1>of face recognition technology. There was a big moment in

0:32:15.040 --> 0:32:20.080
<v Speaker 1>twenty eighteen in China where new face recognition technology picked

0:32:20.160 --> 0:32:24.280
<v Speaker 1>out a person that the authorities wanted out of a

0:32:24.360 --> 0:32:28.160
<v Speaker 1>crowd of sixty thousand people at a concert, and apparently

0:32:28.320 --> 0:32:30.520
<v Speaker 1>when they came up and grabbed the guy, he was

0:32:30.720 --> 0:32:34.760
<v Speaker 1>infinitely surprised because he assumed that being in a giant

0:32:34.880 --> 0:32:37.720
<v Speaker 1>crowd made him safe, that no one would ever be

0:32:37.760 --> 0:32:40.440
<v Speaker 1>able to spot him, and he was of course right

0:32:40.520 --> 0:32:43.760
<v Speaker 1>that no one could, but the computer did. Now, this

0:32:43.960 --> 0:32:48.040
<v Speaker 1>was an example where face recognition technology performed at a

0:32:48.200 --> 0:32:52.600
<v Speaker 1>superhuman level, but note that the technology often messes up

0:32:52.680 --> 0:32:57.600
<v Speaker 1>and also does something that's very human paradolia. We always

0:32:57.640 --> 0:33:02.280
<v Speaker 1>see algorithms mistakenly idea identifying faces where there aren't any,

0:33:02.520 --> 0:33:06.400
<v Speaker 1>which mimics the same tendencies that we have as humans.

0:33:06.720 --> 0:33:09.440
<v Speaker 1>For example, remember the jack O lantern that I mentioned

0:33:09.480 --> 0:33:14.600
<v Speaker 1>earlier that the Facebook algorithm mistakenly identified as a human face.

0:33:14.920 --> 0:33:18.440
<v Speaker 1>Computers tend to impose patterns as much as we do.

0:33:18.760 --> 0:33:20.480
<v Speaker 1>And I just want to say that even though we're

0:33:20.480 --> 0:33:24.640
<v Speaker 1>often worried about new technologies and its face recognition abilities,

0:33:24.800 --> 0:33:27.480
<v Speaker 1>what's also going to grow out of this are increasingly

0:33:27.520 --> 0:33:32.840
<v Speaker 1>sophisticated assist of devices. You have AI driven tools that

0:33:32.880 --> 0:33:37.200
<v Speaker 1>are going to offer real time face recognition support which

0:33:37.240 --> 0:33:40.200
<v Speaker 1>is going to allow you to recognize your friend when

0:33:40.240 --> 0:33:42.520
<v Speaker 1>you see him totally out of context. But more importantly,

0:33:43.400 --> 0:33:46.520
<v Speaker 1>this has the potential to enhance the quality of life

0:33:47.000 --> 0:33:51.960
<v Speaker 1>for people with prosopagnosia. So let's wrap up. Face recognition

0:33:52.280 --> 0:33:57.200
<v Speaker 1>is a remarkably computationally intensive thing that we do, and

0:33:57.280 --> 0:34:01.440
<v Speaker 1>even though it seems effortless, we have these massive, specialized

0:34:01.440 --> 0:34:05.640
<v Speaker 1>neural networks that underpin our ability to do this. This

0:34:05.800 --> 0:34:10.239
<v Speaker 1>ability highlights the very special role that faces play in

0:34:10.280 --> 0:34:13.799
<v Speaker 1>our social lives and in our interactions, and it's so

0:34:13.840 --> 0:34:17.759
<v Speaker 1>important that we all experience things like paradolia, where we

0:34:18.120 --> 0:34:23.319
<v Speaker 1>impose an interpretation of faces on other patterns all around us,

0:34:23.920 --> 0:34:27.719
<v Speaker 1>and this underscores our brain's need to make sense of

0:34:27.800 --> 0:34:32.360
<v Speaker 1>the world, to impose order on chaos, and to connect

0:34:32.600 --> 0:34:35.960
<v Speaker 1>what we see with what we know. And finally, we

0:34:36.000 --> 0:34:40.000
<v Speaker 1>saw that skill in face recognition varies widely from person

0:34:40.040 --> 0:34:43.600
<v Speaker 1>to person. We can measure this from brain imaging in

0:34:43.680 --> 0:34:49.040
<v Speaker 1>people's performances, and we find a spectrum from super recognizers

0:34:49.400 --> 0:34:53.680
<v Speaker 1>to those who are face blind. And understanding how different

0:34:53.719 --> 0:34:57.120
<v Speaker 1>reality can be on the inside is critical for living

0:34:57.120 --> 0:35:00.920
<v Speaker 1>in a society and understanding one another as it stands now,

0:35:01.000 --> 0:35:04.640
<v Speaker 1>most people on the planet are unaware of this spectrum.

0:35:04.640 --> 0:35:07.920
<v Speaker 1>They've never heard of something like prosapagnosia, and as a result,

0:35:08.280 --> 0:35:11.640
<v Speaker 1>they don't recognize it in themselves or in a loved one.

0:35:11.880 --> 0:35:15.160
<v Speaker 1>So this is one of the things we gain from

0:35:15.239 --> 0:35:20.239
<v Speaker 1>a deeper understanding of the brain, a broader empathy that

0:35:20.360 --> 0:35:25.799
<v Speaker 1>allows societies to interact more richly. So the next time

0:35:26.200 --> 0:35:30.160
<v Speaker 1>you see a newborn baby, lock onto its mother's face

0:35:30.320 --> 0:35:34.279
<v Speaker 1>and track the face and study it, just remember that

0:35:34.320 --> 0:35:38.480
<v Speaker 1>you're not just seeing something cute, you are catching insight

0:35:39.000 --> 0:35:46.560
<v Speaker 1>into deep circuitry of the inner cosmos. Go to Eagleman

0:35:46.600 --> 0:35:49.760
<v Speaker 1>dot com slash podcast for more information and to find

0:35:49.840 --> 0:35:53.840
<v Speaker 1>further reading. Send me an email at podcasts at egleman

0:35:53.880 --> 0:35:57.000
<v Speaker 1>dot com with questions or discussion, and check out and

0:35:57.040 --> 0:36:00.719
<v Speaker 1>subscribe to Inner Cosmos on YouTube for video of each

0:36:00.760 --> 0:36:05.600
<v Speaker 1>episode and to leave comments. Until next time. I'm David Eagleman,

0:36:05.800 --> 0:36:07.759
<v Speaker 1>and this is Inner Cosmos.