WEBVTT - Ep31 "Why do we see #TheDress differently?" 0:00:05.559 --> 0:00:08.959 What's up with those illusions on the Internet where you 0:00:09.000 --> 0:00:12.559 can hear the same sound one of two different ways 0:00:12.600 --> 0:00:15.640 depending on the word that you're looking at. And why 0:00:15.640 --> 0:00:19.159 do electrical outlets sometimes look like a face to you? 0:00:19.960 --> 0:00:23.440 How can you have full, rich visual experience with your 0:00:23.480 --> 0:00:27.080 eyes closed. And when you want to cross a street 0:00:27.160 --> 0:00:30.120 and you hit that crosswalk button, are some of those 0:00:30.120 --> 0:00:32.680 buttons fake and they don't actually do anything? 0:00:33.320 --> 0:00:34.519 And why are there some. 0:00:34.640 --> 0:00:38.040 Pictures that you can only see once you're told what 0:00:38.080 --> 0:00:41.920 you're looking at. And although brains are often celebrated for 0:00:41.960 --> 0:00:46.200 their parallel processing, what did they really be celebrated for. 0:00:49.880 --> 0:00:53.000 Welcome to Inner Cosmos with Me David Eagleman. I'm a 0:00:53.080 --> 0:00:57.560 neuroscientist and author at Stanford and in these episodes we 0:00:57.680 --> 0:01:02.160 sail deeply into our three pounds universe to understand why 0:01:02.200 --> 0:01:05.040 we perceive the world in the ways that we do. 0:01:13.520 --> 0:01:18.080 Today's episode is about expectations and what that has to 0:01:18.120 --> 0:01:24.480 do with perception. Unless you were living in outer space 0:01:24.720 --> 0:01:28.000 or off the grid in twenty fifteen, your life was 0:01:28.120 --> 0:01:33.240 touched by a very tiny, specific event that happened on 0:01:33.280 --> 0:01:37.679 a small island in Scotland. Two young people were going 0:01:37.720 --> 0:01:40.680 to get married there, and a week before the wedding, 0:01:41.080 --> 0:01:43.720 the mother of the bride was shopping around for what 0:01:43.840 --> 0:01:47.680 she was going to wear. So she finds some outfits 0:01:47.760 --> 0:01:50.720 at a store down in Chester, England that she thinks 0:01:50.760 --> 0:01:54.080 will look nice, and while she's making the decision, she 0:01:54.360 --> 0:01:57.480 snaps pictures of each of them and she buys one 0:01:57.520 --> 0:01:57.800 of them. 0:01:58.520 --> 0:01:59.880 So she's driving home. 0:01:59.720 --> 0:02:03.560 After words and she texts the pictures of the three 0:02:03.600 --> 0:02:07.320 outfits to her daughter and she tells her that she 0:02:07.480 --> 0:02:10.560 had bought the third one, and no one could have 0:02:10.720 --> 0:02:14.280 ever guessed that this particular piece of clothing that she 0:02:14.400 --> 0:02:18.680 sent a picture of, this one piece garment, is about 0:02:18.760 --> 0:02:22.440 to become the most famous outfit that ever existed in 0:02:22.480 --> 0:02:27.080 the history of humankind, because the daughter writes back to 0:02:27.280 --> 0:02:32.680 clarify which outfit the mother had bought, and she texts, oh, 0:02:32.919 --> 0:02:37.200 the white and gold one, and the mother texts back, no, 0:02:37.760 --> 0:02:43.240 it's blue and black, and the daughter replies, Mom, if 0:02:43.280 --> 0:02:45.600 you think that's blue and black, you need to go 0:02:45.639 --> 0:02:49.400 and see the doctor. So the mother shows the phone 0:02:49.440 --> 0:02:52.359 to her partner in the car, who, despite having been 0:02:52.400 --> 0:02:54.519 there and bought the dress with her, looks at the 0:02:54.560 --> 0:02:57.000 photo and says, yeah, I think it's white and gold. 0:02:57.680 --> 0:02:59.400 So when they get home, they show the picture to 0:02:59.440 --> 0:03:02.600 their younger, who agrees with the mother that the photo 0:03:02.639 --> 0:03:08.600 looks blue and black. So, given this funny disagreement, the 0:03:08.760 --> 0:03:11.880 bride to be posts the photo to her friends on 0:03:11.960 --> 0:03:16.560 Facebook to settle this, and to her surprise, she doesn't 0:03:16.600 --> 0:03:20.840 find consensus. Some think it's black and blue, others think 0:03:20.919 --> 0:03:25.800 it's white and gold, and each person feels totally certain 0:03:25.880 --> 0:03:29.160 about what they see. So for about a week, this 0:03:29.280 --> 0:03:34.200 debate bubbles around in this small island community. The day 0:03:34.200 --> 0:03:36.920 of the wedding arrives and the mother wears the dress 0:03:36.960 --> 0:03:40.280 to the event, and the issue about the photo becomes 0:03:40.480 --> 0:03:43.080 such a point of discussion that the musicians in the 0:03:43.120 --> 0:03:46.280 band allegedly almost didn't make it onto the stage to 0:03:46.360 --> 0:03:48.720 play because they were wrapped up in the debate. 0:03:49.720 --> 0:03:51.240 So a few days after. 0:03:50.960 --> 0:03:53.200 The wedding, one of the band members, who was a 0:03:53.280 --> 0:03:57.080 friend of the happy couple, she posts the photo to 0:03:57.200 --> 0:04:00.520 her blog on Tumblr, and by the end of the 0:04:00.600 --> 0:04:05.400 day it gets five thousand comments, and soon enough, the 0:04:05.640 --> 0:04:09.839 data scientists at Tumblr are examining this post because it's 0:04:09.880 --> 0:04:15.080 getting fourteen thousand views each second. That's close to a 0:04:15.160 --> 0:04:20.600 million views each minute. So a woman on the BuzzFeed 0:04:20.720 --> 0:04:24.360 social media team sets up a poll about the color 0:04:24.600 --> 0:04:27.680 for Tumblr users, and then she packs up and goes 0:04:27.720 --> 0:04:29.960 home on the subway. And by the time she gets 0:04:30.080 --> 0:04:34.440 off the subway, her phone is overwhelmed, and soon enough 0:04:34.480 --> 0:04:38.840 the BuzzFeed page hits new records for how many unique 0:04:38.920 --> 0:04:41.360 visitors were on the page at the same time, hitting 0:04:41.400 --> 0:04:45.599 almost seven hundred thousand. The number of comments on the 0:04:45.640 --> 0:04:50.200 original post increases tenfold that night. By late that night, 0:04:50.279 --> 0:04:56.120 there are five thousand tweets per minute using hashtag the dress, 0:04:56.320 --> 0:04:58.640 and by the middle of that night it's grown to 0:04:58.760 --> 0:05:02.560 eleven thousand tw wheets per minute. Within the week, more 0:05:02.600 --> 0:05:06.400 than ten million tweets are talking about the dress. This 0:05:06.680 --> 0:05:11.840 was the dress that, as they say, broke the Internet. Now, 0:05:11.920 --> 0:05:15.360 if you were, say a space alien, you might look 0:05:15.400 --> 0:05:18.920 at all this human activity and think, wait, what, why 0:05:19.080 --> 0:05:22.440 is the world stopping over a simple picture of a 0:05:22.560 --> 0:05:26.479 piece of clothing in the UK. Now, the answer, as 0:05:26.520 --> 0:05:28.640 you know, is that none of us humans would have 0:05:28.680 --> 0:05:33.040 found it interesting either, except that someone that you loved 0:05:33.040 --> 0:05:35.720 and trusted said, what do you mean you're seeing it 0:05:35.800 --> 0:05:39.880 that color? It's so clearly the other color, And you said, wait, 0:05:39.960 --> 0:05:42.599 what are you being serious? And they asked you the 0:05:42.640 --> 0:05:48.080 same and then the awe sets in. You both realize 0:05:48.120 --> 0:05:51.320 that you're looking at the same thing in the outside world, 0:05:51.680 --> 0:05:58.960 and you're having different perceptions a different experience on the inside. Now, 0:05:59.160 --> 0:06:02.880 no one was more excited about the dress than neuroscientists, 0:06:02.920 --> 0:06:08.000 because for neuroscientists this was a terrific demonstration of what 0:06:08.040 --> 0:06:11.400 we're going to talk about today. So to start things off, 0:06:11.520 --> 0:06:16.920 let's just point out how important these kinds of perceptual 0:06:17.000 --> 0:06:20.760 oddities are to neuroscience. I've spent a big chunk of 0:06:20.800 --> 0:06:26.760 my career studying illusions. I've published scientific papers about illusions 0:06:26.800 --> 0:06:30.280 in journals like Science and Nature, And some years ago 0:06:30.360 --> 0:06:33.960 I wrote a review article in the journal Nature Reviews Neuroscience, 0:06:34.240 --> 0:06:38.400 and I titled it Visual Illusions and the Brain, And 0:06:38.440 --> 0:06:42.760 in that article I laid out how powerful illusions are 0:06:43.000 --> 0:06:46.599 for figuring out what is under the hood. Sometimes I 0:06:46.600 --> 0:06:49.520 feel like illusions are interesting only to ten year olds 0:06:49.560 --> 0:06:53.880 and for most people they become nothing but entertainment. But truthfully, 0:06:54.040 --> 0:06:59.040 illusions are microscopes for understanding what is happening in the brain. 0:07:00.200 --> 0:07:04.640 Them we can reveal the systematic differences between what is 0:07:04.839 --> 0:07:07.719 actually out there in the world and what we believe 0:07:08.160 --> 0:07:12.600 is out there, And by dialing the illusion around carefully, 0:07:12.960 --> 0:07:16.840 we can usually put constraints on how the network of 0:07:16.960 --> 0:07:22.440 neurons must be operating. Now, most illusions are the type 0:07:22.440 --> 0:07:26.080 in which we measure what's being presented in the outside world, 0:07:26.240 --> 0:07:30.080 like two lines of identical lengths and you see it 0:07:30.160 --> 0:07:33.440 as two different lengths, and we say, ah, there's a 0:07:33.480 --> 0:07:37.400 systematic difference between what's on the page and what you perceive. 0:07:38.040 --> 0:07:40.000 Or maybe I show you two. 0:07:39.720 --> 0:07:43.520 Parallel lines against some background and you don't see them 0:07:43.640 --> 0:07:47.160 as parallel. Or you look at a totally static picture 0:07:47.200 --> 0:07:50.920 on a page and you swear that it's moving. But 0:07:51.000 --> 0:07:54.320 the dress was interesting because it wasn't that traditional kind 0:07:54.360 --> 0:07:58.800 of illusion. Instead, one person sees one thing and the 0:07:58.880 --> 0:08:01.480 person standing right next to them sees another. 0:08:02.640 --> 0:08:04.160 Now, what all. 0:08:03.880 --> 0:08:06.800 Illusions, including the dress, tell us right away is a 0:08:07.000 --> 0:08:11.160 foundational point that's not always intuitive, which is that we 0:08:11.240 --> 0:08:15.640 don't simply look at the world and passively receive what's 0:08:15.680 --> 0:08:22.240 out there. Instead, our brains actively construct our perception, and 0:08:22.640 --> 0:08:26.480 different brains can do so differently. So now let's move 0:08:26.600 --> 0:08:30.680 deeper into this mystery by turning to a different illusion. 0:08:31.000 --> 0:08:33.480 That took over the Internet a few years later, in 0:08:33.559 --> 0:08:35.240 May of twenty eighteen. 0:08:36.120 --> 0:08:40.040 Laurel Laurel, Laurel. 0:08:41.080 --> 0:08:44.520 Now, this was an audio file that was originally recorded 0:08:44.559 --> 0:08:47.800 by a reader in two thousand and seven for vocabulary 0:08:47.840 --> 0:08:51.360 dot com, and some students apparently re recorded that file 0:08:51.440 --> 0:08:54.679 while there was some background noise in a room. So 0:08:55.280 --> 0:08:59.360 a fifteen year old freshman in Georgia named Katie was 0:08:59.440 --> 0:09:03.120 listening to that recording and she realized that she was 0:09:03.200 --> 0:09:07.560 hearing some funny ambiguity, and she posted this little audio 0:09:07.600 --> 0:09:10.440 clip on Instagram, and the next day her friend posted 0:09:10.440 --> 0:09:12.400 it on Reddit, and then it got picked up on 0:09:12.440 --> 0:09:13.280 Twitter and. 0:09:13.320 --> 0:09:14.800 Soon it went nuts. 0:09:15.440 --> 0:09:20.120 Why Because just like the dress, people can have a 0:09:20.320 --> 0:09:25.040 different perception of the same item presented to their senses. 0:09:25.559 --> 0:09:29.240 About half the people hear the word yanny and the 0:09:29.400 --> 0:09:32.200 other half hear the word Laurel. 0:09:32.800 --> 0:09:38.600 Laurel, Laurel, Laurel, Laurel. 0:09:39.840 --> 0:09:44.640 Now, how can people hear different things? So hang tight, 0:09:44.720 --> 0:09:46.360 I'll tell you in a minute. But what I want 0:09:46.400 --> 0:09:48.720 to point out for now is that, just like the dress, 0:09:48.760 --> 0:09:52.040 some people have one experience, some people have another, same 0:09:52.160 --> 0:09:57.920 sound recording, different experiences. Now, the Yanny Laurel clip made 0:09:57.960 --> 0:10:00.960 its rounds on the internet, but it about the exact 0:10:01.040 --> 0:10:05.040 same time. In May of twenty eighteen, something even better 0:10:05.120 --> 0:10:09.920 surfaced on YouTube. A guy had posted a video where 0:10:09.960 --> 0:10:14.199 he was reviewing a children's toy from the ben Ten franchise, 0:10:14.679 --> 0:10:18.600 and the toy lights up and says something. And here's 0:10:18.640 --> 0:10:22.320 what it sounds like. It says the word green needle. 0:10:22.600 --> 0:10:34.080 So listen carefully for green needle. Okay, well, that's not 0:10:34.280 --> 0:10:37.079 actually what the toy was saying. It was actually saying 0:10:37.120 --> 0:10:41.600 the word brainstorm, which is the toy character's name. So 0:10:41.760 --> 0:10:52.320 listen for the word brainstorm. 0:10:52.440 --> 0:10:53.079 Now, I just. 0:10:53.040 --> 0:10:56.840 Played the exact same audio file in both cases, but 0:10:56.960 --> 0:11:01.800 depending on your expectation what you were listening for, you'll 0:11:01.920 --> 0:11:05.400 hear different things. So I'm going to play this file again, 0:11:05.840 --> 0:11:08.160 over and over for about twenty seconds, and I want 0:11:08.160 --> 0:11:13.240 you to think about brainstorm or think about green needle. 0:11:13.840 --> 0:11:16.840 Try to go back and forth about which one you're hearing. 0:11:17.000 --> 0:11:19.720 Switch your thinking from one to the other at any point. 0:11:39.200 --> 0:11:40.840 So, what the heck's going on here? 0:11:41.000 --> 0:11:45.480 How can a single audio file be heard two completely 0:11:45.480 --> 0:11:51.160 different ways? Seems like magic, but it's actually neuroscience. All 0:11:51.240 --> 0:11:56.679 these internet memes actually give deep insight into a fundamental 0:11:57.080 --> 0:12:00.640 and rarely appreciated property of the brain. So I'm going 0:12:00.679 --> 0:12:04.640 to unpack these illusions in a few steps. The first 0:12:04.720 --> 0:12:07.920 clue to the mystery is that the brain does not 0:12:08.240 --> 0:12:13.280 tolerate ambiguity. It really wants to come to a conclusion 0:12:13.440 --> 0:12:17.800 about exactly what's out there. Now, that's a major daily 0:12:17.920 --> 0:12:20.560 challenge for the brain because so much of what you 0:12:20.640 --> 0:12:25.080 see or hear is ambiguous. You have data points that 0:12:25.120 --> 0:12:28.400 come streaming into the brain through the eyes, or the ears, 0:12:28.480 --> 0:12:32.920 or the fingertips, but often they could be interpreted more 0:12:32.960 --> 0:12:36.120 than one way. So what does the brain do in 0:12:36.160 --> 0:12:41.520 this circumstance. It locks onto a single way of understanding it. 0:12:42.320 --> 0:12:46.320 In other words, if there are multiple possibilities, it'll force 0:12:46.440 --> 0:12:49.840 an answer. Now let's pause for just a moment to 0:12:49.880 --> 0:12:53.880 appreciate something here. When you read about the brain, you 0:12:53.960 --> 0:12:57.920 always see it celebrated for its parallel processing. It can 0:12:58.000 --> 0:13:01.280 do lots of things at once. But what it should 0:13:01.280 --> 0:13:04.240 be equally celebrated for, the thing that no one ever 0:13:04.320 --> 0:13:10.280 bothers to highlight is serialization. It takes lots of the 0:13:10.360 --> 0:13:13.840 activity and it squeezes it down to one thing. 0:13:14.080 --> 0:13:15.559 It serializes it. 0:13:15.559 --> 0:13:19.240 It takes an information that could be interpreted in lots 0:13:19.240 --> 0:13:22.160 of different ways, and it crunches it down to a 0:13:22.320 --> 0:13:23.520 single interpretation. 0:13:24.800 --> 0:13:28.760 Now, why is it so good at serializing, at. 0:13:28.600 --> 0:13:33.800 Getting possibilities down to a single answer, Because fundamentally, your 0:13:33.800 --> 0:13:38.240 brain has the challenge of controlling a giant body made 0:13:38.280 --> 0:13:41.800 of trillions of cells, and when you come to a 0:13:42.320 --> 0:13:45.880 tree in the path, it has to go either left 0:13:45.960 --> 0:13:48.719 or right around the tree. Because of the physics of 0:13:48.760 --> 0:13:51.040 the world, it cannot do both, and. 0:13:50.960 --> 0:13:53.360 So it has to make a single. 0:13:53.400 --> 0:13:57.520 Decision, go right or go left, and drag all those 0:13:57.559 --> 0:14:00.560 trillions of cells with it. Your brain it has to 0:14:00.640 --> 0:14:04.960 be good at taking possibilities and crushing them down to 0:14:05.080 --> 0:14:10.880 a single decision. And it's the same with your perceptual life. 0:14:11.360 --> 0:14:14.600 Your brain is used to dealing with a world where 0:14:14.640 --> 0:14:18.120 it has to come to conclusions, having to say, look, 0:14:18.160 --> 0:14:21.760 there are lots of possibilities here, but for me to 0:14:21.840 --> 0:14:24.520 function in the world, I have to make an assumption 0:14:25.000 --> 0:14:27.600 that what I am looking at is a piece of 0:14:27.640 --> 0:14:30.960 food or a boulder, or a bear at a distance 0:14:31.120 --> 0:14:36.000 or whatever. So the brain doesn't tolerate ambiguity, but it 0:14:36.080 --> 0:14:40.480 always says, all right, this is my answer okay, So 0:14:40.680 --> 0:14:45.120 now let's introduce one more perceptual illusion of this flavor, 0:14:45.600 --> 0:14:48.000 and then we're going to unpack what's going on. 0:14:49.480 --> 0:14:51.080 So surely you've seen this one before. 0:14:51.200 --> 0:14:54.080 You draw the outline of a cube on a piece 0:14:54.120 --> 0:14:57.240 of paper. You just draw a square, and then an 0:14:57.320 --> 0:15:00.720 offset square, and then lines connecting the corners of one 0:15:00.760 --> 0:15:03.320 to the corners of the other, so it's twelve lines. 0:15:03.400 --> 0:15:07.400 It's the outline of a cube. This little wireframe drawing 0:15:07.600 --> 0:15:09.920 is known as the Necker cube. 0:15:10.320 --> 0:15:13.479 Now you've seen this before, but as you know, if you've. 0:15:13.280 --> 0:15:18.080 Stared at one, it's perceptually ambiguous because if you stare 0:15:18.120 --> 0:15:21.320 at this little wireframe, it looks like it's coming out 0:15:21.400 --> 0:15:25.400 one way from the page, even though you could perceive. 0:15:25.080 --> 0:15:27.400 The same drawing in two different ways. 0:15:27.720 --> 0:15:30.600 Either the lower square is the face of the cube 0:15:30.640 --> 0:15:33.720 coming toward you, or the upper square is the one 0:15:33.760 --> 0:15:38.280 coming out toward you, but your brain makes a choice. Now, 0:15:38.320 --> 0:15:42.360 you could imagine a space alien who looks at this 0:15:42.400 --> 0:15:45.600 little drawing of the wireframe cube and says, okay, well, 0:15:46.040 --> 0:15:50.240 both configurations of the cube are equally probable, so I'll 0:15:50.240 --> 0:15:53.640 see it both ways at once. But we can't do that. 0:15:54.200 --> 0:15:56.960 We have to see it one way or the other. 0:15:57.120 --> 0:16:01.800 Your brain forces a single interpret and this is the 0:16:01.840 --> 0:16:06.280 same thing that's happening with the other illusions with the dress. 0:16:06.400 --> 0:16:09.640 You don't see it as both blue and black and 0:16:09.880 --> 0:16:12.520 white and gold. And in a minute we'll see why. 0:16:13.160 --> 0:16:14.680 The part I just want to say now is that 0:16:14.720 --> 0:16:17.960 your brain concludes that it is one or the other, 0:16:18.080 --> 0:16:22.440 and then it sticks with that. And likewise with Yanny Laurel. 0:16:23.080 --> 0:16:26.720 Both sounds are present in the audio file, but you 0:16:26.840 --> 0:16:30.640 don't hear Yanny and Laurel at the same time, stacked 0:16:30.680 --> 0:16:33.800 on one another. And it's exactly the same thing with 0:16:33.960 --> 0:16:39.440 brainstorm and green needle. Both interpretations are possible, but your 0:16:39.480 --> 0:16:43.920 brain won't do both at once. It collapses the possibilities 0:16:43.960 --> 0:16:48.600 to a single answer. In all these cases, even though 0:16:48.640 --> 0:16:52.160 the data is consistent with either interpretation, your brain makes 0:16:52.160 --> 0:16:55.280 a call. It goes left or right around the tree. 0:16:55.560 --> 0:16:58.800 You very clearly perceive one or the other. And this 0:16:58.920 --> 0:17:03.040 is because the brain isn't passively receiving the world. It's 0:17:03.200 --> 0:17:25.000 making choices. Okay, but how does your brain know how 0:17:25.040 --> 0:17:30.119 to collapse ambiguous data to a single interpretation. It does 0:17:30.160 --> 0:17:35.760 so by leveraging assumptions, so let's go a level deeper 0:17:35.880 --> 0:17:38.920 with the dress. Why does it happen that some people 0:17:38.960 --> 0:17:41.359 see it one way and some people the other. It 0:17:41.520 --> 0:17:44.560 happens because your brain sees a picture of a dress 0:17:44.560 --> 0:17:50.120 in the shop and it makes dozens of assumptions totally unconsciously. Now, 0:17:50.160 --> 0:17:54.600 what's amazing is that the assumptions aren't directly about the dress, 0:17:55.240 --> 0:17:58.760 but about things you didn't even know you were thinking about. 0:17:58.800 --> 0:18:03.280 What is the light source in the photograph? Is the 0:18:03.440 --> 0:18:07.840 dress mostly being lit by fluorescent lights or by sunlight? 0:18:08.760 --> 0:18:12.240 Is the dress facing a window or is the window 0:18:12.280 --> 0:18:15.840 behind it? What time of day is it, what season 0:18:16.000 --> 0:18:20.320 is it? Your brain is considering all of these questions, 0:18:20.840 --> 0:18:23.720 and fundamentally, this all has to do with a computation 0:18:23.840 --> 0:18:30.080 that it does known as color constancy. Color constancy is 0:18:30.160 --> 0:18:34.960 this sophisticated ability of our visual systems to perceive the 0:18:35.000 --> 0:18:39.000 color of something as constant even when the light source 0:18:39.040 --> 0:18:43.119 the illumination changes. So let's say I'm wearing a white 0:18:43.280 --> 0:18:46.160 T shirt and we're standing outside talking in the sunlight. 0:18:46.560 --> 0:18:50.120 You will see my shirt as white. Now we go 0:18:50.280 --> 0:18:54.479 indoors into the coffee shop and the illuminant changes. In 0:18:54.480 --> 0:18:57.720 other words, the light that's bouncing off my t shirt changes. 0:18:58.359 --> 0:19:03.280 Now it's fluorescent light compared to sunlight. The fluorescent light 0:19:03.359 --> 0:19:06.679 has a different spectrum of colors coming out, and so 0:19:06.720 --> 0:19:09.760 when those bounce off my shirt, you have a different 0:19:09.960 --> 0:19:14.600 spectrum of colors hitting your eyes, and yet you still 0:19:14.600 --> 0:19:17.800 see it as white. And then that night we go 0:19:17.920 --> 0:19:21.680 into a dance club and the lighting is blue, and 0:19:21.760 --> 0:19:25.280 yet you have no problem seeing the shirt as white, 0:19:25.600 --> 0:19:28.960 even though it's mostly blue light reflecting off the shirt 0:19:29.240 --> 0:19:32.520 into your eyes. And then afterwards we go sit by 0:19:32.640 --> 0:19:37.399 a campfire and my shirt still looks white. Your brain 0:19:37.520 --> 0:19:41.359 retains a constant perception of the color of the shirt 0:19:41.800 --> 0:19:45.240 even though the wavelengths bouncing off of it are very different. 0:19:46.320 --> 0:19:50.159 So what does this tell us, Well, it means that 0:19:50.240 --> 0:19:53.560 the way your brain determines the color is not just 0:19:53.680 --> 0:19:56.560 about the colors hitting your eye from the shirt. It 0:19:56.600 --> 0:19:59.720 has to do with something else. And that's something else 0:20:00.119 --> 0:20:04.560 is everything else in the scene. So when you're looking 0:20:04.600 --> 0:20:08.480 at my shirt, your eyes are drinking in everything else. 0:20:09.119 --> 0:20:12.840 The background, the color of the skin on my arms, 0:20:12.880 --> 0:20:17.000 the color of the floors and walls, the colors of 0:20:17.359 --> 0:20:18.120 all the other. 0:20:18.359 --> 0:20:20.680 Jeans and shirts and signs in the. 0:20:20.640 --> 0:20:24.720 Whole scene, and it uses all of that to estimate 0:20:24.800 --> 0:20:30.160 the background illumination and then make the right computation about 0:20:30.200 --> 0:20:34.199 the color of the shirt in the sunlight and the 0:20:34.280 --> 0:20:36.959 coffee shop, at the dance club, at the campfire. 0:20:37.359 --> 0:20:38.520 It's doing all of. 0:20:38.480 --> 0:20:43.080 These computations, and this is what allows it to subtract 0:20:43.320 --> 0:20:46.840 off the background lighting so that it can see what 0:20:47.040 --> 0:20:52.040 color things are most likely to actually be. That's the 0:20:52.080 --> 0:20:56.760 phenomenon of color constancy. The color of the shirt remains 0:20:56.840 --> 0:21:01.920 constant even under different illumination, and that's what allows us 0:21:01.920 --> 0:21:05.720 to see the colors of objects in the world consistently, 0:21:05.760 --> 0:21:10.120 whether we're looking under sunlight or moonlight or firelight or whatever. 0:21:11.000 --> 0:21:14.480 So the first lesson is you're not just seeing what's 0:21:14.600 --> 0:21:19.359 out there. Your brain is actively interpreting information and serving 0:21:19.480 --> 0:21:22.040 up a story to you. And I'll go into this 0:21:22.080 --> 0:21:24.399 more in a future episode. But this is why we 0:21:24.440 --> 0:21:29.600 can see strawberries as red. For example, when we change 0:21:29.600 --> 0:21:32.920 the background color such that the actual light bouncing off 0:21:32.920 --> 0:21:37.440 the strawberries is gray light, your brain can nonetheless say, okay, well, 0:21:37.640 --> 0:21:40.880 given that everything else in the scene is now greenish, 0:21:41.400 --> 0:21:44.160 I can subtrack that off and know that I'm looking 0:21:44.200 --> 0:21:48.080 at something red. Now, in order to do all of 0:21:48.119 --> 0:21:50.960 this that I've been talking about, your brain has to 0:21:51.040 --> 0:21:55.439 make lots of assumptions about what the color should be, 0:21:56.200 --> 0:22:01.239 and different brains do it differently. With the dress, you 0:22:01.320 --> 0:22:05.679 see it as either white and gold or blue and black, 0:22:06.240 --> 0:22:10.680 depending on the assumptions your brain is making. When you 0:22:10.840 --> 0:22:14.000 glance at the photo on your phone, you have no 0:22:14.119 --> 0:22:17.720 idea that your brain is doing all those sophisticated computations 0:22:17.840 --> 0:22:21.359 under the hood to tell you what is the actual 0:22:21.440 --> 0:22:25.760 color of this garment, given my assumptions about all the 0:22:25.880 --> 0:22:27.040 lighting details. 0:22:27.760 --> 0:22:29.240 The issue is that your. 0:22:29.000 --> 0:22:32.520 Brain grew up in a particular environment, maybe with a 0:22:32.560 --> 0:22:35.439 lot of snow or a lot of sunlight or fog, 0:22:36.119 --> 0:22:39.359 and your brain makes assumptions about the time of day 0:22:39.720 --> 0:22:43.199 and the season and the balance of artificial lighting to 0:22:43.320 --> 0:22:47.720 natural lighting. To make sense of this little photo, what 0:22:48.040 --> 0:22:52.199 hues does the lighting contain. If your brain ignores a 0:22:52.200 --> 0:22:54.920 bit of the blue side, you'll see the dress as 0:22:55.040 --> 0:22:58.159 white and gold. If your brain pays less attention to 0:22:58.200 --> 0:23:00.840 the yellow side of the spectrum, you'll see it as 0:23:01.040 --> 0:23:05.000 blue and black. You have no insight into the fact 0:23:05.240 --> 0:23:08.560 that your brain is making all these assumptions under the hood. 0:23:09.280 --> 0:23:11.919 Was the photo of the dress taken with the window 0:23:11.960 --> 0:23:13.120 facing it or behind it. 0:23:13.200 --> 0:23:15.080 Was it morning light or afternoon light? 0:23:15.760 --> 0:23:18.119 And is your experience of the world based on the 0:23:18.160 --> 0:23:21.919 fact that you are a mourning lark or you are. 0:23:21.800 --> 0:23:22.800 A night owl. 0:23:23.080 --> 0:23:26.600 One of my colleagues, Pascal Wallash, showed that people who 0:23:26.640 --> 0:23:30.119 were early risers were more likely to think that the 0:23:30.200 --> 0:23:33.600 dress was lit by natural light, and so they saw 0:23:33.640 --> 0:23:37.840 it as white and gold, but night owls presumably had 0:23:37.880 --> 0:23:41.720 more assumptions about artificial lighting, and they were more likely 0:23:41.760 --> 0:23:45.399 to see the dress as blue and black. Your brain 0:23:45.560 --> 0:23:48.159 is determining the color of the dress by comparing it 0:23:48.280 --> 0:23:51.240 against the other objects of the background of the photo 0:23:51.640 --> 0:23:56.520 and making its best guess at all these parameters. So 0:23:56.920 --> 0:24:00.320 your brain relies on the answers to question is that 0:24:00.359 --> 0:24:03.439 you didn't even think it was asking and the idea 0:24:03.600 --> 0:24:07.600 of imposing assumptions. This is the same with Yanny and 0:24:07.680 --> 0:24:11.840 Laurel in the auditory domain, or with green needle and brainstorm. 0:24:12.040 --> 0:24:16.840 Your brain is imposing an interpretation. But what's interesting in 0:24:16.880 --> 0:24:20.640 this case is that the assumption can be changed more easily, 0:24:21.000 --> 0:24:26.280 typically by just staring at the word visually. Because your 0:24:26.400 --> 0:24:31.280 brain is trying to disambiguate what it's hearing, and suddenly 0:24:31.520 --> 0:24:34.400 it has lots of help from the visual system because 0:24:34.600 --> 0:24:39.800 it sees a word. So the frequencies of both words 0:24:39.880 --> 0:24:43.800 yany and laurel or green needle, brainstorm, they're contained in 0:24:43.920 --> 0:24:48.240 the audio file, so just depending on how you listen 0:24:48.359 --> 0:24:52.560 for it, you can hear one or the other. So 0:24:52.600 --> 0:24:57.120 the brain constantly nails down its world by making assumptions, 0:24:57.640 --> 0:25:00.640 and we see this with everything. And even though these 0:25:01.200 --> 0:25:04.199 internet memes get all of our attention, the fact is 0:25:04.600 --> 0:25:07.560 that our brains have to make assumptions all the time. 0:25:07.880 --> 0:25:10.840 And this is because most of the inputs from the 0:25:10.880 --> 0:25:16.160 world are quite noisy. For example, you can still understand 0:25:16.280 --> 0:25:20.120 me even if my speech is choppy, or if I'm 0:25:20.160 --> 0:25:23.399 speaking and there's lots of background noise like at a restaurant. 0:25:24.160 --> 0:25:27.200 What's actually hitting your ears in these scenarios is a 0:25:27.320 --> 0:25:31.960 very messy signal, But the brain imposes an interpretation about 0:25:32.200 --> 0:25:34.920 what must have been said, and that's what you perceive 0:25:35.440 --> 0:25:38.800 what you believe you heard. A lot of your cell 0:25:38.880 --> 0:25:43.320 phone conversations are super noisy, but you typically don't realize 0:25:43.320 --> 0:25:48.919 it because you keep making your reasonable interpretations. Now, this 0:25:49.040 --> 0:25:52.880 is true of most of what is hitting your eyes 0:25:52.920 --> 0:25:56.399 and ears. You don't catch a fraction of the data, 0:25:56.800 --> 0:25:59.439 but your brain fills in the details to put together 0:25:59.480 --> 0:26:01.959 a story. And this, by the way, is what's at 0:26:02.000 --> 0:26:04.680 the heart of a lot of art and graphic design. 0:26:05.040 --> 0:26:08.000 You just see a few curves and you interpret it 0:26:08.080 --> 0:26:11.720 as a face, or a series of segmented lines and 0:26:11.800 --> 0:26:16.440 you interpret that as a body. We are always operating 0:26:16.480 --> 0:26:19.919 off thin data, but that doesn't stop us from coming 0:26:19.960 --> 0:26:24.440 to clear conclusions. And before I explain how our neural 0:26:24.480 --> 0:26:28.040 networks go about making these assumptions, let's just take a 0:26:28.080 --> 0:26:31.919 second to look at how your brain is so imperfect 0:26:32.040 --> 0:26:36.919 at this. Take paridolia, which is when you perceive a 0:26:37.119 --> 0:26:41.160