WEBVTT - Augmenting Your Reality

0:00:04.160 --> 0:00:07.160
<v Speaker 1>Get in text with technology with tex Stuff from how

0:00:07.240 --> 0:00:14.160
<v Speaker 1>stuff works dot com. Hey there, and welcome to tex Stuff.

0:00:14.200 --> 0:00:17.680
<v Speaker 1>I am your host, Jonathan Strickland, and today I'm going

0:00:17.760 --> 0:00:22.000
<v Speaker 1>to augment your reality, or at least tell you what

0:00:22.320 --> 0:00:24.720
<v Speaker 1>augmented reality and a R is all about. And if

0:00:24.800 --> 0:00:28.040
<v Speaker 1>this seems like it's deja vu, that means you are

0:00:28.240 --> 0:00:33.600
<v Speaker 1>a long time listener of tech stuff, because we did

0:00:33.840 --> 0:00:38.120
<v Speaker 1>our episode about a are way back in two thousand nine.

0:00:38.159 --> 0:00:40.839
<v Speaker 1>That's when Chris Palette was my co host and together

0:00:40.880 --> 0:00:43.879
<v Speaker 1>we recorded an episode that we called What in the

0:00:43.920 --> 0:00:47.640
<v Speaker 1>World Is Augmented Reality? And that was a pretty short

0:00:47.680 --> 0:00:50.720
<v Speaker 1>overview of the concept, like twenty minutes long, and that

0:00:50.800 --> 0:00:55.840
<v Speaker 1>included two different listener males. So I thought I would

0:00:55.840 --> 0:00:59.960
<v Speaker 1>do a deeper dive, a bigger explanation about what augment

0:01:00.040 --> 0:01:03.200
<v Speaker 1>the reality is, what it's all about, how it works,

0:01:03.560 --> 0:01:08.400
<v Speaker 1>and sort of the applications we might put a R

0:01:08.560 --> 0:01:12.520
<v Speaker 1>toward things that, you know, was it good for tons

0:01:12.520 --> 0:01:16.000
<v Speaker 1>of stuff? As it turns out, So the first thing

0:01:16.040 --> 0:01:19.520
<v Speaker 1>we should do, it's probably defined some terms, because if

0:01:19.560 --> 0:01:23.280
<v Speaker 1>you haven't really looked into augmented reality and you aren't

0:01:23.319 --> 0:01:27.039
<v Speaker 1>familiar with a R, you might just be lost. I'm

0:01:27.040 --> 0:01:30.240
<v Speaker 1>gonna define it all for you right now, because that's

0:01:30.240 --> 0:01:33.360
<v Speaker 1>the kind of stand up guy I am. Technically speaking,

0:01:33.400 --> 0:01:38.520
<v Speaker 1>augmented reality is using digital information to enhance or augment

0:01:39.200 --> 0:01:44.319
<v Speaker 1>and experience in our physical real world. So the way

0:01:44.360 --> 0:01:49.080
<v Speaker 1>we usually see this implemented involves some sort of display

0:01:49.200 --> 0:01:51.720
<v Speaker 1>that has an image of the real world on it,

0:01:51.800 --> 0:01:54.800
<v Speaker 1>and it overlays digital information on top of that image.

0:01:54.880 --> 0:01:58.840
<v Speaker 1>So think of like a camera's viewfinder, like an LCD

0:01:59.280 --> 0:02:02.840
<v Speaker 1>screen on a camera, and it actually labels the buildings

0:02:02.880 --> 0:02:05.240
<v Speaker 1>that are in view. When you're out on the street

0:02:05.320 --> 0:02:08.560
<v Speaker 1>and you hold the camera up, or a smartphone or

0:02:08.600 --> 0:02:12.200
<v Speaker 1>even a wearable device like a head mounted display that

0:02:12.280 --> 0:02:15.160
<v Speaker 1>you can look through so you can see the real world.

0:02:15.200 --> 0:02:17.760
<v Speaker 1>You're not just staring at a screen, or if you

0:02:17.800 --> 0:02:20.320
<v Speaker 1>are staring at a screen, you're staring at a video

0:02:20.440 --> 0:02:23.640
<v Speaker 1>feed that is provided by an external camera amounted just

0:02:23.720 --> 0:02:25.320
<v Speaker 1>on the other side of the screen. So it's like

0:02:25.400 --> 0:02:28.680
<v Speaker 1>you're looking through a display in the first place, but

0:02:28.760 --> 0:02:32.440
<v Speaker 1>then on top of that view you have this digital information.

0:02:33.000 --> 0:02:35.880
<v Speaker 1>That's the most common implementation we talk about, but it's

0:02:35.880 --> 0:02:38.520
<v Speaker 1>not the only one. Augmented reality does not have to

0:02:38.639 --> 0:02:43.760
<v Speaker 1>only be or even involve visual information at all. You

0:02:43.760 --> 0:02:47.880
<v Speaker 1>could have audio only augmented reality, for example, But the

0:02:47.919 --> 0:02:51.360
<v Speaker 1>whole idea is that it's something that has created digitally

0:02:51.720 --> 0:02:55.280
<v Speaker 1>to enhance your experience in the real world. Now we

0:02:55.320 --> 0:03:01.160
<v Speaker 1>can contrast this with the concept of virtual reality. Virtual reality,

0:03:01.200 --> 0:03:04.760
<v Speaker 1>of course, is a term where you create an experience

0:03:04.880 --> 0:03:10.200
<v Speaker 1>completely through computer generated means a computer is making all

0:03:10.240 --> 0:03:14.240
<v Speaker 1>the things you see and here, and maybe even beyond

0:03:14.280 --> 0:03:18.079
<v Speaker 1>that if you have really sophisticated UH setups, so you

0:03:18.160 --> 0:03:21.040
<v Speaker 1>might have some haptic feedback. Haptic refers to your sense

0:03:21.080 --> 0:03:23.520
<v Speaker 1>of touch, so if you have haptic feedback, that means

0:03:23.560 --> 0:03:27.840
<v Speaker 1>you're getting information feedback through your sense of touch. Common

0:03:27.880 --> 0:03:32.400
<v Speaker 1>example of this is a rumble pack inside a game controller,

0:03:32.440 --> 0:03:34.320
<v Speaker 1>where you know you fire a gun and a first

0:03:34.360 --> 0:03:39.080
<v Speaker 1>person shooter and your controller rumbles as a result, letting

0:03:39.080 --> 0:03:44.440
<v Speaker 1>you know that you are in fact, unleashing virtual destruction

0:03:44.600 --> 0:03:48.320
<v Speaker 1>upon all you survey. Well, the same thing can be

0:03:48.360 --> 0:03:52.880
<v Speaker 1>true with virtual reality set up. So virtual reality is

0:03:52.920 --> 0:03:58.400
<v Speaker 1>all about constructing an artificial reality a simulated reality. Augmented

0:03:58.440 --> 0:04:01.000
<v Speaker 1>reality is all about enhance and seeing the one that

0:04:01.080 --> 0:04:05.840
<v Speaker 1>we are actually in. And then there's also mixed reality.

0:04:06.240 --> 0:04:09.800
<v Speaker 1>Mixed reality is kind of sort of in between the two.

0:04:09.880 --> 0:04:14.560
<v Speaker 1>You might have some physical objects within a room that

0:04:14.760 --> 0:04:18.640
<v Speaker 1>are also mapped to a virtual environment, and then you

0:04:18.800 --> 0:04:22.200
<v Speaker 1>use something like a head molded display to enter the

0:04:22.279 --> 0:04:25.760
<v Speaker 1>virtual environment. That's what looks like you're inside. But you

0:04:25.800 --> 0:04:28.160
<v Speaker 1>have physical objects in the room around you that are

0:04:28.200 --> 0:04:30.680
<v Speaker 1>also mapped to the virtual world, Meaning you could pick

0:04:30.800 --> 0:04:33.720
<v Speaker 1>up this physical object and you would see that reflected

0:04:34.000 --> 0:04:36.719
<v Speaker 1>within the virtual world, where you might pick up a

0:04:36.760 --> 0:04:40.000
<v Speaker 1>sword and shield or move a chair or something along

0:04:40.040 --> 0:04:44.920
<v Speaker 1>those lines. So augmented reality, virtual reality, and mixed reality

0:04:45.000 --> 0:04:48.120
<v Speaker 1>are all kind of inter related, so much so that

0:04:48.160 --> 0:04:53.279
<v Speaker 1>their histories also are very much inter related. And there's

0:04:53.279 --> 0:04:56.920
<v Speaker 1>some people who try to collect these different technologies, these

0:04:56.960 --> 0:05:00.880
<v Speaker 1>different approaches and put them under a co an umbrella,

0:05:00.920 --> 0:05:05.520
<v Speaker 1>and they tend to use the phrase alternate reality, which

0:05:05.520 --> 0:05:09.039
<v Speaker 1>is unfortunate because that's also a r but alternate reality

0:05:09.080 --> 0:05:12.440
<v Speaker 1>is kind of the umbrella for virtual, augmented and mixed reality.

0:05:13.680 --> 0:05:16.680
<v Speaker 1>Uh that that that kind of gives you the definition

0:05:16.720 --> 0:05:19.640
<v Speaker 1>of those basic terms, and it is important to understand

0:05:19.680 --> 0:05:23.120
<v Speaker 1>them because they're becoming more and more important today. You

0:05:23.279 --> 0:05:27.280
<v Speaker 1>are already probably aware of a lot of VR headsets

0:05:27.320 --> 0:05:29.760
<v Speaker 1>that are out there on the market as well as VR.

0:05:30.640 --> 0:05:34.279
<v Speaker 1>Uh Well, they're they're kind of like cases that you

0:05:34.320 --> 0:05:38.920
<v Speaker 1>slide your smartphone into, so your smartphone becomes the actual

0:05:39.320 --> 0:05:42.160
<v Speaker 1>display on a VR headset. The headset itself is more

0:05:42.240 --> 0:05:46.200
<v Speaker 1>or less just a head mounted case for your phone.

0:05:47.400 --> 0:05:48.800
<v Speaker 1>We've seen a lot of those come out over the

0:05:48.880 --> 0:05:51.479
<v Speaker 1>last few years. We've also seen a lot of a

0:05:51.760 --> 0:05:56.000
<v Speaker 1>R applications come out, typically for things like iPads and smartphones,

0:05:56.080 --> 0:06:00.040
<v Speaker 1>but we've also seen some hardware come out that for

0:06:00.160 --> 0:06:04.479
<v Speaker 1>wearable devices that falls into the augmented reality category, stuff

0:06:04.480 --> 0:06:06.920
<v Speaker 1>like Google Glass, which I'll talk about more a little

0:06:06.920 --> 0:06:11.039
<v Speaker 1>bit later in this episode. For augmented reality to work

0:06:11.279 --> 0:06:16.240
<v Speaker 1>to get this enhanced experience of reality around you, there

0:06:16.240 --> 0:06:19.440
<v Speaker 1>are a lot of technological components that have to come

0:06:19.480 --> 0:06:24.839
<v Speaker 1>together so that you actually do get experience that is meaningful.

0:06:25.800 --> 0:06:28.719
<v Speaker 1>You know, you have to have technology that quote unquote

0:06:28.800 --> 0:06:32.000
<v Speaker 1>knows where you are and what you are looking at

0:06:32.120 --> 0:06:34.560
<v Speaker 1>or what you are close to in order to get

0:06:34.560 --> 0:06:38.080
<v Speaker 1>that augmented experience. It wouldn't do me any good if

0:06:38.120 --> 0:06:42.200
<v Speaker 1>I put on an augmented reality headset, for example, and

0:06:42.360 --> 0:06:47.000
<v Speaker 1>stared at let's say, a famous painting, and instead of

0:06:47.040 --> 0:06:50.280
<v Speaker 1>getting information about the famous painting I see an exploded

0:06:50.400 --> 0:06:53.720
<v Speaker 1>view of an car engine That would make no sense.

0:06:54.120 --> 0:06:57.240
<v Speaker 1>So you have to build in technologies in order for

0:06:57.360 --> 0:07:00.599
<v Speaker 1>the a R to understand what it is you're trying

0:07:00.600 --> 0:07:04.760
<v Speaker 1>to do and to augment that experience, which meant that

0:07:04.800 --> 0:07:08.240
<v Speaker 1>we had to wait a pretty good long time for

0:07:08.560 --> 0:07:13.640
<v Speaker 1>the various technologies that we used to to create this

0:07:13.800 --> 0:07:17.920
<v Speaker 1>relationship to mature to a point where it was possible.

0:07:18.320 --> 0:07:20.920
<v Speaker 1>Sometimes we had technologies that would allow us to do it,

0:07:20.960 --> 0:07:26.400
<v Speaker 1>but it required uh tethering headsets to very large computers,

0:07:26.400 --> 0:07:29.520
<v Speaker 1>which meant that you didn't have really any mobility, and

0:07:29.800 --> 0:07:34.760
<v Speaker 1>uh it really limited the usefulness of the actual application.

0:07:35.520 --> 0:07:38.960
<v Speaker 1>In other cases, you could say things like your head

0:07:38.960 --> 0:07:42.920
<v Speaker 1>tracking technology was absolutely necessary for a R to develop

0:07:42.960 --> 0:07:47.480
<v Speaker 1>the way it did. GPS technology as well. Remember it

0:07:47.520 --> 0:07:52.559
<v Speaker 1>wasn't that long ago that we ordinary mere mortals didn't

0:07:52.640 --> 0:07:57.320
<v Speaker 1>have access to really accurate GPS information. For a very

0:07:57.360 --> 0:08:02.720
<v Speaker 1>long time that was purposely made less accurate. It was

0:08:02.760 --> 0:08:07.000
<v Speaker 1>a matter of national defense. It wasn't until the nineties

0:08:07.040 --> 0:08:10.840
<v Speaker 1>that you started to ce GPS become more accurate for

0:08:10.960 --> 0:08:15.080
<v Speaker 1>the basic consumer. Wait, back in the day, you might

0:08:15.160 --> 0:08:18.800
<v Speaker 1>get accuracy of up to around a hundred meters, which

0:08:18.840 --> 0:08:21.080
<v Speaker 1>is not great if you're looking for the next place

0:08:21.120 --> 0:08:24.000
<v Speaker 1>to make your turn. If it's a hundred meters away,

0:08:24.040 --> 0:08:27.760
<v Speaker 1>that's that's pretty far. But now it's within a few feet,

0:08:27.880 --> 0:08:30.600
<v Speaker 1>so it's much better. That sort of stuff all had

0:08:30.640 --> 0:08:33.959
<v Speaker 1>to come together in order for augmented reality to become

0:08:34.600 --> 0:08:38.600
<v Speaker 1>a viable I almost said a reality, but that just

0:08:38.640 --> 0:08:43.040
<v Speaker 1>starts to sound redundant at any rate. Let's talk about

0:08:43.080 --> 0:08:46.960
<v Speaker 1>some of these technologies we were. We really need things

0:08:46.960 --> 0:08:53.959
<v Speaker 1>like gyroscopes, accelerometers. These help devices understand their orientation where

0:08:54.000 --> 0:08:57.480
<v Speaker 1>they are in respect to something else, like are they

0:08:57.840 --> 0:08:59.760
<v Speaker 1>For a smartphone, it might be is it in landscape

0:08:59.760 --> 0:09:03.439
<v Speaker 1>mode or portrait mode? But for a head mounted display,

0:09:03.520 --> 0:09:07.440
<v Speaker 1>it would help give the the unit the information it

0:09:07.440 --> 0:09:09.480
<v Speaker 1>needs to know which way you're looking, like are you

0:09:09.600 --> 0:09:11.760
<v Speaker 1>looking to the east or to the west. That kind

0:09:11.760 --> 0:09:17.520
<v Speaker 1>of thing. UH also compasses obviously very important GPS sensors,

0:09:17.640 --> 0:09:20.920
<v Speaker 1>image recognition software, but has become really important so that

0:09:21.280 --> 0:09:25.320
<v Speaker 1>when you are looking at something UH, the system can

0:09:25.360 --> 0:09:28.480
<v Speaker 1>actually identify what that is. In some cases you can

0:09:28.480 --> 0:09:31.840
<v Speaker 1>get around this. You can design an a R system

0:09:31.880 --> 0:09:34.880
<v Speaker 1>where let's say you make a movie poster and the

0:09:34.960 --> 0:09:40.000
<v Speaker 1>a R application has the movie poster animate in some

0:09:40.080 --> 0:09:43.480
<v Speaker 1>way if you hold up a smartphone that's running the

0:09:43.520 --> 0:09:47.880
<v Speaker 1>appropriate app. So I'm just gonna take a movie from

0:09:47.880 --> 0:09:50.600
<v Speaker 1>my past that does not have an a R movie

0:09:50.640 --> 0:09:53.600
<v Speaker 1>poster associated with it, but one that I can talk

0:09:53.679 --> 0:09:56.160
<v Speaker 1>about as if it were a good example, and that

0:09:56.280 --> 0:10:00.360
<v Speaker 1>has to be Big Trouble in Little China, universally cleared

0:10:00.400 --> 0:10:02.800
<v Speaker 1>the best movie that has ever been made. So you've

0:10:02.840 --> 0:10:05.360
<v Speaker 1>got your Big Trouble and Little China poster up on

0:10:05.400 --> 0:10:08.679
<v Speaker 1>the wall, and you hold up your smartphone and you

0:10:08.720 --> 0:10:12.520
<v Speaker 1>activate your Big Trouble and Little China movie marketing app,

0:10:13.320 --> 0:10:16.440
<v Speaker 1>and the camera on your phone detect the poster. It's

0:10:16.640 --> 0:10:19.120
<v Speaker 1>you know, the posters there. Well. The app and the

0:10:19.160 --> 0:10:23.600
<v Speaker 1>poster together are able to construct the augmented experience because

0:10:24.120 --> 0:10:26.840
<v Speaker 1>there have been elements put into the poster that the

0:10:26.880 --> 0:10:30.840
<v Speaker 1>app is looking for. And once the app identifies that, like,

0:10:30.920 --> 0:10:34.440
<v Speaker 1>it sees maybe eight different points on the poster, and

0:10:34.520 --> 0:10:37.720
<v Speaker 1>because of the orientation of those points, it knows what

0:10:37.920 --> 0:10:40.760
<v Speaker 1>angle it's at, what height it's at in relation to

0:10:40.800 --> 0:10:44.960
<v Speaker 1>the phone, and can give you on your display the

0:10:45.040 --> 0:10:49.560
<v Speaker 1>augmented reality experience. In this case, it's obviously Jack Burton

0:10:49.600 --> 0:10:53.040
<v Speaker 1>and the Pork Chop Express eating a sandwich, because, as

0:10:53.040 --> 0:10:57.559
<v Speaker 1>we know, the most riveting scene in the movie unfolds

0:10:57.559 --> 0:11:00.720
<v Speaker 1>in this way. So that would be kind of an

0:11:00.760 --> 0:11:04.000
<v Speaker 1>augmented reality experience where you didn't have to worry about

0:11:04.280 --> 0:11:07.440
<v Speaker 1>every possible application out in the real world. You made

0:11:07.480 --> 0:11:10.840
<v Speaker 1>it for something very specific, which means in your software

0:11:10.880 --> 0:11:15.280
<v Speaker 1>you can have the camera look quote unquote for these

0:11:15.559 --> 0:11:20.560
<v Speaker 1>particular points of reference and thus create the augmented experience

0:11:20.559 --> 0:11:22.960
<v Speaker 1>in that way. If you want to take that and

0:11:23.000 --> 0:11:25.439
<v Speaker 1>move it to the real world where you can see

0:11:25.480 --> 0:11:29.120
<v Speaker 1>augmented information about just the world around you, it becomes

0:11:29.200 --> 0:11:32.680
<v Speaker 1>way more complicated. You have to have very sophisticated image

0:11:32.679 --> 0:11:37.280
<v Speaker 1>recognition software so that the camera picks up the images,

0:11:37.520 --> 0:11:41.040
<v Speaker 1>the software processes the information, identifies what those images are,

0:11:41.400 --> 0:11:45.280
<v Speaker 1>and gives you the relevant information. So working with all

0:11:45.320 --> 0:11:48.880
<v Speaker 1>the sensors, augmented reality can make this a possibility. So

0:11:48.920 --> 0:11:52.400
<v Speaker 1>another example, let's say you're out on the street in Atlanta.

0:11:52.720 --> 0:11:56.240
<v Speaker 1>You're here in my hometown, Atlanta, Georgia, and you're looking

0:11:56.240 --> 0:11:57.960
<v Speaker 1>at a building and you wonder what it is, and

0:11:57.960 --> 0:11:59.920
<v Speaker 1>you hold up your phone and you've got your little

0:12:00.360 --> 0:12:03.400
<v Speaker 1>map app that allows you to look at a real

0:12:03.440 --> 0:12:06.800
<v Speaker 1>world setting and tells you information about it, and it

0:12:06.840 --> 0:12:09.280
<v Speaker 1>tells you it's the Georgia Aquarium. Well, first of all,

0:12:09.320 --> 0:12:11.280
<v Speaker 1>you would probably know that already because the signage there

0:12:11.320 --> 0:12:14.920
<v Speaker 1>is actually pretty good. But the point being that this

0:12:14.960 --> 0:12:18.040
<v Speaker 1>would be uh something that would tap into the GPS

0:12:18.080 --> 0:12:20.319
<v Speaker 1>coordinates on your phone, so it would know where your

0:12:20.360 --> 0:12:23.440
<v Speaker 1>location was and help narrow that down. The compass would

0:12:23.440 --> 0:12:28.680
<v Speaker 1>tell it what direction you are facing the camera angle. Also,

0:12:28.760 --> 0:12:32.160
<v Speaker 1>when you have some image recognition going on there, the

0:12:32.200 --> 0:12:36.040
<v Speaker 1>accelerometer tells the orientation of the phone itself. All of

0:12:36.080 --> 0:12:39.440
<v Speaker 1>this data together would give the software the information needed

0:12:39.840 --> 0:12:45.160
<v Speaker 1>for it to display the label Georgia Aquarium on your phone.

0:12:45.200 --> 0:12:48.040
<v Speaker 1>And it all happens in an instant that's pretty amazing.

0:12:49.280 --> 0:12:52.480
<v Speaker 1>Typically you also have to have some other method to

0:12:53.000 --> 0:12:57.160
<v Speaker 1>communicate with a larger infrastructure, because we don't have the

0:12:57.200 --> 0:13:01.640
<v Speaker 1>capability of building an enormous, sleep powerful computer that has

0:13:01.760 --> 0:13:05.240
<v Speaker 1>all this real world information programmed into it and make

0:13:05.280 --> 0:13:09.480
<v Speaker 1>it a handheld or wearable device. So usually you have

0:13:09.600 --> 0:13:13.680
<v Speaker 1>to pair these devices with some other larger infrastructure. Sometimes

0:13:13.679 --> 0:13:17.439
<v Speaker 1>it's a double handshake. For example, with Google Glass, you

0:13:17.480 --> 0:13:20.720
<v Speaker 1>would use Bluetooth to connect Google Glass to a smartphone.

0:13:21.080 --> 0:13:24.200
<v Speaker 1>Then the smartphone would have the connection to the larger

0:13:24.240 --> 0:13:30.920
<v Speaker 1>internet through your smartphones UH cell service provider. So while

0:13:30.960 --> 0:13:36.280
<v Speaker 1>you're experiencing the augmented reality through the Google Glass, it's

0:13:36.280 --> 0:13:39.640
<v Speaker 1>actually communicating through your phone to the infrastructure to get

0:13:39.679 --> 0:13:42.120
<v Speaker 1>the data it needs to show you the information. It's

0:13:42.120 --> 0:13:45.680
<v Speaker 1>showing you very important elements. And all of these components,

0:13:45.760 --> 0:13:48.400
<v Speaker 1>like I said, came together more or less around the

0:13:48.480 --> 0:13:52.640
<v Speaker 1>same time. Most of them were being developed independently of

0:13:52.679 --> 0:13:54.920
<v Speaker 1>each other, and it's just that now we're seeing them

0:13:54.920 --> 0:13:58.520
<v Speaker 1>all converge. That's an old favorite word here at tech

0:13:58.559 --> 0:14:02.599
<v Speaker 1>stuff converge together to create the augmented reality experience and

0:14:02.640 --> 0:14:05.480
<v Speaker 1>make it possible. So how did we get here? How

0:14:05.559 --> 0:14:09.560
<v Speaker 1>did these different elements develop? Well, there are a whole

0:14:09.559 --> 0:14:12.720
<v Speaker 1>bunch of technology pioneers who really created the foundation for

0:14:12.760 --> 0:14:15.960
<v Speaker 1>augmented reality as well as virtual reality and mixed reality.

0:14:16.400 --> 0:14:18.800
<v Speaker 1>But one that I think we really need to concentrate

0:14:18.840 --> 0:14:23.080
<v Speaker 1>on at first is Ivan Sutherland. Now. Sutherland was born

0:14:23.160 --> 0:14:26.880
<v Speaker 1>in Hastings, Nebraska in nineteen thirty eight, and as a

0:14:26.960 --> 0:14:32.240
<v Speaker 1>kid he was fascinated with mathematics, particularly geometry, and also

0:14:32.280 --> 0:14:36.520
<v Speaker 1>with engineering. He began to study and experiment with computers

0:14:36.680 --> 0:14:39.240
<v Speaker 1>while he was in school, and this was at a

0:14:39.240 --> 0:14:41.640
<v Speaker 1>time where personal computers weren't a thing. There were no

0:14:41.720 --> 0:14:44.440
<v Speaker 1>personal computers at this point. Computers were actually pretty rare

0:14:44.480 --> 0:14:47.960
<v Speaker 1>and they were huge, and in fact, they often would

0:14:48.000 --> 0:14:52.200
<v Speaker 1>rely upon physical media formats like punch punch cards or

0:14:52.280 --> 0:14:57.280
<v Speaker 1>paper tape to read a program. So you didn't even

0:14:57.280 --> 0:15:00.200
<v Speaker 1>have a disk or like certainly nothing like a US

0:15:00.200 --> 0:15:03.240
<v Speaker 1>Beat thumb drive or anything like that. You you actually

0:15:03.240 --> 0:15:06.080
<v Speaker 1>had to put physical media into the machine for it

0:15:06.120 --> 0:15:10.200
<v Speaker 1>to read and then execute whatever program you had designed

0:15:10.200 --> 0:15:14.320
<v Speaker 1>for that device. He went to college at what is

0:15:14.360 --> 0:15:18.320
<v Speaker 1>now Carnegie Mellon University on a full scholarship. He graduated

0:15:18.320 --> 0:15:20.280
<v Speaker 1>with a Bachelor of Science degree. He would then go

0:15:20.320 --> 0:15:22.680
<v Speaker 1>on to earn a master's degree at cal Tech and

0:15:22.800 --> 0:15:26.880
<v Speaker 1>a PhD in electrical engineering from m I T. And

0:15:27.000 --> 0:15:31.840
<v Speaker 1>actually his doctoral thesis supervisor was Claude Shannon, And we

0:15:31.920 --> 0:15:34.040
<v Speaker 1>talked about Claude Shannon back in the two thousand and

0:15:34.080 --> 0:15:38.040
<v Speaker 1>fourteen episode who is Claude Shannon? Um. We recorded that

0:15:38.120 --> 0:15:41.680
<v Speaker 1>not too long after Shannon's passing. So if you want

0:15:41.680 --> 0:15:45.280
<v Speaker 1>to hear a really interesting story about a pioneer in

0:15:45.280 --> 0:15:47.920
<v Speaker 1>computer science, you should go check out that two thousand

0:15:47.920 --> 0:15:52.240
<v Speaker 1>and fourteen episode. Back to Sutherland. For his thesis, he

0:15:52.360 --> 0:15:56.400
<v Speaker 1>created something called sketch Pad and that was really, by

0:15:56.440 --> 0:16:00.920
<v Speaker 1>most accounts, the first computer graphical user interface or gooey.

0:16:00.960 --> 0:16:05.160
<v Speaker 1>A graphic goal user interface means that you interact with

0:16:05.280 --> 0:16:10.640
<v Speaker 1>the computer through graphics representing various commands on the computer.

0:16:10.760 --> 0:16:13.960
<v Speaker 1>Windows and the Mac operating system are both examples of

0:16:13.960 --> 0:16:17.800
<v Speaker 1>graphical user interfaces, as is the interface on your smartphone.

0:16:17.880 --> 0:16:21.800
<v Speaker 1>If you have a smartphone where you choose applications on

0:16:21.840 --> 0:16:25.960
<v Speaker 1>a screen, that's a graphical user interface. Well, Sutherland created

0:16:26.000 --> 0:16:28.520
<v Speaker 1>what is largely considered to be the first one of those.

0:16:29.080 --> 0:16:32.520
<v Speaker 1>After college, he entered military service and he was assigned

0:16:32.560 --> 0:16:36.480
<v Speaker 1>to the National Security Agency. We have great friends there.

0:16:37.320 --> 0:16:40.600
<v Speaker 1>I assume I'm sure they're listening, because they're listening to

0:16:40.680 --> 0:16:43.800
<v Speaker 1>everything at any rate. He entered the n s A

0:16:43.880 --> 0:16:47.000
<v Speaker 1>as an electrical engineer, and in nineteen sixty four he

0:16:47.080 --> 0:16:50.040
<v Speaker 1>replaced J. C. R. Lick Lighter as the head of

0:16:50.120 --> 0:16:56.440
<v Speaker 1>DARPA's Information Processing Techniques Office, or TIPTOE. And also, by

0:16:56.560 --> 0:17:01.520
<v Speaker 1>back then, DARPA wasn't DARPA, it was just ARPA. Uh uh,

0:17:01.640 --> 0:17:03.400
<v Speaker 1>So this is the same group, by the way, that

0:17:03.400 --> 0:17:05.879
<v Speaker 1>would end up doing a lot of work that would

0:17:05.920 --> 0:17:09.400
<v Speaker 1>form the Arpanette a few years later, and the Arpanette

0:17:09.440 --> 0:17:13.560
<v Speaker 1>was the predecessor to the Internet in some ways. At least,

0:17:13.600 --> 0:17:16.840
<v Speaker 1>the Arpanet was what ended up being the the building

0:17:16.920 --> 0:17:21.560
<v Speaker 1>blocks for the infrastructure that would become the Internet. Now,

0:17:21.600 --> 0:17:24.200
<v Speaker 1>all of that work happened after Sutherland had already departed

0:17:24.200 --> 0:17:30.160
<v Speaker 1>the organization. His work became a fundamental component of both

0:17:30.240 --> 0:17:33.280
<v Speaker 1>virtual and augmented reality. As I mentioned earlier, in nine

0:17:34.600 --> 0:17:36.959
<v Speaker 1>he wrote a piece and essay. It's very short, it's

0:17:37.080 --> 0:17:39.280
<v Speaker 1>very easy read, and you can find it online. The

0:17:39.320 --> 0:17:42.640
<v Speaker 1>title of the essay is the Ultimate Display. And if

0:17:42.680 --> 0:17:46.720
<v Speaker 1>you ever do any research in virtual reality or augmented reality,

0:17:46.960 --> 0:17:49.880
<v Speaker 1>this essay is going to pop up in your research,

0:17:50.240 --> 0:17:52.480
<v Speaker 1>so go ahead and read it. It's like two pages long,

0:17:52.560 --> 0:17:56.240
<v Speaker 1>so it goes very quickly. In that essay he talked

0:17:56.280 --> 0:18:02.520
<v Speaker 1>about several ideas, including the idealized display, the ultimate display,

0:18:02.720 --> 0:18:06.760
<v Speaker 1>something that would be the the furthest you could go

0:18:08.240 --> 0:18:11.040
<v Speaker 1>with display technology. Now, keep in mind, in his time,

0:18:12.280 --> 0:18:13.879
<v Speaker 1>by his time he's still alive, by the way, but

0:18:14.320 --> 0:18:17.240
<v Speaker 1>this time in the nineties sixties, he uh, you know,

0:18:17.400 --> 0:18:22.879
<v Speaker 1>things were just restricted to monitors. You might have a

0:18:22.960 --> 0:18:26.879
<v Speaker 1>light pen, but usually you just use a keyboard. Like

0:18:27.119 --> 0:18:30.919
<v Speaker 1>it was pretty bare bones. They said, let's push this

0:18:31.040 --> 0:18:35.119
<v Speaker 1>as far as we can imagine it. And in his example,

0:18:35.240 --> 0:18:38.200
<v Speaker 1>he thought of a room that would be completely controlled

0:18:38.240 --> 0:18:42.520
<v Speaker 1>by computers. Everything you would experience within that room would

0:18:42.520 --> 0:18:47.600
<v Speaker 1>be generated by a computer. Everything you see here, smell, taste,

0:18:47.720 --> 0:18:51.320
<v Speaker 1>and touch, all of it generated by computers. The computer

0:18:51.359 --> 0:18:54.359
<v Speaker 1>would even be able to form physical objects out of

0:18:54.400 --> 0:18:58.280
<v Speaker 1>pure matter itself. Now, he wasn't suggesting that this would

0:18:58.320 --> 0:19:01.160
<v Speaker 1>ever be a device that we would actually be able

0:19:01.200 --> 0:19:03.960
<v Speaker 1>to build. He was just saying, what is the ultimate

0:19:04.600 --> 0:19:10.280
<v Speaker 1>incarnation of display technology? And if you read it, you realize, oh,

0:19:10.320 --> 0:19:12.840
<v Speaker 1>this is where the Star Trek Next Generation writers got

0:19:12.840 --> 0:19:16.119
<v Speaker 1>their idea for the Holidack. But unlike Star Trek the

0:19:16.160 --> 0:19:19.439
<v Speaker 1>Next Generation, the Ultimate Display would not go on the

0:19:19.480 --> 0:19:22.000
<v Speaker 1>Fritz every other episode and try to kill the crew.

0:19:23.040 --> 0:19:27.920
<v Speaker 1>It was better than that. The Ultimate Display was sort

0:19:27.920 --> 0:19:32.600
<v Speaker 1>of a a foundational like Philosophically, it was foundational for

0:19:33.320 --> 0:19:36.240
<v Speaker 1>virtual reality and augmented reality. This idea of a very

0:19:36.280 --> 0:19:42.000
<v Speaker 1>immersive experience where you, as a user, are surrounded somehow

0:19:42.320 --> 0:19:47.240
<v Speaker 1>by this computer generated experience. And that's true both with

0:19:47.280 --> 0:19:52.200
<v Speaker 1>augmented reality and virtual reality. And augmented reality, the real

0:19:52.240 --> 0:19:56.000
<v Speaker 1>world is still there, but you get this enhanced experience

0:19:56.040 --> 0:20:02.040
<v Speaker 1>that is completely computer generated. So in nineteen sixty eight,

0:20:02.680 --> 0:20:07.399
<v Speaker 1>Sutherland and a student named Danny Cohen would create a VR,

0:20:07.840 --> 0:20:12.640
<v Speaker 1>a r head mounted display or h MD, and they

0:20:12.760 --> 0:20:17.440
<v Speaker 1>nicknamed it the Sword of Damocles. Why because you had

0:20:17.480 --> 0:20:19.680
<v Speaker 1>to suspend it from the ceiling. It was too heavy

0:20:19.720 --> 0:20:22.439
<v Speaker 1>to wear on your head. You needed it to be

0:20:22.560 --> 0:20:27.440
<v Speaker 1>nice and sturdy. Uh. It included transparent lenses, which meant

0:20:27.480 --> 0:20:31.199
<v Speaker 1>you could overlay computer information on the lenses themselves, and

0:20:31.240 --> 0:20:33.680
<v Speaker 1>thus you could look through the lenses at the real

0:20:33.720 --> 0:20:38.239
<v Speaker 1>world and have these wire framed graphics on top of

0:20:38.240 --> 0:20:40.639
<v Speaker 1>what you were looking at. And it also had a

0:20:40.680 --> 0:20:44.640
<v Speaker 1>magnetic tracking system, meaning that it had sensors that could

0:20:44.640 --> 0:20:47.760
<v Speaker 1>detect magnetic fields, and as you turned your head or

0:20:47.920 --> 0:20:51.840
<v Speaker 1>you change the inclination of your head, it would change

0:20:51.880 --> 0:20:55.000
<v Speaker 1>the magnetic field and this would be relayed as a

0:20:55.119 --> 0:21:00.119
<v Speaker 1>command to the visual center the actual lenses themselves, so

0:21:00.240 --> 0:21:03.439
<v Speaker 1>that it would the change would be reflected in what

0:21:03.520 --> 0:21:06.679
<v Speaker 1>you saw. So if you have a virtual environment and

0:21:06.720 --> 0:21:08.760
<v Speaker 1>you turn your head to the left, you want the

0:21:08.880 --> 0:21:11.639
<v Speaker 1>view within the virtual environment to go to the left too,

0:21:11.920 --> 0:21:14.919
<v Speaker 1>but without head tracking technology that's impossible. So this was

0:21:14.960 --> 0:21:17.880
<v Speaker 1>a very early example of head tracking technology. And again

0:21:17.880 --> 0:21:22.080
<v Speaker 1>it used magnets magnetic fields in order to do that. Uh.

0:21:22.680 --> 0:21:26.760
<v Speaker 1>Obviously it's also really important for augmented reality. Again, if

0:21:27.080 --> 0:21:30.160
<v Speaker 1>the a R system doesn't detect that you are looking around,

0:21:30.680 --> 0:21:34.400
<v Speaker 1>then you're not getting relevant information, not for the specific

0:21:34.400 --> 0:21:38.399
<v Speaker 1>thing you are looking at. Anyway, as I said, the

0:21:38.400 --> 0:21:41.080
<v Speaker 1>graphics were pretty primitive. There were wire frame drawings, but

0:21:41.119 --> 0:21:46.960
<v Speaker 1>they still showed that this was a viable approach to

0:21:47.040 --> 0:21:51.760
<v Speaker 1>technology using HMD for augmented or virtual reality use. Oh

0:21:51.800 --> 0:21:54.120
<v Speaker 1>and one other note I should make so a lot

0:21:54.160 --> 0:21:56.520
<v Speaker 1>of people say the sort of Damocles was the first

0:21:56.520 --> 0:21:59.320
<v Speaker 1>head mounted display, and they say, you know, this is

0:21:59.359 --> 0:22:03.080
<v Speaker 1>the first m D was made in nineteen I take

0:22:03.200 --> 0:22:06.760
<v Speaker 1>issue with that. I don't think of the sort of

0:22:06.840 --> 0:22:10.359
<v Speaker 1>Damocles as the first head mounted display. That to me

0:22:10.560 --> 0:22:14.119
<v Speaker 1>should go to a different invention called the head site

0:22:15.240 --> 0:22:18.120
<v Speaker 1>h G A D S I G h T now

0:22:18.160 --> 0:22:22.280
<v Speaker 1>that was developed by Philco and unlike the sort of Damocles,

0:22:22.800 --> 0:22:27.200
<v Speaker 1>it didn't create a virtual world. Instead, the head site

0:22:27.480 --> 0:22:31.919
<v Speaker 1>was sort of a remote viewfinder for a video camera.

0:22:32.960 --> 0:22:36.920
<v Speaker 1>So imagine that you've got a camera mounted on a

0:22:36.960 --> 0:22:42.520
<v Speaker 1>mechanical uh swiveling mount, so you can move it left right,

0:22:42.560 --> 0:22:46.239
<v Speaker 1>you can change the orientation the inclination as well, and

0:22:46.280 --> 0:22:49.120
<v Speaker 1>then you have that mapped to a head mounted display,

0:22:49.200 --> 0:22:51.440
<v Speaker 1>so that if I put the display on and I

0:22:51.480 --> 0:22:53.960
<v Speaker 1>looked to the left, the camera pans to the left.

0:22:54.160 --> 0:22:55.880
<v Speaker 1>If I look to the right, it pans to the right.

0:22:56.000 --> 0:22:59.720
<v Speaker 1>That sort of thing it was meant to be away

0:22:59.800 --> 0:23:03.360
<v Speaker 1>for for people to operate a camera in a remote

0:23:03.400 --> 0:23:07.240
<v Speaker 1>location that might not be very friendly to a human

0:23:07.280 --> 0:23:11.040
<v Speaker 1>being standing there. For example, the exterior of an aircraft.

0:23:11.440 --> 0:23:13.520
<v Speaker 1>You could have a camera mounted on the outside of

0:23:13.520 --> 0:23:17.080
<v Speaker 1>your aircraft that would allow an engineer on the inside

0:23:17.520 --> 0:23:22.000
<v Speaker 1>to look around and maybe help a pilot land or

0:23:22.200 --> 0:23:25.840
<v Speaker 1>navigate in a dangerous situation, or just get an idea

0:23:26.000 --> 0:23:30.440
<v Speaker 1>of the status of the aircraft itself. This was very

0:23:30.520 --> 0:23:35.120
<v Speaker 1>much a technology that was being pushed by the military,

0:23:36.000 --> 0:23:39.879
<v Speaker 1>an idea to create more military uses using this technology

0:23:39.920 --> 0:23:44.560
<v Speaker 1>to make the military more competent, more adept at very

0:23:44.680 --> 0:23:50.520
<v Speaker 1>rapid changing situations on the technology front, So headsite preceded

0:23:50.960 --> 0:23:53.760
<v Speaker 1>the sort of damocles by about seven years. That came

0:23:53.760 --> 0:23:56.439
<v Speaker 1>out around nineteen sixty one, but again it wasn't a

0:23:56.480 --> 0:24:00.320
<v Speaker 1>virtual reality headset or an augmented reality headset. Was kind

0:24:00.320 --> 0:24:03.479
<v Speaker 1>of a like I said, a remote viewfinder. But still

0:24:03.800 --> 0:24:06.760
<v Speaker 1>I consider that to be the earliest head mounted display,

0:24:06.920 --> 0:24:10.639
<v Speaker 1>not the sort of damocles. However, Sutherland would end up

0:24:10.640 --> 0:24:13.760
<v Speaker 1>going on to make lots of other contributions in computer

0:24:13.800 --> 0:24:18.200
<v Speaker 1>graphics as well as the overall concepts that would guide

0:24:18.200 --> 0:24:22.119
<v Speaker 1>both virtual reality and augmented reality development over the next

0:24:22.119 --> 0:24:26.199
<v Speaker 1>several decades. But now it'll be time for me to

0:24:26.280 --> 0:24:29.080
<v Speaker 1>kind of move away from Sutherland to talk about some

0:24:29.160 --> 0:24:32.440
<v Speaker 1>other developments that were important in a R. And before

0:24:32.440 --> 0:24:35.800
<v Speaker 1>I get to that, let's take a quick break to

0:24:35.960 --> 0:24:46.800
<v Speaker 1>thank our sponsor. All Right, we just left off with

0:24:46.920 --> 0:24:52.040
<v Speaker 1>Ivan Sutherland. Now let's talk about a different father of

0:24:52.359 --> 0:24:58.879
<v Speaker 1>augmented reality, Myron Krueger or Dr Myron Krueger. In nineteen four,

0:24:59.000 --> 0:25:04.440
<v Speaker 1>Dr Krueger create an augmented reality lab called Video Place. Uh.

0:25:04.480 --> 0:25:09.000
<v Speaker 1>He was really into this idea of seeing the interaction

0:25:09.119 --> 0:25:13.560
<v Speaker 1>of technology and people in artistic ways. He really wanted

0:25:13.600 --> 0:25:19.640
<v Speaker 1>to explore artistic expressions using technology and people working together.

0:25:20.960 --> 0:25:23.600
<v Speaker 1>So he wanted to create an artificial reality environment that

0:25:23.680 --> 0:25:27.080
<v Speaker 1>didn't require the user to wear special equipment. You wouldn't

0:25:27.080 --> 0:25:29.600
<v Speaker 1>have to put on a head mold display, or wear

0:25:29.960 --> 0:25:35.080
<v Speaker 1>special gloves, or use any kind of device to control

0:25:35.280 --> 0:25:41.160
<v Speaker 1>your actions, because that's a barrier between you and the experience. Instead,

0:25:42.080 --> 0:25:45.800
<v Speaker 1>his version consisted of a laboratory that had several rooms

0:25:46.160 --> 0:25:51.080
<v Speaker 1>all network together, and each room had a video camera

0:25:51.160 --> 0:25:54.000
<v Speaker 1>in it and a projector and a screen. Now, the

0:25:54.080 --> 0:25:57.040
<v Speaker 1>video camera would pick up the motions of the person

0:25:57.080 --> 0:26:00.960
<v Speaker 1>inside the room. It would send information to the projector,

0:26:01.119 --> 0:26:05.159
<v Speaker 1>which would then project the person's silhouette on the screen.

0:26:05.480 --> 0:26:08.800
<v Speaker 1>And the silhouette was typically a really bright color, and

0:26:08.840 --> 0:26:11.560
<v Speaker 1>you could move around and your silhouette would move around,

0:26:11.640 --> 0:26:14.240
<v Speaker 1>So you almost became like a puppet master controlling your

0:26:14.280 --> 0:26:17.919
<v Speaker 1>own silhouette. But then he started to incorporate other things,

0:26:17.960 --> 0:26:21.919
<v Speaker 1>like other elements that were virtually on the screen. The

0:26:21.920 --> 0:26:24.760
<v Speaker 1>projector was projecting things that were on the screen but

0:26:24.840 --> 0:26:28.040
<v Speaker 1>not in the actual real room itself. So imagine a

0:26:28.080 --> 0:26:31.400
<v Speaker 1>ball and a ball is being projected on the screen. Well,

0:26:31.440 --> 0:26:33.719
<v Speaker 1>you could move around so that your silhouette would interact

0:26:33.720 --> 0:26:36.600
<v Speaker 1>with the ball and the ball would bounce away. That

0:26:36.760 --> 0:26:38.959
<v Speaker 1>sort of thing, so you would be able to interact

0:26:39.000 --> 0:26:43.000
<v Speaker 1>with virtual environments by moving around in a real physical space.

0:26:43.560 --> 0:26:49.400
<v Speaker 1>And while those objects weren't really there in front of you,

0:26:49.400 --> 0:26:51.919
<v Speaker 1>you could see the representation of them on the screen.

0:26:52.280 --> 0:26:54.680
<v Speaker 1>And this was really powerful stuff. And remember I said

0:26:54.720 --> 0:26:57.160
<v Speaker 1>these rooms were all network together, so you could actually

0:26:57.359 --> 0:27:00.200
<v Speaker 1>have a system where a person in one room and

0:27:00.240 --> 0:27:03.919
<v Speaker 1>a person in another room both have their silhouettes projected

0:27:04.119 --> 0:27:09.440
<v Speaker 1>together in their respective rooms on the screen, and your

0:27:09.880 --> 0:27:12.720
<v Speaker 1>silhouette would be one color, the other person's silhouette would

0:27:12.720 --> 0:27:15.840
<v Speaker 1>be a different color, and you could interact with one another.

0:27:16.560 --> 0:27:23.160
<v Speaker 1>And according to reports from this art experiment, they noticed

0:27:23.200 --> 0:27:26.800
<v Speaker 1>that whenever people would have their silhouettes cross one another,

0:27:27.240 --> 0:27:30.760
<v Speaker 1>they would actually recoil in their physical rooms. Keep in

0:27:30.800 --> 0:27:33.040
<v Speaker 1>mind they're in different rooms, they're not in the same

0:27:33.040 --> 0:27:35.840
<v Speaker 1>one together, they would recoil as if they had made

0:27:35.920 --> 0:27:39.960
<v Speaker 1>physical contact or bumped into someone. So it showed that

0:27:39.960 --> 0:27:44.600
<v Speaker 1>there was a very powerful psychological element to this virtual presence.

0:27:44.960 --> 0:27:48.440
<v Speaker 1>And again that psychological element plays a huge important role

0:27:48.640 --> 0:27:52.000
<v Speaker 1>in VR and a R research and development, not just

0:27:52.080 --> 0:27:55.840
<v Speaker 1>for creating products, but just to understand how we process

0:27:55.880 --> 0:28:00.520
<v Speaker 1>information and incorporated into our sense of reality. Not to

0:28:00.520 --> 0:28:04.480
<v Speaker 1>get too deep, for you guys. So experimentation in the

0:28:04.520 --> 0:28:07.840
<v Speaker 1>field continued over the years. In the early nineteen eighties,

0:28:07.920 --> 0:28:11.199
<v Speaker 1>Dr Krueger would write a book and publish it about

0:28:11.320 --> 0:28:15.800
<v Speaker 1>artificial realities. But while the principles for augmented reality were established,

0:28:15.920 --> 0:28:20.600
<v Speaker 1>the technologies were still rather unwieldy. They were large, they

0:28:20.600 --> 0:28:24.560
<v Speaker 1>weren't reliable, and it would require several years of work

0:28:24.760 --> 0:28:29.320
<v Speaker 1>to improve those technologies, to create miniaturization strategies to get

0:28:29.320 --> 0:28:32.280
<v Speaker 1>the elements down to a size that was more practical

0:28:32.520 --> 0:28:35.560
<v Speaker 1>for that sort of use and wouldn't require you to

0:28:35.600 --> 0:28:39.560
<v Speaker 1>have a head mound display mounted to the ceiling. And

0:28:39.600 --> 0:28:41.800
<v Speaker 1>all of that took time, but you could tell that

0:28:41.880 --> 0:28:46.800
<v Speaker 1>the ideas underlying augmented and virtual reality were already in place.

0:28:47.840 --> 0:28:52.720
<v Speaker 1>In n there was a Boeing researcher named Tom Coddell

0:28:52.960 --> 0:28:58.240
<v Speaker 1>who coined the term augmented reality, and he was specifically

0:28:58.320 --> 0:29:01.360
<v Speaker 1>using it to talk about this approach to overlaying digital

0:29:01.400 --> 0:29:03.959
<v Speaker 1>information on top of our physical world to enhance it

0:29:04.080 --> 0:29:08.120
<v Speaker 1>in some way. Now. Dr Cordell earned a PhD in

0:29:08.160 --> 0:29:12.480
<v Speaker 1>physics and astronomy from the University of Arizona, and before

0:29:12.480 --> 0:29:16.120
<v Speaker 1>contributing the term augmented reality to the public lexicon. He

0:29:16.200 --> 0:29:20.160
<v Speaker 1>did extensive work and artificial intelligence research and development. He

0:29:20.240 --> 0:29:22.640
<v Speaker 1>also became a professor in the fields of electrical and

0:29:22.640 --> 0:29:26.960
<v Speaker 1>computer engineering at the University of New Mexico. So when

0:29:26.960 --> 0:29:29.600
<v Speaker 1>he was working with Boeing, he used this phrase to

0:29:29.640 --> 0:29:32.760
<v Speaker 1>talk about specific system he was working on, an augmented

0:29:32.800 --> 0:29:35.360
<v Speaker 1>reality system, and the whole purpose of this was to

0:29:35.560 --> 0:29:41.680
<v Speaker 1>help people who were helping construct airplanes lay cables properly.

0:29:42.320 --> 0:29:46.680
<v Speaker 1>The whole idea was to use this system so that uh,

0:29:46.800 --> 0:29:49.760
<v Speaker 1>an electrician can see exactly where the cable needed to

0:29:49.840 --> 0:29:55.720
<v Speaker 1>go inside the partly constructed cabin of an aircraft, and

0:29:55.760 --> 0:29:58.520
<v Speaker 1>that way you could follow the directions that you see

0:29:58.560 --> 0:30:02.600
<v Speaker 1>through your display, lay the actual cable down where the

0:30:02.680 --> 0:30:06.440
<v Speaker 1>guide tells you to go, and then you would have

0:30:06.520 --> 0:30:11.840
<v Speaker 1>a properly wired airplane. Uh. And I'm sure, as we're

0:30:11.840 --> 0:30:17.560
<v Speaker 1>all aware, properly wired airplanes are good airplanes. Improperly wired

0:30:17.600 --> 0:30:20.440
<v Speaker 1>airplanes are not so good. So it was a very

0:30:20.480 --> 0:30:25.200
<v Speaker 1>important system to make this much more smooth and fast,

0:30:25.320 --> 0:30:28.080
<v Speaker 1>and it meant that you didn't have to have as

0:30:28.920 --> 0:30:33.560
<v Speaker 1>as many experts to guide the process. You could actually

0:30:33.560 --> 0:30:36.840
<v Speaker 1>have someone come in who had never done this before

0:30:37.320 --> 0:30:41.480
<v Speaker 1>and just follow the directions through this augmented reality set

0:30:41.560 --> 0:30:47.840
<v Speaker 1>system and they could wire the airplane properly, so really

0:30:48.000 --> 0:30:52.040
<v Speaker 1>clever means of using augmented reality. Also, we would end

0:30:52.080 --> 0:30:55.320
<v Speaker 1>up seeing that same sort of philosophy used again and

0:30:55.360 --> 0:30:59.920
<v Speaker 1>again in the future in more sophisticated UH types of technology.

0:31:00.040 --> 0:31:03.479
<v Speaker 1>It was the exact same approach, exact same idea underlying it.

0:31:04.720 --> 0:31:09.440
<v Speaker 1>In Lewis Rosenberg proposed a system that the Air Force

0:31:09.560 --> 0:31:12.400
<v Speaker 1>could use to allow someone to control devices from a

0:31:12.480 --> 0:31:17.000
<v Speaker 1>remote location, and that consisted of a video camera which

0:31:17.040 --> 0:31:20.040
<v Speaker 1>would provide the visual data to the user through a

0:31:20.040 --> 0:31:22.440
<v Speaker 1>head mounted display. They would wear the display on their

0:31:22.480 --> 0:31:25.120
<v Speaker 1>heads or they would look at a screen, but typically

0:31:25.160 --> 0:31:28.640
<v Speaker 1>they'd wear a display, and then they would also wear

0:31:28.720 --> 0:31:32.680
<v Speaker 1>an exoskeleton on their upper body that would allow them

0:31:32.720 --> 0:31:38.280
<v Speaker 1>to control some sort of robotic device, typically robotic arms.

0:31:38.960 --> 0:31:40.880
<v Speaker 1>And usually the way this would work is that the

0:31:40.960 --> 0:31:44.160
<v Speaker 1>display was designed in such a way with a video

0:31:44.240 --> 0:31:47.920
<v Speaker 1>camera so that the view that the person had it

0:31:47.960 --> 0:31:51.760
<v Speaker 1>made it look like the robot arms were their actual arms,

0:31:52.480 --> 0:31:55.000
<v Speaker 1>which required a little bit of trickery on the part

0:31:55.080 --> 0:31:59.000
<v Speaker 1>of Rosenberg. They had to fudge the distances between the

0:31:59.080 --> 0:32:02.240
<v Speaker 1>video camera and the robotic arms to give this this

0:32:02.320 --> 0:32:06.720
<v Speaker 1>sort of feeling that the robot arms represented your actual arms.

0:32:06.760 --> 0:32:09.360
<v Speaker 1>So you move your arms inside the exoskeleton, and the

0:32:09.440 --> 0:32:13.360
<v Speaker 1>robot arms would move as well at the remote location,

0:32:13.520 --> 0:32:16.280
<v Speaker 1>so you it's kind of like a really fancy remote control.

0:32:17.920 --> 0:32:23.280
<v Speaker 1>Now imagine that the robot arms are holding various tools. Uh.

0:32:23.320 --> 0:32:27.320
<v Speaker 1>The suit would also provide haptic feedback, that touch based

0:32:27.440 --> 0:32:29.920
<v Speaker 1>feedback to let a user know more about what is

0:32:29.960 --> 0:32:32.840
<v Speaker 1>going on when they're operating the arms. So if you

0:32:32.880 --> 0:32:36.280
<v Speaker 1>were to have do something that would make a robot

0:32:36.360 --> 0:32:41.080
<v Speaker 1>arm encounter resistance, then you would feel haptic feedback in

0:32:41.120 --> 0:32:43.920
<v Speaker 1>the suit that would indicate, oh, you're you're going beyond

0:32:44.000 --> 0:32:46.800
<v Speaker 1>the parameters of where this robot arm is capable of going.

0:32:47.600 --> 0:32:50.480
<v Speaker 1>So you learned very quickly where where you can operate

0:32:50.520 --> 0:32:54.560
<v Speaker 1>within that suit and make sure that you are not

0:32:54.640 --> 0:32:59.640
<v Speaker 1>pushing it beyond its limits. You could also uh end

0:32:59.720 --> 0:33:02.840
<v Speaker 1>up using these tools to do various things in this

0:33:02.920 --> 0:33:07.640
<v Speaker 1>remote environment. Now, Rosenberg called his system virtual fixtures, which

0:33:07.680 --> 0:33:09.960
<v Speaker 1>meant that the user would see these virtual overlays on

0:33:10.040 --> 0:33:12.320
<v Speaker 1>top of a real environment that they were looking at.

0:33:12.880 --> 0:33:16.280
<v Speaker 1>So I'm going to give a very basic example that

0:33:16.920 --> 0:33:19.400
<v Speaker 1>to illustrate this. Because it's hard to imagine, it's hard

0:33:19.440 --> 0:33:21.840
<v Speaker 1>to get it across in words. But let's say you're

0:33:21.840 --> 0:33:24.640
<v Speaker 1>looking through a headmund display and in front of you

0:33:24.960 --> 0:33:30.800
<v Speaker 1>is a board, wooden board, and it's just a regular

0:33:30.840 --> 0:33:33.640
<v Speaker 1>wooden board. There's nothing painted on it or anything in

0:33:33.640 --> 0:33:36.640
<v Speaker 1>the real world, and it's in a room that's across

0:33:36.720 --> 0:33:39.240
<v Speaker 1>the building from you. You cannot see this with your

0:33:39.240 --> 0:33:41.600
<v Speaker 1>own eyes. You can only see it through the video camera.

0:33:42.880 --> 0:33:47.080
<v Speaker 1>The virtual fixture overlay might be a series of circles,

0:33:47.360 --> 0:33:49.760
<v Speaker 1>and the circles are things that you were meant to

0:33:49.800 --> 0:33:52.360
<v Speaker 1>cut out of the board using the robot arms and

0:33:52.400 --> 0:33:56.320
<v Speaker 1>a tool that's right there inside the physical environment across

0:33:56.360 --> 0:33:59.640
<v Speaker 1>the building from you. So you follow the patterns that

0:33:59.720 --> 0:34:03.520
<v Speaker 1>you in this virtual overlay and you complete the task.

0:34:04.040 --> 0:34:07.520
<v Speaker 1>That's a very simple example. And uh, this system was

0:34:07.560 --> 0:34:10.239
<v Speaker 1>meant to allow for that. That's what he would call

0:34:10.280 --> 0:34:14.520
<v Speaker 1>the virtual fixtures, these overlays that you would see that

0:34:14.680 --> 0:34:17.960
<v Speaker 1>would appear to be real, but actually we're not present

0:34:18.040 --> 0:34:24.520
<v Speaker 1>in the physical environment itself. Now. Also in a group

0:34:24.600 --> 0:34:27.759
<v Speaker 1>of researchers at Columbia University were proposing a system that

0:34:27.840 --> 0:34:32.040
<v Speaker 1>they called the knowledge based Augmented Reality for maintenance assistance

0:34:33.080 --> 0:34:38.439
<v Speaker 1>a k a. Karma cute. Their approach was pretty novel.

0:34:38.480 --> 0:34:42.040
<v Speaker 1>They pointed out that while augmented reality had tremendous potential,

0:34:42.120 --> 0:34:44.920
<v Speaker 1>it also had a really big barrier, and that it

0:34:45.000 --> 0:34:49.560
<v Speaker 1>takes an enormous amount of time to design or animate

0:34:49.640 --> 0:34:53.359
<v Speaker 1>and implement these graphic overlays for a our applications. So

0:34:54.360 --> 0:34:56.200
<v Speaker 1>let's say you're in a room and you're looking at

0:34:56.200 --> 0:34:59.640
<v Speaker 1>different objects, and little labels are popping up for each object.

0:35:00.320 --> 0:35:02.120
<v Speaker 1>If you're having to do all that by hand, it

0:35:02.120 --> 0:35:05.239
<v Speaker 1>takes a huge amount of time. What they wanted to

0:35:05.280 --> 0:35:10.440
<v Speaker 1>do was create artificial intelligence systems, or at least techniques

0:35:10.960 --> 0:35:16.839
<v Speaker 1>to generate graphics automatically on the fly. So this would

0:35:16.880 --> 0:35:20.680
<v Speaker 1>be similar to using image recognition software, so that if

0:35:20.680 --> 0:35:25.040
<v Speaker 1>you look at a specific box, let's say, the image

0:35:25.040 --> 0:35:27.360
<v Speaker 1>recognition software might be able to map that box to

0:35:27.440 --> 0:35:30.319
<v Speaker 1>a specific product and thus give you an overlay of

0:35:30.520 --> 0:35:33.080
<v Speaker 1>information about the product that would be inside that box.

0:35:33.680 --> 0:35:35.359
<v Speaker 1>And it would be able to do all this automatically.

0:35:35.440 --> 0:35:38.759
<v Speaker 1>It would not require a human programmer to go through

0:35:39.280 --> 0:35:42.520
<v Speaker 1>and and look at every single product in every single

0:35:42.560 --> 0:35:45.320
<v Speaker 1>type of box and program all that out. That would

0:35:45.320 --> 0:35:49.120
<v Speaker 1>be ridiculous. It would take forever so it was the

0:35:49.160 --> 0:35:51.920
<v Speaker 1>work of this group with Karma that really started the

0:35:51.960 --> 0:35:55.640
<v Speaker 1>ball rolling with this AI approach to automatically fill in

0:35:55.640 --> 0:36:00.759
<v Speaker 1>that information and make a are a more practical experience.

0:36:02.280 --> 0:36:06.239
<v Speaker 1>Around this same time, between ninety two and nine, the

0:36:06.360 --> 0:36:10.840
<v Speaker 1>Laurel Western Development Labs, which was defense contractor, began to

0:36:10.880 --> 0:36:12.960
<v Speaker 1>work with the U. S Military to create a R

0:36:13.040 --> 0:36:16.480
<v Speaker 1>systems for military vehicles. And you can understand very quickly

0:36:16.640 --> 0:36:21.360
<v Speaker 1>how a R would have enormous potential for military applications.

0:36:21.760 --> 0:36:24.239
<v Speaker 1>And in fact, a R is very commonly used in

0:36:24.320 --> 0:36:27.760
<v Speaker 1>lots of different things, like pilot helmets, where it helps

0:36:27.840 --> 0:36:33.360
<v Speaker 1>pilots keep track of targets uh and identify potential threats,

0:36:33.440 --> 0:36:37.080
<v Speaker 1>that kind of thing. But in this case, they were

0:36:37.080 --> 0:36:41.799
<v Speaker 1>really looking at creating a augmented reality system that would

0:36:41.840 --> 0:36:47.480
<v Speaker 1>create virtual opponents for people working in simulated wartime conditions,

0:36:47.960 --> 0:36:51.520
<v Speaker 1>so really a training program. Imagine that you're operating an

0:36:51.560 --> 0:36:55.759
<v Speaker 1>actual military vehicle like a tank, and you have a

0:36:55.960 --> 0:37:00.400
<v Speaker 1>view outside that is really an augmented reality system, so

0:37:00.480 --> 0:37:02.600
<v Speaker 1>you're actually looking at the real world around you. You

0:37:02.640 --> 0:37:05.200
<v Speaker 1>aren't just sitting in a simulator inside of building. You

0:37:05.239 --> 0:37:08.520
<v Speaker 1>are out there in the field controlling a real vehicle

0:37:08.640 --> 0:37:12.320
<v Speaker 1>moving around in real terrain, but you also see virtual

0:37:12.960 --> 0:37:17.399
<v Speaker 1>representations of enemies in that real terrain, and you can

0:37:17.480 --> 0:37:21.200
<v Speaker 1>practice maneuvers and firing on enemies that sort of thing,

0:37:21.520 --> 0:37:25.319
<v Speaker 1>probably not using live ammunition at that point, but having

0:37:25.360 --> 0:37:28.239
<v Speaker 1>a more realistic simulation in a real environment, so that

0:37:28.280 --> 0:37:35.000
<v Speaker 1>you're not just trying to create a totally virtual scenario. Anyway,

0:37:35.040 --> 0:37:39.200
<v Speaker 1>that work was done in the world wouldn't really learn

0:37:39.200 --> 0:37:42.520
<v Speaker 1>about it at large until about ninety nine, because that's

0:37:42.520 --> 0:37:44.880
<v Speaker 1>the way the military works. They're not so eager to

0:37:44.880 --> 0:37:48.840
<v Speaker 1>talk about their stuff while are still doing it. Meanwhile,

0:37:49.000 --> 0:37:51.719
<v Speaker 1>at the same time, artists were continuing to explore the

0:37:51.760 --> 0:37:55.239
<v Speaker 1>relationships between physical performers and virtual elements. You remember I

0:37:55.239 --> 0:37:59.560
<v Speaker 1>talked about Dr Krueger earlier. While in different artist Julie

0:37:59.600 --> 0:38:04.399
<v Speaker 1>Martin would create a piece called Dancing in Cyberspace, and

0:38:04.520 --> 0:38:07.840
<v Speaker 1>in that piece, dancers on a physical space or a

0:38:07.840 --> 0:38:11.520
<v Speaker 1>physical stage. We're able to manipulate virtual objects, so an

0:38:11.560 --> 0:38:14.800
<v Speaker 1>audience would be able to see both the physical performance

0:38:14.840 --> 0:38:19.759
<v Speaker 1>by the dancers and the virtual reactions the things that

0:38:19.880 --> 0:38:22.279
<v Speaker 1>happened within the virtual environment as a result of the

0:38:22.360 --> 0:38:28.680
<v Speaker 1>dancers moving around their physical space. Pretty neat two researchers

0:38:28.840 --> 0:38:34.360
<v Speaker 1>Reki Moto and Nagao created there the first real handheld

0:38:34.440 --> 0:38:37.080
<v Speaker 1>a R display, but it was a tethered display. It

0:38:37.160 --> 0:38:40.120
<v Speaker 1>wasn't free form. You couldn't just take it anywhere. It

0:38:40.200 --> 0:38:43.080
<v Speaker 1>was called Navy Cam, and you had to have a

0:38:43.280 --> 0:38:46.759
<v Speaker 1>tether like cable essentially connect the Navy Cam to a workstation.

0:38:47.480 --> 0:38:50.759
<v Speaker 1>But it had a forward facing camera and you can

0:38:50.920 --> 0:38:55.280
<v Speaker 1>use a video feed to go through this handheld device

0:38:55.800 --> 0:38:59.920
<v Speaker 1>through the cable to the workstation, and it could detect

0:39:00.040 --> 0:39:03.719
<v Speaker 1>color coded markers in the camera image and display information

0:39:03.760 --> 0:39:05.960
<v Speaker 1>on a video see through view. So you can get

0:39:05.960 --> 0:39:09.759
<v Speaker 1>that augmented reality experience. Obviously very limited, you know, you

0:39:09.800 --> 0:39:12.359
<v Speaker 1>could not just carry this around with you everywhere you go,

0:39:12.520 --> 0:39:17.040
<v Speaker 1>but it showed the ideas behind augmented reality could in

0:39:17.080 --> 0:39:20.320
<v Speaker 1>fact be realized in a handheld format. Now. It was

0:39:20.360 --> 0:39:24.200
<v Speaker 1>just a matter of getting those different components small enough

0:39:24.239 --> 0:39:28.320
<v Speaker 1>to all fit in a self contained mobile form factor.

0:39:29.960 --> 0:39:33.000
<v Speaker 1>In the late nineties, we started seeing televised sporting events

0:39:33.160 --> 0:39:37.080
<v Speaker 1>featuring augmented reality elements, or at least you did. I

0:39:37.200 --> 0:39:40.920
<v Speaker 1>don't watch sports ball, that's not entirely true, but I

0:39:40.960 --> 0:39:44.239
<v Speaker 1>don't watch football or hockey. American football or hockey, and

0:39:44.360 --> 0:39:48.040
<v Speaker 1>both of those were the sports that really got them.

0:39:48.040 --> 0:39:52.360
<v Speaker 1>First off, I'm gonna backtrack. I used to watch hockey,

0:39:52.400 --> 0:39:59.160
<v Speaker 1>but then Winnipeg stole the Atlanta Thrashers from me. Winnipeg, Okay,

0:39:59.160 --> 0:40:02.120
<v Speaker 1>getting back to hockey. So hockey had the Fox track system,

0:40:02.239 --> 0:40:05.520
<v Speaker 1>which Fox put into hockey games so that you could

0:40:05.520 --> 0:40:08.120
<v Speaker 1>easily follow the puck. Instead of trying to watch this

0:40:08.239 --> 0:40:11.359
<v Speaker 1>little bitty black disc spinning around, you got to watch

0:40:11.400 --> 0:40:18.000
<v Speaker 1>this very bright, highlighted neon colored disc that everyone hated.

0:40:18.840 --> 0:40:22.120
<v Speaker 1>And after about two seasons, Fox stop doing it, and

0:40:22.239 --> 0:40:26.360
<v Speaker 1>people were happy. Untila Thrashers moved away, and then it

0:40:26.440 --> 0:40:31.160
<v Speaker 1>was just miserable. American football would follow suit in the

0:40:31.239 --> 0:40:34.799
<v Speaker 1>late nineties and have the first down line introduced, where

0:40:34.800 --> 0:40:39.320
<v Speaker 1>they could on live video overlay the first down line.

0:40:39.800 --> 0:40:43.359
<v Speaker 1>Usually it's a bright yellow line that indicates how far

0:40:43.440 --> 0:40:45.920
<v Speaker 1>the offensive team needs to go. And by offensive, I

0:40:45.960 --> 0:40:48.120
<v Speaker 1>mean they're on the offensive. I don't mean they offend

0:40:48.200 --> 0:40:52.480
<v Speaker 1>my sensibilities. I'm not that against American football, but it

0:40:52.480 --> 0:40:54.040
<v Speaker 1>showed how far they would need to go in order

0:40:54.080 --> 0:40:57.200
<v Speaker 1>to establish a first down, which I am told is

0:40:57.239 --> 0:41:01.160
<v Speaker 1>something you want to do. Uh. That would start to

0:41:01.200 --> 0:41:05.400
<v Speaker 1>get employed in and over time we would see that

0:41:05.960 --> 0:41:10.240
<v Speaker 1>increase where eventually Skycam was able to even use this system.

0:41:10.280 --> 0:41:12.800
<v Speaker 1>At first, it wasn't you could get a Skycamp you

0:41:12.920 --> 0:41:15.120
<v Speaker 1>but you couldn't do the overlay of the first intent

0:41:15.239 --> 0:41:19.600
<v Speaker 1>line until later. Well, I've got a lot more to

0:41:19.719 --> 0:41:24.040
<v Speaker 1>say about augmented reality, but before I do, let's take

0:41:24.080 --> 0:41:34.960
<v Speaker 1>another quick break to thank our sponsor. Okay, we're back.

0:41:35.200 --> 0:41:39.520
<v Speaker 1>Let's skip ahead to I guess it's not really skipping

0:41:39.560 --> 0:41:44.879
<v Speaker 1>I just talked about. Let's plot ahead to That's when

0:41:45.000 --> 0:41:51.000
<v Speaker 1>NASA's X spacecraft was using an a R system as

0:41:51.080 --> 0:41:54.960
<v Speaker 1>part of its navigational tools, so people back on Earth

0:41:55.160 --> 0:41:58.760
<v Speaker 1>could look at a view from the spacecraft a camera

0:41:59.000 --> 0:42:02.560
<v Speaker 1>mounted on the space ace craft, and on top of

0:42:02.600 --> 0:42:06.719
<v Speaker 1>that view they could overlay map data to help with navigation.

0:42:06.920 --> 0:42:08.839
<v Speaker 1>And all of that, of course was controlled back here

0:42:08.960 --> 0:42:12.000
<v Speaker 1>on Earth, but it was sort of an experiment to

0:42:12.040 --> 0:42:16.319
<v Speaker 1>see how augmented reality could be incorporated into space exploration

0:42:16.400 --> 0:42:20.520
<v Speaker 1>missions in the future and make them more effective. Also,

0:42:20.560 --> 0:42:25.920
<v Speaker 1>in the Navy began work on the Battlefield Augmented Reality

0:42:26.040 --> 0:42:29.400
<v Speaker 1>System or BARS, which is a wearable a R system

0:42:29.400 --> 0:42:33.560
<v Speaker 1>for soldiers. You've probably seen various implementations of this over

0:42:33.600 --> 0:42:38.120
<v Speaker 1>the years is obviously evolved since nine It's one of

0:42:38.120 --> 0:42:42.000
<v Speaker 1>those pieces of technology that some soldiers took to but

0:42:42.080 --> 0:42:47.080
<v Speaker 1>a lot just felt that it created unnecessary distractions. Technology

0:42:47.120 --> 0:42:52.560
<v Speaker 1>and warfare is very, very difficult because there's sometimes where

0:42:52.600 --> 0:42:56.239
<v Speaker 1>we think, oh, more information is always better, but in

0:42:56.320 --> 0:43:00.120
<v Speaker 1>some cases that doesn't seem to hold true. Uh, And

0:43:00.280 --> 0:43:03.520
<v Speaker 1>for some people with these head mounted displays or or

0:43:03.600 --> 0:43:07.480
<v Speaker 1>really it's heads up displays HUDs uh, that can sometimes

0:43:07.480 --> 0:43:11.640
<v Speaker 1>be the case, depends on the implementation. In two thousand,

0:43:12.000 --> 0:43:17.080
<v Speaker 1>hiro Katsu Kato created a software library called a R Toolkit.

0:43:18.040 --> 0:43:21.760
<v Speaker 1>Very important software library was also open source, so anyone

0:43:21.880 --> 0:43:26.160
<v Speaker 1>could contribute to it, modify it. Brent put out a

0:43:26.200 --> 0:43:29.760
<v Speaker 1>new version that sort of stuff, and it uses video

0:43:29.800 --> 0:43:33.040
<v Speaker 1>tracking to overlay computer graphics on a video camera feed,

0:43:33.880 --> 0:43:36.200
<v Speaker 1>and it's still a component for a lot of a

0:43:36.320 --> 0:43:39.920
<v Speaker 1>our experiences today. Later on in the two thousand's this

0:43:39.960 --> 0:43:43.120
<v Speaker 1>would be adapted so that it could also be used

0:43:43.200 --> 0:43:48.680
<v Speaker 1>in web experiences, not just native experiences to specific devices,

0:43:50.200 --> 0:43:53.440
<v Speaker 1>and we continue to see are built into new experiences

0:43:53.480 --> 0:43:57.560
<v Speaker 1>including smartphones and tablets. By two thousand four, some researchers

0:43:57.560 --> 0:44:00.200
<v Speaker 1>in Germany, we're creating a r apps that can take

0:44:00.239 --> 0:44:04.280
<v Speaker 1>advantage of a smartphones camera. But two thousand fours pretty

0:44:04.280 --> 0:44:08.200
<v Speaker 1>early for smartphones. It really would would be a few

0:44:08.320 --> 0:44:11.919
<v Speaker 1>years before this would truly take off, because that's when

0:44:12.000 --> 0:44:14.760
<v Speaker 1>Apple came out with the iPhone in two thousand seven.

0:44:15.560 --> 0:44:19.480
<v Speaker 1>That was the real revolution in smartphone technology. There had

0:44:19.520 --> 0:44:22.680
<v Speaker 1>been smartphones before the iPhone, don't get me wrong, and

0:44:22.719 --> 0:44:26.480
<v Speaker 1>many of them were really good, but the iPhone was

0:44:26.560 --> 0:44:30.880
<v Speaker 1>something that caught the public's attention and made smartphone sexy.

0:44:31.160 --> 0:44:33.800
<v Speaker 1>And because of that, there was a ton of money

0:44:34.239 --> 0:44:38.160
<v Speaker 1>poured into the smartphone industry as well as not just Apple,

0:44:38.200 --> 0:44:40.640
<v Speaker 1>but also to other companies, like the companies that were

0:44:40.680 --> 0:44:44.839
<v Speaker 1>offering Android smartphones. But I think we can really thank

0:44:44.880 --> 0:44:48.280
<v Speaker 1>Apple for all of that happening in the first place,

0:44:48.920 --> 0:44:51.640
<v Speaker 1>especially things like seeing that accelerometer where you could switch

0:44:51.680 --> 0:44:54.520
<v Speaker 1>from portrait to landscape mode. I remember everyone freaking out

0:44:54.600 --> 0:44:57.680
<v Speaker 1>about that when Steve Jobs showed it off in two

0:44:57.719 --> 0:45:01.200
<v Speaker 1>thousand seven at Macworld, and everyone thought, will this is

0:45:01.239 --> 0:45:03.520
<v Speaker 1>the this is amazing. Well, we take it for granted now,

0:45:03.560 --> 0:45:06.520
<v Speaker 1>but it was a big deal then. So once that

0:45:06.600 --> 0:45:12.479
<v Speaker 1>smartphone revolution happened, it was a landslide victory for both

0:45:12.560 --> 0:45:16.160
<v Speaker 1>augmented reality and virtual reality research and development, because it

0:45:16.200 --> 0:45:22.000
<v Speaker 1>meant that so much money was being poured into creating newer, thinner,

0:45:22.560 --> 0:45:27.839
<v Speaker 1>more capable smartphones that we saw an explosion in technological

0:45:27.880 --> 0:45:31.440
<v Speaker 1>development that could also be used for virtual and augmented

0:45:31.480 --> 0:45:36.760
<v Speaker 1>reality experiences. So, for example, I think of those sensors

0:45:36.800 --> 0:45:42.240
<v Speaker 1>I talked about earlier, accelerometers and gyroscopes, that sort of thing. Well,

0:45:42.360 --> 0:45:44.480
<v Speaker 1>we saw a lot of development in those spaces in

0:45:44.560 --> 0:45:47.560
<v Speaker 1>order to make smartphones better, and people who are working

0:45:47.560 --> 0:45:50.359
<v Speaker 1>in a R and VR experiences can take advantage of

0:45:50.400 --> 0:45:55.759
<v Speaker 1>those same sensors either creating apps specifically for smartphones. Thus,

0:45:55.840 --> 0:45:58.279
<v Speaker 1>you don't have to build any other hardware, you just

0:45:58.520 --> 0:46:02.120
<v Speaker 1>use existing hardware. But that limits how you can use it,

0:46:02.480 --> 0:46:05.040
<v Speaker 1>right because you don't typically wear your smartphone directly in

0:46:05.040 --> 0:46:08.359
<v Speaker 1>front of your face. Or they could end up taking

0:46:08.360 --> 0:46:12.760
<v Speaker 1>advantage of those new, smaller sensors and incorporate them directly

0:46:12.800 --> 0:46:16.160
<v Speaker 1>into brand new hardware like various types of wearables like

0:46:16.239 --> 0:46:18.200
<v Speaker 1>Google Glass for example, but that would be a few

0:46:18.200 --> 0:46:23.160
<v Speaker 1>more years. In two thousand and eleven, Nintendo launched the

0:46:23.280 --> 0:46:27.040
<v Speaker 1>Nintendo three D S, which included a camera. It was

0:46:27.080 --> 0:46:32.080
<v Speaker 1>you know, the three D capable handheld device and included

0:46:32.400 --> 0:46:36.000
<v Speaker 1>actually a pair of forward facing cameras so you could

0:46:36.280 --> 0:46:40.040
<v Speaker 1>take three D photos if you wanted to, and it

0:46:40.120 --> 0:46:43.480
<v Speaker 1>also had some a R software included with it. You

0:46:43.480 --> 0:46:48.400
<v Speaker 1>would get these special Nintendo cards kind of like playing cards,

0:46:49.239 --> 0:46:51.920
<v Speaker 1>and if you were to point the camera of the

0:46:51.960 --> 0:46:55.680
<v Speaker 1>three D S at the card and look at the screen,

0:46:56.280 --> 0:46:59.680
<v Speaker 1>you would see a little virtual three dimensional character pop

0:46:59.800 --> 0:47:03.320
<v Speaker 1>up on the card. So Mario would be an obvious example.

0:47:03.640 --> 0:47:06.200
<v Speaker 1>You put the Mario card down on the table, you

0:47:06.239 --> 0:47:07.920
<v Speaker 1>hold up the three D S, and you aim the

0:47:07.960 --> 0:47:10.000
<v Speaker 1>camera at the card, and you look at the screen

0:47:10.080 --> 0:47:13.120
<v Speaker 1>and there's Mario, and Mario appears to be jumping around

0:47:13.239 --> 0:47:17.600
<v Speaker 1>on your physical table. Now, obviously, if you look off

0:47:17.760 --> 0:47:21.680
<v Speaker 1>of the display, there's no Mario jumping around, but on

0:47:21.719 --> 0:47:24.600
<v Speaker 1>the display there he is, and it was pretty cute.

0:47:24.880 --> 0:47:28.920
<v Speaker 1>I remember being really impressed with this very simple implementation

0:47:28.960 --> 0:47:32.040
<v Speaker 1>of a R when we got our three DS, and

0:47:32.080 --> 0:47:35.319
<v Speaker 1>then I took our three DS apart, and then I

0:47:35.320 --> 0:47:38.520
<v Speaker 1>took pictures of it, and then I posted on Twitter

0:47:39.280 --> 0:47:43.359
<v Speaker 1>and people got sad, it's a great day. In two

0:47:43.400 --> 0:47:47.520
<v Speaker 1>thousand thirteen, Google introduced Google Glass that was the wearable

0:47:47.600 --> 0:47:51.719
<v Speaker 1>that included a small display position just above the right eye. Uh,

0:47:51.840 --> 0:47:55.239
<v Speaker 1>So when you look straightforward, you could tell that there

0:47:55.320 --> 0:47:59.840
<v Speaker 1>was something kind of above your natural eyeline. But it

0:48:00.800 --> 0:48:03.319
<v Speaker 1>get in the way too much. You too to look

0:48:03.320 --> 0:48:05.319
<v Speaker 1>at the screen. You actually had a glimpse, You had

0:48:05.320 --> 0:48:07.920
<v Speaker 1>a glance upward, and then you could see what was

0:48:07.960 --> 0:48:13.600
<v Speaker 1>on the display. Google Glass had augmented reality features like crazy.

0:48:13.680 --> 0:48:18.719
<v Speaker 1>You could see video calls. You could actually use the

0:48:19.520 --> 0:48:22.719
<v Speaker 1>glasses to not just to take a video call, but

0:48:22.800 --> 0:48:26.080
<v Speaker 1>show the other person what you are looking at so

0:48:26.239 --> 0:48:29.080
<v Speaker 1>they could see from your point of view. You could

0:48:29.080 --> 0:48:32.279
<v Speaker 1>also overlay directions, so if you're walking down street, you

0:48:32.320 --> 0:48:34.480
<v Speaker 1>could glance up at the screen and it would tell

0:48:34.520 --> 0:48:36.520
<v Speaker 1>you if you need to keep going straight or turn laughter,

0:48:36.600 --> 0:48:39.640
<v Speaker 1>turn right, that kind of thing. It was really useful. Um.

0:48:39.680 --> 0:48:42.400
<v Speaker 1>I had a pair of these Google Glass and I

0:48:42.480 --> 0:48:46.040
<v Speaker 1>really liked the direction they were going in. I felt

0:48:46.040 --> 0:48:49.040
<v Speaker 1>that it wasn't a fully realized product at the time,

0:48:49.080 --> 0:48:52.200
<v Speaker 1>and eventually Google agreed and after a couple of years

0:48:52.239 --> 0:48:55.239
<v Speaker 1>they took Google Glass off the market entirely, and now

0:48:55.280 --> 0:48:58.920
<v Speaker 1>you can't get them anymore. Uh. They were clever, but

0:48:59.280 --> 0:49:04.319
<v Speaker 1>they were expensive and they had some limitations, and like

0:49:04.440 --> 0:49:06.480
<v Speaker 1>I was saying earlier, you know, it's hard to build

0:49:06.520 --> 0:49:09.680
<v Speaker 1>all the components you need into one headset. So Google

0:49:09.719 --> 0:49:13.680
<v Speaker 1>Glass would communicate via Bluetooth to your smartphone, and your

0:49:13.719 --> 0:49:16.800
<v Speaker 1>smartphone would act as the actual nexus point to the Internet.

0:49:17.800 --> 0:49:21.080
<v Speaker 1>But it was a neat idea, uh, and I enjoyed

0:49:21.400 --> 0:49:24.040
<v Speaker 1>getting to use them while I did, so I keep

0:49:24.080 --> 0:49:28.440
<v Speaker 1>hoping to see a return of that kind of technology,

0:49:28.480 --> 0:49:32.360
<v Speaker 1>but perhaps in a more mature and less expensive format.

0:49:33.880 --> 0:49:37.680
<v Speaker 1>Now we've also seen applications similar to the ones we

0:49:37.719 --> 0:49:40.760
<v Speaker 1>mentioned earlier, the ones that are meant to guide people

0:49:40.920 --> 0:49:44.680
<v Speaker 1>into laying out or repairing a system. We've seen that

0:49:44.760 --> 0:49:47.960
<v Speaker 1>in the car world. Not too long ago, there was

0:49:48.040 --> 0:49:52.120
<v Speaker 1>the MARTA system introduced by Volkswagen. MARTA makes me chuckle

0:49:52.320 --> 0:49:55.640
<v Speaker 1>because that's also the name of Atlanta's public transportation system,

0:49:55.719 --> 0:49:58.840
<v Speaker 1>But in this case, it stands for Mobile Augmented Reality

0:49:58.880 --> 0:50:03.279
<v Speaker 1>Technical Assistance, and it's specifically designed for mechanics who are

0:50:03.320 --> 0:50:06.960
<v Speaker 1>working on the XL one vehicle. So if you hold

0:50:07.000 --> 0:50:10.480
<v Speaker 1>up an iPad that has this app on it, and

0:50:10.520 --> 0:50:13.040
<v Speaker 1>the camera has pointed at an XL one and you

0:50:13.080 --> 0:50:16.319
<v Speaker 1>look at the display, you'll see information overlaid on top

0:50:16.400 --> 0:50:20.360
<v Speaker 1>of the car, including labels for all the different parts.

0:50:20.360 --> 0:50:22.600
<v Speaker 1>So let's say you're a mechanic and you have to

0:50:22.640 --> 0:50:26.799
<v Speaker 1>do a specific repair on this vehicle. You hold up

0:50:26.840 --> 0:50:29.799
<v Speaker 1>the iPad, you look through the display, and you see

0:50:29.840 --> 0:50:31.600
<v Speaker 1>exactly what you need to do. It gives you a

0:50:31.600 --> 0:50:34.480
<v Speaker 1>set of instructions that shows you how you need to do.

0:50:34.560 --> 0:50:36.839
<v Speaker 1>It tells you where you need to stand based upon

0:50:36.960 --> 0:50:39.680
<v Speaker 1>the angle of the view. So if you hold it

0:50:39.760 --> 0:50:41.280
<v Speaker 1>up and it says no, you need to move about

0:50:41.320 --> 0:50:43.160
<v Speaker 1>a foot to the right, you can do that. Then

0:50:43.160 --> 0:50:45.080
<v Speaker 1>hold up the iPad again. I'll say, all right, you're

0:50:45.080 --> 0:50:48.280
<v Speaker 1>in the right spot. Make sure you loosen this particular

0:50:48.320 --> 0:50:51.319
<v Speaker 1>bolt first, that kind of thing. And it's meant to

0:50:51.400 --> 0:50:55.880
<v Speaker 1>be an interactive maintenance guide in a way maintenance and

0:50:55.920 --> 0:50:59.920
<v Speaker 1>repair guide. This is one of those applications of augmentary

0:51:00.080 --> 0:51:04.120
<v Speaker 1>lity I think is a no brainer to me. It's

0:51:04.120 --> 0:51:07.759
<v Speaker 1>a killer hap the idea of having an ability to

0:51:09.640 --> 0:51:14.399
<v Speaker 1>work with something you are not familiar with, but you're

0:51:14.440 --> 0:51:17.880
<v Speaker 1>able to leverage the expertise of people who either designed

0:51:17.880 --> 0:51:20.440
<v Speaker 1>it or built it, or just fully understand it and

0:51:20.520 --> 0:51:25.600
<v Speaker 1>get guidance based on their expertise in real time, so

0:51:25.640 --> 0:51:30.160
<v Speaker 1>you're not having to go and consult a an article

0:51:30.239 --> 0:51:34.000
<v Speaker 1>about it, or watch a YouTube video. You get step

0:51:34.000 --> 0:51:38.640
<v Speaker 1>by step instructions overlaid on top of your view of

0:51:38.680 --> 0:51:42.480
<v Speaker 1>that thing. To me, that's the most compelling use of

0:51:42.560 --> 0:51:45.520
<v Speaker 1>augmented reality from a practical standpoint. There are a lot

0:51:45.560 --> 0:51:47.640
<v Speaker 1>of other uses that they'll talk about towards the end

0:51:47.840 --> 0:51:50.160
<v Speaker 1>that I think are also really super cool. So don't

0:51:50.160 --> 0:51:53.959
<v Speaker 1>get me wrong, it's not the only one. But let's

0:51:54.000 --> 0:51:56.520
<v Speaker 1>move on to two thousand fifteen. That was when Microsoft

0:51:56.600 --> 0:51:59.920
<v Speaker 1>would unveil the hollow lens, something I still want to

0:52:00.000 --> 0:52:01.799
<v Speaker 1>try out. I have not had a chance to try

0:52:01.800 --> 0:52:04.880
<v Speaker 1>a hollow lens yet. That is a headset capable of

0:52:04.920 --> 0:52:07.880
<v Speaker 1>advanced a R applications everything from what I was just

0:52:07.960 --> 0:52:11.279
<v Speaker 1>talking about, giving you guidance, step by step instructions on

0:52:11.320 --> 0:52:14.000
<v Speaker 1>how to do like a repair job on say an

0:52:14.000 --> 0:52:18.600
<v Speaker 1>electrical outlet. You can even use a Skype system to

0:52:18.760 --> 0:52:22.520
<v Speaker 1>call an expert who can then view your point of

0:52:22.600 --> 0:52:27.080
<v Speaker 1>view and interact with that point of view. So let's

0:52:27.120 --> 0:52:31.480
<v Speaker 1>say I'm looking at the outlet. The expert electrician I'm

0:52:31.480 --> 0:52:35.319
<v Speaker 1>talking to can see what I see, and he or

0:52:35.360 --> 0:52:39.960
<v Speaker 1>she can also make notes on the display which shows

0:52:40.040 --> 0:52:42.880
<v Speaker 1>up in my field of view. So he or she

0:52:43.000 --> 0:52:45.920
<v Speaker 1>might circle a specific wire and say you need to

0:52:46.080 --> 0:52:48.399
<v Speaker 1>you need to remove that one first, and I know

0:52:48.719 --> 0:52:50.520
<v Speaker 1>I need to do that one first because I can

0:52:50.520 --> 0:52:52.640
<v Speaker 1>see which one they are talking about. Or they might

0:52:52.640 --> 0:52:56.680
<v Speaker 1>circle another wire and say, no, matter what you do,

0:52:56.680 --> 0:53:00.480
<v Speaker 1>don't cut this wire. Well, the toilet ups stairs will

0:53:00.480 --> 0:53:05.920
<v Speaker 1>explode like lethal weapon too, and I won't do that

0:53:06.040 --> 0:53:09.719
<v Speaker 1>because you know that guys like three days from retirement.

0:53:10.080 --> 0:53:13.680
<v Speaker 1>So I have a heart. But no, this is this

0:53:13.760 --> 0:53:17.480
<v Speaker 1>is a really neat idea, having this interactive ability to

0:53:18.320 --> 0:53:21.800
<v Speaker 1>overlay the information from the world, the digital world, onto

0:53:21.960 --> 0:53:24.839
<v Speaker 1>your physical world. And beyond that, the hollowns has lots

0:53:24.880 --> 0:53:28.600
<v Speaker 1>of other functions. It's not just something to do, you know,

0:53:29.000 --> 0:53:31.840
<v Speaker 1>home repairs around the house. You can also use it

0:53:31.840 --> 0:53:36.080
<v Speaker 1>for entertainment purposes, like you could create a screen that

0:53:36.160 --> 0:53:40.280
<v Speaker 1>can show you video from various sources and you can

0:53:40.320 --> 0:53:44.440
<v Speaker 1>assign it a place on a wall in your environment.

0:53:44.520 --> 0:53:48.120
<v Speaker 1>Let's say that you're in your living room and you

0:53:48.239 --> 0:53:50.920
<v Speaker 1>just create a screen so you can watch Netflix, and

0:53:50.960 --> 0:53:54.040
<v Speaker 1>you slap it on a wall and it will stay

0:53:54.400 --> 0:53:57.279
<v Speaker 1>in that same position relative to your point of view.

0:53:57.640 --> 0:53:59.400
<v Speaker 1>So if you look to the left or right, the

0:53:59.440 --> 0:54:02.600
<v Speaker 1>screens days where you put it as if it were

0:54:02.680 --> 0:54:05.239
<v Speaker 1>physically they're on your wall, but keep in mind it's

0:54:05.320 --> 0:54:08.319
<v Speaker 1>just a virtual screen, and when you look back to

0:54:08.400 --> 0:54:10.960
<v Speaker 1>that part of your wall, you'll see the virtual screen

0:54:11.000 --> 0:54:13.280
<v Speaker 1>there playing whatever it was that you wanted to watch.

0:54:14.600 --> 0:54:16.879
<v Speaker 1>I think that's a super cool idea. And they've also

0:54:16.880 --> 0:54:20.920
<v Speaker 1>shown off games like a game of Minecraft that uses

0:54:20.920 --> 0:54:23.920
<v Speaker 1>hollow lens so you can actually view a Minecraft world

0:54:24.680 --> 0:54:27.799
<v Speaker 1>sitting appearing to sit at any rate on top of

0:54:27.800 --> 0:54:31.080
<v Speaker 1>a table, so you can walk around the table and

0:54:31.200 --> 0:54:34.759
<v Speaker 1>view the Minecraft world from various angles and play that way.

0:54:35.520 --> 0:54:38.640
<v Speaker 1>I think that's super neat. Don't know how compelling it is,

0:54:38.640 --> 0:54:41.600
<v Speaker 1>because again I haven't tried it myself, but I really

0:54:41.640 --> 0:54:46.400
<v Speaker 1>like the idea. This year, two thousand and sixteen, are

0:54:46.480 --> 0:54:49.759
<v Speaker 1>got another big boost from a little game called Pokemon Go.

0:54:50.840 --> 0:54:54.040
<v Speaker 1>Although I have to admit this was a really primitive,

0:54:54.239 --> 0:54:58.200
<v Speaker 1>basic implementation of augmented reality. Really, it was not much

0:54:58.719 --> 0:55:00.839
<v Speaker 1>more than just a In fact, it was nothing more

0:55:01.280 --> 0:55:04.479
<v Speaker 1>than just an animated overlay that would exist on top

0:55:04.520 --> 0:55:08.759
<v Speaker 1>of the camera view of your of your device. So

0:55:08.800 --> 0:55:11.839
<v Speaker 1>I'd say I'm holding up my smartphone and I'm trying

0:55:11.840 --> 0:55:14.680
<v Speaker 1>to catch a Jiggli Puff, and the Jiggli Puff is

0:55:14.680 --> 0:55:17.799
<v Speaker 1>currently bouncing up and down on the sidewalk in front

0:55:17.840 --> 0:55:20.799
<v Speaker 1>of me. That's about as far as the augmented reality

0:55:21.040 --> 0:55:25.040
<v Speaker 1>actual experience would go. So very primitive. But because Pokemon

0:55:25.160 --> 0:55:30.239
<v Speaker 1>Go became so popular so quickly, it really pushed the

0:55:30.280 --> 0:55:33.600
<v Speaker 1>concept of a R to the front of the minds

0:55:33.640 --> 0:55:37.440
<v Speaker 1>of people everywhere, including business owners who immediately said, we

0:55:37.520 --> 0:55:41.600
<v Speaker 1>need an augmented reality app Whether they actually needed one

0:55:41.680 --> 0:55:44.360
<v Speaker 1>or not is beside the point. A lot of people

0:55:44.440 --> 0:55:47.840
<v Speaker 1>got into a are because of Pokemon Go, uh, for

0:55:47.960 --> 0:55:50.520
<v Speaker 1>both good and bad. I always think that you have

0:55:50.600 --> 0:55:52.719
<v Speaker 1>to come up with the experience first. You have to

0:55:53.400 --> 0:55:58.320
<v Speaker 1>understand why you need to use a specific strategy to

0:55:58.800 --> 0:56:03.640
<v Speaker 1>create a specific experience, and then build it. Not hey,

0:56:03.760 --> 0:56:08.319
<v Speaker 1>we need augmented reality makes something that's a R. To me,

0:56:08.440 --> 0:56:11.440
<v Speaker 1>that's the backwards way of going about it. But what

0:56:11.520 --> 0:56:14.520
<v Speaker 1>do I know. I'm not not a programmer, so I'm

0:56:14.520 --> 0:56:17.440
<v Speaker 1>sure the programmers feel in a similar way to me.

0:56:17.800 --> 0:56:21.360
<v Speaker 1>But that's just a guess. Now. The future of a

0:56:21.480 --> 0:56:24.719
<v Speaker 1>R depends heavily upon the applications we see in which

0:56:24.719 --> 0:56:28.360
<v Speaker 1>ones end up being successful and which ones aren't. Right now,

0:56:28.400 --> 0:56:30.799
<v Speaker 1>I would say that the best bet is to see

0:56:30.840 --> 0:56:35.600
<v Speaker 1>more a R features built into smartphones and tablets. Uh not,

0:56:35.680 --> 0:56:39.480
<v Speaker 1>maybe not necessarily built into them, but have apps available

0:56:39.560 --> 0:56:43.560
<v Speaker 1>that create a R experiences for very specific contexts. Like

0:56:43.640 --> 0:56:46.800
<v Speaker 1>let's say it's a museum app. You might download a

0:56:46.880 --> 0:56:48.960
<v Speaker 1>museum app on your phone, and when you go to

0:56:49.000 --> 0:56:51.680
<v Speaker 1>the museum and you use your phone, you can get

0:56:51.719 --> 0:56:55.440
<v Speaker 1>more information about the paintings and sculptures and other installations

0:56:55.440 --> 0:56:57.879
<v Speaker 1>that you see in the museum. That's an easy one

0:56:57.920 --> 0:57:00.680
<v Speaker 1>to understand. But that same app isn't going to be

0:57:00.760 --> 0:57:03.279
<v Speaker 1>useful once you leave the museum, you no longer have

0:57:03.520 --> 0:57:07.239
<v Speaker 1>the context that it is tied to. I think that

0:57:07.400 --> 0:57:10.680
<v Speaker 1>smartphones are probably going to be where the greatest development

0:57:10.719 --> 0:57:13.520
<v Speaker 1>is going to be in the near term, because wearables

0:57:13.560 --> 0:57:17.000
<v Speaker 1>is still really hard to do. We still don't have

0:57:17.240 --> 0:57:20.080
<v Speaker 1>a consumer version of the hollow lens out available for

0:57:20.120 --> 0:57:23.480
<v Speaker 1>anyone to purchase, and it may never come out as

0:57:23.480 --> 0:57:26.320
<v Speaker 1>a consumer product. Microsoft hasn't shown a whole lot of

0:57:26.360 --> 0:57:29.760
<v Speaker 1>interest in making it a consumer product. Maybe that will change,

0:57:30.240 --> 0:57:32.880
<v Speaker 1>but at the moment, I wouldn't hold my breath, so

0:57:33.600 --> 0:57:36.760
<v Speaker 1>I would argue smartphones and tablets are pretty much where

0:57:36.800 --> 0:57:41.760
<v Speaker 1>it's at maybe some implementation with some existing VR headsets

0:57:41.760 --> 0:57:44.480
<v Speaker 1>which have external cameras mounted on them as well, like

0:57:44.560 --> 0:57:48.040
<v Speaker 1>forward facing cameras. You could build a our experiences there.

0:57:48.240 --> 0:57:50.480
<v Speaker 1>Then it gets a little weird because you're you're also

0:57:50.600 --> 0:57:53.120
<v Speaker 1>you know, you're looking at a monitor, so you're looking

0:57:53.120 --> 0:57:55.640
<v Speaker 1>at a video feed of your surroundings, and on top

0:57:55.680 --> 0:57:57.760
<v Speaker 1>of the video feed, you get the overlay. Same thing

0:57:57.920 --> 0:57:59.919
<v Speaker 1>is true for your smartphones and tablets, by the way,

0:58:00.560 --> 0:58:03.960
<v Speaker 1>but different that from the Google glass implementation, where you're

0:58:03.960 --> 0:58:07.080
<v Speaker 1>looking at the actual physical world, not a video representation

0:58:07.120 --> 0:58:10.240
<v Speaker 1>of it, but the real world. And then because the

0:58:10.320 --> 0:58:13.760
<v Speaker 1>display itself that you are looking through is transparent, you're

0:58:13.800 --> 0:58:17.600
<v Speaker 1>looking at a transparent overlay of digital information that gives

0:58:17.600 --> 0:58:22.440
<v Speaker 1>you more info about the world you are in. I

0:58:22.480 --> 0:58:25.000
<v Speaker 1>think a R is super cool. I think it's really

0:58:25.040 --> 0:58:28.480
<v Speaker 1>got a lot of potential to change the world around

0:58:28.560 --> 0:58:30.320
<v Speaker 1>us and to change the way we interact with the

0:58:30.360 --> 0:58:35.720
<v Speaker 1>world around us. You could imagine a dystopian future implementation

0:58:35.760 --> 0:58:38.600
<v Speaker 1>of a R where we all have to wear glasses

0:58:38.640 --> 0:58:42.880
<v Speaker 1>and we're constantly getting personalized commercials beamed at us whenever

0:58:42.920 --> 0:58:46.600
<v Speaker 1>we look at anything, like imagine walking past a store

0:58:46.640 --> 0:58:48.880
<v Speaker 1>casually looking in the window and then getting a whole

0:58:48.880 --> 0:58:50.800
<v Speaker 1>bunch of ads for all the stuff that's in the

0:58:50.800 --> 0:58:53.760
<v Speaker 1>store window. That would be obnoxious, and it's easy to

0:58:53.840 --> 0:58:57.840
<v Speaker 1>understand how people would not want that, yet also easy

0:58:57.920 --> 0:59:00.200
<v Speaker 1>to understand how that could possibly become a few her

0:59:00.760 --> 0:59:04.280
<v Speaker 1>or think of the future where your privacy is no

0:59:04.320 --> 0:59:08.080
<v Speaker 1>longer even relevant, and you walk down the street and

0:59:08.120 --> 0:59:11.320
<v Speaker 1>you look at all the people's faces who are also

0:59:11.400 --> 0:59:15.000
<v Speaker 1>walking down the street, and you're getting names of everybody

0:59:15.040 --> 0:59:17.200
<v Speaker 1>and what they like and what they dislike. You know,

0:59:17.280 --> 0:59:19.840
<v Speaker 1>what music they tend to listen to, maybe what they're

0:59:19.880 --> 0:59:22.960
<v Speaker 1>listening to right now, And it's all because we've got

0:59:23.000 --> 0:59:27.520
<v Speaker 1>facial recognition technology. Almost everyone has some sort of social

0:59:27.560 --> 0:59:31.439
<v Speaker 1>media presence, so you can map that face to any

0:59:31.480 --> 0:59:34.480
<v Speaker 1>public profiles, try and find a match. If you found

0:59:34.480 --> 0:59:36.640
<v Speaker 1>a match, you can bring back information to the person

0:59:36.720 --> 0:59:41.400
<v Speaker 1>wearing the glasses. So I can look at somebody and say, oh, this, uh,

0:59:41.520 --> 0:59:45.200
<v Speaker 1>this cute kid over here, she's got she likes punk

0:59:45.280 --> 0:59:48.640
<v Speaker 1>rock music. I'm gonna I'm gonna impress her with my

0:59:49.400 --> 0:59:55.360
<v Speaker 1>knowledge of the cramps. That probably wouldn't work, but the

0:59:55.400 --> 0:59:58.240
<v Speaker 1>point being that's pretty creepy and invasive, and so there

0:59:58.280 --> 1:00:00.760
<v Speaker 1>are some negative implementations of a are that we have

1:00:00.800 --> 1:00:03.560
<v Speaker 1>to watch out for unless we get to a point

1:00:03.600 --> 1:00:05.840
<v Speaker 1>where we just don't care about privacy at all anymore.

1:00:06.320 --> 1:00:08.840
<v Speaker 1>Some would argue we're already there, and in that case,

1:00:09.040 --> 1:00:11.400
<v Speaker 1>this implementation of a R may not sound creepy at all.

1:00:11.400 --> 1:00:13.240
<v Speaker 1>It might just sound kind of cool, kind of the

1:00:13.280 --> 1:00:15.920
<v Speaker 1>equivalent of walking into a store seeing a person with

1:00:15.960 --> 1:00:18.640
<v Speaker 1>a name tag and addressing them by name. If they

1:00:18.640 --> 1:00:20.600
<v Speaker 1>don't remember they have a name tag on, they have

1:00:20.680 --> 1:00:23.160
<v Speaker 1>this moment where they think, do I know you? That.

1:00:23.200 --> 1:00:25.400
<v Speaker 1>If we're in a world where everyone can see everyone's

1:00:25.480 --> 1:00:27.800
<v Speaker 1>name all the time, then well, for one thing, I

1:00:27.840 --> 1:00:30.080
<v Speaker 1>won't ever have to worry about coming at a loss

1:00:30.080 --> 1:00:32.080
<v Speaker 1>when I have to introduce my wife to someone. So

1:00:32.200 --> 1:00:34.880
<v Speaker 1>that's a that's a plus side. Okay, I'm in favor

1:00:34.880 --> 1:00:37.280
<v Speaker 1>of it now, all right. Well that kind of wraps

1:00:37.400 --> 1:00:41.880
<v Speaker 1>up this deep dive on augmented reality. There's so many

1:00:41.920 --> 1:00:44.480
<v Speaker 1>different other things I could have mentioned. There's so much

1:00:44.480 --> 1:00:46.880
<v Speaker 1>more to the history and development of it and all

1:00:46.880 --> 1:00:50.000
<v Speaker 1>the pieces of technology, But this is one of those

1:00:50.600 --> 1:00:55.880
<v Speaker 1>strategies that arose because so many other elements had matured

1:00:56.000 --> 1:00:58.360
<v Speaker 1>over a great deal of time. If you guys have

1:00:58.400 --> 1:01:01.160
<v Speaker 1>any suggestions for future episodes tech Stuff, whether it's a

1:01:01.200 --> 1:01:06.120
<v Speaker 1>specific kind of technology, or how technology has affected us,

1:01:06.240 --> 1:01:09.560
<v Speaker 1>or how we affect tech. Maybe it's a specific company

1:01:09.680 --> 1:01:12.680
<v Speaker 1>or person in technology, or you just have a suggestion

1:01:12.760 --> 1:01:15.080
<v Speaker 1>for somebody I should have on the show, either as

1:01:15.160 --> 1:01:18.680
<v Speaker 1>a guest co host or someone I should interview. Let

1:01:18.680 --> 1:01:22.040
<v Speaker 1>me know, send me a message. The email to send

1:01:22.040 --> 1:01:25.640
<v Speaker 1>it to is tech Stuff at how stuff works dot com,

1:01:25.720 --> 1:01:27.720
<v Speaker 1>or drop me a line on social media. You can

1:01:27.760 --> 1:01:30.840
<v Speaker 1>find me on Twitter or on Facebook with the handle

1:01:31.000 --> 1:01:34.040
<v Speaker 1>tech stuff h s W and I'll talk to you

1:01:34.040 --> 1:01:42.880
<v Speaker 1>guys again really soon. For more on this and thousands

1:01:42.920 --> 1:01:55.040
<v Speaker 1>of other topics, is it how stuff works dot com.