WEBVTT - Rerun: Augmenting Your Reality

0:00:04.240 --> 0:00:07.240
<v Speaker 1>Welcome to tech Stuff, a production of I Heart Radios

0:00:07.320 --> 0:00:14.120
<v Speaker 1>How Stuff Works. Hey there, and welcome to tech Stuff.

0:00:14.200 --> 0:00:16.880
<v Speaker 1>I'm your host, Jonathan Strickland. I'm an executive producer with

0:00:16.960 --> 0:00:19.800
<v Speaker 1>iHeart Radio and I love all things tech, and we're

0:00:19.800 --> 0:00:23.239
<v Speaker 1>going to enjoy another classic episode of tech stuff in

0:00:23.239 --> 0:00:26.920
<v Speaker 1>this episode. And uh, I promised new ones are right

0:00:26.920 --> 0:00:29.360
<v Speaker 1>around the corner. I've got a cool one coming up

0:00:29.400 --> 0:00:32.159
<v Speaker 1>in just a little while that you definitely want to

0:00:32.240 --> 0:00:37.160
<v Speaker 1>check out. Very important, all about the dangers of browsing

0:00:37.159 --> 0:00:40.519
<v Speaker 1>the Internet in an insecure way. But before we get

0:00:40.520 --> 0:00:43.640
<v Speaker 1>to that, I thought maybe we could revisit this classic

0:00:43.680 --> 0:00:48.160
<v Speaker 1>episode in which I talk about augmented reality. So this

0:00:48.240 --> 0:00:52.080
<v Speaker 1>episode is titled Augmenting Your Reality, and it originally aired

0:00:52.120 --> 0:00:54.400
<v Speaker 1>just a couple of years ago, and I thought it'd

0:00:54.440 --> 0:00:56.800
<v Speaker 1>be fun to revisit it. So let's sit back and

0:00:56.920 --> 0:01:01.480
<v Speaker 1>enjoy this classic episode. So I thought I would do

0:01:01.800 --> 0:01:06.920
<v Speaker 1>a deeper dive, a bigger explanation about what augmented reality is,

0:01:06.959 --> 0:01:09.720
<v Speaker 1>what it's all about, how it works, and sort of

0:01:09.760 --> 0:01:15.560
<v Speaker 1>the applications we might put a r toward things that

0:01:15.720 --> 0:01:18.440
<v Speaker 1>you know, was it good for tons of stuff? As

0:01:18.440 --> 0:01:22.000
<v Speaker 1>it turns out, So the first thing we should do

0:01:22.600 --> 0:01:26.520
<v Speaker 1>is probably defined some terms, because if you haven't really

0:01:26.520 --> 0:01:30.080
<v Speaker 1>looked into augmented reality and you aren't familiar with a R,

0:01:30.520 --> 0:01:33.680
<v Speaker 1>you might just be lost. I'm gonna define it all

0:01:33.720 --> 0:01:36.320
<v Speaker 1>for you right now, because that's the kind of stand

0:01:36.400 --> 0:01:41.000
<v Speaker 1>up guy I am. Technically speaking, augmented reality is using

0:01:41.040 --> 0:01:45.840
<v Speaker 1>digital information to enhance or augment and experience in our

0:01:45.880 --> 0:01:51.000
<v Speaker 1>physical real world. So the way we usually see this

0:01:51.160 --> 0:01:56.120
<v Speaker 1>implemented involves some sort of display that has an image

0:01:56.120 --> 0:01:58.800
<v Speaker 1>of the real world on it and it overlays digital

0:01:58.840 --> 0:02:01.360
<v Speaker 1>information on top of that itch. So think of like

0:02:01.880 --> 0:02:06.160
<v Speaker 1>a camera's viewfinder, like an LCD screen on a camera,

0:02:06.280 --> 0:02:09.480
<v Speaker 1>and it actually labels the buildings that are in view.

0:02:09.520 --> 0:02:11.400
<v Speaker 1>When you're out on the street and you hold the

0:02:11.440 --> 0:02:15.760
<v Speaker 1>camera up or a smartphone or even a wearable device

0:02:15.840 --> 0:02:18.840
<v Speaker 1>like a head mounted display that you can look through

0:02:19.040 --> 0:02:21.120
<v Speaker 1>so you can see the real world, you're not just

0:02:21.160 --> 0:02:23.840
<v Speaker 1>staring at a screen, or if you are staring at

0:02:23.880 --> 0:02:26.720
<v Speaker 1>a screen, you're staring at a video feed that is

0:02:26.760 --> 0:02:29.560
<v Speaker 1>provided by an external camera amounted just on the other

0:02:29.600 --> 0:02:31.800
<v Speaker 1>side of the screen. So it's like you're looking through

0:02:32.120 --> 0:02:34.840
<v Speaker 1>a display in the first place, but then on top

0:02:34.960 --> 0:02:38.760
<v Speaker 1>of that view you have this digital information. That's the

0:02:38.800 --> 0:02:41.600
<v Speaker 1>most common implementation we talked about, but it's not the

0:02:41.639 --> 0:02:44.560
<v Speaker 1>only one. Augmented reality does not have to only be

0:02:44.880 --> 0:02:49.600
<v Speaker 1>or even involve visual information at all. You could have

0:02:49.720 --> 0:02:54.000
<v Speaker 1>audio only augmented reality, for example. But the whole idea

0:02:54.160 --> 0:02:58.519
<v Speaker 1>is that it's something that's created digitally to enhance your

0:02:58.720 --> 0:03:01.799
<v Speaker 1>experience in the real world. Now we can contrast this

0:03:02.320 --> 0:03:07.079
<v Speaker 1>with the concept of virtual reality. Virtual reality, of course,

0:03:07.680 --> 0:03:11.480
<v Speaker 1>is a term where you create an experience completely through

0:03:11.800 --> 0:03:16.240
<v Speaker 1>computer generated means computer is making all the things you

0:03:16.280 --> 0:03:20.280
<v Speaker 1>see and here, and maybe even beyond that if you

0:03:20.360 --> 0:03:23.960
<v Speaker 1>have really sophisticated UH setups, so you might have some

0:03:24.040 --> 0:03:27.400
<v Speaker 1>haptic feedback. Haptic refers to your sense of touch, So

0:03:27.440 --> 0:03:30.480
<v Speaker 1>if you have haptic feedback, that means you're getting information

0:03:30.520 --> 0:03:34.000
<v Speaker 1>feedback through your sense of touch. Common example of this

0:03:34.520 --> 0:03:38.280
<v Speaker 1>is a rumble pack inside a game controller, where you

0:03:38.320 --> 0:03:40.440
<v Speaker 1>know you fire a gun and a first person shooter

0:03:40.880 --> 0:03:44.800
<v Speaker 1>and your controller rumbles as a result, letting you know

0:03:44.920 --> 0:03:50.640
<v Speaker 1>that you are in fact, unleashing virtual destruction upon all

0:03:50.720 --> 0:03:54.240
<v Speaker 1>you survey. Well, the same thing can be true with

0:03:54.800 --> 0:03:58.680
<v Speaker 1>virtual reality set up. So virtual reality is all about

0:03:58.720 --> 0:04:04.480
<v Speaker 1>constructing an artificial reality a simulated reality. Augmented reality is

0:04:04.520 --> 0:04:07.680
<v Speaker 1>all about enhancing the one that we are actually in.

0:04:08.200 --> 0:04:13.160
<v Speaker 1>And then there's also mixed reality. Mixed reality is kind

0:04:13.160 --> 0:04:16.040
<v Speaker 1>of sort of in between the two. You might have

0:04:16.160 --> 0:04:21.240
<v Speaker 1>some physical objects within a room that are also mapped

0:04:21.279 --> 0:04:25.279
<v Speaker 1>to a virtual environment, and then you use something like

0:04:25.320 --> 0:04:28.880
<v Speaker 1>a head molded display to enter the virtual environment. That's

0:04:28.880 --> 0:04:32.159
<v Speaker 1>what looks like you're inside, but you have physical objects

0:04:32.160 --> 0:04:34.520
<v Speaker 1>in the room around you that are also mapped to

0:04:34.560 --> 0:04:37.040
<v Speaker 1>the virtual world, Meaning you could pick up this physical

0:04:37.080 --> 0:04:40.560
<v Speaker 1>object and you would see that reflected within the virtual world,

0:04:40.600 --> 0:04:43.480
<v Speaker 1>where you might pick up a sword and shield or

0:04:43.800 --> 0:04:48.320
<v Speaker 1>move a chair or something along those lines. So augmented reality,

0:04:48.400 --> 0:04:52.360
<v Speaker 1>virtual reality, and mixed reality are all kind of inter related,

0:04:52.680 --> 0:04:56.360
<v Speaker 1>so much so that their histories also are very much

0:04:56.440 --> 0:05:00.360
<v Speaker 1>inter related. And there's some people who try to elect

0:05:00.480 --> 0:05:05.080
<v Speaker 1>these different technologies, these different approaches and put them under

0:05:05.120 --> 0:05:08.120
<v Speaker 1>a common umbrella, and they tend to use the phrase

0:05:08.520 --> 0:05:12.880
<v Speaker 1>alternate reality, which is unfortunate because that's also a r

0:05:13.000 --> 0:05:15.960
<v Speaker 1>but alternate reality is kind of the umbrella for virtual,

0:05:16.240 --> 0:05:20.360
<v Speaker 1>augmented and mixed reality. Uh that that that kind of

0:05:20.400 --> 0:05:23.760
<v Speaker 1>gives you the definition of those basic terms, and it

0:05:23.839 --> 0:05:26.719
<v Speaker 1>is important to understand them because they're becoming more and

0:05:26.760 --> 0:05:30.839
<v Speaker 1>more important today. You are already probably aware of a

0:05:30.880 --> 0:05:33.760
<v Speaker 1>lot of VR headsets that are out there on the

0:05:33.800 --> 0:05:38.320
<v Speaker 1>market as well as VR uh Well, they're they're kind

0:05:38.320 --> 0:05:41.520
<v Speaker 1>of like cases that you slide your smartphone into, so

0:05:41.560 --> 0:05:46.599
<v Speaker 1>your smartphone becomes the actual display on a VR headset.

0:05:46.600 --> 0:05:50.000
<v Speaker 1>The headset itself is more or less just a head

0:05:50.080 --> 0:05:53.599
<v Speaker 1>mounted case for your phone. We've seen a lot of

0:05:53.600 --> 0:05:56.200
<v Speaker 1>those come out over the last few years. We've also

0:05:56.240 --> 0:05:59.359
<v Speaker 1>seen a lot of a R applications come out, typically

0:05:59.480 --> 0:06:03.039
<v Speaker 1>for things iPads and smartphones, but we've also seen some

0:06:03.080 --> 0:06:08.240
<v Speaker 1>hardware come out that for wearable devices that falls into

0:06:08.279 --> 0:06:11.040
<v Speaker 1>the augmented reality category, stuff like Google Glass, which i'll

0:06:11.040 --> 0:06:13.560
<v Speaker 1>talk about more a little bit later in this episode.

0:06:14.920 --> 0:06:19.640
<v Speaker 1>For augmented reality to work, to get this enhanced experience

0:06:19.760 --> 0:06:23.440
<v Speaker 1>of reality around you, there are a lot of technological

0:06:23.480 --> 0:06:27.880
<v Speaker 1>components that have to come together so that you actually

0:06:27.920 --> 0:06:31.960
<v Speaker 1>do get an experience that is meaningful. You know, you

0:06:31.960 --> 0:06:35.279
<v Speaker 1>you have to have technology that quote unquote knows where

0:06:35.320 --> 0:06:37.840
<v Speaker 1>you are and what you are looking at or what

0:06:37.920 --> 0:06:41.400
<v Speaker 1>you are close to in order to get that augmented experience.

0:06:41.760 --> 0:06:44.120
<v Speaker 1>It wouldn't do me any good. If I put on

0:06:44.440 --> 0:06:49.120
<v Speaker 1>an augmented reality headset, for example, and stared at let's

0:06:49.160 --> 0:06:53.680
<v Speaker 1>say a famous painting, and instead of getting information about

0:06:53.680 --> 0:06:56.600
<v Speaker 1>the famous painting, I see an exploded view of an

0:06:56.760 --> 0:07:00.080
<v Speaker 1>car engine, that would make no sense. So you have

0:07:00.120 --> 0:07:03.360
<v Speaker 1>to build in technologies in order for the a R

0:07:03.480 --> 0:07:07.120
<v Speaker 1>to understand what it is you're trying to do and

0:07:07.240 --> 0:07:10.560
<v Speaker 1>to augment that experience, which meant that we had to

0:07:10.600 --> 0:07:15.480
<v Speaker 1>wait a pretty good long time for the various technologies

0:07:16.080 --> 0:07:21.000
<v Speaker 1>that we used to to create this relationship to mature

0:07:21.200 --> 0:07:24.640
<v Speaker 1>to a point where it was possible. Sometimes we had

0:07:24.680 --> 0:07:26.760
<v Speaker 1>technologies that would allow us to do it, but it

0:07:26.920 --> 0:07:32.240
<v Speaker 1>required uh tethering headsets to very large computers, which meant

0:07:32.240 --> 0:07:35.600
<v Speaker 1>that you didn't have really any mobility, and uh it

0:07:35.720 --> 0:07:41.840
<v Speaker 1>really limited the usefulness of the actual application. In other cases,

0:07:42.600 --> 0:07:45.680
<v Speaker 1>you could say things like your head tracking technology was

0:07:45.720 --> 0:07:49.040
<v Speaker 1>absolutely necessary for a R to develop the way it did.

0:07:49.880 --> 0:07:54.000
<v Speaker 1>GPS technology as well. Remember it wasn't that long ago

0:07:54.520 --> 0:07:59.760
<v Speaker 1>that we ordinary mere mortals didn't have access to really

0:08:00.000 --> 0:08:04.880
<v Speaker 1>accurate GPS information. For a very long time, that was

0:08:04.960 --> 0:08:10.000
<v Speaker 1>purposefully made less accurate. It was a matter of national defense.

0:08:10.680 --> 0:08:13.400
<v Speaker 1>It wasn't until the nineties that you started to see

0:08:13.400 --> 0:08:19.600
<v Speaker 1>GPS become more accurate for the basic consumer. Wait, back

0:08:19.640 --> 0:08:22.840
<v Speaker 1>in the day, you might get accuracy of up to

0:08:22.920 --> 0:08:25.440
<v Speaker 1>around a hundred meters, which is not great if you're

0:08:25.480 --> 0:08:28.120
<v Speaker 1>looking for the next place to make your turn. If

0:08:28.200 --> 0:08:31.800
<v Speaker 1>it's a hundred meters away, that's that's pretty far. But

0:08:31.920 --> 0:08:34.160
<v Speaker 1>now it's within a few feet, so it's much better.

0:08:34.679 --> 0:08:36.880
<v Speaker 1>That sort of stuff all had to come together in

0:08:37.000 --> 0:08:42.640
<v Speaker 1>order for augmented reality to become a viable I almost

0:08:42.640 --> 0:08:45.800
<v Speaker 1>said a reality, but that just starts to sound redundant

0:08:46.720 --> 0:08:49.960
<v Speaker 1>at any rate. Let's talk about some of these technologies

0:08:50.240 --> 0:08:55.080
<v Speaker 1>we are We really need things like gyroscopes, accelerometers. These

0:08:55.120 --> 0:09:00.680
<v Speaker 1>help devices understand their orientation where they are in respect

0:09:00.760 --> 0:09:04.079
<v Speaker 1>to something else, like are they for a smartphone it

0:09:04.160 --> 0:09:06.280
<v Speaker 1>might be is it in landscape mode or portrait mode?

0:09:06.720 --> 0:09:10.679
<v Speaker 1>But for a head mounted display, it would help give

0:09:10.840 --> 0:09:13.680
<v Speaker 1>the the unit the information that needs to know which

0:09:13.679 --> 0:09:15.920
<v Speaker 1>way you're looking, like are you looking to the east

0:09:16.000 --> 0:09:19.520
<v Speaker 1>or to the west. That kind of thing. UH also

0:09:19.640 --> 0:09:24.760
<v Speaker 1>compasses obviously very important GPS sensors, image recognition software, but

0:09:24.800 --> 0:09:27.720
<v Speaker 1>has become really important so that when you are looking

0:09:27.880 --> 0:09:32.400
<v Speaker 1>at something UH, the system can actually identify what that is.

0:09:32.640 --> 0:09:35.319
<v Speaker 1>In some cases you can get around this. You can

0:09:35.360 --> 0:09:38.280
<v Speaker 1>design an a R system where let's say you make

0:09:38.320 --> 0:09:43.680
<v Speaker 1>a movie poster and the a R application has the

0:09:43.679 --> 0:09:46.840
<v Speaker 1>movie poster or animate in some way if you hold

0:09:46.960 --> 0:09:50.960
<v Speaker 1>up a smartphone that's running the appropriate app. So I'm

0:09:51.000 --> 0:09:54.360
<v Speaker 1>just gonna take a movie from my past that does

0:09:54.400 --> 0:09:57.360
<v Speaker 1>not have an a R movie poster or associated with it,

0:09:57.960 --> 0:09:59.880
<v Speaker 1>but one that I can talk about as if it.

0:10:00.000 --> 0:10:02.400
<v Speaker 1>We're a good example, and that has to be Big

0:10:02.440 --> 0:10:06.600
<v Speaker 1>Trouble in Little China, universally declared the best movie that

0:10:06.640 --> 0:10:09.240
<v Speaker 1>has ever been made. So you've got your Big Trouble

0:10:09.320 --> 0:10:12.000
<v Speaker 1>and Little China poster up on the wall, and you

0:10:12.040 --> 0:10:15.320
<v Speaker 1>hold up your smartphone and you activate your Big Trouble

0:10:15.360 --> 0:10:20.040
<v Speaker 1>and Little China movie marketing app, and the camera on

0:10:20.080 --> 0:10:22.760
<v Speaker 1>your phone detects the poster it you know the posters

0:10:22.800 --> 0:10:26.679
<v Speaker 1>there well. The app and the poster together are able

0:10:26.720 --> 0:10:30.600
<v Speaker 1>to construct the augmented experience because there have been elements

0:10:30.600 --> 0:10:33.360
<v Speaker 1>put into the poster that the app is looking for.

0:10:34.200 --> 0:10:37.600
<v Speaker 1>And once the app identifies that, like, it sees maybe

0:10:37.840 --> 0:10:40.560
<v Speaker 1>eight different points on the poster, and because of the

0:10:40.600 --> 0:10:44.200
<v Speaker 1>orientation of those points, it knows what angle it's at

0:10:44.280 --> 0:10:47.319
<v Speaker 1>what height it's at in relation to the phone, and

0:10:47.360 --> 0:10:52.199
<v Speaker 1>can give you on your display the augmented reality experience.

0:10:52.240 --> 0:10:55.480
<v Speaker 1>In this case, it's obviously Jack Burton and the pork

0:10:55.559 --> 0:10:59.640
<v Speaker 1>Chop Express eating a sandwich, because, as we know, the

0:10:59.679 --> 0:11:04.000
<v Speaker 1>most riveting scene in the movie unfolds in this way.

0:11:04.080 --> 0:11:07.440
<v Speaker 1>So that would be kind of an augmented reality experience

0:11:07.440 --> 0:11:11.240
<v Speaker 1>where you didn't have to worry about every possible application

0:11:11.280 --> 0:11:13.560
<v Speaker 1>down in the real world. You made it for something

0:11:13.679 --> 0:11:16.760
<v Speaker 1>very specific, which means in your software you can have

0:11:16.840 --> 0:11:22.320
<v Speaker 1>the camera look quote unquote for these particular points of

0:11:22.360 --> 0:11:26.680
<v Speaker 1>reference and thus create the augmented experience in that way.

0:11:26.880 --> 0:11:28.800
<v Speaker 1>If you want to take that and move it to

0:11:28.880 --> 0:11:32.760
<v Speaker 1>the real world where you can see augmented information about

0:11:32.920 --> 0:11:35.960
<v Speaker 1>just the world around you, it becomes way more complicated.

0:11:36.320 --> 0:11:39.800
<v Speaker 1>You have to have very sophisticated image recognition software so

0:11:39.840 --> 0:11:44.199
<v Speaker 1>that the camera picks up the images, the software processes

0:11:44.240 --> 0:11:47.360
<v Speaker 1>the information, identifies what those images are, and gives you

0:11:47.360 --> 0:11:52.240
<v Speaker 1>the relevant information. So working with all the sensors, augmented

0:11:52.240 --> 0:11:55.319
<v Speaker 1>reality can make this a possibility. So another example, let's

0:11:55.320 --> 0:11:58.559
<v Speaker 1>say you're out on the street in Atlanta. You're here

0:11:58.600 --> 0:12:01.680
<v Speaker 1>in my hometown at and a Georgia and you're looking

0:12:01.720 --> 0:12:03.400
<v Speaker 1>at a building and you wonder what it is, and

0:12:03.440 --> 0:12:05.440
<v Speaker 1>you hold up your phone and you've got your little

0:12:05.800 --> 0:12:08.840
<v Speaker 1>map app that allows you to look at a real

0:12:08.920 --> 0:12:12.240
<v Speaker 1>world setting and tells you information about it, and it

0:12:12.280 --> 0:12:14.760
<v Speaker 1>tells you it's the Georgia Aquarium. Well, first of all,

0:12:14.760 --> 0:12:16.720
<v Speaker 1>you would probably know that already because the signage there

0:12:16.800 --> 0:12:20.400
<v Speaker 1>is actually pretty good. But the point being that this

0:12:20.440 --> 0:12:24.040
<v Speaker 1>would be something that would tap into the GPS cordints

0:12:24.080 --> 0:12:26.280
<v Speaker 1>on your phone, so it would know where your location

0:12:26.440 --> 0:12:29.040
<v Speaker 1>was and help narrow that down. The compass would tell

0:12:29.040 --> 0:12:33.840
<v Speaker 1>it what direction you are facing the the camera angle.

0:12:33.880 --> 0:12:37.640
<v Speaker 1>Also we have some image recognition going on there. The

0:12:37.640 --> 0:12:41.520
<v Speaker 1>accelerometer tells the orientation of the phone itself. All of

0:12:41.520 --> 0:12:44.920
<v Speaker 1>this data together would give the software the information needed

0:12:45.280 --> 0:12:50.640
<v Speaker 1>for it to display the label Georgia Aquarium on your phone.

0:12:50.640 --> 0:12:55.120
<v Speaker 1>And it all happens in an instant that's pretty amazing. Typically,

0:12:55.160 --> 0:12:59.200
<v Speaker 1>you also have to have some other method to communicate

0:12:59.320 --> 0:13:03.280
<v Speaker 1>with a larger infrastructure, because we don't have the capability

0:13:03.400 --> 0:13:07.560
<v Speaker 1>of building an enormously powerful computer that has all this

0:13:07.720 --> 0:13:11.040
<v Speaker 1>real world information programmed into it and make it a

0:13:11.200 --> 0:13:15.440
<v Speaker 1>handheld or wearable device. So usually you have to pair

0:13:15.640 --> 0:13:19.400
<v Speaker 1>these devices with some other larger infrastructure. Sometimes it's a

0:13:19.440 --> 0:13:23.360
<v Speaker 1>double handshake. For example, with Google Glass, you would use

0:13:23.360 --> 0:13:26.800
<v Speaker 1>Bluetooth to connect Google Glass to a smartphone. Then the

0:13:26.840 --> 0:13:30.680
<v Speaker 1>smartphone would have the connection to the larger internet through

0:13:30.760 --> 0:13:38.760
<v Speaker 1>your smartphones UH cell service provider. So while you're experiencing

0:13:39.600 --> 0:13:43.080
<v Speaker 1>the augmented reality through the Google Glass, it's actually communicating

0:13:43.080 --> 0:13:45.559
<v Speaker 1>through your phone to the infrastructure to get the data

0:13:45.600 --> 0:13:48.040
<v Speaker 1>it needs to show you the information. It's showing you

0:13:49.000 --> 0:13:51.720
<v Speaker 1>very important elements. And all of these components, like I said,

0:13:51.840 --> 0:13:55.600
<v Speaker 1>came together more or less around the same time. Most

0:13:55.640 --> 0:13:59.040
<v Speaker 1>of them were being developed independently of each other, and

0:13:59.080 --> 0:14:01.960
<v Speaker 1>it's just that now we're seeing them all converge. That's

0:14:01.960 --> 0:14:05.840
<v Speaker 1>an old favorite word here at tech stuff converge together

0:14:05.880 --> 0:14:08.840
<v Speaker 1>to create the augmented reality experience and make it possible.

0:14:09.640 --> 0:14:11.800
<v Speaker 1>So how did we get here? How did these different

0:14:11.800 --> 0:14:16.000
<v Speaker 1>elements develop? Well, there are a whole bunch of technology

0:14:16.000 --> 0:14:19.440
<v Speaker 1>pioneers who really create the foundation for augmented reality as

0:14:19.440 --> 0:14:22.440
<v Speaker 1>well as virtual reality and mixed reality. But one that

0:14:22.520 --> 0:14:25.040
<v Speaker 1>I think we really need to concentrate on at first

0:14:25.120 --> 0:14:29.960
<v Speaker 1>is Ivan Sutherland. Now. Sutherland was born in Hastings, Nebraska

0:14:30.040 --> 0:14:33.080
<v Speaker 1>in nineteen thirty eight, and as a kid, he was

0:14:33.160 --> 0:14:39.400
<v Speaker 1>fascinated with mathematics, particularly geometry, and also with engineering. He

0:14:39.440 --> 0:14:42.800
<v Speaker 1>began to study and experiment with computers while he was

0:14:42.840 --> 0:14:45.440
<v Speaker 1>in school, and this was at a time where personal

0:14:45.440 --> 0:14:48.160
<v Speaker 1>computers weren't a thing. There were no personal computers at

0:14:48.160 --> 0:14:50.720
<v Speaker 1>this point. Computers were actually pretty rare, and they were huge,

0:14:51.280 --> 0:14:55.320
<v Speaker 1>and in fact, they often would rely upon physical media

0:14:55.400 --> 0:14:59.320
<v Speaker 1>formats like punch punch cards or paper tape to read

0:14:59.400 --> 0:15:03.320
<v Speaker 1>a program him. So you didn't even have a disc

0:15:03.480 --> 0:15:07.160
<v Speaker 1>or like certainly nothing like a USB thumb drive or

0:15:07.200 --> 0:15:09.920
<v Speaker 1>anything like that. You you actually had to put physical

0:15:10.000 --> 0:15:13.240
<v Speaker 1>media into the machine for it to read and then

0:15:13.360 --> 0:15:18.040
<v Speaker 1>execute whatever program you had designed for that device. He

0:15:18.280 --> 0:15:21.720
<v Speaker 1>went to college at what is now Carnegie Mellon University

0:15:21.760 --> 0:15:24.440
<v Speaker 1>on a full scholarship. He graduated with a Bachelor of

0:15:24.480 --> 0:15:26.440
<v Speaker 1>Science degree. He would then go on to earn a

0:15:26.480 --> 0:15:29.800
<v Speaker 1>master's degree at cal Tech and a PhD in electrical

0:15:29.880 --> 0:15:34.000
<v Speaker 1>engineering from m I T And actually his doctoral thesis

0:15:34.000 --> 0:15:38.560
<v Speaker 1>supervisor was Claude Shannon. And we talked about Claude Shannon

0:15:38.600 --> 0:15:40.720
<v Speaker 1>back in the two thousand and fourteen episode Who Is

0:15:40.760 --> 0:15:44.960
<v Speaker 1>Claude Shannon. Um. We recorded that not too long after

0:15:45.120 --> 0:15:48.280
<v Speaker 1>Shannon's passing, So if you want to hear a really

0:15:48.320 --> 0:15:52.280
<v Speaker 1>interesting story about a pioneer in computer science, you should

0:15:52.280 --> 0:15:55.360
<v Speaker 1>go check out that two thousand and fourteen episode. Back

0:15:55.400 --> 0:15:59.720
<v Speaker 1>to Sutherland. For his thesis, he created something called sketch

0:15:59.760 --> 0:16:02.720
<v Speaker 1>pack AD and that was really, by most accounts, the

0:16:02.800 --> 0:16:07.520
<v Speaker 1>first computer graphical user interface, or gooey. A graphical user

0:16:07.560 --> 0:16:13.520
<v Speaker 1>interface means that you interact with the computer through graphics

0:16:13.600 --> 0:16:17.560
<v Speaker 1>representing various commands on the computer. Windows and the Mac

0:16:17.560 --> 0:16:21.720
<v Speaker 1>operating system are both examples of graphical user interfaces, as

0:16:21.760 --> 0:16:23.880
<v Speaker 1>is the interface on your smartphone. If you have a

0:16:23.920 --> 0:16:28.760
<v Speaker 1>smartphone where you choose applications on a screen, that's a

0:16:28.800 --> 0:16:32.520
<v Speaker 1>graphical user interface. Well, Sutherland created what is largely considered

0:16:32.560 --> 0:16:35.320
<v Speaker 1>to be the first one of those. After college, he

0:16:35.400 --> 0:16:38.720
<v Speaker 1>entered military service and he was assigned to the National

0:16:38.760 --> 0:16:43.880
<v Speaker 1>Security Agency. We have great friends there. I assume I'm

0:16:43.880 --> 0:16:48.080
<v Speaker 1>sure they're listening, because they're listening to everything at any rate.

0:16:48.160 --> 0:16:50.680
<v Speaker 1>He entered the n s A as an electrical engineer,

0:16:50.800 --> 0:16:53.880
<v Speaker 1>and in nineteen sixty four he replaced J. C. R.

0:16:54.000 --> 0:16:58.040
<v Speaker 1>Lick Lighter as the head of DARPA's Information Processing Techniques

0:16:58.160 --> 0:17:03.240
<v Speaker 1>Office or ip TO. And also by back then DARPA

0:17:03.360 --> 0:17:07.600
<v Speaker 1>wasn't DARPA, it was just ARPA. Uh so this is

0:17:07.640 --> 0:17:09.439
<v Speaker 1>the same group, by the way, that would end up

0:17:09.480 --> 0:17:12.720
<v Speaker 1>doing a lot of work that would form the ARPANETTE

0:17:12.880 --> 0:17:15.800
<v Speaker 1>a few years later, and the Arpanette was the predecessor

0:17:15.920 --> 0:17:19.560
<v Speaker 1>to the Internet in some ways. At least, the ARPANET

0:17:19.720 --> 0:17:23.760
<v Speaker 1>was what ended up being the building blocks for the

0:17:23.800 --> 0:17:27.439
<v Speaker 1>infrastructure that would become the Internet. Now, all of that

0:17:27.480 --> 0:17:32.159
<v Speaker 1>work happened after Sutherland had already departed the organization. His

0:17:32.200 --> 0:17:37.280
<v Speaker 1>work became a fundamental component of both virtual and augmented reality.

0:17:37.320 --> 0:17:40.919
<v Speaker 1>As I mentioned earlier in n he wrote a piece

0:17:41.000 --> 0:17:43.280
<v Speaker 1>and essay. It's very short, it's very easy read, and

0:17:43.320 --> 0:17:45.680
<v Speaker 1>you can find it online. The title of the essay

0:17:45.800 --> 0:17:48.960
<v Speaker 1>is the Ultimate Display. And if you ever do any

0:17:49.040 --> 0:17:53.320
<v Speaker 1>research and virtual reality or augmented reality, this essay is

0:17:53.359 --> 0:17:56.280
<v Speaker 1>going to pop up in your research, so go ahead

0:17:56.280 --> 0:17:58.199
<v Speaker 1>and read it. It's like two pages long, so it

0:17:58.240 --> 0:18:03.040
<v Speaker 1>goes very quickly. In that essay he talked about several ideas,

0:18:03.119 --> 0:18:10.000
<v Speaker 1>including the idealized display, the ultimate display, something that would

0:18:10.040 --> 0:18:15.200
<v Speaker 1>be the furthest you could go with display technology. Now,

0:18:15.280 --> 0:18:18.440
<v Speaker 1>keep in mind in his time, by his time, he's

0:18:18.440 --> 0:18:20.439
<v Speaker 1>still alive, by the way, but this time in the

0:18:20.480 --> 0:18:25.440
<v Speaker 1>nineties sixties, he uh, you know, things were just restricted

0:18:25.480 --> 0:18:30.159
<v Speaker 1>to monitors. You might have a light pen, but usually

0:18:30.200 --> 0:18:33.760
<v Speaker 1>you would just use a keyboard. Like it was pretty

0:18:33.760 --> 0:18:36.880
<v Speaker 1>bare bones. But he said, let's push this as far

0:18:36.920 --> 0:18:40.760
<v Speaker 1>as we can imagine it. And in his example, he

0:18:40.840 --> 0:18:44.840
<v Speaker 1>thought of a room that would be completely controlled by computers.

0:18:44.880 --> 0:18:48.680
<v Speaker 1>Everything you would experience within that room would be generated

0:18:48.720 --> 0:18:53.800
<v Speaker 1>by a computer. Everything you see here, smell, taste, and touch,

0:18:54.680 --> 0:18:57.240
<v Speaker 1>all of it generated by computers. The computer would even

0:18:57.280 --> 0:19:01.520
<v Speaker 1>be able to form physical objects out of ror matter itself. Now,

0:19:01.560 --> 0:19:05.200
<v Speaker 1>he wasn't suggesting that this would ever be a device

0:19:05.280 --> 0:19:07.280
<v Speaker 1>that we would actually be able to build. He was

0:19:07.320 --> 0:19:13.000
<v Speaker 1>just saying, what is the ultimate incarnation of display technology?

0:19:13.280 --> 0:19:16.080
<v Speaker 1>And if you read it, you realize, oh, this is

0:19:16.080 --> 0:19:18.760
<v Speaker 1>where the Star Trek Next Generation writers got their idea

0:19:18.800 --> 0:19:22.359
<v Speaker 1>for the Holidack. But unlike Star Trek the Next Generation,

0:19:22.520 --> 0:19:25.760
<v Speaker 1>the Ultimate Display would not go on the fritz every

0:19:25.760 --> 0:19:28.760
<v Speaker 1>other episode and try to kill the crew. It was

0:19:28.800 --> 0:19:34.480
<v Speaker 1>better than that. The Ultimate Display was sort of a

0:19:33.960 --> 0:19:39.480
<v Speaker 1>a foundational like Philosophically, it was foundational for virtual reality

0:19:39.520 --> 0:19:43.320
<v Speaker 1>and augmented reality. This idea of a very immersive experience

0:19:43.400 --> 0:19:48.280
<v Speaker 1>where you, as a user are surrounded somehow by this

0:19:49.280 --> 0:19:53.600
<v Speaker 1>computer generated experience. And that's true both with augmented reality

0:19:53.640 --> 0:19:58.119
<v Speaker 1>and virtual reality and augmented reality. The real world is

0:19:58.160 --> 0:20:01.840
<v Speaker 1>still there, but you get this enhanced experience that is

0:20:01.880 --> 0:20:09.000
<v Speaker 1>completely computer generated. So in nineteen sixty eight, Sutherland and

0:20:09.240 --> 0:20:13.359
<v Speaker 1>a student named Danny Cohen would create a VR, a

0:20:13.720 --> 0:20:18.680
<v Speaker 1>R head mounted display or h MD, and they nicknamed

0:20:18.720 --> 0:20:23.000
<v Speaker 1>it the Sword of Damocles. Why because you had to

0:20:23.040 --> 0:20:25.280
<v Speaker 1>suspend it from the ceiling. It was too heavy to

0:20:25.359 --> 0:20:28.240
<v Speaker 1>wear on your head. You needed it to be nice

0:20:28.280 --> 0:20:33.000
<v Speaker 1>and sturdy. Uh. It included transparent lenses, which meant you

0:20:33.040 --> 0:20:36.920
<v Speaker 1>could overlay computer information on the lenses themselves, and thus

0:20:36.960 --> 0:20:39.520
<v Speaker 1>you could look through the lenses at the real world

0:20:39.880 --> 0:20:43.880
<v Speaker 1>and have these wire framed graphics on top of what

0:20:43.920 --> 0:20:46.639
<v Speaker 1>you were looking at. And it also had a magnetic

0:20:46.680 --> 0:20:50.479
<v Speaker 1>tracking system, meaning that it had sensors that could detect

0:20:50.520 --> 0:20:53.520
<v Speaker 1>magnetic fields, and as you turned your head or you

0:20:54.040 --> 0:20:57.399
<v Speaker 1>change the inclination of your head, it would change the

0:20:57.440 --> 0:21:01.040
<v Speaker 1>magnetic field and this would be relayed as a command

0:21:01.280 --> 0:21:05.880
<v Speaker 1>to the visual center. The actual lenses themselves, so that

0:21:06.000 --> 0:21:09.400
<v Speaker 1>it would the change would be reflected in what you saw.

0:21:10.240 --> 0:21:12.480
<v Speaker 1>So if you have a virtual environment and you turn

0:21:12.520 --> 0:21:14.920
<v Speaker 1>your head to the left, you want the view within

0:21:14.960 --> 0:21:17.480
<v Speaker 1>the virtual environment to go to the left too, But

0:21:17.560 --> 0:21:20.480
<v Speaker 1>without head tracking technology that's impossible. So this was a

0:21:20.600 --> 0:21:23.399
<v Speaker 1>very early example of head tracking technology. And again it

0:21:23.520 --> 0:21:27.560
<v Speaker 1>used magnets magnetic fields in order to do that. Uh.

0:21:28.160 --> 0:21:32.480
<v Speaker 1>Obviously it's also really important for augmented reality. Again, if

0:21:32.520 --> 0:21:35.600
<v Speaker 1>the A R system doesn't detect that you are looking around,

0:21:36.160 --> 0:21:39.840
<v Speaker 1>then you're not getting relevant information, not for the specific

0:21:39.880 --> 0:21:43.840
<v Speaker 1>thing you are looking at. Anyway, as I said, the

0:21:43.840 --> 0:21:46.520
<v Speaker 1>graphics were pretty primitive. There were wire frame drawings, but

0:21:46.600 --> 0:21:52.399
<v Speaker 1>they still showed that this was a viable approach to

0:21:52.480 --> 0:21:57.200
<v Speaker 1>technology using HMD for augmented or virtual reality use. Oh

0:21:57.240 --> 0:21:59.560
<v Speaker 1>and one other note I should make so a lot

0:21:59.600 --> 0:22:01.960
<v Speaker 1>of people say the sort of Damocles was the first

0:22:02.000 --> 0:22:04.800
<v Speaker 1>head mounted display, and they say, you know, this is

0:22:04.840 --> 0:22:09.000
<v Speaker 1>the first HMD was made in nineteen I take issue

0:22:09.080 --> 0:22:12.920
<v Speaker 1>with that. I don't think of the sort of Damocles

0:22:13.080 --> 0:22:16.359
<v Speaker 1>as the first head mounted display. That to me should

0:22:16.359 --> 0:22:20.760
<v Speaker 1>go to a different invention called the head site h

0:22:20.800 --> 0:22:23.679
<v Speaker 1>E A D s I g h T. Now that

0:22:23.800 --> 0:22:27.720
<v Speaker 1>was developed by Phil Coo, and unlike the sort of Damocles,

0:22:28.280 --> 0:22:32.679
<v Speaker 1>it didn't create a virtual world. Instead, the head site

0:22:32.920 --> 0:22:37.400
<v Speaker 1>was sort of a remote viewfinder for a video camera.

0:22:38.440 --> 0:22:43.120
<v Speaker 1>So imagine that you've got a camera mounted on a mechanical,

0:22:43.800 --> 0:22:48.000
<v Speaker 1>uh swiveling mount, so you can move it left right,

0:22:48.040 --> 0:22:51.679
<v Speaker 1>you can change the orientation the inclination as well, and

0:22:51.720 --> 0:22:54.520
<v Speaker 1>then you have that mapped to a head mounted display,

0:22:54.640 --> 0:22:56.879
<v Speaker 1>so that if I put the display on and I

0:22:56.920 --> 0:22:59.440
<v Speaker 1>looked to the left, the camera pans to the left.

0:22:59.600 --> 0:23:01.360
<v Speaker 1>If I to the right, it pans to the right.

0:23:01.440 --> 0:23:04.879
<v Speaker 1>That sort of thing. It was meant to be a

0:23:04.920 --> 0:23:08.359
<v Speaker 1>way for for people to operate a camera in a

0:23:08.400 --> 0:23:12.280
<v Speaker 1>remote location that might not be very friendly to a

0:23:12.400 --> 0:23:16.520
<v Speaker 1>human being standing there. For example, the exterior of an aircraft.

0:23:16.880 --> 0:23:18.960
<v Speaker 1>You could have a camera mounted on the outside of

0:23:18.960 --> 0:23:22.520
<v Speaker 1>your aircraft that would allow an engineer on the inside

0:23:22.960 --> 0:23:27.439
<v Speaker 1>to look around and maybe help a pilot land or

0:23:27.680 --> 0:23:31.280
<v Speaker 1>navigate in a dangerous situation, or just get an idea

0:23:31.440 --> 0:23:35.880
<v Speaker 1>of the status of the aircraft itself. This was very

0:23:35.960 --> 0:23:40.440
<v Speaker 1>much a technology that was being pushed by the military,

0:23:41.440 --> 0:23:45.320
<v Speaker 1>an idea to create more military uses using this technology.

0:23:45.359 --> 0:23:50.040
<v Speaker 1>To make the military more competent, more adept at very

0:23:50.119 --> 0:23:55.959
<v Speaker 1>rapid changing situations on the technology front. So headsite proceeded

0:23:56.400 --> 0:23:59.240
<v Speaker 1>the sort of Damocles by about seven years. That came

0:23:59.240 --> 0:24:01.879
<v Speaker 1>out around night sixty one. But again it wasn't a

0:24:01.960 --> 0:24:05.520
<v Speaker 1>virtual reality headset or an augmented reality headset. It was

0:24:05.600 --> 0:24:08.480
<v Speaker 1>kind of a like I said, a remote viewfinder. But

0:24:08.600 --> 0:24:12.240
<v Speaker 1>still I consider that to be the earliest head mounted display,

0:24:12.400 --> 0:24:16.119
<v Speaker 1>not the sort of Damocles. However, Sutherland would end up

0:24:16.119 --> 0:24:19.240
<v Speaker 1>going on to make lots of other contributions in computer

0:24:19.280 --> 0:24:23.639
<v Speaker 1>graphics as well as the overall concepts that would guide

0:24:23.640 --> 0:24:27.560
<v Speaker 1>both virtual reality and augmented reality development over the next

0:24:27.600 --> 0:24:31.639
<v Speaker 1>several decades. But now it'll be time for me to

0:24:31.720 --> 0:24:34.560
<v Speaker 1>kind of move away from Sutherland to talk about some

0:24:34.640 --> 0:24:37.919
<v Speaker 1>other developments that were important in a R. And before

0:24:37.920 --> 0:24:41.280
<v Speaker 1>I get to that, let's take a quick break to

0:24:41.400 --> 0:24:52.280
<v Speaker 1>thank our sponsor. All Right, we just left off with

0:24:52.359 --> 0:24:57.520
<v Speaker 1>Ivan Sutherland. Now let's talk about a different father of

0:24:57.800 --> 0:25:03.719
<v Speaker 1>augmented reality, Myra Krueger or Doctor Myron Krueger. In nineteen

0:25:03.840 --> 0:25:07.520
<v Speaker 1>seventy four, Dr Krueger created an augmented reality lab called

0:25:07.760 --> 0:25:12.640
<v Speaker 1>video Place. Uh, he was really into this idea of

0:25:12.680 --> 0:25:17.719
<v Speaker 1>seeing the interaction of technology and people in artistic ways.

0:25:18.240 --> 0:25:23.480
<v Speaker 1>He really wanted to explore artistic expressions using technology and

0:25:23.600 --> 0:25:27.880
<v Speaker 1>people working together. So he wanted to create an artificial

0:25:27.960 --> 0:25:32.040
<v Speaker 1>reality environment that didn't require the user to wear special equipment.

0:25:32.080 --> 0:25:34.280
<v Speaker 1>You wouldn't have to put on a head mold display,

0:25:34.560 --> 0:25:39.760
<v Speaker 1>or wear special gloves, or use any kind of device

0:25:39.920 --> 0:25:44.000
<v Speaker 1>to control your actions, because that's a barrier between you

0:25:44.080 --> 0:25:50.000
<v Speaker 1>and the experience. Instead, his version consisted of a laboratory

0:25:50.040 --> 0:25:54.879
<v Speaker 1>that had several rooms all network together, and each room

0:25:55.040 --> 0:25:58.240
<v Speaker 1>had a video camera in it and a projector and

0:25:58.320 --> 0:26:01.119
<v Speaker 1>a screen. Now, the video camera would pick up the

0:26:01.200 --> 0:26:04.720
<v Speaker 1>motions of the person inside the room, it would send

0:26:05.000 --> 0:26:08.760
<v Speaker 1>information to the projector, which would then project the person's

0:26:08.960 --> 0:26:12.520
<v Speaker 1>silhouette on the screen. And the silhouette was typically a

0:26:12.560 --> 0:26:15.800
<v Speaker 1>really bright color, and you could move around and your

0:26:15.840 --> 0:26:18.159
<v Speaker 1>silhouette would move around, so you almost became like a

0:26:18.160 --> 0:26:22.119
<v Speaker 1>puppet master controlling your own silhouette. But then he started

0:26:22.119 --> 0:26:26.280
<v Speaker 1>to incorporate other things, like other elements that were virtually

0:26:26.359 --> 0:26:29.480
<v Speaker 1>on the screen. The projector was projecting things that were

0:26:29.520 --> 0:26:32.600
<v Speaker 1>on the screen but not in the actual real room itself.

0:26:32.840 --> 0:26:35.359
<v Speaker 1>So imagine a ball and a ball is being projected

0:26:35.480 --> 0:26:37.879
<v Speaker 1>on the screen. Well, you could move around so that

0:26:37.880 --> 0:26:39.960
<v Speaker 1>your silhouette would interact with the ball and the ball

0:26:40.000 --> 0:26:43.720
<v Speaker 1>would bounce away. That sort of thing. So you would

0:26:43.720 --> 0:26:46.840
<v Speaker 1>be able to interact with virtual environments by moving around

0:26:46.840 --> 0:26:52.439
<v Speaker 1>in a real physical space. And while those objects weren't

0:26:52.520 --> 0:26:55.480
<v Speaker 1>really there in front of you, you could see the

0:26:55.520 --> 0:26:58.520
<v Speaker 1>representation of them on the screen. And this was really

0:26:58.520 --> 0:27:00.919
<v Speaker 1>powerful stuff. And remember I said these rooms were all

0:27:00.960 --> 0:27:04.200
<v Speaker 1>network together, so you could actually have a system where

0:27:04.359 --> 0:27:06.640
<v Speaker 1>a person in one room and a person in another

0:27:06.720 --> 0:27:11.560
<v Speaker 1>room both have their silhouettes projected together in their respective

0:27:11.720 --> 0:27:16.760
<v Speaker 1>rooms on the screen, and your silhouette would be one color,

0:27:16.840 --> 0:27:19.840
<v Speaker 1>the other person's silhouette would be a different color, and

0:27:19.920 --> 0:27:24.720
<v Speaker 1>you could interact with one another. And according to reports

0:27:25.119 --> 0:27:30.119
<v Speaker 1>from this art experiment, they noticed that whenever people would

0:27:30.160 --> 0:27:33.879
<v Speaker 1>have their silhouettes cross one another, they would actually recoil

0:27:34.760 --> 0:27:37.480
<v Speaker 1>in their physical rooms. Keep in mind, they're in different rooms,

0:27:37.600 --> 0:27:40.240
<v Speaker 1>they're not in the same one together, they would recoil

0:27:40.359 --> 0:27:44.040
<v Speaker 1>as if they had made physical contact or bumped into someone.

0:27:44.720 --> 0:27:47.480
<v Speaker 1>So it showed that there was a very powerful psychological

0:27:47.600 --> 0:27:52.159
<v Speaker 1>element to this virtual presence. And again that psychological element

0:27:52.200 --> 0:27:55.600
<v Speaker 1>plays a huge important role in VR and a R

0:27:55.720 --> 0:27:58.800
<v Speaker 1>research and development, not just for creating products, but just

0:27:58.880 --> 0:28:03.520
<v Speaker 1>to understand how we process information and incorporated into our

0:28:03.520 --> 0:28:08.359
<v Speaker 1>sense of reality. Not to get too deep for you guys,

0:28:08.400 --> 0:28:12.159
<v Speaker 1>So experimentation in the field continued over the years. In

0:28:12.200 --> 0:28:15.280
<v Speaker 1>the early nine eighties, Dr Krueger would write a book

0:28:15.560 --> 0:28:19.520
<v Speaker 1>and publish it about artificial realities. But while the principles

0:28:19.560 --> 0:28:24.520
<v Speaker 1>for augmented reality were established, the technologies were still rather unwieldy.

0:28:24.560 --> 0:28:28.360
<v Speaker 1>They were large, they weren't reliable, and it would require

0:28:28.520 --> 0:28:32.439
<v Speaker 1>several years of work to improve those technologies, to create

0:28:32.760 --> 0:28:36.240
<v Speaker 1>miniaturization strategies to get the elements down to a size

0:28:36.280 --> 0:28:39.920
<v Speaker 1>that was more practical for that sort of use and

0:28:40.000 --> 0:28:43.000
<v Speaker 1>wouldn't require you to have a head mound display mounted

0:28:43.040 --> 0:28:46.440
<v Speaker 1>to the ceiling. And all of that took time, but

0:28:46.600 --> 0:28:50.440
<v Speaker 1>you could tell that the ideas underlying augmented and virtual

0:28:50.480 --> 0:28:56.920
<v Speaker 1>reality were already in place. There was a Boeing researcher

0:28:57.080 --> 0:29:02.680
<v Speaker 1>named Tom Coudell who coined the term augmented reality, and

0:29:02.800 --> 0:29:05.760
<v Speaker 1>he was specifically using it to talk about this approach

0:29:05.800 --> 0:29:08.640
<v Speaker 1>to overlaying digital information on top of our physical world

0:29:08.720 --> 0:29:12.800
<v Speaker 1>to enhance it in some way. Now Dr Cardell earned

0:29:12.840 --> 0:29:17.320
<v Speaker 1>a PhD in physics and astronomy from the University of Arizona,

0:29:17.440 --> 0:29:21.280
<v Speaker 1>and before contributing the term augmented reality to the public lexicon,

0:29:21.480 --> 0:29:25.200
<v Speaker 1>he did extensive work and artificial intelligence research and development.

0:29:25.520 --> 0:29:28.000
<v Speaker 1>He also became a professor in the fields of electrical

0:29:28.000 --> 0:29:32.200
<v Speaker 1>and computer engineering at the University of New Mexico. So

0:29:32.240 --> 0:29:34.920
<v Speaker 1>when he was working with Boeing, he used this phrase

0:29:34.960 --> 0:29:37.640
<v Speaker 1>to talk about specific system he was working on, an

0:29:37.680 --> 0:29:40.680
<v Speaker 1>augmented reality system, and the whole purpose of this was

0:29:40.760 --> 0:29:47.120
<v Speaker 1>to help people who were helping construct airplanes lay cables properly.

0:29:47.800 --> 0:29:52.120
<v Speaker 1>The whole idea was to use this system so that uh,

0:29:52.240 --> 0:29:55.200
<v Speaker 1>an electrician can see exactly where the cable needed to

0:29:55.280 --> 0:30:00.240
<v Speaker 1>go inside the partly constructed cabin of an aircraft aft,

0:30:01.080 --> 0:30:03.640
<v Speaker 1>and that way you could follow the directions that you

0:30:03.680 --> 0:30:07.920
<v Speaker 1>see through your display, lay the actual cable down where

0:30:07.960 --> 0:30:11.760
<v Speaker 1>the guide tells you to go, and then you would

0:30:11.800 --> 0:30:17.120
<v Speaker 1>have a properly wired airplane. Uh. And I'm sure, as

0:30:17.120 --> 0:30:22.680
<v Speaker 1>we're all aware, properly wired airplanes are good airplanes. Improperly

0:30:22.760 --> 0:30:25.640
<v Speaker 1>wired airplanes are not so good. So it was a

0:30:25.720 --> 0:30:30.120
<v Speaker 1>very important system to make this much more smooth and

0:30:30.280 --> 0:30:33.040
<v Speaker 1>fast and it meant that you didn't have to have

0:30:33.280 --> 0:30:38.640
<v Speaker 1>as as many experts to guide the process. You could

0:30:38.680 --> 0:30:41.840
<v Speaker 1>actually have someone come in who had never done this

0:30:41.920 --> 0:30:46.320
<v Speaker 1>before and just follow the directions through this augmented reality

0:30:46.680 --> 0:30:52.720
<v Speaker 1>set system and they could wire the airplane properly. So

0:30:53.000 --> 0:30:57.320
<v Speaker 1>really clever means of using augmented reality. Also, we would

0:30:57.360 --> 0:31:00.640
<v Speaker 1>end up seeing that same sort of philosophy used again

0:31:00.680 --> 0:31:04.680
<v Speaker 1>and again in the future in more sophisticated UH types

0:31:04.720 --> 0:31:07.440
<v Speaker 1>of technology, but it was the exact same approach, exact

0:31:07.480 --> 0:31:14.040
<v Speaker 1>same idea underlying it. In Lewis Rosenberg proposed a system

0:31:14.120 --> 0:31:16.520
<v Speaker 1>that the Air Force could use to allow someone to

0:31:16.560 --> 0:31:20.760
<v Speaker 1>control devices from a remote location, and that consisted of

0:31:20.800 --> 0:31:23.960
<v Speaker 1>a video camera which would provide the visual data to

0:31:23.960 --> 0:31:27.080
<v Speaker 1>the user through a head mounted display. They would wear

0:31:27.120 --> 0:31:29.479
<v Speaker 1>the display on their heads or they would look at

0:31:29.520 --> 0:31:33.000
<v Speaker 1>a screen, but typically they'd wear a display, and then

0:31:33.200 --> 0:31:36.880
<v Speaker 1>they would also wear an exoskeleton on their upper body

0:31:37.000 --> 0:31:42.200
<v Speaker 1>that would allow them to control some sort of robotic device,

0:31:42.280 --> 0:31:45.840
<v Speaker 1>typically robotic arms. And usually the way this would work

0:31:45.920 --> 0:31:48.080
<v Speaker 1>is that the display was designed in such a way

0:31:49.040 --> 0:31:52.120
<v Speaker 1>with the video camera so that the view that the

0:31:52.160 --> 0:31:55.160
<v Speaker 1>person had it made it look like the robot arms

0:31:55.640 --> 0:31:59.200
<v Speaker 1>were their actual arms, which required a little bit of

0:31:59.200 --> 0:32:02.920
<v Speaker 1>trickery on the part of Rosenberg. They had to fudge

0:32:03.240 --> 0:32:06.240
<v Speaker 1>the distances between the video camera and the robotic arms

0:32:06.480 --> 0:32:09.760
<v Speaker 1>to give this this sort of feeling that the robot

0:32:09.880 --> 0:32:13.320
<v Speaker 1>arms represented your actual arms. So you move your arms

0:32:13.320 --> 0:32:17.080
<v Speaker 1>inside the exoskeleton and the robot arms would move as

0:32:17.120 --> 0:32:19.680
<v Speaker 1>well at their remote location, so you it's kind of

0:32:19.720 --> 0:32:25.480
<v Speaker 1>like a really fancy remote control. Now imagine that the

0:32:25.560 --> 0:32:29.680
<v Speaker 1>robot arms are holding various tools. Uh. The suit would

0:32:29.680 --> 0:32:33.720
<v Speaker 1>also provide haptic feedback, that touch based feedback to let

0:32:33.720 --> 0:32:36.120
<v Speaker 1>a user know more about what is going on when

0:32:36.160 --> 0:32:40.040
<v Speaker 1>they're operating the arms. So if you were to have

0:32:40.360 --> 0:32:43.200
<v Speaker 1>do something that would make a robot arm encounter resistance,

0:32:44.080 --> 0:32:47.120
<v Speaker 1>then you would feel haptic feedback in the suit that

0:32:47.120 --> 0:32:50.280
<v Speaker 1>would indicate, oh, you're you're going beyond the parameters of

0:32:50.280 --> 0:32:53.280
<v Speaker 1>where this robot arm is capable of going. So you

0:32:53.400 --> 0:32:56.480
<v Speaker 1>learned very quickly where where you can operate within that

0:32:56.560 --> 0:33:00.600
<v Speaker 1>suit and make sure that you are not pushing it

0:33:00.680 --> 0:33:05.720
<v Speaker 1>beyond its limits. You could also uh end up using

0:33:05.760 --> 0:33:09.480
<v Speaker 1>these tools to do various things in this remote environment. Now,

0:33:09.600 --> 0:33:13.480
<v Speaker 1>Rosenberg called his system virtual fixtures, which meant that the

0:33:13.560 --> 0:33:15.959
<v Speaker 1>user would see these virtual overlays on top of a

0:33:15.960 --> 0:33:19.200
<v Speaker 1>real environment that they were looking at. So I'm going

0:33:19.240 --> 0:33:23.320
<v Speaker 1>to give a very basic example that to illustrate this,

0:33:23.360 --> 0:33:25.200
<v Speaker 1>because it's hard to imagine, it's hard to get it

0:33:25.280 --> 0:33:27.840
<v Speaker 1>across in words. But let's say you're looking through a

0:33:27.880 --> 0:33:31.520
<v Speaker 1>headmund display and in front of you is a board,

0:33:32.560 --> 0:33:37.160
<v Speaker 1>wooden board, and it's just a regular wooden board. There's

0:33:37.200 --> 0:33:39.800
<v Speaker 1>nothing painted on it or anything in the real world,

0:33:39.920 --> 0:33:43.080
<v Speaker 1>and it's in a room that's across the building from you.

0:33:43.080 --> 0:33:45.680
<v Speaker 1>You cannot see this with your own eyes. You can

0:33:45.680 --> 0:33:49.400
<v Speaker 1>only see it through the video camera. The virtual fixture

0:33:49.480 --> 0:33:53.640
<v Speaker 1>overlay might be a series of circles, and the circles

0:33:53.720 --> 0:33:55.960
<v Speaker 1>are things that you were meant to cut out of

0:33:56.000 --> 0:33:58.960
<v Speaker 1>the board using the robot arms and a tool that's

0:33:59.040 --> 0:34:02.200
<v Speaker 1>right there in side the physical environment, across the building

0:34:02.240 --> 0:34:05.680
<v Speaker 1>from you. So you follow the patterns that you see

0:34:05.720 --> 0:34:09.640
<v Speaker 1>in this virtual overlay and you complete the task. That's

0:34:09.680 --> 0:34:13.239
<v Speaker 1>a very simple example, and uh, this system was meant

0:34:13.320 --> 0:34:15.800
<v Speaker 1>to allow for that. That's what he would call the

0:34:15.880 --> 0:34:20.359
<v Speaker 1>virtual fixtures, these overlays that you would see that would

0:34:20.400 --> 0:34:23.600
<v Speaker 1>appear to be real, but actually we're not present in

0:34:23.640 --> 0:34:30.160
<v Speaker 1>the physical environment itself. Now, also in a group of

0:34:30.160 --> 0:34:33.480
<v Speaker 1>researchers at Columbia University, we're proposing a system that they

0:34:33.520 --> 0:34:38.600
<v Speaker 1>called the knowledge based Augmented Reality for Maintenance Assistance a

0:34:38.760 --> 0:34:44.080
<v Speaker 1>k a. Karma Cute. Their approach was pretty novel. They

0:34:44.120 --> 0:34:47.640
<v Speaker 1>pointed out that while augmented reality had tremendous potential, it

0:34:47.680 --> 0:34:50.800
<v Speaker 1>also had a really big barrier and that it takes

0:34:50.960 --> 0:34:55.160
<v Speaker 1>an enormous amount of time to design or animate and

0:34:55.280 --> 0:35:00.239
<v Speaker 1>implement these graphic overlays for a R applications. So say

0:35:00.280 --> 0:35:02.520
<v Speaker 1>you're in a room and you're looking at different objects,

0:35:02.600 --> 0:35:05.960
<v Speaker 1>and little labels are popping up for each object. If

0:35:05.960 --> 0:35:07.839
<v Speaker 1>you're having to do all that by hand, it takes

0:35:07.840 --> 0:35:10.920
<v Speaker 1>a huge amount of time. What they wanted to do

0:35:11.440 --> 0:35:16.440
<v Speaker 1>was create artificial intelligence systems, or at least techniques to

0:35:16.640 --> 0:35:22.560
<v Speaker 1>generate graphics automatically on the fly. So this would be

0:35:22.760 --> 0:35:26.239
<v Speaker 1>similar to using image recognition software, so that if you

0:35:26.320 --> 0:35:31.000
<v Speaker 1>look at a specific box, let's say, the image recognition

0:35:31.000 --> 0:35:33.000
<v Speaker 1>software might be able to map that box to a

0:35:33.000 --> 0:35:36.480
<v Speaker 1>specific product and thus give you an overlay of information

0:35:36.520 --> 0:35:39.200
<v Speaker 1>about the product that would be inside that box, and

0:35:39.200 --> 0:35:40.960
<v Speaker 1>it would be able to do all this automatically. It

0:35:40.960 --> 0:35:45.200
<v Speaker 1>would not require a human programmer to go through and

0:35:45.200 --> 0:35:48.279
<v Speaker 1>and look at every single product in every single type

0:35:48.280 --> 0:35:52.120
<v Speaker 1>of box and program all that out. That would be ridiculous,

0:35:52.120 --> 0:35:54.960
<v Speaker 1>It would take forever. So it was the work of

0:35:54.960 --> 0:35:58.120
<v Speaker 1>this group with Karma that really started the ball rolling

0:35:58.120 --> 0:36:01.920
<v Speaker 1>with this AI approach to automatically fill in that information

0:36:02.200 --> 0:36:08.160
<v Speaker 1>and make a are a more practical experience. Around the

0:36:08.280 --> 0:36:12.680
<v Speaker 1>same time, between ninety two and nine, the Laurel Western

0:36:12.760 --> 0:36:16.560
<v Speaker 1>Development Labs, which was a defense contractor, began to work

0:36:16.600 --> 0:36:18.880
<v Speaker 1>with the U. S Military to create a R systems

0:36:18.920 --> 0:36:22.400
<v Speaker 1>for military vehicles. And you can understand very quickly how

0:36:22.880 --> 0:36:27.319
<v Speaker 1>a R would have enormous potential for military applications. And

0:36:27.360 --> 0:36:30.040
<v Speaker 1>in fact, a R is very commonly used in lots

0:36:30.040 --> 0:36:33.760
<v Speaker 1>of different things like pilot helmets where it helps pilots

0:36:33.840 --> 0:36:38.840
<v Speaker 1>keep track of targets UH and identify a potential threats

0:36:38.880 --> 0:36:42.520
<v Speaker 1>that kind of thing. But in this case they were

0:36:42.560 --> 0:36:47.279
<v Speaker 1>really looking at creating a augmented reality system that would

0:36:47.280 --> 0:36:52.920
<v Speaker 1>create virtual opponents for people working in simulated wartime conditions.

0:36:53.400 --> 0:36:57.000
<v Speaker 1>So really a training program. Imagine that you're operating an

0:36:57.040 --> 0:37:01.200
<v Speaker 1>actual military vehicle like a tank, and you have a

0:37:01.400 --> 0:37:05.880
<v Speaker 1>view outside that is really an augmented reality system, so

0:37:05.920 --> 0:37:08.080
<v Speaker 1>you're actually looking at the real world around you. You

0:37:08.120 --> 0:37:10.680
<v Speaker 1>aren't just sitting in a simulator inside of building. You

0:37:10.719 --> 0:37:14.000
<v Speaker 1>are out there in the field controlling a real vehicle

0:37:14.120 --> 0:37:17.760
<v Speaker 1>moving around in real terrain. But you also see virtual

0:37:18.400 --> 0:37:22.879
<v Speaker 1>representations of enemies in that real terrain, and you can

0:37:22.960 --> 0:37:26.680
<v Speaker 1>practice maneuvers and firing on enemies that sort of thing,

0:37:26.960 --> 0:37:30.799
<v Speaker 1>probably not using live ammunition at that point, but having

0:37:30.800 --> 0:37:33.680
<v Speaker 1>a more realistic simulation in a real environment, so that

0:37:33.719 --> 0:37:40.440
<v Speaker 1>you're not just trying to create a totally virtual scenario. Anyway,

0:37:40.480 --> 0:37:44.600
<v Speaker 1>that work was done in the world, wouldn't really learn

0:37:44.680 --> 0:37:48.040
<v Speaker 1>about it at large until about nine because that's the

0:37:48.040 --> 0:37:50.520
<v Speaker 1>way the military works. They're not so eager to talk

0:37:50.560 --> 0:37:54.600
<v Speaker 1>about their stuff while are still doing it. Meanwhile, at

0:37:54.600 --> 0:37:57.839
<v Speaker 1>the same time, artists were continuing to explore the relationships

0:37:57.840 --> 0:38:00.600
<v Speaker 1>between physical performers and virtual LM. And so you remember

0:38:00.640 --> 0:38:04.280
<v Speaker 1>I talked about Dr Krueger earlier. While in nine different

0:38:04.360 --> 0:38:08.680
<v Speaker 1>artist Julie Martin would create a piece called Dancing in Cyberspace,

0:38:09.680 --> 0:38:13.200
<v Speaker 1>and in that piece, dancers on a physical space or

0:38:13.239 --> 0:38:16.799
<v Speaker 1>a physical stage. We're able to manipulate virtual objects, so

0:38:16.880 --> 0:38:19.720
<v Speaker 1>an audience would be able to see both the physical

0:38:19.719 --> 0:38:25.200
<v Speaker 1>performance by the dancers and the virtual reactions the things

0:38:25.200 --> 0:38:27.640
<v Speaker 1>that happened within the virtual environment as a result of

0:38:27.680 --> 0:38:34.160
<v Speaker 1>the dancers moving around their physical space. Pretty neat. Two researchers,

0:38:34.320 --> 0:38:39.839
<v Speaker 1>Recky Motto and Nagao created there the first real handheld

0:38:39.920 --> 0:38:42.520
<v Speaker 1>a R display, but it was a tethered display. It

0:38:42.600 --> 0:38:45.600
<v Speaker 1>wasn't free form. You couldn't just take it anywhere. It

0:38:45.680 --> 0:38:48.560
<v Speaker 1>was called Navy Cam, and you had to have a

0:38:48.719 --> 0:38:52.200
<v Speaker 1>tether like cable essentially connect the Navy Cam to a workstation.

0:38:52.960 --> 0:38:56.239
<v Speaker 1>But it had a forward facing camera and you can

0:38:56.360 --> 0:39:00.359
<v Speaker 1>use a video feed to go through this and held

0:39:00.360 --> 0:39:05.000
<v Speaker 1>device through the cable to the workstation, and it could

0:39:05.000 --> 0:39:08.680
<v Speaker 1>detect color coded markers in the camera image and display

0:39:08.680 --> 0:39:11.200
<v Speaker 1>information on a video see through view. So you can

0:39:11.239 --> 0:39:15.080
<v Speaker 1>get that augmented reality experience. Obviously very limited, you know,

0:39:15.160 --> 0:39:17.520
<v Speaker 1>you could not just carry this around with you everywhere

0:39:17.520 --> 0:39:22.160
<v Speaker 1>you go, but it showed the ideas behind augmented reality

0:39:22.200 --> 0:39:25.560
<v Speaker 1>could in fact be realized in a handheld format. Now

0:39:25.600 --> 0:39:29.399
<v Speaker 1>it's just a matter of getting those different components small

0:39:29.520 --> 0:39:35.360
<v Speaker 1>enough to all fit in a self contained mobile form factor. Now,

0:39:35.400 --> 0:39:38.480
<v Speaker 1>in the late nineties, we started seeing televised sporting events

0:39:38.640 --> 0:39:42.600
<v Speaker 1>featuring augmented reality elements, or at least you did. I

0:39:42.640 --> 0:39:46.400
<v Speaker 1>don't watch sports ball, that's not entirely true, but I

0:39:46.400 --> 0:39:49.680
<v Speaker 1>don't watch football or hockey, American football or hockey, and

0:39:49.840 --> 0:39:53.480
<v Speaker 1>both of those were the sports that really got them.

0:39:53.520 --> 0:39:57.800
<v Speaker 1>First off, I'm gonna backtrack. I used to watch hockey,

0:39:57.840 --> 0:40:04.640
<v Speaker 1>but then Winnipeg stole the Atlanta Thrash years from me. Winnipeg, Okay,

0:40:04.640 --> 0:40:07.560
<v Speaker 1>getting back to hockey. So hockey had the Fox track system,

0:40:07.680 --> 0:40:10.960
<v Speaker 1>which Fox put into hockey games so that you could

0:40:11.000 --> 0:40:13.560
<v Speaker 1>easily follow the puck. Instead of trying to watch this

0:40:13.719 --> 0:40:16.839
<v Speaker 1>little bitty black disc spinning around, You've got to watch

0:40:16.840 --> 0:40:23.440
<v Speaker 1>this very bright, highlighted neon colored disc that everyone hated.

0:40:24.320 --> 0:40:27.600
<v Speaker 1>And after about two seasons, Fox stop doing it, and

0:40:27.680 --> 0:40:31.880
<v Speaker 1>people were happy. Adila Thrashers moved away, and then it

0:40:31.920 --> 0:40:36.640
<v Speaker 1>was just miserable. American football would follow suit in the

0:40:36.719 --> 0:40:40.279
<v Speaker 1>late nineties and have the first down line introduced, where

0:40:40.280 --> 0:40:44.760
<v Speaker 1>they could on live video overlay the first down line.

0:40:45.280 --> 0:40:48.799
<v Speaker 1>Usually it's a bright yellow line that indicates how far

0:40:48.880 --> 0:40:51.400
<v Speaker 1>the offensive team needs to go. And by offensive, I

0:40:51.400 --> 0:40:53.560
<v Speaker 1>mean they're on the offensive. I don't mean they offend

0:40:53.640 --> 0:40:57.920
<v Speaker 1>my sensibilities. I'm not that against American football, but it

0:40:57.960 --> 0:40:59.480
<v Speaker 1>showed how far they would need to go in order

0:40:59.520 --> 0:41:02.640
<v Speaker 1>to estab a first down, which I am told is

0:41:02.680 --> 0:41:06.600
<v Speaker 1>something you want to do. Uh. That would start to

0:41:06.640 --> 0:41:10.839
<v Speaker 1>get employed in and over time we would see that

0:41:11.400 --> 0:41:15.680
<v Speaker 1>increase where eventually Skycam was able to even use this system.

0:41:15.719 --> 0:41:18.319
<v Speaker 1>At first it wasn't. You could get a Skycamp view,

0:41:18.400 --> 0:41:20.560
<v Speaker 1>but you couldn't do the overlay of the first intent

0:41:20.680 --> 0:41:25.080
<v Speaker 1>line until later. Well, I've got a lot more to

0:41:25.160 --> 0:41:29.520
<v Speaker 1>say about augmented reality, but before I do, let's take

0:41:29.520 --> 0:41:40.400
<v Speaker 1>another quick break to thank our sponsor. Okay, we're back.

0:41:40.640 --> 0:41:45.000
<v Speaker 1>Let's skip ahead to I guess it's not really skipping

0:41:45.040 --> 0:41:50.359
<v Speaker 1>I just talked about. Let's plot ahead to That's when

0:41:50.440 --> 0:41:56.439
<v Speaker 1>NASA's X spacecraft was using an a R system as

0:41:56.560 --> 0:42:00.440
<v Speaker 1>part of its navigational tools, so people back on Earth

0:42:00.600 --> 0:42:04.240
<v Speaker 1>could look at a view from the spacecraft a camera

0:42:04.440 --> 0:42:08.479
<v Speaker 1>mounted on the spacecraft, and on top of that view

0:42:08.680 --> 0:42:12.440
<v Speaker 1>they could overlay map data to help with navigation. And

0:42:12.480 --> 0:42:14.960
<v Speaker 1>all of that, of course was controlled back here on Earth.

0:42:15.680 --> 0:42:18.040
<v Speaker 1>But it was sort of an experiment to see how

0:42:18.080 --> 0:42:22.319
<v Speaker 1>augmented reality could be incorporated into space exploration missions. In

0:42:22.360 --> 0:42:27.520
<v Speaker 1>the future and make them more effective. Also, in the

0:42:27.640 --> 0:42:32.840
<v Speaker 1>Navy began work on the Battlefield Augmented Reality System or BARS,

0:42:33.239 --> 0:42:35.719
<v Speaker 1>which is a wearable a R system for soldiers. You've

0:42:35.719 --> 0:42:39.719
<v Speaker 1>probably seen various implementations of this over the years. Is

0:42:39.760 --> 0:42:44.239
<v Speaker 1>obviously evolved since nine It's one of those pieces of

0:42:44.280 --> 0:42:48.239
<v Speaker 1>technology that some soldiers took to, but a lot just

0:42:48.360 --> 0:42:53.960
<v Speaker 1>felt that it created unnecessary distractions. Technology and warfare is

0:42:54.160 --> 0:42:59.080
<v Speaker 1>very very difficult because there are sometimes where we think, oh,

0:42:59.280 --> 0:43:02.400
<v Speaker 1>more information and is always better, but in some cases

0:43:02.440 --> 0:43:06.480
<v Speaker 1>that doesn't seem to hold true. And for some people

0:43:06.520 --> 0:43:10.120
<v Speaker 1>with these head mounted displays or or really it's heads

0:43:10.200 --> 0:43:13.439
<v Speaker 1>up displays HUDs uh, that can sometimes be the case,

0:43:13.520 --> 0:43:19.400
<v Speaker 1>depends on the implementation. In two thousand, hiro Katsu Kato

0:43:19.560 --> 0:43:24.440
<v Speaker 1>created a software library called a R Toolkit. Very important

0:43:24.440 --> 0:43:28.360
<v Speaker 1>software library was also open source, so anyone could contribute

0:43:28.360 --> 0:43:32.360
<v Speaker 1>to it, modify it. Brent put out a new version

0:43:32.640 --> 0:43:35.680
<v Speaker 1>that sort of stuff, and it uses video tracking to

0:43:35.719 --> 0:43:39.719
<v Speaker 1>overlay computer graphics on a video camera feed and it's

0:43:39.719 --> 0:43:43.120
<v Speaker 1>still a component for a lot of a our experiences today.

0:43:43.200 --> 0:43:46.680
<v Speaker 1>Later on in the two thousand's this would be adapted

0:43:47.000 --> 0:43:50.480
<v Speaker 1>so that it could also be used in Web experiences,

0:43:50.560 --> 0:43:56.320
<v Speaker 1>not just native experiences to specific devices, and we continue

0:43:56.360 --> 0:44:00.719
<v Speaker 1>to see are built into new experiences, including smartphone and tablets.

0:44:00.719 --> 0:44:04.360
<v Speaker 1>By two thousand four, some researchers in Germany were creating

0:44:04.440 --> 0:44:07.400
<v Speaker 1>a r apps that could take advantage of a smartphones camera.

0:44:08.040 --> 0:44:11.760
<v Speaker 1>But two thousand four's pretty early for smartphones. It really

0:44:11.800 --> 0:44:15.320
<v Speaker 1>would would be a few years before this would truly

0:44:15.360 --> 0:44:18.520
<v Speaker 1>take off, because that's when Apple came out with the

0:44:18.560 --> 0:44:22.480
<v Speaker 1>iPhone in two thousand seven. That was the real revolution

0:44:22.800 --> 0:44:26.840
<v Speaker 1>in smartphone technology. There have been smartphones before the iPhone,

0:44:26.960 --> 0:44:29.480
<v Speaker 1>don't get me wrong, and many of them were really good,

0:44:30.120 --> 0:44:34.200
<v Speaker 1>but the iPhone was something that caught the public's attention

0:44:34.360 --> 0:44:38.279
<v Speaker 1>and made smartphones sexy. And because of that, there was

0:44:38.320 --> 0:44:41.879
<v Speaker 1>a ton of money poured into the smartphone industry as

0:44:41.920 --> 0:44:45.040
<v Speaker 1>well as not just Apple, but also to other companies,

0:44:45.080 --> 0:44:49.200
<v Speaker 1>like the companies that were offering Android smartphones. But I

0:44:49.239 --> 0:44:51.879
<v Speaker 1>think we can really thank Apple for all of that

0:44:52.360 --> 0:44:55.520
<v Speaker 1>happening in the first place, especially things like seeing that

0:44:55.560 --> 0:44:58.560
<v Speaker 1>accelerometer where you could switch from portrait to landscape mode.

0:44:58.560 --> 0:45:02.080
<v Speaker 1>I remember everyone freaking out about that when Steve Jobs

0:45:02.200 --> 0:45:05.200
<v Speaker 1>showed it off in two thousand seven at Macworld and

0:45:05.560 --> 0:45:07.839
<v Speaker 1>everyone thought, Wow, this is the this is amazing. Well,

0:45:07.880 --> 0:45:09.400
<v Speaker 1>we take it for granted now, but it was a

0:45:09.440 --> 0:45:14.359
<v Speaker 1>big deal then. So once that smartphone revolution happened, it

0:45:14.440 --> 0:45:19.640
<v Speaker 1>was a landslide victory for both augmented reality and virtual

0:45:19.680 --> 0:45:22.960
<v Speaker 1>reality research and development because it meant that so much

0:45:23.000 --> 0:45:28.800
<v Speaker 1>money was being poured into creating newer, thinner, more capable

0:45:28.840 --> 0:45:33.880
<v Speaker 1>smartphones that we saw an explosion in technological development that

0:45:33.960 --> 0:45:39.319
<v Speaker 1>could also be used for virtual and augmented reality experiences. So,

0:45:39.400 --> 0:45:43.200
<v Speaker 1>for example, I think of those sensors I talked about earlier,

0:45:43.239 --> 0:45:48.120
<v Speaker 1>accelerometers and gyroscopes, that sort of thing. Well, we saw

0:45:48.160 --> 0:45:50.319
<v Speaker 1>a lot of development in those spaces in order to

0:45:50.360 --> 0:45:53.279
<v Speaker 1>make smartphones better, and people who are working in a

0:45:53.480 --> 0:45:56.359
<v Speaker 1>R and VR experiences can take advantage of those same

0:45:56.400 --> 0:46:01.600
<v Speaker 1>sensors either creating apps specifically four smartphones. Thus you don't

0:46:01.600 --> 0:46:05.200
<v Speaker 1>have to build any other hardware, you just use existing hardware,

0:46:05.280 --> 0:46:08.359
<v Speaker 1>but that limits how you can use it, right because

0:46:08.360 --> 0:46:10.799
<v Speaker 1>you don't typically wear your smartphone directly in front of

0:46:10.800 --> 0:46:14.600
<v Speaker 1>your face. Or they could end up taking advantage of

0:46:14.640 --> 0:46:18.880
<v Speaker 1>those new, smaller sensors and incorporate them directly into brand

0:46:18.880 --> 0:46:22.400
<v Speaker 1>new hardware, like various types of wearables like Google Glass,

0:46:22.400 --> 0:46:24.200
<v Speaker 1>for example, but that would be a few more years.

0:46:25.800 --> 0:46:30.080
<v Speaker 1>In two thousand eleven, Nintendo launched the Nintendo three D S,

0:46:30.800 --> 0:46:33.120
<v Speaker 1>which included a camera. It was, you know, the three

0:46:33.200 --> 0:46:38.680
<v Speaker 1>D capable handheld device, and included actually a pair of

0:46:39.320 --> 0:46:42.880
<v Speaker 1>forward facing cameras, so you could take three D photos

0:46:42.920 --> 0:46:46.800
<v Speaker 1>if you wanted to, and it also had some a

0:46:47.040 --> 0:46:50.400
<v Speaker 1>R software included with it. You would get these special

0:46:50.560 --> 0:46:55.480
<v Speaker 1>Nintendo cards kind of like playing cards, and if you

0:46:55.520 --> 0:46:58.040
<v Speaker 1>were to point the camera of the three D S

0:46:58.520 --> 0:47:02.000
<v Speaker 1>at the card and look at the screen, you would

0:47:02.000 --> 0:47:05.520
<v Speaker 1>see a little virtual three dimensional character pop up on

0:47:05.600 --> 0:47:09.200
<v Speaker 1>the card. So Mario would be an obvious example. You

0:47:09.239 --> 0:47:11.919
<v Speaker 1>put the Mario card down on the table, you hold

0:47:12.000 --> 0:47:13.760
<v Speaker 1>up the three D S, and you aim the camera

0:47:13.800 --> 0:47:15.640
<v Speaker 1>at the card, and you look at the screen and

0:47:15.640 --> 0:47:18.799
<v Speaker 1>there's Mario, and Mario appears to be jumping around on

0:47:18.960 --> 0:47:23.319
<v Speaker 1>your physical table. Now, obviously, if you look off of

0:47:23.400 --> 0:47:27.279
<v Speaker 1>the display, there's no Mario jumping around, but on the

0:47:27.360 --> 0:47:30.440
<v Speaker 1>display there he is, and it was pretty cute. I

0:47:30.440 --> 0:47:34.480
<v Speaker 1>remember being really impressed with this very simple implementation of

0:47:34.520 --> 0:47:37.680
<v Speaker 1>a R when we got our three DS, and then

0:47:37.680 --> 0:47:41.000
<v Speaker 1>I took our three DS apart and then I took

0:47:41.000 --> 0:47:44.880
<v Speaker 1>pictures of it, and then I posted on Twitter and

0:47:44.960 --> 0:47:49.640
<v Speaker 1>people got sad, it's a great day. In two thousand thirteen,

0:47:49.680 --> 0:47:53.719
<v Speaker 1>Google introduced Google Glass. That was the wearable that included

0:47:53.760 --> 0:47:57.160
<v Speaker 1>a small display position just above the right eye. Uh

0:47:57.320 --> 0:48:00.719
<v Speaker 1>so when you look straightforward, you could tell that there

0:48:00.760 --> 0:48:05.400
<v Speaker 1>was something kind of above your natural eyeline. But it

0:48:05.440 --> 0:48:08.560
<v Speaker 1>didn't get in the way too much. You too to

0:48:08.600 --> 0:48:10.640
<v Speaker 1>look at the screen. You actually had a glimpse, you know,

0:48:10.760 --> 0:48:13.400
<v Speaker 1>the glance upward, and then you could see what was

0:48:13.400 --> 0:48:19.040
<v Speaker 1>on the display. Google Glass had augmented reality features like crazy.

0:48:19.160 --> 0:48:24.200
<v Speaker 1>You could see video calls. You could actually use the

0:48:25.000 --> 0:48:28.200
<v Speaker 1>glasses to not just to take a video call, but

0:48:28.280 --> 0:48:31.520
<v Speaker 1>show the other person what you are looking at so

0:48:31.719 --> 0:48:34.520
<v Speaker 1>they could see from your point of view. You could

0:48:34.560 --> 0:48:37.759
<v Speaker 1>also overlay directions, so if you're walking down street, you

0:48:37.760 --> 0:48:39.960
<v Speaker 1>could glance up at the screen and it would tell

0:48:39.960 --> 0:48:42.000
<v Speaker 1>you if you need to keep going straight or turn laughter,

0:48:42.080 --> 0:48:44.440
<v Speaker 1>turn right, that kind of thing. It was really useful.

0:48:44.840 --> 0:48:47.560
<v Speaker 1>Um I had a pair of these Google Glass and

0:48:47.760 --> 0:48:51.200
<v Speaker 1>I really liked the direction they were going in. I

0:48:51.239 --> 0:48:54.480
<v Speaker 1>felt that it wasn't a fully realized product at the time,

0:48:54.520 --> 0:48:57.719
<v Speaker 1>and eventually Google agreed and after a couple of years,

0:48:57.719 --> 0:49:00.520
<v Speaker 1>they took Google Glass off the market and early and

0:49:00.560 --> 0:49:04.000
<v Speaker 1>now you can't get them anymore. Uh. They were clever,

0:49:04.320 --> 0:49:09.080
<v Speaker 1>but they were expensive, and they had some limitations. And

0:49:09.520 --> 0:49:11.640
<v Speaker 1>like I was saying earlier, you know, it's hard to

0:49:11.640 --> 0:49:14.719
<v Speaker 1>build all the components you need into one headset. So

0:49:14.800 --> 0:49:18.960
<v Speaker 1>Google Glass would communicate via Bluetooth to your smartphone, and

0:49:19.000 --> 0:49:21.640
<v Speaker 1>your smartphone would act as the actual nexus point to

0:49:21.680 --> 0:49:25.680
<v Speaker 1>the Internet. But it was a neat idea, uh, and

0:49:25.880 --> 0:49:28.560
<v Speaker 1>I enjoyed getting to use them while I did, so

0:49:29.120 --> 0:49:32.799
<v Speaker 1>I keep hoping to see a return of that kind

0:49:32.800 --> 0:49:36.680
<v Speaker 1>of technology, but perhaps in a more mature and less

0:49:36.680 --> 0:49:42.799
<v Speaker 1>expensive format. Now we've also seen applications similar to the

0:49:42.840 --> 0:49:45.319
<v Speaker 1>ones we mentioned earlier, the ones that are meant to

0:49:45.360 --> 0:49:49.759
<v Speaker 1>guide people into laying out or repairing a system. We've

0:49:49.800 --> 0:49:52.600
<v Speaker 1>seen that in the car world. Not too long ago,

0:49:53.040 --> 0:49:57.120
<v Speaker 1>there was the MARTA system introduced by Volkswagen. MARTA makes

0:49:57.120 --> 0:50:00.000
<v Speaker 1>me chuckle because that's also the name of Atlanta's public

0:50:00.040 --> 0:50:03.160
<v Speaker 1>transportation system, but in this case, it stands for Mobile

0:50:03.160 --> 0:50:08.400
<v Speaker 1>Augmented Reality Technical Assistance and it's specifically designed for mechanics

0:50:08.440 --> 0:50:12.080
<v Speaker 1>who are working on the XL one vehicle. So if

0:50:12.120 --> 0:50:15.160
<v Speaker 1>you hold up an iPad that has this app on it,

0:50:15.719 --> 0:50:18.359
<v Speaker 1>and the camera has pointed at an XCEL one and

0:50:18.400 --> 0:50:21.480
<v Speaker 1>you look at the display, you'll see information overlaid on

0:50:21.560 --> 0:50:25.759
<v Speaker 1>top of the car, including labels for all the different parts.

0:50:25.840 --> 0:50:28.080
<v Speaker 1>So let's say you're a mechanic and you have to

0:50:28.120 --> 0:50:32.239
<v Speaker 1>do a specific repair on this vehicle. You hold up

0:50:32.280 --> 0:50:35.279
<v Speaker 1>the iPad, you look through the display, and you see

0:50:35.280 --> 0:50:37.040
<v Speaker 1>exactly what you need to do. It gives you a

0:50:37.080 --> 0:50:39.960
<v Speaker 1>set of instructions that shows you how you need to do.

0:50:40.000 --> 0:50:42.319
<v Speaker 1>It tells you where you need to stand based upon

0:50:42.440 --> 0:50:45.160
<v Speaker 1>the angle of the view. So if you hold it

0:50:45.239 --> 0:50:46.760
<v Speaker 1>up and it says no, you need to move about

0:50:46.800 --> 0:50:48.600
<v Speaker 1>a foot to the right, you can do that. Then

0:50:48.640 --> 0:50:50.520
<v Speaker 1>hold up the iPad again. I'll say, all right, you're

0:50:50.560 --> 0:50:53.719
<v Speaker 1>in the right spot. Make sure you loosen this particular

0:50:53.760 --> 0:50:56.759
<v Speaker 1>bolt first. That kind of thing, and it's meant to

0:50:56.880 --> 0:51:01.360
<v Speaker 1>be an interactive maintenance guy in a way, maintenance and

0:51:01.400 --> 0:51:05.760
<v Speaker 1>repair guide. This is one of those applications of augmented reality.

0:51:05.800 --> 0:51:09.600
<v Speaker 1>I think is a no brainer to me. It's a

0:51:09.680 --> 0:51:15.400
<v Speaker 1>killer app The idea of having an ability to work

0:51:15.480 --> 0:51:20.240
<v Speaker 1>with something you are not familiar with, but you're able

0:51:20.280 --> 0:51:23.400
<v Speaker 1>to leverage the expertise of people who either designed it

0:51:23.520 --> 0:51:26.239
<v Speaker 1>or built it, or just fully understand it, and get

0:51:26.320 --> 0:51:31.279
<v Speaker 1>guidance based on their expertise in real time, so you're

0:51:31.320 --> 0:51:36.000
<v Speaker 1>not having to go and consult a an article about

0:51:36.000 --> 0:51:39.600
<v Speaker 1>it or watch a YouTube video. You get step by

0:51:39.600 --> 0:51:44.680
<v Speaker 1>step instructions overlaid on top of your view of that thing.

0:51:45.800 --> 0:51:49.160
<v Speaker 1>To me, that's the most compelling use of augmented reality

0:51:49.400 --> 0:51:51.279
<v Speaker 1>from a practical standpoint. There are a lot of other

0:51:51.360 --> 0:51:53.520
<v Speaker 1>uses that I'll talk about towards the end that I

0:51:53.520 --> 0:51:56.080
<v Speaker 1>think are also really super cool. So don't get me wrong,

0:51:56.120 --> 0:51:59.920
<v Speaker 1>It's not the only one. But let's move on to

0:52:00.040 --> 0:52:04.080
<v Speaker 1>thousand fifteen. That was when Microsoft would unveil the hollow lens,

0:52:04.080 --> 0:52:06.319
<v Speaker 1>something I still want to try out. I have not

0:52:06.440 --> 0:52:09.120
<v Speaker 1>had a chance to try a hollow lens yet. That

0:52:09.239 --> 0:52:12.520
<v Speaker 1>is a headset cable of advanced a R applications everything

0:52:12.600 --> 0:52:15.160
<v Speaker 1>from what I was just talking about, giving you guidance,

0:52:15.200 --> 0:52:18.120
<v Speaker 1>step by step instructions on how to do like a

0:52:18.160 --> 0:52:21.279
<v Speaker 1>repair job on say an electrical outlet. You can even

0:52:21.440 --> 0:52:26.520
<v Speaker 1>use a Skype system to call an expert who can

0:52:26.560 --> 0:52:31.400
<v Speaker 1>then view your point of view and interact with that

0:52:31.480 --> 0:52:34.280
<v Speaker 1>point of view. So Let's say I'm looking at the outlet.

0:52:35.080 --> 0:52:38.880
<v Speaker 1>The expert electrician i'm talking to can see what I see,

0:52:39.480 --> 0:52:44.000
<v Speaker 1>and he or she can also make notes on the

0:52:44.120 --> 0:52:47.719
<v Speaker 1>display which shows up in my field of view. So

0:52:48.000 --> 0:52:50.959
<v Speaker 1>he or she might circle a specific wire and say

0:52:50.960 --> 0:52:52.960
<v Speaker 1>you need to you need to remove that one first,

0:52:53.360 --> 0:52:55.400
<v Speaker 1>and I know I need to do that one first

0:52:55.440 --> 0:52:57.600
<v Speaker 1>because I can see which one they are talking about.

0:52:57.680 --> 0:53:00.600
<v Speaker 1>Or they might circle another wire and say, no matter

0:53:00.719 --> 0:53:05.200
<v Speaker 1>why you do, don't cut this wire where the toilet

0:53:05.280 --> 0:53:11.080
<v Speaker 1>upstairs will explode like lethal weapon too, and I won't

0:53:11.080 --> 0:53:14.399
<v Speaker 1>do that because you know that guys like three days

0:53:14.440 --> 0:53:18.920
<v Speaker 1>from retirement. So I have a heart. But no, this

0:53:18.960 --> 0:53:21.839
<v Speaker 1>is this is a really neat idea, having this interactive

0:53:21.880 --> 0:53:27.000
<v Speaker 1>ability to overlay the information from the world, the digital world,

0:53:27.000 --> 0:53:30.080
<v Speaker 1>onto your physical world and beyond that. The hollowlens has

0:53:30.120 --> 0:53:33.640
<v Speaker 1>lots of other functions. It's not just something to do,

0:53:33.840 --> 0:53:36.919
<v Speaker 1>you know, home repairs around the house. You can also

0:53:37.040 --> 0:53:40.480
<v Speaker 1>use it for entertainment purposes, like you could create a

0:53:40.760 --> 0:53:45.279
<v Speaker 1>screen that can show you video from various sources and

0:53:45.320 --> 0:53:49.080
<v Speaker 1>you can assign it a place on a wall in

0:53:49.160 --> 0:53:52.880
<v Speaker 1>your environment. Let's say that you're in your living room

0:53:53.000 --> 0:53:56.280
<v Speaker 1>and you just create a screen so you can watch Netflix,

0:53:56.320 --> 0:53:59.040
<v Speaker 1>and you slap it on a wall and it will

0:53:59.120 --> 0:54:02.719
<v Speaker 1>stay in that same position relative to your point of view.

0:54:03.080 --> 0:54:04.840
<v Speaker 1>So if you look to the left or right, the

0:54:04.920 --> 0:54:08.080
<v Speaker 1>screen stays where you put it as if it were

0:54:08.120 --> 0:54:10.719
<v Speaker 1>physically there on your wall. But keep in mind it's

0:54:10.760 --> 0:54:13.759
<v Speaker 1>just a virtual screen, and when you look back to

0:54:13.880 --> 0:54:16.400
<v Speaker 1>that part of your wall, you'll see the virtual screen

0:54:16.440 --> 0:54:18.760
<v Speaker 1>there playing whatever it was that you wanted to watch.

0:54:20.080 --> 0:54:22.319
<v Speaker 1>I think that's a super cool idea. And they've also

0:54:22.360 --> 0:54:26.359
<v Speaker 1>shown off games like a game of Minecraft that uses

0:54:26.400 --> 0:54:29.360
<v Speaker 1>hollow lens so you can actually view a Minecraft world

0:54:30.120 --> 0:54:33.239
<v Speaker 1>sitting appearing to sit at any rate on top of

0:54:33.239 --> 0:54:36.520
<v Speaker 1>a table, so you can walk around the table and

0:54:36.640 --> 0:54:40.200
<v Speaker 1>view the Minecraft world from various angles and play that way.

0:54:41.000 --> 0:54:44.080
<v Speaker 1>I think that's super neat. Don't know how compelling it is,

0:54:44.120 --> 0:54:47.040
<v Speaker 1>because again I haven't tried it myself, but I really

0:54:47.080 --> 0:54:52.080
<v Speaker 1>like the idea. This year two thousand sixteen, are got

0:54:52.120 --> 0:54:55.160
<v Speaker 1>another big boost from a little game called Pokemon Go.

0:54:56.280 --> 0:54:59.480
<v Speaker 1>Although I have to admit this was a really primitive,

0:54:59.640 --> 0:55:03.200
<v Speaker 1>base sick implementation of augmented reality. Really, it was not

0:55:03.440 --> 0:55:06.000
<v Speaker 1>much more than just a In fact, it was nothing

0:55:06.080 --> 0:55:09.600
<v Speaker 1>more than just an animated overlay that would exist on

0:55:09.719 --> 0:55:13.240
<v Speaker 1>top of the camera view of your of your device.

0:55:14.120 --> 0:55:17.000
<v Speaker 1>So I'd say, I'm holding up my smartphone and I'm

0:55:17.080 --> 0:55:20.000
<v Speaker 1>trying to catch a Jiggli Puff, and the Jiggli Puff

0:55:20.080 --> 0:55:22.920
<v Speaker 1>is currently bouncing up and down on the sidewalk in

0:55:23.000 --> 0:55:25.520
<v Speaker 1>front of me. That's about as far as the augmented

0:55:25.600 --> 0:55:29.880
<v Speaker 1>reality actual experience would go. So very primitive. But because

0:55:29.960 --> 0:55:35.360
<v Speaker 1>Pokemon Go became so popular so quickly, it really pushed

0:55:35.520 --> 0:55:38.320
<v Speaker 1>the concept of a R to the front of the

0:55:38.440 --> 0:55:42.759
<v Speaker 1>minds of people everywhere, including business owners who immediately said,

0:55:42.840 --> 0:55:46.880
<v Speaker 1>we need an augmented reality app. Whether they actually needed

0:55:46.920 --> 0:55:49.480
<v Speaker 1>one or not is beside the point. A lot of

0:55:49.520 --> 0:55:52.920
<v Speaker 1>people got into a R because of Pokemon Go. Uh,

0:55:53.239 --> 0:55:55.759
<v Speaker 1>for both good and bad. I always think that you

0:55:55.880 --> 0:55:58.000
<v Speaker 1>have to come up with the experience first. You have

0:55:58.160 --> 0:56:03.160
<v Speaker 1>to understand why you need to use a specific strategy

0:56:03.719 --> 0:56:08.719
<v Speaker 1>to create a specific experience and then build it. Not Hey,

0:56:09.200 --> 0:56:13.759
<v Speaker 1>we need augmented reality makes something that's a R. To me,

0:56:13.880 --> 0:56:16.880
<v Speaker 1>that's the backwards way of going about it. But what

0:56:16.960 --> 0:56:19.960
<v Speaker 1>do I know. I'm not not a programmer, so I'm

0:56:19.960 --> 0:56:22.840
<v Speaker 1>sure the programmers feel in a similar way to me.

0:56:23.280 --> 0:56:26.800
<v Speaker 1>But that's just a guess. Now. The future of a

0:56:26.960 --> 0:56:30.120
<v Speaker 1>R depends heavily upon the applications we see, in which

0:56:30.200 --> 0:56:33.680
<v Speaker 1>ones end up being successful, in which ones aren't. Right now,

0:56:33.880 --> 0:56:36.239
<v Speaker 1>I would say that the best bet is to see

0:56:36.320 --> 0:56:41.040
<v Speaker 1>more a R features built into smartphones and tablets. Uh not,

0:56:41.160 --> 0:56:44.920
<v Speaker 1>maybe not necessarily built into them, but have apps available

0:56:45.000 --> 0:56:48.960
<v Speaker 1>that create a OUR experiences for very specific contexts, Like

0:56:49.120 --> 0:56:52.239
<v Speaker 1>let's say it's a museum app. You might download a

0:56:52.320 --> 0:56:54.360
<v Speaker 1>museum app on your phone, and when you go to

0:56:54.440 --> 0:56:57.120
<v Speaker 1>the museum and you use your phone, you can get

0:56:57.160 --> 0:57:00.839
<v Speaker 1>more information about the paintings and sculptures and or installations

0:57:00.920 --> 0:57:03.279
<v Speaker 1>that you see in the museum. That's an easy one

0:57:03.360 --> 0:57:06.080
<v Speaker 1>to understand. But that same app isn't going to be

0:57:06.200 --> 0:57:08.680
<v Speaker 1>useful once you leave the museum, you no longer have

0:57:09.000 --> 0:57:12.680
<v Speaker 1>the context that it is tied to. I think that

0:57:12.840 --> 0:57:16.040
<v Speaker 1>smartphones are probably going to be where the greatest development

0:57:16.160 --> 0:57:18.960
<v Speaker 1>is going to be in the near term, because wearables

0:57:19.040 --> 0:57:22.360
<v Speaker 1>is still really hard to do. We still don't have

0:57:22.680 --> 0:57:25.520
<v Speaker 1>a consumer version of the hollow lens out available for

0:57:25.560 --> 0:57:28.920
<v Speaker 1>anyone to purchase, and it may never come out as

0:57:28.960 --> 0:57:31.720
<v Speaker 1>a consumer product. Microsoft hasn't shown a whole lot of

0:57:31.800 --> 0:57:35.160
<v Speaker 1>interest in making it a consumer product. Maybe that will change,

0:57:35.680 --> 0:57:38.160
<v Speaker 1>but at the moment, I wouldn't hold my breath, so

0:57:39.040 --> 0:57:42.240
<v Speaker 1>I would argue smartphones and tablets are pretty much where

0:57:42.240 --> 0:57:47.200
<v Speaker 1>it's at. Maybe some implementation with some existing VR headsets

0:57:47.240 --> 0:57:49.880
<v Speaker 1>which have external cameras mounted on them as well, like

0:57:50.000 --> 0:57:53.480
<v Speaker 1>forward facing cameras, you could build a our experiences there.

0:57:53.680 --> 0:57:55.880
<v Speaker 1>Then it gets a little weird because you're you're also

0:57:56.040 --> 0:57:58.520
<v Speaker 1>you know, you're looking at a monitor, so you're looking

0:57:58.560 --> 0:58:01.080
<v Speaker 1>at a video feed of your roundings, and on top

0:58:01.120 --> 0:58:03.240
<v Speaker 1>of the video feed you get the overlay. Same thing

0:58:03.360 --> 0:58:05.320
<v Speaker 1>is true for your smartphones and tablets, by the way,

0:58:06.040 --> 0:58:09.360
<v Speaker 1>but different that from the Google Glass implementation, where you're

0:58:09.440 --> 0:58:12.520
<v Speaker 1>looking at the actual physical world, not a video representation

0:58:12.600 --> 0:58:15.640
<v Speaker 1>of it, but the real world. And then because the

0:58:15.760 --> 0:58:19.200
<v Speaker 1>display itself that you are looking through is transparent, you're

0:58:19.240 --> 0:58:23.000
<v Speaker 1>looking at a transparent overlay of digital information that gives

0:58:23.080 --> 0:58:27.840
<v Speaker 1>you more info about the world you are in. I

0:58:27.960 --> 0:58:30.439
<v Speaker 1>think a R is super cool. I think it's really

0:58:30.520 --> 0:58:33.880
<v Speaker 1>got a lot of potential to change the world around

0:58:34.000 --> 0:58:35.800
<v Speaker 1>us and to change the way we interact with the

0:58:35.800 --> 0:58:41.160
<v Speaker 1>world around us. You could imagine a dystopian future implementation

0:58:41.240 --> 0:58:44.040
<v Speaker 1>of a r where we all have to wear glasses

0:58:44.120 --> 0:58:48.320
<v Speaker 1>and we're constantly getting personalized commercials beamed at us whenever

0:58:48.400 --> 0:58:51.480
<v Speaker 1>we look at anything. Like imagine walking past a store

0:58:52.080 --> 0:58:54.320
<v Speaker 1>casually looking in the window and then getting a whole

0:58:54.360 --> 0:58:56.200
<v Speaker 1>bunch of ads for all the stuff that's in the

0:58:56.280 --> 0:58:59.160
<v Speaker 1>store window. That would be obnoxious, and it's easy to

0:58:59.280 --> 0:59:03.280
<v Speaker 1>understand how people would not want that, yet also easy

0:59:03.360 --> 0:59:06.240
<v Speaker 1>to understand how that could possibly become a future or

0:59:06.840 --> 0:59:10.080
<v Speaker 1>think of the future where your privacy is no longer

0:59:10.360 --> 0:59:13.640
<v Speaker 1>even relevant, and you walk down the street and you

0:59:13.720 --> 0:59:17.120
<v Speaker 1>look at all the people's faces who are also walking

0:59:17.160 --> 0:59:20.520
<v Speaker 1>down the street, and you're getting names of everybody and

0:59:20.680 --> 0:59:22.760
<v Speaker 1>what they like and what they dislike. You know, what

0:59:22.960 --> 0:59:25.560
<v Speaker 1>music they tend to listen to, maybe what they're listening

0:59:25.600 --> 0:59:28.760
<v Speaker 1>to right now, And it's all because we've got facial

0:59:28.840 --> 0:59:33.840
<v Speaker 1>recognition technology. Almost everyone has some sort of social media presence,

0:59:34.400 --> 0:59:38.040
<v Speaker 1>so you can map that face to any public profiles,

0:59:38.120 --> 0:59:40.200
<v Speaker 1>try and find a match. If you found a match,

0:59:40.280 --> 0:59:43.120
<v Speaker 1>you could bring back information to the person wearing the glasses,

0:59:43.640 --> 0:59:46.560
<v Speaker 1>so I can look at somebody and say, oh, this, uh,

0:59:46.960 --> 0:59:50.600
<v Speaker 1>this cute kid over here, she's got she likes punk

0:59:50.720 --> 0:59:54.040
<v Speaker 1>rock music. I'm gonna I'm gonna impress her with my

0:59:54.880 --> 1:00:00.800
<v Speaker 1>knowledge of the cramps. That probably wouldn't work. But the

1:00:00.880 --> 1:00:03.360
<v Speaker 1>point being, this is pretty creepy and invasive, and so

1:00:03.520 --> 1:00:06.040
<v Speaker 1>there are some negative implementations of a ARE that we

1:00:06.120 --> 1:00:08.800
<v Speaker 1>have to watch out for. Unless we get to a

1:00:08.840 --> 1:00:11.280
<v Speaker 1>point where we just don't care about privacy at all anymore.

1:00:11.760 --> 1:00:14.240
<v Speaker 1>Some would argue we're already there, and in that case,

1:00:14.520 --> 1:00:16.840
<v Speaker 1>this implementation of a R may not sound creepy at all.

1:00:16.880 --> 1:00:18.680
<v Speaker 1>It might just sound kind of cool, kind of the

1:00:18.720 --> 1:00:21.320
<v Speaker 1>equivalent of walking into a store, seeing a person with

1:00:21.440 --> 1:00:24.080
<v Speaker 1>a name tag and addressing them by name. If they

1:00:24.080 --> 1:00:26.040
<v Speaker 1>don't remember they have a name tag on, they have

1:00:26.160 --> 1:00:28.600
<v Speaker 1>this moment where they think, do I know you? But

1:00:28.640 --> 1:00:30.840
<v Speaker 1>if we're in a world where everyone can see everyone's

1:00:30.920 --> 1:00:33.240
<v Speaker 1>name all the time, then well, for one thing, I

1:00:33.280 --> 1:00:35.520
<v Speaker 1>won't ever have to worry about coming at a loss

1:00:35.520 --> 1:00:37.440
<v Speaker 1>when I have to introduce my wife to someone. So

1:00:37.680 --> 1:00:40.320
<v Speaker 1>that's a that's a plus side. And I hope you

1:00:40.400 --> 1:00:44.080
<v Speaker 1>guys enjoyed that classic episode of text stuff. If you

1:00:44.160 --> 1:00:47.120
<v Speaker 1>have any comments or questions, reach out on social media.

1:00:47.280 --> 1:00:49.960
<v Speaker 1>We are available at text Stuff h s W both

1:00:50.080 --> 1:00:52.680
<v Speaker 1>on Twitter and on Facebook. You can also go to

1:00:52.760 --> 1:00:56.560
<v Speaker 1>our podcast page that's tech Stuff podcast dot com. You

1:00:56.640 --> 1:00:59.480
<v Speaker 1>can find a link to every episode we've ever recorded. There.

1:00:59.720 --> 1:01:02.439
<v Speaker 1>You'll also find a link to our online store, where

1:01:02.440 --> 1:01:05.400
<v Speaker 1>there's lots of tech Stuff merchandise, so you can go

1:01:05.480 --> 1:01:09.280
<v Speaker 1>ahead and spend all that hard earned cash on really

1:01:09.320 --> 1:01:12.000
<v Speaker 1>cool tech stuff products. I mean, seriously, we do have

1:01:12.080 --> 1:01:13.560
<v Speaker 1>some really neat ones. If you haven't checked it out,

1:01:13.640 --> 1:01:15.080
<v Speaker 1>go at least take a look at the designs. I

1:01:15.120 --> 1:01:17.800
<v Speaker 1>think you'll find them amusing, and who knows, maybe you'll

1:01:17.800 --> 1:01:21.360
<v Speaker 1>find a new favorite tote bag or t shirt. Then remember,

1:01:21.400 --> 1:01:23.000
<v Speaker 1>every purchase you make goes to help the show, and

1:01:23.040 --> 1:01:25.160
<v Speaker 1>we greatly appreciate it, and I will talk to you

1:01:25.280 --> 1:01:33.040
<v Speaker 1>again really soon. Tex Stuff is a production of I

1:01:33.160 --> 1:01:36.120
<v Speaker 1>Heart Radio's How Stuff Works. For more podcasts from my

1:01:36.240 --> 1:01:39.800
<v Speaker 1>heart Radio, visit the i heart Radio app, Apple Podcasts,

1:01:39.960 --> 1:01:41.920
<v Speaker 1>or wherever you listen to your favorite shows.