WEBVTT - Zoom and Enhance: The Sound Edition

0:00:00.160 --> 0:00:07.160
<v Speaker 1>Brought to you by Toyota. Let's go places. Welcome to

0:00:07.360 --> 0:00:14.920
<v Speaker 1>Forward Thinking. Pay there and welcome to Forward Thinking, the

0:00:15.000 --> 0:00:17.680
<v Speaker 1>podcast that looks at the future and says, but my

0:00:17.800 --> 0:00:21.239
<v Speaker 1>words like silent raindrops fell and echoed in the wells

0:00:21.360 --> 0:00:27.960
<v Speaker 1>of silence. I'm Jonathan Strickland and I'm Joe McCormick. And today, Yes, Joe,

0:00:28.800 --> 0:00:32.680
<v Speaker 1>we're going to be zooming in and enhancing the topic

0:00:32.720 --> 0:00:35.440
<v Speaker 1>we've talked about before. Yeah. We we talked about zoom

0:00:35.440 --> 0:00:39.120
<v Speaker 1>and Enhanced a while ago, right, we did one way

0:00:39.159 --> 0:00:43.440
<v Speaker 1>back in Uh, you know, I haven't gone back and

0:00:43.479 --> 0:00:46.600
<v Speaker 1>listened to any of our episodes from in a while,

0:00:46.680 --> 0:00:50.800
<v Speaker 1>but I don't know. I feel like I'd be embarrassed

0:00:50.800 --> 0:00:54.440
<v Speaker 1>to hear myself. Then we have learned a lot about podcasting.

0:00:55.480 --> 0:01:00.240
<v Speaker 1>I've forgotten more than I've ever learned. Wait what No, anyway, So, yeah,

0:01:00.280 --> 0:01:02.960
<v Speaker 1>we we recorded one about zoom and Enhanced with image

0:01:03.600 --> 0:01:06.480
<v Speaker 1>imagery back in August two thousand teen. We're recording this

0:01:06.520 --> 0:01:08.680
<v Speaker 1>at the beginning of September two thousand fifteen, so it's

0:01:08.680 --> 0:01:11.639
<v Speaker 1>been more than two years. So clearly, first of all,

0:01:12.400 --> 0:01:15.040
<v Speaker 1>we finally solve that zoom and enhanced for video and

0:01:15.400 --> 0:01:18.280
<v Speaker 1>pictures right, That's that's done. Now you can take like

0:01:18.319 --> 0:01:22.680
<v Speaker 1>a highly pixelated JPEG and turn it into a full

0:01:22.720 --> 0:01:25.360
<v Speaker 1>motion three D video and it doesn't matter if that

0:01:25.480 --> 0:01:29.280
<v Speaker 1>image was taken seventy years ago. You can still do it,

0:01:29.720 --> 0:01:33.320
<v Speaker 1>you know. I came across just recently. Uh, my wife

0:01:33.440 --> 0:01:36.480
<v Speaker 1>Rachel and I have been going back through The X

0:01:36.520 --> 0:01:41.360
<v Speaker 1>Files because it's up on Netflix, and and and uh Christian,

0:01:41.440 --> 0:01:43.720
<v Speaker 1>our our colleague Christian who's also on Stuff to Blow

0:01:43.760 --> 0:01:45.640
<v Speaker 1>your mind with me and Lauren, there have been like,

0:01:45.680 --> 0:01:48.280
<v Speaker 1>you've got to watch the X Files, so we have,

0:01:48.760 --> 0:01:52.920
<v Speaker 1>and we came across an episode that has probably the

0:01:52.920 --> 0:01:56.680
<v Speaker 1>most egregious and ridiculous case of zoom and enhanced I

0:01:56.760 --> 0:02:00.680
<v Speaker 1>have ever seen in any show. It's an episode where

0:02:00.720 --> 0:02:05.400
<v Speaker 1>there is a character who does psychic photography, so he

0:02:05.520 --> 0:02:08.079
<v Speaker 1>like gets near a camera and whatever is going on

0:02:08.160 --> 0:02:11.200
<v Speaker 1>in his head ends up on the film, and it's

0:02:11.280 --> 0:02:17.560
<v Speaker 1>this polaroid of somebody's vision of some hellish nightmare escape

0:02:17.600 --> 0:02:21.480
<v Speaker 1>with these ghosts called howlers screaming around a woman's face,

0:02:21.800 --> 0:02:25.079
<v Speaker 1>and then in the background there's this little blur and

0:02:25.080 --> 0:02:27.520
<v Speaker 1>and Moulder goes to the lab where they zoom and

0:02:27.639 --> 0:02:30.280
<v Speaker 1>enhance the little blur and then they get this perfectly

0:02:30.360 --> 0:02:34.239
<v Speaker 1>resolved picture of a guy's face from a polaroid. Yeah,

0:02:34.320 --> 0:02:37.960
<v Speaker 1>seems like that might be a little far fetched. Um. Yeah,

0:02:38.000 --> 0:02:40.920
<v Speaker 1>as it turns out, we do not have this magical capability.

0:02:40.960 --> 0:02:44.679
<v Speaker 1>Things have improved since two thousand thirteen. You know, the

0:02:45.040 --> 0:02:48.680
<v Speaker 1>basic premise of zoom and enhances all about you take

0:02:48.919 --> 0:02:53.320
<v Speaker 1>you take an existing video or image, you concentrate your

0:02:53.400 --> 0:02:57.600
<v Speaker 1>view on one particular sector of that image, and then

0:02:57.680 --> 0:03:00.320
<v Speaker 1>you zoom that in and you are in and swing

0:03:00.360 --> 0:03:03.000
<v Speaker 1>the pictures so that it is more uh, that you

0:03:03.040 --> 0:03:05.399
<v Speaker 1>can see what it is. And for one thing, we've

0:03:05.440 --> 0:03:09.280
<v Speaker 1>got super high resolution cameras now, so there's some that

0:03:09.320 --> 0:03:12.560
<v Speaker 1>are so high resolution that a normal view you'd be

0:03:12.600 --> 0:03:15.560
<v Speaker 1>looking at you know, like a like a YouTube video,

0:03:15.720 --> 0:03:18.480
<v Speaker 1>something along that size. And then it turns out that

0:03:18.560 --> 0:03:22.160
<v Speaker 1>the resolution is so high you could digitally punch into

0:03:22.200 --> 0:03:25.359
<v Speaker 1>parts of that video and not lose a lot of resolutions.

0:03:25.400 --> 0:03:27.480
<v Speaker 1>So it gives you the effect of zoom and enhanced.

0:03:27.480 --> 0:03:29.960
<v Speaker 1>But really it's not enhancing. It's just that information is

0:03:30.000 --> 0:03:33.320
<v Speaker 1>already there. Yeah, and that's the key. The information is

0:03:33.360 --> 0:03:37.280
<v Speaker 1>already there. We have never gotten and we will never

0:03:37.360 --> 0:03:41.720
<v Speaker 1>get to a point where there can be information retrieved

0:03:41.800 --> 0:03:44.680
<v Speaker 1>from an image that was not recorded in the image.

0:03:44.680 --> 0:03:47.320
<v Speaker 1>Now we can have it simulated. Right. That's exactly what

0:03:47.360 --> 0:03:50.880
<v Speaker 1>we're going to talk about today, is finding ways to

0:03:51.000 --> 0:03:54.280
<v Speaker 1>use what information is recorded in the image and do

0:03:54.520 --> 0:03:57.760
<v Speaker 1>very smart things with it with computer programs. Right. And

0:03:57.800 --> 0:04:01.040
<v Speaker 1>the really cool thing is that we're not talking about

0:04:01.400 --> 0:04:05.680
<v Speaker 1>actual like images as the end result in this case, right,

0:04:05.800 --> 0:04:11.280
<v Speaker 1>we are talking about using images to reconstruct sound. Yeah.

0:04:11.320 --> 0:04:14.920
<v Speaker 1>And this all comes to us because we watched a

0:04:15.000 --> 0:04:19.080
<v Speaker 1>TED talk in which a computer scientist named Aid Davis

0:04:19.480 --> 0:04:25.080
<v Speaker 1>talked about a really interesting project that involved cameras and

0:04:25.640 --> 0:04:30.080
<v Speaker 1>uh not necessarily well, I guess inanimate objects really and

0:04:30.200 --> 0:04:34.240
<v Speaker 1>being able to reconstruct sound that took place near that

0:04:34.279 --> 0:04:39.040
<v Speaker 1>inanimate object as if that inanimate object itself were a microphone.

0:04:40.160 --> 0:04:43.800
<v Speaker 1>Say that all again? All right, so, um, there's this

0:04:43.800 --> 0:04:46.520
<v Speaker 1>guy named Aimed Davis. No no, no, no, no. The

0:04:46.600 --> 0:04:50.800
<v Speaker 1>relevant part it does what sound? So yeah, it acts

0:04:50.800 --> 0:04:54.520
<v Speaker 1>as if any object itself is a microphone, not in

0:04:54.560 --> 0:04:58.400
<v Speaker 1>the sense of amplifying what was said, but in the

0:04:58.480 --> 0:05:02.400
<v Speaker 1>sense of of being being a record of what was said,

0:05:02.440 --> 0:05:07.120
<v Speaker 1>Like the vibrations of the material itself are able to

0:05:07.320 --> 0:05:11.599
<v Speaker 1>inform us enough that we can we can replicate the

0:05:11.680 --> 0:05:14.480
<v Speaker 1>sound that was created. Yeah, he's he's part of this

0:05:14.560 --> 0:05:17.279
<v Speaker 1>team that includes other researchers at m I t he's

0:05:17.320 --> 0:05:19.800
<v Speaker 1>a he's a grad student or possibly a graduate of

0:05:19.880 --> 0:05:22.880
<v Speaker 1>m I t UH and also scientists over at Microsoft

0:05:22.920 --> 0:05:26.560
<v Speaker 1>and Adobe. And this is so cool. Yes, this, So

0:05:26.600 --> 0:05:31.479
<v Speaker 1>this is a synesthesia machine. It sees sound. This it's

0:05:31.520 --> 0:05:34.560
<v Speaker 1>like it took a bunch of LSD and began to

0:05:34.680 --> 0:05:39.440
<v Speaker 1>see the music. Except that's what literally that they can do. Now. Yeah,

0:05:39.440 --> 0:05:42.040
<v Speaker 1>it's and and when you start to break down what's

0:05:42.080 --> 0:05:46.080
<v Speaker 1>going on, it starts to be less magical, but no

0:05:46.400 --> 0:05:49.640
<v Speaker 1>less amazing. All right, So let's let's take some of

0:05:49.680 --> 0:05:52.560
<v Speaker 1>the mysticism and magic out first. And to do that,

0:05:52.640 --> 0:05:54.880
<v Speaker 1>we have to talk about what sound is, which we've

0:05:54.920 --> 0:05:56.960
<v Speaker 1>done on this show before, but I'm going to give

0:05:56.960 --> 0:06:01.800
<v Speaker 1>a quick rundown. So sound is really the energy of vibration, right,

0:06:01.839 --> 0:06:05.440
<v Speaker 1>So when something vibrates, that's when it's creating a sound.

0:06:06.000 --> 0:06:09.360
<v Speaker 1>And as long as there is some sort of medium

0:06:09.560 --> 0:06:13.120
<v Speaker 1>for that sound to travel through, such as air exactly,

0:06:13.440 --> 0:06:16.080
<v Speaker 1>water or the wood of a table that you've put

0:06:16.080 --> 0:06:19.160
<v Speaker 1>your head down on, right, any of these things. Metal

0:06:19.200 --> 0:06:27.840
<v Speaker 1>mask around your head hatchet. Now you're just giving Aaron

0:06:27.880 --> 0:06:31.839
<v Speaker 1>Cooper father for the next next image. Alright, So but yes,

0:06:31.880 --> 0:06:34.240
<v Speaker 1>as long as there's a medium through which sound can travel,

0:06:34.279 --> 0:06:36.840
<v Speaker 1>it will travel as far as it possibly can before

0:06:36.839 --> 0:06:41.040
<v Speaker 1>the energy has essentially dissipated. And this is why sound

0:06:41.040 --> 0:06:43.679
<v Speaker 1>does not really does not travel in space because space

0:06:43.760 --> 0:06:47.159
<v Speaker 1>is effectively a vacuum. So there are no particles, there's

0:06:47.160 --> 0:06:50.080
<v Speaker 1>no medium through which the sound can travel. But here

0:06:50.160 --> 0:06:53.560
<v Speaker 1>on Earth we've got air, thank goodness, because this is

0:06:53.560 --> 0:06:57.039
<v Speaker 1>where I keep all my stuff, and air can act

0:06:57.120 --> 0:07:00.200
<v Speaker 1>as a medium through which sound can travel. So what

0:07:00.279 --> 0:07:03.679
<v Speaker 1>happens is when something vibrates, h, it begins to pull

0:07:03.720 --> 0:07:07.400
<v Speaker 1>and push uh, the air around it. So if you

0:07:07.440 --> 0:07:10.560
<v Speaker 1>imagine a vibration, some of those vibrations are going to

0:07:10.720 --> 0:07:15.320
<v Speaker 1>move inward based on your perspective, that's gonna pull air

0:07:15.400 --> 0:07:18.920
<v Speaker 1>toward it. Sometimes it's going to be moving outward, pushing

0:07:18.960 --> 0:07:22.360
<v Speaker 1>air away from it. So think of like a a

0:07:22.480 --> 0:07:26.600
<v Speaker 1>vibrating string on a guitar or a vibrating drumhead. Uh,

0:07:26.640 --> 0:07:30.440
<v Speaker 1>that's going to be pushing and pulling air. Now that air,

0:07:30.680 --> 0:07:32.640
<v Speaker 1>in turn, is going to be pushing and pulling the

0:07:32.640 --> 0:07:35.200
<v Speaker 1>air molecules around it, and so on and so forth.

0:07:35.200 --> 0:07:38.160
<v Speaker 1>It's this great, big chain reaction because our atmosphere is

0:07:38.160 --> 0:07:41.400
<v Speaker 1>a giant fluid, right, It's it's a gas, but it's

0:07:41.400 --> 0:07:44.960
<v Speaker 1>it's a It acts as a fluid. So these various

0:07:44.960 --> 0:07:48.400
<v Speaker 1>molecules will continue to push and pull, and then eventually

0:07:48.520 --> 0:07:52.000
<v Speaker 1>that motion will make it into the air inside of

0:07:52.040 --> 0:07:55.120
<v Speaker 1>your ears. So it's not that the air molecules that

0:07:55.160 --> 0:07:57.600
<v Speaker 1>were next to the strumming string on the guitar have

0:07:57.760 --> 0:08:00.520
<v Speaker 1>magically made their way to your ear. It's rather that

0:08:00.520 --> 0:08:04.400
<v Speaker 1>that motion has continued to move up at the speed

0:08:04.440 --> 0:08:07.960
<v Speaker 1>of push to your to the air inside your ears. Sure,

0:08:08.320 --> 0:08:11.160
<v Speaker 1>now at that point it ends up vibrating your ear drum,

0:08:11.240 --> 0:08:15.120
<v Speaker 1>which then goes through this whole complicated series of maneuvers

0:08:15.160 --> 0:08:17.760
<v Speaker 1>where you're talking about tiny bones and the cochlea and

0:08:17.960 --> 0:08:19.960
<v Speaker 1>fluid and we're not going to get into this to

0:08:20.120 --> 0:08:23.080
<v Speaker 1>how hearing works. You can actually read an amazing article

0:08:23.080 --> 0:08:25.160
<v Speaker 1>at how stuff works dot com on how hearing works

0:08:25.200 --> 0:08:30.960
<v Speaker 1>that explains it. But our brains ultimately interpret this motion

0:08:31.440 --> 0:08:35.000
<v Speaker 1>as sound. Now, of course, the key fact here is

0:08:35.080 --> 0:08:39.840
<v Speaker 1>that sound is vibration, and that vibration is something that

0:08:39.920 --> 0:08:43.400
<v Speaker 1>you could in theory. See, yeah, if you could see

0:08:43.440 --> 0:08:45.880
<v Speaker 1>fast enough, and if you could tell what you were

0:08:45.880 --> 0:08:49.719
<v Speaker 1>looking at. Yeah, if you could see with the ability

0:08:49.760 --> 0:08:55.120
<v Speaker 1>to really notice minute changes. I said fast enough, I

0:08:55.160 --> 0:08:58.240
<v Speaker 1>should have said I guess fast enough enough frames per

0:08:58.280 --> 0:09:01.960
<v Speaker 1>second and with enough resolution ship Yeah right. So, uh,

0:09:02.000 --> 0:09:04.760
<v Speaker 1>you know this is like there's certain videos where if

0:09:04.800 --> 0:09:08.880
<v Speaker 1>you do you know, high speed photography, high speed film,

0:09:08.960 --> 0:09:12.520
<v Speaker 1>you can see how how something like a tuning fork,

0:09:13.240 --> 0:09:16.000
<v Speaker 1>when you strike it and it's vibrating, you can actually

0:09:16.040 --> 0:09:21.080
<v Speaker 1>see how it's moving in and out of its normal alignment,

0:09:21.160 --> 0:09:24.520
<v Speaker 1>and it looks really freaky because when you just look

0:09:24.559 --> 0:09:27.600
<v Speaker 1>at it with our normal eyes are normal ability to perceive,

0:09:28.200 --> 0:09:31.360
<v Speaker 1>it doesn't really have that, you know, you don't see

0:09:31.360 --> 0:09:33.839
<v Speaker 1>it distorting like it does in that high speed video.

0:09:34.760 --> 0:09:39.240
<v Speaker 1>But uh, if we could see it, and if we

0:09:39.320 --> 0:09:42.640
<v Speaker 1>could then interpret those vibrations, if we if we knew,

0:09:43.120 --> 0:09:46.920
<v Speaker 1>all right, it's vibrating at this speed and this amplitude,

0:09:47.320 --> 0:09:50.679
<v Speaker 1>that would tell us the pitch and volume of the

0:09:50.760 --> 0:09:53.880
<v Speaker 1>sound that was affecting it, if we knew enough of

0:09:53.920 --> 0:09:58.800
<v Speaker 1>the properties of the material itself. So that's the basis

0:09:58.920 --> 0:10:01.640
<v Speaker 1>of the experiment that these folks from in my Tea

0:10:01.800 --> 0:10:06.200
<v Speaker 1>were following, and it was all about kind of pointing

0:10:06.280 --> 0:10:10.840
<v Speaker 1>a camera at an object, a camera that was capable

0:10:10.960 --> 0:10:15.920
<v Speaker 1>of detecting these minute changes, these these movements of that object,

0:10:16.360 --> 0:10:19.320
<v Speaker 1>and then feeding that through a computer that had an

0:10:19.360 --> 0:10:24.000
<v Speaker 1>algorithm that could interpret those changes as sound and then

0:10:25.080 --> 0:10:29.440
<v Speaker 1>reconstruct the sound that must have happened to produce those changes.

0:10:30.000 --> 0:10:33.920
<v Speaker 1>And the results are pretty amazing. Yeah, I have to

0:10:33.960 --> 0:10:39.079
<v Speaker 1>say I was really impressed. I am astonished. Yeah. Yeah,

0:10:39.160 --> 0:10:42.240
<v Speaker 1>it was one of those things where well, first of all, uh,

0:10:42.480 --> 0:10:45.800
<v Speaker 1>they decided to use Mary had a Little Lamb as

0:10:45.920 --> 0:10:49.120
<v Speaker 1>a lot of their their you know what they would

0:10:49.120 --> 0:10:53.480
<v Speaker 1>try to record, right, which is a throwback to experiments

0:10:53.520 --> 0:10:56.880
<v Speaker 1>that Edison was doing way back when. Yeah. The first,

0:10:57.000 --> 0:11:00.280
<v Speaker 1>the earliest recording that we know of that tom as

0:11:00.400 --> 0:11:04.520
<v Speaker 1>and made dates to eighteen seventy eight. It was on

0:11:04.559 --> 0:11:11.319
<v Speaker 1>a device that recorded messages onto tinfoil. Interestingly, the scholarship

0:11:11.360 --> 0:11:14.280
<v Speaker 1>suggests that it wasn't Edison himself that provided the voice.

0:11:14.320 --> 0:11:18.480
<v Speaker 1>It was probably someone else, but the voice says he made.

0:11:20.480 --> 0:11:23.559
<v Speaker 1>The voice says Mary had a little Lamb, And it's

0:11:23.679 --> 0:11:27.520
<v Speaker 1>very loud and very deliberate because the technology was brand

0:11:27.520 --> 0:11:30.560
<v Speaker 1>new and it was not high fidelity by any stretch

0:11:30.559 --> 0:11:33.280
<v Speaker 1>of the imagination. So this is kind of a a

0:11:34.720 --> 0:11:37.439
<v Speaker 1>sort of genuflecting to history, saying, well, that this was

0:11:37.480 --> 0:11:42.160
<v Speaker 1>a significant moment in history. We're going to use that same, uh,

0:11:42.200 --> 0:11:46.720
<v Speaker 1>that same idea when we're trying this new experiment. And

0:11:46.760 --> 0:11:49.720
<v Speaker 1>it worked like that. They did both tones of the

0:11:49.760 --> 0:11:52.960
<v Speaker 1>song Mary had a Little Lamb, and they also did

0:11:53.080 --> 0:11:56.720
<v Speaker 1>spoken variations of Mary had a Little Lamp, which in

0:11:56.800 --> 0:11:58.719
<v Speaker 1>the Ted talk I highly recommend you watch it. It's

0:11:58.800 --> 0:12:02.920
<v Speaker 1>very entertaining. Abe Davis is actually very entertaining presented. He

0:12:03.000 --> 0:12:05.560
<v Speaker 1>talks about how he he shot a you know, one

0:12:05.559 --> 0:12:08.479
<v Speaker 1>of the videos. He shows the video of him shouting

0:12:08.920 --> 0:12:12.360
<v Speaker 1>at an empty bag of potato chips. Yeah. Yeah, it

0:12:12.480 --> 0:12:16.000
<v Speaker 1>is a very technical experiment that definitely involves M. I. T.

0:12:16.240 --> 0:12:19.560
<v Speaker 1>Grad students yelling the lyrics to Marry had a Little

0:12:19.640 --> 0:12:22.800
<v Speaker 1>lamb at empty bags of potato chips. Right. He even

0:12:22.840 --> 0:12:25.640
<v Speaker 1>talks about how, you know, originally they wanted to have

0:12:26.280 --> 0:12:30.079
<v Speaker 1>the best possible um a chance to be able to

0:12:30.120 --> 0:12:33.120
<v Speaker 1>pick up these vibrations. They knew that these uh, these

0:12:33.400 --> 0:12:36.439
<v Speaker 1>vibrations were going to be tiny like a micrometer, like

0:12:36.440 --> 0:12:39.880
<v Speaker 1>like a tenth of a micrometer. Yeah, that's super tiny.

0:12:40.559 --> 0:12:42.520
<v Speaker 1>So they wanted to be able to get that with

0:12:42.760 --> 0:12:45.480
<v Speaker 1>a pretty high resolution high speed camera, and they had

0:12:45.520 --> 0:12:48.240
<v Speaker 1>to use a lot of light because these high speed

0:12:48.280 --> 0:12:50.400
<v Speaker 1>cameras that you know, the shutter speed is so fast

0:12:50.760 --> 0:12:52.200
<v Speaker 1>that you need a lot of light to light your

0:12:52.240 --> 0:12:54.520
<v Speaker 1>scene in order to get an image of what you're

0:12:54.760 --> 0:12:57.680
<v Speaker 1>you're pointing the camera at. And he even talked about

0:12:57.720 --> 0:13:00.000
<v Speaker 1>how the lights were so hot that on a previous

0:13:00.200 --> 0:13:04.080
<v Speaker 1>experiment they melted the bag, the empty bag of potato

0:13:04.160 --> 0:13:06.040
<v Speaker 1>chips as a result of this. So it was a

0:13:06.040 --> 0:13:09.760
<v Speaker 1>lot of trial and error early on, but it worked. Yeah. So,

0:13:09.760 --> 0:13:13.120
<v Speaker 1>so they were using objects like like bags of potato chips,

0:13:13.160 --> 0:13:17.079
<v Speaker 1>empty bags of potato chips, and potted plants. And the

0:13:17.120 --> 0:13:20.240
<v Speaker 1>camera that they were using for these first experiments was

0:13:20.240 --> 0:13:24.440
<v Speaker 1>was a high speed camera could capture at two thousand

0:13:24.480 --> 0:13:27.640
<v Speaker 1>to six thousand frames per second, which is a higher

0:13:27.679 --> 0:13:31.520
<v Speaker 1>frequency than the audio signal. But it certainly isn't like

0:13:31.520 --> 0:13:35.320
<v Speaker 1>like the highest possible end high speed camera on the market,

0:13:35.400 --> 0:13:38.680
<v Speaker 1>Phantom or anything. Yeah. Yeah, yeah, the the highest speed

0:13:38.679 --> 0:13:42.280
<v Speaker 1>cameras run something like a thousand a hundred thousand frames

0:13:42.280 --> 0:13:47.079
<v Speaker 1>per second. Sorry, And the software could could pick up

0:13:47.120 --> 0:13:52.160
<v Speaker 1>these tiny, these tiny, tiny movements. Um a tenth of

0:13:52.160 --> 0:13:56.000
<v Speaker 1>a micrometer is something like five thousands of a pixel,

0:13:56.960 --> 0:14:00.680
<v Speaker 1>and it could do that thanks to very subtle changes

0:14:00.800 --> 0:14:04.080
<v Speaker 1>and in each pixels color values at the edges of

0:14:04.120 --> 0:14:08.600
<v Speaker 1>the objects that were being studied. So yeah, and it's

0:14:08.640 --> 0:14:11.719
<v Speaker 1>also that he pointed out in the TED talk that

0:14:12.200 --> 0:14:15.120
<v Speaker 1>it's not like the camera was pointed at one particular,

0:14:15.240 --> 0:14:18.120
<v Speaker 1>tiny little edge of one of these objects. It could

0:14:18.200 --> 0:14:22.520
<v Speaker 1>actually take into account all of the different vibrations happening

0:14:22.520 --> 0:14:26.520
<v Speaker 1>across the object, and that collectively that provided the data

0:14:26.600 --> 0:14:30.000
<v Speaker 1>necessary for them to be able to reconstruct the audio. Right.

0:14:30.080 --> 0:14:33.720
<v Speaker 1>It's rooted in research from from m I t S

0:14:33.760 --> 0:14:38.080
<v Speaker 1>Computer Science and Artificial Intelligence Laboratory, and the software that

0:14:38.080 --> 0:14:41.320
<v Speaker 1>that team was developing was originally intended to amplify color

0:14:41.440 --> 0:14:43.480
<v Speaker 1>changes in video, but then they realized that that it

0:14:43.520 --> 0:14:46.840
<v Speaker 1>could thereby amplify motion, and so they bent it to

0:14:47.680 --> 0:14:52.920
<v Speaker 1>to UH tasks like monitoring blood flow unobtrusively and then

0:14:53.080 --> 0:14:55.200
<v Speaker 1>show that in the video right that they show the

0:14:55.280 --> 0:14:59.160
<v Speaker 1>pulse of someone's arm. Because of these minute changes, they're

0:14:59.200 --> 0:15:01.360
<v Speaker 1>able to amplif by that to the point where you

0:15:01.360 --> 0:15:03.960
<v Speaker 1>can actually see the pulse, which is, by the way,

0:15:04.160 --> 0:15:08.960
<v Speaker 1>a little freaky sure, but also pretty cool. And and

0:15:09.040 --> 0:15:11.640
<v Speaker 1>so this this new team, this this acoustic team in

0:15:13.160 --> 0:15:16.440
<v Speaker 1>built on top of that software, adding the algorithms that

0:15:16.480 --> 0:15:20.520
<v Speaker 1>would identify the whole object and monitor its overall movements

0:15:20.560 --> 0:15:27.360
<v Speaker 1>in order to create the the sound goodness. And it

0:15:27.400 --> 0:15:30.680
<v Speaker 1>was interesting because once they determined that they could capture

0:15:31.320 --> 0:15:35.640
<v Speaker 1>the motion under those quote unquote ideal circumstances with the

0:15:35.760 --> 0:15:39.120
<v Speaker 1>bright lighting and the high speed camera, they started to

0:15:39.560 --> 0:15:43.720
<v Speaker 1>test how far outside of those ideal circumstances they could

0:15:43.760 --> 0:15:47.600
<v Speaker 1>still capture meaningful information and be able to replicate the

0:15:47.680 --> 0:15:51.960
<v Speaker 1>sound that occurred next to that physical object. And you know,

0:15:52.080 --> 0:15:54.040
<v Speaker 1>the idea being that you would be able to replicate

0:15:54.040 --> 0:15:57.720
<v Speaker 1>sound even if there were no microphones, no official microphones working.

0:15:58.560 --> 0:16:04.120
<v Speaker 1>So they it up testing it with normal daylight, providing

0:16:04.160 --> 0:16:07.480
<v Speaker 1>the lighting and shooting through a soundproofed window. So the

0:16:07.520 --> 0:16:10.080
<v Speaker 1>camera was on one side of the window, the object

0:16:10.160 --> 0:16:12.040
<v Speaker 1>was on the other side of the window, another empty

0:16:12.080 --> 0:16:14.880
<v Speaker 1>back of chips, yep, and and that's where the sound

0:16:14.920 --> 0:16:17.360
<v Speaker 1>was generator was on the other side of the soundproof windows.

0:16:17.360 --> 0:16:20.040
<v Speaker 1>So uh, in theory, there shouldn't have been any sound

0:16:20.080 --> 0:16:23.880
<v Speaker 1>bleed over into the camera and they could still pick

0:16:23.960 --> 0:16:28.240
<v Speaker 1>up sound that way clearly. Yeah. Um, and with normal

0:16:28.560 --> 0:16:32.120
<v Speaker 1>indoor light. They filmed a pair of earbuds like normal plastic,

0:16:32.200 --> 0:16:35.520
<v Speaker 1>cheap out earbuds, and then reconstructed the music that they

0:16:35.520 --> 0:16:40.280
<v Speaker 1>were playing well enough that they successfully shazammed the music

0:16:40.480 --> 0:16:42.960
<v Speaker 1>and it was under pressure, which I realized now that

0:16:42.960 --> 0:16:44.960
<v Speaker 1>I should have used a lyric from that from the

0:16:45.000 --> 0:16:48.640
<v Speaker 1>beginning of the show, but never mind that. Uh They

0:16:48.720 --> 0:16:51.760
<v Speaker 1>also furthermore found that they could use a standard camera,

0:16:51.920 --> 0:16:55.880
<v Speaker 1>not a high faluten camera. And we're talking standard like

0:16:55.880 --> 0:16:59.440
<v Speaker 1>like sixty frames per second smartphone camera or you know something.

0:16:59.480 --> 0:17:02.360
<v Speaker 1>You could run out and buy a target. And this

0:17:02.440 --> 0:17:05.280
<v Speaker 1>is thanks to a quirk and how standard digital cameras

0:17:05.320 --> 0:17:09.720
<v Speaker 1>handle fast moving objects. It would be more accurate for

0:17:09.720 --> 0:17:12.520
<v Speaker 1>for them to read measurements off of your whole array

0:17:12.600 --> 0:17:15.720
<v Speaker 1>of photo detectors at the same time, but that is

0:17:15.800 --> 0:17:20.400
<v Speaker 1>kind of expensive, so uh so cameras that are less

0:17:20.400 --> 0:17:24.160
<v Speaker 1>expensive than than super high speed cameras instead read off

0:17:24.200 --> 0:17:27.480
<v Speaker 1>of their photo detectors one row at a time, sort

0:17:27.520 --> 0:17:29.840
<v Speaker 1>of like scan line televisions. And it does this very

0:17:29.920 --> 0:17:33.720
<v Speaker 1>quickly but not instantaneously sure, and it can lead to

0:17:33.760 --> 0:17:36.040
<v Speaker 1>that weird lag that you might have noticed in some

0:17:36.200 --> 0:17:39.200
<v Speaker 1>videos of high speed objects, like sort of jagged edges

0:17:39.359 --> 0:17:42.199
<v Speaker 1>or extra pixelation when the object is moving faster than

0:17:42.240 --> 0:17:45.440
<v Speaker 1>the software can can handle. Yeah. I think in the

0:17:45.560 --> 0:17:48.600
<v Speaker 1>m I T article we read, they use the example

0:17:48.600 --> 0:17:51.800
<v Speaker 1>of a rotor blade of a helicopter. Right. Sure, sure,

0:17:52.240 --> 0:17:54.760
<v Speaker 1>it might be spinning so fast that it's not going

0:17:54.840 --> 0:17:57.239
<v Speaker 1>to capture it the same way your eye would see it,

0:17:57.280 --> 0:17:59.560
<v Speaker 1>but it's going to scan the blade in a different

0:17:59.560 --> 0:18:03.520
<v Speaker 1>position each line, right, and uh on, on a much

0:18:03.720 --> 0:18:08.240
<v Speaker 1>smaller level, invisible to the naked eye. This flaw in

0:18:08.640 --> 0:18:12.640
<v Speaker 1>in normal cameras creates visual artifacts that the researchers found

0:18:12.680 --> 0:18:16.600
<v Speaker 1>out that they could use in order to measure subtle vibrations.

0:18:16.720 --> 0:18:19.639
<v Speaker 1>So the audio reconstructions that they got out of this

0:18:19.720 --> 0:18:23.720
<v Speaker 1>experiment weren't as close to the original audio, but the

0:18:23.760 --> 0:18:26.639
<v Speaker 1>researchers did report that they could probably still identify like

0:18:26.680 --> 0:18:29.160
<v Speaker 1>the number of speakers in a room, or the identity

0:18:29.240 --> 0:18:32.320
<v Speaker 1>of a speaker, given that you have an audio profile

0:18:32.359 --> 0:18:35.400
<v Speaker 1>of the person's voice to begin with. Yeah, so, uh

0:18:35.440 --> 0:18:39.639
<v Speaker 1>definitely a little more like um muted and a little

0:18:39.640 --> 0:18:42.359
<v Speaker 1>more distorted, to the point where if you had not

0:18:42.480 --> 0:18:44.800
<v Speaker 1>already heard what was being said, you might not be

0:18:44.840 --> 0:18:48.800
<v Speaker 1>able to necessarily reconstruct it. Um. Our brains are interesting, right,

0:18:48.880 --> 0:18:51.520
<v Speaker 1>Like if we hear what we're supposed to hear, and

0:18:51.560 --> 0:18:54.480
<v Speaker 1>then we hear the sound played, were more likely to

0:18:54.520 --> 0:18:57.480
<v Speaker 1>pick it out. Uh. This is something that you find

0:18:57.480 --> 0:19:00.440
<v Speaker 1>and people who claim for for like you know, hidden

0:19:00.480 --> 0:19:04.280
<v Speaker 1>messages and backmasking and that kind of stuff. Yeah, if

0:19:04.280 --> 0:19:08.000
<v Speaker 1>you if you listen to the raw sound file without

0:19:08.040 --> 0:19:11.720
<v Speaker 1>any prompting, you may end up saying I didn't make

0:19:11.720 --> 0:19:13.560
<v Speaker 1>anything out and then someone says, oh, you need to

0:19:13.600 --> 0:19:17.080
<v Speaker 1>listen for the phrase, um help me, Jonathan has me

0:19:17.119 --> 0:19:20.080
<v Speaker 1>trapped in the basement. You might end up hearing it.

0:19:20.680 --> 0:19:23.880
<v Speaker 1>I try very hard to make sure don't hear it,

0:19:24.880 --> 0:19:30.960
<v Speaker 1>but you might hear it. Okay. So anyway, that's when

0:19:31.000 --> 0:19:38.400
<v Speaker 1>you play under pressure backwards. Oh vo vovo oh, Freddie Mercury.

0:19:39.200 --> 0:19:41.440
<v Speaker 1>Other methods that you can use to pick up sound

0:19:41.440 --> 0:19:45.000
<v Speaker 1>from a distance. Uh. Some of them use a similar method,

0:19:45.040 --> 0:19:48.880
<v Speaker 1>Like there's one that uses lasers, and in fact, at

0:19:48.880 --> 0:19:52.639
<v Speaker 1>the TED talk, Davis actually says this particular approach, a

0:19:52.720 --> 0:19:55.359
<v Speaker 1>lot of people might immediately spring to the conclusion that

0:19:55.400 --> 0:19:57.440
<v Speaker 1>you would use it to spy on someone, you would

0:19:58.000 --> 0:20:01.120
<v Speaker 1>aim a camera in at something that would be kind

0:20:01.119 --> 0:20:03.359
<v Speaker 1>of unobtrusive, like a potted plant that happens to be

0:20:03.400 --> 0:20:05.920
<v Speaker 1>near a person's desk, and then you could end up

0:20:05.920 --> 0:20:09.520
<v Speaker 1>replicating conversations that went on inside that room by just

0:20:09.680 --> 0:20:12.679
<v Speaker 1>measuring the vibrations of that plant and then running it

0:20:12.680 --> 0:20:16.760
<v Speaker 1>through this algorithm to recreate the sound that happened. Most

0:20:16.800 --> 0:20:19.920
<v Speaker 1>evil plots take place in a room with a potted plant. Yeah,

0:20:20.000 --> 0:20:24.240
<v Speaker 1>you know. Uh, yeah, that's why I only use cacti now,

0:20:24.280 --> 0:20:29.320
<v Speaker 1>But even they have have been deceptive. So the the

0:20:29.359 --> 0:20:32.439
<v Speaker 1>point that Davis makes is that this is pretty low fidelity.

0:20:32.520 --> 0:20:33.920
<v Speaker 1>If you want to do something like that, and if

0:20:33.920 --> 0:20:37.080
<v Speaker 1>you really want to do that, there are alternatives that

0:20:37.200 --> 0:20:42.359
<v Speaker 1>provide much higher quality recordings, uh, mainly using laser microphones. Now,

0:20:42.560 --> 0:20:44.920
<v Speaker 1>you guys heard of laser microphones before we started looking

0:20:44.920 --> 0:20:47.639
<v Speaker 1>into this, No, I don't think I had. It was

0:20:47.680 --> 0:20:49.280
<v Speaker 1>one of those things that I had heard about only

0:20:49.320 --> 0:20:53.200
<v Speaker 1>because I was looking through the Spy Museum in Washington,

0:20:53.280 --> 0:20:56.600
<v Speaker 1>d C. And And read about them. So they work

0:20:56.600 --> 0:21:00.119
<v Speaker 1>on a very similar principle, except instead of detecting the

0:21:00.200 --> 0:21:03.520
<v Speaker 1>vibrations optically, what they're doing is they're using it's still optical,

0:21:03.600 --> 0:21:07.480
<v Speaker 1>but it's not you know, like a visual approach. Um

0:21:07.560 --> 0:21:11.040
<v Speaker 1>you're shooting a laser out at an object and as

0:21:11.040 --> 0:21:15.000
<v Speaker 1>that object vibrates when it is exposed to sound the

0:21:15.080 --> 0:21:17.520
<v Speaker 1>returning laser light. Because you know, it's all based on

0:21:17.680 --> 0:21:20.120
<v Speaker 1>shooting a laser out and then detecting when it comes back,

0:21:20.560 --> 0:21:24.159
<v Speaker 1>the returning laser light will have slightly different arrival times

0:21:24.160 --> 0:21:27.160
<v Speaker 1>than when it was sent out based on those vibrations

0:21:27.160 --> 0:21:29.040
<v Speaker 1>of it's vibrating out, it's going to be a little

0:21:29.080 --> 0:21:32.479
<v Speaker 1>shorter than if it's vibrating in. And while that sounds

0:21:32.480 --> 0:21:36.560
<v Speaker 1>incredibly minute, and it is, it's enough to be able

0:21:36.680 --> 0:21:40.400
<v Speaker 1>to take that data, feed it back through and create

0:21:40.480 --> 0:21:43.679
<v Speaker 1>a sound file based on it, so you could replicate

0:21:43.800 --> 0:21:46.199
<v Speaker 1>things that are being said or other sounds that are

0:21:46.240 --> 0:21:48.879
<v Speaker 1>going on. And in fact, there are a lot of

0:21:48.880 --> 0:21:53.280
<v Speaker 1>places that that due to their classified nature, due to

0:21:53.320 --> 0:21:56.359
<v Speaker 1>the secrecy of stuff that goes on inside, they take

0:21:56.440 --> 0:22:01.480
<v Speaker 1>great pains to try and obvious skate any view into

0:22:01.600 --> 0:22:04.840
<v Speaker 1>the place, whether that's creating sort of a double glazing

0:22:04.840 --> 0:22:07.080
<v Speaker 1>on the window so it disperses the laser beam and

0:22:07.080 --> 0:22:10.119
<v Speaker 1>thus the laser beam can't get a good read um

0:22:10.240 --> 0:22:14.119
<v Speaker 1>or other elements as well. So that's super spy stuff

0:22:14.160 --> 0:22:16.320
<v Speaker 1>that most of us don't have to worry about. But

0:22:16.400 --> 0:22:21.240
<v Speaker 1>as Davis would say, that is a more relevant fear

0:22:21.400 --> 0:22:24.760
<v Speaker 1>than someone using a camera to look in Um, that's

0:22:24.840 --> 0:22:29.200
<v Speaker 1>not as likely. I mean, why would you do. It's expensive,

0:22:31.840 --> 0:22:35.440
<v Speaker 1>but you do have to get your access to the

0:22:35.480 --> 0:22:38.119
<v Speaker 1>algorithm though. That's true, that's true. No, I've got another

0:22:38.119 --> 0:22:41.920
<v Speaker 1>technology in mind. That's where you just grab somebody from

0:22:42.000 --> 0:22:44.280
<v Speaker 1>the room and throw them in a van and demand

0:22:44.320 --> 0:22:45.960
<v Speaker 1>to know what was said. That's not so much a

0:22:46.000 --> 0:22:51.280
<v Speaker 1>technology as it is a valid strategy. Hey, vans are

0:22:51.320 --> 0:22:54.960
<v Speaker 1>a technology. You could just as easily throw them in

0:22:55.000 --> 0:22:57.240
<v Speaker 1>the back of a horse drawn carriage. I mean, it's

0:22:57.240 --> 0:23:00.880
<v Speaker 1>just which are also a technology? Options are list uh

0:23:00.920 --> 0:23:04.520
<v Speaker 1>there of course also long range microphones. You probably have

0:23:04.560 --> 0:23:06.800
<v Speaker 1>seen these advertise in the back of a comic book.

0:23:07.160 --> 0:23:10.080
<v Speaker 1>Parabolic mic. Yeah, that's that's the really popular one. Like

0:23:10.160 --> 0:23:12.800
<v Speaker 1>that's if you ever see, like the spy kits that

0:23:12.840 --> 0:23:14.760
<v Speaker 1>are made for kids who are interested in this kind

0:23:14.760 --> 0:23:16.919
<v Speaker 1>of stuff, there's usually some sort of parabolic mic and

0:23:17.080 --> 0:23:21.040
<v Speaker 1>involved in that. Parabolic mics are meant to be sensitive

0:23:21.080 --> 0:23:24.399
<v Speaker 1>and directional. They're not terribly good at being directional, but

0:23:24.440 --> 0:23:27.879
<v Speaker 1>the really high powered ones are fairly sensitive. Uh. So

0:23:27.960 --> 0:23:30.480
<v Speaker 1>the idea is that you're you're concentrating on a specific

0:23:30.480 --> 0:23:32.600
<v Speaker 1>area to try and pick up sound from that area

0:23:32.680 --> 0:23:36.000
<v Speaker 1>while trying to block sound from as other from all

0:23:36.040 --> 0:23:38.479
<v Speaker 1>the other directions as much as possible. Right, Because as

0:23:38.600 --> 0:23:41.720
<v Speaker 1>as you keep turning up sensitivity in any kind of recorder,

0:23:41.760 --> 0:23:44.080
<v Speaker 1>you're going to increase your noise. Yeah, it's it's like

0:23:44.119 --> 0:23:46.840
<v Speaker 1>turning up the gain on a microphone. If you crank

0:23:46.920 --> 0:23:48.840
<v Speaker 1>that way up, you're gonna get a lot of noise,

0:23:48.880 --> 0:23:50.960
<v Speaker 1>and and it's gonna be harder for you to concentrate

0:23:50.960 --> 0:23:53.119
<v Speaker 1>on the signal. Uh. There are a lot of different

0:23:53.160 --> 0:23:56.760
<v Speaker 1>models of long range microphones, and they all have varying

0:23:56.800 --> 0:24:01.360
<v Speaker 1>degrees of directionality and sensitivity and and uh, but they

0:24:01.359 --> 0:24:04.920
<v Speaker 1>all work on basically the same principle, the idea of

0:24:05.280 --> 0:24:07.920
<v Speaker 1>boosting an acoustic signal to the point where you can

0:24:08.000 --> 0:24:09.959
<v Speaker 1>hear it where normally you would not be able to.

0:24:10.400 --> 0:24:13.400
<v Speaker 1>There are not very many out there that are uh

0:24:13.440 --> 0:24:16.600
<v Speaker 1>powerful enough and and effective enough to be able to

0:24:16.600 --> 0:24:20.480
<v Speaker 1>listen to a conversation beyond like fifty meters away, unless

0:24:20.480 --> 0:24:22.440
<v Speaker 1>people are shouting, in which case you probably go a

0:24:22.480 --> 0:24:25.959
<v Speaker 1>little further, but um, you know that they have their limitations.

0:24:25.960 --> 0:24:28.000
<v Speaker 1>Whereas the laser, I mean, as long as you have

0:24:28.000 --> 0:24:31.359
<v Speaker 1>a line of sight, you could in theory be really

0:24:31.480 --> 0:24:34.919
<v Speaker 1>far away from the actual conversation taking place and be

0:24:34.960 --> 0:24:38.320
<v Speaker 1>able to replicate it. Also, depending on who the person

0:24:38.400 --> 0:24:40.400
<v Speaker 1>spuying on you is, they could just listen to you

0:24:40.440 --> 0:24:42.560
<v Speaker 1>through your cell phone. Yeah, there are a lot of

0:24:42.560 --> 0:24:45.960
<v Speaker 1>other options right, um, but at any rate, so if

0:24:46.000 --> 0:24:51.239
<v Speaker 1>this is not really good as a spy technology, what

0:24:51.400 --> 0:24:55.000
<v Speaker 1>else could we use it for? Well, it could be

0:24:55.119 --> 0:24:59.440
<v Speaker 1>used to reconstruct conversations for court cases and and forensics

0:24:59.680 --> 0:25:02.560
<v Speaker 1>within the justice system. Yep. Of course it requires that

0:25:02.600 --> 0:25:04.560
<v Speaker 1>you have had a camera in place of that area

0:25:04.560 --> 0:25:06.760
<v Speaker 1>in the first place. But yes, but many places do

0:25:07.359 --> 0:25:11.439
<v Speaker 1>be paranoid everyone. It could help out in hospitals that

0:25:11.560 --> 0:25:14.879
<v Speaker 1>the kind of software could monitor vital signs like like

0:25:14.920 --> 0:25:16.760
<v Speaker 1>breath and blood flow from a distance, like we were

0:25:16.800 --> 0:25:19.600
<v Speaker 1>talking about earlier, which would be especially useful in cases

0:25:19.600 --> 0:25:22.480
<v Speaker 1>where you want to disturb the patient as little as possible,

0:25:22.520 --> 0:25:25.760
<v Speaker 1>which I would I would posit as everyone, but but

0:25:25.920 --> 0:25:30.600
<v Speaker 1>especially like infants. Yeah, theah people who who might not

0:25:30.640 --> 0:25:34.359
<v Speaker 1>be able to alert you in a change like especially

0:25:34.400 --> 0:25:37.440
<v Speaker 1>a sudden change in their health. This would be very

0:25:37.520 --> 0:25:39.800
<v Speaker 1>useful to monitor that type of stuff and get a

0:25:39.920 --> 0:25:43.639
<v Speaker 1>very early look at it. Or hey, uh, laper scopy

0:25:43.840 --> 0:25:47.560
<v Speaker 1>is pretty cool putting cameras where cameras don't usually go

0:25:47.680 --> 0:25:52.680
<v Speaker 1>into human bodies, and you could use this to take

0:25:52.800 --> 0:25:55.879
<v Speaker 1>more precise measurements of what's going on inside of a

0:25:55.960 --> 0:25:58.600
<v Speaker 1>human body. Joe, have you ever wondered what your spleen

0:25:58.760 --> 0:26:05.120
<v Speaker 1>sounds like? I don't need to wonder. It's voice whisper

0:26:05.200 --> 0:26:07.520
<v Speaker 1>sweet nothings in my ear. That's better than my answer,

0:26:07.520 --> 0:26:10.800
<v Speaker 1>which was I can show you, all right. So, going

0:26:10.840 --> 0:26:13.960
<v Speaker 1>back to the Ted talk, Davis actually talked about how

0:26:14.160 --> 0:26:18.439
<v Speaker 1>this could even tell us more about materials themselves and

0:26:18.480 --> 0:26:21.800
<v Speaker 1>how they behave, which was really cool. The idea that

0:26:21.960 --> 0:26:25.240
<v Speaker 1>it's kind of like reversing the the approach that they

0:26:25.280 --> 0:26:28.120
<v Speaker 1>had been taking, where they were looking at the vibrations

0:26:28.160 --> 0:26:31.600
<v Speaker 1>and determining what sound was being made to cause those vibrations. Instead,

0:26:32.280 --> 0:26:35.960
<v Speaker 1>they end up measuring vibrations and then figuring out how

0:26:36.000 --> 0:26:40.040
<v Speaker 1>that material would behave uh, depending upon the different types

0:26:40.080 --> 0:26:43.359
<v Speaker 1>of stresses being put on that material. So the first

0:26:43.400 --> 0:26:46.600
<v Speaker 1>example he shows was a it was a wire uh,

0:26:47.320 --> 0:26:50.240
<v Speaker 1>like a wire sculpture of a of a human figure,

0:26:51.000 --> 0:26:53.800
<v Speaker 1>and they had it on a on a surface like

0:26:53.840 --> 0:26:55.679
<v Speaker 1>it's on a little pedestal and it was resting on

0:26:55.760 --> 0:26:58.639
<v Speaker 1>like a shelf table or something. And you see a

0:26:58.640 --> 0:27:02.720
<v Speaker 1>hand come into frame and bang on the surface quite

0:27:02.720 --> 0:27:06.840
<v Speaker 1>a few times to create little vibrations in the wire frame.

0:27:07.680 --> 0:27:09.560
<v Speaker 1>It was. It was to the tune of shaven a

0:27:09.560 --> 0:27:14.040
<v Speaker 1>haircut two bits, And they ended up measuring this with

0:27:14.160 --> 0:27:17.280
<v Speaker 1>their camera equipment, but instead of trying to recreate the

0:27:17.359 --> 0:27:20.600
<v Speaker 1>shaven a haircut knock, they did it to find out

0:27:20.680 --> 0:27:23.520
<v Speaker 1>how this material behaves when stresses are put on it,

0:27:24.040 --> 0:27:29.320
<v Speaker 1>and their algorithm could then simulate that material's behavior and

0:27:29.400 --> 0:27:34.439
<v Speaker 1>they could virtually drag any part of that material and

0:27:34.480 --> 0:27:37.120
<v Speaker 1>then let go to see how it would snap back

0:27:37.119 --> 0:27:41.280
<v Speaker 1>into place, uh, creating kind of this virtual model but

0:27:41.520 --> 0:27:44.840
<v Speaker 1>using real imagery. It was really interesting. Yeah. Yeah, And

0:27:45.040 --> 0:27:47.080
<v Speaker 1>the this little wire figure was not the only thing

0:27:47.160 --> 0:27:51.000
<v Speaker 1>that he showed a test of, right, he showed a bush.

0:27:51.160 --> 0:27:54.639
<v Speaker 1>They ended up shooting video of a bush as it

0:27:54.720 --> 0:27:57.000
<v Speaker 1>was moving through because of I believe it was a

0:27:57.040 --> 0:27:58.760
<v Speaker 1>breeze that was just blowing. Yeah, it was just a

0:27:58.760 --> 0:28:01.840
<v Speaker 1>shrub outside of his apartment, I think, and then they

0:28:01.840 --> 0:28:05.359
<v Speaker 1>were able to with getting enough of that information, they

0:28:05.400 --> 0:28:08.719
<v Speaker 1>were able to then recreate that as well. And so

0:28:08.840 --> 0:28:10.920
<v Speaker 1>you have this image of the bush and he could

0:28:10.920 --> 0:28:14.439
<v Speaker 1>pick any point of it and virtually pull on it,

0:28:14.520 --> 0:28:17.080
<v Speaker 1>and it would show how that bush would move, and

0:28:17.119 --> 0:28:19.520
<v Speaker 1>it looks like it's just video that's been shot of

0:28:19.600 --> 0:28:22.119
<v Speaker 1>this bush as someone has you know, kicked it or something.

0:28:22.960 --> 0:28:26.040
<v Speaker 1>And also showed a like a curtain or a tapestry

0:28:26.040 --> 0:28:29.480
<v Speaker 1>hanging and how that would move based upon the same

0:28:29.520 --> 0:28:31.520
<v Speaker 1>sort of thing, like based upon the source of stresses

0:28:31.560 --> 0:28:33.680
<v Speaker 1>you could put on it, whether you're tugging at one

0:28:33.720 --> 0:28:36.600
<v Speaker 1>corner or the center or whatever, how it would ripple

0:28:36.680 --> 0:28:39.960
<v Speaker 1>back into place. And this was based on extremely small,

0:28:40.000 --> 0:28:42.320
<v Speaker 1>like like invisible emotions to the eye. In the case

0:28:42.320 --> 0:28:45.360
<v Speaker 1>of the curtain, it was just just tiny currents of

0:28:45.360 --> 0:28:48.000
<v Speaker 1>air that we're moving through the room. And so it

0:28:48.360 --> 0:28:51.960
<v Speaker 1>was really really striking getting to watch it, you know. Artistically,

0:28:52.040 --> 0:28:55.160
<v Speaker 1>this kind of thing could let filmmakers adjust the motion

0:28:55.200 --> 0:29:00.040
<v Speaker 1>of objects after a video has already been shot, like

0:29:00.040 --> 0:29:03.440
<v Speaker 1>if if if hair was was blowing the wrong way

0:29:03.920 --> 0:29:06.480
<v Speaker 1>in in in a shot, because you've because you've kind

0:29:06.480 --> 0:29:08.640
<v Speaker 1>of you've kind of plastered a couple different a couple

0:29:08.680 --> 0:29:11.200
<v Speaker 1>different frames together or something like that. Then you could

0:29:11.840 --> 0:29:15.960
<v Speaker 1>move the hair in another direction. Do you remember that

0:29:16.080 --> 0:29:20.120
<v Speaker 1>phase in the history of video games, video games when

0:29:20.160 --> 0:29:23.840
<v Speaker 1>they were no when everyone was obsessed with hair blowing

0:29:23.840 --> 0:29:27.200
<v Speaker 1>in the wind. I don't think we're over there. Remember

0:29:27.280 --> 0:29:30.040
<v Speaker 1>it's happening today. I mean, that's like the thing that

0:29:30.200 --> 0:29:34.960
<v Speaker 1>all graphics boasting referred to. Look at this character's hair.

0:29:35.520 --> 0:29:38.800
<v Speaker 1>It's the hair and it's water. Doesn't like the water

0:29:38.880 --> 0:29:41.200
<v Speaker 1>effects and the hair effects. But no, I was gonna say, like,

0:29:41.240 --> 0:29:43.640
<v Speaker 1>this could create a new era of full motion video

0:29:43.720 --> 0:29:49.840
<v Speaker 1>video games. Finally, finally, we've been waiting for it exactly.

0:29:49.880 --> 0:29:51.920
<v Speaker 1>The reason we don't have that yet is that we

0:29:52.000 --> 0:29:54.360
<v Speaker 1>don't know what to make these characters look like when

0:29:54.400 --> 0:29:58.640
<v Speaker 1>they're supposed to jiggle. I am looking forward to the

0:29:58.680 --> 0:30:03.320
<v Speaker 1>resurgence of the emotion video video game because I need

0:30:03.360 --> 0:30:06.760
<v Speaker 1>to make sure that I have a transition plan for

0:30:06.840 --> 0:30:09.880
<v Speaker 1>when I am finished doing my futurism. I need to

0:30:09.920 --> 0:30:12.840
<v Speaker 1>have another career to move into, and I think FMV

0:30:13.120 --> 0:30:17.320
<v Speaker 1>actor in low budget video games is my niche. You know,

0:30:17.520 --> 0:30:20.280
<v Speaker 1>if Tim Curry can do it, you can do it too. Yeah,

0:30:20.360 --> 0:30:24.120
<v Speaker 1>nowhere near the panache that Mr Curry had in things

0:30:24.160 --> 0:30:27.600
<v Speaker 1>like that Frankenstein game. Well, work on it. Also, your

0:30:27.640 --> 0:30:30.920
<v Speaker 1>listeners over tech stuff would be so happy. That's true.

0:30:31.160 --> 0:30:34.280
<v Speaker 1>That's true. So we will see if that day arrives.

0:30:34.440 --> 0:30:38.840
<v Speaker 1>But more practically, it could let engineers test structures for whear,

0:30:39.040 --> 0:30:43.360
<v Speaker 1>like like predict damage from earthquakes, important things like that. Yeah,

0:30:43.440 --> 0:30:46.920
<v Speaker 1>it's really kind of an interesting way of advancing our

0:30:46.960 --> 0:30:53.000
<v Speaker 1>knowledge of material science in an indirect fashion. It's pretty amazing. Uh.

0:30:53.240 --> 0:30:56.080
<v Speaker 1>Also could tell us things about like resonant frequencies, that

0:30:56.120 --> 0:30:58.840
<v Speaker 1>sort of stuff, which again is very similar to making

0:30:58.840 --> 0:31:01.600
<v Speaker 1>sure you're you're in engineering things properly in the case

0:31:01.600 --> 0:31:04.360
<v Speaker 1>of earthquakes or you know, we've we've heard the horror

0:31:04.440 --> 0:31:08.200
<v Speaker 1>stories about bridges and resonant frequencies. That kind of stuff

0:31:08.840 --> 0:31:12.480
<v Speaker 1>really fascinating And uh, we're sitting in an audio studio

0:31:12.560 --> 0:31:14.560
<v Speaker 1>right now. I wonder if it could have applications in

0:31:14.640 --> 0:31:17.960
<v Speaker 1>trying to figure out how to best insulate how the

0:31:18.000 --> 0:31:25.600
<v Speaker 1>best baffle sound studio, which we often find baffling. All right, Well,

0:31:25.920 --> 0:31:29.600
<v Speaker 1>on that note, members of the audience, have you ever

0:31:29.680 --> 0:31:33.160
<v Speaker 1>heard drilling in a forward thinking podcast? We tried to

0:31:33.280 --> 0:31:36.680
<v Speaker 1>edit most to the drilling out, but sometimes some drilling

0:31:36.760 --> 0:31:39.840
<v Speaker 1>might get through. Yeah, I blame Ben Bolan and his

0:31:39.920 --> 0:31:45.000
<v Speaker 1>search for lost gold, but uh, you know. Anyway, Guys,

0:31:45.080 --> 0:31:48.080
<v Speaker 1>if you have suggestions for future topics we can tackle

0:31:48.200 --> 0:31:51.560
<v Speaker 1>here on Forward Thinking, whatever they may be, you should

0:31:51.560 --> 0:31:54.400
<v Speaker 1>write us and let us know. The email address is

0:31:54.720 --> 0:31:58.800
<v Speaker 1>FW thinking at how Stuff Works dot com, or you

0:31:58.800 --> 0:32:01.560
<v Speaker 1>can drop us a line on Twitter or Google Plus.

0:32:01.800 --> 0:32:04.560
<v Speaker 1>We are FW thinking at both of those places. Our

0:32:04.600 --> 0:32:07.120
<v Speaker 1>search FW thinking and Facebook will pop right up. You

0:32:07.120 --> 0:32:09.440
<v Speaker 1>can leave us a message and we'll talk to you

0:32:09.480 --> 0:32:17.800
<v Speaker 1>again really soon for more on this topic and the

0:32:17.880 --> 0:32:32.080
<v Speaker 1>future of technology. This is forward sinking dot Com, brought

0:32:32.080 --> 0:32:34.600
<v Speaker 1>to you by Toyota. Let's go places,