WEBVTT - Big Applications for Big Data

0:00:00.160 --> 0:00:07.400
<v Speaker 1>Brought to you by Toyota. Let's go places. Welcome to

0:00:07.560 --> 0:00:15.120
<v Speaker 1>Forward Thinking. Hey there everyone, and welcome to Forward Thinking,

0:00:15.200 --> 0:00:17.400
<v Speaker 1>the podcast that looks at the future and says, here's

0:00:17.400 --> 0:00:19.880
<v Speaker 1>a little story. I've got to tell about three bad podcasters.

0:00:19.880 --> 0:00:24.120
<v Speaker 1>You know, so well, I'm Jonathan Strickland and I'm Joe McCormick,

0:00:24.320 --> 0:00:27.320
<v Speaker 1>and you just you don't dig the BC boys they're

0:00:27.360 --> 0:00:29.800
<v Speaker 1>Lauren No, no, I do that. That was That was lovely,

0:00:29.960 --> 0:00:33.199
<v Speaker 1>Thank you, terrific, thank you. Uh, and we want to

0:00:33.240 --> 0:00:36.279
<v Speaker 1>talk a little bit more about big data. You know,

0:00:36.720 --> 0:00:39.960
<v Speaker 1>we talked about big data. It's okay, I'm going to

0:00:40.080 --> 0:00:45.040
<v Speaker 1>use both pronunciations. Okay, fair enough, so big info, uh no,

0:00:45.200 --> 0:00:47.720
<v Speaker 1>big data because we're all you are talking about computer

0:00:47.880 --> 0:00:50.800
<v Speaker 1>information here and not the android from Star not the

0:00:50.840 --> 0:00:53.920
<v Speaker 1>android from Star Trek. Yeah, just get that all the way.

0:00:54.320 --> 0:00:58.320
<v Speaker 1>So we're we wanted to talk more about applications of

0:00:58.360 --> 0:01:02.200
<v Speaker 1>what what different organizations, companies, governments are using big data

0:01:02.240 --> 0:01:04.600
<v Speaker 1>for what they're mining out of this huge amount of

0:01:04.640 --> 0:01:07.679
<v Speaker 1>information that we are generating every day. Now. You may

0:01:07.680 --> 0:01:11.560
<v Speaker 1>remember in our last podcast we said that we're generating

0:01:11.600 --> 0:01:16.080
<v Speaker 1>about two point five quintillion bytes of information per day

0:01:16.160 --> 0:01:18.440
<v Speaker 1>and not just humans, but you know, sensors, things that

0:01:18.480 --> 0:01:22.320
<v Speaker 1>are uh indirectly yeah, internet things, stuff that's connected to

0:01:22.360 --> 0:01:25.559
<v Speaker 1>the Internet that we're not directly in putting data into.

0:01:25.880 --> 0:01:28.240
<v Speaker 1>And for those of us who missed the last podcast,

0:01:28.319 --> 0:01:31.640
<v Speaker 1>what's the difference between this paradigm of big data and

0:01:31.680 --> 0:01:35.919
<v Speaker 1>just say a lot of data. Big data? We're talking huge, huge,

0:01:36.160 --> 0:01:39.600
<v Speaker 1>enormous amounts of information. When we're talking two point five

0:01:39.720 --> 0:01:45.200
<v Speaker 1>quintillion bytes, that's half of all the spoken words that

0:01:45.360 --> 0:01:49.720
<v Speaker 1>humans have uttered since the dawn of language. So we're

0:01:49.760 --> 0:01:53.680
<v Speaker 1>talking about in two days, you generate as much information

0:01:53.880 --> 0:01:57.040
<v Speaker 1>as all the words we've ever spoken ever of all

0:01:57.080 --> 0:02:00.120
<v Speaker 1>the people. So there's this issue of volume, but also

0:02:00.400 --> 0:02:04.520
<v Speaker 1>these characteristics we talked about, like velocity and variety. So

0:02:04.600 --> 0:02:07.960
<v Speaker 1>it's not just the amount of data, but it's interacting

0:02:08.560 --> 0:02:13.440
<v Speaker 1>exactly extremely fast. We're gathering it at an incredible pace.

0:02:13.639 --> 0:02:17.000
<v Speaker 1>I mean, you're you're gathering data at an unprecedented pace.

0:02:17.160 --> 0:02:20.320
<v Speaker 1>You are. It's rich and intense and everywhere. Yeah, and

0:02:20.360 --> 0:02:23.400
<v Speaker 1>it's all different types of information. And also there's a

0:02:23.400 --> 0:02:27.560
<v Speaker 1>fourth v that we can mention, which is voracity, which

0:02:27.600 --> 0:02:31.160
<v Speaker 1>is the quality of the information truthiness. Yeah, yeah, there

0:02:31.160 --> 0:02:35.280
<v Speaker 1>you go. Uh we're gonna tell you some strategicies that

0:02:35.360 --> 0:02:38.440
<v Speaker 1>people use with big data. Uh it does mean truthiness.

0:02:38.480 --> 0:02:42.639
<v Speaker 1>I mean it means that the data is essentially high quality. Right. Yeah.

0:02:42.760 --> 0:02:45.880
<v Speaker 1>Voracity is really kind of their way of saying, how

0:02:45.960 --> 0:02:48.919
<v Speaker 1>good is your information? And sometimes you don't know how

0:02:48.919 --> 0:02:51.239
<v Speaker 1>good your information is, because again, you've got a lot

0:02:51.280 --> 0:02:53.560
<v Speaker 1>of it. Until you analyze it, you don't really know

0:02:54.040 --> 0:02:58.280
<v Speaker 1>if there's anything useful there. Uh. IBM is in the

0:02:58.400 --> 0:03:02.760
<v Speaker 1>business of of leveraging big data and helping other companies

0:03:02.840 --> 0:03:07.600
<v Speaker 1>leverage it. And uh and and they say very clearly

0:03:07.639 --> 0:03:10.200
<v Speaker 1>on their website and in their white papers that the

0:03:10.240 --> 0:03:12.760
<v Speaker 1>important part here is that you have to figure out

0:03:12.800 --> 0:03:16.280
<v Speaker 1>what your goal is before you start just looking at

0:03:16.280 --> 0:03:18.080
<v Speaker 1>big data and saying we need to be part of this.

0:03:18.639 --> 0:03:21.600
<v Speaker 1>Because whatever your goal is, that's going to end up

0:03:21.720 --> 0:03:25.440
<v Speaker 1>informing your approach to using that information in a way

0:03:25.480 --> 0:03:28.560
<v Speaker 1>that makes sense. Otherwise, you're just talking about an enormous

0:03:28.600 --> 0:03:32.200
<v Speaker 1>resource that may not be directly useful to you. You're

0:03:32.240 --> 0:03:35.400
<v Speaker 1>just kind of looking at it and thinking, I want

0:03:36.000 --> 0:03:38.240
<v Speaker 1>to make use of that information. I just it's too

0:03:38.280 --> 0:03:41.000
<v Speaker 1>big a problem for me to even get a grasp

0:03:41.040 --> 0:03:44.440
<v Speaker 1>on how I want to use it. So you can't

0:03:44.480 --> 0:03:47.080
<v Speaker 1>just run at it and say, give me some ones

0:03:47.120 --> 0:03:49.960
<v Speaker 1>and zeros, I'm gonna make magic happen. You have to

0:03:49.960 --> 0:03:52.000
<v Speaker 1>have a plan in place first. But I've got a

0:03:52.000 --> 0:03:54.720
<v Speaker 1>lot of really clever people have figured out really cool

0:03:54.760 --> 0:03:58.720
<v Speaker 1>stuff to do with this information. Sure. In the last podcast,

0:03:58.840 --> 0:04:01.960
<v Speaker 1>we talked a little bit about traffic analysis, which was

0:04:02.080 --> 0:04:04.800
<v Speaker 1>a very kind of you know, you know, it's an

0:04:04.800 --> 0:04:08.520
<v Speaker 1>easy to understand application of big data, right, So let

0:04:08.520 --> 0:04:13.320
<v Speaker 1>me give you an example using Google's approach. So the

0:04:13.320 --> 0:04:17.120
<v Speaker 1>way Google would generate traffic on Google Maps. If you

0:04:17.160 --> 0:04:20.000
<v Speaker 1>were using Google Maps on a mobile device and you

0:04:20.080 --> 0:04:22.080
<v Speaker 1>wanted to try and get from point A to point B,

0:04:22.200 --> 0:04:25.120
<v Speaker 1>and you wanted traffic to be part of that that equation,

0:04:25.480 --> 0:04:28.880
<v Speaker 1>that route, then what it would do is it would

0:04:28.880 --> 0:04:32.560
<v Speaker 1>start looking at information of other Google users. Uh. There

0:04:32.560 --> 0:04:36.479
<v Speaker 1>are other systems that use this, the Dash system, which

0:04:36.960 --> 0:04:39.800
<v Speaker 1>doesn't really exist anymore, but uh, it used a very

0:04:39.839 --> 0:04:44.920
<v Speaker 1>similar approach where it would send anonymous data about vehicles

0:04:44.960 --> 0:04:48.160
<v Speaker 1>that were moving through a particular region and the speed

0:04:48.240 --> 0:04:52.800
<v Speaker 1>at which they were moving. It was just sampling the vehicle, yeah,

0:04:52.800 --> 0:04:55.680
<v Speaker 1>sampling the vehicle's location. It would get the GPS coordinates

0:04:56.000 --> 0:04:57.880
<v Speaker 1>and it would just sample it over a certain amount

0:04:57.880 --> 0:05:01.440
<v Speaker 1>of time and derive how fat that vehicle was moving

0:05:01.480 --> 0:05:04.600
<v Speaker 1>down the street based upon the information, right saying, okay,

0:05:04.600 --> 0:05:06.880
<v Speaker 1>well it was at this point at this time, and

0:05:06.920 --> 0:05:09.000
<v Speaker 1>it was at this other point a little bit later,

0:05:09.080 --> 0:05:10.760
<v Speaker 1>and this other point a little bit later. Therefore, that

0:05:10.760 --> 0:05:13.839
<v Speaker 1>means traffic is moving at this speed down this particular street,

0:05:14.400 --> 0:05:18.080
<v Speaker 1>and then it's extrapolates that sends it out to everyone

0:05:18.120 --> 0:05:21.120
<v Speaker 1>so that you know which routes have the heaviest traffic.

0:05:22.000 --> 0:05:26.000
<v Speaker 1>Now the Yeah, it's it's a very simple approach to

0:05:26.160 --> 0:05:29.039
<v Speaker 1>big data in the sense that it's just taking real

0:05:29.120 --> 0:05:33.600
<v Speaker 1>time information, analyzing it, and sending the results back very quickly.

0:05:33.600 --> 0:05:36.880
<v Speaker 1>It's not storing information, it's not trying to be transformative

0:05:36.920 --> 0:05:39.240
<v Speaker 1>with the information. It's just trying to make sense of

0:05:39.320 --> 0:05:41.680
<v Speaker 1>all these different pieces of information that are coming in

0:05:42.040 --> 0:05:44.720
<v Speaker 1>and then making it meaningful to the people who are

0:05:44.800 --> 0:05:49.240
<v Speaker 1>using the service. So that's one example, but that's just one.

0:05:49.560 --> 0:05:53.760
<v Speaker 1>You can actually see some pretty interesting patterns when you

0:05:53.800 --> 0:05:56.640
<v Speaker 1>get huge amounts of information. You can see patterns where

0:05:56.680 --> 0:05:59.640
<v Speaker 1>you might have thought there was just chaos before. So

0:05:59.760 --> 0:06:03.640
<v Speaker 1>you you can look at a system that you might say, well,

0:06:04.279 --> 0:06:07.560
<v Speaker 1>from the outset, it just looks like stuff is happening.

0:06:07.880 --> 0:06:10.279
<v Speaker 1>But now when I see all this information, it's broken

0:06:10.279 --> 0:06:12.679
<v Speaker 1>down like this, I can actually see trends where before

0:06:12.720 --> 0:06:16.800
<v Speaker 1>I just saw stuff. So education is a good example

0:06:16.800 --> 0:06:19.480
<v Speaker 1>of this. So let's say you're a teacher and you're

0:06:19.480 --> 0:06:25.520
<v Speaker 1>teaching a class, and you have your classes submitting schoolwork

0:06:25.560 --> 0:06:28.800
<v Speaker 1>in a into a system that can then analyze the

0:06:28.839 --> 0:06:32.040
<v Speaker 1>school work. So you're grading the kids. You you might

0:06:32.080 --> 0:06:34.880
<v Speaker 1>actually just be inserting the grades into the system. In fact,

0:06:34.880 --> 0:06:37.960
<v Speaker 1>it may not have any connection with the kids directly

0:06:38.000 --> 0:06:41.200
<v Speaker 1>at all. You might just be the teacher in the system.

0:06:41.240 --> 0:06:44.240
<v Speaker 1>The system would be able to, if you're using a

0:06:44.360 --> 0:06:47.520
<v Speaker 1>very sophisticated approach, be able to start detecting trends in

0:06:47.600 --> 0:06:52.880
<v Speaker 1>each individual student's progress. So you might be able to say, oh,

0:06:53.120 --> 0:06:59.320
<v Speaker 1>while while student A isn't failing, the trend indicates that

0:06:59.720 --> 0:07:02.120
<v Speaker 1>the student is beginning to struggle. So I need to

0:07:02.200 --> 0:07:05.960
<v Speaker 1>adjust my way of reaching this student so that I

0:07:06.360 --> 0:07:10.080
<v Speaker 1>am not leaving the student without any support. Oh, I wonder,

0:07:10.200 --> 0:07:15.280
<v Speaker 1>so would that involve comparing uh little signals with millions

0:07:15.280 --> 0:07:18.080
<v Speaker 1>of other students, Like if we we've seen that when

0:07:18.120 --> 0:07:22.480
<v Speaker 1>these things start to happen, statistically, that means like we're

0:07:22.480 --> 0:07:25.679
<v Speaker 1>heading towards failure. It would mean that It would also

0:07:25.760 --> 0:07:28.040
<v Speaker 1>mean that on a larger level, you might see that

0:07:28.120 --> 0:07:31.120
<v Speaker 1>an entire classroom is having some issues, which would tell

0:07:31.160 --> 0:07:33.800
<v Speaker 1>the teacher I need to change my approach. I I

0:07:34.240 --> 0:07:37.040
<v Speaker 1>you know this, this concept that I've tried to teach.

0:07:37.080 --> 0:07:39.880
<v Speaker 1>Obviously this has not worked out. So I need to

0:07:39.920 --> 0:07:42.720
<v Speaker 1>find a new way of getting this across in a

0:07:42.760 --> 0:07:44.880
<v Speaker 1>way that makes sense to my students, or perhaps help

0:07:44.920 --> 0:07:47.720
<v Speaker 1>an entire school system figure out how to how to

0:07:48.520 --> 0:07:51.960
<v Speaker 1>grade and test better. Exactly. Yeah, you know how if

0:07:52.000 --> 0:07:55.000
<v Speaker 1>you take a survey um and your survey has just

0:07:55.040 --> 0:07:57.440
<v Speaker 1>a hundred people in it, well, it's probably not very

0:07:57.440 --> 0:08:00.240
<v Speaker 1>representative of the entire population, but the big but you

0:08:00.280 --> 0:08:03.080
<v Speaker 1>can play family feud, right if you have a thousand people,

0:08:03.120 --> 0:08:06.400
<v Speaker 1>it's better. If you keep increasing your sample size, your

0:08:06.440 --> 0:08:11.240
<v Speaker 1>statistics become better and better at representing stronger trends that

0:08:11.280 --> 0:08:13.800
<v Speaker 1>you want to look for. And this is the same

0:08:13.840 --> 0:08:16.880
<v Speaker 1>thing you'd see. It's why big data is great. Right,

0:08:16.560 --> 0:08:20.720
<v Speaker 1>you're increasing your sample size, right, Yes, As you increase

0:08:20.760 --> 0:08:23.440
<v Speaker 1>that sample size, then you can actually start to recognize

0:08:23.480 --> 0:08:25.760
<v Speaker 1>things that are truly trends and not just a one off.

0:08:26.400 --> 0:08:29.520
<v Speaker 1>They're less likely to be anomalies. Yes, exactly, And this

0:08:29.600 --> 0:08:32.920
<v Speaker 1>has tracked a lot in especially in consumer segments. I mean,

0:08:32.920 --> 0:08:34.880
<v Speaker 1>you know, like every time you buy something on Amazon,

0:08:34.960 --> 0:08:37.360
<v Speaker 1>it it collects a little group of other things that

0:08:37.440 --> 0:08:40.000
<v Speaker 1>people who have bought that thing have also bought in

0:08:40.080 --> 0:08:42.600
<v Speaker 1>case you want to buy that thing, um and and

0:08:42.720 --> 0:08:45.840
<v Speaker 1>can be very useful not just for not just for

0:08:45.880 --> 0:08:49.160
<v Speaker 1>that basic you know, like we want to sell more stuff,

0:08:49.440 --> 0:08:53.560
<v Speaker 1>but uh yeah, I mean with Amazon sometimes it's creepy

0:08:53.679 --> 0:08:57.400
<v Speaker 1>how how on spot it is? Or it's creepy how

0:08:57.440 --> 0:09:00.000
<v Speaker 1>not on spot it is, and you wonder if they're

0:09:00.120 --> 0:09:02.680
<v Speaker 1>something you don't know about yourself. My my favorite, my

0:09:02.760 --> 0:09:05.440
<v Speaker 1>favorite is that if you start to find the products

0:09:05.440 --> 0:09:08.840
<v Speaker 1>that are on Amazon that have the ridiculous reviews, like

0:09:08.920 --> 0:09:11.480
<v Speaker 1>the ones that are just you know, it's the product

0:09:11.559 --> 0:09:19.000
<v Speaker 1>itself is absurd for or the gallon of milk there

0:09:19.120 --> 0:09:21.840
<v Speaker 1>they're there are examples of products that are on Amazon

0:09:22.000 --> 0:09:26.720
<v Speaker 1>that people have written like novellas in review and the

0:09:26.800 --> 0:09:30.360
<v Speaker 1>novellas are hilarious. Like you get to a point where

0:09:30.360 --> 0:09:32.760
<v Speaker 1>it's like it's like a soap opera that opens up

0:09:32.800 --> 0:09:34.320
<v Speaker 1>and it is this whole thing, and then the last

0:09:34.360 --> 0:09:37.720
<v Speaker 1>line will be some throwaway review of the product or whatever,

0:09:38.120 --> 0:09:41.040
<v Speaker 1>and it's it's I mean, I love it for the absurdity.

0:09:41.240 --> 0:09:44.800
<v Speaker 1>What's interesting is that the related items always tend to

0:09:44.800 --> 0:09:48.120
<v Speaker 1>be the other ones that have similar ridiculous reviews, which

0:09:48.160 --> 0:09:51.439
<v Speaker 1>means that even then always tend to include horse mask.

0:09:51.800 --> 0:09:56.680
<v Speaker 1>Horsehead mask. You know, yeah, people like you bought this thing, right,

0:09:56.720 --> 0:10:01.560
<v Speaker 1>but at any rate, the I wonder if you click

0:10:01.600 --> 0:10:04.040
<v Speaker 1>on that, if you also get like the Godfather as

0:10:04.040 --> 0:10:08.320
<v Speaker 1>a suggestion, you've seen horsehead mask? Right, I've seen horsehead mask. Yes,

0:10:08.440 --> 0:10:11.480
<v Speaker 1>I've used the internet, so yes, I have seen it.

0:10:12.080 --> 0:10:15.520
<v Speaker 1>But yeah, that is another example. Right. Amazon uses this

0:10:15.640 --> 0:10:20.480
<v Speaker 1>in order to make more sales, because the thought is,

0:10:20.520 --> 0:10:23.000
<v Speaker 1>if you are interested in this one particular kind of product,

0:10:23.200 --> 0:10:25.719
<v Speaker 1>then you're probably interested in these other products, especially if

0:10:25.720 --> 0:10:29.000
<v Speaker 1>there's a history of other people having bought those together,

0:10:29.240 --> 0:10:31.720
<v Speaker 1>or at least you know, I bought them at some

0:10:31.800 --> 0:10:35.120
<v Speaker 1>other point in their and their history. And then you

0:10:35.120 --> 0:10:37.760
<v Speaker 1>can you can start pouring that information not just into

0:10:38.160 --> 0:10:41.640
<v Speaker 1>what are people going to buy next, but into why

0:10:41.640 --> 0:10:44.720
<v Speaker 1>are people buying this thing, and start tracking things like

0:10:44.800 --> 0:10:48.800
<v Speaker 1>flu outbreaks. Right, Okay, so this is this is This

0:10:48.880 --> 0:10:50.880
<v Speaker 1>is one of those things that I thought was really,

0:10:51.720 --> 0:10:54.440
<v Speaker 1>uh an interesting example of using big data in a

0:10:54.480 --> 0:10:57.960
<v Speaker 1>way that you wouldn't necessarily first think about. This was

0:10:58.000 --> 0:11:01.080
<v Speaker 1>something that Google wrote up a white paper on there's

0:11:01.120 --> 0:11:06.040
<v Speaker 1>actually a full paper about Google using information to detect

0:11:06.160 --> 0:11:10.000
<v Speaker 1>influenza outbreaks. And the way that they did it was

0:11:10.040 --> 0:11:14.280
<v Speaker 1>they essentially found search queries that people were putting in

0:11:14.840 --> 0:11:18.800
<v Speaker 1>that indicated that someone was feeling sick, especially things like

0:11:18.960 --> 0:11:22.920
<v Speaker 1>various symptoms and stuff. And then by relating that information

0:11:23.000 --> 0:11:27.440
<v Speaker 1>to specific regions in the world and seeing multiple people

0:11:27.640 --> 0:11:32.400
<v Speaker 1>requesting this information from say a particular city, they could say,

0:11:32.440 --> 0:11:35.400
<v Speaker 1>this looks like this is an outbreak of the flu.

0:11:35.679 --> 0:11:37.960
<v Speaker 1>They said that there was a reporting lag of about

0:11:38.120 --> 0:11:42.640
<v Speaker 1>one day, so a day after a certain you know,

0:11:43.320 --> 0:11:46.120
<v Speaker 1>a large enough sample size of people are looking for this,

0:11:46.400 --> 0:11:50.400
<v Speaker 1>Google could say there's a potential flu outbreak in this

0:11:50.880 --> 0:11:54.320
<v Speaker 1>very specific area of the world. Maybe we need to

0:11:54.520 --> 0:11:57.680
<v Speaker 1>you know, by a learning something like the Center for

0:11:57.760 --> 0:12:00.560
<v Speaker 1>Disease Control the CDC, They could say, we need to

0:12:00.600 --> 0:12:03.679
<v Speaker 1>head this off before it becomes some sort of pandemic, right,

0:12:03.800 --> 0:12:06.280
<v Speaker 1>which basically says to me that that that because of

0:12:06.320 --> 0:12:08.640
<v Speaker 1>Google and big data we're going to be able to

0:12:08.800 --> 0:12:13.880
<v Speaker 1>uh prevent the inevitable zombie. But exactly the scary thing

0:12:13.920 --> 0:12:17.320
<v Speaker 1>is will probably never even find out about it, right

0:12:18.280 --> 0:12:21.680
<v Speaker 1>because of stuff like this. They'll get to the government

0:12:21.800 --> 0:12:25.320
<v Speaker 1>before the public knows. No, there there has to be

0:12:25.520 --> 0:12:28.960
<v Speaker 1>enough queries of dead uncle trying to eat my face

0:12:30.040 --> 0:12:32.560
<v Speaker 1>for at least a few people to take notice. If

0:12:32.640 --> 0:12:37.400
<v Speaker 1>you happen to be checking Google trends on dead uncle

0:12:37.400 --> 0:12:40.720
<v Speaker 1>turned on Twitter long before. Yeah, you know, just the

0:12:40.760 --> 0:12:44.000
<v Speaker 1>tumbler alone would be But this is funny. Actually, Google

0:12:44.040 --> 0:12:48.640
<v Speaker 1>trends is a great, really simple, really straightforward example of

0:12:48.800 --> 0:12:53.079
<v Speaker 1>how big data is really interesting, like looking at um

0:12:53.120 --> 0:12:56.360
<v Speaker 1>the popularity of a search term over a period of time.

0:12:56.640 --> 0:12:59.040
<v Speaker 1>I mean, it's so cool. You can get lost in

0:12:59.080 --> 0:13:01.920
<v Speaker 1>these bore to see is watching the spikes when you

0:13:01.960 --> 0:13:05.000
<v Speaker 1>know different movies or books or cultural events happened. Yeah,

0:13:05.360 --> 0:13:08.640
<v Speaker 1>when Kanye West does something embarrassing and immediately shoots to

0:13:08.679 --> 0:13:11.760
<v Speaker 1>the top. One I love is when you can look

0:13:11.800 --> 0:13:15.559
<v Speaker 1>at the historical ones because they have every book ever written, right,

0:13:15.920 --> 0:13:18.600
<v Speaker 1>They've scanned that in and then they can you say,

0:13:18.840 --> 0:13:22.079
<v Speaker 1>historical ones. I just said, they're thinking like bad play.

0:13:22.240 --> 0:13:27.319
<v Speaker 1>My American cousin just think like Google trends throughout the centurial.

0:13:27.480 --> 0:13:29.360
<v Speaker 1>You can look at you can look at like back

0:13:29.400 --> 0:13:32.400
<v Speaker 1>to eighteen hundred in the books they've scanned, right, um,

0:13:32.440 --> 0:13:35.679
<v Speaker 1>and you can you can chart changes in spelling. Right.

0:13:35.880 --> 0:13:39.840
<v Speaker 1>You can compare spelling a word one way versus spelling

0:13:39.840 --> 0:13:42.200
<v Speaker 1>it another way, and watch one go down with while

0:13:42.240 --> 0:13:44.600
<v Speaker 1>the other goes up one one data artists that I'm

0:13:44.600 --> 0:13:46.760
<v Speaker 1>going to talk about in a little bit created a

0:13:46.840 --> 0:13:51.960
<v Speaker 1>graph of the use of hope versus despair and in

0:13:52.040 --> 0:13:55.240
<v Speaker 1>recent years, just just watching the times when when despair

0:13:55.400 --> 0:13:59.079
<v Speaker 1>overlapped hope. Interesting, beautiful. I can't wait to talk about that. Well,

0:13:59.120 --> 0:14:02.959
<v Speaker 1>and and you know, let's start with other applications. Well,

0:14:03.000 --> 0:14:04.360
<v Speaker 1>I was going to mention that, you know, when you

0:14:04.360 --> 0:14:08.760
<v Speaker 1>think about Google, Google's mission statement is all about big data,

0:14:08.800 --> 0:14:14.440
<v Speaker 1>because they're about organizing the universe's information, which when you

0:14:14.480 --> 0:14:16.560
<v Speaker 1>think about that, you know they want to index and

0:14:16.760 --> 0:14:21.200
<v Speaker 1>organize all the information everywhere that we ever encounter. That

0:14:21.400 --> 0:14:26.720
<v Speaker 1>is big data. That's that's as clear as you can get. Yeah,

0:14:26.760 --> 0:14:29.360
<v Speaker 1>And so the fact that they're able to demonstrate the

0:14:29.480 --> 0:14:34.200
<v Speaker 1>usefulness of this proves the the the the utility of

0:14:34.240 --> 0:14:36.960
<v Speaker 1>their company, right because otherwise if if all they did

0:14:37.040 --> 0:14:41.360
<v Speaker 1>was index this and there was no uh, actually useful, Yeah,

0:14:41.400 --> 0:14:43.240
<v Speaker 1>then you'd be like, well, this company is just not

0:14:43.240 --> 0:14:47.280
<v Speaker 1>going to stick around. So besides education or predicting a

0:14:47.320 --> 0:14:51.320
<v Speaker 1>flu outbreak, you can actually use it to monitor cybersecurity

0:14:51.360 --> 0:14:54.040
<v Speaker 1>and check a network's health. So if you see a

0:14:54.080 --> 0:14:56.160
<v Speaker 1>spike in network activity, you can check it out and

0:14:56.160 --> 0:14:58.400
<v Speaker 1>make sure that it's not a d d o S attack,

0:14:58.400 --> 0:15:00.760
<v Speaker 1>a distributed denial of service attack, so that a hacker

0:15:01.000 --> 0:15:05.560
<v Speaker 1>hasn't said, hey, this website has raised my ire, I

0:15:05.600 --> 0:15:09.560
<v Speaker 1>shall direct my zombie computers to attack it. UM. You know,

0:15:09.840 --> 0:15:11.800
<v Speaker 1>being able to see that kind of stuff and respond

0:15:11.800 --> 0:15:13.600
<v Speaker 1>to it in real time is really useful. And you

0:15:13.680 --> 0:15:16.960
<v Speaker 1>obviously need to have a robust, robust system to deal

0:15:17.040 --> 0:15:19.320
<v Speaker 1>with a lot of information because it may just be

0:15:19.560 --> 0:15:24.080
<v Speaker 1>that it's a heavy amount of traffic for completely legitimate reasons.

0:15:24.520 --> 0:15:28.480
<v Speaker 1>So that's another implementation of big data. UH. It's also

0:15:28.560 --> 0:15:30.280
<v Speaker 1>part of what they're talking about when they talk about

0:15:30.320 --> 0:15:34.280
<v Speaker 1>the smart grid UM for electrical electrical companies to be

0:15:34.320 --> 0:15:35.880
<v Speaker 1>able to get energy to the right places at the

0:15:35.960 --> 0:15:39.840
<v Speaker 1>right times and prevent brownouts and blackouts. UM system overloads.

0:15:40.160 --> 0:15:43.200
<v Speaker 1>People have talked about using UM tracking the number of

0:15:43.280 --> 0:15:47.440
<v Speaker 1>ups packages sent to track how well the economy is doing. Interesting. Yeah,

0:15:47.440 --> 0:15:49.480
<v Speaker 1>I've definitely heard about the smart grid stuff. I mean,

0:15:49.520 --> 0:15:52.280
<v Speaker 1>there are a lot of utility companies that are running

0:15:52.360 --> 0:15:56.680
<v Speaker 1>at close to full capacity, and being able to to

0:15:56.760 --> 0:15:59.680
<v Speaker 1>see where a demand is going to be at any

0:15:59.720 --> 0:16:04.120
<v Speaker 1>given time means that you are reducing the demand on

0:16:04.240 --> 0:16:07.320
<v Speaker 1>any individual power company because they can all work in

0:16:07.400 --> 0:16:11.520
<v Speaker 1>concert together and that way you don't have these you

0:16:11.640 --> 0:16:16.720
<v Speaker 1>reduce the possibility of a brown out or a blackout. Um. So,

0:16:16.760 --> 0:16:18.800
<v Speaker 1>I mean that's clearly important, but that is a lot

0:16:18.800 --> 0:16:21.960
<v Speaker 1>of information. You're constantly getting feedback from all the different

0:16:22.800 --> 0:16:25.760
<v Speaker 1>meters essentially smart meters, and even if you get down

0:16:25.760 --> 0:16:28.440
<v Speaker 1>to it, you can have smart appliances that are very

0:16:28.480 --> 0:16:32.120
<v Speaker 1>specifically giving both you and the network more information about

0:16:32.160 --> 0:16:36.440
<v Speaker 1>power consumption. So that's all also important. Weather forecasting another

0:16:36.480 --> 0:16:40.920
<v Speaker 1>important part. Talk about gathering all the information from weather

0:16:41.240 --> 0:16:45.480
<v Speaker 1>sensors around the world and looking at the information and

0:16:45.560 --> 0:16:50.000
<v Speaker 1>detecting patterns because our forecasting abilities, don't know if you noticed,

0:16:50.800 --> 0:16:54.600
<v Speaker 1>not so great. Sometimes it's hilarious because when you think

0:16:54.640 --> 0:16:59.480
<v Speaker 1>about it, we have so much power and technology devoted

0:16:59.560 --> 0:17:03.120
<v Speaker 1>to produce acting the weather and sometimes we're still so

0:17:04.480 --> 0:17:07.600
<v Speaker 1>it's such a complex night. The other night, the hourly

0:17:07.640 --> 0:17:11.800
<v Speaker 1>weather on online was telling me zero percent chance of precipitation.

0:17:12.080 --> 0:17:15.120
<v Speaker 1>We had a storm that was knocking limbs out of trees.

0:17:15.280 --> 0:17:18.920
<v Speaker 1>It was like you couldn't see ten feet for the rain. Yeah. Yeah,

0:17:19.000 --> 0:17:21.600
<v Speaker 1>And it's hilarious when you take a look and say, wait,

0:17:21.720 --> 0:17:25.879
<v Speaker 1>they forecast ten days out, How how reliable is that

0:17:25.960 --> 0:17:31.480
<v Speaker 1>tenth day? There wrong about what's happening right now. Well,

0:17:31.680 --> 0:17:35.600
<v Speaker 1>and beyond that you have things like uh fraud detection,

0:17:35.920 --> 0:17:40.800
<v Speaker 1>and also governments can use big data for tax collection purposes,

0:17:41.119 --> 0:17:43.840
<v Speaker 1>looking at trends in taxes and the way that people

0:17:43.880 --> 0:17:47.000
<v Speaker 1>are paying taxes, and maybe comparing the way people are

0:17:47.000 --> 0:17:50.359
<v Speaker 1>paying taxes versus what they supposedly, oh in Texas, and

0:17:50.400 --> 0:17:53.080
<v Speaker 1>finding out if there are big gaps there, because right now,

0:17:53.119 --> 0:17:55.560
<v Speaker 1>the way it tends to work is it's after the fact,

0:17:55.720 --> 0:17:58.399
<v Speaker 1>right people file their taxes, and then a certain number

0:17:58.400 --> 0:18:03.719
<v Speaker 1>of those taxes tax reports are picked to be looked

0:18:03.760 --> 0:18:06.720
<v Speaker 1>over in more careful detail, and it's only if they

0:18:06.760 --> 0:18:12.000
<v Speaker 1>start to detect a pretty uh significant pattern that they'll

0:18:12.040 --> 0:18:16.320
<v Speaker 1>look at any individual's taxes specifically, unless you're part of

0:18:16.359 --> 0:18:20.120
<v Speaker 1>some political controversy which we won't get into, but this

0:18:20.320 --> 0:18:22.560
<v Speaker 1>big data thing would allow you to take a look

0:18:22.600 --> 0:18:27.040
<v Speaker 1>at a much larger scale and focus in on particular problems,

0:18:28.160 --> 0:18:31.960
<v Speaker 1>as opposed to just hoping that the sheet of reports

0:18:31.960 --> 0:18:35.120
<v Speaker 1>that you just pulled from the printer includes people who

0:18:35.200 --> 0:18:39.720
<v Speaker 1>are not paying their fair share. So I've got a question. Yeah,

0:18:39.880 --> 0:18:45.359
<v Speaker 1>now that we're talking about the government using big data

0:18:45.440 --> 0:18:49.720
<v Speaker 1>to predict near duells, and uh, I think I see

0:18:49.720 --> 0:18:53.240
<v Speaker 1>where this is going. Yeah, y'all seen that movie Minority Report,

0:18:53.400 --> 0:18:57.159
<v Speaker 1>documentary Minority Report. Well, okay, so I want to explain

0:18:57.200 --> 0:18:59.359
<v Speaker 1>a little bit. In that movie, they've got a they've

0:18:59.359 --> 0:19:04.119
<v Speaker 1>got a division of law enforcement called pre crime. Where

0:19:04.160 --> 0:19:07.000
<v Speaker 1>they are now in the movie, it's kind of they've

0:19:07.000 --> 0:19:10.920
<v Speaker 1>got these like psychic prelugs. But let's just say replace

0:19:11.040 --> 0:19:15.720
<v Speaker 1>the psychics with really, really really powerful computers, right that

0:19:15.800 --> 0:19:21.000
<v Speaker 1>look at trends pattern com make extremely accurate predictions about

0:19:21.000 --> 0:19:26.159
<v Speaker 1>what's about to happen. I can definitely foresee a future

0:19:26.240 --> 0:19:30.320
<v Speaker 1>where it might not be all that impossible for computers

0:19:30.359 --> 0:19:33.639
<v Speaker 1>to predict when somebody is very likely to commit a crime?

0:19:35.280 --> 0:19:37.080
<v Speaker 1>Could I can? I can tell you. Let me give

0:19:37.080 --> 0:19:40.240
<v Speaker 1>you a little more, a little more insight. From my perspective,

0:19:40.280 --> 0:19:41.760
<v Speaker 1>I don't know that we're going to get to a

0:19:41.760 --> 0:19:43.680
<v Speaker 1>point where we're going to be able to predict when

0:19:44.240 --> 0:19:47.119
<v Speaker 1>a specific individual is likely to commit a crime. We

0:19:47.160 --> 0:19:49.840
<v Speaker 1>can definitely get a little more probabilistic, you know, sit

0:19:49.880 --> 0:19:52.600
<v Speaker 1>there and say, what is the probability of any person

0:19:52.680 --> 0:19:56.480
<v Speaker 1>at any given time to commit a crime? Um, there

0:19:56.480 --> 0:19:58.640
<v Speaker 1>are some things that we can say. For example, there

0:19:58.640 --> 0:20:01.120
<v Speaker 1>are law enforcement agencies there that are now using big

0:20:01.200 --> 0:20:05.560
<v Speaker 1>data in order to predict crime trends. So not a

0:20:05.600 --> 0:20:09.800
<v Speaker 1>specific person not saying, you know, yeah yeah ne'er dowell,

0:20:09.960 --> 0:20:13.919
<v Speaker 1>Johnny today is gonna knock over the liquor store. They

0:20:13.920 --> 0:20:15.800
<v Speaker 1>are not doing that. What they are doing is saying,

0:20:16.720 --> 0:20:19.080
<v Speaker 1>looking at this big data, I'm seeing this trend where

0:20:19.080 --> 0:20:22.920
<v Speaker 1>this particular part of town tends to be a target

0:20:23.000 --> 0:20:26.680
<v Speaker 1>for vandalization. In burglary, let's say those those are two

0:20:26.720 --> 0:20:29.840
<v Speaker 1>crimes that often tons of factors, you know, based on

0:20:29.960 --> 0:20:34.240
<v Speaker 1>weather or right. Apparently things like burglaries, um kind of

0:20:34.240 --> 0:20:37.040
<v Speaker 1>go in rashes. Yeah. That's another thing is that if

0:20:37.040 --> 0:20:41.280
<v Speaker 1>a place is hit by burglars, then there is there

0:20:41.320 --> 0:20:44.760
<v Speaker 1>tends to be a increased risk of the same thing

0:20:44.760 --> 0:20:48.040
<v Speaker 1>happening in and in the sanginal area. Yeah, so if

0:20:48.080 --> 0:20:52.160
<v Speaker 1>there's a successful burglary attempt in one particular home, for example,

0:20:52.200 --> 0:20:55.960
<v Speaker 1>other homes in that neighborhood could be also um prone

0:20:56.040 --> 0:20:59.399
<v Speaker 1>to being hit by burglars. So that's one example that

0:20:59.480 --> 0:21:01.280
<v Speaker 1>law enforced we can use. They can use it as

0:21:01.320 --> 0:21:04.600
<v Speaker 1>a reactionary thing, saying all right, well, because we know this,

0:21:04.880 --> 0:21:07.919
<v Speaker 1>we should end up increasing patrols in this area for

0:21:07.960 --> 0:21:10.520
<v Speaker 1>the time being so that we can discourage any other

0:21:10.600 --> 0:21:13.679
<v Speaker 1>crime or catch the criminals before they're able to hit

0:21:13.720 --> 0:21:17.680
<v Speaker 1>another another house or another business. Another thing is that

0:21:18.359 --> 0:21:21.159
<v Speaker 1>for crimes like burglary and vandalism, those are crimes that

0:21:21.280 --> 0:21:26.240
<v Speaker 1>generally go down when you increase patrols. They they are

0:21:26.760 --> 0:21:31.280
<v Speaker 1>considered low intensity but high frequency crimes. So if you

0:21:31.359 --> 0:21:35.960
<v Speaker 1>were to adjust patrols so that there is a more

0:21:36.119 --> 0:21:40.360
<v Speaker 1>frequent patrol of police through that area, you reduce the

0:21:40.440 --> 0:21:43.920
<v Speaker 1>likelihood of those crimes being committed. And by using big

0:21:44.000 --> 0:21:47.159
<v Speaker 1>data and and and really analyzing where these crimes are

0:21:47.160 --> 0:21:50.520
<v Speaker 1>taking place within a city, you can redraw patrol routes

0:21:50.880 --> 0:21:55.720
<v Speaker 1>so that police are taking the most efficient patrol they can,

0:21:55.800 --> 0:21:58.119
<v Speaker 1>so they're not having to patrol an area that's way

0:21:58.240 --> 0:22:00.480
<v Speaker 1>larger than what they're capable of doing in a in

0:22:00.520 --> 0:22:04.680
<v Speaker 1>a given shift, and you also will hit the areas

0:22:04.720 --> 0:22:08.359
<v Speaker 1>that are most likely to be targeted and help reduce crime.

0:22:08.400 --> 0:22:10.439
<v Speaker 1>That way, you're preventing it from happening. So you're not

0:22:10.480 --> 0:22:12.879
<v Speaker 1>going out and arresting someone for a crime they haven't

0:22:12.920 --> 0:22:17.320
<v Speaker 1>committed yet. That's not the same thing at all. But

0:22:17.520 --> 0:22:21.480
<v Speaker 1>you can help attack those sort of crimes, things like

0:22:21.720 --> 0:22:26.200
<v Speaker 1>murder much less you know, much less prone to any

0:22:26.240 --> 0:22:30.280
<v Speaker 1>sort of pattern that you can predict. It's that's something

0:22:30.280 --> 0:22:34.280
<v Speaker 1>that's a high intensity but low frequency crime as opposed

0:22:34.320 --> 0:22:38.160
<v Speaker 1>to low intensity, high frequency like vandalism and burglary. So

0:22:38.760 --> 0:22:41.160
<v Speaker 1>they don't tend to take that kind of crime into

0:22:41.160 --> 0:22:44.439
<v Speaker 1>consideration when they're looking at this big data in this sense,

0:22:44.840 --> 0:22:48.160
<v Speaker 1>other than to uh perhaps say that, you know, this

0:22:48.280 --> 0:22:51.199
<v Speaker 1>particular area of town needs to have a stronger police

0:22:51.240 --> 0:22:53.399
<v Speaker 1>presence in order to help can back to what we

0:22:53.480 --> 0:22:57.080
<v Speaker 1>were talking about with sample size, Yeah, and uh, Like

0:22:57.119 --> 0:23:00.919
<v Speaker 1>in Santa Cruz, California, police use this approach to and

0:23:01.359 --> 0:23:03.840
<v Speaker 1>identify homes that were more likely to be hit by

0:23:03.880 --> 0:23:06.919
<v Speaker 1>a burglar so they could redraw their patrol roots to

0:23:07.040 --> 0:23:10.719
<v Speaker 1>take that into consideration and prevent that from happening. Now,

0:23:11.520 --> 0:23:14.919
<v Speaker 1>when we do talk about criminals and the likelihood of

0:23:14.960 --> 0:23:19.080
<v Speaker 1>someone to to commit a particular crime, there is some

0:23:19.160 --> 0:23:23.159
<v Speaker 1>statistical evidence to suggest that people who are who have

0:23:23.240 --> 0:23:27.639
<v Speaker 1>committed a crime are more likely to commit another crime

0:23:27.720 --> 0:23:30.480
<v Speaker 1>than someone who has never committed a crime like that,

0:23:31.000 --> 0:23:34.960
<v Speaker 1>there's like a fort recidivism rate. But part of that

0:23:35.200 --> 0:23:38.280
<v Speaker 1>is due to the way that we handle criminals and

0:23:38.320 --> 0:23:41.639
<v Speaker 1>how we try to reintroduce criminals to society. So it

0:23:41.720 --> 0:23:45.919
<v Speaker 1>may not be that people just have this statistical likelihood

0:23:45.920 --> 0:23:48.600
<v Speaker 1>of committing a crime again once they've already done. So

0:23:49.119 --> 0:23:52.159
<v Speaker 1>some of its institutionalize, right, It's a social construct, not

0:23:52.240 --> 0:23:56.360
<v Speaker 1>a personal propensity. Right, So therefore that wouldn't it wouldn't

0:23:56.359 --> 0:23:58.600
<v Speaker 1>be the issue what the cause was. It would just

0:23:58.640 --> 0:24:01.280
<v Speaker 1>be like that, you see it. Well, no, there's an

0:24:01.280 --> 0:24:02.919
<v Speaker 1>issue about what the cause was, because if you can

0:24:02.920 --> 0:24:07.800
<v Speaker 1>treat the cause, then you remove the I'm saying that

0:24:07.840 --> 0:24:12.639
<v Speaker 1>criminals matter, and Joe, you're just throwing them away. I

0:24:12.160 --> 0:24:14.679
<v Speaker 1>think you know, you know, I'm talking about what the

0:24:14.760 --> 0:24:20.840
<v Speaker 1>cause was. Wouldn't affect how how how well? Just like

0:24:22.280 --> 0:24:24.240
<v Speaker 1>I was, I was teasing Joe, but we were very

0:24:24.240 --> 0:24:26.800
<v Speaker 1>clearly talking about two different things of the same problem. Yeah,

0:24:26.800 --> 0:24:28.480
<v Speaker 1>And and what is scary about all of this is

0:24:28.520 --> 0:24:30.800
<v Speaker 1>the thought that someone could say, well, all of all

0:24:30.840 --> 0:24:34.160
<v Speaker 1>of this is is reactive in looking for this kind

0:24:34.200 --> 0:24:36.880
<v Speaker 1>of crime, and why can't we be proactive and well

0:24:36.960 --> 0:24:41.119
<v Speaker 1>and getting into that scary minority. And that's and and

0:24:41.160 --> 0:24:43.600
<v Speaker 1>I think, I mean, I'm not saying that we'll never

0:24:43.640 --> 0:24:48.880
<v Speaker 1>get to a point where we where where statistical models

0:24:48.920 --> 0:24:51.760
<v Speaker 1>won't give, at least again, a probabilistic approach of how

0:24:51.800 --> 0:24:54.639
<v Speaker 1>likely is person A to commit a crime versus person B.

0:24:54.800 --> 0:24:57.399
<v Speaker 1>And you take everything into account, and you compare that

0:24:57.440 --> 0:25:00.399
<v Speaker 1>against all the information you've ever gathered and come up

0:25:00.440 --> 0:25:03.680
<v Speaker 1>with a probability that's probably gonna happen at some point.

0:25:03.680 --> 0:25:05.439
<v Speaker 1>But I don't think that we're ever going to act

0:25:05.560 --> 0:25:08.520
<v Speaker 1>on that. I'm just saying that, I think if a

0:25:08.560 --> 0:25:12.160
<v Speaker 1>computer can predict that Jonathan Strickland is likely to buy

0:25:12.160 --> 0:25:16.200
<v Speaker 1>a horsehead mask, it can also probably predict that Jonathan

0:25:16.240 --> 0:25:20.280
<v Speaker 1>Strickland is more likely than the average person to Robert

0:25:20.320 --> 0:25:25.800
<v Speaker 1>Jimmy Johns. But what was what horrifying how accurate you are?

0:25:26.760 --> 0:25:28.840
<v Speaker 1>But there's a Jimmy Johns with then walking distance of

0:25:28.880 --> 0:25:31.560
<v Speaker 1>this office. What we should always keep in mind is

0:25:31.760 --> 0:25:34.760
<v Speaker 1>even if computers are that good, we shouldn't ever let

0:25:34.800 --> 0:25:38.920
<v Speaker 1>that prejudice our approach to Jonathan Strickland, because he may

0:25:39.080 --> 0:25:41.840
<v Speaker 1>very well not buy a horsehead mask, and he may

0:25:41.960 --> 0:25:45.480
<v Speaker 1>very well not Robert Jimmy Johns. That's true. That was

0:25:45.560 --> 0:25:50.119
<v Speaker 1>my point. Okay. So yeah, So while while Minority Report

0:25:50.160 --> 0:25:53.320
<v Speaker 1>definitely had this sort of scary science fictionary approach to

0:25:53.960 --> 0:25:58.159
<v Speaker 1>you know, uh, stopping people arresting people for crimes they

0:25:58.160 --> 0:26:00.960
<v Speaker 1>had not yet committed, but we're going to come it. Uh.

0:26:01.000 --> 0:26:02.879
<v Speaker 1>And and they, you know, they had the benefit of

0:26:02.960 --> 0:26:07.480
<v Speaker 1>having psychics who are apparently infallible, except they're not. When

0:26:07.480 --> 0:26:12.200
<v Speaker 1>you watch the movie spoiler. Yeah for a movie that's

0:26:12.240 --> 0:26:18.880
<v Speaker 1>that old. I'm sorry anyone who's listened to this. Yeah.

0:26:19.000 --> 0:26:22.399
<v Speaker 1>So anyway, Uh, I don't think we're ever gonna I

0:26:22.400 --> 0:26:23.600
<v Speaker 1>don't think we're ever going to get to a point

0:26:23.640 --> 0:26:26.399
<v Speaker 1>where big day is when pre crime comes up. Vote

0:26:26.440 --> 0:26:30.240
<v Speaker 1>no right to your representative, say no to pre crime things.

0:26:30.240 --> 0:26:32.440
<v Speaker 1>You got to vote no one. Vote no one pre

0:26:32.520 --> 0:26:35.040
<v Speaker 1>crime and vote no on giving artificial intelligence the right

0:26:35.080 --> 0:26:36.720
<v Speaker 1>to vote. Those are the two things you have to

0:26:36.760 --> 0:26:40.439
<v Speaker 1>make sure deals. Yeah, those are two big strikes, Lauren,

0:26:40.480 --> 0:26:43.439
<v Speaker 1>can you tell us something happy? I can? Well, okay, So,

0:26:43.520 --> 0:26:46.760
<v Speaker 1>so part of what part of what is scary about

0:26:46.800 --> 0:26:48.560
<v Speaker 1>all this data is that it's really hard for us

0:26:48.640 --> 0:26:53.600
<v Speaker 1>to understand what's out there, what we're generating, what it's

0:26:53.640 --> 0:26:55.880
<v Speaker 1>being used for, and what all that looks like. I mean,

0:26:55.880 --> 0:26:57.560
<v Speaker 1>because you know, like we were talking about like at

0:26:57.600 --> 0:27:01.760
<v Speaker 1>a certain point, we're like, oh, sure, a quadrille, what's

0:27:01.800 --> 0:27:05.240
<v Speaker 1>what's the number. It's a lot, Yeah, it's it's a

0:27:05.280 --> 0:27:09.760
<v Speaker 1>it's a one with fifteen zeros. And that's a lot

0:27:09.760 --> 0:27:14.119
<v Speaker 1>of zeros. And and there are a group of data

0:27:14.200 --> 0:27:18.399
<v Speaker 1>artists out there. Are you giggling at more than a

0:27:18.520 --> 0:27:22.520
<v Speaker 1>dozen zeros? That is more than a dozen zero? Let's

0:27:22.600 --> 0:27:26.479
<v Speaker 1>thank you. Let's not let Joe talk anymore. Take the

0:27:26.760 --> 0:27:30.119
<v Speaker 1>X away from Joe. I haven't been touching it. Please

0:27:30.160 --> 0:27:32.399
<v Speaker 1>please go on. Um, they're there are a group of

0:27:32.480 --> 0:27:37.760
<v Speaker 1>data artists out there who are working to to put

0:27:37.800 --> 0:27:40.560
<v Speaker 1>all of this into some kind of meaningful and and

0:27:40.960 --> 0:27:46.200
<v Speaker 1>also culturally meaningful unit that that we can process and

0:27:45.800 --> 0:27:47.960
<v Speaker 1>u And there there's there's one particular fellow by the

0:27:48.040 --> 0:27:50.280
<v Speaker 1>name of Jared Thorpe who used to be the data

0:27:50.359 --> 0:27:53.200
<v Speaker 1>artist in residence at the New York Times and has

0:27:53.240 --> 0:27:59.440
<v Speaker 1>as of I think December or January of UM gone

0:27:59.440 --> 0:28:02.080
<v Speaker 1>off and found did the Office for Creative Research as

0:28:02.119 --> 0:28:06.199
<v Speaker 1>as it's being called a company of his UM and

0:28:06.200 --> 0:28:08.679
<v Speaker 1>and he he posits that that this data art is

0:28:08.680 --> 0:28:11.680
<v Speaker 1>going to help people understand what all of this data

0:28:11.720 --> 0:28:14.640
<v Speaker 1>means and what it's being used for. UM and he's

0:28:14.640 --> 0:28:18.000
<v Speaker 1>got some really interesting just personal projects that he did

0:28:18.000 --> 0:28:20.879
<v Speaker 1>a TED talk that's that's pretty pretty terrific UM or

0:28:20.960 --> 0:28:23.840
<v Speaker 1>you can you can see, you know, he's he's taken

0:28:24.720 --> 0:28:28.240
<v Speaker 1>Twitter data from from people saying good morning and putting

0:28:28.240 --> 0:28:30.760
<v Speaker 1>it into this kind of gorgeous bouncy map of of

0:28:30.840 --> 0:28:33.399
<v Speaker 1>just of just tracking when people are waking up and

0:28:33.480 --> 0:28:38.080
<v Speaker 1>saying good morning to Twitter. He also shows what time

0:28:38.280 --> 0:28:40.560
<v Speaker 1>they say good morning based upon the color of the

0:28:40.600 --> 0:28:43.480
<v Speaker 1>block that appears. So if they say good morning earlier,

0:28:43.720 --> 0:28:47.080
<v Speaker 1>it's a green block, and the later they say good morning,

0:28:47.200 --> 0:28:49.800
<v Speaker 1>it goes into goes into the reds. And so he

0:28:49.880 --> 0:28:52.520
<v Speaker 1>also could show trends that way, like around the world,

0:28:52.640 --> 0:28:55.400
<v Speaker 1>showing trends of when people would say good morning and

0:28:55.800 --> 0:28:58.080
<v Speaker 1>uh in general, Let's say the West Coast wakes up

0:28:58.120 --> 0:29:04.160
<v Speaker 1>at around eleven am and the East coast we're early risers,

0:29:04.680 --> 0:29:08.280
<v Speaker 1>not only because the sun gets to us first, but

0:29:08.400 --> 0:29:11.080
<v Speaker 1>because the west coast is sleeping in. It's all those

0:29:11.120 --> 0:29:16.160
<v Speaker 1>actors who say good morning at two pm. You're an actor,

0:29:16.440 --> 0:29:19.959
<v Speaker 1>that's true, I am, but I get up at you know,

0:29:20.920 --> 0:29:22.800
<v Speaker 1>five in the morning. The trick is you don't say

0:29:22.840 --> 0:29:29.040
<v Speaker 1>good morning, just a grump coffee everyone. That's pretty much

0:29:29.040 --> 0:29:33.720
<v Speaker 1>meet hate everyone? That what what when am I not

0:29:33.800 --> 0:29:37.640
<v Speaker 1>tweeting that I hate everyone? So so that's an interesting

0:29:37.720 --> 0:29:42.280
<v Speaker 1>example of data visualization right right, and and and that's

0:29:42.320 --> 0:29:45.600
<v Speaker 1>that's what they're working for, is that visualization of getting

0:29:45.680 --> 0:29:50.080
<v Speaker 1>something down to a graphic scale where we can go like, oh,

0:29:50.280 --> 0:29:52.600
<v Speaker 1>that's still a way too huge for me to comprehend,

0:29:52.640 --> 0:29:54.520
<v Speaker 1>but at least it looks kind of pretty and I

0:29:54.560 --> 0:29:57.200
<v Speaker 1>get it now. I can totally see that how that

0:29:57.240 --> 0:30:00.360
<v Speaker 1>would help people understand what data means. H um he

0:30:00.640 --> 0:30:03.880
<v Speaker 1>was talking about in one UM one talk that he

0:30:03.920 --> 0:30:08.040
<v Speaker 1>gave it pop tech. I believe about about how people

0:30:08.040 --> 0:30:10.920
<v Speaker 1>have been saying that data is the new oil, and

0:30:10.920 --> 0:30:13.520
<v Speaker 1>and how kind of grandiose and lovely that sounds. For

0:30:13.560 --> 0:30:16.920
<v Speaker 1>a second, because people are thinking like oil, oil is money, money,

0:30:17.040 --> 0:30:23.640
<v Speaker 1>is good. But but but it's how how terrifying that

0:30:23.760 --> 0:30:26.480
<v Speaker 1>is in a certain way because because oil for for

0:30:26.640 --> 0:30:29.720
<v Speaker 1>you know, a very specific example has been a resource

0:30:29.720 --> 0:30:33.560
<v Speaker 1>that has been so misused and is so poisonous and

0:30:35.880 --> 0:30:38.680
<v Speaker 1>terrible global instability and war and and all of this.

0:30:38.880 --> 0:30:42.480
<v Speaker 1>But and that that you know, similarly, this data could

0:30:42.480 --> 0:30:46.680
<v Speaker 1>be used or misused rather for um, you know, not

0:30:46.840 --> 0:30:49.120
<v Speaker 1>very good purposes like like we were talking about. But

0:30:49.160 --> 0:30:52.840
<v Speaker 1>if we you know, if if we use these kind

0:30:52.880 --> 0:30:56.280
<v Speaker 1>of resources, and by resources, I mean people who are

0:30:56.400 --> 0:31:00.920
<v Speaker 1>processing this, um too, get away from all of the

0:31:02.200 --> 0:31:05.160
<v Speaker 1>capitalism that the capitalism is terrible. But but but using

0:31:05.160 --> 0:31:08.240
<v Speaker 1>this data for the greater common good, for some things

0:31:08.280 --> 0:31:12.120
<v Speaker 1>like predicting influenza outbreaks and being able to respond quickly

0:31:12.160 --> 0:31:15.240
<v Speaker 1>before it becomes a pandemic. I mean, clearly you're talking

0:31:15.240 --> 0:31:21.560
<v Speaker 1>about benefiting potentially millions of people. We've seen flu outbreaks

0:31:21.640 --> 0:31:24.960
<v Speaker 1>affect millions of people, and if you're able to respond

0:31:25.280 --> 0:31:28.400
<v Speaker 1>fast enough so that you could contain that, then that

0:31:28.400 --> 0:31:32.520
<v Speaker 1>would be an obvious, you know, benefit to everybody. So yeah,

0:31:32.600 --> 0:31:34.560
<v Speaker 1>that's and that's just one example. There's some that are

0:31:34.600 --> 0:31:37.760
<v Speaker 1>more like, well, this makes my life easier. The traffic

0:31:37.800 --> 0:31:40.120
<v Speaker 1>stuff for example. But even even in the bigger scheme,

0:31:40.200 --> 0:31:43.080
<v Speaker 1>if you talk about traffic, that seems kind of trivial.

0:31:43.480 --> 0:31:45.320
<v Speaker 1>You know, all it means that I don't have to

0:31:45.360 --> 0:31:48.040
<v Speaker 1>spend you know, extra time sitting in traffic. That also

0:31:48.080 --> 0:31:50.920
<v Speaker 1>means you're spending less time running a gasoline power at

0:31:50.920 --> 0:31:53.360
<v Speaker 1>engine I mean, unless you have an electric vehicle or whatever.

0:31:53.440 --> 0:31:56.360
<v Speaker 1>But and the stress levels which relate to your to

0:31:56.440 --> 0:31:59.400
<v Speaker 1>your heart rate and health, um, the your productivity at work.

0:31:59.720 --> 0:32:02.360
<v Speaker 1>If you could get everyone in Atlanta to work in

0:32:03.040 --> 0:32:05.240
<v Speaker 1>half an hour less than they currently spend on the road,

0:32:05.720 --> 0:32:07.400
<v Speaker 1>I mean, we would probably just be looking at pictures

0:32:07.400 --> 0:32:11.400
<v Speaker 1>of cats on the internet anyway. But um, but either way, yeah, yeah, no,

0:32:11.520 --> 0:32:15.680
<v Speaker 1>I agree entirely. So there are and you know, I

0:32:15.720 --> 0:32:18.560
<v Speaker 1>like the I like the artistic vision of showing this

0:32:18.640 --> 0:32:22.280
<v Speaker 1>as a way to demonstrate this is just one way

0:32:22.280 --> 0:32:25.160
<v Speaker 1>of looking at the information. And uh, and you know,

0:32:25.280 --> 0:32:27.840
<v Speaker 1>the ways that you've heard of are just the tip

0:32:27.920 --> 0:32:31.040
<v Speaker 1>of the iceberg. We haven't even really explored the full

0:32:32.120 --> 0:32:34.840
<v Speaker 1>extent of what we can use this data for. And

0:32:35.240 --> 0:32:38.560
<v Speaker 1>in some cases it may be truly transformative. We won't

0:32:38.560 --> 0:32:43.120
<v Speaker 1>have to necessarily reinvent or or invent brand new technology

0:32:43.480 --> 0:32:47.160
<v Speaker 1>to make the world a better place. We may have

0:32:47.200 --> 0:32:49.760
<v Speaker 1>all the tools already, it's just in that information we

0:32:49.800 --> 0:32:52.360
<v Speaker 1>have to feagure. Yeah, and then part of that is

0:32:52.400 --> 0:32:56.560
<v Speaker 1>getting people interested in this field and creating a culture

0:32:56.640 --> 0:33:00.480
<v Speaker 1>around it. Yeah. Yeah, agreed. Well that's awesome. I mean

0:33:00.560 --> 0:33:04.680
<v Speaker 1>it's I've only seen one of those, uh those examples.

0:33:04.720 --> 0:33:07.760
<v Speaker 1>I saw the good morning example for Twitter. It was

0:33:07.880 --> 0:33:11.360
<v Speaker 1>a spinning globe and all the little uh pop ups

0:33:11.360 --> 0:33:14.320
<v Speaker 1>of showing where people had said good morning. Um. It

0:33:14.320 --> 0:33:16.960
<v Speaker 1>also made me feel better about my tweets because I don't.

0:33:17.040 --> 0:33:19.680
<v Speaker 1>I don't tend to say good morning. No, I don't know.

0:33:19.800 --> 0:33:22.680
<v Speaker 1>I also don't LaVar Burton does. Is LaVar Burton not

0:33:22.720 --> 0:33:25.040
<v Speaker 1>good enough for you? No, LaVar Burton is good enough

0:33:25.120 --> 0:33:29.000
<v Speaker 1>for me. Um. You know, I I like to take

0:33:30.080 --> 0:33:33.160
<v Speaker 1>He was data. That was terrible. His data's best friend

0:33:33.200 --> 0:33:35.760
<v Speaker 1>was about to say, uh, what was this just the

0:33:35.800 --> 0:33:40.920
<v Speaker 1>worst next generation? Well, yeah, okay, So LaVar Burton can

0:33:40.960 --> 0:33:44.040
<v Speaker 1>say good morning, that's fantastic. I don't. I don't have

0:33:44.160 --> 0:33:48.200
<v Speaker 1>enough followers to say good morning. I do occasionally quote

0:33:48.320 --> 0:33:50.680
<v Speaker 1>half of a song lyric here's a there's a shock

0:33:50.840 --> 0:33:52.600
<v Speaker 1>just to see how many people who follow me know

0:33:52.720 --> 0:33:57.480
<v Speaker 1>what I'm quoting and see. Yeah, so if you know

0:33:57.800 --> 0:34:00.200
<v Speaker 1>the end of the sky is blue and all the

0:34:00.200 --> 0:34:02.400
<v Speaker 1>grass is green, my heart's as full as a baked potato.

0:34:02.800 --> 0:34:05.440
<v Speaker 1>You let me know, all right. Well, I think that

0:34:05.440 --> 0:34:09.840
<v Speaker 1>wraps up our discussion about applications of big data and

0:34:09.840 --> 0:34:12.319
<v Speaker 1>and what we're using it for. And again, it's just

0:34:12.480 --> 0:34:14.319
<v Speaker 1>kind of a hint at what big data will be

0:34:14.400 --> 0:34:19.680
<v Speaker 1>used for. And while yes, there are certainly examples of

0:34:19.760 --> 0:34:22.959
<v Speaker 1>how companies, governments could abuse big data in a way

0:34:23.000 --> 0:34:28.440
<v Speaker 1>that are legitimately scary, there are also some truly amazing

0:34:28.560 --> 0:34:31.680
<v Speaker 1>uses that could be very beneficial. So I don't think

0:34:31.719 --> 0:34:34.920
<v Speaker 1>we should shy away from it because of the uh,

0:34:34.960 --> 0:34:37.600
<v Speaker 1>the possibility of things being used in a scary way.

0:34:37.640 --> 0:34:39.200
<v Speaker 1>We just need to be aware of it and be

0:34:39.960 --> 0:34:42.200
<v Speaker 1>and make sure that we don't go down that pathway

0:34:42.360 --> 0:34:44.960
<v Speaker 1>because the benefits are too great for us just to ignore. Well,

0:34:44.960 --> 0:34:47.520
<v Speaker 1>of course, it can go either way. I mean, it's elemental,

0:34:47.760 --> 0:34:51.560
<v Speaker 1>it's knowledge and know. Yeah, it's a tool, and you

0:34:51.560 --> 0:34:54.800
<v Speaker 1>know a tool is it's going to be used the

0:34:54.840 --> 0:34:56.799
<v Speaker 1>way the person who's using the tool wants to use it,

0:34:56.880 --> 0:35:01.840
<v Speaker 1>So like a drill, so a third so so not

0:35:02.040 --> 0:35:05.200
<v Speaker 1>nice you really carried it home there, We're just going

0:35:05.239 --> 0:35:08.600
<v Speaker 1>to I'm just gonna end here, guys. If you have

0:35:08.760 --> 0:35:12.520
<v Speaker 1>suggestions for future episode topics or you want to tell

0:35:12.560 --> 0:35:15.399
<v Speaker 1>me the end of the song lyric I quoted rite

0:35:15.480 --> 0:35:18.000
<v Speaker 1>us let's know. Are you know? Just as FW Thinking

0:35:18.080 --> 0:35:20.480
<v Speaker 1>at discovery dot com or go to f W thinking

0:35:20.520 --> 0:35:23.239
<v Speaker 1>dot com. Check out our blogs, check out the podcasts,

0:35:23.719 --> 0:35:26.560
<v Speaker 1>check out the videos. We've got some really fun ones

0:35:26.680 --> 0:35:28.480
<v Speaker 1>up there. I think you guys will really like it,

0:35:28.840 --> 0:35:35.200
<v Speaker 1>and we'll talk to you again really soon. For more

0:35:35.200 --> 0:35:37.719
<v Speaker 1>on this topic in the future of technology, is it

0:35:37.840 --> 0:35:51.439
<v Speaker 1>Forward Thinking dot Com Brought to you by Toyota. Let's

0:35:51.480 --> 0:35:52.160
<v Speaker 1>go Places,