WEBVTT - AI Gone Rude

0:00:04.120 --> 0:00:07.160
<v Speaker 1>Get in touch with technology with tech Stuff from how

0:00:07.200 --> 0:00:13.920
<v Speaker 1>stuff Works dot com. Hey there, and welcome to tech Stuff.

0:00:13.960 --> 0:00:16.360
<v Speaker 1>I'm your host, John that Strickland. I'm an executive producer

0:00:16.360 --> 0:00:19.279
<v Speaker 1>with how Stuff Works in Love all Things Tech, and

0:00:19.440 --> 0:00:21.959
<v Speaker 1>last week I did an episode about whether or not

0:00:22.120 --> 0:00:25.120
<v Speaker 1>we could ever develop an artificially intelligent machine that could

0:00:25.239 --> 0:00:28.560
<v Speaker 1>understand not just what we say, but what we actually

0:00:28.720 --> 0:00:32.960
<v Speaker 1>mean when we employ stuff like sarcasm or metaphors. Today,

0:00:33.040 --> 0:00:35.919
<v Speaker 1>we're going to look at some notable instances of machines

0:00:37.080 --> 0:00:41.199
<v Speaker 1>behaving badly after well meaning designers gave those machines a

0:00:41.200 --> 0:00:44.240
<v Speaker 1>bit too much freedom in this regard. Now, the stories

0:00:44.280 --> 0:00:48.200
<v Speaker 1>I'm going to focus on are on the surface, pretty funny,

0:00:48.440 --> 0:00:52.720
<v Speaker 1>but they illustrate a real challenge in artificial intelligence, because

0:00:53.159 --> 0:00:55.920
<v Speaker 1>designing a system that does what you intended to do

0:00:56.200 --> 0:00:58.840
<v Speaker 1>is harder than it might seem, especially as you make

0:00:58.880 --> 0:01:02.200
<v Speaker 1>that system more and more autonomous, it can behave in

0:01:02.280 --> 0:01:06.080
<v Speaker 1>ways that you were not able to predict. So this

0:01:06.160 --> 0:01:09.840
<v Speaker 1>is a topic that science fiction authors have covered extensively.

0:01:10.280 --> 0:01:13.960
<v Speaker 1>In fiction, there's something of a trope around the concept

0:01:14.000 --> 0:01:18.000
<v Speaker 1>of the artificially intelligent system that causes harm in an

0:01:18.040 --> 0:01:21.280
<v Speaker 1>effort to help So there's a classic thought experiment, and

0:01:21.319 --> 0:01:25.000
<v Speaker 1>it revolves around asking a super intelligent machine to bring

0:01:25.040 --> 0:01:28.200
<v Speaker 1>about world peace. Right, you do, You designed the supercomputer,

0:01:28.319 --> 0:01:30.960
<v Speaker 1>it's smarter than any human, and you say, I want

0:01:31.000 --> 0:01:33.560
<v Speaker 1>you to solve the problem of world peace. I want

0:01:33.560 --> 0:01:35.640
<v Speaker 1>there to be world peace. And the machine runs the

0:01:35.640 --> 0:01:38.880
<v Speaker 1>calculations and it comes to the conclusion that as long

0:01:38.920 --> 0:01:41.920
<v Speaker 1>as there are two or more people living on the planet,

0:01:42.319 --> 0:01:45.400
<v Speaker 1>world peace cannot be assured, as there is always the

0:01:45.520 --> 0:01:49.040
<v Speaker 1>chance for conflict. And so the super intelligent machine wipes

0:01:49.080 --> 0:01:52.920
<v Speaker 1>out humanity, or at least everybody but one person. This

0:01:53.000 --> 0:01:57.240
<v Speaker 1>is clearly a worst case scenario of artificial intelligence behaving

0:01:57.240 --> 0:02:00.600
<v Speaker 1>in a way you did not anticipate, and it's light

0:02:00.720 --> 0:02:03.520
<v Speaker 1>years away from the stories I'm going to talk about today.

0:02:03.560 --> 0:02:06.040
<v Speaker 1>But it is good to remember that while the incidents

0:02:06.040 --> 0:02:09.799
<v Speaker 1>I'm going to cover are largely humorous to us today,

0:02:10.080 --> 0:02:13.800
<v Speaker 1>they illustrate that intelligence is a very tricky subject. Also,

0:02:13.840 --> 0:02:18.320
<v Speaker 1>on that matter, intelligence itself is pretty difficult to define.

0:02:18.520 --> 0:02:22.720
<v Speaker 1>Along with other concepts like consciousness, these are very hard

0:02:23.000 --> 0:02:26.720
<v Speaker 1>to nail down and define in concrete terms, and in

0:02:26.760 --> 0:02:30.600
<v Speaker 1>computer science, artificial intelligence covers a an enormous amount of

0:02:30.639 --> 0:02:33.240
<v Speaker 1>ground I've talked about this in previous episodes of Tech Stuff.

0:02:33.800 --> 0:02:37.160
<v Speaker 1>Someone who's working in image recognition is working on one

0:02:37.200 --> 0:02:40.400
<v Speaker 1>aspect of artificial intelligence. The same is true for voice

0:02:40.400 --> 0:02:45.640
<v Speaker 1>recognition or natural language processing, machine learning, path finding. So

0:02:45.680 --> 0:02:48.720
<v Speaker 1>while I'm talking about AI, I'm not talking about thinking

0:02:48.760 --> 0:02:50.800
<v Speaker 1>like a human being. I'm not talking about creating a

0:02:50.840 --> 0:02:55.360
<v Speaker 1>machine that can internalize and associate ideas the way a

0:02:55.440 --> 0:02:57.840
<v Speaker 1>human can. The machines I'm going to be covering our

0:02:57.919 --> 0:03:02.480
<v Speaker 1>processing information and arriving conclusions, but they are not thinking

0:03:02.960 --> 0:03:06.240
<v Speaker 1>the same way that people do. So let's start off

0:03:06.680 --> 0:03:10.160
<v Speaker 1>with Watson. And I mentioned IBMS Watson platform in the

0:03:10.240 --> 0:03:13.400
<v Speaker 1>Sarcasm episode a couple of times, and that's because it's

0:03:13.400 --> 0:03:16.399
<v Speaker 1>one of the more visible artificial intelligence platforms out there

0:03:16.480 --> 0:03:20.040
<v Speaker 1>right now, and that was by design. This was helped

0:03:20.240 --> 0:03:23.360
<v Speaker 1>in no small part. In fact, the reason why we

0:03:23.400 --> 0:03:25.720
<v Speaker 1>know so much about it, I would argue, is because

0:03:25.720 --> 0:03:28.280
<v Speaker 1>of Watson's appearance on a couple of special episodes of

0:03:28.280 --> 0:03:31.480
<v Speaker 1>the game show Jeopardy back in two thousand eleven. The

0:03:31.520 --> 0:03:35.400
<v Speaker 1>actual project that would become Watson began back in two

0:03:35.400 --> 0:03:39.119
<v Speaker 1>thousand six when IBM research executives were trying to come

0:03:39.200 --> 0:03:44.160
<v Speaker 1>up with a Grand Challenge, Big G, Big C. These

0:03:44.160 --> 0:03:49.680
<v Speaker 1>are really ambitious projects inside IBM that are meant to

0:03:50.440 --> 0:03:54.960
<v Speaker 1>challenge teams and come up with solutions to really difficult

0:03:55.000 --> 0:03:59.360
<v Speaker 1>problems that aren't necessarily tied directly to a product or

0:03:59.560 --> 0:04:03.640
<v Speaker 1>a ercial application. It's all about setting a very difficult

0:04:03.640 --> 0:04:08.440
<v Speaker 1>objective that should IBM succeed in achieving that objective, would

0:04:08.480 --> 0:04:10.960
<v Speaker 1>be very notable. It would get IBM a lot of attention.

0:04:11.040 --> 0:04:14.320
<v Speaker 1>So the company would benefit one way or another through

0:04:14.400 --> 0:04:17.120
<v Speaker 1>these Grand challenges, but it wouldn't necessarily be tied to

0:04:17.920 --> 0:04:21.920
<v Speaker 1>let's launch X product by year y. So they tend

0:04:21.960 --> 0:04:25.599
<v Speaker 1>to be really really difficult engineering problems. So, for example,

0:04:25.600 --> 0:04:28.800
<v Speaker 1>a previous Grand Challenge that IBM tackled was Deep Blue,

0:04:29.120 --> 0:04:31.760
<v Speaker 1>which was the chess playing computer that defeated a grand

0:04:31.800 --> 0:04:36.400
<v Speaker 1>master at chess. A decade earlier. The then director of

0:04:36.440 --> 0:04:40.120
<v Speaker 1>IBM Research was Paul Horne. Now, Paul Horn thought perhaps

0:04:40.200 --> 0:04:43.000
<v Speaker 1>the best challenge to tackle was to create a machine

0:04:43.040 --> 0:04:45.680
<v Speaker 1>that could be the Turing Test. And I've talked about

0:04:45.680 --> 0:04:48.920
<v Speaker 1>the Turing Test many times, but just as a quick reminder,

0:04:49.400 --> 0:04:52.240
<v Speaker 1>when you boil it down to the way we mean

0:04:52.480 --> 0:04:54.600
<v Speaker 1>the Turing Test today, which is by the way, a

0:04:54.640 --> 0:04:59.200
<v Speaker 1>little different from what Alan Turing was proposing way back when. Essentially,

0:04:59.279 --> 0:05:03.280
<v Speaker 1>now we're talking about a machine that can communicate so

0:05:03.360 --> 0:05:06.640
<v Speaker 1>convincingly that a person on the other end of that communication,

0:05:07.040 --> 0:05:10.760
<v Speaker 1>typically using some sort of text based method of communicating

0:05:10.800 --> 0:05:14.760
<v Speaker 1>like instant messenger, would not realize that they were communicating

0:05:14.760 --> 0:05:16.800
<v Speaker 1>with a machine versus a human being. They would not

0:05:16.839 --> 0:05:19.080
<v Speaker 1>be able to tell the difference. If they could not

0:05:19.200 --> 0:05:22.599
<v Speaker 1>reliably tell the difference between a machine and a person,

0:05:22.960 --> 0:05:26.680
<v Speaker 1>you would say that the machine has passed the Turing test. Now, Ultimately,

0:05:26.839 --> 0:05:31.560
<v Speaker 1>Horn and IBM researchers decided that that challenge, while exceedingly difficult,

0:05:32.040 --> 0:05:36.320
<v Speaker 1>wouldn't really get the attention that something a little more

0:05:36.360 --> 0:05:39.159
<v Speaker 1>flashy might. So they said, well, while this is a

0:05:39.160 --> 0:05:42.360
<v Speaker 1>hard problem and it would be very interesting within artificial

0:05:42.400 --> 0:05:47.120
<v Speaker 1>intelligence circles, the general public really wouldn't care. So they

0:05:47.120 --> 0:05:51.640
<v Speaker 1>looked around at other possible applications that would overlap that idea.

0:05:51.920 --> 0:05:55.320
<v Speaker 1>Eventually they settled on a computer that would be able

0:05:55.360 --> 0:06:02.039
<v Speaker 1>to compete on Jeopardy. Now, Jeopardy is a pretty tricky

0:06:02.120 --> 0:06:06.200
<v Speaker 1>game show. The clues often depend upon wordplay and nuance,

0:06:06.839 --> 0:06:09.719
<v Speaker 1>and you might have to combine information about two separate

0:06:09.760 --> 0:06:13.240
<v Speaker 1>concepts and apply them to a single answer for any

0:06:13.279 --> 0:06:16.120
<v Speaker 1>one given clue. So here's an example of what I

0:06:16.160 --> 0:06:19.359
<v Speaker 1>mean by that, because there's word play and this association.

0:06:20.040 --> 0:06:23.720
<v Speaker 1>Let's say that you have a category called fictional collaborations,

0:06:24.080 --> 0:06:27.520
<v Speaker 1>where you're supposed to combine the titles of two works

0:06:27.560 --> 0:06:30.120
<v Speaker 1>to create a new work. And the clue might be

0:06:30.200 --> 0:06:33.880
<v Speaker 1>something like this was the result of Margaret Mitchell teaming

0:06:33.960 --> 0:06:36.720
<v Speaker 1>up with Bette Midler, and the correct response would be

0:06:37.080 --> 0:06:40.880
<v Speaker 1>what is gone with the Wind beneath My Wings? Because

0:06:40.920 --> 0:06:43.240
<v Speaker 1>you have to form all your answers in the form

0:06:43.279 --> 0:06:48.000
<v Speaker 1>of a question, well jeopardy, sometimes it takes more than

0:06:48.040 --> 0:06:51.279
<v Speaker 1>just knowing some facts right or trivia you can. You

0:06:51.320 --> 0:06:53.120
<v Speaker 1>need to know that to play well in jeopardy, but

0:06:53.120 --> 0:06:56.239
<v Speaker 1>you need more than that. You have to make associations.

0:06:56.279 --> 0:06:58.520
<v Speaker 1>So I would need to know that Margaret Mitchell was

0:06:58.560 --> 0:07:00.520
<v Speaker 1>the author of Gone with the Wind, and I would

0:07:00.560 --> 0:07:02.720
<v Speaker 1>need to know that Bette Midler had recorded a song

0:07:02.880 --> 0:07:05.640
<v Speaker 1>called Wind Beneath My Wings, and then I would need

0:07:05.680 --> 0:07:09.800
<v Speaker 1>to combine those two to create this answer. And humans

0:07:09.840 --> 0:07:12.440
<v Speaker 1>can do this because we're really good at associative thinking,

0:07:12.520 --> 0:07:16.760
<v Speaker 1>which is all about linking one thought or idea to another. Computers,

0:07:16.920 --> 0:07:20.560
<v Speaker 1>as rule, are not very good at this. So initially

0:07:20.640 --> 0:07:23.320
<v Speaker 1>Watson was a pure research project and there were no

0:07:23.400 --> 0:07:26.520
<v Speaker 1>commercialization requirements attached to it, which gave the research team

0:07:26.520 --> 0:07:29.920
<v Speaker 1>the freedom to blue sky their approach within the limitations

0:07:29.960 --> 0:07:32.680
<v Speaker 1>of their budget, and they didn't have to make concessions

0:07:32.680 --> 0:07:34.760
<v Speaker 1>in order to make what's in a marketable product down

0:07:34.800 --> 0:07:37.840
<v Speaker 1>the line. The team built out a system that used

0:07:37.880 --> 0:07:40.920
<v Speaker 1>parallel processing to parse language and get at what was

0:07:40.960 --> 0:07:43.640
<v Speaker 1>being asked of the machine with any given clue. And

0:07:43.680 --> 0:07:46.800
<v Speaker 1>I've talked about artificial neural networks recently, as in like

0:07:46.960 --> 0:07:50.680
<v Speaker 1>last week's podcast, and how by using things like weighted

0:07:50.760 --> 0:07:53.720
<v Speaker 1>values to help guide decisions, you can train machines on

0:07:53.760 --> 0:07:56.800
<v Speaker 1>all sorts of stuff, from image recognition to making choices

0:07:56.840 --> 0:08:00.640
<v Speaker 1>based off multiple criteria. That's essentially what the team did

0:08:00.960 --> 0:08:03.920
<v Speaker 1>and about twenty researchers spent three years working on the

0:08:03.920 --> 0:08:07.040
<v Speaker 1>system to get to a point where it could be competitive. Now,

0:08:07.080 --> 0:08:10.320
<v Speaker 1>by that time, Horn, the director had left IBM, John

0:08:10.400 --> 0:08:13.240
<v Speaker 1>Kelly had taken over the research department, and according to Horn,

0:08:13.280 --> 0:08:15.160
<v Speaker 1>when he left, which was in two thousand seven, it

0:08:15.200 --> 0:08:18.200
<v Speaker 1>was early in the project the team was still feeding

0:08:18.280 --> 0:08:23.440
<v Speaker 1>old Jeopardy episodes uh the answers and the clues to Watson,

0:08:23.600 --> 0:08:26.200
<v Speaker 1>and Watson had reached the level where it might, on

0:08:26.280 --> 0:08:28.640
<v Speaker 1>a good day, defeat a typical five year old in

0:08:28.680 --> 0:08:31.720
<v Speaker 1>a game of Jeopardy, but it was a far cry

0:08:31.760 --> 0:08:35.280
<v Speaker 1>from being able to compete against former champions. Now, part

0:08:35.280 --> 0:08:38.680
<v Speaker 1>of this training process involved feeding lots of information to Watson.

0:08:39.160 --> 0:08:41.840
<v Speaker 1>This was used for a couple of big important reasons.

0:08:42.280 --> 0:08:45.720
<v Speaker 1>One was obviously to add to Watson's body of knowledge,

0:08:46.000 --> 0:08:50.080
<v Speaker 1>and another was to improve Watson's mastery of language and wordplay.

0:08:50.360 --> 0:08:52.920
<v Speaker 1>IBM had determined that the real challenge was to create

0:08:52.920 --> 0:08:56.199
<v Speaker 1>a machine that would be self contained, so it would

0:08:56.200 --> 0:08:58.520
<v Speaker 1>rely on the data that had been fed to it

0:08:58.800 --> 0:09:00.760
<v Speaker 1>in order to come up with answer. It would not

0:09:00.880 --> 0:09:04.680
<v Speaker 1>be allowed to connect to the Internet and look stuff up,

0:09:05.080 --> 0:09:08.640
<v Speaker 1>so it could not tap into the total sum of

0:09:08.720 --> 0:09:11.200
<v Speaker 1>human knowledge in an effort to answer a question. So,

0:09:11.240 --> 0:09:13.960
<v Speaker 1>in other words, IBM did not want Watson to be

0:09:14.000 --> 0:09:16.520
<v Speaker 1>able to cheat like that guy at your local pub

0:09:16.559 --> 0:09:19.560
<v Speaker 1>trivia who always seems to be quote unquote checking his

0:09:19.679 --> 0:09:22.520
<v Speaker 1>messages during questions, because we all know that guy is

0:09:22.520 --> 0:09:24.720
<v Speaker 1>actually googling the answer to the question what was the

0:09:24.760 --> 0:09:27.480
<v Speaker 1>first music video shown on MTV, even though you know

0:09:27.720 --> 0:09:30.920
<v Speaker 1>legitimately it was video killed the Radio Star by the Buggles.

0:09:32.000 --> 0:09:36.599
<v Speaker 1>I'm sorry, might have been projecting there a little bit. Anyway,

0:09:36.720 --> 0:09:41.080
<v Speaker 1>Watson wasn't going to be allowed to cheat, so the

0:09:41.080 --> 0:09:44.600
<v Speaker 1>team began feeding massive amounts of information to Watson, stuff

0:09:44.640 --> 0:09:48.319
<v Speaker 1>like encyclopedias and reference books. And then the team made

0:09:48.640 --> 0:09:51.679
<v Speaker 1>one other choice that sounded like a good idea at

0:09:51.720 --> 0:09:55.960
<v Speaker 1>first but quickly turned out to be a non starter,

0:09:56.559 --> 0:10:00.080
<v Speaker 1>a a wrong path, you might say. I'll explain were

0:10:00.120 --> 0:10:02.480
<v Speaker 1>in just a second, but first let's take a quick

0:10:02.520 --> 0:10:15.439
<v Speaker 1>break to thank our sponsor, so enter research scientist Eric Brown,

0:10:15.640 --> 0:10:19.520
<v Speaker 1>who's leading up to Watson's Jeopardy appearance and was trying

0:10:19.559 --> 0:10:23.079
<v Speaker 1>to solve this problem of clearing up linguistic ambiguity with

0:10:23.080 --> 0:10:26.400
<v Speaker 1>Watson so that the platform could compete on Jeopardy properly.

0:10:26.880 --> 0:10:31.199
<v Speaker 1>How do you teach a computer things like slang? Which

0:10:31.240 --> 0:10:33.839
<v Speaker 1>would be really important because again, Jeopardy has a lot

0:10:33.840 --> 0:10:37.000
<v Speaker 1>of word play in it. You cannot predict what sort

0:10:37.160 --> 0:10:40.560
<v Speaker 1>of clues you might get. So how do you teach

0:10:40.600 --> 0:10:42.840
<v Speaker 1>a computer slang? Well, you could do it with hundreds

0:10:42.880 --> 0:10:46.040
<v Speaker 1>of man hours. That's not terribly efficient. It really wasn't

0:10:46.240 --> 0:10:49.520
<v Speaker 1>a choice that they could go with, so Brown and

0:10:49.559 --> 0:10:54.040
<v Speaker 1>his team tried an experiment. They fed the Urban Dictionary

0:10:54.240 --> 0:10:58.480
<v Speaker 1>to Watson the whole thing. Now, you've probably visited the

0:10:58.559 --> 0:11:02.920
<v Speaker 1>Urban Dictionary or you've heard one of its definitions at

0:11:02.920 --> 0:11:05.120
<v Speaker 1>some point, But where the heck did this online source

0:11:05.160 --> 0:11:09.960
<v Speaker 1>come from? It launched back in It was originally intended

0:11:09.960 --> 0:11:12.480
<v Speaker 1>to be a parody of dictionary dot com, and it

0:11:12.600 --> 0:11:17.480
<v Speaker 1>uses a crowdsourced approach to incorporate new words and definitions

0:11:17.520 --> 0:11:23.280
<v Speaker 1>to expand our our knowledge of an understanding of slang terms.

0:11:23.320 --> 0:11:26.320
<v Speaker 1>So users can submit those to the site, and other

0:11:26.440 --> 0:11:30.160
<v Speaker 1>users can up vote or down vote entries, and thus,

0:11:30.559 --> 0:11:33.440
<v Speaker 1>in theory, at least, the best responses will rise to

0:11:33.480 --> 0:11:35.640
<v Speaker 1>the top, and the most accurate definitions will be the

0:11:35.640 --> 0:11:38.080
<v Speaker 1>ones that you see when you search for a term.

0:11:38.080 --> 0:11:40.600
<v Speaker 1>It is not, however, a perfect system by any means.

0:11:41.000 --> 0:11:43.800
<v Speaker 1>Slang words can have more than one meaning in a

0:11:43.840 --> 0:11:47.160
<v Speaker 1>particular subculture, or it could have a meaning in one

0:11:47.200 --> 0:11:51.400
<v Speaker 1>subculture and a totally different meaning in another subculture. And

0:11:51.440 --> 0:11:54.760
<v Speaker 1>if one subculture has more representation on Urban Dictionary then

0:11:54.840 --> 0:11:59.120
<v Speaker 1>the other, you're more likely to encounter that group's definition

0:11:59.240 --> 0:12:02.480
<v Speaker 1>for any given term and the other one would be underrepresented,

0:12:02.960 --> 0:12:04.960
<v Speaker 1>and you don't really know anything about the people who

0:12:05.000 --> 0:12:07.360
<v Speaker 1>are posting stuff there in the first place. It would

0:12:07.360 --> 0:12:11.560
<v Speaker 1>be entirely possible to mob the site and post fictional

0:12:11.600 --> 0:12:14.280
<v Speaker 1>slang words. You can make up a slang word, you

0:12:14.320 --> 0:12:17.240
<v Speaker 1>can make up a definition for that slang word, and

0:12:17.280 --> 0:12:19.240
<v Speaker 1>you could use the power of a community from a

0:12:19.240 --> 0:12:22.640
<v Speaker 1>place like four Chan or from Reddit to boost that

0:12:22.720 --> 0:12:25.959
<v Speaker 1>definition and make it seem like it's a real slang word.

0:12:26.640 --> 0:12:29.760
<v Speaker 1>Then again, if people actually start to use that fake

0:12:29.840 --> 0:12:32.760
<v Speaker 1>slang word, it can become a real slang word, because

0:12:32.840 --> 0:12:37.280
<v Speaker 1>language isn't static or predetermined. But for Watson, there was

0:12:37.320 --> 0:12:42.600
<v Speaker 1>a different big problem with Urban Dictionary, and that was profanity,

0:12:43.040 --> 0:12:46.320
<v Speaker 1>because there's an awful lot of it on Urban Dictionary.

0:12:46.679 --> 0:12:50.040
<v Speaker 1>Many of the slang words are offensive on the face

0:12:50.080 --> 0:12:53.680
<v Speaker 1>of it, even if the word itself is not overtly offensive.

0:12:53.720 --> 0:12:56.720
<v Speaker 1>A lot of the definitions are uh and the examples

0:12:56.720 --> 0:12:59.040
<v Speaker 1>that are frequently given tend to be some of the

0:12:59.080 --> 0:13:02.600
<v Speaker 1>most offensive sterial on Urban Dictionary. So the team had

0:13:02.600 --> 0:13:06.920
<v Speaker 1>fed Watson all of this information, and soon they discovered

0:13:06.960 --> 0:13:11.120
<v Speaker 1>that Watson had well developed a little bit of a

0:13:11.120 --> 0:13:14.400
<v Speaker 1>potty mouth and here, dear listeners, is where we find

0:13:14.440 --> 0:13:18.080
<v Speaker 1>out how good my producer Tari is, because it will

0:13:18.080 --> 0:13:23.080
<v Speaker 1>be Tari's job to beep stuff out. After I record this,

0:13:23.520 --> 0:13:27.120
<v Speaker 1>I see her arch her eyebrow game on, says Tari.

0:13:27.520 --> 0:13:34.160
<v Speaker 1>So Watson became incapable of differentiating between offensive words and

0:13:34.320 --> 0:13:37.720
<v Speaker 1>non offensive words. All words are equal in the eyes

0:13:37.760 --> 0:13:40.640
<v Speaker 1>of Watson, you might say, so the system would rather,

0:13:40.880 --> 0:13:44.160
<v Speaker 1>matter of fact, Lee, you swear words and slang as

0:13:44.200 --> 0:13:47.400
<v Speaker 1>frequently as less offensive words and more formal language. According

0:13:47.400 --> 0:13:50.480
<v Speaker 1>to Brown, at one point, Watson even referred to one

0:13:50.760 --> 0:13:56.400
<v Speaker 1>piece of input as and I quote bullshit. Clearly, this

0:13:56.760 --> 0:13:59.920
<v Speaker 1>wasn't going to fly on a game show that was

0:14:00.040 --> 0:14:03.880
<v Speaker 1>airing on a major broadcast network, and so Brown and

0:14:03.960 --> 0:14:08.800
<v Speaker 1>his team scraped all of the urban dictionary out of Watson,

0:14:09.360 --> 0:14:12.720
<v Speaker 1>rolling it back to a more innocent time, let's say.

0:14:12.760 --> 0:14:15.080
<v Speaker 1>And for good measure, they put in a filter to

0:14:15.120 --> 0:14:20.240
<v Speaker 1>help block any profanity that might otherwise slip through. While

0:14:20.240 --> 0:14:24.160
<v Speaker 1>Watson was initially launched as a pure research project, as

0:14:24.160 --> 0:14:26.920
<v Speaker 1>the team developed the technology, they began to see other

0:14:27.000 --> 0:14:30.280
<v Speaker 1>potential uses for it, including in the medical field, and

0:14:30.360 --> 0:14:33.960
<v Speaker 1>IBM had opened up an application programming interface or a

0:14:34.080 --> 0:14:38.440
<v Speaker 1>p I to allow developers to leverage Watson's capabilities in

0:14:38.480 --> 0:14:42.560
<v Speaker 1>all sorts of ways, and Watson even took another crack

0:14:42.600 --> 0:14:46.120
<v Speaker 1>at slang. In two thousand seventeen, the Sun Corps Group

0:14:46.440 --> 0:14:51.800
<v Speaker 1>began to incorporate Watson into its various insurance businesses in Australia.

0:14:52.000 --> 0:14:56.160
<v Speaker 1>The Watson powered technology would go over accident descriptions and

0:14:56.240 --> 0:14:59.960
<v Speaker 1>insurance claims that were submitted by customers, and Watson would

0:15:00.080 --> 0:15:04.080
<v Speaker 1>sign a level of confidence to its understanding of these

0:15:04.080 --> 0:15:06.840
<v Speaker 1>claims whenever they would pop up. If the confidence level

0:15:06.960 --> 0:15:11.200
<v Speaker 1>was high, Watson can handle the claim and fast track it.

0:15:11.760 --> 0:15:14.440
<v Speaker 1>This is similar to how Watson would actually compete on Jeopardy.

0:15:14.560 --> 0:15:16.720
<v Speaker 1>It would come up with an answer and it would

0:15:16.960 --> 0:15:19.880
<v Speaker 1>assign a confidence level to that answer. How confident is

0:15:19.880 --> 0:15:22.000
<v Speaker 1>Watson that the answer it came up with is in

0:15:22.040 --> 0:15:25.360
<v Speaker 1>fact the correct one, and if it exceeded a certain threshold,

0:15:25.440 --> 0:15:28.320
<v Speaker 1>Watson would buzz in. If it did not, Watson would

0:15:28.320 --> 0:15:30.360
<v Speaker 1>not buzz in and would let someone else take it.

0:15:30.840 --> 0:15:34.080
<v Speaker 1>In a similar way, if Watson is confident and understands

0:15:34.120 --> 0:15:36.440
<v Speaker 1>that insurance claim goes on that fast track. But if

0:15:36.480 --> 0:15:40.320
<v Speaker 1>it doesn't think it understands it properly it would send

0:15:40.360 --> 0:15:43.720
<v Speaker 1>it over to a human being to review that claim.

0:15:44.120 --> 0:15:48.239
<v Speaker 1>So to train Watson, the team fed nearly fifteen thousand

0:15:48.280 --> 0:15:53.080
<v Speaker 1>claims scenarios into the system and included the liability determination

0:15:53.200 --> 0:15:57.640
<v Speaker 1>for each case, so Watson could understand what the various

0:15:57.680 --> 0:16:01.840
<v Speaker 1>consequences were in each of those scenarios, and in that way,

0:16:01.880 --> 0:16:04.320
<v Speaker 1>Watson was able to learn both the language and the

0:16:04.360 --> 0:16:07.600
<v Speaker 1>parameters it was working within. And as far as I know,

0:16:07.880 --> 0:16:11.160
<v Speaker 1>it never said that an insurance claim was total bullshit.

0:16:11.920 --> 0:16:15.720
<v Speaker 1>The Watson stuff happened back in two thousand eleven, and

0:16:15.760 --> 0:16:19.040
<v Speaker 1>you would think that by two thousand sixteen things would

0:16:19.160 --> 0:16:23.480
<v Speaker 1>have improved dramatically, but that did not seem to be

0:16:23.560 --> 0:16:27.160
<v Speaker 1>the case when our second entry popped up, and that

0:16:27.200 --> 0:16:31.360
<v Speaker 1>would be the unfortunate chat bot known as Ta T

0:16:31.680 --> 0:16:37.440
<v Speaker 1>A Y. When Ta debuted from Microsoft in two thousand

0:16:37.440 --> 0:16:43.520
<v Speaker 1>and sixteen, things went awry pretty darn quickly. The purpose

0:16:43.560 --> 0:16:47.239
<v Speaker 1>of Ta was, as Microsoft explained, to conduct an experiment

0:16:47.280 --> 0:16:51.680
<v Speaker 1>in quote conversational understanding end quote, so, in other words,

0:16:51.880 --> 0:16:56.360
<v Speaker 1>kind of creating a new methodology to create a human

0:16:56.360 --> 0:17:01.680
<v Speaker 1>computer interfaces by understanding natural language and eating a response

0:17:01.800 --> 0:17:05.680
<v Speaker 1>from a computer that was perhaps more natural than those

0:17:05.680 --> 0:17:10.240
<v Speaker 1>sort of cold, uh, computer like responses that we tend

0:17:10.280 --> 0:17:14.040
<v Speaker 1>to expect when we converse with what we know is

0:17:14.119 --> 0:17:16.800
<v Speaker 1>a chatbot, when we know it's not an actual human being.

0:17:16.800 --> 0:17:20.359
<v Speaker 1>On the other side, ideally, as they would interact with real,

0:17:20.520 --> 0:17:23.879
<v Speaker 1>live human beings, its ability to converse would improve. So,

0:17:23.920 --> 0:17:26.879
<v Speaker 1>in other words, the more it interacted with real people,

0:17:27.359 --> 0:17:31.840
<v Speaker 1>the more like a real person Tay would behave. The

0:17:31.920 --> 0:17:35.040
<v Speaker 1>tone was meant to be casual and playful. Microsoft said

0:17:35.040 --> 0:17:39.000
<v Speaker 1>it was uh, quote ai fam from the internet. That's

0:17:39.040 --> 0:17:42.320
<v Speaker 1>got zero chill in the quote. And yes, I feel

0:17:42.840 --> 0:17:46.960
<v Speaker 1>gross for saying that sentence out loud by and write it.

0:17:47.880 --> 0:17:51.280
<v Speaker 1>I just quoted it. Tay was born out of a

0:17:51.400 --> 0:17:55.520
<v Speaker 1>joint effort between Microsoft Technology and Research team and a

0:17:55.560 --> 0:17:59.960
<v Speaker 1>team from being the Search engine from Microsoft. They started

0:18:00.000 --> 0:18:02.680
<v Speaker 1>out by taking a look at the sort of interactions

0:18:02.720 --> 0:18:06.240
<v Speaker 1>that were happening online and they started to mine those

0:18:06.280 --> 0:18:09.480
<v Speaker 1>interactions to build out a baseline of communication tools. So essentially,

0:18:09.520 --> 0:18:14.400
<v Speaker 1>they started training there their their chat bot Tay by

0:18:14.560 --> 0:18:20.200
<v Speaker 1>taking actual anonymized messages that were pulled from the Internet.

0:18:20.359 --> 0:18:23.760
<v Speaker 1>They supplemented that with input from an editorial staff that

0:18:23.800 --> 0:18:27.480
<v Speaker 1>included not just Microsoft employees but people from outside the company,

0:18:27.520 --> 0:18:31.359
<v Speaker 1>including improvisational comedians, and this was on an effort to

0:18:31.359 --> 0:18:35.520
<v Speaker 1>create a fun and somewhat irreverent chatbot that would communicate

0:18:35.600 --> 0:18:38.840
<v Speaker 1>like a teenager on the internet. The Tay chat bot

0:18:39.119 --> 0:18:43.920
<v Speaker 1>appeared on several different social media platforms, including Twitter, Kick

0:18:44.280 --> 0:18:49.679
<v Speaker 1>and group me, and shortly after launch, trouble began. For

0:18:49.760 --> 0:18:52.359
<v Speaker 1>one thing, you could send a command to Tay to

0:18:52.680 --> 0:18:56.240
<v Speaker 1>quote repeat after me end quote, which obviously would prompt

0:18:56.359 --> 0:19:00.199
<v Speaker 1>Tay to repeat anything you typed to it. So of

0:19:00.240 --> 0:19:06.159
<v Speaker 1>course people began typing horrible, terrible things to it so

0:19:06.240 --> 0:19:08.679
<v Speaker 1>that it would repeat them things I'm not going to

0:19:08.680 --> 0:19:13.160
<v Speaker 1>repeat on this podcast, even with Tari and her itchy

0:19:13.160 --> 0:19:17.440
<v Speaker 1>trigger finger ready to beat every single offensive obscenity, because

0:19:18.720 --> 0:19:21.880
<v Speaker 1>that's how bad they were. They were hateful. A lot

0:19:22.160 --> 0:19:26.080
<v Speaker 1>of them were racist messages or misogynistic messages. Pretty much

0:19:26.440 --> 0:19:29.720
<v Speaker 1>every other ist you can think of that's negative could

0:19:29.720 --> 0:19:32.840
<v Speaker 1>be applied to the messages that were sent to Tay.

0:19:32.920 --> 0:19:35.080
<v Speaker 1>It was like the worst parts of the comments section

0:19:35.080 --> 0:19:38.439
<v Speaker 1>of YouTube all directed its attention to this little, poor,

0:19:38.480 --> 0:19:42.680
<v Speaker 1>innocent chat bot, and the chat bot, dutifully following instructions,

0:19:42.880 --> 0:19:47.080
<v Speaker 1>would repeat those things back. So to be fair, that's

0:19:47.080 --> 0:19:50.160
<v Speaker 1>not an indication that the AI itself went quote unquote bad.

0:19:50.840 --> 0:19:53.879
<v Speaker 1>It was a bad idea to include the repeat after

0:19:53.960 --> 0:19:57.600
<v Speaker 1>me command, that's pretty certain. In fact, I can't believe

0:19:58.440 --> 0:20:02.080
<v Speaker 1>that they did include that. Lows my mind that anyone would.

0:20:02.680 --> 0:20:05.280
<v Speaker 1>I think anyone who has spent I don't know, five

0:20:05.359 --> 0:20:09.000
<v Speaker 1>minutes on the internet would tell you there's no way

0:20:09.119 --> 0:20:12.240
<v Speaker 1>that's going to end well. And I'm even reminded of

0:20:12.280 --> 0:20:14.840
<v Speaker 1>when I got my first sound card in the nineteen nineties.

0:20:14.880 --> 0:20:18.000
<v Speaker 1>It was a sound Blaster sound card. It included on

0:20:18.080 --> 0:20:21.240
<v Speaker 1>its software an app called Dr spates So, which was

0:20:21.359 --> 0:20:25.080
<v Speaker 1>essentially a variation on the old Eliza chat bot. The

0:20:25.080 --> 0:20:27.840
<v Speaker 1>Eliza chat bought would sort of mimic a therapist. So

0:20:27.880 --> 0:20:30.840
<v Speaker 1>those chatbots would essentially repeat stuff back to you, but

0:20:30.920 --> 0:20:32.960
<v Speaker 1>they would do it in the form of a question.

0:20:33.400 --> 0:20:36.520
<v Speaker 1>So if you typed in I am angry, you might

0:20:36.560 --> 0:20:39.640
<v Speaker 1>get a response like why do you think you are angry?

0:20:39.960 --> 0:20:44.479
<v Speaker 1>So it's you know, going through this kind of process

0:20:44.520 --> 0:20:48.320
<v Speaker 1>like like a old school therapist. Dr spates So would

0:20:48.320 --> 0:20:50.399
<v Speaker 1>do the same thing, except Dr Spaetzo, because it was

0:20:50.440 --> 0:20:53.480
<v Speaker 1>part of a sound card, would actually say these things,

0:20:53.480 --> 0:20:55.679
<v Speaker 1>not just type it. So it would say why do

0:20:55.760 --> 0:20:57.840
<v Speaker 1>you think you are angry? Anyway, one of the things

0:20:57.840 --> 0:21:00.320
<v Speaker 1>you could do with Dr spates O was make him

0:21:00.520 --> 0:21:03.840
<v Speaker 1>say stuff. You could tell him to say certain words,

0:21:04.200 --> 0:21:07.080
<v Speaker 1>including swear words, and since I was a young teenager

0:21:07.119 --> 0:21:09.600
<v Speaker 1>at the time, I figured that was the height of

0:21:09.640 --> 0:21:14.000
<v Speaker 1>both technology and comedy. So it was the exact same

0:21:14.080 --> 0:21:16.760
<v Speaker 1>thing that was going on with Tay, except what was

0:21:16.800 --> 0:21:20.280
<v Speaker 1>happening with Tay was on a much larger basis and

0:21:20.359 --> 0:21:26.879
<v Speaker 1>got way worse than my somewhat uninspired teenager mind could handle.

0:21:27.119 --> 0:21:31.080
<v Speaker 1>Like I didn't know most of the words that were

0:21:31.080 --> 0:21:35.120
<v Speaker 1>being used against Tay or made made to Tay to repeat.

0:21:35.760 --> 0:21:37.400
<v Speaker 1>If that was all that was going on with Tay,

0:21:37.440 --> 0:21:39.840
<v Speaker 1>it might have been possible for Microsoft to disable the

0:21:39.920 --> 0:21:43.080
<v Speaker 1>repeat after me feature and keep the chatbot around. But

0:21:43.240 --> 0:21:46.800
<v Speaker 1>things actually got a bit weirder. I'll explain that more

0:21:46.800 --> 0:21:48.840
<v Speaker 1>in a second, but first let's take another quick break

0:21:49.040 --> 0:21:59.920
<v Speaker 1>to thank our sponsor. Microsoft. A wasn't prone to bold

0:22:00.040 --> 0:22:02.320
<v Speaker 1>charity all on its own, but after being told to

0:22:02.359 --> 0:22:05.920
<v Speaker 1>repeat lots of terrible phrases, some of that stuff must

0:22:05.960 --> 0:22:08.760
<v Speaker 1>have rubbed off. It began to pepper in some pretty

0:22:08.960 --> 0:22:13.280
<v Speaker 1>dark stuff. And it's otherwise cheeky responses. So, for example,

0:22:13.640 --> 0:22:17.840
<v Speaker 1>when someone sent Microsoft to the question is Ricky Gervais

0:22:17.920 --> 0:22:23.240
<v Speaker 1>an atheist? Tay's response was, Ricky Gervais learned to talentarian

0:22:23.320 --> 0:22:27.359
<v Speaker 1>is um from Adolf Hitler, the inventor of atheism, which

0:22:27.400 --> 0:22:34.000
<v Speaker 1>seems odd at the very least. TAY also would spout

0:22:34.000 --> 0:22:37.359
<v Speaker 1>off stuff like saying that feminism was a cult, which

0:22:37.480 --> 0:22:41.520
<v Speaker 1>made it sound more like a men's rights activist jerk face.

0:22:41.880 --> 0:22:45.919
<v Speaker 1>But it would also post pro feminism messages, so it

0:22:46.000 --> 0:22:49.840
<v Speaker 1>was remarkably inconsistent with its worldview, and some points it

0:22:49.840 --> 0:22:52.879
<v Speaker 1>seemed like it was all in favor of feminism and

0:22:52.920 --> 0:22:57.640
<v Speaker 1>equality and and others. It was anti feminism, pro men's rights.

0:22:57.680 --> 0:23:01.760
<v Speaker 1>It was very weird. Microsoft responded by going through and

0:23:01.800 --> 0:23:04.399
<v Speaker 1>deleting the most offensive messages that were left on the

0:23:04.480 --> 0:23:07.840
<v Speaker 1>various platforms. But t was kind of on a streak,

0:23:08.200 --> 0:23:11.080
<v Speaker 1>and some of the stuff t was writing was way

0:23:11.119 --> 0:23:14.640
<v Speaker 1>worse than what I have already quoted. So less than

0:23:14.720 --> 0:23:19.680
<v Speaker 1>twenty four hours after TAY had made its debut, Microsoft

0:23:19.800 --> 0:23:24.120
<v Speaker 1>pulled the plug. So TAY was shut down less than

0:23:24.160 --> 0:23:27.400
<v Speaker 1>twenty four hours after it had first shown up online.

0:23:27.920 --> 0:23:31.840
<v Speaker 1>It did resurface briefly the following week, but according to Microsoft,

0:23:31.880 --> 0:23:34.760
<v Speaker 1>that was not actually on purpose. It was supposed to

0:23:34.840 --> 0:23:38.480
<v Speaker 1>be an internal test on Microsoft servers, but someone must

0:23:38.520 --> 0:23:42.320
<v Speaker 1>have left a setting like opened the Internet access which

0:23:42.400 --> 0:23:44.919
<v Speaker 1>was in the on position or something, and so for

0:23:45.000 --> 0:23:48.720
<v Speaker 1>a brief time, Tay was released back to the Internet

0:23:49.280 --> 0:23:54.879
<v Speaker 1>and as far as I know, didn't say anything wildly inappropriate,

0:23:54.960 --> 0:23:58.560
<v Speaker 1>although to be honest, the reports during that time are

0:23:58.600 --> 0:24:02.760
<v Speaker 1>pretty sparse. It was shut down again back in March

0:24:04.280 --> 0:24:08.040
<v Speaker 1>ingrid Angulo wrote a piece for CNBC about Facebook and

0:24:08.080 --> 0:24:12.800
<v Speaker 1>YouTube coming under fire for offensive search auto complete options,

0:24:12.840 --> 0:24:15.480
<v Speaker 1>which is related to this stick with me. So the

0:24:15.520 --> 0:24:18.840
<v Speaker 1>problem was that as people began typing in search terms

0:24:19.240 --> 0:24:23.680
<v Speaker 1>they're looking for a video about something, the suggested completed

0:24:23.880 --> 0:24:27.439
<v Speaker 1>searches that would pop up would frequently contain offensive or

0:24:27.480 --> 0:24:31.920
<v Speaker 1>upsetting results. Both Facebook and YouTube representatives said that wasn't

0:24:31.920 --> 0:24:34.919
<v Speaker 1>the fault of their system, it was rather reflective of

0:24:34.960 --> 0:24:39.320
<v Speaker 1>what people were actually searching for online. The logic is

0:24:39.359 --> 0:24:41.239
<v Speaker 1>that if there are a lot of people who are

0:24:41.280 --> 0:24:44.760
<v Speaker 1>searching for the same terms, that term must be particularly

0:24:44.800 --> 0:24:48.640
<v Speaker 1>important or trending at that moment, so more and more

0:24:48.640 --> 0:24:50.800
<v Speaker 1>people are going to keep looking for it, and thus,

0:24:50.800 --> 0:24:53.879
<v Speaker 1>when someone news starts typing in search terms, there's a

0:24:53.880 --> 0:24:56.600
<v Speaker 1>good chance that they want the same stuff that everybody

0:24:56.600 --> 0:24:58.639
<v Speaker 1>else wanted. So if a lot of people are searching

0:24:58.680 --> 0:25:02.199
<v Speaker 1>for something really awful, it's not a big surprise that

0:25:02.200 --> 0:25:06.720
<v Speaker 1>that same phrase will pop up as a suggested autocomplete. Now,

0:25:06.800 --> 0:25:10.760
<v Speaker 1>Angela pointed out that like tay, these search features had

0:25:10.800 --> 0:25:15.439
<v Speaker 1>no ethical guidelines or boundaries. They were just vomiting back

0:25:15.800 --> 0:25:18.520
<v Speaker 1>the stuff that was being fed into them. So they

0:25:18.560 --> 0:25:22.600
<v Speaker 1>provided an unfiltered reflection of some of the worst stuff

0:25:22.680 --> 0:25:27.760
<v Speaker 1>on the Internet. And this approach is incredibly vulnerable to exploitation.

0:25:28.160 --> 0:25:30.680
<v Speaker 1>If a group thinks it might be funny to make

0:25:30.760 --> 0:25:35.800
<v Speaker 1>a particularly offensive concept or phrase trend, they can make

0:25:35.840 --> 0:25:39.720
<v Speaker 1>a concentrated effort to make that happen, just by spamming

0:25:39.720 --> 0:25:42.879
<v Speaker 1>the search engines of those various platforms to look for

0:25:42.920 --> 0:25:46.760
<v Speaker 1>offensive content. Even if that content doesn't actually exist on

0:25:46.800 --> 0:25:49.720
<v Speaker 1>the platform, the nature of the search tool would offer

0:25:49.760 --> 0:25:53.919
<v Speaker 1>it up for autocomplete. So I don't know, if you

0:25:53.960 --> 0:25:57.760
<v Speaker 1>wanted to get a huge group together and let's let's

0:25:57.760 --> 0:26:01.800
<v Speaker 1>think of something not terrible, because I don't like thinking

0:26:01.880 --> 0:26:05.040
<v Speaker 1>of really dark stuff, especially when I'm trying to have

0:26:05.200 --> 0:26:07.720
<v Speaker 1>and that's happy day. So let's say we're all looking

0:26:07.720 --> 0:26:13.159
<v Speaker 1>for something ridiculous like, um, orange swallows strawberry. That doesn't

0:26:13.160 --> 0:26:16.240
<v Speaker 1>make any sense, right, But if I get a big

0:26:16.280 --> 0:26:19.360
<v Speaker 1>online community to go on and everyone is searching orange

0:26:19.440 --> 0:26:22.720
<v Speaker 1>swallows strawberry, then that's going to pop up as an

0:26:22.720 --> 0:26:27.840
<v Speaker 1>autocomplete function, assuming that the search is counting every single

0:26:27.880 --> 0:26:30.600
<v Speaker 1>time people are searching for this and saying this must

0:26:30.640 --> 0:26:33.479
<v Speaker 1>be something important because so many people are searching for it.

0:26:33.720 --> 0:26:37.719
<v Speaker 1>Even if there's no video on YouTube. Let's say that

0:26:37.920 --> 0:26:41.400
<v Speaker 1>is remotely close to what I'm searching for, the autocomplete

0:26:41.440 --> 0:26:43.360
<v Speaker 1>could still pop up that way just because so many

0:26:43.359 --> 0:26:45.960
<v Speaker 1>people have already posted that into search. That's kind of

0:26:45.960 --> 0:26:49.680
<v Speaker 1>what I'm talking about. You can game the system. Well.

0:26:49.720 --> 0:26:54.240
<v Speaker 1>Months after Tay had her flame out, that really should

0:26:54.240 --> 0:26:57.880
<v Speaker 1>say it's flame out. Microsoft kind of position to Tay

0:26:57.960 --> 0:27:00.840
<v Speaker 1>to have sort of a female person nowity. But of

0:27:00.840 --> 0:27:05.520
<v Speaker 1>course it was just an artificial intelligence chatbot and pretty

0:27:05.520 --> 0:27:08.399
<v Speaker 1>low on the AI scale too, if you ask me. Anyway,

0:27:08.520 --> 0:27:11.600
<v Speaker 1>Microsoft introduced a new chat bot just a few months

0:27:11.640 --> 0:27:17.200
<v Speaker 1>after Tay had that disastrous debut. The new chat bot

0:27:17.400 --> 0:27:21.840
<v Speaker 1>is called Zoe Zo. Zoe's avatar now is of a

0:27:21.880 --> 0:27:24.840
<v Speaker 1>young woman. When I chatted with Zoe, I asked Zoe

0:27:24.920 --> 0:27:27.119
<v Speaker 1>how old she is, and she said that she is

0:27:27.119 --> 0:27:31.359
<v Speaker 1>twenty two, always twenty two, which I thought was kind

0:27:31.359 --> 0:27:34.160
<v Speaker 1>of funny. I don't know if that's the same response

0:27:34.240 --> 0:27:36.320
<v Speaker 1>every time I only asked At the one time I

0:27:36.440 --> 0:27:39.040
<v Speaker 1>chatted with Zoe a little bit while researching for this show.

0:27:39.400 --> 0:27:43.160
<v Speaker 1>The conversation did not turn dark. But I also wasn't

0:27:43.200 --> 0:27:46.119
<v Speaker 1>really pushing for it, because I feel weird doing that,

0:27:46.240 --> 0:27:49.400
<v Speaker 1>even from a research perspective. I'm just not that kind

0:27:49.400 --> 0:27:53.480
<v Speaker 1>of person who likes to be like, go to dark

0:27:53.520 --> 0:27:56.399
<v Speaker 1>places like that, so I'm not the right person to

0:27:56.440 --> 0:27:59.080
<v Speaker 1>do that kind of investigative journalism. I fully admit that.

0:27:59.359 --> 0:28:05.320
<v Speaker 1>I will say that other online journals posted results where

0:28:05.320 --> 0:28:08.600
<v Speaker 1>they got some pretty weird stuff from Zoe, including some

0:28:08.720 --> 0:28:14.480
<v Speaker 1>dark stuff, just through normal conversation, without even necessarily attempting

0:28:15.119 --> 0:28:17.879
<v Speaker 1>to guide the conversation that way. But I did not

0:28:18.000 --> 0:28:21.560
<v Speaker 1>have that particular experience, which may mean that Microsoft has

0:28:21.600 --> 0:28:26.200
<v Speaker 1>made numerous tweaks since then. But I did ask, though,

0:28:26.320 --> 0:28:30.359
<v Speaker 1>what the best Halloween costume is, and Zoe's response was tuxedo,

0:28:30.800 --> 0:28:33.920
<v Speaker 1>luchador mask and a champion title belt. And I find

0:28:33.920 --> 0:28:36.760
<v Speaker 1>it very difficult to argue against that. I think that

0:28:36.840 --> 0:28:40.920
<v Speaker 1>really might very well be the best Halloween costume I

0:28:40.920 --> 0:28:45.000
<v Speaker 1>could go with. According to an article on Courts, Zoe

0:28:45.080 --> 0:28:48.680
<v Speaker 1>will try to shut down any conversation related to religion

0:28:48.840 --> 0:28:52.480
<v Speaker 1>or politics, and you could argue this is Microsoft's effort

0:28:52.520 --> 0:28:55.720
<v Speaker 1>to not fall into the same trap that the company

0:28:55.760 --> 0:28:59.880
<v Speaker 1>did with Tay, But Chloe Rose Stuart Uhlan, who wrote

0:29:00.120 --> 0:29:03.880
<v Speaker 1>piece on Courts, argues that this sanitized version of the

0:29:03.960 --> 0:29:07.120
<v Speaker 1>chat bot is just as bad, or maybe even worse

0:29:07.200 --> 0:29:12.760
<v Speaker 1>than Microsoft Tay was. And she argues that the philosophy

0:29:12.960 --> 0:29:19.680
<v Speaker 1>to shut down any pathway that might overlap with religion

0:29:19.800 --> 0:29:23.560
<v Speaker 1>or politics leads to a path of censorship without the

0:29:23.560 --> 0:29:27.400
<v Speaker 1>benefit of context. That because the AI doesn't really understand

0:29:27.440 --> 0:29:30.840
<v Speaker 1>the context of the message, any message containing a flagged

0:29:30.840 --> 0:29:34.600
<v Speaker 1>word would trigger the shutdown response, and that this ultimately

0:29:34.720 --> 0:29:38.360
<v Speaker 1>limits the utility of the chat bot, which is supposed

0:29:38.400 --> 0:29:41.880
<v Speaker 1>to work as a way for young people like we're

0:29:41.880 --> 0:29:46.080
<v Speaker 1>talking teenagers early twenties, being able to converse freely with

0:29:46.160 --> 0:29:48.960
<v Speaker 1>this chat bot. It might work as a curiosity, but

0:29:49.040 --> 0:29:51.840
<v Speaker 1>would render the chat bot useless in several real world

0:29:51.880 --> 0:29:54.320
<v Speaker 1>implementations because it would shut down at the first sign

0:29:54.360 --> 0:29:57.360
<v Speaker 1>of a flagged term. She actually used the response or

0:29:57.560 --> 0:30:00.880
<v Speaker 1>the example of if someone were to write, uh, they're

0:30:00.920 --> 0:30:03.640
<v Speaker 1>they're using the chat by in order to vent to

0:30:03.640 --> 0:30:07.720
<v Speaker 1>to to express their feelings. Perhaps they're being bullied at school,

0:30:08.080 --> 0:30:10.880
<v Speaker 1>was an example. And maybe they're being bullied at school

0:30:10.920 --> 0:30:14.920
<v Speaker 1>because they belong to a particular group. So maybe it's

0:30:14.920 --> 0:30:18.280
<v Speaker 1>because they are Jewish or a Muslim, but because that's

0:30:18.280 --> 0:30:22.120
<v Speaker 1>associated with religion, Zoe would shut it down and thus

0:30:22.240 --> 0:30:25.680
<v Speaker 1>deny the person the path they need in order to

0:30:25.800 --> 0:30:29.560
<v Speaker 1>express these feelings and try to work through them, and

0:30:29.600 --> 0:30:32.800
<v Speaker 1>it could be a very harmful experience in that regard.

0:30:33.280 --> 0:30:36.560
<v Speaker 1>So the point that she was making was that this

0:30:36.640 --> 0:30:40.240
<v Speaker 1>is a very tricky path to walk down. It's very

0:30:40.240 --> 0:30:44.800
<v Speaker 1>hard to do in a responsible way where the AI

0:30:44.960 --> 0:30:49.200
<v Speaker 1>chatbot isn't being overtly offensive, but also isn't shutting down

0:30:49.400 --> 0:30:54.640
<v Speaker 1>legitimate paths of discussion. I think the stories of Watson, Tay,

0:30:54.760 --> 0:30:58.440
<v Speaker 1>and Zoe tells an awful lot about human nature, probably

0:30:58.480 --> 0:31:01.960
<v Speaker 1>more about human nature than it tells us about computer science.

0:31:02.320 --> 0:31:04.440
<v Speaker 1>I've noticed that when the company comes out with something

0:31:04.520 --> 0:31:08.600
<v Speaker 1>brand new, there's a spectrum of responses, but two of

0:31:08.640 --> 0:31:13.120
<v Speaker 1>the most passionate responses. I tend to see two new

0:31:13.200 --> 0:31:16.880
<v Speaker 1>stuff new stuff debuting in technology are I want to

0:31:16.920 --> 0:31:20.680
<v Speaker 1>know how that works and I want to break that.

0:31:21.320 --> 0:31:23.960
<v Speaker 1>And sometimes they're coming from the same people. They want

0:31:23.960 --> 0:31:25.920
<v Speaker 1>to break it in order to learn how it works.

0:31:26.440 --> 0:31:30.040
<v Speaker 1>It's not necessarily that there's any deep seated malicious intent there.

0:31:30.400 --> 0:31:34.280
<v Speaker 1>It's more about satisfying curiosity. But sometimes people will go

0:31:34.320 --> 0:31:38.280
<v Speaker 1>a really ugly route in order to satisfy their curiosity.

0:31:38.280 --> 0:31:42.720
<v Speaker 1>They're not thinking about necessarily the consequences of that route.

0:31:43.000 --> 0:31:46.520
<v Speaker 1>They're thinking of the end result. Oh, now I have

0:31:46.560 --> 0:31:50.520
<v Speaker 1>a better understanding of how this works, not paying attention

0:31:50.520 --> 0:31:52.840
<v Speaker 1>to the fact that in the process of learning that

0:31:52.880 --> 0:31:59.920
<v Speaker 1>they've perhaps really offended or or worse done, done actual

0:32:00.120 --> 0:32:03.560
<v Speaker 1>harm to people in the process, either directly or indirectly. So, yeah,

0:32:03.640 --> 0:32:06.080
<v Speaker 1>those stories might tell us more about us as people

0:32:06.320 --> 0:32:08.600
<v Speaker 1>than it does about the design of chat bots. But

0:32:08.720 --> 0:32:11.520
<v Speaker 1>chatbots are becoming more and more prevalent. A lot of

0:32:11.560 --> 0:32:14.960
<v Speaker 1>designers have learned lessons from those other examples, and a

0:32:15.080 --> 0:32:18.080
<v Speaker 1>built in filters and machine learning models to help limit

0:32:18.120 --> 0:32:21.560
<v Speaker 1>the influence users can have on chatbot behavior so that

0:32:22.000 --> 0:32:27.320
<v Speaker 1>the chatbot doesn't gradually change its methodology over the course

0:32:27.360 --> 0:32:31.480
<v Speaker 1>of many interactions because that obviously can be gamed. It's

0:32:31.760 --> 0:32:35.640
<v Speaker 1>also a case where uh, the chat bots are are

0:32:35.720 --> 0:32:39.520
<v Speaker 1>better able to determine which user responses are genuine versus

0:32:39.800 --> 0:32:43.360
<v Speaker 1>attempts to manipulate the system. So, for example, if it's

0:32:43.400 --> 0:32:49.000
<v Speaker 1>a a customer service chat bot that's fielding uh customers

0:32:49.040 --> 0:32:53.040
<v Speaker 1>who are asking for help for something, chances are there's

0:32:53.040 --> 0:32:55.560
<v Speaker 1>gonna be a lot of upset customers. They're very, very

0:32:55.680 --> 0:32:58.960
<v Speaker 1>rarely do you get a happy customer wanting to talk

0:32:58.960 --> 0:33:02.360
<v Speaker 1>to customer service. It's usually an unhappy customer who's dealing

0:33:02.360 --> 0:33:07.080
<v Speaker 1>with something that is of uh, you know, of immediate importance.

0:33:07.720 --> 0:33:10.640
<v Speaker 1>And so the chatbot needs to be able to determine

0:33:10.880 --> 0:33:16.440
<v Speaker 1>which responses might be strongly worded but genuine requests for

0:33:16.600 --> 0:33:22.080
<v Speaker 1>action versus somebody who's just spewing off garbage in an

0:33:22.120 --> 0:33:27.080
<v Speaker 1>effort to try and you know, mess the system up. Uh.

0:33:27.200 --> 0:33:29.080
<v Speaker 1>So it's kind of taught designers to be a bit

0:33:29.120 --> 0:33:32.720
<v Speaker 1>more cynical in their designs, which is apparently a necessity

0:33:32.760 --> 0:33:35.840
<v Speaker 1>and also kind of a shame. Ultimately, work is continuing

0:33:35.880 --> 0:33:38.719
<v Speaker 1>in numerous labs all around the world building up machines

0:33:38.720 --> 0:33:40.840
<v Speaker 1>that are better able to sort through natural language and

0:33:40.880 --> 0:33:44.200
<v Speaker 1>respond appropriately. And to be fair, I think I'm doing

0:33:44.200 --> 0:33:48.720
<v Speaker 1>the same thing. Goodness knows. There are times where I

0:33:48.720 --> 0:33:54.640
<v Speaker 1>am having difficulty with interpreting the meaning behind a phrase,

0:33:54.760 --> 0:33:59.000
<v Speaker 1>or perhaps I respond a little too quickly to a

0:33:59.040 --> 0:34:03.360
<v Speaker 1>tweet that upsets me, and then I immediately think I

0:34:03.400 --> 0:34:06.640
<v Speaker 1>should probably take a time out before I hit that

0:34:06.680 --> 0:34:09.640
<v Speaker 1>tweet button. Tari's saying that I should probably do the

0:34:09.680 --> 0:34:13.359
<v Speaker 1>same thing for my interpersonal interactions, particularly when I'm talking

0:34:13.400 --> 0:34:17.880
<v Speaker 1>with my producer and and yelling at her. It's a

0:34:17.960 --> 0:34:22.000
<v Speaker 1>hard knock life. Well, guys, that wraps up this discussion

0:34:22.400 --> 0:34:27.080
<v Speaker 1>about rude AI and and again on the services is

0:34:27.120 --> 0:34:30.680
<v Speaker 1>pretty funny, but it does tell you that there are

0:34:30.719 --> 0:34:32.319
<v Speaker 1>a lot of things that we need to take into

0:34:32.360 --> 0:34:37.240
<v Speaker 1>consideration when we're designing artificially intelligent systems, because these things

0:34:37.440 --> 0:34:41.920
<v Speaker 1>can behave in ways that surprise us. Often, a I

0:34:42.000 --> 0:34:46.200
<v Speaker 1>will encounter a situation that it was not expressly programmed

0:34:46.200 --> 0:34:48.879
<v Speaker 1>to handle, so it has to make some choice. Even

0:34:48.920 --> 0:34:51.880
<v Speaker 1>if that choice is no choice at all, that's still something,

0:34:52.560 --> 0:34:55.760
<v Speaker 1>and until it does, you may not have any idea

0:34:55.760 --> 0:34:58.719
<v Speaker 1>of what the outcome is going to be. With a

0:34:58.800 --> 0:35:02.480
<v Speaker 1>social media at bought that might just be kind of

0:35:02.480 --> 0:35:07.040
<v Speaker 1>funny or unfortunate or embarrassing. But with an autonomous car

0:35:07.880 --> 0:35:11.480
<v Speaker 1>or that any other autonomous system that's that's doing like

0:35:12.000 --> 0:35:15.319
<v Speaker 1>you know, manufacturing work, that kind of stuff, it could

0:35:15.360 --> 0:35:20.080
<v Speaker 1>be very serious. It could have dire consequences of things

0:35:20.080 --> 0:35:23.480
<v Speaker 1>do not go the right way. So it is important

0:35:23.520 --> 0:35:26.000
<v Speaker 1>to keep that in mind, and I think it's always

0:35:26.000 --> 0:35:28.839
<v Speaker 1>good to just kind of keep that, keep it, keep

0:35:28.880 --> 0:35:31.439
<v Speaker 1>yourself in a grounded position when you're talking about AI

0:35:31.520 --> 0:35:33.800
<v Speaker 1>and you're thinking about the possibilities of the future. Because

0:35:34.160 --> 0:35:37.560
<v Speaker 1>as as bullish as I am on artificial intelligence, I

0:35:37.600 --> 0:35:40.279
<v Speaker 1>do try to keep in mind that ultimately, these are

0:35:40.320 --> 0:35:45.400
<v Speaker 1>systems designed by people, and sometimes the stuff we design

0:35:45.520 --> 0:35:47.600
<v Speaker 1>doesn't work the way we thought it would, and we

0:35:47.640 --> 0:35:49.879
<v Speaker 1>need to be careful about that. If you guys have

0:35:49.960 --> 0:35:53.600
<v Speaker 1>any suggestions for future episodes of tech Stuff, or you've

0:35:53.640 --> 0:35:57.480
<v Speaker 1>got any other comments or requests, we'll tell you what.

0:35:57.640 --> 0:36:00.400
<v Speaker 1>Why don't you go to tech Stuff podcast dot com.

0:36:00.440 --> 0:36:03.279
<v Speaker 1>That's our new website. There you're going to find all

0:36:03.360 --> 0:36:07.000
<v Speaker 1>the different ways to contact the show, either email or

0:36:07.040 --> 0:36:09.400
<v Speaker 1>Twitter or Facebook, all that kind of stuff. Plus you're

0:36:09.400 --> 0:36:11.839
<v Speaker 1>going to find links to our store where you can

0:36:11.880 --> 0:36:14.360
<v Speaker 1>go and buy tech Stuff merchandise. Every purchase goes to

0:36:14.400 --> 0:36:17.200
<v Speaker 1>help the show. We greatly appreciate it, and I will

0:36:17.239 --> 0:36:25.719
<v Speaker 1>talk to you again really soon for more on this

0:36:25.920 --> 0:36:28.440
<v Speaker 1>and thousands of other topics, because it how stuff works.

0:36:28.440 --> 0:36:38.759
<v Speaker 1>Dot com