WEBVTT - Smart Talks with IBM: The Debating AI

0:00:00.080 --> 0:00:02.719
<v Speaker 1>In this episode, we'll be focusing on Project Debater, which

0:00:02.720 --> 0:00:06.800
<v Speaker 1>is an AI system designed to process evidence and persuasive

0:00:06.880 --> 0:00:10.039
<v Speaker 1>arguments and text so that it can ultimately understand and

0:00:10.080 --> 0:00:13.800
<v Speaker 1>participate in human debate. To get to the heart of

0:00:13.840 --> 0:00:16.880
<v Speaker 1>this effort, we're going to share two interviews we recorded

0:00:16.880 --> 0:00:20.799
<v Speaker 1>with leaders at IBM. The first is with Noam slow Name,

0:00:21.120 --> 0:00:24.200
<v Speaker 1>who is a distinguished engineer at IBM Research and founder

0:00:24.239 --> 0:00:27.600
<v Speaker 1>of Project Debater, and the second chat will be with

0:00:27.680 --> 0:00:31.840
<v Speaker 1>matdou Coachar, who is Vice President Offering Management for IBM

0:00:31.920 --> 0:00:34.680
<v Speaker 1>Data and AI. So today's episode is going to be

0:00:34.760 --> 0:00:38.000
<v Speaker 1>the third of four episodes in this series that Robert

0:00:38.040 --> 0:00:39.840
<v Speaker 1>and I are releasing here on the Stuff to Blow

0:00:39.840 --> 0:00:42.720
<v Speaker 1>Your Mind feed. If you'd like to hear more episodes,

0:00:42.800 --> 0:00:45.159
<v Speaker 1>you can check out the ones labeled smart Talks that

0:00:45.200 --> 0:00:47.479
<v Speaker 1>we've released over the past few weeks, and you can

0:00:47.520 --> 0:00:50.480
<v Speaker 1>also listen to the first four episodes of smart Talks,

0:00:50.520 --> 0:00:52.840
<v Speaker 1>which were released not on our show but in the feed.

0:00:52.880 --> 0:00:55.400
<v Speaker 1>For the podcast Text Stuff. You can find them on

0:00:55.440 --> 0:00:57.720
<v Speaker 1>the I Heart Radio app or wherever you get your podcast.

0:00:57.800 --> 0:01:00.000
<v Speaker 1>Just look up text Stuff and click on the episode

0:01:00.040 --> 0:01:02.840
<v Speaker 1>has labeled Smart Talks, and of course stay tuned for

0:01:02.840 --> 0:01:04.920
<v Speaker 1>the one remaining episode in the series, which is going

0:01:05.000 --> 0:01:06.839
<v Speaker 1>to be published in our feed in a couple of weeks.

0:01:07.200 --> 0:01:10.320
<v Speaker 1>And now straight onto our conversation with no One Slowly,

0:01:12.920 --> 0:01:14.880
<v Speaker 1>no One, thanks so much for joining us today. Can

0:01:14.920 --> 0:01:18.360
<v Speaker 1>you start by introducing yourself and talking about your role

0:01:18.400 --> 0:01:22.880
<v Speaker 1>at IBM? Sure, thank you for hosting me. So I'm

0:01:22.920 --> 0:01:26.200
<v Speaker 1>no One Slownym. I'm a distinguished engineer at IBMI Research.

0:01:27.440 --> 0:01:30.920
<v Speaker 1>I did my PhD in the Hebew University quite a

0:01:30.920 --> 0:01:36.400
<v Speaker 1>few years ago walking on machine learning, staff and artificial intelligence,

0:01:37.040 --> 0:01:39.600
<v Speaker 1>and then I did a past doc at Princeton University

0:01:39.680 --> 0:01:44.240
<v Speaker 1>and I joined the IBM research in two thousand and seven,

0:01:45.319 --> 0:01:50.960
<v Speaker 1>and uh in two thousand and eleven, I suggested the

0:01:50.960 --> 0:01:53.320
<v Speaker 1>project that I guess we're going to talk about today,

0:01:53.520 --> 0:01:56.440
<v Speaker 1>and of course that project was Project Debat, right do

0:01:56.480 --> 0:01:58.280
<v Speaker 1>you do? You want to mention a little bit about

0:01:58.280 --> 0:02:01.640
<v Speaker 1>the origins of that. In IBM research, we have this

0:02:02.120 --> 0:02:08.840
<v Speaker 1>interesting tradition of grand challenges in artificial intelligence. Back in

0:02:08.919 --> 0:02:12.440
<v Speaker 1>the nineties, idem introduced the Blue that was able to

0:02:12.480 --> 0:02:16.800
<v Speaker 1>defeat Gary customers in chess, and in two thousand eleven

0:02:16.880 --> 0:02:19.720
<v Speaker 1>id AM introduced Watson that was able to defeat the

0:02:19.760 --> 0:02:23.760
<v Speaker 1>all time winners of the TV trivia game Jeopardy. And

0:02:23.919 --> 0:02:27.440
<v Speaker 1>just a few days after this event, an email was

0:02:27.520 --> 0:02:31.120
<v Speaker 1>sent to all the thousands of researchers in i DM

0:02:31.200 --> 0:02:35.880
<v Speaker 1>across the globe, myself included, asking us what should be

0:02:35.919 --> 0:02:41.119
<v Speaker 1>the next grand challenge for IDM research and uh I

0:02:41.160 --> 0:02:44.160
<v Speaker 1>was intrigued by that, so I offered my office mate

0:02:44.600 --> 0:02:48.480
<v Speaker 1>at the time to brainstone together, and this is what

0:02:48.520 --> 0:02:50.680
<v Speaker 1>we did. We set in the office in Tel Aviv

0:02:50.880 --> 0:02:54.440
<v Speaker 1>and we raised many different ideas that probably I should

0:02:54.480 --> 0:02:58.600
<v Speaker 1>not share with you today, but at some point towards

0:02:58.639 --> 0:03:02.640
<v Speaker 1>the end of the hour, well I suggested this notion

0:03:03.160 --> 0:03:08.320
<v Speaker 1>of developing a machine that we'll be able to debate humans,

0:03:08.520 --> 0:03:11.399
<v Speaker 1>and that this is how we will demonstrate the technology

0:03:11.480 --> 0:03:15.440
<v Speaker 1>for a full life debate between this envisioned system and

0:03:15.520 --> 0:03:20.720
<v Speaker 1>an expert human debate. And we submitted that the only

0:03:20.720 --> 0:03:23.240
<v Speaker 1>guidance that we got from the management was really to

0:03:23.280 --> 0:03:27.200
<v Speaker 1>submit the proposals in a single side so they will

0:03:27.240 --> 0:03:30.320
<v Speaker 1>not be swamped with too many details. And we were

0:03:30.400 --> 0:03:34.400
<v Speaker 1>able to helpfully follow these guidelines and we submitted a

0:03:34.440 --> 0:03:37.120
<v Speaker 1>single slide. This was fair Boy in two thousand eleven,

0:03:37.840 --> 0:03:40.920
<v Speaker 1>and this started a fairly long, and the thought review

0:03:41.000 --> 0:03:44.880
<v Speaker 1>process that lasted for a year, and in February two

0:03:44.920 --> 0:03:48.280
<v Speaker 1>thousand and twelve, this proposal was selected as the next

0:03:48.320 --> 0:03:52.520
<v Speaker 1>Man Challenge for IBM research and we started to walk

0:03:52.560 --> 0:03:56.440
<v Speaker 1>a few months later with a small team that gradually expanded,

0:03:57.480 --> 0:04:01.839
<v Speaker 1>and we walked on that intensively for I would say

0:04:01.880 --> 0:04:06.160
<v Speaker 1>six and a half yels dedicated solewly to dismission of

0:04:06.600 --> 0:04:09.880
<v Speaker 1>developing a machine that will be able to debate humans.

0:04:10.960 --> 0:04:15.840
<v Speaker 1>And eventually we demonstrated this system in a in a

0:04:15.880 --> 0:04:18.240
<v Speaker 1>full life debate. It was a little bit more than

0:04:18.279 --> 0:04:21.680
<v Speaker 1>a year ago, and it was a debate between this

0:04:21.760 --> 0:04:26.279
<v Speaker 1>system now being called the project debate and one of

0:04:26.400 --> 0:04:31.880
<v Speaker 1>the legendary debates in the history of university debate competitions,

0:04:32.000 --> 0:04:35.400
<v Speaker 1>and it still Harris Naam. It was in San Francisco,

0:04:35.680 --> 0:04:39.240
<v Speaker 1>and and it was a full life debate, surprisingly reminiscent

0:04:39.320 --> 0:04:42.640
<v Speaker 1>to division that we had back in the office in

0:04:42.640 --> 0:04:46.960
<v Speaker 1>Tel Aviv quite a few fields earlier in that single side.

0:04:47.360 --> 0:04:49.520
<v Speaker 1>So the topic of debate brings with it a few

0:04:49.520 --> 0:04:52.920
<v Speaker 1>different connotations, um, you know, and therefore the idea of

0:04:53.240 --> 0:04:56.000
<v Speaker 1>AI entering the frame might might be a bit confusing

0:04:56.080 --> 0:04:58.679
<v Speaker 1>for for some you know, we might imagine a computer

0:04:58.839 --> 0:05:01.840
<v Speaker 1>designed to defeat play or or perhaps a robot that

0:05:01.920 --> 0:05:06.279
<v Speaker 1>can shout louder and a televised US presidential debate to Daddy,

0:05:06.320 --> 0:05:09.360
<v Speaker 1>and can you walk us through what Project Debater is

0:05:09.600 --> 0:05:14.160
<v Speaker 1>and perhaps what it isn't. Yes, absolutely so. So first

0:05:14.160 --> 0:05:17.120
<v Speaker 1>of all, it is worth explaining what we mean, indeed

0:05:17.120 --> 0:05:21.800
<v Speaker 1>by a debate between an AI system like Project Debata

0:05:21.960 --> 0:05:26.960
<v Speaker 1>and a human opponent. So the debate starts with with

0:05:27.080 --> 0:05:30.640
<v Speaker 1>a motion in the debate jargon that defines what we're

0:05:30.680 --> 0:05:34.880
<v Speaker 1>going to debate. And in the event in San Francisco,

0:05:35.000 --> 0:05:38.080
<v Speaker 1>the topic was whether or not the government should subsidize

0:05:38.320 --> 0:05:42.159
<v Speaker 1>the schools. Uh. There are many considerations around how this

0:05:42.320 --> 0:05:44.839
<v Speaker 1>topic is being selected which we can skip, but the

0:05:44.920 --> 0:05:48.160
<v Speaker 1>only thing we should really emphasize is that this topic

0:05:49.080 --> 0:05:52.279
<v Speaker 1>is selected from a list of topics that were never

0:05:52.400 --> 0:05:57.039
<v Speaker 1>included in the training of the system, So the system

0:05:57.120 --> 0:06:00.160
<v Speaker 1>was never able to train on this particular topic. It

0:06:00.320 --> 0:06:03.880
<v Speaker 1>was trying to debate a new topic from from the

0:06:04.240 --> 0:06:08.000
<v Speaker 1>perspective of the machine. And then we are on the

0:06:08.000 --> 0:06:10.680
<v Speaker 1>side of the governments of Project Debta is supporting the

0:06:10.760 --> 0:06:14.200
<v Speaker 1>motion and how the issues on the opposition, and we

0:06:14.320 --> 0:06:18.039
<v Speaker 1>have a full minutes opening speeches for each side and

0:06:18.160 --> 0:06:23.480
<v Speaker 1>full minutely bottom speeches and two minutes closing statements. So

0:06:23.520 --> 0:06:26.400
<v Speaker 1>all you know, we are talking about a little more

0:06:26.440 --> 0:06:29.760
<v Speaker 1>than twenty to twenty five minutes of a discussion that

0:06:29.880 --> 0:06:34.800
<v Speaker 1>we hope we will be a meaningful discussion between Project

0:06:34.839 --> 0:06:38.000
<v Speaker 1>Debata and and and a human plish in these particularly

0:06:38.760 --> 0:06:42.240
<v Speaker 1>so to clarify for people who might not be familiar

0:06:42.279 --> 0:06:45.920
<v Speaker 1>with competitive debating. So competitive debating does not involve what

0:06:46.080 --> 0:06:48.640
<v Speaker 1>people might be more familiar with, which is like passionately

0:06:48.760 --> 0:06:52.279
<v Speaker 1>arguing your actual point of view. It involves having a

0:06:52.360 --> 0:06:56.080
<v Speaker 1>position selected for you that you then must get up

0:06:56.120 --> 0:06:59.440
<v Speaker 1>and defend in front of the judges. Correct, yes, this

0:06:59.520 --> 0:07:02.320
<v Speaker 1>is called act and and this is indeed important to

0:07:02.360 --> 0:07:06.719
<v Speaker 1>emphasize because you do not know in advance what is

0:07:06.720 --> 0:07:10.440
<v Speaker 1>going to be your side. And and even if you

0:07:10.480 --> 0:07:12.080
<v Speaker 1>know in advance that you are going to be on

0:07:12.120 --> 0:07:15.440
<v Speaker 1>the side of the government, we should bear in mind

0:07:15.440 --> 0:07:19.360
<v Speaker 1>the motion could have been phrased we should not subsidize previously,

0:07:20.280 --> 0:07:23.640
<v Speaker 1>and then you should actually contest that. So you do

0:07:23.720 --> 0:07:25.800
<v Speaker 1>not know in advance what is going to be your

0:07:25.880 --> 0:07:29.080
<v Speaker 1>stance to the topic. This is true for Project Debata

0:07:29.160 --> 0:07:32.960
<v Speaker 1>and also for the for the human opponent, and you

0:07:33.080 --> 0:07:36.240
<v Speaker 1>have only ten to fifteen minutes to PerPell. You don't

0:07:36.240 --> 0:07:38.680
<v Speaker 1>know the topic in advance. This is again true for

0:07:39.160 --> 0:07:44.240
<v Speaker 1>project debata and for the human opponent, and uh, your

0:07:44.320 --> 0:07:47.280
<v Speaker 1>goal is really to to persuade the audience. And this

0:07:47.440 --> 0:07:50.960
<v Speaker 1>actually touches on an interesting question of how do you

0:07:51.200 --> 0:07:56.240
<v Speaker 1>do you measure who won the debate? Because in chess

0:07:56.320 --> 0:07:59.040
<v Speaker 1>and in other games this is very clear and and

0:07:59.240 --> 0:08:03.280
<v Speaker 1>really part of the problem with with with debate in

0:08:03.320 --> 0:08:07.840
<v Speaker 1>general and with developing artificial intelligence that is capable of

0:08:07.880 --> 0:08:11.640
<v Speaker 1>debating in particular now is that it is very hard

0:08:12.040 --> 0:08:15.160
<v Speaker 1>to to be fine who actually won the debate. Yeah,

0:08:15.200 --> 0:08:17.680
<v Speaker 1>I know. There are a couple of different metrics. So

0:08:17.760 --> 0:08:20.200
<v Speaker 1>of course one would just be like, what is the

0:08:20.360 --> 0:08:23.320
<v Speaker 1>percentage of the audience that is convinced to either side?

0:08:23.320 --> 0:08:25.560
<v Speaker 1>But that can be problematic because people come in with

0:08:25.600 --> 0:08:28.800
<v Speaker 1>their own opinions already formed on an issue. So one

0:08:29.680 --> 0:08:33.480
<v Speaker 1>metric I've seen is how much the percentages change. They

0:08:33.559 --> 0:08:37.920
<v Speaker 1>ask people before and afterward what their positions are, and

0:08:37.960 --> 0:08:41.680
<v Speaker 1>then after word they say, okay, which side has one

0:08:41.840 --> 0:08:45.199
<v Speaker 1>over more people? Whatever the starting percentages were is, And

0:08:45.440 --> 0:08:48.600
<v Speaker 1>I assume you all had a metric like that precisely so,

0:08:48.600 --> 0:08:51.800
<v Speaker 1>so this is exactly the point, because if you simply

0:08:51.840 --> 0:08:54.520
<v Speaker 1>ask people who is more convinced, you need somehow to

0:08:54.559 --> 0:08:58.600
<v Speaker 1>take into account the opinions to begin with, and and

0:08:58.640 --> 0:09:01.800
<v Speaker 1>the it is done exactly as as you described it.

0:09:01.880 --> 0:09:06.559
<v Speaker 1>And all this event was in collaboration with with Intelligence

0:09:06.720 --> 0:09:10.000
<v Speaker 1>as well, which is really I think the leading platform

0:09:10.160 --> 0:09:14.640
<v Speaker 1>in the US for organizing such a high profile competitive debate.

0:09:15.480 --> 0:09:19.719
<v Speaker 1>It was hosted, the moderator was the moderator Form Intelligence

0:09:19.800 --> 0:09:23.480
<v Speaker 1>as well, John Dunvan, and and the voting was done

0:09:23.520 --> 0:09:27.840
<v Speaker 1>exactly as you described and as being done with the

0:09:27.880 --> 0:09:30.760
<v Speaker 1>show of Intelligence Square. That is, the audience is voting

0:09:31.240 --> 0:09:35.400
<v Speaker 1>before the debate starts, and they vote again after the

0:09:35.440 --> 0:09:38.680
<v Speaker 1>debate ends, and you win if you were able to

0:09:38.720 --> 0:09:41.640
<v Speaker 1>move more people to to your side. Now I think

0:09:41.640 --> 0:09:43.720
<v Speaker 1>a lot of people might be wondering, how on earth

0:09:43.720 --> 0:09:47.640
<v Speaker 1>would you even begin to organize a persuasive argument from

0:09:47.679 --> 0:09:49.680
<v Speaker 1>an AI point of view? Could you walk us through

0:09:49.720 --> 0:09:53.960
<v Speaker 1>the technical specifics of how Project Debater would put together

0:09:54.040 --> 0:09:57.280
<v Speaker 1>an argument. Yes, so we were asking ourselves the same

0:09:57.400 --> 0:10:03.240
<v Speaker 1>question actually when when we started this project. And I

0:10:03.280 --> 0:10:07.480
<v Speaker 1>think this is part of the of the nature of

0:10:07.640 --> 0:10:10.920
<v Speaker 1>such a grand challenge that you do not really know

0:10:11.120 --> 0:10:15.800
<v Speaker 1>how exactly you are going to to approach the problem.

0:10:15.840 --> 0:10:21.000
<v Speaker 1>But we did what computer scientists often do, and this

0:10:21.080 --> 0:10:25.480
<v Speaker 1>is to take this big and somewhat amorphic problem and

0:10:25.600 --> 0:10:31.320
<v Speaker 1>break it into more modular and hopefully more tangible tasks.

0:10:31.480 --> 0:10:37.640
<v Speaker 1>And so in general, the debated system had uh two

0:10:37.760 --> 0:10:41.520
<v Speaker 1>major sources of information. One of them is the massive

0:10:41.600 --> 0:10:48.480
<v Speaker 1>collection of around four hundred million newspaper articles, and when

0:10:48.480 --> 0:10:54.679
<v Speaker 1>the debate starts, the system was using various AI artificial

0:10:54.720 --> 0:11:00.840
<v Speaker 1>intelligence engines in order to try and pinpoint short pieces

0:11:00.880 --> 0:11:04.480
<v Speaker 1>of text within this massive collection. We're talking about ten

0:11:04.600 --> 0:11:09.120
<v Speaker 1>billion sentences, so we were trying to automatically pinpoint these

0:11:09.200 --> 0:11:14.880
<v Speaker 1>short pieces of text that should satisfy three criteria. They

0:11:14.880 --> 0:11:19.200
<v Speaker 1>should be relevant to the topic, they should be argumentative

0:11:19.240 --> 0:11:22.880
<v Speaker 1>in nature, they should argue something about the topic, and

0:11:22.920 --> 0:11:26.720
<v Speaker 1>they should support our side of the debate. And this

0:11:26.800 --> 0:11:30.280
<v Speaker 1>is quite a formidable challenge. But assuming that you are

0:11:30.320 --> 0:11:33.360
<v Speaker 1>capable of finding these short pieces of tax, the system

0:11:33.440 --> 0:11:38.040
<v Speaker 1>is then using other AI capabilities in order to try

0:11:38.160 --> 0:11:42.559
<v Speaker 1>and glue them together into a meaningful narrative. So this

0:11:42.679 --> 0:11:47.080
<v Speaker 1>is one major source of information for the system. The

0:11:47.160 --> 0:11:50.600
<v Speaker 1>second important source of information for the system was a

0:11:50.800 --> 0:11:58.480
<v Speaker 1>unique collection of more principled arguments that were actually written

0:11:58.800 --> 0:12:03.320
<v Speaker 1>by by humans, and we are talking about thousands of

0:12:03.480 --> 0:12:07.400
<v Speaker 1>more principled arguments. And the role of the system was

0:12:07.440 --> 0:12:11.080
<v Speaker 1>when the debate starts, was really to navigate within this

0:12:11.200 --> 0:12:14.880
<v Speaker 1>collection and find the most relevant principled arguments and use

0:12:14.960 --> 0:12:17.280
<v Speaker 1>them in the right timing. So so, to make this

0:12:17.360 --> 0:12:21.839
<v Speaker 1>more concrete what we mean by a principal argument, imagine

0:12:21.880 --> 0:12:25.200
<v Speaker 1>that we are debating whether or not to ban organ

0:12:25.280 --> 0:12:28.400
<v Speaker 1>trade or whether or not to ban the sale of alcohol.

0:12:28.880 --> 0:12:32.040
<v Speaker 1>In both cases, the opposition may argue that if you

0:12:32.120 --> 0:12:35.520
<v Speaker 1>ban something, you are at the risk of the emergence

0:12:35.559 --> 0:12:38.120
<v Speaker 1>of a black market. So a black market is a

0:12:38.160 --> 0:12:41.280
<v Speaker 1>principled argument that can be used almost in the same

0:12:41.360 --> 0:12:46.400
<v Speaker 1>way in many different contexts. So one may naively assume

0:12:47.360 --> 0:12:51.760
<v Speaker 1>that this is kind of a simple keyword matching thing.

0:12:52.040 --> 0:12:55.560
<v Speaker 1>If we ban something, then the opposition is going to

0:12:55.679 --> 0:12:58.240
<v Speaker 1>use the black market argument, and we should be prepared

0:12:58.320 --> 0:13:02.000
<v Speaker 1>for that. But obviously this is far from true. So

0:13:02.120 --> 0:13:07.679
<v Speaker 1>imagine a debate about banning breastfeeding in public. Obviously there

0:13:07.800 --> 0:13:11.400
<v Speaker 1>is little risk for a black market in this contract.

0:13:11.520 --> 0:13:15.000
<v Speaker 1>Or imagine a debate about banning internet cookies. We're not

0:13:15.120 --> 0:13:18.320
<v Speaker 1>going to tee a black market of internet cookies if

0:13:18.360 --> 0:13:22.560
<v Speaker 1>we band these. So the system really needs to develop

0:13:22.640 --> 0:13:27.960
<v Speaker 1>a more subtle understanding after human language in order to

0:13:28.080 --> 0:13:31.920
<v Speaker 1>be able to identify the most relevant principle argument and

0:13:31.960 --> 0:13:35.400
<v Speaker 1>need use them doing a debate. And and this is,

0:13:35.440 --> 0:13:40.199
<v Speaker 1>by the way, just what all this description is before

0:13:40.320 --> 0:13:43.800
<v Speaker 1>listening to the opponent. This is just what we're going

0:13:43.880 --> 0:13:47.840
<v Speaker 1>to say on our side. And and the most the

0:13:47.920 --> 0:13:51.960
<v Speaker 1>most challenging part is really too uh to listen to

0:13:52.000 --> 0:13:55.199
<v Speaker 1>the opponent. And it's some kind of a battle to

0:13:55.320 --> 0:13:59.040
<v Speaker 1>the arguments generated by the opponment raised by the And

0:13:59.200 --> 0:14:03.920
<v Speaker 1>we do that you using uh an arsenal of technique

0:14:04.320 --> 0:14:07.240
<v Speaker 1>that most of them rely on the same principle. We

0:14:07.360 --> 0:14:11.800
<v Speaker 1>start by listening to the world articulated by the opponment,

0:14:11.920 --> 0:14:14.760
<v Speaker 1>and for that we simply use what's on speech recognition

0:14:15.200 --> 0:14:17.839
<v Speaker 1>capabilities out of the box. But of course we need

0:14:17.880 --> 0:14:20.360
<v Speaker 1>to go to beyond the world, and we need to

0:14:20.440 --> 0:14:23.520
<v Speaker 1>understand the gist of the arguments of the opponent. And

0:14:23.560 --> 0:14:27.640
<v Speaker 1>in order to do that we try using various smackloads

0:14:27.680 --> 0:14:33.200
<v Speaker 1>to anticipate in advance what kind of arguments the opposition

0:14:33.800 --> 0:14:38.520
<v Speaker 1>mind you and then listen to determine whether he did

0:14:38.560 --> 0:14:43.720
<v Speaker 1>the opposition was making these arguments and then responded cold yeah.

0:14:43.800 --> 0:14:47.920
<v Speaker 1>That calls to mind the question of the difference between, say,

0:14:47.960 --> 0:14:51.720
<v Speaker 1>what's a sound argument versus what's a persuasive argument? I mean,

0:14:51.960 --> 0:14:55.840
<v Speaker 1>we know from reality that often the most persuasive appeals

0:14:55.840 --> 0:15:00.680
<v Speaker 1>and debates rely on just straightforwardly false claims and logical fallacies,

0:15:00.840 --> 0:15:03.960
<v Speaker 1>or even on little emotional cues that have little to

0:15:04.000 --> 0:15:06.680
<v Speaker 1>do with the matter at hand. I was thinking about

0:15:06.680 --> 0:15:09.240
<v Speaker 1>how in live debates, if you can get a laugh

0:15:09.400 --> 0:15:12.560
<v Speaker 1>at your opponent's expense, that's worth you know, a dozen

0:15:13.640 --> 0:15:18.200
<v Speaker 1>sound arguments or claims. So to what degree can AI

0:15:18.360 --> 0:15:21.840
<v Speaker 1>understand these sorts of persuasive appeals that that go beyond

0:15:22.000 --> 0:15:24.560
<v Speaker 1>just like what kind of evidence you can bring and

0:15:24.640 --> 0:15:29.880
<v Speaker 1>the appeals based on style you're right in in in

0:15:29.880 --> 0:15:33.040
<v Speaker 1>in debate and in the methods. We know already from

0:15:33.080 --> 0:15:38.320
<v Speaker 1>the ancient weeks that that we have free elaps, we

0:15:38.440 --> 0:15:43.240
<v Speaker 1>have logos, and we have ethos, and we have afforts,

0:15:43.280 --> 0:15:47.120
<v Speaker 1>and humans are using a mixture of these pilas when

0:15:47.440 --> 0:15:51.600
<v Speaker 1>they are debating one another. And just as a quick clarification, logos,

0:15:51.640 --> 0:15:55.080
<v Speaker 1>pathos and ethos are the types of appeals that were

0:15:55.120 --> 0:15:58.840
<v Speaker 1>identified in the study of classical rhetoric. Where logos is

0:15:58.920 --> 0:16:03.160
<v Speaker 1>appeals based on our logical arguments and evidence, Pathos is

0:16:03.160 --> 0:16:06.200
<v Speaker 1>the appeal to the emotions or the passions, and ethos

0:16:06.320 --> 0:16:09.440
<v Speaker 1>is an appeal based on the credibility or authority of

0:16:09.480 --> 0:16:14.360
<v Speaker 1>the speaker. I mean, as you know broadly understood and

0:16:14.360 --> 0:16:18.800
<v Speaker 1>and the technology that we developed, and and by the way,

0:16:18.800 --> 0:16:23.600
<v Speaker 1>it should be stated that there is a rapidly emerging

0:16:23.680 --> 0:16:28.880
<v Speaker 1>community of scientists across the globe that are investigating this

0:16:29.080 --> 0:16:32.120
<v Speaker 1>kind of topic. It is all under the umbrella of

0:16:32.240 --> 0:16:38.240
<v Speaker 1>this emerging field, yeah, referred to as a computational argumentation.

0:16:38.960 --> 0:16:41.760
<v Speaker 1>And when we started in two thousand and twelve, there

0:16:41.920 --> 0:16:46.720
<v Speaker 1>was a handful of teams pursuing that, and we see

0:16:46.720 --> 0:16:50.160
<v Speaker 1>a very dramatic increase in the result in these areas

0:16:50.200 --> 0:16:54.160
<v Speaker 1>of the last few years is very I think from

0:16:55.000 --> 0:17:01.880
<v Speaker 1>exacting and as I mentioned, the technology that we developed

0:17:01.880 --> 0:17:06.159
<v Speaker 1>a most focused on logos, and you can see in

0:17:06.200 --> 0:17:09.640
<v Speaker 1>the debate between proper Debate and Hali. By the way,

0:17:09.680 --> 0:17:13.879
<v Speaker 1>this this debate is is fully available on YouTube, and

0:17:14.080 --> 0:17:19.080
<v Speaker 1>you can see that indeed a woman is better in

0:17:19.240 --> 0:17:23.560
<v Speaker 1>making in using path as and perhaps in using ethos

0:17:23.600 --> 0:17:27.040
<v Speaker 1>and it is harder for the machine. And indeed most

0:17:27.040 --> 0:17:30.800
<v Speaker 1>of the research being done by by the by the

0:17:30.880 --> 0:17:35.560
<v Speaker 1>relevant research communities around logos, but there are already attempt

0:17:36.040 --> 0:17:40.320
<v Speaker 1>trying to model and to capture additional aspect of path

0:17:40.359 --> 0:17:44.400
<v Speaker 1>of and ethos in all the further enhanced this kind

0:17:44.400 --> 0:17:48.240
<v Speaker 1>of technology. So another question I have is debater has

0:17:48.280 --> 0:17:52.879
<v Speaker 1>to source claims and facts and arguments from existing written

0:17:52.880 --> 0:17:55.400
<v Speaker 1>work produced by humans, which of course we know can

0:17:55.440 --> 0:17:58.280
<v Speaker 1>be full of all sorts of flaws. Is there any

0:17:58.280 --> 0:18:01.600
<v Speaker 1>way at this point for it to to have an

0:18:01.600 --> 0:18:05.960
<v Speaker 1>analytical function to tell a say, factually true claim or

0:18:06.000 --> 0:18:09.960
<v Speaker 1>a logically valid argument from just something that is wrong

0:18:10.080 --> 0:18:12.920
<v Speaker 1>or dubious but repeated a lot in writing, or are

0:18:13.000 --> 0:18:19.200
<v Speaker 1>we not there yet? This is a very kindly important

0:18:19.240 --> 0:18:24.000
<v Speaker 1>and difficult problem, and that is receiving going attempting over

0:18:24.280 --> 0:18:30.639
<v Speaker 1>over the previous teams and go to tackle that. This

0:18:30.840 --> 0:18:35.080
<v Speaker 1>is certainly not bullet bof and and the problem is

0:18:35.080 --> 0:18:39.520
<v Speaker 1>is quite complex because one may say, you know, okay, fine,

0:18:39.600 --> 0:18:43.600
<v Speaker 1>maybe I should only take my argument from highly credibally

0:18:43.640 --> 0:18:49.760
<v Speaker 1>so and by boxy I can assume that that these

0:18:49.880 --> 0:18:54.720
<v Speaker 1>arguments are our valid. But this is not necessarily the case. Right.

0:18:54.800 --> 0:18:58.240
<v Speaker 1>You can see you can lead an opinion article in

0:18:58.359 --> 0:19:05.240
<v Speaker 1>a highly respectable newspaper which is actually quoting a false

0:19:05.359 --> 0:19:08.879
<v Speaker 1>argument that was made as well, and if you're not

0:19:08.920 --> 0:19:13.440
<v Speaker 1>careful enough, you you might be your system is going

0:19:13.480 --> 0:19:17.440
<v Speaker 1>to pull this argument without understanding that something is happening.

0:19:18.119 --> 0:19:21.199
<v Speaker 1>So we try to develop and we actually part of

0:19:21.240 --> 0:19:26.800
<v Speaker 1>Project Debate included some kind of filtering mechanism in order

0:19:26.920 --> 0:19:30.119
<v Speaker 1>to to filter out these kind of cases. And the

0:19:30.200 --> 0:19:34.359
<v Speaker 1>way we did that was really once a specific claim

0:19:34.920 --> 0:19:37.800
<v Speaker 1>was affected and by the way to being ordered, the

0:19:37.960 --> 0:19:41.200
<v Speaker 1>claim is not a full sentence. A claim is often

0:19:41.600 --> 0:19:44.600
<v Speaker 1>only a part of a tentence. Even if you were

0:19:44.680 --> 0:19:48.720
<v Speaker 1>able to detect sentence that contains a claim relevant one

0:19:48.800 --> 0:19:51.800
<v Speaker 1>that supportal side out of the billions of sentences in

0:19:51.840 --> 0:19:55.160
<v Speaker 1>the popos, you still need to find the coret boundaries

0:19:55.560 --> 0:19:58.520
<v Speaker 1>after claim within the sentence, and you have hundreds of

0:19:58.600 --> 0:20:02.440
<v Speaker 1>options and only all of them is correct. So this

0:20:02.520 --> 0:20:05.320
<v Speaker 1>is just going back why this this problem is it

0:20:05.440 --> 0:20:08.760
<v Speaker 1>so talenting? But until you do that and found this

0:20:08.960 --> 0:20:12.119
<v Speaker 1>claim and asked what is the stance of this claim,

0:20:12.440 --> 0:20:15.560
<v Speaker 1>and if the stance is supporting your side, you can

0:20:15.600 --> 0:20:18.920
<v Speaker 1>still ask what is the stance of the full sentence?

0:20:20.200 --> 0:20:22.439
<v Speaker 1>And if the stance of the full sentences in the

0:20:22.480 --> 0:20:26.520
<v Speaker 1>opposite direction, you may suspect that something is going on.

0:20:27.359 --> 0:20:31.280
<v Speaker 1>And perhaps this this claim is quoted in order to

0:20:31.920 --> 0:20:35.680
<v Speaker 1>contradict and not because it is true. And then perhaps

0:20:35.680 --> 0:20:39.879
<v Speaker 1>it is there it is safer to avoid using it.

0:20:39.920 --> 0:20:44.120
<v Speaker 1>But but this is just one safety mechanism, and and

0:20:44.200 --> 0:20:46.880
<v Speaker 1>the problem that you raise is actually a much more

0:20:46.960 --> 0:20:52.080
<v Speaker 1>beneval one, and and I think many teams are working

0:20:52.119 --> 0:20:55.359
<v Speaker 1>on that, and we try to address that as well.

0:20:55.600 --> 0:21:00.280
<v Speaker 1>And I think it has many interesting dimensions because it

0:21:00.400 --> 0:21:04.600
<v Speaker 1>is not even just about the validity of the argument. Often,

0:21:04.680 --> 0:21:08.600
<v Speaker 1>when when you show people to arguments, they will agree

0:21:08.640 --> 0:21:11.720
<v Speaker 1>that one of them is better than the other. But

0:21:11.920 --> 0:21:15.920
<v Speaker 1>what are the underlying mechanisms that I'd ask to the

0:21:16.160 --> 0:21:19.439
<v Speaker 1>one argument over the other, And how do you train

0:21:19.800 --> 0:21:23.639
<v Speaker 1>an artificial as in system to make the distinction. This

0:21:23.840 --> 0:21:27.240
<v Speaker 1>is kind of another example of the problems that welcome

0:21:27.280 --> 0:21:30.440
<v Speaker 1>to them. I have a question about what could come

0:21:30.520 --> 0:21:33.960
<v Speaker 1>out of AI research like this, because I would say,

0:21:33.960 --> 0:21:36.879
<v Speaker 1>from my personal perspective, I think studying rhetoric and debate

0:21:37.040 --> 0:21:43.040
<v Speaker 1>is extremely important, but not necessarily because getting into debates

0:21:43.160 --> 0:21:45.760
<v Speaker 1>is a good way to figure out what's true and

0:21:45.920 --> 0:21:48.040
<v Speaker 1>establish you know, the right thing to do. I think

0:21:48.119 --> 0:21:50.720
<v Speaker 1>one of the most important reasons to study rhetoric and

0:21:50.760 --> 0:21:54.800
<v Speaker 1>debate is so that you can understand how other people's

0:21:54.960 --> 0:21:58.760
<v Speaker 1>arguments and persuasive appeals are operating on you, or are

0:21:58.880 --> 0:22:02.280
<v Speaker 1>designed to operate you. A clear understanding of rhetoric can

0:22:02.280 --> 0:22:04.959
<v Speaker 1>be a kind of suit of armor for going into

0:22:05.160 --> 0:22:08.879
<v Speaker 1>you know, the world and seeing how political actors and

0:22:08.960 --> 0:22:11.960
<v Speaker 1>business actors and advertising and all that is trying to

0:22:12.040 --> 0:22:15.720
<v Speaker 1>affect you. Do you see project debate or serving any

0:22:15.800 --> 0:22:19.080
<v Speaker 1>kind of educational purpose like this in the world today.

0:22:19.560 --> 0:22:25.080
<v Speaker 1>So there are several levels by which I can I

0:22:25.119 --> 0:22:30.680
<v Speaker 1>can answer that. The first one is that this kind

0:22:30.680 --> 0:22:36.320
<v Speaker 1>of technology is is definitely relevant and we believe highly

0:22:36.520 --> 0:22:42.200
<v Speaker 1>valuable in the context of education. You can imagine using

0:22:42.240 --> 0:22:47.080
<v Speaker 1>the technology in order to build better arguments and more

0:22:47.119 --> 0:22:53.640
<v Speaker 1>of all, to perform a more analytical and perhaps more

0:22:53.680 --> 0:23:01.640
<v Speaker 1>objective analysis off complex and controversial topics. This is one aspect.

0:23:02.560 --> 0:23:06.280
<v Speaker 1>There is another aspect, but often when we debate is

0:23:06.920 --> 0:23:13.639
<v Speaker 1>other humans. There are many layouts that that are involved

0:23:13.840 --> 0:23:16.560
<v Speaker 1>in this discussion. In this debate. What all of them

0:23:16.560 --> 0:23:20.119
<v Speaker 1>are related? To the facts and to the arguments that

0:23:20.200 --> 0:23:23.400
<v Speaker 1>we are raising. Perhaps we have history with that Belton,

0:23:23.880 --> 0:23:28.040
<v Speaker 1>Perhaps we have history with ourselves that actually impact our

0:23:28.119 --> 0:23:32.600
<v Speaker 1>on part and decisions. Perhaps other people are listening and

0:23:32.680 --> 0:23:38.480
<v Speaker 1>this actually improvides contact, uh that impact what is happening.

0:23:38.960 --> 0:23:42.720
<v Speaker 1>And we are curious about this option of the dating

0:23:42.800 --> 0:23:47.680
<v Speaker 1>with the machine in the privacy of your office. Maybe

0:23:47.720 --> 0:23:51.840
<v Speaker 1>this is a different form of a discussion that to

0:23:52.000 --> 0:23:58.159
<v Speaker 1>some extent is perhaps all free of of external biases

0:23:58.240 --> 0:24:04.280
<v Speaker 1>and maybe will enable treat some people to identify situations

0:24:04.280 --> 0:24:08.000
<v Speaker 1>where they have a blind book and to better listen

0:24:08.320 --> 0:24:12.159
<v Speaker 1>to the other side. So I think in this case

0:24:12.280 --> 0:24:17.400
<v Speaker 1>the whole of the technology could be quite instrumental and positive.

0:24:17.840 --> 0:24:21.880
<v Speaker 1>The false business applications that are also very interesting from

0:24:21.920 --> 0:24:28.240
<v Speaker 1>the IBM perspective and uh, and this is another another dimension,

0:24:28.520 --> 0:24:37.280
<v Speaker 1>another level by which we can consider the technology as exacuable. Again,

0:24:37.280 --> 0:24:39.720
<v Speaker 1>big thanks to No One slow name for taking time

0:24:39.760 --> 0:24:41.600
<v Speaker 1>to chat with us. And now we're going to go

0:24:41.640 --> 0:24:49.240
<v Speaker 1>straight into our second talk on the subject with Madu Matt.

0:24:49.520 --> 0:24:51.960
<v Speaker 1>Thanks so much for joining us today. Could you start

0:24:52.000 --> 0:24:56.320
<v Speaker 1>off by introducing yourself and talking about your role at IBM. Yeah, absolutely,

0:24:56.440 --> 0:24:59.920
<v Speaker 1>and really nice to meet you. Robert and Joe uh

0:25:00.040 --> 0:25:06.040
<v Speaker 1>maduco Chi, vice President Offering Management in Data and AI IBM,

0:25:06.080 --> 0:25:09.159
<v Speaker 1>And the role of offering management is really all about

0:25:09.680 --> 0:25:14.040
<v Speaker 1>laying down the strategy and then delivering and executing towards

0:25:14.119 --> 0:25:18.520
<v Speaker 1>such strategy. And I'm based out of San Jose, Sunny, California, excellent.

0:25:19.359 --> 0:25:21.679
<v Speaker 1>So just to kick things off here, um, you know

0:25:21.680 --> 0:25:24.600
<v Speaker 1>we're gonna be talking a lot about AI here, and

0:25:25.440 --> 0:25:28.280
<v Speaker 1>it makes sense to to to really get into what

0:25:28.359 --> 0:25:31.520
<v Speaker 1>we mean when we're talking about AI for business. How

0:25:31.560 --> 0:25:35.560
<v Speaker 1>does AI serve business compared to the way it serves consumers.

0:25:36.359 --> 0:25:39.520
<v Speaker 1>That's a great question to get started on. UM so

0:25:40.160 --> 0:25:44.679
<v Speaker 1>redeveloped a thesis a couple of years ago about really

0:25:44.720 --> 0:25:50.000
<v Speaker 1>how AI for business would be different from consumer AI.

0:25:50.280 --> 0:25:53.159
<v Speaker 1>Think of consumer AI, which we all know work with

0:25:53.200 --> 0:25:58.200
<v Speaker 1>our smartphones, smart speakers, social media, photos, everything what it comes.

0:25:58.280 --> 0:26:01.440
<v Speaker 1>But when it comes for AI for business, it's really

0:26:01.560 --> 0:26:06.880
<v Speaker 1>very very different. AI for business is all about automation,

0:26:07.280 --> 0:26:12.560
<v Speaker 1>optimization and making better predictions, and it requires really a

0:26:12.680 --> 0:26:16.000
<v Speaker 1>very different set of technical capabilities, like you would have

0:26:16.040 --> 0:26:19.080
<v Speaker 1>to understand how to deal with language, have to deal

0:26:19.119 --> 0:26:23.240
<v Speaker 1>with what does automation means, and then be able to

0:26:23.680 --> 0:26:28.120
<v Speaker 1>have the explainability and trust up AI. UM. So that's

0:26:28.119 --> 0:26:31.200
<v Speaker 1>sort of the big difference between commercial AI and AI

0:26:31.280 --> 0:26:33.560
<v Speaker 1>for business. So we know that one of the big

0:26:33.600 --> 0:26:36.280
<v Speaker 1>AI projects at IBM is Watson. Could you tell us

0:26:36.320 --> 0:26:39.880
<v Speaker 1>about Watson and explain how Watson fits into the broader

0:26:39.920 --> 0:26:44.960
<v Speaker 1>picture of recent advancements in AI. Sure you you might

0:26:45.000 --> 0:26:47.760
<v Speaker 1>have heard of Watson, and our audience might have heard

0:26:47.800 --> 0:26:50.800
<v Speaker 1>of Watson, which came out when we first did our

0:26:52.000 --> 0:26:56.239
<v Speaker 1>UH in Jeopardy and people remember Watson from there. But

0:26:56.680 --> 0:26:59.840
<v Speaker 1>fast forward, a lot of work done around Watson. Think

0:26:59.840 --> 0:27:05.160
<v Speaker 1>of Watson as our definition of IBM AI. We evolved

0:27:05.200 --> 0:27:10.120
<v Speaker 1>a lot um since then, and our strategic intent always

0:27:10.160 --> 0:27:15.240
<v Speaker 1>has been to have what's an available anywhere meaning available

0:27:15.359 --> 0:27:20.600
<v Speaker 1>on any cloud. UH. We have focused on Watson. UH

0:27:21.080 --> 0:27:24.400
<v Speaker 1>we call with Watson meaning it's embedded in almost all

0:27:24.440 --> 0:27:29.080
<v Speaker 1>your applications. So for example, UM, I use the world

0:27:29.119 --> 0:27:32.159
<v Speaker 1>a lot for AI for AI. What does that mean? Like,

0:27:32.280 --> 0:27:35.879
<v Speaker 1>how do we embed AI in our data sciences and

0:27:35.960 --> 0:27:40.840
<v Speaker 1>in our data data platforms and such. The other parts

0:27:40.880 --> 0:27:44.480
<v Speaker 1>of evolution has been you know, as I said earlier,

0:27:44.720 --> 0:27:48.320
<v Speaker 1>from our AI for business is all about automation. How

0:27:48.359 --> 0:27:52.960
<v Speaker 1>do we UH evolve into the workflow AI that matters

0:27:53.359 --> 0:27:57.960
<v Speaker 1>for our clients and our our society. So the workflows

0:27:58.280 --> 0:28:02.439
<v Speaker 1>could definition could be you know, customer care, uh in

0:28:02.680 --> 0:28:07.600
<v Speaker 1>I t asset management, in your regulatory or compliance, in

0:28:07.760 --> 0:28:11.960
<v Speaker 1>supply chain or in your planning and budgeting. Right, these

0:28:12.000 --> 0:28:15.679
<v Speaker 1>are how you can really embed AI and that is

0:28:15.720 --> 0:28:20.720
<v Speaker 1>where Watson has really evolved into. And we have also

0:28:20.960 --> 0:28:25.840
<v Speaker 1>been delivering now Watson an AI capability in a in

0:28:25.880 --> 0:28:30.520
<v Speaker 1>our integrated single platform we call cloud Pacer data. So

0:28:30.880 --> 0:28:33.480
<v Speaker 1>a long way. We came from Jeopardy Days and then

0:28:33.560 --> 0:28:36.359
<v Speaker 1>you just heard from nome where we landed with Debater.

0:28:36.920 --> 0:28:42.000
<v Speaker 1>So speaking of Debater, what capabilities has IBM commercialized from

0:28:42.080 --> 0:28:46.840
<v Speaker 1>Project Debater into Watson? So that's a great question. Um,

0:28:47.000 --> 0:28:52.200
<v Speaker 1>A lot of commercialization has happened. We have uh pretty

0:28:52.240 --> 0:28:56.080
<v Speaker 1>good rich set of products like what's an assistant, what's

0:28:56.120 --> 0:28:59.800
<v Speaker 1>on discovery, what's on knowledge, language understanding? And I know

0:28:59.840 --> 0:29:02.120
<v Speaker 1>the are just works, but let me just give a

0:29:03.040 --> 0:29:05.960
<v Speaker 1>bit of a background on what what's an assistant is?

0:29:06.040 --> 0:29:11.560
<v Speaker 1>What's an assistant is? Our conversational AI platform really helps

0:29:11.600 --> 0:29:17.800
<v Speaker 1>provide customer fast, straightforward answer, accurate answers UM across any application,

0:29:17.960 --> 0:29:21.720
<v Speaker 1>device or cloud right, UM and our discovery is all

0:29:21.760 --> 0:29:27.480
<v Speaker 1>about enterprise search and AI search technology that truly retrieves

0:29:27.720 --> 0:29:32.400
<v Speaker 1>specific answers to your questions while you're analyzing trends and

0:29:32.480 --> 0:29:36.760
<v Speaker 1>relationships in the enterprise data. So we've been looking at

0:29:36.840 --> 0:29:39.320
<v Speaker 1>debater and some of the key technologies. Let me give

0:29:39.320 --> 0:29:44.840
<v Speaker 1>you an example of few UM like sentiment analysis. Uh,

0:29:45.120 --> 0:29:48.200
<v Speaker 1>let me pose a problem statement, what does that really mean? So,

0:29:48.320 --> 0:29:54.600
<v Speaker 1>for example, today Watson does not understand idioms or sentiment shifters,

0:29:54.600 --> 0:29:58.040
<v Speaker 1>and neither does any other competitor operates out there also,

0:29:58.520 --> 0:30:04.040
<v Speaker 1>So think of elements which include hardly helpful, over the moon,

0:30:04.640 --> 0:30:08.000
<v Speaker 1>cold feet, UM all years. You know, how do you

0:30:08.080 --> 0:30:12.040
<v Speaker 1>make that analysis and figure figure this out? What is

0:30:12.080 --> 0:30:15.520
<v Speaker 1>the real context behind this? So what we have done

0:30:15.720 --> 0:30:19.000
<v Speaker 1>with that is that now what's on leverages this debat

0:30:19.160 --> 0:30:24.760
<v Speaker 1>technology and looks at these idioms and sentiment shifters and

0:30:24.840 --> 0:30:28.840
<v Speaker 1>does the analysis starting with better understanding of this sentiment

0:30:28.880 --> 0:30:31.840
<v Speaker 1>and analysis is one of the most widely used API

0:30:31.920 --> 0:30:35.400
<v Speaker 1>s for us UM. This already exists today in our

0:30:35.480 --> 0:30:40.520
<v Speaker 1>product portfolio. What's coming into the future is UM. It's

0:30:40.600 --> 0:30:44.320
<v Speaker 1>around all around documents. So let me put a perspective

0:30:44.360 --> 0:30:49.800
<v Speaker 1>around a problem statement. There are many regulatory documents such

0:30:49.800 --> 0:30:57.080
<v Speaker 1>as contracts or security filings which contains important clauses that

0:30:57.160 --> 0:31:02.120
<v Speaker 1>have really really serious business implications for example, payment terms,

0:31:02.680 --> 0:31:08.680
<v Speaker 1>obligations made to regulatory bodies, or warranties. Such humans can

0:31:08.720 --> 0:31:14.240
<v Speaker 1>spend countless hours reading and extracting the information so they

0:31:14.280 --> 0:31:18.560
<v Speaker 1>remain compliant. Although we can provide some of the out

0:31:18.560 --> 0:31:21.640
<v Speaker 1>of the box models for contracts and invoices and such,

0:31:22.120 --> 0:31:25.320
<v Speaker 1>but it creates UM but client may still need to

0:31:25.320 --> 0:31:28.920
<v Speaker 1>create their own element classifications of business classes. So the

0:31:29.080 --> 0:31:34.440
<v Speaker 1>solution has been with our debaters birth based classification technology

0:31:34.520 --> 0:31:37.720
<v Speaker 1>into these products so we can learn with few one

0:31:38.160 --> 0:31:42.960
<v Speaker 1>samples to do new classification of elements. Business documents could

0:31:43.000 --> 0:31:47.200
<v Speaker 1>include contracts, invoices, and procurement contracts. The end of the day,

0:31:47.240 --> 0:31:51.840
<v Speaker 1>it really really excelerates the outcomes what the businesses would

0:31:51.880 --> 0:31:58.720
<v Speaker 1>be looking for. UM. Other technology is around summarization. So

0:31:58.840 --> 0:32:02.040
<v Speaker 1>the problem statement here is like when you're looking for

0:32:02.720 --> 0:32:07.680
<v Speaker 1>information customer or employee who may have aggregate research from

0:32:07.720 --> 0:32:12.959
<v Speaker 1>different sources, clicking through multiple links and pages and finding

0:32:13.040 --> 0:32:17.000
<v Speaker 1>exactly what they need can be very very difficult, right.

0:32:17.000 --> 0:32:21.320
<v Speaker 1>It can take months, weeks and months sometimes to your years.

0:32:21.360 --> 0:32:25.880
<v Speaker 1>So with Watson and Debater technology, we can analyze variety

0:32:25.960 --> 0:32:29.600
<v Speaker 1>of these sources and provide a summary or brief of

0:32:29.680 --> 0:32:34.240
<v Speaker 1>the ideas and the information which is contained within UM

0:32:34.280 --> 0:32:37.120
<v Speaker 1>that's coming up. We're going to be leveraging this technology

0:32:37.120 --> 0:32:42.280
<v Speaker 1>in our Watson discovery portfolio in second half. The other

0:32:42.520 --> 0:32:47.160
<v Speaker 1>interesting UM issues we see today is like in our

0:32:47.200 --> 0:32:53.960
<v Speaker 1>traditional UH rule based systems for contact centers, it categorizes

0:32:54.280 --> 0:32:57.760
<v Speaker 1>large fraction of calls in a very generic bucket like

0:32:57.840 --> 0:33:01.160
<v Speaker 1>it says, you know, like not uncommon to see more

0:33:01.200 --> 0:33:05.400
<v Speaker 1>than maybe of calling a call center for a bank,

0:33:05.800 --> 0:33:08.160
<v Speaker 1>which says, hey, this this call was just made for

0:33:08.640 --> 0:33:12.480
<v Speaker 1>generalized checking, and it prevents the company from creating any

0:33:12.640 --> 0:33:17.280
<v Speaker 1>robust self service. So with Debater technology, now we can

0:33:17.360 --> 0:33:22.880
<v Speaker 1>leverage advanced topic clustering, which enables users to cluster this

0:33:23.040 --> 0:33:28.760
<v Speaker 1>incoming data in a meaningful topics of related information and

0:33:28.840 --> 0:33:33.680
<v Speaker 1>automatically this can be analyzed. So think of discovery of

0:33:33.760 --> 0:33:37.120
<v Speaker 1>a content minor which will be enhanced with this type

0:33:37.120 --> 0:33:42.080
<v Speaker 1>of a technology to extract better topics from very large

0:33:42.160 --> 0:33:46.640
<v Speaker 1>data sets and then make the topic extraction more business

0:33:46.720 --> 0:33:50.200
<v Speaker 1>user friendly. So a lot of stuff. I give a

0:33:50.240 --> 0:33:53.120
<v Speaker 1>lot of examples, but sort of the gist of all

0:33:53.200 --> 0:33:59.680
<v Speaker 1>this is, Look, it's going to impact businesses real outcomes, right,

0:33:59.760 --> 0:34:02.280
<v Speaker 1>It's going to save them time, is going to automate

0:34:02.320 --> 0:34:04.880
<v Speaker 1>the process, it's going to remove a lot of human

0:34:05.040 --> 0:34:09.880
<v Speaker 1>error which comes with it, and really speak towards the productivity.

0:34:10.200 --> 0:34:14.040
<v Speaker 1>Is going to speak towards the clients UM and P

0:34:14.200 --> 0:34:17.239
<v Speaker 1>as their own promoter scores and such, and so that's

0:34:17.280 --> 0:34:19.560
<v Speaker 1>really the gist of what we're looking to drive out

0:34:19.560 --> 0:34:22.920
<v Speaker 1>of the debater technology. If I'm understanding this correctly, this

0:34:23.000 --> 0:34:26.200
<v Speaker 1>is interesting that it's interesting that this kind of functionality

0:34:26.280 --> 0:34:30.000
<v Speaker 1>would come out of an AI debate tool, because debate

0:34:30.040 --> 0:34:32.600
<v Speaker 1>and persuasion that will seem like the kinds of things

0:34:32.640 --> 0:34:35.880
<v Speaker 1>that would be inherently the most difficult to master with AI,

0:34:35.960 --> 0:34:38.719
<v Speaker 1>because you've got all these elements of style and subtlety

0:34:39.120 --> 0:34:41.959
<v Speaker 1>things that are really difficult to quantify to make into

0:34:42.160 --> 0:34:45.960
<v Speaker 1>two understandable data. But out of the debater technology, it

0:34:46.000 --> 0:34:48.000
<v Speaker 1>sounds like you're saying that you're actually getting a lot

0:34:48.000 --> 0:34:52.080
<v Speaker 1>of derivative technologies that are good at dealing with algorithmic

0:34:52.200 --> 0:34:55.680
<v Speaker 1>types of text like legal documents. Am I getting this right?

0:34:55.800 --> 0:34:59.200
<v Speaker 1>Like that you could have a piece of software that

0:34:59.280 --> 0:35:02.920
<v Speaker 1>works like a lawyer. Uh, and it can explain this

0:35:03.080 --> 0:35:05.600
<v Speaker 1>contract to you when it's going over your head. And

0:35:05.840 --> 0:35:08.759
<v Speaker 1>this kind of thing is possible now because of how

0:35:08.840 --> 0:35:12.719
<v Speaker 1>formulaic and algorithmic legal documents tend to be. Would that

0:35:12.760 --> 0:35:16.040
<v Speaker 1>be a correct understanding? Yeah, no, totally And if I may,

0:35:16.360 --> 0:35:20.560
<v Speaker 1>UM give you one of the client example, especially as

0:35:20.560 --> 0:35:25.200
<v Speaker 1>you started talking about legal UM, Legal Nations platform actually

0:35:25.200 --> 0:35:29.719
<v Speaker 1>provides this in house legal teams and outside console the

0:35:29.719 --> 0:35:33.960
<v Speaker 1>ability to respond to their lawsuits UM and draft their

0:35:34.000 --> 0:35:37.720
<v Speaker 1>initial round up discovery requests literally less than two minutes

0:35:37.840 --> 0:35:43.279
<v Speaker 1>right um and which shaved off about ten hours of

0:35:43.480 --> 0:35:47.319
<v Speaker 1>attorney times on each of these lawsuits. So the real

0:35:47.400 --> 0:35:50.879
<v Speaker 1>direct outcomes of usage of this technology. So you've been

0:35:50.880 --> 0:35:53.919
<v Speaker 1>talking about big business applications, but I also wonder about

0:35:53.920 --> 0:35:58.160
<v Speaker 1>applications directly for the consumer. Where, for example, because you

0:35:58.200 --> 0:36:01.960
<v Speaker 1>have a program that ingests legal documents, so you you

0:36:02.040 --> 0:36:04.920
<v Speaker 1>feed it some contract you're thinking about signing, and then

0:36:04.960 --> 0:36:07.120
<v Speaker 1>you say, I have a question because I'm not a lawyer,

0:36:07.239 --> 0:36:09.759
<v Speaker 1>I don't understand what I would be bound to do

0:36:09.960 --> 0:36:13.080
<v Speaker 1>under this agreement. And then you could feed the contract

0:36:13.160 --> 0:36:16.960
<v Speaker 1>in and pose questions to your AI legal assistant in

0:36:17.080 --> 0:36:21.359
<v Speaker 1>natural language. Can you see a future like that. We do,

0:36:21.640 --> 0:36:25.319
<v Speaker 1>and we already have a product like what'son Assistant, which

0:36:25.320 --> 0:36:27.680
<v Speaker 1>is for customer care. It feeds on a lot of

0:36:27.760 --> 0:36:31.120
<v Speaker 1>you know, pre train models, like especially now in COVID

0:36:31.239 --> 0:36:37.720
<v Speaker 1>nineteen right, Uh, a situation where our government offices and

0:36:38.400 --> 0:36:42.000
<v Speaker 1>our healthcare are getting in dated by calls. Right, So

0:36:43.080 --> 0:36:48.000
<v Speaker 1>leveraging this UM what'son Assistant in front is really helping

0:36:48.040 --> 0:36:50.960
<v Speaker 1>them deflect a lot of those phone calls and get

0:36:51.000 --> 0:36:54.759
<v Speaker 1>the accurate answers in hands of the consumers. So you know,

0:36:55.120 --> 0:36:57.399
<v Speaker 1>this is what we are focusing on around customer care.

0:36:57.680 --> 0:37:00.640
<v Speaker 1>But yeah, in the future, I mean this similar technology

0:37:00.640 --> 0:37:05.960
<v Speaker 1>and leveraging UM the from debater, we can actually go

0:37:06.120 --> 0:37:10.440
<v Speaker 1>into any domain. Right, we have the right framework and

0:37:10.480 --> 0:37:15.000
<v Speaker 1>we have the right technology to go pursue those different domains.

0:37:15.040 --> 0:37:17.040
<v Speaker 1>I guess this sets us up for a bigger question,

0:37:17.160 --> 0:37:20.200
<v Speaker 1>which is what is the overall role of natural language

0:37:20.200 --> 0:37:24.040
<v Speaker 1>processing in the landscape of AI today and also which

0:37:24.040 --> 0:37:27.240
<v Speaker 1>are the elements of natural language processing that we've really

0:37:27.280 --> 0:37:29.239
<v Speaker 1>gotten a lot better at and which are the ones

0:37:29.280 --> 0:37:32.879
<v Speaker 1>that are still a major challenge. Yeah, great question. As

0:37:32.880 --> 0:37:35.719
<v Speaker 1>we all know, right, language have existed. I don't know

0:37:35.760 --> 0:37:39.720
<v Speaker 1>a hundred thousand plus years. You know, started as speech

0:37:39.800 --> 0:37:43.480
<v Speaker 1>probably people started to talk and the writing came perhaps

0:37:43.560 --> 0:37:47.359
<v Speaker 1>much later. Um, and we write in ways we don't

0:37:47.400 --> 0:37:52.000
<v Speaker 1>talk also, right, it's a lot more descriptive and more reflective.

0:37:52.760 --> 0:37:57.719
<v Speaker 1>And so now with things where we can compute at

0:37:57.840 --> 0:38:02.399
<v Speaker 1>larger with open data sets and transfer learnings, n LP

0:38:02.680 --> 0:38:06.719
<v Speaker 1>natural language processing really really is the inflection point, right,

0:38:06.760 --> 0:38:10.080
<v Speaker 1>And some of the examples I shared earlier around the

0:38:10.160 --> 0:38:16.720
<v Speaker 1>sentiment analysis and summarization and clustering, these are all such

0:38:16.880 --> 0:38:23.000
<v Speaker 1>critical aspects of taking LP, not just natural language processing,

0:38:23.040 --> 0:38:27.080
<v Speaker 1>but natural language understanding, natural language generations is all going

0:38:27.120 --> 0:38:30.160
<v Speaker 1>to come through all of that. And we really think

0:38:30.719 --> 0:38:34.440
<v Speaker 1>with the Debater technology it really puts us in a

0:38:35.160 --> 0:38:37.880
<v Speaker 1>in a leader quadrant here a lot more work to

0:38:37.920 --> 0:38:41.400
<v Speaker 1>be done, but the the end goal is yes, we

0:38:41.440 --> 0:38:44.280
<v Speaker 1>can continue to research on these things, but how quickly

0:38:44.360 --> 0:38:47.600
<v Speaker 1>we commercialize it and how quick quickly we help our

0:38:47.640 --> 0:38:51.200
<v Speaker 1>clients and users to see the outcomes what are needed

0:38:51.239 --> 0:38:54.520
<v Speaker 1>here and make them a lot more productive. So how

0:38:54.560 --> 0:38:58.879
<v Speaker 1>many languages does Project Debater and Watson together, how many

0:38:58.920 --> 0:39:03.400
<v Speaker 1>do they understand support today? We started with obviously English,

0:39:03.560 --> 0:39:10.200
<v Speaker 1>we are expanding now to French, Spanish, German in in

0:39:10.320 --> 0:39:12.880
<v Speaker 1>the second half of this year, and then very soon

0:39:12.960 --> 0:39:18.799
<v Speaker 1>will expand to Dutch, French, Arabic, Chinese both traditional and simplified,

0:39:19.360 --> 0:39:24.440
<v Speaker 1>and Italian. UM And obviously we are choosing these based

0:39:24.440 --> 0:39:26.920
<v Speaker 1>on where we are seeing most of our growth and

0:39:27.960 --> 0:39:31.560
<v Speaker 1>an adoption. What are some additional examples of how these

0:39:31.560 --> 0:39:35.839
<v Speaker 1>commercialized capabilities can be used by clients? Great question, um.

0:39:35.880 --> 0:39:40.000
<v Speaker 1>I gave you an example earlier on legal missions. The

0:39:40.080 --> 0:39:42.880
<v Speaker 1>other one, which is very close to my heart is

0:39:43.800 --> 0:39:50.640
<v Speaker 1>um RBS with Watson. Watson RBS built Cora, which is it?

0:39:50.800 --> 0:39:53.719
<v Speaker 1>Which is their digital assistant that helps better serve their

0:39:53.760 --> 0:39:59.719
<v Speaker 1>customers through first time problem resolutions. Cora is trained with

0:39:59.800 --> 0:40:06.200
<v Speaker 1>the were one thousand responses to more than two customer queries. However,

0:40:06.480 --> 0:40:09.160
<v Speaker 1>if she doesn't know an answer or she sends that

0:40:09.239 --> 0:40:13.759
<v Speaker 1>customer is getting angry or frustrated, she will transfer it

0:40:13.800 --> 0:40:18.680
<v Speaker 1>to a live agent. Now, with improved sentiment analysis from Debater,

0:40:18.840 --> 0:40:22.680
<v Speaker 1>as I mentioned earlier, we hope that clients like RBS

0:40:22.760 --> 0:40:26.920
<v Speaker 1>will be able to better serve their customers by having

0:40:27.080 --> 0:40:32.200
<v Speaker 1>digital assistance that better understand the subtleties of the of

0:40:32.280 --> 0:40:36.000
<v Speaker 1>the sentiments of the clients. So for example, the phrase

0:40:36.760 --> 0:40:41.080
<v Speaker 1>over the moon might be interpreted as literally about the

0:40:41.080 --> 0:40:47.000
<v Speaker 1>planetary satellite and not as excited or elated. Right. So

0:40:47.200 --> 0:40:50.600
<v Speaker 1>this is what with Project Debater core AI built into

0:40:50.680 --> 0:40:55.520
<v Speaker 1>IBM Watson, it can understand these idioms helping clients like

0:40:55.760 --> 0:41:00.560
<v Speaker 1>RBS to better serve their customers. The other example switching

0:41:00.560 --> 0:41:05.400
<v Speaker 1>into financial like Credit Mutual, they had over five thousand

0:41:05.480 --> 0:41:11.560
<v Speaker 1>branches and they receive more than three fifty thousand online

0:41:11.560 --> 0:41:15.399
<v Speaker 1>inquiries a day and the volume is growing at least

0:41:15.440 --> 0:41:19.080
<v Speaker 1>twenty three percent a year. So now with Watson infused

0:41:19.200 --> 0:41:25.360
<v Speaker 1>email analyzer, they can help deflect and address of the

0:41:25.440 --> 0:41:31.760
<v Speaker 1>three daily emails received by banks client advisors. So the

0:41:31.800 --> 0:41:36.480
<v Speaker 1>implementation of the topic clustering from Debater, we believe now

0:41:36.560 --> 0:41:40.600
<v Speaker 1>clients with similar needs that Credit Mutual will enable more

0:41:40.640 --> 0:41:45.600
<v Speaker 1>self service by identifying clusters are commonly as topics and

0:41:45.680 --> 0:41:50.200
<v Speaker 1>can be converted into self service content. Right. So to me,

0:41:50.320 --> 0:41:53.600
<v Speaker 1>the examples like this are just amazing because I can

0:41:53.680 --> 0:41:58.040
<v Speaker 1>totally then connect the dots between technology, the usage and

0:41:58.080 --> 0:42:02.360
<v Speaker 1>the outcome, right, a win win situation. We've got multiple

0:42:02.400 --> 0:42:06.040
<v Speaker 1>other examples as well, Roberts, and we're going to continue

0:42:06.040 --> 0:42:10.279
<v Speaker 1>to be focusing on how do we really not just

0:42:10.400 --> 0:42:15.480
<v Speaker 1>commercialize it, but I believe in AI is really meant

0:42:15.520 --> 0:42:19.759
<v Speaker 1>to improve our society as well, right, make us more

0:42:19.800 --> 0:42:23.279
<v Speaker 1>productive and do better things, especially the world we are

0:42:23.320 --> 0:42:25.920
<v Speaker 1>living in with COVID and other things which are happening

0:42:26.000 --> 0:42:29.520
<v Speaker 1>around us. Right, Um, the goodness of AI needs to

0:42:29.560 --> 0:42:33.640
<v Speaker 1>be there, so very critical overall, what do you see

0:42:33.680 --> 0:42:36.839
<v Speaker 1>as the best possible role for AI, not just as

0:42:36.840 --> 0:42:40.080
<v Speaker 1>a tool for business, but as a society. What could

0:42:40.160 --> 0:42:43.080
<v Speaker 1>it do for us in the best case scenario? Yeah,

0:42:43.160 --> 0:42:47.200
<v Speaker 1>I mean that's a great question, right um. To me fundamentally,

0:42:47.239 --> 0:42:50.280
<v Speaker 1>I mean there are many examples, but one most critical

0:42:50.360 --> 0:42:53.600
<v Speaker 1>which comes to my mind is how AI can really

0:42:53.640 --> 0:42:56.919
<v Speaker 1>help us detect bias? Right, A lot of our data

0:42:56.920 --> 0:43:03.520
<v Speaker 1>sets and it has been built by humans with unbiased

0:43:04.120 --> 0:43:08.000
<v Speaker 1>goes into those data. Right, AI can really start separating

0:43:08.040 --> 0:43:13.160
<v Speaker 1>that help us detect bias and and make our products better,

0:43:13.560 --> 0:43:16.360
<v Speaker 1>makes our society better. So that to me is the

0:43:17.480 --> 0:43:19.399
<v Speaker 1>would be sort of the holy grail if I can

0:43:19.440 --> 0:43:24.720
<v Speaker 1>achieve that. All right, So there you have it. Thanks

0:43:24.800 --> 0:43:27.960
<v Speaker 1>once again to know I'm slow name and Maduka char

0:43:28.120 --> 0:43:30.000
<v Speaker 1>for taking time out of their busy days to chat

0:43:30.040 --> 0:43:33.880
<v Speaker 1>with us about this topic. For more information on smart Talks,

0:43:34.040 --> 0:43:37.680
<v Speaker 1>visit IBM dot com slash smart Talks, and if you'd

0:43:37.680 --> 0:43:39.480
<v Speaker 1>like to learn more about n LP, you can go

0:43:39.520 --> 0:43:44.320
<v Speaker 1>to IBM dot com slash Watson, Slash Natural dash Language,

0:43:44.400 --> 0:43:47.239
<v Speaker 1>dash Processing. And if you would like to learn more

0:43:47.280 --> 0:43:49.799
<v Speaker 1>about our show, well, you can find us wherever you

0:43:49.840 --> 0:43:53.239
<v Speaker 1>get your podcasts and wherever that happens to be. Just

0:43:53.320 --> 0:43:58.040
<v Speaker 1>make sure you rate, review and subscribe. Huge thanks as

0:43:58.040 --> 0:44:01.480
<v Speaker 1>always to our excellent audio producers eth Nicholas Johnson. If

0:44:01.520 --> 0:44:02.960
<v Speaker 1>you would like to get in touch with us with

0:44:03.120 --> 0:44:05.680
<v Speaker 1>feedback on this episode or any other, to suggest a

0:44:05.719 --> 0:44:07.960
<v Speaker 1>topic for the future, or just to say hello, you

0:44:07.960 --> 0:44:10.799
<v Speaker 1>can email us at contact at stuff to Blow your

0:44:10.800 --> 0:44:21.120
<v Speaker 1>Mind dot com. Stuff to Blow Your Mind is production

0:44:21.160 --> 0:44:23.920
<v Speaker 1>of I Heart Radio. For more podcasts for my Heart Radio,

0:44:24.120 --> 0:44:26.800
<v Speaker 1>visit the i Heart Radio app, Apple Podcasts, or wherever

0:44:26.840 --> 0:44:36.120
<v Speaker 1>you listening to your favorite shows,