WEBVTT - Smart Talks With IBM: Transformations in AI: why foundation models are the future

0:00:00.120 --> 0:00:02.840
<v Speaker 1>Hey everyone, it's Robert and Joe here. Today we've got

0:00:02.840 --> 0:00:04.720
<v Speaker 1>something a little bit different to share with you. It

0:00:04.840 --> 0:00:08.000
<v Speaker 1>is a new season of the Smart Talks with IBM

0:00:08.119 --> 0:00:09.119
<v Speaker 1>podcast series.

0:00:09.600 --> 0:00:11.680
<v Speaker 2>Today we are witnessed to one of those rare moments

0:00:11.680 --> 0:00:14.360
<v Speaker 2>in history, the rise of an innovative technology with the

0:00:14.360 --> 0:00:18.680
<v Speaker 2>potential to radically transform business and society forever. The technology,

0:00:18.760 --> 0:00:22.160
<v Speaker 2>of course, is artificial intelligence, and it's the central focus

0:00:22.239 --> 0:00:24.800
<v Speaker 2>for this new season of Smart Talks with IBM.

0:00:25.320 --> 0:00:28.400
<v Speaker 1>Join hosts from your favorite Pushkin podcasts as they talk

0:00:28.480 --> 0:00:31.640
<v Speaker 1>with industry experts and leaders to explore how businesses can

0:00:31.680 --> 0:00:35.360
<v Speaker 1>integrate AI into their workflows and help drive real change

0:00:35.400 --> 0:00:38.160
<v Speaker 1>in this new era of AI. And of course, host

0:00:38.280 --> 0:00:40.440
<v Speaker 1>Malcolm Gladwell will be there to guide you through the

0:00:40.479 --> 0:00:42.640
<v Speaker 1>season and throw in his two cents as well.

0:00:43.120 --> 0:00:46.120
<v Speaker 2>Look out for new episodes of Smart Talks with IBM

0:00:46.400 --> 0:00:49.519
<v Speaker 2>every other week on the iHeartRadio app, Apple Podcasts, or

0:00:49.560 --> 0:00:53.360
<v Speaker 2>wherever you get your podcasts, and learn more at IBM

0:00:53.479 --> 0:00:55.480
<v Speaker 2>dot com slash smart Talks.

0:00:57.720 --> 0:01:01.160
<v Speaker 3>Hello, Hello, Welcome to Smart Talks with IBA, a podcast

0:01:01.200 --> 0:01:06.480
<v Speaker 3>from Pushkin Industries, iHeartRadio and IBM. I'm Malcolm Gabwell. This

0:01:06.560 --> 0:01:11.200
<v Speaker 3>season we're continuing our conversation with new creators visionaries who

0:01:11.240 --> 0:01:15.319
<v Speaker 3>are creatively applying technology in business to drive change, but

0:01:15.440 --> 0:01:19.600
<v Speaker 3>with a focus on the transformative power of artificial intelligence

0:01:19.920 --> 0:01:22.640
<v Speaker 3>and what it means to leverage AI as a game

0:01:22.720 --> 0:01:27.440
<v Speaker 3>changing multiplier for your business. Our guest today is doctor

0:01:27.520 --> 0:01:32.720
<v Speaker 3>David Cox, VP of AI Models at IBM Research and

0:01:32.959 --> 0:01:37.520
<v Speaker 3>IBM Director of the MIT IBM Watson AI Lab, a

0:01:37.600 --> 0:01:42.480
<v Speaker 3>first of its kind industry academic collaboration between IBM and

0:01:42.640 --> 0:01:48.240
<v Speaker 3>MIT focused on the fundamental research of artificial intelligence. Over

0:01:48.240 --> 0:01:51.960
<v Speaker 3>the course of decades, David Cox watched as the AI

0:01:52.080 --> 0:01:55.920
<v Speaker 3>revolution steadily grew from the simmering ideas of a few

0:01:55.960 --> 0:02:00.840
<v Speaker 3>academics and technologists into the industrial boom we are experiencing today.

0:02:01.800 --> 0:02:04.720
<v Speaker 3>Having dedicated his life to pushing the field of AI

0:02:04.840 --> 0:02:09.280
<v Speaker 3>towards new horizons, David has both contributed to and presided

0:02:09.320 --> 0:02:14.520
<v Speaker 3>over many of the major breakthroughs in artificial intelligence. In

0:02:14.560 --> 0:02:18.680
<v Speaker 3>today's episode, you'll hear David explain some of the conceptual

0:02:18.840 --> 0:02:23.760
<v Speaker 3>underpinnings of the current AI landscape, things like foundation models,

0:02:23.800 --> 0:02:27.680
<v Speaker 3>in surprisingly comprehensible terms. I might add, we'll also get

0:02:27.720 --> 0:02:31.320
<v Speaker 3>into some of the amazing practical applications for AI in business,

0:02:31.480 --> 0:02:34.440
<v Speaker 3>as well as what implications AI will have for the

0:02:34.440 --> 0:02:38.200
<v Speaker 3>future of work and design. David spoke with Jacob Goldstein,

0:02:38.480 --> 0:02:42.560
<v Speaker 3>host of the Pushkin podcast What's Your Problem. A veteran

0:02:42.639 --> 0:02:46.000
<v Speaker 3>business journalist, Jacob has reported for The Wall Street Journal,

0:02:46.280 --> 0:02:49.160
<v Speaker 3>the Miami Herald, and was a longtime host of the

0:02:49.280 --> 0:02:58.960
<v Speaker 3>NPR program Planet Money. Okay, let's get to the interview.

0:03:00.840 --> 0:03:02.799
<v Speaker 4>Tell me about your job at IBM.

0:03:03.240 --> 0:03:03.360
<v Speaker 2>So.

0:03:03.680 --> 0:03:06.640
<v Speaker 5>I wear two hats at IBM. So one, I'm the

0:03:06.680 --> 0:03:10.080
<v Speaker 5>IBM Director of the MIT, IBM Watson AI Lab. So

0:03:10.160 --> 0:03:13.040
<v Speaker 5>that's a joint lab between IBM and MIT where we

0:03:13.600 --> 0:03:15.880
<v Speaker 5>try and invent what's next in AI. It's been running

0:03:15.919 --> 0:03:18.920
<v Speaker 5>for about five years, and then more recently I started

0:03:19.040 --> 0:03:21.800
<v Speaker 5>as the vice president for AI Models, and I'm in

0:03:21.880 --> 0:03:26.400
<v Speaker 5>charge of building IBM's foundation models, you know, building these

0:03:26.520 --> 0:03:28.600
<v Speaker 5>these big models, generative models that allow us to have

0:03:28.639 --> 0:03:30.680
<v Speaker 5>all kinds of new exciting capabilities in AI.

0:03:31.240 --> 0:03:33.160
<v Speaker 4>So, so I want to talk to you a lot

0:03:33.200 --> 0:03:36.640
<v Speaker 4>about foundation models, about genitive AI. But before we get

0:03:36.640 --> 0:03:38.720
<v Speaker 4>to that, let's just spend a minute on the on

0:03:38.840 --> 0:03:44.480
<v Speaker 4>the IBM MIT collaboration. Where did that partnership start, How

0:03:44.480 --> 0:03:45.280
<v Speaker 4>did it originate?

0:03:46.440 --> 0:03:49.280
<v Speaker 5>Yeah, So, actually it turns out that MIT and IBM

0:03:49.480 --> 0:03:52.560
<v Speaker 5>have been collaborating for a very long time in the

0:03:52.600 --> 0:03:56.680
<v Speaker 5>area of AI. In fact, the term artificial intelligence was

0:03:56.720 --> 0:04:00.440
<v Speaker 5>coined in a nineteen fifty six workshop that was held

0:04:00.440 --> 0:04:02.440
<v Speaker 5>at Dartmouth, but it was actually organized by an IBM

0:04:02.560 --> 0:04:05.720
<v Speaker 5>or Nathaniel Rochester, who led the development of the IBM

0:04:05.760 --> 0:04:08.920
<v Speaker 5>seven and one. So we've really been together in AIS

0:04:08.960 --> 0:04:13.840
<v Speaker 5>since the beginning, and as AI kept accelerating more and

0:04:13.880 --> 0:04:17.320
<v Speaker 5>more and more, I think there was a really interesting

0:04:17.360 --> 0:04:19.880
<v Speaker 5>decision to say, let's make this a formal partnership, so

0:04:20.080 --> 0:04:22.320
<v Speaker 5>IBM in twenty seventeen and also to be committing close

0:04:22.360 --> 0:04:25.200
<v Speaker 5>to a quarter billion dollars over ten years to have

0:04:25.279 --> 0:04:29.360
<v Speaker 5>this joint lab with MIT, and we located ourselves right

0:04:29.400 --> 0:04:31.640
<v Speaker 5>on the campus and we've been developing very very deep

0:04:31.680 --> 0:04:34.280
<v Speaker 5>relationships where we can really get to know each other,

0:04:34.400 --> 0:04:37.719
<v Speaker 5>work shoulder to shoulder, conceiving what we should work on next,

0:04:37.760 --> 0:04:41.720
<v Speaker 5>and then executing the projects. And it's really very few

0:04:42.360 --> 0:04:46.040
<v Speaker 5>entities like this exist between academia industry. It's been really

0:04:46.080 --> 0:04:48.320
<v Speaker 5>fun the last five years to be a part of it.

0:04:48.960 --> 0:04:50.400
<v Speaker 4>And what do you think are some of the most

0:04:50.440 --> 0:04:54.000
<v Speaker 4>important outcomes of this collaboration between IBM and MIT.

0:04:55.400 --> 0:04:58.039
<v Speaker 5>Yeah, so we're really kind of the tip of the

0:04:58.080 --> 0:05:02.720
<v Speaker 5>sphere for IBM the I strategy. So we're we're really

0:05:02.720 --> 0:05:05.560
<v Speaker 5>looking what, you know, what's coming ahead, and you know,

0:05:05.560 --> 0:05:08.239
<v Speaker 5>in areas like Foundation models, you know, as the field

0:05:08.400 --> 0:05:11.600
<v Speaker 5>changes and I T people are interested in working on

0:05:11.760 --> 0:05:13.719
<v Speaker 5>you know, faculty, students and staff are interested in working

0:05:13.800 --> 0:05:15.520
<v Speaker 5>on what's the latest thing, what's the next thing. We

0:05:15.560 --> 0:05:18.960
<v Speaker 5>at IBM Research are very much interested in the same.

0:05:19.240 --> 0:05:21.240
<v Speaker 5>So we can kind of put out feelers, you know,

0:05:21.320 --> 0:05:24.880
<v Speaker 5>interesting things that we're seeing in our research, interesting things

0:05:24.920 --> 0:05:26.479
<v Speaker 5>we're hearing in the field. We can go and chase

0:05:26.480 --> 0:05:29.599
<v Speaker 5>those opportunities. So when something big comes, like the big

0:05:29.680 --> 0:05:32.560
<v Speaker 5>change that's been happening lately with Foundation Models, we're ready

0:05:32.560 --> 0:05:35.000
<v Speaker 5>to jump on it. That's really the purpose, that's that's

0:05:35.000 --> 0:05:38.120
<v Speaker 5>the lab functioning the way it should. We're also really

0:05:38.120 --> 0:05:41.640
<v Speaker 5>interested in how do we advance you know AI that

0:05:41.680 --> 0:05:44.600
<v Speaker 5>can help with climate change or you know, build better

0:05:44.720 --> 0:05:47.240
<v Speaker 5>materials and all these kinds of things that are you know,

0:05:47.320 --> 0:05:50.440
<v Speaker 5>a broader aperture sometimes than than what we might consider

0:05:50.600 --> 0:05:53.479
<v Speaker 5>just looking at the product portfolio of IBM, and that

0:05:53.480 --> 0:05:55.240
<v Speaker 5>that gives us again a breadth where we can see

0:05:55.240 --> 0:05:58.240
<v Speaker 5>connections that we might not have seen otherwise. We can

0:05:58.520 --> 0:06:01.039
<v Speaker 5>you know, think things that help out society and also

0:06:01.040 --> 0:06:02.080
<v Speaker 5>help out our customers.

0:06:03.240 --> 0:06:07.680
<v Speaker 4>So the last whatever six months, say, there has been

0:06:07.720 --> 0:06:13.400
<v Speaker 4>this wild rise in the public's interest in AI, right

0:06:13.480 --> 0:06:16.880
<v Speaker 4>clearly coming out of these generative AI models that are

0:06:16.920 --> 0:06:21.039
<v Speaker 4>really accessible, you know, certainly chat GPT language models like that,

0:06:21.080 --> 0:06:24.159
<v Speaker 4>as well as models that generate images like mid Journey.

0:06:24.760 --> 0:06:27.520
<v Speaker 4>I mean, can you just sort of briefly talk about

0:06:27.680 --> 0:06:31.680
<v Speaker 4>the breakthroughs in AI that have made this moment feel

0:06:31.760 --> 0:06:35.240
<v Speaker 4>so exciting, so revolutionary for artificial intelligence.

0:06:36.320 --> 0:06:41.039
<v Speaker 5>Yeah. You know, I've been studying AI basically my entire

0:06:41.080 --> 0:06:43.200
<v Speaker 5>adult life. Before I came to IBM, I was a

0:06:43.200 --> 0:06:45.760
<v Speaker 5>professor at Harvard. I've been doing this a long time,

0:06:46.000 --> 0:06:48.280
<v Speaker 5>and I've gotten used to being surprised. It sounds like

0:06:48.320 --> 0:06:51.400
<v Speaker 5>a joke, but it's serious, Like I'm getting used to

0:06:51.440 --> 0:06:55.359
<v Speaker 5>being surprised at the acceleration of the pace Again. It

0:06:55.440 --> 0:06:58.320
<v Speaker 5>tracks actually a long way back. You know, there's lots

0:06:58.360 --> 0:07:00.920
<v Speaker 5>of things where there was an idea that just simmered

0:07:01.600 --> 0:07:04.960
<v Speaker 5>for a really long time. Some of the key math

0:07:05.440 --> 0:07:09.000
<v Speaker 5>behind the stuff that we have today, which is amazing

0:07:09.720 --> 0:07:12.960
<v Speaker 5>there's an algorithm called backpropagation, which is sort of key

0:07:13.000 --> 0:07:15.600
<v Speaker 5>to training neural networks that's been around, you know, since

0:07:15.680 --> 0:07:19.520
<v Speaker 5>the eighties in wide use. And really what happened was

0:07:19.840 --> 0:07:23.560
<v Speaker 5>it simmered for a long time and then enough data

0:07:23.800 --> 0:07:27.440
<v Speaker 5>and enough compute came. So we had enough data because

0:07:28.080 --> 0:07:31.240
<v Speaker 5>you know, we all started carrying multiple cameras around with us.

0:07:31.280 --> 0:07:34.080
<v Speaker 5>Our mobile phones have all, you know, all these cameras

0:07:34.160 --> 0:07:36.600
<v Speaker 5>and this we put everything on the Internet, and there's

0:07:36.600 --> 0:07:39.119
<v Speaker 5>all this data out there. We caught a lucky break

0:07:39.120 --> 0:07:41.400
<v Speaker 5>that there was something called a graphics processing unit, which

0:07:41.680 --> 0:07:43.680
<v Speaker 5>turns out to be really useful for doing these kinds

0:07:43.720 --> 0:07:46.240
<v Speaker 5>of algorithms, maybe even more useful than it is for

0:07:46.320 --> 0:07:50.400
<v Speaker 5>doing graphics. They're greater graphics too, And things just kept

0:07:50.480 --> 0:07:53.680
<v Speaker 5>kind of adding to the snowball. So we had deep learning,

0:07:54.120 --> 0:07:57.720
<v Speaker 5>which is sort of a rebrand of neural networks that

0:07:57.760 --> 0:07:59.960
<v Speaker 5>I mentioned from from the eighties, and that was enable

0:08:00.160 --> 0:08:03.800
<v Speaker 5>again by data because we digitalize the world and compute

0:08:03.800 --> 0:08:06.240
<v Speaker 5>because because we kept building faster and faster and more

0:08:06.280 --> 0:08:09.520
<v Speaker 5>powerful computers, and then that allowed us to make this

0:08:09.520 --> 0:08:13.200
<v Speaker 5>this big breakthrough. And then you know, more recently, using

0:08:13.280 --> 0:08:17.360
<v Speaker 5>the same building blocks that inexorable rise of more and

0:08:17.400 --> 0:08:21.920
<v Speaker 5>more and more data met a technology called self supervised learning,

0:08:22.400 --> 0:08:27.120
<v Speaker 5>where the key difference there in traditional deep learning, you know,

0:08:27.160 --> 0:08:29.760
<v Speaker 5>for classifying images, you know, like is this a cat

0:08:29.880 --> 0:08:33.120
<v Speaker 5>or is this a dog? And a picture, those technologies

0:08:33.520 --> 0:08:36.880
<v Speaker 5>require super visions, so you have to take what you

0:08:36.960 --> 0:08:38.320
<v Speaker 5>have and then you have to label it. So you

0:08:38.320 --> 0:08:39.679
<v Speaker 5>have to take a picture of a cat, and then

0:08:39.679 --> 0:08:42.240
<v Speaker 5>you label it as a cat. And it turns out

0:08:42.280 --> 0:08:44.600
<v Speaker 5>that you know, that's very powerful, that it takes a

0:08:44.600 --> 0:08:47.600
<v Speaker 5>lot of time to label gaps and to label dogs,

0:08:47.640 --> 0:08:50.040
<v Speaker 5>and there's only so many labels that exist in the world.

0:08:50.440 --> 0:08:54.000
<v Speaker 5>So what really changed more recently is that we have

0:08:54.080 --> 0:08:56.560
<v Speaker 5>self supervised learning where you don't have to have the labels.

0:08:56.559 --> 0:08:59.040
<v Speaker 5>We can just take unannotated data. And what that does

0:08:59.160 --> 0:09:02.240
<v Speaker 5>is it lots you use even more data. And that's

0:09:02.280 --> 0:09:06.200
<v Speaker 5>really what drove this latest sort of rage. And then

0:09:06.559 --> 0:09:08.199
<v Speaker 5>and then all of a sudden we start getting these

0:09:08.559 --> 0:09:14.600
<v Speaker 5>really powerful models. And then really this has been simmering technologies, right,

0:09:15.120 --> 0:09:19.280
<v Speaker 5>this has been happening for a while and progressively getting

0:09:19.280 --> 0:09:21.880
<v Speaker 5>more and more powerful. One of the things that really

0:09:22.120 --> 0:09:27.320
<v Speaker 5>happened with CHATGBT and technologies like stable diffusion and mid

0:09:27.360 --> 0:09:30.480
<v Speaker 5>journey was that they made it visible to the public.

0:09:31.160 --> 0:09:33.080
<v Speaker 5>You know, you put it out there, the public can

0:09:33.120 --> 0:09:35.280
<v Speaker 5>touch and feel and they're like, wow, not only is

0:09:35.280 --> 0:09:38.560
<v Speaker 5>there palpable change and wow this you know, I could

0:09:38.559 --> 0:09:40.720
<v Speaker 5>talk to this thing. Wow, this thing can generate an image.

0:09:41.040 --> 0:09:43.679
<v Speaker 5>Not only that, but everyone can touch and feel and try.

0:09:44.520 --> 0:09:49.160
<v Speaker 5>My kids can use some of these AI art generation technologies.

0:09:49.679 --> 0:09:53.200
<v Speaker 5>And that's really just launched. You know. It's like a

0:09:53.600 --> 0:09:57.199
<v Speaker 5>propelled slingshot at us into a different regime in terms

0:09:57.200 --> 0:09:58.960
<v Speaker 5>of the public awareness of these technologies.

0:09:59.640 --> 0:10:02.800
<v Speaker 4>You mentioned earlier in the conversation foundation models, and I

0:10:02.840 --> 0:10:04.679
<v Speaker 4>want to talk a little bit about that. I mean,

0:10:04.720 --> 0:10:08.120
<v Speaker 4>can you just tell me, you know, what are foundation

0:10:08.360 --> 0:10:11.080
<v Speaker 4>models for AI and why are they a big deal?

0:10:12.280 --> 0:10:16.000
<v Speaker 5>Yeah, So this term foundation model was coined by a

0:10:16.000 --> 0:10:19.760
<v Speaker 5>group at Stanford, and I think it's actually a really

0:10:19.800 --> 0:10:23.319
<v Speaker 5>apt term because remember I said, you know, one of

0:10:23.360 --> 0:10:26.680
<v Speaker 5>the big things that unlocked this latest excitement was the

0:10:26.679 --> 0:10:30.280
<v Speaker 5>fact that we could use large amounts of unannotated data.

0:10:30.840 --> 0:10:32.200
<v Speaker 5>We could train a model. We don't have to go

0:10:32.240 --> 0:10:35.760
<v Speaker 5>through the painful effort of labeling each and every example.

0:10:36.320 --> 0:10:38.560
<v Speaker 5>You still need to have your model do something you

0:10:38.559 --> 0:10:40.720
<v Speaker 5>wanted to do you still need to tell it what

0:10:40.800 --> 0:10:42.360
<v Speaker 5>you want to do. You can't just have a model

0:10:42.400 --> 0:10:45.600
<v Speaker 5>that doesn't have any purpose, but what a foundation model

0:10:45.640 --> 0:10:49.280
<v Speaker 5>that provides a foundation, like a literal foundation, you can

0:10:49.320 --> 0:10:51.320
<v Speaker 5>sort of stand on the shoulders of giants. You can

0:10:51.400 --> 0:10:53.960
<v Speaker 5>have one of these massively trained models and then do

0:10:54.040 --> 0:10:56.400
<v Speaker 5>a little bit on top. You know, you could use

0:10:56.440 --> 0:10:59.280
<v Speaker 5>just a few examples of what you're looking for and

0:10:59.559 --> 0:11:01.880
<v Speaker 5>you can get what you want from the model. So

0:11:02.040 --> 0:11:04.040
<v Speaker 5>just a little bit on top now gets to the

0:11:04.080 --> 0:11:06.120
<v Speaker 5>results that a huge amount of effort used to have

0:11:06.200 --> 0:11:08.400
<v Speaker 5>to put in, you know, to get from the ground

0:11:08.520 --> 0:11:10.120
<v Speaker 5>up to that level.

0:11:10.400 --> 0:11:14.320
<v Speaker 4>I was trying to think of of an analogy for

0:11:14.440 --> 0:11:17.480
<v Speaker 4>sort of foundation models versus what came before, and I

0:11:17.480 --> 0:11:19.960
<v Speaker 4>don't know that I came up with a good one,

0:11:20.000 --> 0:11:21.719
<v Speaker 4>but the best I could do was this. I want

0:11:21.720 --> 0:11:24.640
<v Speaker 4>you to tell me if it's plausible. It's like before

0:11:24.760 --> 0:11:28.280
<v Speaker 4>foundation models, it was like you had these sort of

0:11:28.360 --> 0:11:31.560
<v Speaker 4>single use kitchen appliances. You could make a waffle iron

0:11:31.600 --> 0:11:34.120
<v Speaker 4>if you wanted waffles, or you could make a toaster

0:11:34.280 --> 0:11:36.839
<v Speaker 4>if you wanted to make toast. But a foundation model

0:11:36.880 --> 0:11:39.480
<v Speaker 4>is like like an oven with a range on top.

0:11:39.520 --> 0:11:41.319
<v Speaker 4>So it's like this machine, and you could just cook

0:11:41.400 --> 0:11:43.240
<v Speaker 4>anything with this machine.

0:11:43.880 --> 0:11:48.360
<v Speaker 5>Yeah, that's a great analogy. They're very versatile. The other

0:11:48.480 --> 0:11:51.040
<v Speaker 5>piece of it, too, is that they dramatically lowered the

0:11:51.160 --> 0:11:54.320
<v Speaker 5>effort that it takes to do something that you want

0:11:54.360 --> 0:11:57.640
<v Speaker 5>to do. And I used to say about the old

0:11:57.679 --> 0:11:59.440
<v Speaker 5>world of AI, would say, you know, the problem with

0:11:59.480 --> 0:12:03.240
<v Speaker 5>automation is that it's too labor intensive, which sounds like

0:12:03.280 --> 0:12:04.160
<v Speaker 5>I'm making a joke.

0:12:04.400 --> 0:12:08.920
<v Speaker 4>Indeed, famously, if automation does one thing, it substitutes machines

0:12:09.040 --> 0:12:12.280
<v Speaker 4>or computing power for labor, right, So what does that

0:12:12.360 --> 0:12:16.640
<v Speaker 4>mean to say AI is or automation is too labor intensive.

0:12:17.120 --> 0:12:19.120
<v Speaker 5>It sounds like I'm making a joke, but I'm actually serious.

0:12:19.320 --> 0:12:22.520
<v Speaker 5>What I mean is that the effort it took the

0:12:22.640 --> 0:12:26.480
<v Speaker 5>old regime to automate something was very very high. So

0:12:26.640 --> 0:12:29.520
<v Speaker 5>if I need to go and curate all this data,

0:12:29.600 --> 0:12:32.800
<v Speaker 5>collect all this data, and then carefully label all these examples,

0:12:33.200 --> 0:12:37.600
<v Speaker 5>that labeling itself might be incredibly expensive and time. And

0:12:37.640 --> 0:12:40.240
<v Speaker 5>we estimate anywhere between eighty to ninety percent of the

0:12:40.320 --> 0:12:43.200
<v Speaker 5>effort it takes to feel an AI solution actually is

0:12:43.360 --> 0:12:46.679
<v Speaker 5>just spent on data, so that that has some consequences,

0:12:47.000 --> 0:12:52.360
<v Speaker 5>which is the threshold for bothering. You know, if you're

0:12:52.360 --> 0:12:54.559
<v Speaker 5>going to only get a little bit of value back

0:12:54.800 --> 0:12:57.040
<v Speaker 5>from something, are you going to go through this huge

0:12:57.040 --> 0:13:00.560
<v Speaker 5>effort to curate all this data and then when it

0:13:00.559 --> 0:13:02.960
<v Speaker 5>comes time to train the model you need highly skilled

0:13:03.000 --> 0:13:06.680
<v Speaker 5>people defensive or hard to find in the labor market.

0:13:07.200 --> 0:13:08.719
<v Speaker 5>You know, are you really going to do something that's

0:13:08.760 --> 0:13:10.800
<v Speaker 5>just a title incremental thing? Now you're going to do

0:13:10.840 --> 0:13:15.840
<v Speaker 5>the only the highest value things that weren't right level because.

0:13:15.440 --> 0:13:18.840
<v Speaker 4>You have to essentially build the whole machine from scratch,

0:13:19.040 --> 0:13:21.960
<v Speaker 4>and there aren't many things where it's worth that much

0:13:21.960 --> 0:13:24.040
<v Speaker 4>work to build a machine that's only going to do

0:13:24.160 --> 0:13:25.319
<v Speaker 4>one narrow thing.

0:13:25.800 --> 0:13:28.719
<v Speaker 5>That's right, and then you tackle the next problem and

0:13:28.840 --> 0:13:31.160
<v Speaker 5>you basically have to start over. And you know, there

0:13:31.160 --> 0:13:34.000
<v Speaker 5>are some nuances here, like for images, you can pre

0:13:34.040 --> 0:13:36.560
<v Speaker 5>train a model on some other task and change it around.

0:13:36.559 --> 0:13:39.520
<v Speaker 5>So there are some examples of this, like non recurring

0:13:39.640 --> 0:13:42.160
<v Speaker 5>cost that we have in the old world too, But

0:13:42.240 --> 0:13:44.800
<v Speaker 5>by and large, it's just a lot of effort. It's hard.

0:13:45.080 --> 0:13:49.360
<v Speaker 5>It takes you know, a large level of skill to implement.

0:13:50.160 --> 0:13:52.959
<v Speaker 5>One analogy that I like is, you know, think about

0:13:52.960 --> 0:13:55.080
<v Speaker 5>it as you know, you have a river of data,

0:13:55.440 --> 0:13:58.840
<v Speaker 5>you know, running through your company or your institution. Traditional

0:13:58.840 --> 0:14:01.320
<v Speaker 5>AI solutions are kind of like building a dam on

0:14:01.360 --> 0:14:04.840
<v Speaker 5>that river. You know, dams are very expensive things to build.

0:14:05.160 --> 0:14:09.439
<v Speaker 5>They require highly specialized skills and lots of planning. And

0:14:09.640 --> 0:14:11.360
<v Speaker 5>you know, you're only going to put a dam on

0:14:11.720 --> 0:14:14.440
<v Speaker 5>a river that's big enough that you're gonna get enough

0:14:14.520 --> 0:14:16.400
<v Speaker 5>energy out of it that it was worth your trouble.

0:14:16.800 --> 0:14:18.040
<v Speaker 5>You're going to get a lot of value out of

0:14:18.040 --> 0:14:19.920
<v Speaker 5>that dam. If you have a river like that, you know,

0:14:20.000 --> 0:14:23.560
<v Speaker 5>a river of data, but it's actually the vast majority

0:14:23.600 --> 0:14:26.160
<v Speaker 5>of the water you know in your kingdom, actually isn't

0:14:26.160 --> 0:14:30.360
<v Speaker 5>in that river. It's in puddles and creeks and bable bricks,

0:14:30.400 --> 0:14:33.840
<v Speaker 5>And you know, there's a lot of value left on

0:14:33.880 --> 0:14:36.440
<v Speaker 5>the table because it's like, well, I can't there's nothing

0:14:36.440 --> 0:14:38.240
<v Speaker 5>you can do about it. It's just that that's too

0:14:39.240 --> 0:14:42.360
<v Speaker 5>low value, so it takes too much effort, so I'm

0:14:42.400 --> 0:14:43.960
<v Speaker 5>just not going to do it. The return around investment

0:14:44.320 --> 0:14:47.080
<v Speaker 5>just isn't there, so you just end up not automating things.

0:14:47.320 --> 0:14:50.640
<v Speaker 5>It's too much of a pain. Now what foundation models

0:14:50.640 --> 0:14:52.720
<v Speaker 5>do is they say, well, actually no, we can train

0:14:53.600 --> 0:14:55.680
<v Speaker 5>a base model, a foundation that you can work on

0:14:55.800 --> 0:14:57.760
<v Speaker 5>and we don't we don't care. We have specifying what

0:14:57.760 --> 0:14:59.280
<v Speaker 5>the task is ahead of time. We just need to

0:14:59.680 --> 0:15:02.440
<v Speaker 5>learn about the domain of data. So if we want

0:15:02.440 --> 0:15:05.760
<v Speaker 5>to build something that can understand English language, there's a

0:15:05.800 --> 0:15:09.080
<v Speaker 5>ton of English language text available out in the world.

0:15:09.280 --> 0:15:13.040
<v Speaker 5>We can now train models on huge quantities of it,

0:15:13.520 --> 0:15:17.280
<v Speaker 5>and then it learned the structure, learned how language you know,

0:15:17.400 --> 0:15:20.280
<v Speaker 5>good part of how language works on all that unlabeled data,

0:15:20.360 --> 0:15:22.600
<v Speaker 5>and then when you roll up with your task, you know,

0:15:22.840 --> 0:15:26.240
<v Speaker 5>I want to solve this particular problem. You don't have

0:15:26.320 --> 0:15:29.000
<v Speaker 5>to start from scratch. You're starting from a very very

0:15:29.080 --> 0:15:31.960
<v Speaker 5>very high place. So that just gives you the ability

0:15:32.040 --> 0:15:34.440
<v Speaker 5>to just you know, now all of a sudden, everything

0:15:34.680 --> 0:15:37.560
<v Speaker 5>is accessible. All the puddles and greeks and babbling brooks

0:15:37.720 --> 0:15:42.000
<v Speaker 5>and klipons, you know, those are all accessible now. And

0:15:42.040 --> 0:15:44.920
<v Speaker 5>that's that's very exciting. But it just changes the equation

0:15:45.040 --> 0:15:47.600
<v Speaker 5>on what kinds of problems you could use AI to solve.

0:15:47.720 --> 0:15:53.160
<v Speaker 4>And so foundation models basically mean that automating some new

0:15:53.280 --> 0:15:56.520
<v Speaker 4>task is much less labor intensive. The sort of marginal

0:15:56.600 --> 0:15:59.560
<v Speaker 4>effort to do some new automation thing is much lower

0:15:59.560 --> 0:16:02.880
<v Speaker 4>because you're building on top of the foundation model rather

0:16:02.960 --> 0:16:07.320
<v Speaker 4>than starting from scratch. Absolutely, So that is that is

0:16:07.440 --> 0:16:11.040
<v Speaker 4>like the exciting good news. I do feel like there's

0:16:11.840 --> 0:16:14.400
<v Speaker 4>a little bit of a countervailing idea that's worth talking

0:16:14.480 --> 0:16:16.800
<v Speaker 4>about here, and that is the idea that even though

0:16:16.840 --> 0:16:20.880
<v Speaker 4>there are these foundation models that are really powerful, that

0:16:20.920 --> 0:16:23.960
<v Speaker 4>are relatively easy to build on top of, it's still

0:16:24.000 --> 0:16:27.200
<v Speaker 4>the case right that there is not some one size

0:16:27.240 --> 0:16:30.920
<v Speaker 4>fits all foundation model. So you know, what does that

0:16:31.040 --> 0:16:33.280
<v Speaker 4>mean and why is that important to think about in

0:16:33.320 --> 0:16:34.040
<v Speaker 4>this context?

0:16:34.640 --> 0:16:38.440
<v Speaker 5>Yeah, so we believe very strongly that there isn't just

0:16:38.560 --> 0:16:41.400
<v Speaker 5>one model to rule them all. There's a number of

0:16:41.480 --> 0:16:44.440
<v Speaker 5>reasons why that could be true. One which I think

0:16:44.520 --> 0:16:48.560
<v Speaker 5>is important and very relevant today is how much energy

0:16:48.880 --> 0:16:53.640
<v Speaker 5>these models can consume. So these models, you know, can

0:16:53.680 --> 0:16:59.120
<v Speaker 5>get very very large. So one thing that we're starting

0:16:59.160 --> 0:17:01.880
<v Speaker 5>to see or starting to believe, is that you probably

0:17:01.920 --> 0:17:07.040
<v Speaker 5>shouldn't use one giant sledgehammer model to solve every single problem,

0:17:07.200 --> 0:17:09.160
<v Speaker 5>you know, like we should pick the right size model

0:17:09.200 --> 0:17:12.000
<v Speaker 5>to solve the problem. We shouldn't necessarily assume that we

0:17:12.040 --> 0:17:16.600
<v Speaker 5>need the biggest, baddest model for every little use case.

0:17:17.040 --> 0:17:19.280
<v Speaker 5>And we're also seeing that, you know, small models that

0:17:19.320 --> 0:17:23.520
<v Speaker 5>are trained like to specialize on particular domains can actually

0:17:23.520 --> 0:17:27.080
<v Speaker 5>outperform much bigger models. So bigger isn't always even better.

0:17:27.440 --> 0:17:30.040
<v Speaker 4>So they're more efficient and they do the thing you

0:17:30.080 --> 0:17:31.680
<v Speaker 4>want them to do better as well.

0:17:32.240 --> 0:17:35.520
<v Speaker 5>That's right. So Stanford, for instance, a group of Stanford

0:17:35.520 --> 0:17:38.639
<v Speaker 5>trained a model. It is a two point seven billion

0:17:38.680 --> 0:17:41.840
<v Speaker 5>parameter model, which isn't terribly big by today's standards. They

0:17:41.840 --> 0:17:44.119
<v Speaker 5>trained it just on the biomedical literature, you know, this

0:17:44.160 --> 0:17:46.520
<v Speaker 5>is the kind of thing that universities do. And what

0:17:46.560 --> 0:17:50.080
<v Speaker 5>they showed was that this model was better at answering

0:17:50.160 --> 0:17:52.680
<v Speaker 5>questions about the biomedical literature than some models that are

0:17:53.200 --> 0:17:56.919
<v Speaker 5>one hundred billion parameters, you know, many times larger. So

0:17:57.080 --> 0:17:59.600
<v Speaker 5>it's a little bit like you know, asking an expert

0:18:00.119 --> 0:18:03.919
<v Speaker 5>for help on something versus asking the smartest person you know. Ye,

0:18:04.000 --> 0:18:06.440
<v Speaker 5>the smartest person you know may be very smart, but

0:18:06.560 --> 0:18:09.439
<v Speaker 5>they're not going to be expertise. And then as an

0:18:09.480 --> 0:18:11.919
<v Speaker 5>added bonus, you know, this is now a much smaller model,

0:18:12.000 --> 0:18:13.919
<v Speaker 5>it's much more efficient to run, we are you know,

0:18:14.480 --> 0:18:18.400
<v Speaker 5>you know, it's cheaper, so there's lots of different advantages there.

0:18:18.440 --> 0:18:22.000
<v Speaker 5>So I think we're going to see attention in the

0:18:22.080 --> 0:18:25.360
<v Speaker 5>industry between vendors that say hey, this is the one,

0:18:25.560 --> 0:18:27.920
<v Speaker 5>you know, big model, and then others that say, well, actually,

0:18:28.200 --> 0:18:30.720
<v Speaker 5>you know, there's there's you know, lots of different tools

0:18:30.720 --> 0:18:32.720
<v Speaker 5>we can use that all have this nice quality that

0:18:32.760 --> 0:18:35.440
<v Speaker 5>we outlined at the beginning, and then we should really

0:18:35.440 --> 0:18:36.960
<v Speaker 5>pick the one that makes the most sense for the

0:18:37.320 --> 0:18:38.040
<v Speaker 5>task at hand.

0:18:39.320 --> 0:18:43.720
<v Speaker 4>So there's sustainability basically efficiency. Another kind of set of

0:18:43.720 --> 0:18:46.000
<v Speaker 4>issues that come up a lot with ai A are

0:18:46.200 --> 0:18:50.000
<v Speaker 4>bias hallucination. Can you talk a little bit about bias

0:18:50.240 --> 0:18:52.480
<v Speaker 4>and hallucination, what they are and how you're working to

0:18:52.880 --> 0:18:54.000
<v Speaker 4>mitigate those problems.

0:18:54.400 --> 0:18:57.240
<v Speaker 5>Yeah, so there are lots of issues still as amazing

0:18:57.280 --> 0:19:00.240
<v Speaker 5>as these technologies are, and they are amazing, let's let's

0:19:00.280 --> 0:19:02.720
<v Speaker 5>be very clear, lots of great things we're going to

0:19:02.840 --> 0:19:06.640
<v Speaker 5>enable with these kinds of technologies. Bias isn't a new problem,

0:19:07.000 --> 0:19:11.600
<v Speaker 5>so you know, basically we've seen this since the beginning

0:19:11.600 --> 0:19:14.520
<v Speaker 5>of AI. If you train a model on data that

0:19:14.960 --> 0:19:17.080
<v Speaker 5>has a bias in it, the model is going to

0:19:17.119 --> 0:19:21.680
<v Speaker 5>recapitulate that bias when it provides its answers. So every time,

0:19:21.880 --> 0:19:24.399
<v Speaker 5>you know, if all the text you have says, you know,

0:19:24.440 --> 0:19:27.520
<v Speaker 5>it's more likely to refer to female nurses and male scientists.

0:19:27.560 --> 0:19:29.639
<v Speaker 5>Then you're going to you know, get models that you know.

0:19:29.720 --> 0:19:32.760
<v Speaker 5>For instance, there was an example where a machine learning

0:19:32.800 --> 0:19:37.200
<v Speaker 5>based translation system translated from Hungarian to English. Hungarian doesn't

0:19:37.240 --> 0:19:40.520
<v Speaker 5>have gendered pronouns. English does, and when you ask them

0:19:40.560 --> 0:19:42.879
<v Speaker 5>to translate, it would translate they are a nurse to

0:19:43.320 --> 0:19:46.160
<v Speaker 5>she is a nurse, would translate they are a scientist

0:19:46.200 --> 0:19:48.399
<v Speaker 5>to he is a scientist. And that's not because the

0:19:49.240 --> 0:19:51.800
<v Speaker 5>people who wrote the algorithm were building in bias and

0:19:51.920 --> 0:19:53.640
<v Speaker 5>coding in like oh, it's got to be this way.

0:19:53.720 --> 0:19:55.960
<v Speaker 5>It's because the data was like that. You know, we

0:19:56.040 --> 0:20:00.159
<v Speaker 5>have biases in our society and they're reflected in in

0:20:00.200 --> 0:20:04.000
<v Speaker 5>our data and our text and our images everywhere. And

0:20:04.040 --> 0:20:06.920
<v Speaker 5>then the models they're just mapping from what they've what

0:20:06.960 --> 0:20:09.480
<v Speaker 5>they've seen in their training data to to the result

0:20:09.520 --> 0:20:11.400
<v Speaker 5>that you're trying to get them to do and to give,

0:20:11.440 --> 0:20:15.240
<v Speaker 5>and then these biases come out. So there's a very

0:20:15.600 --> 0:20:19.439
<v Speaker 5>active program of research, and you know, we we do

0:20:19.800 --> 0:20:22.600
<v Speaker 5>quite a bit at IBM research and my T but

0:20:22.840 --> 0:20:25.960
<v Speaker 5>also all over the community and industry and academia trying

0:20:25.960 --> 0:20:29.320
<v Speaker 5>to figure out how do we explicitly remove these biases,

0:20:29.359 --> 0:20:31.560
<v Speaker 5>how do we identify them, how do you know, how

0:20:31.600 --> 0:20:33.840
<v Speaker 5>do we build tools that allow people to audit their

0:20:33.840 --> 0:20:36.439
<v Speaker 5>systems to make sure they aren't biased. So this is

0:20:36.440 --> 0:20:38.560
<v Speaker 5>a really important thing. And you know, again this was

0:20:38.600 --> 0:20:43.200
<v Speaker 5>here since the beginning, you know of machine learning and AI,

0:20:43.680 --> 0:20:47.240
<v Speaker 5>but foundation models and large language models and generative AI

0:20:48.119 --> 0:20:50.880
<v Speaker 5>just bring it into sharper even sharper focus because there's

0:20:50.880 --> 0:20:53.399
<v Speaker 5>just so much data and it's sort of building in

0:20:54.000 --> 0:20:56.960
<v Speaker 5>baking and all these different biases we have, so that

0:20:57.080 --> 0:21:01.240
<v Speaker 5>that's that's absolutely a problem that these model have. Another

0:21:01.280 --> 0:21:04.720
<v Speaker 5>one that you mentioned was hallucinations. So even the most

0:21:04.720 --> 0:21:08.840
<v Speaker 5>impressive of our models will often just make stuff up.

0:21:09.280 --> 0:21:11.840
<v Speaker 5>You know, the technical term that the heels chosen as

0:21:11.960 --> 0:21:15.280
<v Speaker 5>is hallucination. To give you an example, I asked chat

0:21:15.359 --> 0:21:19.920
<v Speaker 5>tbt to create a biography of David Cox IBM, and

0:21:20.040 --> 0:21:22.320
<v Speaker 5>you know, it started off really well. You know, they

0:21:22.440 --> 0:21:24.639
<v Speaker 5>identified that I was the director of the mt IBM

0:21:24.680 --> 0:21:27.320
<v Speaker 5>Watsonay and said a few words about that, and then

0:21:27.359 --> 0:21:32.119
<v Speaker 5>it proceeded to create an authoritative but completely fake biography

0:21:32.320 --> 0:21:34.639
<v Speaker 5>of me. Where I was British. I was born in

0:21:34.640 --> 0:21:39.040
<v Speaker 5>the UK. I went to British university, you know universities

0:21:39.040 --> 0:21:41.080
<v Speaker 5>in the UK. I was professorating the authority.

0:21:41.160 --> 0:21:44.720
<v Speaker 4>Right, it's the certainty that that is weird about it, right,

0:21:44.720 --> 0:21:48.000
<v Speaker 4>it's it's dead certain that you're from the UK, et cetera.

0:21:48.560 --> 0:21:51.920
<v Speaker 5>Absolutely, yeah, as all kinds of flourishes like I want

0:21:52.080 --> 0:21:56.880
<v Speaker 5>awards in the UK. So yeah, it's it's problematic because

0:21:57.040 --> 0:21:59.399
<v Speaker 5>it kind of pokes a lot of weak spots in

0:21:59.440 --> 0:22:04.880
<v Speaker 5>our humans psychology, where if something sounds coherent, we're likely

0:22:04.920 --> 0:22:07.480
<v Speaker 5>to assume it's true. We're not used to interacting with

0:22:07.520 --> 0:22:12.120
<v Speaker 5>people who eloquently and authoritatively you know, emit complete nonsense

0:22:12.200 --> 0:22:14.960
<v Speaker 5>like yeah, you know we can debate about that, but.

0:22:15.000 --> 0:22:18.440
<v Speaker 4>Yeah, we could debate about that. But yes, the sort

0:22:18.480 --> 0:22:22.120
<v Speaker 4>of blive confidence throws you off when you realize it's

0:22:22.119 --> 0:22:22.879
<v Speaker 4>completely wrong.

0:22:23.000 --> 0:22:25.639
<v Speaker 5>Right, that's right. And and we do have a little

0:22:25.640 --> 0:22:28.280
<v Speaker 5>bit of like a great and powerful AWS sort of

0:22:28.680 --> 0:22:31.080
<v Speaker 5>vibe going sometimes where we're like, well, you know, the

0:22:31.080 --> 0:22:34.880
<v Speaker 5>AI is all knowing and therefore whatever it says must

0:22:34.920 --> 0:22:37.159
<v Speaker 5>be true. But but these things will make up stuff,

0:22:37.440 --> 0:22:42.119
<v Speaker 5>you know, very aggressively, and you know, you everyone can

0:22:42.200 --> 0:22:45.120
<v Speaker 5>try asking it for their their bio. You'll you'll get

0:22:45.119 --> 0:22:47.879
<v Speaker 5>something that You'll always get something that's of the right form,

0:22:48.359 --> 0:22:50.480
<v Speaker 5>that has the right tone. But you know, the facts

0:22:50.680 --> 0:22:53.800
<v Speaker 5>just aren't necessarily there, So that's obviously a problem. We

0:22:53.840 --> 0:22:55.840
<v Speaker 5>need to figure out how to close those gaps, fix

0:22:55.920 --> 0:22:58.600
<v Speaker 5>those problems. There's lots of ways we can use them

0:22:59.040 --> 0:22:59.840
<v Speaker 5>much more easily.

0:23:00.359 --> 0:23:03.080
<v Speaker 3>I'd just like to say, faced with the awesome potential

0:23:03.119 --> 0:23:06.160
<v Speaker 3>of what these technologies might do, it's a bit encouraging

0:23:06.200 --> 0:23:09.720
<v Speaker 3>to hear that even chat GPT has a weakness for

0:23:09.840 --> 0:23:14.960
<v Speaker 3>inventing flamboyant, if fictional versions of people's lives. And while

0:23:15.080 --> 0:23:18.639
<v Speaker 3>entertaining ourselves with chat GPT and mid journey is important,

0:23:19.080 --> 0:23:23.160
<v Speaker 3>the way lay people use consumer facing chatbots and generative

0:23:23.280 --> 0:23:27.560
<v Speaker 3>AI is just fundamentally different from the way an enterprise

0:23:27.600 --> 0:23:31.119
<v Speaker 3>business uses AI. How can we harness the abilities of

0:23:31.240 --> 0:23:34.600
<v Speaker 3>artificial intelligence to help us solve the problems we face

0:23:34.680 --> 0:23:38.280
<v Speaker 3>in business and technology. Let's listen on as David and

0:23:38.359 --> 0:23:40.199
<v Speaker 3>Jacob continue their conversation.

0:23:40.960 --> 0:23:43.920
<v Speaker 4>We've been talking in a somewhat abstract way about AI

0:23:44.040 --> 0:23:46.760
<v Speaker 4>in the ways it can be used. Let's talk in

0:23:46.800 --> 0:23:49.520
<v Speaker 4>a little bit more of a specific way. Can you

0:23:50.200 --> 0:23:54.000
<v Speaker 4>just talk about some examples of business challenges that can

0:23:54.040 --> 0:23:57.400
<v Speaker 4>be solved with automation? With this kind of automation we're

0:23:57.400 --> 0:23:58.320
<v Speaker 4>talking about.

0:23:58.880 --> 0:24:02.359
<v Speaker 5>Yeah, so really really, this guy's the limit. There's a

0:24:02.359 --> 0:24:06.480
<v Speaker 5>whole set of different applications that these models are really

0:24:06.480 --> 0:24:09.119
<v Speaker 5>good at. And basically it's a super set of everything

0:24:09.119 --> 0:24:12.159
<v Speaker 5>we used to use ALI for in business. So you know,

0:24:12.840 --> 0:24:14.520
<v Speaker 5>the simple kinds of things are like, hey, if I

0:24:14.520 --> 0:24:16.680
<v Speaker 5>have text and I you know, if I have product

0:24:16.720 --> 0:24:18.600
<v Speaker 5>reviews and I want to be able to tell if

0:24:18.600 --> 0:24:20.760
<v Speaker 5>these are positive or negative. You know, like, let's look

0:24:20.760 --> 0:24:22.280
<v Speaker 5>at all the negative reviews so we can have a

0:24:22.320 --> 0:24:25.400
<v Speaker 5>human look through them and see what was up. Very

0:24:25.440 --> 0:24:28.440
<v Speaker 5>common business use case. You can do it with traditional

0:24:28.480 --> 0:24:32.080
<v Speaker 5>deep learning based AI. So so there's things like that

0:24:32.080 --> 0:24:34.199
<v Speaker 5>that are you know, it's very prosaic sort that we

0:24:34.200 --> 0:24:35.679
<v Speaker 5>were already doing that, We've been doing it for a

0:24:35.720 --> 0:24:39.400
<v Speaker 5>long time. Then you get situations that are that were

0:24:39.400 --> 0:24:40.040
<v Speaker 5>harder for.

0:24:39.960 --> 0:24:40.479
<v Speaker 2>The old day.

0:24:40.600 --> 0:24:44.359
<v Speaker 5>I like, if i'm I want to compress something like

0:24:44.400 --> 0:24:46.040
<v Speaker 5>I want to I have like say I have a

0:24:46.160 --> 0:24:49.480
<v Speaker 5>chat transcript, like a customer called in and they had

0:24:49.480 --> 0:24:53.760
<v Speaker 5>a complaint. They call back, Okay, now a new you know,

0:24:54.080 --> 0:24:56.000
<v Speaker 5>a person on the line needs to go read the

0:24:56.040 --> 0:24:59.000
<v Speaker 5>old transcript to catch up. Wouldn't it be better if

0:24:59.000 --> 0:25:01.360
<v Speaker 5>we could just summariz that, just condense it all down

0:25:01.680 --> 0:25:03.840
<v Speaker 5>quick little paragraph, you know, customer call they're up said

0:25:03.880 --> 0:25:05.920
<v Speaker 5>about this, rather than having to read the blow by blow.

0:25:06.359 --> 0:25:09.480
<v Speaker 5>There's just lots of settings like that where summarization is

0:25:09.560 --> 0:25:12.600
<v Speaker 5>really helpful. Hey, you have a meeting and I'd like

0:25:12.680 --> 0:25:15.359
<v Speaker 5>to just automatically, you know, have have that meeting or

0:25:15.400 --> 0:25:17.240
<v Speaker 5>that email or whatever. I'd like to just have a

0:25:17.240 --> 0:25:19.480
<v Speaker 5>condensed down so I can really quickly get to the

0:25:19.480 --> 0:25:22.080
<v Speaker 5>heart of the matter. These models are are really good

0:25:22.119 --> 0:25:24.760
<v Speaker 5>at doing that. They're also a really good at question answering.

0:25:25.080 --> 0:25:27.400
<v Speaker 5>So if I want to find out what's how many

0:25:27.440 --> 0:25:31.040
<v Speaker 5>vacation days do I have? I can now interact in

0:25:31.200 --> 0:25:34.600
<v Speaker 5>natural language with a system that can go and that

0:25:34.760 --> 0:25:37.439
<v Speaker 5>has access to our HR policies, and I can actually

0:25:37.440 --> 0:25:40.120
<v Speaker 5>have a you know, a multi turn conversation where I can,

0:25:40.200 --> 0:25:42.359
<v Speaker 5>you know, like I would have with you know, somebody,

0:25:42.480 --> 0:25:47.520
<v Speaker 5>you know, actual HR professional or customer service representative. So

0:25:48.000 --> 0:25:51.280
<v Speaker 5>a big part, you know, what this is doing is

0:25:51.280 --> 0:25:53.840
<v Speaker 5>it's it's putting an interface. You know, when we think

0:25:53.880 --> 0:25:57.800
<v Speaker 5>of computer interfaces, we're usually thinking about UI user interface

0:25:57.880 --> 0:26:00.440
<v Speaker 5>elements where I click on menus and there's buts and

0:26:00.480 --> 0:26:04.600
<v Speaker 5>all this stuff. Increasingly, now we can just talk you know,

0:26:04.640 --> 0:26:06.920
<v Speaker 5>you just in words, you can describe what you want,

0:26:07.000 --> 0:26:09.719
<v Speaker 5>you want to answer, ask a question, you want to

0:26:09.880 --> 0:26:12.760
<v Speaker 5>sort of command the system to do something, rather than

0:26:12.800 --> 0:26:14.680
<v Speaker 5>having to learn how to do that clicking buttons, which

0:26:14.720 --> 0:26:16.800
<v Speaker 5>might be inefficient. Now we can just sort of spell

0:26:16.840 --> 0:26:17.200
<v Speaker 5>it out.

0:26:17.720 --> 0:26:20.720
<v Speaker 4>Interesting, right, the graphical user interface that we all sort

0:26:20.720 --> 0:26:24.040
<v Speaker 4>of default to, that's not like the state of nature, Right,

0:26:24.119 --> 0:26:26.600
<v Speaker 4>that's a thing that was invented and just came to

0:26:26.680 --> 0:26:29.040
<v Speaker 4>be the standard way that we interact with computers. And

0:26:29.080 --> 0:26:33.560
<v Speaker 4>so you could imagine, as you're saying, like chat essentially

0:26:33.720 --> 0:26:37.000
<v Speaker 4>chatting with the machine could could become a sort of

0:26:37.080 --> 0:26:40.320
<v Speaker 4>standard user interface, just like the graphical user interface, did

0:26:40.480 --> 0:26:41.879
<v Speaker 4>you know over the past several decades.

0:26:42.359 --> 0:26:45.800
<v Speaker 5>Absolutely, And I think those kinds of conversational interfaces are

0:26:45.800 --> 0:26:50.040
<v Speaker 5>going to be hugely important for increasing our productivity. It's

0:26:50.040 --> 0:26:51.919
<v Speaker 5>just a lot easier if I if I have to

0:26:51.960 --> 0:26:53.600
<v Speaker 5>learn how to use a tool or I don't have

0:26:53.640 --> 0:26:56.720
<v Speaker 5>to kind of have awkward, you know, interactions from the computer.

0:26:56.720 --> 0:26:57.920
<v Speaker 5>I can just tell it what I want and I

0:26:57.920 --> 0:27:00.520
<v Speaker 5>can understand it, could you know potentially and you ask

0:27:00.640 --> 0:27:03.720
<v Speaker 5>questions back to clarify and have those kinds of conversations

0:27:04.800 --> 0:27:07.840
<v Speaker 5>that can be extremely powerful, and in fact, one area

0:27:07.880 --> 0:27:10.600
<v Speaker 5>where that's going to I think be absolutely game changing

0:27:10.680 --> 0:27:14.400
<v Speaker 5>is in code. When we write code. You know, programming

0:27:14.480 --> 0:27:18.560
<v Speaker 5>languages are a way for us to sort of match

0:27:18.680 --> 0:27:22.800
<v Speaker 5>between our very sloppy way of talking and the very

0:27:22.840 --> 0:27:24.919
<v Speaker 5>exact way that you need to command a computer to

0:27:24.960 --> 0:27:27.880
<v Speaker 5>do what you wanted to do. They're cumbersome to learn,

0:27:27.960 --> 0:27:30.320
<v Speaker 5>they can you know, create very complex systems that are

0:27:30.400 --> 0:27:33.320
<v Speaker 5>very hard to reason about. And we're already starting to

0:27:33.320 --> 0:27:35.360
<v Speaker 5>see the ability to just write down what you want

0:27:35.440 --> 0:27:38.159
<v Speaker 5>and the AI will generate the code for you. And

0:27:38.200 --> 0:27:39.880
<v Speaker 5>I think we're just going to see a huge revolution

0:27:39.960 --> 0:27:41.920
<v Speaker 5>of like we just converse you and we can have

0:27:41.960 --> 0:27:44.480
<v Speaker 5>a conversation to say what we want, and then the

0:27:44.520 --> 0:27:48.199
<v Speaker 5>computer can actually not only do fixed actions and do

0:27:48.280 --> 0:27:50.359
<v Speaker 5>things for us, but it can actually even write code

0:27:50.400 --> 0:27:53.360
<v Speaker 5>to do new things, you know, and generate software itself.

0:27:53.680 --> 0:27:56.399
<v Speaker 5>Given how much software we have, how much craving we

0:27:56.440 --> 0:27:59.199
<v Speaker 5>have for software, like we'll never have enough software in

0:27:59.240 --> 0:28:02.200
<v Speaker 5>our world, uh, you know, the ability to have AI

0:28:02.280 --> 0:28:05.439
<v Speaker 5>systems as a helper in that, I think we're going

0:28:05.480 --> 0:28:07.280
<v Speaker 5>to see a lot of a lot of value there.

0:28:08.480 --> 0:28:11.120
<v Speaker 4>So if you if you think about the different ways

0:28:11.760 --> 0:28:13.960
<v Speaker 4>AI might be applied to business. I mean you've talked

0:28:13.960 --> 0:28:16.320
<v Speaker 4>about a number of the sort of classic use cases.

0:28:17.000 --> 0:28:20.359
<v Speaker 4>What are some of the more out there use cases.

0:28:20.359 --> 0:28:23.280
<v Speaker 4>What are some you know, unique ways you could imagine

0:28:23.320 --> 0:28:25.680
<v Speaker 4>AI being applied to business.

0:28:26.720 --> 0:28:29.439
<v Speaker 5>Yeah, there's really disguised the limit. I mean, we have

0:28:29.520 --> 0:28:31.719
<v Speaker 5>one project that I'm kind of a fan of where

0:28:32.359 --> 0:28:35.840
<v Speaker 5>we actually were working with a mechanical engineering professor at

0:28:35.920 --> 0:28:38.920
<v Speaker 5>MIT working on a classic problem, how do you build

0:28:39.280 --> 0:28:42.680
<v Speaker 5>linkage systems which are like, can imagine bars and joints

0:28:42.840 --> 0:28:45.160
<v Speaker 5>and ogres, you know the things that are.

0:28:45.120 --> 0:28:48.440
<v Speaker 4>Building a thing, building a physical machine of some kind of.

0:28:49.000 --> 0:28:54.160
<v Speaker 5>Like real like metal and you know nineteenth century just

0:28:54.360 --> 0:28:57.280
<v Speaker 5>old school Industrial revolution. Yeah yeah, yeah, but you know

0:28:57.320 --> 0:29:00.040
<v Speaker 5>the little arm that's that's holding up my microphone in

0:29:00.080 --> 0:29:02.560
<v Speaker 5>front of me. Cranes, get build your buildings, you know,

0:29:02.640 --> 0:29:05.160
<v Speaker 5>parts of your engines. This is like classical stuff. It

0:29:05.200 --> 0:29:07.480
<v Speaker 5>turns out that you know, humans, if you want to

0:29:07.480 --> 0:29:10.680
<v Speaker 5>build an advanced system, you decide what like curve you

0:29:10.720 --> 0:29:13.360
<v Speaker 5>want to create, and then a human together with a

0:29:13.400 --> 0:29:17.160
<v Speaker 5>computer program, can build a five or six bar linkage.

0:29:17.280 --> 0:29:18.800
<v Speaker 5>And then that's kind of where you top out it

0:29:18.800 --> 0:29:21.680
<v Speaker 5>because it gets too complicated to work more than that.

0:29:22.320 --> 0:29:24.800
<v Speaker 5>We built a generative AI system that can build twenty

0:29:24.840 --> 0:29:28.200
<v Speaker 5>bar linkages like arbitrarily complex. So these are machines that

0:29:28.200 --> 0:29:32.600
<v Speaker 5>are beyond the capability of a human to design themselves.

0:29:33.120 --> 0:29:36.080
<v Speaker 5>Another example, we have an AI system that can generate

0:29:36.240 --> 0:29:38.640
<v Speaker 5>electronic circuits. You know, we had a project where we're

0:29:38.640 --> 0:29:41.320
<v Speaker 5>working where we were building better power converters which allow

0:29:41.560 --> 0:29:45.680
<v Speaker 5>our computers and our devices to be more efficient, save energy,

0:29:46.480 --> 0:29:49.400
<v Speaker 5>you know, less less carbonet But I think the world

0:29:49.480 --> 0:29:52.400
<v Speaker 5>around us has always been shaped by technology. If you

0:29:52.440 --> 0:29:54.719
<v Speaker 5>look around, you know, just think about how many steps

0:29:54.720 --> 0:29:57.040
<v Speaker 5>and how many people, and how many designs went into

0:29:57.080 --> 0:30:00.600
<v Speaker 5>the table and the chair and the vamp. It's it's

0:30:00.640 --> 0:30:04.120
<v Speaker 5>really just astonishing. And that's already you know, the fruit

0:30:04.200 --> 0:30:07.400
<v Speaker 5>of automation and computers and those kinds of tools. But

0:30:07.400 --> 0:30:10.880
<v Speaker 5>we're going to see that increasingly be act also of AI.

0:30:10.960 --> 0:30:13.000
<v Speaker 5>It's just going to be everywhere around us, everything we

0:30:13.120 --> 0:30:15.760
<v Speaker 5>touch is going to have to you know, helped in

0:30:15.840 --> 0:30:19.240
<v Speaker 5>some way to get get to you by you know.

0:30:19.240 --> 0:30:22.160
<v Speaker 4>That is a pretty profound transformation that you're talking about

0:30:22.200 --> 0:30:25.040
<v Speaker 4>in business. How do you think about the implications of

0:30:25.080 --> 0:30:28.600
<v Speaker 4>that both for the sort of you know, business itself,

0:30:29.000 --> 0:30:30.760
<v Speaker 4>and also for for employees.

0:30:32.520 --> 0:30:36.640
<v Speaker 5>Yeah, so I think for businesses this is gonna cut costs,

0:30:36.920 --> 0:30:40.800
<v Speaker 5>make new opportunities to like customers, you know, like there's

0:30:40.840 --> 0:30:43.640
<v Speaker 5>just you know, it's sort of all upside right like

0:30:44.480 --> 0:30:46.480
<v Speaker 5>for the for the workers, I think the story is

0:30:46.520 --> 0:30:49.400
<v Speaker 5>mostly good too. You know, like how many things do

0:30:49.520 --> 0:30:53.320
<v Speaker 5>you do in your day that you'd really rather not right?

0:30:53.800 --> 0:30:55.800
<v Speaker 5>You know, and we're used to having things we don't

0:30:55.920 --> 0:30:59.200
<v Speaker 5>like automated away, you know, we we didn't you know,

0:30:59.160 --> 0:31:01.520
<v Speaker 5>if we didn't like walk getting many miles to work,

0:31:01.600 --> 0:31:03.480
<v Speaker 5>then you know, like you can have a car and

0:31:03.520 --> 0:31:05.720
<v Speaker 5>you can drive there. Or we used to have a

0:31:05.800 --> 0:31:08.720
<v Speaker 5>huge fraction over ninety percent of the US population engaged

0:31:08.720 --> 0:31:11.760
<v Speaker 5>in agriculture, and then we mechanized it. Now very few

0:31:11.760 --> 0:31:13.800
<v Speaker 5>people work in agriculture. A small number of people can

0:31:13.880 --> 0:31:16.120
<v Speaker 5>do the work of a large number of people. And

0:31:16.160 --> 0:31:18.680
<v Speaker 5>then you know, things like email, and you know, they've

0:31:18.720 --> 0:31:21.360
<v Speaker 5>led to huge productivity enhancements because I don't need to

0:31:21.400 --> 0:31:23.720
<v Speaker 5>be writing letters and sending them in the mail. I

0:31:23.760 --> 0:31:28.160
<v Speaker 5>can just instantly communicate with people. We just become more effective,

0:31:28.320 --> 0:31:32.400
<v Speaker 5>Like our jobs have transformed, whether it's a physical job

0:31:32.480 --> 0:31:35.360
<v Speaker 5>like agriculture, or whether it's a knowledge worker job where

0:31:35.360 --> 0:31:39.080
<v Speaker 5>you're sending emails and communicating with people and coordinating teams.

0:31:39.400 --> 0:31:42.040
<v Speaker 5>We've just gotten better and you know, the technology has

0:31:42.040 --> 0:31:45.560
<v Speaker 5>just made us more productive. And this is just another example. Now,

0:31:45.840 --> 0:31:47.960
<v Speaker 5>you know, there are people who worry that you know,

0:31:48.640 --> 0:31:51.040
<v Speaker 5>will be so good at that that maybe jobs will

0:31:51.080 --> 0:31:54.760
<v Speaker 5>be displaced, and that's that's a legitimate concern. But just

0:31:54.880 --> 0:31:58.360
<v Speaker 5>like how in agriculture, you know, it's not like suddenly

0:31:58.400 --> 0:32:01.280
<v Speaker 5>we had ninety percent of the population and unemployed. You know,

0:32:01.360 --> 0:32:05.640
<v Speaker 5>people transitioned to to other jobs. And the other thing

0:32:05.640 --> 0:32:09.200
<v Speaker 5>that we've found, too, is that our appetite for doing

0:32:09.240 --> 0:32:13.040
<v Speaker 5>more things is as humans is sort of insatiable. So

0:32:13.320 --> 0:32:16.400
<v Speaker 5>even if we can dramatically increase how much you know,

0:32:16.440 --> 0:32:19.560
<v Speaker 5>one human can do, that doesn't necessarily mean we're going

0:32:19.600 --> 0:32:21.920
<v Speaker 5>to do a fixed amount of stuff. There's an appetite

0:32:21.960 --> 0:32:23.479
<v Speaker 5>to have even more, so we're going to you can

0:32:23.480 --> 0:32:26.040
<v Speaker 5>continue to grow grow the pie. So I think at

0:32:26.160 --> 0:32:28.400
<v Speaker 5>least certainly in the near term, you know, we're going

0:32:28.440 --> 0:32:30.280
<v Speaker 5>to see a lot of drudgery go away from work.

0:32:30.320 --> 0:32:32.600
<v Speaker 5>We're going to see people to be able to be

0:32:32.640 --> 0:32:35.760
<v Speaker 5>more effective at their jobs. You know, we will see

0:32:35.760 --> 0:32:39.440
<v Speaker 5>some transformation in jobs and what like. But we've seen

0:32:39.480 --> 0:32:44.280
<v Speaker 5>that before and the technology a least has the potential

0:32:44.480 --> 0:32:45.800
<v Speaker 5>to make our lives a lot easier.

0:32:47.040 --> 0:32:52.000
<v Speaker 4>So IBM recently launched Watson X, which includes Watson x

0:32:52.120 --> 0:32:55.080
<v Speaker 4>dot AI. Tell me about that, tell me about you

0:32:55.080 --> 0:32:57.160
<v Speaker 4>know what it is and the new possibilities that it

0:32:57.200 --> 0:32:57.760
<v Speaker 4>opens up.

0:32:58.680 --> 0:33:02.240
<v Speaker 5>Yeah, So so Wat's the next is obviously a bit

0:33:02.280 --> 0:33:07.240
<v Speaker 5>of a new branding on the Watson brand. TJ. Watson

0:33:07.280 --> 0:33:11.120
<v Speaker 5>that was the founder of IBM and our EI technologies

0:33:11.160 --> 0:33:15.080
<v Speaker 5>have had the Watson brand. Watson X is a recognition

0:33:15.280 --> 0:33:18.600
<v Speaker 5>that there's something new, there's something that actually has changed

0:33:18.600 --> 0:33:22.920
<v Speaker 5>the game. We've gone from this old world of automation

0:33:23.120 --> 0:33:25.720
<v Speaker 5>is to labor intensive to this new world of possibilities

0:33:26.280 --> 0:33:30.200
<v Speaker 5>where it's much easier to use AI. And what Watson

0:33:30.400 --> 0:33:35.360
<v Speaker 5>X does it brings together tools for businesses to harness

0:33:35.400 --> 0:33:40.239
<v Speaker 5>that power. So whattsonex dot AI foundation models that our

0:33:40.280 --> 0:33:43.440
<v Speaker 5>customers can use. It includes tools that make it easy

0:33:43.520 --> 0:33:47.480
<v Speaker 5>to run, easy to deploy, easy to experiment. There's a

0:33:47.520 --> 0:33:51.440
<v Speaker 5>watsonex dot Data component which allows you to sort of

0:33:51.600 --> 0:33:54.240
<v Speaker 5>organize and access to your data. So what we're really

0:33:54.240 --> 0:33:58.000
<v Speaker 5>trying to do is give our customers a cohesive set

0:33:58.040 --> 0:34:02.640
<v Speaker 5>of tools to the value of these technologies and at

0:34:02.680 --> 0:34:05.280
<v Speaker 5>the same time be able to manage the risks and

0:34:05.480 --> 0:34:07.479
<v Speaker 5>other things that you have to keep an eye on

0:34:07.680 --> 0:34:08.960
<v Speaker 5>in an enterprise context.

0:34:10.640 --> 0:34:13.359
<v Speaker 4>So we talk about the guests on this show as

0:34:13.840 --> 0:34:17.960
<v Speaker 4>new creators, by which we mean people who are creatively

0:34:18.000 --> 0:34:22.880
<v Speaker 4>applying technology in business to drive change. And I'm curious

0:34:23.360 --> 0:34:28.080
<v Speaker 4>how creativity plays a role in the research that you do.

0:34:28.680 --> 0:34:33.279
<v Speaker 5>I honestly, I think the creative aspects of this job

0:34:33.719 --> 0:34:37.040
<v Speaker 5>this is what makes this work exciting. You know, I

0:34:37.040 --> 0:34:38.960
<v Speaker 5>should say, you know, the folks who work at my

0:34:39.080 --> 0:34:42.200
<v Speaker 5>organization are doing the creating, and I.

0:34:42.120 --> 0:34:45.680
<v Speaker 4>Guess you're doing the managing so that they could do

0:34:45.760 --> 0:34:46.680
<v Speaker 4>the creator.

0:34:47.120 --> 0:34:50.560
<v Speaker 5>I'm helping them be their best and I still get

0:34:50.560 --> 0:34:53.480
<v Speaker 5>to get involved in the weeds of the research as

0:34:53.560 --> 0:34:56.279
<v Speaker 5>much as I can. But you know, there's something really

0:34:56.320 --> 0:35:01.280
<v Speaker 5>exciting about inventing, you know, like nice things about doing

0:35:01.360 --> 0:35:05.360
<v Speaker 5>invention and doing research on AI and industries. It's usually

0:35:05.400 --> 0:35:07.960
<v Speaker 5>grounded and a real problem that somebody's having. You know,

0:35:08.000 --> 0:35:11.640
<v Speaker 5>a customer wants to solve this problem. It's losing money

0:35:11.719 --> 0:35:14.480
<v Speaker 5>or there there would be a new opportunity. You identify

0:35:14.560 --> 0:35:18.719
<v Speaker 5>that problem and then you build something that's never been

0:35:18.719 --> 0:35:21.680
<v Speaker 5>built before to do that. And I think that's honestly

0:35:21.760 --> 0:35:25.600
<v Speaker 5>the adrenaline rush that keeps all of us in this field.

0:35:25.760 --> 0:35:28.400
<v Speaker 5>How do you do something that nobody else on earth

0:35:28.560 --> 0:35:32.040
<v Speaker 5>has done before or tried before, So that that kind

0:35:32.040 --> 0:35:35.520
<v Speaker 5>of creativity, and there's also creativity as well, and identifying

0:35:35.520 --> 0:35:39.880
<v Speaker 5>what those problems are, being able to understand the places

0:35:40.520 --> 0:35:44.560
<v Speaker 5>where you know the technology is close enough to solving

0:35:44.560 --> 0:35:48.319
<v Speaker 5>a problem, and doing that matchmaking between problems that are

0:35:48.400 --> 0:35:51.080
<v Speaker 5>now solvable, you know, and an AI where the field

0:35:51.160 --> 0:35:55.279
<v Speaker 5>is moving so fast, this is constantly growing horizon of

0:35:55.400 --> 0:35:58.239
<v Speaker 5>things that we might be able to solve. So that matchmaking,

0:35:58.280 --> 0:36:02.000
<v Speaker 5>I think is also a really interesting creative problem. So

0:36:02.280 --> 0:36:04.440
<v Speaker 5>I think I think that's that's that's why it's so

0:36:04.520 --> 0:36:07.640
<v Speaker 5>much fun. And it's a fun environment we have here too.

0:36:07.840 --> 0:36:11.120
<v Speaker 5>It's you know, people drawing on whiteboards and writing on

0:36:11.239 --> 0:36:13.799
<v Speaker 5>pages of math and you.

0:36:13.719 --> 0:36:16.359
<v Speaker 4>Know, like in a movie, like in a movie, yeah,

0:36:16.440 --> 0:36:19.240
<v Speaker 4>straight from special casting drawing, the drawing on the window,

0:36:19.280 --> 0:36:24.640
<v Speaker 4>writing on the window in sharp absolutely, So, so let's

0:36:24.640 --> 0:36:29.640
<v Speaker 4>close with the really long view. How do you imagine

0:36:29.840 --> 0:36:34.080
<v Speaker 4>AI and people working together twenty years from now?

0:36:36.160 --> 0:36:40.799
<v Speaker 5>Yeah, it's really hard to make predictions. The vision that

0:36:41.360 --> 0:36:47.759
<v Speaker 5>I like, actually this came from an MIT economist named

0:36:47.800 --> 0:36:53.719
<v Speaker 5>David Ottur, which was imagine AI almost as a natural resource.

0:36:54.680 --> 0:36:57.640
<v Speaker 5>You know, we have we know how natural resources work, right,

0:36:57.760 --> 0:36:59.440
<v Speaker 5>Like there's an or we can dig up out of

0:36:59.480 --> 0:37:02.080
<v Speaker 5>the earth, comes from kind of springs from the earth,

0:37:02.200 --> 0:37:05.160
<v Speaker 5>or we usually think of that in terms of physical stuff.

0:37:05.800 --> 0:37:07.400
<v Speaker 5>With AI, you can almost think of it as like

0:37:07.440 --> 0:37:10.279
<v Speaker 5>there's a new kind of abundance potentially twenty years from

0:37:10.320 --> 0:37:12.960
<v Speaker 5>now where not only can we have things we can

0:37:13.000 --> 0:37:15.600
<v Speaker 5>build or eat or use or burn or whatever. Now

0:37:15.600 --> 0:37:18.160
<v Speaker 5>we have, you know, this ability to do things and

0:37:18.280 --> 0:37:21.520
<v Speaker 5>understand things and do intellectual work, and I think we

0:37:21.840 --> 0:37:25.880
<v Speaker 5>can get to a world where automating things is just seamless.

0:37:26.280 --> 0:37:31.520
<v Speaker 5>We're surrounded by capability to augment ourselves to get things done.

0:37:32.239 --> 0:37:35.000
<v Speaker 5>And you could think of that in terms of like, oh,

0:37:35.040 --> 0:37:37.200
<v Speaker 5>that's going to displace our jobs, because eventually the AI

0:37:37.239 --> 0:37:39.319
<v Speaker 5>system is going to do everything we can do. But

0:37:39.680 --> 0:37:41.839
<v Speaker 5>you could also think of it in terms of like, wow,

0:37:41.920 --> 0:37:44.239
<v Speaker 5>that's just so much abundance that we now have, and

0:37:44.280 --> 0:37:47.520
<v Speaker 5>really how we use that abundance is sort of up

0:37:47.560 --> 0:37:50.279
<v Speaker 5>to us, you know, like you can writing software is

0:37:50.280 --> 0:37:53.040
<v Speaker 5>super easy and fast, and anybody can do it. Just

0:37:53.080 --> 0:37:55.560
<v Speaker 5>think about all the things you can do now, think

0:37:55.560 --> 0:37:57.520
<v Speaker 5>about all the new activities, and go out all the

0:37:57.560 --> 0:38:00.319
<v Speaker 5>ways we could use that to enrich our lives. That's

0:38:00.320 --> 0:38:03.239
<v Speaker 5>where I'd like to see us in twenty years. You

0:38:03.239 --> 0:38:06.200
<v Speaker 5>know we can. We can do just so much more

0:38:06.560 --> 0:38:09.160
<v Speaker 5>than we were able to do before abundance.

0:38:09.960 --> 0:38:12.800
<v Speaker 4>Great, thank you so much for your time.

0:38:13.520 --> 0:38:15.560
<v Speaker 5>Yeah, it's been a pleasure. Thanks for inviting me.

0:38:17.080 --> 0:38:21.160
<v Speaker 3>What a far ranging, deep conversation. I'm mesmerized by the

0:38:21.200 --> 0:38:25.120
<v Speaker 3>vision David just described. A world where natural conversation between

0:38:25.120 --> 0:38:29.720
<v Speaker 3>mankind and machine can generate creative solutions to our most

0:38:29.760 --> 0:38:33.560
<v Speaker 3>complex problems. A world where we view AI not as

0:38:33.640 --> 0:38:37.680
<v Speaker 3>our replacements, but as a powerful resource we can tap

0:38:37.719 --> 0:38:43.200
<v Speaker 3>into and exponentially boost our innovation and productivity. Thanks so

0:38:43.280 --> 0:38:46.680
<v Speaker 3>much to doctor David Cox for joining us on smart Talks.

0:38:47.120 --> 0:38:50.839
<v Speaker 3>We deeply appreciate him sharing his huge breadth of AI

0:38:50.920 --> 0:38:54.960
<v Speaker 3>knowledge with us and for explaining the transformative potential of

0:38:55.000 --> 0:38:58.360
<v Speaker 3>foundation models in a way that even I can understand.

0:38:58.960 --> 0:39:03.440
<v Speaker 3>We eagerly await his next great breakthrough. Smart Talks with

0:39:03.480 --> 0:39:07.880
<v Speaker 3>IBM is produced by Matt Romano David jaw nishe Venkat

0:39:08.040 --> 0:39:12.480
<v Speaker 3>and Royston Preserve with Jacob Goldstein. We're edited by Lydia

0:39:12.520 --> 0:39:16.839
<v Speaker 3>Jean Kott. Our engineers are Jason Gambrel, Sarah Buguer and

0:39:16.920 --> 0:39:22.560
<v Speaker 3>Ben Holliday. Theme song by Gramoscope. Special thanks to Carli Megliori,

0:39:22.920 --> 0:39:27.040
<v Speaker 3>Andy Kelly, Kathy Callahan and the eight Bar and IBM teams,

0:39:27.520 --> 0:39:31.040
<v Speaker 3>as well as the Pushkin marketing team. Smart Talks with

0:39:31.120 --> 0:39:35.360
<v Speaker 3>IBM is a production of Pushkin Industries and iHeartMedia. To

0:39:35.440 --> 0:39:40.360
<v Speaker 3>find more Pushkin podcasts, listen on the iHeartRadio app, Apple Podcasts,

0:39:40.480 --> 0:39:45.000
<v Speaker 3>or wherever you listen to podcasts. Hi'm Malcolm Gladwell. This

0:39:45.320 --> 0:40:00.520
<v Speaker 3>is a paid advertisement from IBM.