WEBVTT - Smart Talks with IBM - Transformations in AI: why foundation models are the future

0:00:04.519 --> 0:00:10.480
<v Speaker 1>Welcome to Tech Stuff, a production from iHeartRadio.

0:00:12.240 --> 0:00:13.800
<v Speaker 2>Today, we are witnessed.

0:00:13.400 --> 0:00:16.640
<v Speaker 1>To one of those rare moments in history, the rise

0:00:16.760 --> 0:00:20.520
<v Speaker 1>of an innovative technology with the potential to radically transform

0:00:20.600 --> 0:00:26.320
<v Speaker 1>business in society forever. That technology, of course, is artificial intelligence,

0:00:26.520 --> 0:00:29.280
<v Speaker 1>and it's the central focus for this new season of

0:00:29.320 --> 0:00:33.400
<v Speaker 1>Smart Talks with IBM. Join hosts from your favorite Pushkin

0:00:33.479 --> 0:00:37.040
<v Speaker 1>podcasts as they talk with industry experts and leaders to

0:00:37.120 --> 0:00:40.879
<v Speaker 1>explore how businesses can integrate AI into their workflows and

0:00:41.000 --> 0:00:44.160
<v Speaker 1>help drive real change in this new era of AI,

0:00:44.720 --> 0:00:47.360
<v Speaker 1>and of course, host Malcolm Gladwell will be there to

0:00:47.400 --> 0:00:49.919
<v Speaker 1>guide you through the season and throw in his two

0:00:50.000 --> 0:00:53.080
<v Speaker 1>cents as well. Look out for new episodes of Smart

0:00:53.080 --> 0:00:56.240
<v Speaker 1>Talks with IBM every other week on the iHeartRadio app,

0:00:56.480 --> 0:00:59.960
<v Speaker 1>Apple Podcasts, or wherever you get your podcasts, and learn

0:01:00.160 --> 0:01:03.840
<v Speaker 1>more at IBM dot com slash smart Talks.

0:01:06.200 --> 0:01:09.680
<v Speaker 3>Hello, Hello, Welcome to Smart Talks with IBM, a podcast

0:01:09.680 --> 0:01:15.399
<v Speaker 3>from Pushkin Industries, iHeartRadio and IBM. I'm Malcolm Gabwell. This season,

0:01:15.440 --> 0:01:19.920
<v Speaker 3>we're continuing our conversation with new creators visionaries who are

0:01:19.959 --> 0:01:24.000
<v Speaker 3>creatively applying technology in business to drive change, but with

0:01:24.080 --> 0:01:28.480
<v Speaker 3>a focus on the transformative power of artificial intelligence and

0:01:28.560 --> 0:01:31.520
<v Speaker 3>what it means to leverage AI as a game changing

0:01:31.600 --> 0:01:36.800
<v Speaker 3>multiplier for your business. Our guest today is doctor David Cox,

0:01:37.440 --> 0:01:42.320
<v Speaker 3>VP of AI Models at IBM Research and IBM Director

0:01:42.360 --> 0:01:46.440
<v Speaker 3>of the MIT IBM Watson AI Lab, a first of

0:01:46.480 --> 0:01:52.360
<v Speaker 3>its kind industry academic collaboration between IBM and MIT focused

0:01:52.400 --> 0:01:57.080
<v Speaker 3>on the fundamental research of artificial intelligence. Over the course

0:01:57.120 --> 0:02:02.000
<v Speaker 3>of decades, David Cox watched as the AI revolution steadily

0:02:02.080 --> 0:02:05.120
<v Speaker 3>grew from the simmering ideas of a few academics and

0:02:05.160 --> 0:02:10.560
<v Speaker 3>technologists into the industrial boom we are experiencing today. Having

0:02:10.600 --> 0:02:13.720
<v Speaker 3>dedicated his life to pushing the field of AI towards

0:02:13.800 --> 0:02:18.040
<v Speaker 3>new horizons, David has both contributed to and presided over

0:02:18.400 --> 0:02:24.040
<v Speaker 3>many of the major breakthroughs in artificial intelligence. In today's episode,

0:02:24.280 --> 0:02:28.520
<v Speaker 3>you'll hear David explain some of the conceptual underpinnings of

0:02:28.560 --> 0:02:33.320
<v Speaker 3>the current AI landscape, things like foundation models, in surprisingly

0:02:33.440 --> 0:02:36.520
<v Speaker 3>comprehensible terms. I might add, we'll also get into some

0:02:36.560 --> 0:02:40.120
<v Speaker 3>of the amazing practical applications for AI in business, as

0:02:40.160 --> 0:02:43.280
<v Speaker 3>well as what implications AI will have for the future

0:02:43.320 --> 0:02:47.280
<v Speaker 3>of work and design. David spoke with Jacob Goldstein, host

0:02:47.320 --> 0:02:52.080
<v Speaker 3>of the Pushkin podcast What's Your Problem. A veteran business journalist,

0:02:52.400 --> 0:02:55.680
<v Speaker 3>Jacob has reported for The Wall Street Journal, the Miami Herald,

0:02:55.919 --> 0:03:01.080
<v Speaker 3>and was a longtime host of the NPR program Planet Money. Okay,

0:03:01.760 --> 0:03:02.920
<v Speaker 3>let's get to the interview.

0:03:09.360 --> 0:03:11.360
<v Speaker 2>Tell me about your job at IBM.

0:03:11.720 --> 0:03:15.040
<v Speaker 4>So. I wear two hats at IBM. So one, I'm

0:03:15.080 --> 0:03:17.919
<v Speaker 4>the IBM Director of the MI T IBM Watson AI Lab.

0:03:18.440 --> 0:03:21.480
<v Speaker 4>So that's a joint lab between IBM and MIT where

0:03:21.480 --> 0:03:24.080
<v Speaker 4>we try and invent what's next in AI. It's been

0:03:24.120 --> 0:03:27.000
<v Speaker 4>running for about five years, and then more recently I

0:03:27.040 --> 0:03:30.200
<v Speaker 4>started as the vice president for AI Models, and I'm

0:03:30.200 --> 0:03:34.760
<v Speaker 4>in charge of building IBM's foundation models, you know, building

0:03:34.800 --> 0:03:37.000
<v Speaker 4>these these big models, generative models that allow us to

0:03:37.040 --> 0:03:39.160
<v Speaker 4>have all kinds of new exciting capabilities in AI.

0:03:39.720 --> 0:03:41.640
<v Speaker 2>So, so I want to talk to you a lot

0:03:41.720 --> 0:03:45.120
<v Speaker 2>about foundation models, about genitive AI. But before we get

0:03:45.160 --> 0:03:47.240
<v Speaker 2>to that, let's just spend a minute on the on

0:03:47.320 --> 0:03:52.800
<v Speaker 2>the IBM MI T collaboration. Where where did that partnership start?

0:03:52.840 --> 0:03:53.800
<v Speaker 2>How did it originate?

0:03:54.960 --> 0:03:57.320
<v Speaker 4>Yeah, So, actually it turns out that MI T and

0:03:57.360 --> 0:04:01.000
<v Speaker 4>IBM have been collaborating for a very long time in

0:04:01.000 --> 0:04:05.040
<v Speaker 4>the area of AI. In fact, the term artificial intelligence

0:04:05.120 --> 0:04:08.720
<v Speaker 4>was coined in a nineteen fifty six workshop that was

0:04:08.720 --> 0:04:11.000
<v Speaker 4>held at Dartmouth. It was actually organized by an IBM

0:04:11.040 --> 0:04:14.200
<v Speaker 4>or Nathaniel Rochester, who led the development of the IBM

0:04:14.240 --> 0:04:17.400
<v Speaker 4>seven and one. So we've really been together in AIS

0:04:17.480 --> 0:04:22.360
<v Speaker 4>since the beginning, and as AI kept accelerating more and

0:04:22.400 --> 0:04:25.839
<v Speaker 4>more and more, I think there was a really interesting

0:04:25.880 --> 0:04:28.400
<v Speaker 4>decision to say, let's make this a formal partnership. So

0:04:28.600 --> 0:04:30.640
<v Speaker 4>IBM in twenty seventeen and AW so it'd be committing

0:04:30.640 --> 0:04:33.520
<v Speaker 4>close to a quarter billion dollars over ten years to

0:04:33.600 --> 0:04:37.640
<v Speaker 4>have this joint lab with MIT, and we located ourselves

0:04:37.720 --> 0:04:39.880
<v Speaker 4>right on the campus and we've been developing very, very

0:04:39.960 --> 0:04:42.760
<v Speaker 4>deep relationships where we can really get to know each other,

0:04:42.880 --> 0:04:46.240
<v Speaker 4>work shoulder to shoulder, conceiving what we should work on next,

0:04:46.240 --> 0:04:50.239
<v Speaker 4>and then executing the projects. And it's really very few

0:04:50.880 --> 0:04:54.520
<v Speaker 4>entities like this exist between academia industry. It's been really

0:04:54.560 --> 0:04:56.800
<v Speaker 4>fun the last five years to be a part of it.

0:04:57.440 --> 0:04:58.960
<v Speaker 2>And what do you think are some of the most

0:04:58.960 --> 0:05:02.480
<v Speaker 2>important outcomes of this collaboration between IBM and MIT.

0:05:03.880 --> 0:05:06.560
<v Speaker 4>Yeah, so we're really kind of the tip of the

0:05:06.600 --> 0:05:10.920
<v Speaker 4>sphere for for IBM's b I strategy. So we're we're

0:05:10.960 --> 0:05:14.039
<v Speaker 4>really looking what, you know, what's coming ahead, and you know,

0:05:14.080 --> 0:05:16.800
<v Speaker 4>in areas like Foundation models, you know, as the field

0:05:16.880 --> 0:05:20.080
<v Speaker 4>changes and I T people are interested in working on

0:05:20.279 --> 0:05:22.280
<v Speaker 4>you know, faculty, students and staff are interested in working

0:05:22.320 --> 0:05:24.039
<v Speaker 4>on what's the latest thing, what's the next thing. We

0:05:24.080 --> 0:05:27.440
<v Speaker 4>at IBM Research are very much interested in the same

0:05:27.720 --> 0:05:29.760
<v Speaker 4>So we can kind of put out feelers, you know,

0:05:29.839 --> 0:05:33.400
<v Speaker 4>interesting things that we're seeing in our research, interesting things

0:05:33.400 --> 0:05:34.960
<v Speaker 4>we're hearing in the field. We can go and chase

0:05:35.000 --> 0:05:38.120
<v Speaker 4>those opportunities. So when something big comes, like the big

0:05:38.200 --> 0:05:41.040
<v Speaker 4>change that's been happening lately with Foundation Models, we're ready

0:05:41.080 --> 0:05:43.479
<v Speaker 4>to jump on it. That's really the purpose, that's that's

0:05:43.520 --> 0:05:46.600
<v Speaker 4>the lab functioning the way it should. We're also really

0:05:46.600 --> 0:05:50.159
<v Speaker 4>interested in how do we advance you know AI that

0:05:50.200 --> 0:05:53.039
<v Speaker 4>can help with climate change or you know, build better

0:05:53.200 --> 0:05:55.760
<v Speaker 4>materials and all these kinds of things that are you know,

0:05:55.839 --> 0:05:58.960
<v Speaker 4>a broader aperture sometimes than than what we might consider.

0:05:59.120 --> 0:06:01.800
<v Speaker 4>Just looking at the product portfolio of IBM, and that

0:06:02.120 --> 0:06:04.440
<v Speaker 4>gives us again a breadth where we can see connections

0:06:04.480 --> 0:06:07.240
<v Speaker 4>that we might not have seen otherwise. We can you know,

0:06:07.400 --> 0:06:09.920
<v Speaker 4>think things that help out society and also help out

0:06:09.920 --> 0:06:10.560
<v Speaker 4>our customers.

0:06:11.720 --> 0:06:16.200
<v Speaker 2>So the last whatever six months, say, there has been

0:06:16.200 --> 0:06:21.920
<v Speaker 2>this wild rise in the public's interest in AI, right,

0:06:21.960 --> 0:06:25.400
<v Speaker 2>clearly coming out of these generative AI models that are

0:06:25.400 --> 0:06:29.520
<v Speaker 2>really accessible, you know, certainly chat GPT language models like that,

0:06:29.600 --> 0:06:32.640
<v Speaker 2>as well as models that generate images like mid Journey.

0:06:33.279 --> 0:06:36.040
<v Speaker 2>I mean, can you just sort of briefly talk about

0:06:36.160 --> 0:06:40.200
<v Speaker 2>the breakthroughs in AI that have made this moment feel

0:06:40.279 --> 0:06:43.760
<v Speaker 2>so exciting, so revolutionary for artificial intelligence?

0:06:44.800 --> 0:06:49.560
<v Speaker 4>Yeah. You know, I've been studying AI basically my entire

0:06:49.600 --> 0:06:51.680
<v Speaker 4>adult life. Before I came to IBM, I was a

0:06:51.680 --> 0:06:54.280
<v Speaker 4>professor at Harvard. I've been doing this a long time,

0:06:54.520 --> 0:06:56.760
<v Speaker 4>and I've gotten used to being surprised. It sounds like

0:06:56.800 --> 0:07:00.000
<v Speaker 4>a joke, but it's serious, like getting used to being

0:07:00.080 --> 0:07:04.400
<v Speaker 4>surprised at the acceleration of the pace again. It tracks

0:07:04.440 --> 0:07:06.960
<v Speaker 4>actually a long way back. You know, there's lots of

0:07:06.960 --> 0:07:10.240
<v Speaker 4>things where there was an idea that just simmered for

0:07:10.280 --> 0:07:14.480
<v Speaker 4>a really long time. Some of the key math behind

0:07:15.080 --> 0:07:18.400
<v Speaker 4>the stuff that we have today, which is amazing. There's

0:07:18.400 --> 0:07:21.560
<v Speaker 4>an algorithm called backpropagation, which is sort of key to

0:07:21.640 --> 0:07:24.240
<v Speaker 4>training neural networks that's been around you know since the

0:07:24.280 --> 0:07:28.400
<v Speaker 4>eighties in wide use, and really what happened was it

0:07:28.520 --> 0:07:32.360
<v Speaker 4>simmered for a long time and then enough data and

0:07:32.520 --> 0:07:36.720
<v Speaker 4>enough compute came. So we had enough data because you know,

0:07:36.760 --> 0:07:39.920
<v Speaker 4>we all started carrying multiple cameras around with us, our

0:07:39.960 --> 0:07:42.760
<v Speaker 4>mobile phones have all, you know, all these cameras and

0:07:42.800 --> 0:07:45.240
<v Speaker 4>this we put everything on the Internet, and there's all

0:07:45.240 --> 0:07:47.680
<v Speaker 4>this data out there. We called a lucky break that

0:07:47.720 --> 0:07:50.160
<v Speaker 4>there was something called a graphics processing unit, which you know,

0:07:50.200 --> 0:07:52.200
<v Speaker 4>turns out to be really useful for doing these kinds

0:07:52.240 --> 0:07:54.720
<v Speaker 4>of algorithms, maybe even more useful than it is for

0:07:54.840 --> 0:07:58.880
<v Speaker 4>doing graphics. They're great graphics too, And things just kept

0:07:58.960 --> 0:08:02.160
<v Speaker 4>kind of adding to the snowball. So we had deep learning,

0:08:02.600 --> 0:08:06.240
<v Speaker 4>which is sort of a rebrand of neural networks that

0:08:06.280 --> 0:08:08.880
<v Speaker 4>I mentioned from the eighties, and that was enabled again

0:08:08.920 --> 0:08:12.880
<v Speaker 4>by data because we digitalized the world and compute because

0:08:12.920 --> 0:08:15.600
<v Speaker 4>we kept building faster and faster and more powerful computers.

0:08:16.120 --> 0:08:18.800
<v Speaker 4>And then that allowed us to make this this big breakthrough.

0:08:19.080 --> 0:08:23.400
<v Speaker 4>And then you know, more recently, using the same building blocks,

0:08:23.880 --> 0:08:26.800
<v Speaker 4>that inexorable rise of more and more and more data

0:08:27.480 --> 0:08:32.400
<v Speaker 4>met the technology called self supervised learning. Where the key

0:08:32.679 --> 0:08:37.080
<v Speaker 4>difference there in traditional deep learning, you know, for classifying images,

0:08:37.160 --> 0:08:38.680
<v Speaker 4>you know, like is this a cat or is this

0:08:38.720 --> 0:08:43.079
<v Speaker 4>a dog? And a picture those technologies require super vision,

0:08:43.200 --> 0:08:45.920
<v Speaker 4>so you have to take what you have and then

0:08:45.920 --> 0:08:47.160
<v Speaker 4>you have to label it. So you have to take

0:08:47.160 --> 0:08:48.720
<v Speaker 4>a picture of a cat and then you label it

0:08:48.720 --> 0:08:51.199
<v Speaker 4>as a cat, and it turns out that, you know,

0:08:51.240 --> 0:08:53.600
<v Speaker 4>that's very powerful, but it takes a lot of time

0:08:53.640 --> 0:08:56.959
<v Speaker 4>to label gats and to label dogs, and there's only

0:08:57.040 --> 0:08:59.360
<v Speaker 4>so many labels that exist in the world. So what

0:08:59.520 --> 0:09:03.280
<v Speaker 4>really changed more recently is that we have self supervised

0:09:03.320 --> 0:09:05.120
<v Speaker 4>learning where you don't have to have the labels. We

0:09:05.120 --> 0:09:07.720
<v Speaker 4>can just take unannotated data. And what that does is

0:09:07.720 --> 0:09:11.000
<v Speaker 4>it lets you use even more data. And that's really

0:09:11.000 --> 0:09:15.120
<v Speaker 4>what drove this latest sort of rage. And then and

0:09:15.160 --> 0:09:17.040
<v Speaker 4>then all of a sudden we start getting these these

0:09:17.040 --> 0:09:23.120
<v Speaker 4>really powerful models. And then really, this has been simmering technologies, right,

0:09:23.600 --> 0:09:27.760
<v Speaker 4>This has been happening for a while and progressively getting

0:09:27.800 --> 0:09:30.400
<v Speaker 4>more and more powerful. One of the things that really

0:09:30.600 --> 0:09:35.760
<v Speaker 4>happened with CHATGBT and technologies like stable diffusion and mid

0:09:35.880 --> 0:09:39.000
<v Speaker 4>Journey was that they made it visible to the public.

0:09:39.640 --> 0:09:41.600
<v Speaker 4>You know, you put it out there the public can

0:09:41.640 --> 0:09:43.760
<v Speaker 4>touch and feel, and they're like, Wow, not only is

0:09:43.800 --> 0:09:47.000
<v Speaker 4>there palpable change and wow this you know, I can

0:09:47.080 --> 0:09:49.199
<v Speaker 4>talk to this thing. Wow, this thing can generate an image.

0:09:49.559 --> 0:09:52.160
<v Speaker 4>Not only that, but everyone can touch and feel and try.

0:09:53.000 --> 0:09:57.640
<v Speaker 4>My kids can use some of these AI art generation technologies.

0:09:58.200 --> 0:10:01.640
<v Speaker 4>And that's really just launched, you know. It's like a

0:10:02.080 --> 0:10:05.679
<v Speaker 4>propelled slingshot at us into a different regime in terms

0:10:05.720 --> 0:10:07.480
<v Speaker 4>of the public awareness of these technologies.

0:10:08.160 --> 0:10:11.320
<v Speaker 2>You mentioned earlier in the conversation foundation models, and I

0:10:11.360 --> 0:10:13.160
<v Speaker 2>want to talk a little bit about that. I mean,

0:10:13.200 --> 0:10:16.600
<v Speaker 2>can you just tell me, you know, what are foundation

0:10:16.840 --> 0:10:19.600
<v Speaker 2>models for AI and why are they a big deal?

0:10:20.800 --> 0:10:24.480
<v Speaker 4>Yeah? So this term foundation model was coined by a

0:10:24.520 --> 0:10:28.240
<v Speaker 4>group at Stanford, and I think it's actually a really

0:10:28.280 --> 0:10:31.800
<v Speaker 4>apt term because remember I said, you know, one of

0:10:31.840 --> 0:10:35.200
<v Speaker 4>the big things that unlocked this latest excitement was the

0:10:35.200 --> 0:10:38.800
<v Speaker 4>fact that we could use large amounts of unannotated data.

0:10:39.000 --> 0:10:40.680
<v Speaker 4>We could train a model. We don't have to go

0:10:40.720 --> 0:10:44.240
<v Speaker 4>through the painful effort of labeling each and every example.

0:10:44.840 --> 0:10:47.040
<v Speaker 4>You still need to have your model do something you

0:10:47.080 --> 0:10:49.240
<v Speaker 4>wanted to do. You still need to tell it what

0:10:49.280 --> 0:10:50.880
<v Speaker 4>you want to do. You can't just have a model

0:10:50.880 --> 0:10:53.440
<v Speaker 4>that doesn't, you know, have any purpose. But what a

0:10:53.440 --> 0:10:57.280
<v Speaker 4>foundation models that provides a foundation, like a literal foundation,

0:10:57.559 --> 0:10:59.560
<v Speaker 4>you can sort of stand on the shoulders of giants.

0:10:59.600 --> 0:11:02.160
<v Speaker 4>You can have one of these massively trained models and

0:11:02.200 --> 0:11:04.280
<v Speaker 4>then do a little bit on top. You know, you

0:11:04.280 --> 0:11:06.520
<v Speaker 4>could use just a few examples of what you're looking

0:11:06.559 --> 0:11:09.760
<v Speaker 4>for and you can get what you want from the model.

0:11:10.320 --> 0:11:12.240
<v Speaker 4>So just a little bit on top. Now it gets

0:11:12.280 --> 0:11:14.439
<v Speaker 4>to the results that a huge amount of effort used

0:11:14.480 --> 0:11:16.400
<v Speaker 4>to have to put in, you know, to get from

0:11:16.440 --> 0:11:18.640
<v Speaker 4>the ground up to that level.

0:11:18.880 --> 0:11:22.680
<v Speaker 2>I was trying to think of of an analogy for

0:11:22.960 --> 0:11:25.959
<v Speaker 2>sort of foundation models versus what came before, and I

0:11:26.000 --> 0:11:28.440
<v Speaker 2>don't know that I came up with a good one,

0:11:28.480 --> 0:11:30.199
<v Speaker 2>but the best I could do was this. I want

0:11:30.240 --> 0:11:33.160
<v Speaker 2>you to tell me if it's plausible. It's like before

0:11:33.240 --> 0:11:36.800
<v Speaker 2>foundation models, it was like you had these sort of

0:11:36.840 --> 0:11:40.040
<v Speaker 2>single use kitchen appliances. You could make a waffle iron

0:11:40.080 --> 0:11:42.600
<v Speaker 2>if you wanted waffles, or you could make a toaster

0:11:42.760 --> 0:11:45.319
<v Speaker 2>if you wanted to make toast. But a foundation model

0:11:45.400 --> 0:11:47.960
<v Speaker 2>is like like an oven with a range on top.

0:11:48.040 --> 0:11:49.800
<v Speaker 2>So it's like this machine and you could just cook

0:11:49.920 --> 0:11:51.760
<v Speaker 2>anything with this machine.

0:11:52.360 --> 0:11:56.880
<v Speaker 4>Yeah, that's a great analogy. They're very versatile. The other

0:11:57.000 --> 0:11:59.520
<v Speaker 4>piece of it too, is that they dramatically lowered the

0:11:59.600 --> 0:12:02.800
<v Speaker 4>effort that it takes to do something that you want

0:12:02.840 --> 0:12:06.120
<v Speaker 4>to do. And I used to say about the old

0:12:06.160 --> 0:12:07.960
<v Speaker 4>world of AI, would say, you know, the problem with

0:12:08.000 --> 0:12:11.760
<v Speaker 4>automation is that it's too labor intensive, which sounds like

0:12:11.800 --> 0:12:12.680
<v Speaker 4>I'm making a joke.

0:12:12.880 --> 0:12:17.440
<v Speaker 2>Indeed, famously, if automation does one thing, it substitutes machines

0:12:17.559 --> 0:12:20.760
<v Speaker 2>or computing power for labor, right, So what does that

0:12:20.840 --> 0:12:25.160
<v Speaker 2>mean to say AI is or automation is too labor intensive.

0:12:25.640 --> 0:12:27.600
<v Speaker 4>It sounds like I'm making a joke, but I'm actually serious.

0:12:27.840 --> 0:12:31.040
<v Speaker 4>What I mean is that the effort it took the

0:12:31.120 --> 0:12:34.960
<v Speaker 4>old regime to automate something was very, very high. So

0:12:35.160 --> 0:12:38.040
<v Speaker 4>if I need to go and curate all this data,

0:12:38.080 --> 0:12:41.319
<v Speaker 4>collect all this data, and then carefully label all these examples,

0:12:41.720 --> 0:12:46.120
<v Speaker 4>that labeling itself might be incredibly expensive and times, and

0:12:46.120 --> 0:12:48.760
<v Speaker 4>we estimate anywhere between eighty to ninety percent of the

0:12:48.800 --> 0:12:51.720
<v Speaker 4>effort it takes to feel an AI solution actually is

0:12:51.880 --> 0:12:55.199
<v Speaker 4>just spent on data so that that has some consequences,

0:12:55.480 --> 0:13:00.839
<v Speaker 4>which is the threshold for bothering. You know, if you're

0:13:00.880 --> 0:13:03.040
<v Speaker 4>going to only get a little bit of value back

0:13:03.320 --> 0:13:05.520
<v Speaker 4>from something, are you going to go through this huge

0:13:05.559 --> 0:13:09.000
<v Speaker 4>effort to curate all this data, and then when it

0:13:09.040 --> 0:13:11.480
<v Speaker 4>comes time to train the model, you need highly skilled

0:13:11.480 --> 0:13:15.160
<v Speaker 4>people defensive or hard to find in the labor market.

0:13:15.720 --> 0:13:17.240
<v Speaker 4>You know, are you really going to do something that's

0:13:17.240 --> 0:13:19.280
<v Speaker 4>just a tidal incremental thing? Now you're going to do

0:13:19.320 --> 0:13:23.280
<v Speaker 4>the only the highest value things that weren't right level

0:13:23.640 --> 0:13:24.199
<v Speaker 4>because you have.

0:13:24.240 --> 0:13:28.240
<v Speaker 2>To essentially build the whole machine from scratch, and there

0:13:28.240 --> 0:13:30.840
<v Speaker 2>aren't many things where it's worth that much work to

0:13:30.880 --> 0:13:33.840
<v Speaker 2>build a machine that's only going to do one narrow thing.

0:13:34.320 --> 0:13:37.240
<v Speaker 4>That's right, and then you tackle the next problem and

0:13:37.320 --> 0:13:39.640
<v Speaker 4>you basically have to start over. And you know, there

0:13:39.679 --> 0:13:42.480
<v Speaker 4>are some nuances here, like for images, you can pre

0:13:42.520 --> 0:13:45.000
<v Speaker 4>train a model on some other task and change it around.

0:13:45.080 --> 0:13:48.040
<v Speaker 4>So there are some examples of this, like non recurring

0:13:48.120 --> 0:13:50.719
<v Speaker 4>cost that we have in the old world too, But

0:13:50.760 --> 0:13:53.280
<v Speaker 4>by and large, it's just a lot of effort. It's hard.

0:13:53.559 --> 0:13:57.880
<v Speaker 4>It takes you know, a large level of skill to implement.

0:13:58.640 --> 0:14:01.439
<v Speaker 4>One analogy that I like is, you know, think about

0:14:01.440 --> 0:14:03.559
<v Speaker 4>it as you know, you have a river of data,

0:14:03.960 --> 0:14:07.280
<v Speaker 4>you know, running through your company or your institution. Traditional

0:14:07.360 --> 0:14:09.840
<v Speaker 4>AI solutions are kind of like building a dam on

0:14:09.880 --> 0:14:13.320
<v Speaker 4>that river. You know, dams are very expensive things to build.

0:14:13.679 --> 0:14:17.960
<v Speaker 4>They require highly specialized skills and lots of planning. And

0:14:18.120 --> 0:14:19.800
<v Speaker 4>you know, you're only going to put a dam on

0:14:20.240 --> 0:14:22.960
<v Speaker 4>a river that's big enough that you're gonna get enough

0:14:23.040 --> 0:14:24.920
<v Speaker 4>energy out of it that it was worth your trouble.

0:14:25.320 --> 0:14:26.600
<v Speaker 4>You're gonna get a lot of value out of that

0:14:26.680 --> 0:14:28.400
<v Speaker 4>dam if you have a river like that, you know,

0:14:28.480 --> 0:14:32.080
<v Speaker 4>a river of data, but it's actually the vast majority

0:14:32.080 --> 0:14:34.640
<v Speaker 4>of the water you know in your kingdom actually isn't

0:14:34.680 --> 0:14:38.800
<v Speaker 4>in that river. It's in puddles and greeks and babid brooks.

0:14:38.880 --> 0:14:42.360
<v Speaker 4>And you know, there's a lot of value left on

0:14:42.360 --> 0:14:44.960
<v Speaker 4>the table because it's like, well, I can't there's nothing

0:14:44.960 --> 0:14:46.760
<v Speaker 4>you can do about it. It's just that that's too

0:14:47.760 --> 0:14:50.880
<v Speaker 4>low value. So it takes too much effort, so I'm

0:14:50.880 --> 0:14:52.000
<v Speaker 4>just not going to do it. The return on the

0:14:52.000 --> 0:14:54.800
<v Speaker 4>investment just isn't there, so you just end up not

0:14:54.880 --> 0:14:58.120
<v Speaker 4>automating things. It's too much of a pain. Now what

0:14:58.280 --> 0:15:00.720
<v Speaker 4>foundation models do is they say, well, actually, no, we

0:15:00.760 --> 0:15:03.920
<v Speaker 4>can train a base model a foundation that you can

0:15:03.960 --> 0:15:06.240
<v Speaker 4>work on, don't We don't care. We're not specifying what

0:15:06.280 --> 0:15:07.800
<v Speaker 4>the task is ahead of time. We just need to

0:15:08.240 --> 0:15:10.920
<v Speaker 4>learn about the domain of data. So if we want

0:15:10.960 --> 0:15:14.240
<v Speaker 4>to build something that can understand English language, there's a

0:15:14.280 --> 0:15:17.640
<v Speaker 4>ton of English language text available out in the world.

0:15:17.760 --> 0:15:21.560
<v Speaker 4>We can now train models on huge quantities of it.

0:15:22.000 --> 0:15:25.400
<v Speaker 4>And then it learned the structure, It learned how language

0:15:25.600 --> 0:15:27.640
<v Speaker 4>you know, good part of how language works on all

0:15:27.640 --> 0:15:29.920
<v Speaker 4>that unlabeled data. And then when you roll up with

0:15:30.000 --> 0:15:33.760
<v Speaker 4>your task, you know, I want to solve this particular problem.

0:15:34.200 --> 0:15:36.560
<v Speaker 4>You don't have to start from scratch. You're starting from

0:15:36.640 --> 0:15:39.840
<v Speaker 4>a very very very high place. So that just gives

0:15:39.880 --> 0:15:42.440
<v Speaker 4>you the ability to you know, now all of a sudden,

0:15:42.480 --> 0:15:45.640
<v Speaker 4>everything is accessible. All the puddles and greeks and babbling

0:15:45.680 --> 0:15:49.840
<v Speaker 4>brooks and kilopons, you know, those are all accessible now,

0:15:50.360 --> 0:15:53.040
<v Speaker 4>and that's that's very exciting. But it just changes the

0:15:53.040 --> 0:15:55.560
<v Speaker 4>equation on what kinds of problems you could use AI

0:15:55.720 --> 0:15:56.080
<v Speaker 4>to solve.

0:15:56.200 --> 0:16:01.680
<v Speaker 2>And so foundation models basically mean that automating some new

0:16:01.760 --> 0:16:05.000
<v Speaker 2>task is much less labor intensive. The sort of marginal

0:16:05.080 --> 0:16:08.120
<v Speaker 2>effort to do some new automation thing is much lower

0:16:08.120 --> 0:16:11.400
<v Speaker 2>because you're building on top of the foundation model rather

0:16:11.440 --> 0:16:16.200
<v Speaker 2>than starting from scratch. Absolutely, so that is like the

0:16:16.560 --> 0:16:20.720
<v Speaker 2>exciting good news. I do feel like there's a little

0:16:20.760 --> 0:16:23.480
<v Speaker 2>bit of a countervailing idea that's worth talking about here,

0:16:23.520 --> 0:16:25.640
<v Speaker 2>and that is the idea that even though there are

0:16:25.680 --> 0:16:30.280
<v Speaker 2>these foundation models that are really powerful that are relatively

0:16:30.320 --> 0:16:32.880
<v Speaker 2>easy to build on top of, it's still the case

0:16:32.960 --> 0:16:36.240
<v Speaker 2>right that there is not some one size fits all

0:16:36.320 --> 0:16:39.960
<v Speaker 2>foundation model. So you know, what does that mean and

0:16:40.040 --> 0:16:42.560
<v Speaker 2>why is that important to think about in this context?

0:16:43.160 --> 0:16:46.920
<v Speaker 4>Yeah, So we believe very strongly that there isn't just

0:16:47.040 --> 0:16:49.960
<v Speaker 4>one model to rule them all. There's a number of

0:16:49.960 --> 0:16:52.960
<v Speaker 4>reasons why that could be true. One which I think

0:16:53.040 --> 0:16:57.080
<v Speaker 4>is important and very relevant today is how much energy

0:16:57.400 --> 0:17:02.160
<v Speaker 4>these models can consume. So these models, you know, can

0:17:02.200 --> 0:17:07.640
<v Speaker 4>get very, very large. So one thing that we're starting

0:17:07.640 --> 0:17:10.399
<v Speaker 4>to see or starting to believe, is that you probably

0:17:10.400 --> 0:17:15.560
<v Speaker 4>shouldn't use one giant sledgehammer model to solve every single problem,

0:17:15.720 --> 0:17:17.680
<v Speaker 4>you know, like we should pick the right size model

0:17:17.680 --> 0:17:20.480
<v Speaker 4>to solve the problem. We shouldn't necessarily assume that we

0:17:20.560 --> 0:17:25.119
<v Speaker 4>need the biggest, baddest model for every little use case.

0:17:25.560 --> 0:17:27.760
<v Speaker 4>And we're also seeing that, you know, small models that

0:17:27.800 --> 0:17:32.000
<v Speaker 4>are trained like to specialize on particular domains can actually

0:17:32.040 --> 0:17:35.920
<v Speaker 4>outperform much bigger models. So bigger isn't always even better.

0:17:35.960 --> 0:17:38.520
<v Speaker 2>So they're more efficient and they do the thing you

0:17:38.560 --> 0:17:40.200
<v Speaker 2>want them to do better as well.

0:17:40.760 --> 0:17:43.960
<v Speaker 4>That's right. So Stanford, for instance, a group of Stanford

0:17:44.040 --> 0:17:47.880
<v Speaker 4>trained a model was a two point seven billion parameter model,

0:17:47.880 --> 0:17:50.719
<v Speaker 4>which isn't terribly big by today's standards. They trained it

0:17:50.840 --> 0:17:52.760
<v Speaker 4>just on the biomedical literature, you know, this is the

0:17:52.800 --> 0:17:55.560
<v Speaker 4>kind of thing that universities do. And what they showed

0:17:55.680 --> 0:17:59.120
<v Speaker 4>was that this model was better at answering questions about

0:17:59.119 --> 0:18:01.800
<v Speaker 4>the biomedical literacy sure than some models that are one

0:18:01.880 --> 0:18:05.760
<v Speaker 4>hundred billion prouders, you know, many times larger. So it's

0:18:05.800 --> 0:18:08.679
<v Speaker 4>a little bit like you know, asking an expert for

0:18:08.760 --> 0:18:11.840
<v Speaker 4>help on something versus asking the smartest person, you know,

0:18:12.480 --> 0:18:15.280
<v Speaker 4>the smartest person you know, maybe very smart, but they're

0:18:15.320 --> 0:18:18.639
<v Speaker 4>not going to be expertise. And then as an added bonus,

0:18:18.680 --> 0:18:20.680
<v Speaker 4>you know, this is now a much smaller model, it's

0:18:20.760 --> 0:18:23.199
<v Speaker 4>much more efficient to run, we are you know, you know,

0:18:23.240 --> 0:18:27.040
<v Speaker 4>it's cheaper. So there's lots of different advantages there. So

0:18:27.280 --> 0:18:31.119
<v Speaker 4>I think we're going to see attention in the industry

0:18:31.480 --> 0:18:34.200
<v Speaker 4>between vendors that say, hey, this is the one, you know,

0:18:34.280 --> 0:18:36.879
<v Speaker 4>big model, and then others that say, well, actually, you know,

0:18:36.880 --> 0:18:39.439
<v Speaker 4>there's there's you know, lots of different tools we can

0:18:39.560 --> 0:18:41.840
<v Speaker 4>use that all have this nice quality that we outligned

0:18:41.880 --> 0:18:44.199
<v Speaker 4>at the beginning, and then we should really pick the

0:18:44.200 --> 0:18:46.520
<v Speaker 4>one that makes the most sense for the task at hand.

0:18:47.840 --> 0:18:52.199
<v Speaker 2>So there's sustainability basically efficiency. Another kind of set of

0:18:52.240 --> 0:18:54.520
<v Speaker 2>issues that come up a lot with AI A are

0:18:54.680 --> 0:18:58.479
<v Speaker 2>bias hallucination. Can you talk a little bit about bias

0:18:58.720 --> 0:19:01.000
<v Speaker 2>and hallucination, what they are and how you're working to

0:19:01.400 --> 0:19:02.480
<v Speaker 2>mitigate those problems.

0:19:02.920 --> 0:19:05.760
<v Speaker 4>Yeah, so there are lots of issues still as amazing

0:19:05.800 --> 0:19:08.960
<v Speaker 4>as these technologies are, and they are amazing, let's be

0:19:09.080 --> 0:19:11.600
<v Speaker 4>very clear, lots of great things we're going to enable

0:19:11.640 --> 0:19:15.160
<v Speaker 4>with these kinds of technologies. Bias isn't a new problem,

0:19:15.520 --> 0:19:20.119
<v Speaker 4>so you know, basically we've seen this since the beginning

0:19:20.119 --> 0:19:23.040
<v Speaker 4>of AI. If you train a model on data that

0:19:23.440 --> 0:19:25.560
<v Speaker 4>has a bias in it, the model is going to

0:19:25.600 --> 0:19:30.200
<v Speaker 4>recapitulate that bias and it provides its answers. So every time,

0:19:30.359 --> 0:19:32.919
<v Speaker 4>you know, if all the text you have says, you know,

0:19:32.920 --> 0:19:35.960
<v Speaker 4>it's more likely to refer to female nurses and male scientists,

0:19:36.080 --> 0:19:38.760
<v Speaker 4>then you're going to get models that you know. For instance,

0:19:39.080 --> 0:19:41.960
<v Speaker 4>there was an example where a machine learning based translation

0:19:42.040 --> 0:19:46.840
<v Speaker 4>system translated from Hungarian to English. Hungarian doesn't have gender pronouns.

0:19:46.960 --> 0:19:49.520
<v Speaker 4>English does, and when you ask them to translate, it

0:19:49.560 --> 0:19:52.560
<v Speaker 4>would translate they are a nurse to she is a nurse.

0:19:53.200 --> 0:19:55.680
<v Speaker 4>Translate they are a scientist, to he is a scientist.

0:19:55.920 --> 0:19:58.960
<v Speaker 4>And that's not because the people who wrote the algorithm

0:19:58.960 --> 0:20:01.520
<v Speaker 4>we're building in bio and coding in like, oh, it's

0:20:01.520 --> 0:20:03.320
<v Speaker 4>got to be this way. It's because the data was

0:20:03.359 --> 0:20:06.080
<v Speaker 4>like that. You know, we have biases in our society

0:20:06.560 --> 0:20:10.119
<v Speaker 4>and they're reflected in our data and our text and

0:20:10.200 --> 0:20:14.640
<v Speaker 4>our images everywhere. And then the models they're just mapping

0:20:14.680 --> 0:20:16.560
<v Speaker 4>from what they've what they've seen in their training data

0:20:16.640 --> 0:20:19.000
<v Speaker 4>to to the result that you're trying to get them

0:20:19.000 --> 0:20:21.880
<v Speaker 4>to do and to give, and then these biases come out.

0:20:22.000 --> 0:20:27.439
<v Speaker 4>So there's a very active program of research in you know,

0:20:27.480 --> 0:20:30.280
<v Speaker 4>we we do quite a bit at IBM research and I,

0:20:31.000 --> 0:20:34.240
<v Speaker 4>but also all over the community and industry in academia

0:20:34.280 --> 0:20:37.840
<v Speaker 4>trying to figure out how do we explicitly remove these biases,

0:20:37.880 --> 0:20:40.080
<v Speaker 4>how do we identify them, how do you know, how

0:20:40.080 --> 0:20:42.320
<v Speaker 4>do we build tools that allow people to audit their

0:20:42.359 --> 0:20:44.919
<v Speaker 4>systems to make sure they aren't biased. So this is

0:20:44.960 --> 0:20:47.040
<v Speaker 4>a really important thing, and you know, again this was

0:20:47.080 --> 0:20:51.600
<v Speaker 4>here since the beginning, you know, of machine learning and AI,

0:20:52.160 --> 0:20:55.640
<v Speaker 4>but foundation models and large language models and generative AI

0:20:56.600 --> 0:20:59.360
<v Speaker 4>just bring it into sharper even sharper focus because there's

0:20:59.359 --> 0:21:02.880
<v Speaker 4>just so much and it's sort of building in baking

0:21:02.920 --> 0:21:06.160
<v Speaker 4>in all these different biases we have. So that's that's

0:21:06.200 --> 0:21:10.000
<v Speaker 4>absolutely a problem that these models have. Another one that

0:21:10.040 --> 0:21:13.879
<v Speaker 4>you mentioned was hallucinations. So even the most impressive of

0:21:13.880 --> 0:21:17.960
<v Speaker 4>our models will often just make stuff up. You know,

0:21:18.000 --> 0:21:21.159
<v Speaker 4>the technical term that the heels chosen as is hallucination.

0:21:21.760 --> 0:21:24.719
<v Speaker 4>To give you an example, I asked chat tbt to

0:21:24.960 --> 0:21:28.760
<v Speaker 4>create a biography of David Cox IBM, and you know,

0:21:29.000 --> 0:21:31.560
<v Speaker 4>it started off really well. You know, they identified that

0:21:31.600 --> 0:21:34.040
<v Speaker 4>I was the director of the mt IBM Watson and

0:21:34.040 --> 0:21:36.440
<v Speaker 4>said a few words about that, and then it proceeded

0:21:36.480 --> 0:21:41.040
<v Speaker 4>to create an authoritative but completely fake biography of me.

0:21:41.080 --> 0:21:43.560
<v Speaker 4>Where I was British, I was born in the UK.

0:21:44.960 --> 0:21:47.880
<v Speaker 4>I went to British university, you know universities in the UK.

0:21:47.960 --> 0:21:51.359
<v Speaker 2>I was professor the authority, right, it's the certainty that

0:21:51.359 --> 0:21:54.600
<v Speaker 2>that is weird about it, right, It's it's dead certain

0:21:54.640 --> 0:21:56.520
<v Speaker 2>that you're from the UK, et cetera.

0:21:57.080 --> 0:22:00.119
<v Speaker 4>Absolutely, yeah, it has all kinds of flourishes like I

0:22:00.240 --> 0:22:04.920
<v Speaker 4>want words in the UK. So yeah, it's it's problematic

0:22:04.960 --> 0:22:07.800
<v Speaker 4>because it kind of pokes at a lot of weak spots

0:22:07.840 --> 0:22:13.040
<v Speaker 4>in our human psychology where if something sounds coherent, we're

0:22:13.119 --> 0:22:15.880
<v Speaker 4>likely to assume it's true. We're not used to interacting

0:22:15.880 --> 0:22:20.000
<v Speaker 4>with people who eloquently and authoritatively you know, emit complete

0:22:20.119 --> 0:22:23.080
<v Speaker 4>nonsense like yeah, you know, you know, we can debate

0:22:23.080 --> 0:22:23.399
<v Speaker 4>about that.

0:22:23.359 --> 0:22:25.399
<v Speaker 2>But yeah, we can debate about that, but yes it

0:22:25.800 --> 0:22:30.080
<v Speaker 2>the sort of blive confidence throws you off when you

0:22:30.080 --> 0:22:31.399
<v Speaker 2>realize it's completely wrong.

0:22:31.520 --> 0:22:34.120
<v Speaker 4>Right, that's right. And and we do have a little

0:22:34.119 --> 0:22:36.760
<v Speaker 4>bit of like a great and powerful AWS sort of

0:22:37.160 --> 0:22:39.560
<v Speaker 4>vibe going sometimes where we're like, well, you know, the

0:22:39.600 --> 0:22:43.320
<v Speaker 4>AI is all knowing and therefore whatever it says must

0:22:43.400 --> 0:22:45.679
<v Speaker 4>be true. But but these things will make up stuff,

0:22:45.920 --> 0:22:50.639
<v Speaker 4>you know, very aggressively, and you know, you everyone can

0:22:50.680 --> 0:22:53.440
<v Speaker 4>try asking it for their their bio. You you'll you'll

0:22:53.440 --> 0:22:55.680
<v Speaker 4>get something that you'll always get something that's of the

0:22:55.760 --> 0:22:58.560
<v Speaker 4>right form, that has the right tone. But you know,

0:22:58.600 --> 0:23:02.119
<v Speaker 4>the facts just aren't necessarily there. So that's obviously a problem.

0:23:02.240 --> 0:23:04.080
<v Speaker 4>We need to figure out how to close those gaps,

0:23:04.080 --> 0:23:06.919
<v Speaker 4>fix those problems there's lots of ways we could use

0:23:06.920 --> 0:23:08.320
<v Speaker 4>them much more easily.

0:23:08.840 --> 0:23:11.600
<v Speaker 3>I'd just like to say, faced with the awesome potential

0:23:11.640 --> 0:23:14.680
<v Speaker 3>of what these technologies might do, it's a bit encouraging

0:23:14.720 --> 0:23:18.199
<v Speaker 3>to hear that even chat GPT has a weakness for

0:23:18.359 --> 0:23:23.440
<v Speaker 3>inventing flamboyant, if fictional versions of people's lives. And while

0:23:23.600 --> 0:23:27.119
<v Speaker 3>entertaining ourselves with chat GPT and mid journey is important,

0:23:27.600 --> 0:23:31.679
<v Speaker 3>the way lay people use consumer facing chatbots and generative

0:23:31.760 --> 0:23:36.040
<v Speaker 3>AI is just fundamentally different from the way an enterprise

0:23:36.119 --> 0:23:39.600
<v Speaker 3>business uses AI. How can we harness the abilities of

0:23:39.720 --> 0:23:43.080
<v Speaker 3>artificial intelligence to help us solve the problems we face

0:23:43.160 --> 0:23:46.800
<v Speaker 3>in business and technology. Let's listen on as David and

0:23:46.880 --> 0:23:48.719
<v Speaker 3>Jacob continue their conversation.

0:23:49.480 --> 0:23:52.439
<v Speaker 2>We've been talking in a somewhat abstract way about AI

0:23:52.560 --> 0:23:55.280
<v Speaker 2>in the ways it can be used. Let's talk in

0:23:55.320 --> 0:23:58.040
<v Speaker 2>a little bit more of a specific way. Can you

0:23:58.680 --> 0:24:02.520
<v Speaker 2>just talk about some examples of business challenges that can

0:24:02.560 --> 0:24:05.960
<v Speaker 2>be solved with automation, with this kind of automation.

0:24:05.720 --> 0:24:09.640
<v Speaker 4>We're talking about, Yeah, so there really really this guy's

0:24:09.680 --> 0:24:13.600
<v Speaker 4>the limit. There's a whole set of different applications that

0:24:13.840 --> 0:24:16.479
<v Speaker 4>these models are really good at, and basically it's a

0:24:16.520 --> 0:24:18.800
<v Speaker 4>super set of everything we used to use AI for

0:24:19.080 --> 0:24:22.320
<v Speaker 4>in business. So you know, the simple kinds of things

0:24:22.320 --> 0:24:24.040
<v Speaker 4>are like, hey, if I have text and i'm you know,

0:24:24.040 --> 0:24:26.760
<v Speaker 4>I have product reviews and I want to be able

0:24:26.800 --> 0:24:28.679
<v Speaker 4>to tell if these are positive or negative. You know,

0:24:28.760 --> 0:24:30.440
<v Speaker 4>like let's look at all the negative reviews so we

0:24:30.480 --> 0:24:32.360
<v Speaker 4>can have a human look through them and see what

0:24:32.440 --> 0:24:36.080
<v Speaker 4>was up. Very common business use case. You can do

0:24:36.080 --> 0:24:40.080
<v Speaker 4>it with traditional deep learning based AI. So so there's

0:24:40.119 --> 0:24:42.240
<v Speaker 4>things like that that are you know, it's very prosaic,

0:24:42.280 --> 0:24:43.880
<v Speaker 4>so that we were already doing it. We've been doing

0:24:43.920 --> 0:24:47.320
<v Speaker 4>it for a long time. Then you get situations that

0:24:47.359 --> 0:24:50.240
<v Speaker 4>are that were harder for the old AI, like if

0:24:50.280 --> 0:24:53.240
<v Speaker 4>i'm I want to compress something like I want to

0:24:53.440 --> 0:24:55.679
<v Speaker 4>I have like I have a chat transcript, Like a

0:24:55.720 --> 0:25:00.800
<v Speaker 4>customer called in and they had a complaint. They called back. Okay,

0:25:00.880 --> 0:25:03.640
<v Speaker 4>now a new you know, a person on the line

0:25:03.640 --> 0:25:05.960
<v Speaker 4>needs to go read the old transcript to catch up.

0:25:06.359 --> 0:25:08.760
<v Speaker 4>Wouldn't it be better if we could just summarize that,

0:25:08.960 --> 0:25:11.280
<v Speaker 4>just condense it all down a quick little paragraph, you know,

0:25:11.359 --> 0:25:13.440
<v Speaker 4>customer call they were upset about this, rather than having

0:25:13.440 --> 0:25:15.760
<v Speaker 4>to read the blow by blow. There's just lots of

0:25:15.760 --> 0:25:18.919
<v Speaker 4>settings like that where summarization is really helpful. Hey, you

0:25:18.920 --> 0:25:22.720
<v Speaker 4>have a meeting and I'd like to just automatically, you know,

0:25:22.800 --> 0:25:25.240
<v Speaker 4>have have that meeting or that email or whatever. I'd

0:25:25.240 --> 0:25:26.680
<v Speaker 4>like to just have a condensed down so I can

0:25:26.720 --> 0:25:29.480
<v Speaker 4>really quickly get to the heart of the matter. These

0:25:29.480 --> 0:25:32.000
<v Speaker 4>models are are really good at doing that. They're also

0:25:32.000 --> 0:25:34.640
<v Speaker 4>a really good at question answering. So if I want

0:25:34.680 --> 0:25:37.040
<v Speaker 4>to find out what's how many vacation days do I have?

0:25:37.359 --> 0:25:41.760
<v Speaker 4>I can now interact in natural language with a system

0:25:41.840 --> 0:25:45.040
<v Speaker 4>that can go and that has access to our HR policies,

0:25:45.240 --> 0:25:47.040
<v Speaker 4>and I can actually have a you know, a multi

0:25:47.080 --> 0:25:49.520
<v Speaker 4>turn conversation where I can, you know, like I would

0:25:49.520 --> 0:25:53.480
<v Speaker 4>have with you know, somebody, you know, actual HR professional

0:25:53.640 --> 0:25:58.080
<v Speaker 4>or customer service representative. So a big part, you know,

0:25:58.520 --> 0:26:01.640
<v Speaker 4>of what this is doing is it's it's putting an interface.

0:26:01.720 --> 0:26:03.840
<v Speaker 4>You know, when we think of computer interfaces, we're usually

0:26:03.840 --> 0:26:07.359
<v Speaker 4>thinking about UI user interface elements where I click on

0:26:07.520 --> 0:26:10.920
<v Speaker 4>menus and there's buttons and all this stuff. Increasingly, now

0:26:11.160 --> 0:26:14.360
<v Speaker 4>we can just talk, you know, you just in words.

0:26:14.440 --> 0:26:16.320
<v Speaker 4>You can describe what you want you want to answer,

0:26:16.480 --> 0:26:19.080
<v Speaker 4>ask a question, you want to sort of command the

0:26:19.080 --> 0:26:21.959
<v Speaker 4>system to do something. Rather than having to learn how

0:26:21.960 --> 0:26:24.040
<v Speaker 4>to do that clicking buttons, which might be inefficient. Now

0:26:24.040 --> 0:26:25.680
<v Speaker 4>we can just sort of spell it out.

0:26:26.200 --> 0:26:29.200
<v Speaker 2>Interesting, right, the graphical user interface that we all sort

0:26:29.240 --> 0:26:32.560
<v Speaker 2>of default to, that's not like the state of nature, Right,

0:26:32.600 --> 0:26:35.119
<v Speaker 2>That's a thing that was invented and just came to

0:26:35.160 --> 0:26:37.560
<v Speaker 2>be the standard way that we interact with computers. And

0:26:37.600 --> 0:26:42.080
<v Speaker 2>so you could imagine, as you're saying, like chat essentially

0:26:42.240 --> 0:26:45.520
<v Speaker 2>chatting with the machine could could become a sort of

0:26:45.560 --> 0:26:48.640
<v Speaker 2>standard user interface, just like the graphical user interface, did

0:26:49.000 --> 0:26:50.400
<v Speaker 2>you know over the past several decades.

0:26:50.880 --> 0:26:54.280
<v Speaker 4>Absolutely, And I think those kinds of conversational interfaces are

0:26:54.320 --> 0:26:58.520
<v Speaker 4>going to be hugely important for increasing our productivity. It's

0:26:58.560 --> 0:27:00.359
<v Speaker 4>just a lot easier if I i kind I have

0:27:00.400 --> 0:27:02.000
<v Speaker 4>to learn how to use a tool, or I don't

0:27:02.040 --> 0:27:04.800
<v Speaker 4>have to kind of have awkward, you know, interactions for

0:27:04.880 --> 0:27:06.240
<v Speaker 4>the computer. I can just tell it what I want

0:27:06.240 --> 0:27:08.600
<v Speaker 4>and I can understand it. Could you know, potentially even

0:27:08.880 --> 0:27:11.480
<v Speaker 4>ask questions back to clarify and have those kinds of

0:27:11.480 --> 0:27:16.080
<v Speaker 4>conversations that can be extremely powerful. And in fact, one

0:27:16.119 --> 0:27:18.720
<v Speaker 4>area where that's going to I think be absolutely game

0:27:18.800 --> 0:27:21.879
<v Speaker 4>changing is in code. When we write code, you know,

0:27:22.400 --> 0:27:26.159
<v Speaker 4>programming languages are a way for us to sort of

0:27:26.760 --> 0:27:30.960
<v Speaker 4>match between our very sloppy way of talking and the

0:27:31.080 --> 0:27:33.199
<v Speaker 4>very exact way that you need to command a computer

0:27:33.320 --> 0:27:36.400
<v Speaker 4>to do what you wanted to do. They're cumbersome to learn,

0:27:36.480 --> 0:27:38.840
<v Speaker 4>they can you know, create very complex systems that are

0:27:38.920 --> 0:27:41.800
<v Speaker 4>very hard to reason about. And we're already starting to

0:27:41.840 --> 0:27:43.880
<v Speaker 4>see the ability to just write down what you want

0:27:43.960 --> 0:27:46.760
<v Speaker 4>and AI will generate the code for you. And I

0:27:46.760 --> 0:27:48.560
<v Speaker 4>think we're just going to see a huge revolution of

0:27:48.640 --> 0:27:50.520
<v Speaker 4>like we just converse, you know, we can have a

0:27:50.560 --> 0:27:53.400
<v Speaker 4>conversation to say what we want, and then the computer

0:27:53.440 --> 0:27:57.000
<v Speaker 4>can actually not only do fixed actions and do things

0:27:57.000 --> 0:27:58.960
<v Speaker 4>for us, but it can actually even write code to

0:27:59.000 --> 0:28:02.440
<v Speaker 4>do new things, you know, and generated software itself. Given

0:28:02.440 --> 0:28:05.040
<v Speaker 4>how much software we have, how much craving we have

0:28:05.119 --> 0:28:07.720
<v Speaker 4>for software, like we will never have enough software in

0:28:07.720 --> 0:28:10.520
<v Speaker 4>our world, uh, you know, the ability to have a

0:28:10.760 --> 0:28:13.879
<v Speaker 4>systems as a helper in that, I think we're going

0:28:13.960 --> 0:28:15.760
<v Speaker 4>to see a lot of a lot of value there.

0:28:17.000 --> 0:28:19.600
<v Speaker 2>So if you if you think about the different ways

0:28:20.240 --> 0:28:22.440
<v Speaker 2>AI might be applied to business, I mean you've talked

0:28:22.440 --> 0:28:24.800
<v Speaker 2>about a number of the sort of classic use cases.

0:28:25.480 --> 0:28:28.840
<v Speaker 2>What are some of the more out there use cases.

0:28:28.880 --> 0:28:31.760
<v Speaker 2>What are some you know, unique ways you could imagine

0:28:31.800 --> 0:28:33.560
<v Speaker 2>AI being applied to business.

0:28:35.240 --> 0:28:37.919
<v Speaker 4>Yeah, there's really disguised the limit. I mean, we have

0:28:38.000 --> 0:28:40.240
<v Speaker 4>one project that I'm kind of a fan of where

0:28:40.880 --> 0:28:44.720
<v Speaker 4>we actually were working with a mechanical engineering professor at MIT,

0:28:45.440 --> 0:28:48.160
<v Speaker 4>working on a classic problem, how do you build linkage

0:28:48.200 --> 0:28:52.440
<v Speaker 4>systems which like can imagine bars and joints and overs,

0:28:52.760 --> 0:28:53.720
<v Speaker 4>you know, the things that are.

0:28:53.600 --> 0:28:56.480
<v Speaker 2>Building a thing, building a physical machine of some.

0:28:56.640 --> 0:29:01.720
<v Speaker 4>Kind of like real like metal and you know nineteenth

0:29:01.800 --> 0:29:05.479
<v Speaker 4>century just old school industrial revolution. Yeah yeah, yeah, but

0:29:05.600 --> 0:29:07.920
<v Speaker 4>you know the little arm that's that's holding up my

0:29:07.920 --> 0:29:10.880
<v Speaker 4>microphone in front of me, cranes, get build your buildings,

0:29:10.880 --> 0:29:13.440
<v Speaker 4>you know, parts of your engines. This is like classical stuff.

0:29:13.600 --> 0:29:15.920
<v Speaker 4>It turns out that you know, humans, if you want

0:29:15.920 --> 0:29:19.040
<v Speaker 4>to build an advanced system, you decide what like curve

0:29:19.120 --> 0:29:21.800
<v Speaker 4>you want to create, and then a human together with

0:29:21.840 --> 0:29:25.720
<v Speaker 4>a computer program, can build a five or six bar linkage.

0:29:25.800 --> 0:29:27.240
<v Speaker 4>And then that's kind of where you top out it

0:29:27.280 --> 0:29:30.200
<v Speaker 4>because it gets too complicated to work more than that.

0:29:30.840 --> 0:29:33.320
<v Speaker 4>We built a generative AI system that can build twenty

0:29:33.320 --> 0:29:36.680
<v Speaker 4>bar linkages, like arbitrarily complex. So these are machines that

0:29:36.720 --> 0:29:41.080
<v Speaker 4>are beyond the capability of a human to design themselves.

0:29:41.600 --> 0:29:44.560
<v Speaker 4>Another example, we have an AI system that can generate

0:29:44.720 --> 0:29:47.120
<v Speaker 4>electronic circuits. You know, we had a project where we're

0:29:47.120 --> 0:29:49.840
<v Speaker 4>working where we were building better power converters which allow

0:29:50.080 --> 0:29:54.200
<v Speaker 4>our computers and our devices to be more efficient, save energy,

0:29:54.960 --> 0:29:57.680
<v Speaker 4>you know, less less carbon ote. But I think the

0:29:57.680 --> 0:30:00.920
<v Speaker 4>world around us has always been shaped technology. If you

0:30:00.960 --> 0:30:03.240
<v Speaker 4>look around, you know, just think about how many steps

0:30:03.240 --> 0:30:05.560
<v Speaker 4>and how many people, and how many designs went into

0:30:05.560 --> 0:30:09.120
<v Speaker 4>the table and the chair and the WAYMP It's it's

0:30:09.120 --> 0:30:12.600
<v Speaker 4>really just astonishing. And that's already you know, the fruit

0:30:12.680 --> 0:30:15.880
<v Speaker 4>of automation and computers and those kinds of tools. But

0:30:15.920 --> 0:30:19.360
<v Speaker 4>we're going to see that increasingly be product also of AI.

0:30:19.480 --> 0:30:21.480
<v Speaker 4>It's just going to be everywhere around us. Everything we

0:30:21.640 --> 0:30:24.280
<v Speaker 4>touch is going to have to you know, helped in

0:30:24.320 --> 0:30:26.480
<v Speaker 4>some way to get get to you by.

0:30:27.440 --> 0:30:30.000
<v Speaker 2>You know, that is a pretty profound transformation that you're

0:30:30.040 --> 0:30:32.720
<v Speaker 2>talking about in business. How do you think about the

0:30:32.760 --> 0:30:35.840
<v Speaker 2>implications of that both for the sort of you know,

0:30:36.120 --> 0:30:39.280
<v Speaker 2>business itself and also for employees.

0:30:41.000 --> 0:30:44.000
<v Speaker 4>Yeah, so I think for businesses, this is going to

0:30:44.400 --> 0:30:48.280
<v Speaker 4>cut costs, make new opportunities to like customers, you know,

0:30:48.360 --> 0:30:51.960
<v Speaker 4>like there's just you know, it's sort of all upside right,

0:30:52.040 --> 0:30:54.840
<v Speaker 4>like for the for the workers, I think the story

0:30:54.920 --> 0:30:57.880
<v Speaker 4>is mostly good too. You know, like how many things

0:30:57.880 --> 0:31:01.200
<v Speaker 4>do you do in your day that you'd really rather

0:31:01.400 --> 0:31:03.960
<v Speaker 4>not right, you know, and we're used to having things

0:31:03.960 --> 0:31:07.680
<v Speaker 4>we don't like automated away, you know, we didn't you know,

0:31:07.680 --> 0:31:10.280
<v Speaker 4>if you didn't like walking many miles to work, then

0:31:10.320 --> 0:31:12.080
<v Speaker 4>you know, like you can have a car and you

0:31:12.080 --> 0:31:14.560
<v Speaker 4>can drive there. Or we used to have a huge

0:31:14.560 --> 0:31:18.000
<v Speaker 4>traction over ninety percent of the US population engaged in agriculture,

0:31:18.040 --> 0:31:20.680
<v Speaker 4>and then we mechanized it. How very few people work

0:31:20.680 --> 0:31:22.600
<v Speaker 4>in agriculture. A small number of people can do the

0:31:22.640 --> 0:31:25.040
<v Speaker 4>work of a large number of people. And then you know,

0:31:25.120 --> 0:31:28.480
<v Speaker 4>things like email, and yeah, they've led to huge productivity

0:31:28.520 --> 0:31:31.040
<v Speaker 4>enhancements because I don't need to be writing letters and

0:31:31.080 --> 0:31:33.520
<v Speaker 4>sending them in the mail. I can just instantly communicate

0:31:33.520 --> 0:31:37.680
<v Speaker 4>with people. We just become more effective, Like our jobs

0:31:37.720 --> 0:31:41.920
<v Speaker 4>have transformed, whether it's a physical job like agriculture, or

0:31:42.120 --> 0:31:44.800
<v Speaker 4>whether it's a knowledge worker job where you're sending emails

0:31:44.880 --> 0:31:49.360
<v Speaker 4>and communicating with people and coordinating teams. We've just gotten better.

0:31:49.520 --> 0:31:51.880
<v Speaker 4>And you know, the technology has just made us more productive.

0:31:51.960 --> 0:31:54.920
<v Speaker 4>And this is just another example. Now, you know, there

0:31:54.920 --> 0:31:57.560
<v Speaker 4>are people who worry that you know, will be so

0:31:57.640 --> 0:32:01.440
<v Speaker 4>good at that that maybe jobs will be displayed, and

0:32:00.800 --> 0:32:05.800
<v Speaker 4>that's a legitimate concern. But just like how in agriculture,

0:32:05.880 --> 0:32:07.800
<v Speaker 4>you know, it's not like suddenly we had ninety percent

0:32:07.800 --> 0:32:12.560
<v Speaker 4>of the population unemployed. You know, people transitioned to other jobs.

0:32:13.160 --> 0:32:15.240
<v Speaker 4>And the other thing that we've found too, is that

0:32:15.840 --> 0:32:20.200
<v Speaker 4>our appetite for doing more things is as humans is

0:32:20.520 --> 0:32:24.160
<v Speaker 4>sort of insatiable. So even if we can dramatically increase

0:32:24.160 --> 0:32:27.080
<v Speaker 4>how much you know, one human can do, that doesn't

0:32:27.080 --> 0:32:29.560
<v Speaker 4>necessarily mean we're going to do a fixed amount of stuff.

0:32:29.720 --> 0:32:31.560
<v Speaker 4>There's an appetite to have even more, so we're going

0:32:31.560 --> 0:32:34.000
<v Speaker 4>to you can continue to grow grow the pie. So

0:32:34.160 --> 0:32:36.640
<v Speaker 4>I think at least certainly in the near term, you know,

0:32:36.640 --> 0:32:38.320
<v Speaker 4>we're going to see a lot of drudgery go away

0:32:38.320 --> 0:32:40.880
<v Speaker 4>from work. We're going to see people to be able

0:32:40.920 --> 0:32:43.880
<v Speaker 4>to be more effective at their jobs. You know, we

0:32:43.880 --> 0:32:47.400
<v Speaker 4>will see some transformation in jobs and what like. But

0:32:47.480 --> 0:32:52.200
<v Speaker 4>we've seen that before, and the technology a least has

0:32:52.240 --> 0:32:54.320
<v Speaker 4>the potential to make our lives a lot easier.

0:32:55.560 --> 0:33:01.360
<v Speaker 2>So IBEM recently launched Watson X which includessx dot AI.

0:33:01.920 --> 0:33:03.880
<v Speaker 2>Tell me about that, tell me about you know what

0:33:03.920 --> 0:33:06.239
<v Speaker 2>it is, and the new possibilities that it opens up.

0:33:07.160 --> 0:33:11.480
<v Speaker 4>Yeah, so Watson X is obviously a bit of a

0:33:11.760 --> 0:33:15.960
<v Speaker 4>new branding on the Watson brand. TJ. Watson that was

0:33:15.960 --> 0:33:20.160
<v Speaker 4>the founder of IBM and our EI technologies have had

0:33:20.200 --> 0:33:24.800
<v Speaker 4>the Watson brand. Watson X is a recognition that there's

0:33:24.840 --> 0:33:27.480
<v Speaker 4>something new, there's something that actually has changed the game.

0:33:28.080 --> 0:33:31.720
<v Speaker 4>We've gone from this old world of automation is to

0:33:31.880 --> 0:33:35.400
<v Speaker 4>labor intensive to this new world of possibilities where it's

0:33:35.480 --> 0:33:39.840
<v Speaker 4>much easier to use AI. And what Watson X does

0:33:40.000 --> 0:33:44.400
<v Speaker 4>it brings together tools for businesses to harness that power.

0:33:44.840 --> 0:33:49.720
<v Speaker 4>So whattsonex dot AI foundation models that our customers can use.

0:33:49.800 --> 0:33:52.840
<v Speaker 4>It includes tools that make it easy to run, easy

0:33:52.920 --> 0:33:57.280
<v Speaker 4>to deploy, easy to experiment. There's a watsonex dot Data

0:33:57.600 --> 0:34:01.080
<v Speaker 4>component which allows you to sort of organize and access

0:34:01.080 --> 0:34:03.160
<v Speaker 4>to your data. So what we're really trying to do

0:34:03.240 --> 0:34:08.200
<v Speaker 4>is give our customers a cohesive set of tools to

0:34:08.239 --> 0:34:11.439
<v Speaker 4>harness the value of these technologies and at the same

0:34:11.480 --> 0:34:14.479
<v Speaker 4>time be able to manage the risks and other things

0:34:14.520 --> 0:34:16.400
<v Speaker 4>that you have to keep an eye on in an

0:34:16.520 --> 0:34:17.480
<v Speaker 4>enterprise context.

0:34:19.160 --> 0:34:22.200
<v Speaker 2>So we talk about the guests on this show as

0:34:22.360 --> 0:34:26.440
<v Speaker 2>new creators, by which we mean people who are creatively

0:34:26.480 --> 0:34:31.360
<v Speaker 2>applying technology in business to drive change. And I'm curious

0:34:31.880 --> 0:34:36.560
<v Speaker 2>how creativity plays a role in the research that you do.

0:34:37.160 --> 0:34:41.759
<v Speaker 4>I honestly, I think the creative aspects of this job,

0:34:42.200 --> 0:34:45.520
<v Speaker 4>this is what makes this work exciting. You know, I

0:34:45.520 --> 0:34:47.480
<v Speaker 4>should say, you know, the folks who work at my

0:34:47.600 --> 0:34:50.680
<v Speaker 4>organization are doing the creating, and I.

0:34:50.640 --> 0:34:54.200
<v Speaker 2>Guess you're doing the managing so that they could do

0:34:54.239 --> 0:34:54.760
<v Speaker 2>the creator.

0:34:55.640 --> 0:34:59.040
<v Speaker 4>I'm helping them be their best and I still get

0:34:59.080 --> 0:35:01.960
<v Speaker 4>to get involved in the weeds of the research as

0:35:02.040 --> 0:35:04.839
<v Speaker 4>much as I can. But you know, there's something really

0:35:04.840 --> 0:35:08.719
<v Speaker 4>exciting about inventing. You know, like one of the nice

0:35:08.719 --> 0:35:12.279
<v Speaker 4>things about doing invention and doing research on AI in

0:35:12.400 --> 0:35:15.359
<v Speaker 4>industry is it's usually grounded and a real problem that

0:35:15.520 --> 0:35:18.480
<v Speaker 4>somebody's having. You know, a customer wants to solve this problem.

0:35:18.560 --> 0:35:22.080
<v Speaker 4>It's losing money or there would be a new opportunity.

0:35:22.360 --> 0:35:26.799
<v Speaker 4>You identify that problem and then you build something that's

0:35:26.840 --> 0:35:29.040
<v Speaker 4>never been built before to do that. And I think

0:35:29.080 --> 0:35:32.879
<v Speaker 4>that's honestly the adrenaline rush that keeps all of us

0:35:33.400 --> 0:35:35.880
<v Speaker 4>in this field. How do you do something that nobody

0:35:35.880 --> 0:35:39.799
<v Speaker 4>else on earth has done before or tried before, So

0:35:39.840 --> 0:35:43.279
<v Speaker 4>that that kind of creativity, and there's also creativity as well,

0:35:43.360 --> 0:35:46.600
<v Speaker 4>and identifying what those problems are, being able to understand

0:35:47.280 --> 0:35:52.040
<v Speaker 4>the places where you know, the technology is close enough

0:35:52.280 --> 0:35:56.560
<v Speaker 4>to solving a problem, and doing that matchmaking between problems

0:35:56.560 --> 0:35:59.279
<v Speaker 4>that are now solvable, you know, and in AI, where

0:35:59.280 --> 0:36:02.320
<v Speaker 4>the field is moving so fast, this is constantly growing

0:36:02.400 --> 0:36:05.440
<v Speaker 4>horizon of things that we might be able to solve.

0:36:05.760 --> 0:36:08.560
<v Speaker 4>So that matchmaking, I think, is also a really interesting

0:36:08.640 --> 0:36:12.279
<v Speaker 4>creative problem. So I think I think that's that's that's

0:36:12.280 --> 0:36:15.239
<v Speaker 4>why it's so much fun, and it's a fun environment

0:36:15.320 --> 0:36:17.719
<v Speaker 4>we have here too. It's you know, people drawing on

0:36:17.760 --> 0:36:22.279
<v Speaker 4>whiteboards and writing on pages of math and you.

0:36:22.239 --> 0:36:24.879
<v Speaker 2>Know, like in a movie, like in a movie, Yeah,

0:36:24.920 --> 0:36:27.720
<v Speaker 2>straight from special casting drawing, the drawing on the window,

0:36:27.760 --> 0:36:33.080
<v Speaker 2>writing on the window in sharp absolutely. So, so let's

0:36:33.160 --> 0:36:38.120
<v Speaker 2>close with the really long view. How do you imagine

0:36:38.360 --> 0:36:42.600
<v Speaker 2>AI and people working together twenty years from now?

0:36:44.680 --> 0:36:49.279
<v Speaker 4>Yeah, it's really hard to make predictions. The vision that

0:36:49.840 --> 0:36:56.239
<v Speaker 4>I like, actually this came from an MIT economist named

0:36:56.320 --> 0:37:01.279
<v Speaker 4>David Attar, which was imagine a I almost as a

0:37:01.360 --> 0:37:06.120
<v Speaker 4>natural resource you know, we know how natural resources work, right,

0:37:06.280 --> 0:37:08.000
<v Speaker 4>Like there's an ore we can dig up out of

0:37:08.000 --> 0:37:10.560
<v Speaker 4>the earth that comes from kind of springs from the earth.

0:37:10.680 --> 0:37:13.640
<v Speaker 4>Or we usually think of that in terms of physical stuff.

0:37:14.280 --> 0:37:15.880
<v Speaker 4>With AI, you can almost think of it as like

0:37:15.960 --> 0:37:18.799
<v Speaker 4>there's a new kind of abundance potentially twenty years from

0:37:18.840 --> 0:37:21.480
<v Speaker 4>now where not only can we have things we can

0:37:21.480 --> 0:37:24.080
<v Speaker 4>build or eat or use or burn or whatever. Now

0:37:24.120 --> 0:37:26.640
<v Speaker 4>we have, you know, this ability to do things and

0:37:26.760 --> 0:37:30.000
<v Speaker 4>understand things and do intellectual work. And I think we

0:37:30.320 --> 0:37:34.360
<v Speaker 4>can get to a world where automating things is just seamless.

0:37:34.800 --> 0:37:40.080
<v Speaker 4>We're surrounded by capability to augment ourselves to get things done.

0:37:40.719 --> 0:37:43.520
<v Speaker 4>And you could think of that in terms of like, oh,

0:37:43.520 --> 0:37:45.680
<v Speaker 4>that's going to displace our jobs, because eventually the AI

0:37:45.760 --> 0:37:47.799
<v Speaker 4>system is going to do everything we can do. But

0:37:48.200 --> 0:37:50.359
<v Speaker 4>you could also think of it in terms of like, wow,

0:37:50.400 --> 0:37:52.719
<v Speaker 4>that's just so much abundance that we now have, and

0:37:52.760 --> 0:37:56.000
<v Speaker 4>really how we use that abundance is sort of up

0:37:56.040 --> 0:37:58.640
<v Speaker 4>to us. You know, like when you can writing software

0:37:58.680 --> 0:38:01.040
<v Speaker 4>is super easy and fast and anybody can do it.

0:38:01.480 --> 0:38:03.239
<v Speaker 4>Just think about all the things you can do now,

0:38:03.880 --> 0:38:05.880
<v Speaker 4>Think about all the new activities and go out of

0:38:05.880 --> 0:38:08.280
<v Speaker 4>all the ways we could use that to enrich our lives.

0:38:08.600 --> 0:38:11.640
<v Speaker 4>That's where I'd like to see us in twenty years.

0:38:11.680 --> 0:38:14.239
<v Speaker 4>You know we can. We can do just so much

0:38:14.400 --> 0:38:17.680
<v Speaker 4>more than we were able to do before abundance.

0:38:18.480 --> 0:38:21.279
<v Speaker 2>Great, thank you so much for your time.

0:38:22.040 --> 0:38:24.040
<v Speaker 4>Yeah, it's been a pleasure. Thanks for inviting me.

0:38:25.560 --> 0:38:29.640
<v Speaker 3>What a far ranging, deep conversation. I'm mesmerized by the

0:38:29.719 --> 0:38:33.600
<v Speaker 3>vision David just described. A world where natural conversation between

0:38:33.640 --> 0:38:38.240
<v Speaker 3>mankind and machine can generate creative solutions to our most

0:38:38.280 --> 0:38:42.040
<v Speaker 3>complex problems. A world where we view AI not as

0:38:42.160 --> 0:38:46.160
<v Speaker 3>our replacements, but as a powerful resource we can tap

0:38:46.200 --> 0:38:51.719
<v Speaker 3>into and exponentially boost our innovation and productivity. Thanks so

0:38:51.800 --> 0:38:55.200
<v Speaker 3>much to doctor David Cox for joining us on smart Talks.

0:38:55.600 --> 0:38:59.319
<v Speaker 3>We deeply appreciate him sharing his huge breadth of AI

0:38:59.400 --> 0:39:03.440
<v Speaker 3>knowledge with us and for explaining the transformative potential of

0:39:03.520 --> 0:39:06.840
<v Speaker 3>foundation models in a way that even I can understand.

0:39:07.480 --> 0:39:11.920
<v Speaker 3>We eagerly await his next great breakthrough. Smart Talks with

0:39:12.000 --> 0:39:16.360
<v Speaker 3>IBM is produced by Matt Romano, David jaw nishe Venkat

0:39:16.520 --> 0:39:20.960
<v Speaker 3>and Royston Preserve with Jacob Goldstein. We're edited by Lydia

0:39:21.040 --> 0:39:25.319
<v Speaker 3>Jeane Kott. Our engineers are Jason Gambrel, Sarah Buguier and

0:39:25.440 --> 0:39:31.040
<v Speaker 3>Ben Holliday. Theme song by Gramoscope. Special thanks to Carli Megliori,

0:39:31.440 --> 0:39:35.560
<v Speaker 3>Andy Kelly, Kathy Callahan and the Eight Bar and IBM teams,

0:39:36.000 --> 0:39:39.560
<v Speaker 3>as well as the Pushkin marketing team. Smart Talks with

0:39:39.640 --> 0:39:43.840
<v Speaker 3>IBM is a production of Pushkin Industries and iHeartMedia. To

0:39:43.920 --> 0:39:48.880
<v Speaker 3>find more Pushkin podcasts, listen on the iHeartRadio app, Apple Podcasts,

0:39:48.960 --> 0:39:53.520
<v Speaker 3>or wherever you listen to podcasts. Him Malcolm Gladwell. This

0:39:53.800 --> 0:40:09.000
<v Speaker 3>is a paid advertisement from IBM.