WEBVTT - Smart Talks with IBM - Transformations in Al: why foundation models are the future

0:00:04.480 --> 0:00:10.440
<v Speaker 1>Welcome to Tech Stuff, a production from iHeartRadio.

0:00:12.200 --> 0:00:13.840
<v Speaker 2>Today, we are witnessed.

0:00:13.360 --> 0:00:16.600
<v Speaker 1>To one of those rare moments in history, the rise

0:00:16.720 --> 0:00:20.480
<v Speaker 1>of an innovative technology with the potential to radically transform

0:00:20.560 --> 0:00:26.280
<v Speaker 1>business in society forever. That technology, of course, is artificial intelligence,

0:00:26.480 --> 0:00:29.240
<v Speaker 1>and it's the central focus for this new season of

0:00:29.280 --> 0:00:33.360
<v Speaker 1>Smart Talks with IBM. Join hosts from your favorite Pushkin

0:00:33.440 --> 0:00:37.000
<v Speaker 1>podcasts as they talk with industry experts and leaders to

0:00:37.080 --> 0:00:40.839
<v Speaker 1>explore how businesses can integrate AI into their workflows and

0:00:41.000 --> 0:00:44.239
<v Speaker 1>help drive real change in this new era of AI.

0:00:44.680 --> 0:00:47.320
<v Speaker 1>And of course, host Malcolm Gladwell will be there to

0:00:47.360 --> 0:00:49.960
<v Speaker 1>guide you through the season and throw in his two

0:00:49.960 --> 0:00:53.040
<v Speaker 1>cents as well. Look out for new episodes of Smart

0:00:53.040 --> 0:00:56.240
<v Speaker 1>Talks with IBM every other week on the iHeartRadio app,

0:00:56.440 --> 0:00:59.920
<v Speaker 1>Apple Podcasts, or wherever you get your podcasts, and learn

0:01:00.000 --> 0:01:03.800
<v Speaker 1>more at IBM dot com slash smart Talks.

0:01:05.280 --> 0:01:09.280
<v Speaker 3>Hey, it's Jacob Goldstein for Smart Talks with IBM. Last year,

0:01:09.360 --> 0:01:12.399
<v Speaker 3>I had the pleasure of sitting down with doctor David Cox,

0:01:12.720 --> 0:01:17.120
<v Speaker 3>VP of AI Models at IBM Research. We explored the

0:01:17.160 --> 0:01:21.560
<v Speaker 3>fascinating world of AI foundation models and their revolutionary potential

0:01:21.640 --> 0:01:26.199
<v Speaker 3>for business automation and innovation. When we first aired this episode,

0:01:26.319 --> 0:01:29.440
<v Speaker 3>the concept of foundation models was just beginning to capture

0:01:29.480 --> 0:01:34.000
<v Speaker 3>our attention. Since then, this technology has evolved and redefined

0:01:34.000 --> 0:01:37.640
<v Speaker 3>the boundaries of what's possible. Businesses are becoming more savvy

0:01:37.680 --> 0:01:40.560
<v Speaker 3>about selecting the right models and understanding how they can

0:01:40.640 --> 0:01:44.640
<v Speaker 3>drive revenue and efficiency. As I listened back to the conversation,

0:01:44.880 --> 0:01:47.600
<v Speaker 3>it was interesting to reflect on some new developments and

0:01:47.680 --> 0:01:51.200
<v Speaker 3>ideas that have emerged, and many of these we will

0:01:51.240 --> 0:01:54.640
<v Speaker 3>continue to explore throughout the season, like how to play

0:01:54.680 --> 0:01:57.480
<v Speaker 3>an active role in choosing the best model for your needs.

0:01:58.240 --> 0:02:00.480
<v Speaker 3>Whether you're a longtime listener or two in for the

0:02:00.520 --> 0:02:03.760
<v Speaker 3>first time, I'm certain you'll find doctor Cox's insights as

0:02:03.800 --> 0:02:07.120
<v Speaker 3>thought provoking as ever. Thanks as always for joining us.

0:02:07.480 --> 0:02:08.880
<v Speaker 3>Now let's dive in.

0:02:10.600 --> 0:02:14.360
<v Speaker 4>Hello, Hello, Welcome to Smart Talks with IBM, a podcast

0:02:14.400 --> 0:02:19.880
<v Speaker 4>from Pushkin Industries, iHeartRadio and IBM. I'm Malcolm Glabwell. Our

0:02:19.919 --> 0:02:24.560
<v Speaker 4>guest today is doctor David Cox, VP of AI Models

0:02:24.600 --> 0:02:29.639
<v Speaker 4>at IBM Research and IBM Director of the MIT IBM

0:02:29.680 --> 0:02:33.960
<v Speaker 4>Watson AI Lab, a first of its kind industry academic

0:02:34.080 --> 0:02:39.120
<v Speaker 4>collaboration between IBM and MIT focused on the fundamental research

0:02:39.560 --> 0:02:44.280
<v Speaker 4>of artificial intelligence. Over the course of decades, David Cox

0:02:44.400 --> 0:02:49.040
<v Speaker 4>watched as the AI revolution steadily grew from the simmering

0:02:49.040 --> 0:02:52.919
<v Speaker 4>ideas of a few academics and technologists into the industrial

0:02:53.000 --> 0:02:57.280
<v Speaker 4>boom we are experiencing today. Having dedicated his life to

0:02:57.360 --> 0:03:00.800
<v Speaker 4>pushing the field of AI towards new horizons, David has

0:03:00.840 --> 0:03:04.760
<v Speaker 4>both contributed to and presided over many of the major

0:03:04.880 --> 0:03:10.440
<v Speaker 4>breakthroughs in artificial intelligence. In today's episode, you'll hear David

0:03:10.480 --> 0:03:15.600
<v Speaker 4>explain some of the conceptual underpinnings of the current AI landscape,

0:03:15.720 --> 0:03:20.680
<v Speaker 4>things like foundation models, in surprisingly comprehensible terms. I might add,

0:03:20.919 --> 0:03:24.080
<v Speaker 4>we'll also get into some of the amazing practical applications

0:03:24.080 --> 0:03:27.120
<v Speaker 4>for AI in business, as well as what implications AI

0:03:27.280 --> 0:03:30.799
<v Speaker 4>will have for the future of work and design. David

0:03:30.800 --> 0:03:34.520
<v Speaker 4>spoke with Jacob Goldstein, host of the Pushkin podcast What's

0:03:34.560 --> 0:03:38.880
<v Speaker 4>Your Problem. A veteran business journalist, Jacob has reported for

0:03:38.920 --> 0:03:41.680
<v Speaker 4>The Wall Street Journal, the Miami Herald, and was a

0:03:41.720 --> 0:03:47.360
<v Speaker 4>longtime host of the NPR program Planet Money. Okay, let's

0:03:47.400 --> 0:03:48.280
<v Speaker 4>get to the interview.

0:03:50.600 --> 0:03:53.800
<v Speaker 5>Tell me about your job at IBM SO. I wear

0:03:53.920 --> 0:03:57.080
<v Speaker 5>two hats at IBM SO one. I'm the IBM Doctor

0:03:57.120 --> 0:04:00.320
<v Speaker 5>of the MIT IBM Watson the Eye Lab. That's a

0:04:00.480 --> 0:04:03.640
<v Speaker 5>joint lab between IBM and MIT where we try and

0:04:03.680 --> 0:04:06.040
<v Speaker 5>invent what's next in AI. It's been running for about

0:04:06.040 --> 0:04:09.120
<v Speaker 5>five years, and then more recently I started as the

0:04:09.160 --> 0:04:12.040
<v Speaker 5>vice president for AI Models, and I'm in charge of

0:04:12.080 --> 0:04:16.960
<v Speaker 5>building IBM's foundation models, you know, building these these big models,

0:04:16.960 --> 0:04:18.800
<v Speaker 5>generative models that allow us to have all kinds of

0:04:18.880 --> 0:04:20.440
<v Speaker 5>new exciting capabilities in AI.

0:04:21.000 --> 0:04:22.880
<v Speaker 3>So so I want to talk to you a lot

0:04:22.960 --> 0:04:26.400
<v Speaker 3>about foundation models, about genitive AI. But before we get

0:04:26.400 --> 0:04:28.520
<v Speaker 3>to that, let's just spend a minute on the on

0:04:28.600 --> 0:04:31.240
<v Speaker 3>the IBM MIT collaboration.

0:04:32.240 --> 0:04:35.039
<v Speaker 2>Where did that partnership start, How did it originate?

0:04:36.200 --> 0:04:39.039
<v Speaker 5>Yeah, so, actually it turns out that MIT and IBM

0:04:39.240 --> 0:04:42.320
<v Speaker 5>have been collaborating for a very long time in the

0:04:42.360 --> 0:04:46.440
<v Speaker 5>area of AI. In fact, the term artificial intelligence was

0:04:46.480 --> 0:04:50.160
<v Speaker 5>coined in a nineteen fifty six workshop that was held

0:04:50.200 --> 0:04:52.360
<v Speaker 5>at Dartmouth. It was actually organized by an IBM or

0:04:52.400 --> 0:04:55.719
<v Speaker 5>Nathaniel Rochester, who led the development of the IBM seven

0:04:55.720 --> 0:04:59.000
<v Speaker 5>and one. So we've really been together in AI since

0:04:59.000 --> 0:05:03.839
<v Speaker 5>the beginning, and as AI kept accelerating more and more

0:05:03.880 --> 0:05:07.480
<v Speaker 5>and more, I think there was a really interesting decision

0:05:07.480 --> 0:05:10.239
<v Speaker 5>to say let's make this a formal partnership. So IBM

0:05:10.279 --> 0:05:12.080
<v Speaker 5>in twenty seventeen and now, so it'd be committing close

0:05:12.120 --> 0:05:14.960
<v Speaker 5>to a quarter billion dollars over ten years to have

0:05:15.040 --> 0:05:18.880
<v Speaker 5>this joint lab with MIT and we we located ourselves

0:05:18.960 --> 0:05:21.200
<v Speaker 5>right on the campus and we've been developing very very

0:05:21.200 --> 0:05:23.520
<v Speaker 5>deep relationships where we can you know, really get to

0:05:23.520 --> 0:05:26.640
<v Speaker 5>know each other, work shoulder to shoulder, conceiving what we

0:05:26.640 --> 0:05:29.279
<v Speaker 5>should work on next, and then executing the projects. And

0:05:29.320 --> 0:05:33.320
<v Speaker 5>it's really you know, very few entities like this exist

0:05:33.800 --> 0:05:36.800
<v Speaker 5>between academia industry. It's been really fun of the last

0:05:36.800 --> 0:05:38.080
<v Speaker 5>five years to be a part of it.

0:05:38.720 --> 0:05:40.160
<v Speaker 3>And what do you think are some of the most

0:05:40.200 --> 0:05:43.760
<v Speaker 3>important outcomes of this collaboration between IBM and MIT.

0:05:45.160 --> 0:05:47.800
<v Speaker 5>Yeah, so we're really kind of the tip of the

0:05:47.839 --> 0:05:52.799
<v Speaker 5>sphere for IBM's the I strategy. So we're really looking,

0:05:53.000 --> 0:05:55.800
<v Speaker 5>you know, what's coming ahead, and you know, in areas

0:05:55.839 --> 0:05:59.760
<v Speaker 5>like foundation models, you know, as the field changes, MiTV

0:06:00.120 --> 0:06:02.480
<v Speaker 5>are interested in working on you know, faculty, students and

0:06:02.520 --> 0:06:04.520
<v Speaker 5>staff are interested in working on what's the latest thing,

0:06:04.600 --> 0:06:07.680
<v Speaker 5>what's the next thing. We at IBM Research are very

0:06:07.720 --> 0:06:09.800
<v Speaker 5>much interested in the same. So we can kind of

0:06:09.800 --> 0:06:12.719
<v Speaker 5>put out feelers, you know, interesting things that we're seeing

0:06:12.760 --> 0:06:15.559
<v Speaker 5>in our research, interesting things we're hearing in the field.

0:06:15.560 --> 0:06:17.960
<v Speaker 5>We can go and chase those opportunities. So when something

0:06:18.000 --> 0:06:21.000
<v Speaker 5>big comes, like the big change that's been happening lately

0:06:21.040 --> 0:06:23.599
<v Speaker 5>with foundation models, we're ready to jump on it. That's

0:06:23.600 --> 0:06:26.600
<v Speaker 5>really the purpose, that's that's the lab functioning the way

0:06:26.600 --> 0:06:29.800
<v Speaker 5>it should. We're also really interested in how do we

0:06:29.839 --> 0:06:32.440
<v Speaker 5>advance you know, the AI that can help with climate

0:06:32.560 --> 0:06:35.840
<v Speaker 5>change or you know, build better materials and all these

0:06:35.920 --> 0:06:38.200
<v Speaker 5>kinds of things that are you know, a broader aperture

0:06:38.279 --> 0:06:40.960
<v Speaker 5>sometimes than than what we might consider just looking at

0:06:40.960 --> 0:06:43.719
<v Speaker 5>the product portfolio of IBM, and that that gives us

0:06:43.760 --> 0:06:45.920
<v Speaker 5>again a breadth where we can see connections that we

0:06:46.000 --> 0:06:48.880
<v Speaker 5>might not have seen otherwise. We can you know, think

0:06:48.880 --> 0:06:51.839
<v Speaker 5>things that help out society and also help out our customers.

0:06:52.600 --> 0:06:57.080
<v Speaker 3>So the last whatever six months, say, there has been

0:06:57.120 --> 0:07:02.800
<v Speaker 3>this wild rise in the public's interest in AI, right

0:07:02.839 --> 0:07:06.280
<v Speaker 3>clearly coming out of these generative AI models that are

0:07:06.320 --> 0:07:10.400
<v Speaker 3>really accessible, you know, certainly chat GPT language models like that,

0:07:10.480 --> 0:07:13.520
<v Speaker 3>as well as models that generate images like mid Journey.

0:07:14.160 --> 0:07:17.640
<v Speaker 3>I mean, can you just sort of briefly talk about

0:07:17.040 --> 0:07:21.080
<v Speaker 3>the breakthroughs in AI that have made this moment feel

0:07:21.160 --> 0:07:24.640
<v Speaker 3>so exciting, so revolutionary for artificial intelligence.

0:07:25.680 --> 0:07:30.440
<v Speaker 5>Yeah, you know, I've been studying AI basically my entire

0:07:30.480 --> 0:07:32.600
<v Speaker 5>adult life. Before I came to IBM, I was a

0:07:32.600 --> 0:07:35.160
<v Speaker 5>professor at Harvard. I've been doing this a long time,

0:07:35.400 --> 0:07:37.680
<v Speaker 5>and I've gotten used to being surprised. It sounds like

0:07:37.680 --> 0:07:40.960
<v Speaker 5>a joke, but it's serious, like getting used to being

0:07:41.000 --> 0:07:43.559
<v Speaker 5>surprised at the acceleration of the pace.

0:07:44.360 --> 0:07:44.600
<v Speaker 2>Again.

0:07:44.640 --> 0:07:47.400
<v Speaker 5>It tracks actually a long way back. You know, there's

0:07:47.480 --> 0:07:49.720
<v Speaker 5>lots of things where there was an idea that just

0:07:49.840 --> 0:07:53.800
<v Speaker 5>simmered for a really long time. Some of the key

0:07:54.000 --> 0:07:58.400
<v Speaker 5>math behind the stuff that we have today, which is amazing.

0:07:59.120 --> 0:08:01.960
<v Speaker 5>There's an algorithm call back propagation, which is sort of

0:08:02.080 --> 0:08:04.720
<v Speaker 5>key to training neural networks that's been around, you know,

0:08:04.760 --> 0:08:08.679
<v Speaker 5>since the eighties in wide use. And really what happened

0:08:08.800 --> 0:08:12.480
<v Speaker 5>was it simmered for a long time and then enough

0:08:12.640 --> 0:08:16.400
<v Speaker 5>data and enough compute came so we had enough data

0:08:16.440 --> 0:08:20.320
<v Speaker 5>because you know, we all started carrying multiple cameras around

0:08:20.360 --> 0:08:22.760
<v Speaker 5>with us. Our mobile phones have all you know, all

0:08:22.800 --> 0:08:25.600
<v Speaker 5>these cameras and this we put everything on the Internet

0:08:25.680 --> 0:08:27.920
<v Speaker 5>and there's all this data out there. We caught a

0:08:27.960 --> 0:08:30.480
<v Speaker 5>lucky break that there was something called a graphics processing unit,

0:08:30.520 --> 0:08:32.800
<v Speaker 5>which turns out to be really useful for doing these

0:08:32.880 --> 0:08:35.480
<v Speaker 5>kinds of algorithms, maybe even more useful than it is

0:08:35.559 --> 0:08:39.520
<v Speaker 5>for doing graphics. They're greater graphics too, And things just

0:08:39.600 --> 0:08:42.240
<v Speaker 5>kept kind of adding to the snowball. So we had

0:08:42.360 --> 0:08:46.600
<v Speaker 5>deep learning, which is sort of a rebrand of neural

0:08:46.600 --> 0:08:49.079
<v Speaker 5>networks that I mentioned from the eighties, and that was

0:08:49.200 --> 0:08:52.360
<v Speaker 5>enabled again by data because we digitalized the world and

0:08:52.800 --> 0:08:55.480
<v Speaker 5>compute because because we kept building faster and faster and

0:08:55.520 --> 0:08:58.679
<v Speaker 5>more powerful computers, and then that allowed us to make

0:08:58.720 --> 0:09:01.840
<v Speaker 5>this this big breakthrough. And then you know, more recently,

0:09:02.320 --> 0:09:06.600
<v Speaker 5>using the same building blocks, that inexorable rise of more

0:09:06.640 --> 0:09:10.280
<v Speaker 5>and more and more data that the technology called self

0:09:10.320 --> 0:09:16.240
<v Speaker 5>supervised learning. Where the key difference there in traditional deep learning,

0:09:16.320 --> 0:09:18.800
<v Speaker 5>you know, for classifying images, you know, like is this

0:09:18.880 --> 0:09:20.640
<v Speaker 5>a cat or is this a dog? And a picture

0:09:21.080 --> 0:09:26.040
<v Speaker 5>those technologies require supervision, so you have to take what

0:09:26.160 --> 0:09:27.560
<v Speaker 5>you have and then you have to label it. So

0:09:27.600 --> 0:09:28.920
<v Speaker 5>you have to take a picture of a cat and

0:09:28.920 --> 0:09:31.480
<v Speaker 5>then you label it as a cat, and it turns

0:09:31.520 --> 0:09:33.920
<v Speaker 5>out that you know, that's very powerful, but it takes

0:09:33.920 --> 0:09:37.000
<v Speaker 5>a lot of time to label gats and to label dogs,

0:09:37.000 --> 0:09:39.400
<v Speaker 5>and there's only so many labels that exist in the world.

0:09:39.840 --> 0:09:43.400
<v Speaker 5>So what really changed more recently is that we have

0:09:43.480 --> 0:09:45.960
<v Speaker 5>self supervised learning where you don't have to have the labels.

0:09:45.960 --> 0:09:48.480
<v Speaker 5>We can just take unannotated data. And what that does

0:09:48.559 --> 0:09:51.640
<v Speaker 5>is allows you to use even more data. And that's

0:09:51.640 --> 0:09:55.240
<v Speaker 5>really what drove this this latest sort of rage. And

0:09:55.280 --> 0:09:57.400
<v Speaker 5>then and then all of a sudden we start getting

0:09:57.440 --> 0:10:01.360
<v Speaker 5>these these really powerful models. And then really this has

0:10:01.400 --> 0:10:06.160
<v Speaker 5>been simmering technologies, right, this has been happening for a

0:10:06.240 --> 0:10:10.400
<v Speaker 5>while and progressively getting more and more powerful. One of

0:10:10.440 --> 0:10:14.720
<v Speaker 5>the things that really happened with CHATGBT and technologies like

0:10:15.160 --> 0:10:18.600
<v Speaker 5>stable Diffusion and mid Journey was that they made it

0:10:18.800 --> 0:10:21.480
<v Speaker 5>visible to the public. You know, you put it out

0:10:21.480 --> 0:10:23.760
<v Speaker 5>there the public can touch and feel and they're like, wow,

0:10:24.040 --> 0:10:27.640
<v Speaker 5>not only is there palpable change, and wow this you know,

0:10:27.679 --> 0:10:29.199
<v Speaker 5>I can talk to this thing. Wow, this thing can

0:10:29.240 --> 0:10:32.120
<v Speaker 5>generate an image. Not only that, but everyone can touch

0:10:32.120 --> 0:10:36.400
<v Speaker 5>and feel and try. My kids can use some of

0:10:36.440 --> 0:10:41.880
<v Speaker 5>these AI art generation technologies. And that's really just launched,

0:10:41.920 --> 0:10:45.160
<v Speaker 5>you know, it's like a propelled slingshot at us into

0:10:45.520 --> 0:10:47.520
<v Speaker 5>a different regime in terms of the public awareness of

0:10:47.559 --> 0:10:48.920
<v Speaker 5>these technologies.

0:10:49.040 --> 0:10:52.200
<v Speaker 3>You mentioned earlier in the conversation foundation models, and I

0:10:52.240 --> 0:10:54.040
<v Speaker 3>want to talk a little bit about that. I mean,

0:10:54.080 --> 0:10:57.520
<v Speaker 3>can you just tell me, you know, what are foundation

0:10:57.720 --> 0:11:00.480
<v Speaker 3>models for AI and why are they big deal?

0:11:01.679 --> 0:11:05.360
<v Speaker 5>Yeah, So this term foundation model was coined by a

0:11:05.400 --> 0:11:09.080
<v Speaker 5>group at Stanford, and I think it's actually a really

0:11:09.200 --> 0:11:12.679
<v Speaker 5>apt term because remember I said, you know, one of

0:11:12.720 --> 0:11:16.080
<v Speaker 5>the big things that unlocked this latest excitement was the

0:11:16.080 --> 0:11:19.680
<v Speaker 5>fact that we could use large amounts of unannotated data.

0:11:20.000 --> 0:11:21.480
<v Speaker 5>Could we could train a model. We don't have to

0:11:21.520 --> 0:11:25.120
<v Speaker 5>go through the painful effort of labeling each and every example.

0:11:25.720 --> 0:11:27.920
<v Speaker 5>You still need to have your model do something you

0:11:27.960 --> 0:11:30.120
<v Speaker 5>wanted to do. You still need to tell it what

0:11:30.160 --> 0:11:31.760
<v Speaker 5>you want to do. You can't just have a model

0:11:31.760 --> 0:11:33.720
<v Speaker 5>that doesn't, you know, have any purpose.

0:11:34.000 --> 0:11:34.400
<v Speaker 2>But what a.

0:11:34.320 --> 0:11:38.160
<v Speaker 5>Foundation model is that provides a foundation, like a literal foundation.

0:11:38.440 --> 0:11:40.520
<v Speaker 5>You can sort of stand on the shoulders of giants.

0:11:40.520 --> 0:11:43.040
<v Speaker 5>You can have one of these massively trained models and

0:11:43.080 --> 0:11:45.160
<v Speaker 5>then do a little bit on top. You know, you

0:11:45.160 --> 0:11:47.440
<v Speaker 5>could use just a few examples of what you're looking

0:11:47.480 --> 0:11:50.640
<v Speaker 5>for and you can get what you want from the model.

0:11:51.200 --> 0:11:53.199
<v Speaker 5>So just a little bit on top now gets to

0:11:53.360 --> 0:11:55.440
<v Speaker 5>the results that a huge amount of effort used to

0:11:55.440 --> 0:11:57.360
<v Speaker 5>have to put in, you know, to get from the

0:11:57.440 --> 0:11:59.520
<v Speaker 5>ground up to that level.

0:12:00.200 --> 0:12:04.160
<v Speaker 3>Trying to think of of an analogy for sort of

0:12:04.440 --> 0:12:07.240
<v Speaker 3>foundation models versus what came before, and I don't know

0:12:07.280 --> 0:12:09.679
<v Speaker 3>that I came up with a good one, but the

0:12:09.720 --> 0:12:11.320
<v Speaker 3>best I could do was this. I want you to

0:12:11.320 --> 0:12:15.240
<v Speaker 3>tell me if it's plausible. It's like before foundation models,

0:12:15.760 --> 0:12:18.400
<v Speaker 3>it was like you had these sort of single use

0:12:18.640 --> 0:12:21.160
<v Speaker 3>kitchen appliances. You could make a waffle iron if you

0:12:21.200 --> 0:12:23.199
<v Speaker 3>wanted waffles, or you could make a.

0:12:23.160 --> 0:12:24.840
<v Speaker 2>Toaster if you wanted to make toast.

0:12:25.160 --> 0:12:27.960
<v Speaker 3>But a foundation model is like like an oven with

0:12:28.040 --> 0:12:29.960
<v Speaker 3>a range on top. So it's like this machine and

0:12:30.000 --> 0:12:32.640
<v Speaker 3>you could just cook anything with this machine.

0:12:33.280 --> 0:12:37.760
<v Speaker 5>Yeah, that's a great analogy. They're very versatile. The other

0:12:37.880 --> 0:12:40.440
<v Speaker 5>piece of it, too, is that they dramatically lower the

0:12:40.520 --> 0:12:43.679
<v Speaker 5>effort that it takes to do something that you want

0:12:43.720 --> 0:12:46.760
<v Speaker 5>to do. And someone I used to say about the

0:12:46.800 --> 0:12:48.720
<v Speaker 5>old world of AI would say, you know, the problem

0:12:48.760 --> 0:12:52.200
<v Speaker 5>with automation is that it's too labor intensive. H It

0:12:52.240 --> 0:12:53.560
<v Speaker 5>sounds like I'm making a joke.

0:12:53.760 --> 0:12:58.320
<v Speaker 3>Indeed, famously, if automation does one thing, it substitutes machines

0:12:58.440 --> 0:13:01.679
<v Speaker 3>or computing power for labor. Right, So what does that

0:13:01.720 --> 0:13:06.040
<v Speaker 3>mean to say AI is or automation is too labor intensive.

0:13:06.520 --> 0:13:08.480
<v Speaker 5>It sounds like I'm making a joke, but I'm actually serious,

0:13:08.520 --> 0:13:11.240
<v Speaker 5>And what I mean is that the effort it took

0:13:11.840 --> 0:13:15.600
<v Speaker 5>the old regime to automate something was very, very high.

0:13:15.720 --> 0:13:18.920
<v Speaker 5>So if I need to go and curate all this data,

0:13:18.960 --> 0:13:22.199
<v Speaker 5>collect all this data, and then carefully label all these examples,

0:13:22.600 --> 0:13:26.559
<v Speaker 5>that labeling itself might be incredibly expensive and time. So

0:13:26.880 --> 0:13:29.520
<v Speaker 5>and we estimate anywhere between eighty to ninety percent of

0:13:29.559 --> 0:13:32.440
<v Speaker 5>the effort it takes to feel an AI solution actually

0:13:32.520 --> 0:13:36.079
<v Speaker 5>is just spent on data, so that that has some consequences,

0:13:36.400 --> 0:13:41.720
<v Speaker 5>which is the threshold for bothering. You know, if you're

0:13:41.760 --> 0:13:43.920
<v Speaker 5>going to only get a little bit of value back

0:13:44.200 --> 0:13:46.400
<v Speaker 5>from something, are you going to go through this huge

0:13:46.440 --> 0:13:49.960
<v Speaker 5>effort to curate all this data and then when it

0:13:49.960 --> 0:13:52.320
<v Speaker 5>comes time to train the model, you need highly skilled

0:13:52.400 --> 0:13:56.439
<v Speaker 5>people expensive or hard to find in the labor market.

0:13:56.600 --> 0:13:58.120
<v Speaker 5>You know, are you really going to do something that's

0:13:58.160 --> 0:14:00.000
<v Speaker 5>just a tiny little incremental thing. Now you're going to

0:14:00.080 --> 0:14:03.240
<v Speaker 5>do the only the highest value things that warn't right

0:14:03.920 --> 0:14:05.000
<v Speaker 5>level because you.

0:14:04.960 --> 0:14:08.559
<v Speaker 3>Have to essentially build the whole machine from scratch, and

0:14:08.960 --> 0:14:11.600
<v Speaker 3>there aren't many things where it's worth that much work

0:14:11.640 --> 0:14:13.760
<v Speaker 3>to build a machine that's only going to do one

0:14:13.880 --> 0:14:14.720
<v Speaker 3>narrow thing.

0:14:15.200 --> 0:14:18.120
<v Speaker 5>That's right, and then you tackle the next problem and

0:14:18.200 --> 0:14:20.560
<v Speaker 5>you basically have to start over. And you know, there

0:14:20.560 --> 0:14:23.360
<v Speaker 5>are some nuances here, like for images, you can pre

0:14:23.440 --> 0:14:25.880
<v Speaker 5>train a model on some other tasks and change it around.

0:14:25.960 --> 0:14:28.920
<v Speaker 5>So there are some examples of this, like non recurring

0:14:29.040 --> 0:14:31.600
<v Speaker 5>cost that we have in the old world too, But

0:14:31.640 --> 0:14:34.160
<v Speaker 5>by and large, it's just a lot of effort. It's hard,

0:14:34.480 --> 0:14:38.760
<v Speaker 5>it takes, you know, a large level of skill to implement.

0:14:39.520 --> 0:14:42.320
<v Speaker 5>One analogy that I like is, you know, think about

0:14:42.360 --> 0:14:44.480
<v Speaker 5>it as you know, you have a river of data,

0:14:44.840 --> 0:14:48.160
<v Speaker 5>you know, running through your company or your institution. Traditional

0:14:48.240 --> 0:14:50.720
<v Speaker 5>AI solutions are kind of like building a dam on

0:14:50.760 --> 0:14:54.240
<v Speaker 5>that river. You know, dams are very expensive things to build.

0:14:54.560 --> 0:14:58.800
<v Speaker 5>They require highly specialized skills and lots of planning. And

0:14:59.000 --> 0:15:00.680
<v Speaker 5>you know, you're only going to put a dam on

0:15:01.120 --> 0:15:03.640
<v Speaker 5>a river that's big enough that you're going to get

0:15:03.680 --> 0:15:05.800
<v Speaker 5>enough energy out of it that it was worth your trouble.

0:15:06.200 --> 0:15:07.720
<v Speaker 5>You're gonna get a lot of value out of that dam.

0:15:07.800 --> 0:15:09.400
<v Speaker 5>If you have a river like that, you know, a

0:15:09.520 --> 0:15:13.080
<v Speaker 5>river of data, but it's actually the vast majority of

0:15:13.280 --> 0:15:15.640
<v Speaker 5>the water you know in your kingdom actually isn't in

0:15:15.680 --> 0:15:19.720
<v Speaker 5>that river. It's in puddles and greeks and ballet bricks.

0:15:19.800 --> 0:15:23.240
<v Speaker 5>And you know, there's a lot of value left on

0:15:23.280 --> 0:15:25.840
<v Speaker 5>the table because it's like, well, I can't there's nothing

0:15:25.840 --> 0:15:27.640
<v Speaker 5>you can do about it. It's just that that's too

0:15:28.640 --> 0:15:31.760
<v Speaker 5>low value. So it takes too much effort, so I'm

0:15:31.760 --> 0:15:33.320
<v Speaker 5>just not going to do it. The return on investment

0:15:33.720 --> 0:15:36.280
<v Speaker 5>just isn't there, so you just end up not automating

0:15:36.320 --> 0:15:39.120
<v Speaker 5>things because it's too much of a pain. Now what

0:15:39.160 --> 0:15:41.600
<v Speaker 5>foundation models do is they say, well, actually, no, we

0:15:41.640 --> 0:15:44.800
<v Speaker 5>can train a base model, a foundation that you can

0:15:44.840 --> 0:15:46.560
<v Speaker 5>work on the don't We don't care. We have to

0:15:46.560 --> 0:15:48.400
<v Speaker 5>specify what the task is ahead of time. We just

0:15:48.400 --> 0:15:51.560
<v Speaker 5>need to learn about the domain of data. So if

0:15:51.560 --> 0:15:54.440
<v Speaker 5>we want to build something that can understand English language,

0:15:54.760 --> 0:15:58.080
<v Speaker 5>there's a ton of English language text available out in

0:15:58.120 --> 0:16:02.440
<v Speaker 5>the world. We can now train on huge quantities of it,

0:16:02.880 --> 0:16:06.680
<v Speaker 5>and then it learned the structure, learned how language you know,

0:16:06.800 --> 0:16:09.680
<v Speaker 5>good part of how language works on all that unlabeled data,

0:16:09.760 --> 0:16:11.880
<v Speaker 5>and then when you roll up with your task, you

0:16:11.880 --> 0:16:15.440
<v Speaker 5>know I want to solve this particular problem. You don't

0:16:15.480 --> 0:16:18.080
<v Speaker 5>have to start from scratch. You're starting from a very

0:16:18.200 --> 0:16:20.920
<v Speaker 5>very very high place. So that just gives you the

0:16:20.960 --> 0:16:23.320
<v Speaker 5>ability to just, you know, now, all of a sudden,

0:16:23.360 --> 0:16:26.560
<v Speaker 5>everything is accessible. All the puddles and greeks and babbling

0:16:26.560 --> 0:16:30.720
<v Speaker 5>brooks and kettlepons, you know, those are all accessible now.

0:16:31.240 --> 0:16:33.920
<v Speaker 5>And that's that's very exciting. But it just changes the

0:16:33.920 --> 0:16:36.440
<v Speaker 5>equation on what kinds of problems you could use AI

0:16:36.560 --> 0:16:36.960
<v Speaker 5>to solve.

0:16:37.080 --> 0:16:42.560
<v Speaker 3>And so foundation models basically mean that automating some new

0:16:42.640 --> 0:16:45.920
<v Speaker 3>task is much less labor intensive. The sort of marginal

0:16:45.960 --> 0:16:49.000
<v Speaker 3>effort to do some new automation thing is much lower

0:16:49.000 --> 0:16:52.280
<v Speaker 3>because you're building on top of the foundation model rather

0:16:52.320 --> 0:16:56.720
<v Speaker 3>than starting from scratch. Absolutely, So that is that is

0:16:56.800 --> 0:17:00.520
<v Speaker 3>like the exciting good news. I do feel like there's

0:17:01.200 --> 0:17:03.840
<v Speaker 3>a little bit of a countervailing idea that's worth talking

0:17:03.840 --> 0:17:06.200
<v Speaker 3>about here, and that is the idea that even though

0:17:06.240 --> 0:17:10.280
<v Speaker 3>there are these foundation models that are really powerful, that

0:17:10.320 --> 0:17:13.359
<v Speaker 3>are relatively easy to build on top of, it's still

0:17:13.359 --> 0:17:17.240
<v Speaker 3>the case right that there is not some one size fits.

0:17:16.960 --> 0:17:18.200
<v Speaker 2>All foundation model.

0:17:18.760 --> 0:17:21.320
<v Speaker 3>So you know, what does that mean and why is

0:17:21.359 --> 0:17:22.520
<v Speaker 3>that important to think about?

0:17:22.560 --> 0:17:23.800
<v Speaker 2>In this context.

0:17:24.040 --> 0:17:27.840
<v Speaker 5>Yeah, so we believe very strongly that there isn't just

0:17:27.920 --> 0:17:30.800
<v Speaker 5>one model to rule them all. There's a number of

0:17:30.840 --> 0:17:33.840
<v Speaker 5>reasons why that could be true. One which I think

0:17:33.920 --> 0:17:37.960
<v Speaker 5>is important and very relevant today is how much energy

0:17:38.280 --> 0:17:43.040
<v Speaker 5>these models can consume. So these models, you know, can

0:17:43.080 --> 0:17:48.520
<v Speaker 5>get very, very large. So one thing that we're starting

0:17:48.560 --> 0:17:51.280
<v Speaker 5>to see or starting to believe, is that you probably

0:17:51.280 --> 0:17:56.440
<v Speaker 5>shouldn't use one giant sledgehammer model to solve every single problem,

0:17:56.600 --> 0:17:58.560
<v Speaker 5>you know, like we should pick the right size model

0:17:58.560 --> 0:18:01.359
<v Speaker 5>to solve the problem. We shouldn't necessarily assume that we

0:18:01.440 --> 0:18:06.000
<v Speaker 5>need the biggest, baddest model for every little use case.

0:18:06.440 --> 0:18:08.639
<v Speaker 5>And we're also seeing that, you know, small models that

0:18:08.680 --> 0:18:12.880
<v Speaker 5>are trained like to specialize on particular domains can actually

0:18:12.920 --> 0:18:16.760
<v Speaker 5>outperform much bigger models. So bigger isn't always even better.

0:18:16.840 --> 0:18:19.439
<v Speaker 3>So they're more efficient and they do the thing you

0:18:19.440 --> 0:18:21.080
<v Speaker 3>want them to do better as well.

0:18:21.640 --> 0:18:22.120
<v Speaker 2>That's right.

0:18:22.240 --> 0:18:25.639
<v Speaker 5>So Stanford, for instance, a group of Stanford trained a model.

0:18:26.359 --> 0:18:28.920
<v Speaker 5>It is a two point seven billion parameter model, which

0:18:28.960 --> 0:18:31.800
<v Speaker 5>isn't terribly big by today's standards. They trained it just

0:18:31.880 --> 0:18:33.160
<v Speaker 5>on the biomedical literature.

0:18:33.200 --> 0:18:33.359
<v Speaker 2>You know.

0:18:33.400 --> 0:18:35.760
<v Speaker 5>This is the kind of thing that universities do and

0:18:35.840 --> 0:18:39.119
<v Speaker 5>what they showed was that this model was better at

0:18:39.119 --> 0:18:41.840
<v Speaker 5>answering questions about the biomedical literature than some models that

0:18:41.920 --> 0:18:45.639
<v Speaker 5>are one hundred billion parameters, you know, many times larger.

0:18:46.200 --> 0:18:48.560
<v Speaker 5>So it's a little bit like you know, asking an

0:18:48.560 --> 0:18:52.439
<v Speaker 5>expert for help on something versus asking the smartest person,

0:18:52.480 --> 0:18:55.240
<v Speaker 5>you know, the smartest person you know, maybe very smart,

0:18:55.680 --> 0:18:58.720
<v Speaker 5>but they're not going to be expertise. And then as

0:18:58.760 --> 0:19:00.520
<v Speaker 5>an added bonus, you know, this is now a much

0:19:00.520 --> 0:19:02.960
<v Speaker 5>smaller model, it's much more efficient to run, we are

0:19:03.119 --> 0:19:06.639
<v Speaker 5>you know, you know, it's cheaper. So there's lots of

0:19:06.640 --> 0:19:09.359
<v Speaker 5>different advantages there. So I think we're going to see

0:19:09.760 --> 0:19:14.280
<v Speaker 5>attension in the industry between vendors that say, hey, this

0:19:14.359 --> 0:19:16.480
<v Speaker 5>is the one, you know, big model, and then others

0:19:16.480 --> 0:19:18.800
<v Speaker 5>that say, well, actually, you know, there's there's you know,

0:19:19.160 --> 0:19:21.080
<v Speaker 5>lots of different tools we can use that all have

0:19:21.160 --> 0:19:24.119
<v Speaker 5>this nice quality that we outligned at the beginning, and

0:19:24.119 --> 0:19:25.600
<v Speaker 5>then we should really pick the one that makes the

0:19:25.680 --> 0:19:27.360
<v Speaker 5>most sense for the task at hand.

0:19:28.720 --> 0:19:33.080
<v Speaker 3>So there's sustainability basically efficiency. Another kind of set of

0:19:33.119 --> 0:19:37.000
<v Speaker 3>issues that come up a lot with ai A are bias, hallucination.

0:19:37.720 --> 0:19:40.359
<v Speaker 3>Can you talk a little bit about bias and hallucination

0:19:40.440 --> 0:19:43.360
<v Speaker 3>what they are, and how you're working to mitigate those problems.

0:19:43.800 --> 0:19:46.639
<v Speaker 5>Yeah, so there are lots of issues still. As amazing

0:19:46.680 --> 0:19:49.640
<v Speaker 5>as these technologies are, and they are amazing, let's let's

0:19:49.640 --> 0:19:52.119
<v Speaker 5>be very clear, lots of great things we're going to

0:19:52.200 --> 0:19:56.040
<v Speaker 5>enable with these kinds of technologies. Bias isn't a new problem.

0:19:56.400 --> 0:20:01.000
<v Speaker 5>So you know, basically we've seen this since the beginning

0:20:01.000 --> 0:20:03.919
<v Speaker 5>of AI. If you train a model on data that

0:20:04.320 --> 0:20:06.439
<v Speaker 5>has a bias in it, the model is going to

0:20:06.480 --> 0:20:11.080
<v Speaker 5>recapitulate that bias and it provides its answers. So every time,

0:20:11.240 --> 0:20:13.800
<v Speaker 5>you know, if all the text you have says, you know,

0:20:13.840 --> 0:20:16.919
<v Speaker 5>it's more likely to refer to female nurses and male scientists,

0:20:16.960 --> 0:20:19.040
<v Speaker 5>then you're going to you know, get models that you know.

0:20:19.080 --> 0:20:22.160
<v Speaker 5>For instance, there was an example where a machine learning

0:20:22.200 --> 0:20:26.600
<v Speaker 5>based translation system translated from Hungarian to English. Hungarian doesn't

0:20:26.600 --> 0:20:29.919
<v Speaker 5>have gender pronouns, English does, and when you ask them

0:20:29.920 --> 0:20:32.280
<v Speaker 5>to translate, it would translate they are a nurse to

0:20:32.680 --> 0:20:35.560
<v Speaker 5>she is a nurse, would translate they are a scientist

0:20:35.600 --> 0:20:37.800
<v Speaker 5>to he is a scientist. And that's not because the

0:20:38.600 --> 0:20:41.199
<v Speaker 5>people who wrote the algorithm were building in bias and

0:20:41.320 --> 0:20:43.040
<v Speaker 5>coding in like oh, it's got to be this way.

0:20:43.119 --> 0:20:45.359
<v Speaker 5>It's because the data was like that. You know, we

0:20:45.440 --> 0:20:49.719
<v Speaker 5>have biases in our society and they're reflected in our

0:20:49.800 --> 0:20:53.600
<v Speaker 5>data and our text and our images everywhere, and then

0:20:53.640 --> 0:20:56.760
<v Speaker 5>the models they're just mapping from what they've seen in

0:20:56.800 --> 0:20:59.560
<v Speaker 5>their training data to the result that you're trying to

0:20:59.560 --> 0:21:01.960
<v Speaker 5>get them to do and to give, and then these

0:21:01.960 --> 0:21:06.840
<v Speaker 5>biases come out. So there's a very active program of

0:21:06.920 --> 0:21:09.560
<v Speaker 5>research and you know, we we do quite a bit

0:21:09.600 --> 0:21:13.240
<v Speaker 5>at IBM research and i T but also all over

0:21:13.400 --> 0:21:16.000
<v Speaker 5>the community and industry and academia trying to figure out

0:21:16.040 --> 0:21:19.080
<v Speaker 5>how do we explicitly remove these biases, how do we

0:21:19.119 --> 0:21:21.480
<v Speaker 5>identify them, how do you know, how do we build

0:21:21.640 --> 0:21:23.959
<v Speaker 5>tools that allow people to audit their systems to make

0:21:23.960 --> 0:21:26.720
<v Speaker 5>sure they aren't biased. So this is a really important thing.

0:21:26.800 --> 0:21:29.920
<v Speaker 5>And you know, again this was here since the beginning,

0:21:30.560 --> 0:21:34.719
<v Speaker 5>you know, of machine learning and AI, but foundation models

0:21:34.720 --> 0:21:37.960
<v Speaker 5>and large language models and generative AI just bring it

0:21:38.000 --> 0:21:40.720
<v Speaker 5>into sharper even sharper focus because there's just so much

0:21:40.800 --> 0:21:44.119
<v Speaker 5>data and it's sort of building in, baking in all

0:21:44.200 --> 0:21:47.679
<v Speaker 5>these different biases we have, so that that's that's absolutely

0:21:48.200 --> 0:21:51.000
<v Speaker 5>a problem that these models have. Another one that you

0:21:51.040 --> 0:21:54.960
<v Speaker 5>mentioned was hallucinations. So even the most impressive of our

0:21:55.000 --> 0:21:58.920
<v Speaker 5>models will often just make stuff up, you know, the

0:21:59.160 --> 0:22:02.720
<v Speaker 5>technical term that heel has chosen as a hallucination. To

0:22:02.760 --> 0:22:06.119
<v Speaker 5>give you an example, I asked chat tbt to create

0:22:06.160 --> 0:22:09.920
<v Speaker 5>a biography of David Cox IBM, and you know, it

0:22:10.040 --> 0:22:12.560
<v Speaker 5>started off really well. You know, they identified that I

0:22:12.600 --> 0:22:15.000
<v Speaker 5>was the director of the mt IBM Watsonay and some

0:22:15.080 --> 0:22:17.439
<v Speaker 5>a few words about that, and then it proceeded to

0:22:17.480 --> 0:22:22.120
<v Speaker 5>create an authoritative but completely fake biography of me where

0:22:22.160 --> 0:22:25.919
<v Speaker 5>I was British, I was born in the UK. I

0:22:25.960 --> 0:22:28.760
<v Speaker 5>went to British university, you know, universities in the UK.

0:22:28.840 --> 0:22:31.800
<v Speaker 3>I was a professor, the authority, right, it's the certainty

0:22:31.920 --> 0:22:34.960
<v Speaker 3>that that is weird about it, right, It's it's dead

0:22:35.119 --> 0:22:37.399
<v Speaker 3>certain that you're from the UK, et cetera.

0:22:37.960 --> 0:22:41.000
<v Speaker 5>Absolutely, yeah, it has all kinds of flourishes, like I

0:22:41.080 --> 0:22:45.800
<v Speaker 5>want awards in the UK. So yeah, it's it's problematic

0:22:45.840 --> 0:22:48.680
<v Speaker 5>because it kind of pokes a lot of weak spots

0:22:48.720 --> 0:22:53.920
<v Speaker 5>in our human psychology where if something sounds coherent, we're

0:22:54.000 --> 0:22:56.760
<v Speaker 5>likely to assume it's true. We're not used to interacting

0:22:56.800 --> 0:23:01.520
<v Speaker 5>with people who eloquently and authoritatively, you know, emit complete nonsense,

0:23:01.600 --> 0:23:04.320
<v Speaker 5>like yeah, you know, you know, we get debate about that, but.

0:23:04.359 --> 0:23:07.840
<v Speaker 3>Yeah, we can debate about that, but yes, the sort

0:23:07.880 --> 0:23:11.480
<v Speaker 3>of blive confidence throws you off when you realize it's

0:23:11.520 --> 0:23:12.280
<v Speaker 3>completely wrong.

0:23:12.400 --> 0:23:15.160
<v Speaker 5>Right, that's right. And we do have a little bit

0:23:15.160 --> 0:23:18.399
<v Speaker 5>of like a great and powerful oz sort of vibe

0:23:18.400 --> 0:23:20.760
<v Speaker 5>going sometimes where we're like, well, you know, the AI

0:23:20.960 --> 0:23:24.720
<v Speaker 5>is all knowing and therefore whatever it says must be true.

0:23:24.920 --> 0:23:27.000
<v Speaker 5>But but these things will make up stuff, you know,

0:23:27.320 --> 0:23:32.119
<v Speaker 5>very aggressively, and you know, you everyone can try asking

0:23:32.119 --> 0:23:34.919
<v Speaker 5>it for their their bio. You'll you'll get something that

0:23:35.480 --> 0:23:37.879
<v Speaker 5>you always get, something that's of the right form, that

0:23:37.920 --> 0:23:40.119
<v Speaker 5>has the right tone. But you know, the facts just

0:23:40.160 --> 0:23:43.359
<v Speaker 5>aren't necessarily there. So that's obviously a problem. We need

0:23:43.400 --> 0:23:46.080
<v Speaker 5>to figure out how to close those gaps, fix those problems.

0:23:46.760 --> 0:23:49.199
<v Speaker 5>There's lots of ways we can use them much more easily.

0:23:49.720 --> 0:23:52.480
<v Speaker 4>I'd just like to say, faced with the awesome potential

0:23:52.520 --> 0:23:55.560
<v Speaker 4>of what these technologies might do, it's a bit encouraging

0:23:55.600 --> 0:23:59.080
<v Speaker 4>to hear that even chat GPT has a weakness for

0:23:59.240 --> 0:24:04.920
<v Speaker 4>inventing buoyant fictional versions of people's lives, and while entertaining

0:24:04.920 --> 0:24:08.560
<v Speaker 4>ourselves with chat GPT, and mid journey is important. The

0:24:08.600 --> 0:24:13.000
<v Speaker 4>way lay people use consumer facing chatbots and generative AI

0:24:13.480 --> 0:24:17.400
<v Speaker 4>is just fundamentally different from the way an enterprise business

0:24:17.480 --> 0:24:21.119
<v Speaker 4>uses AI. How can we harness the abilities of artificial

0:24:21.119 --> 0:24:24.159
<v Speaker 4>intelligence to help us solve the problems we face in

0:24:24.280 --> 0:24:28.119
<v Speaker 4>business and technology. Let's listen on as David and Jacob

0:24:28.240 --> 0:24:29.600
<v Speaker 4>continue their conversation.

0:24:30.359 --> 0:24:33.320
<v Speaker 3>We've been talking in a somewhat abstract way about AI

0:24:33.440 --> 0:24:35.119
<v Speaker 3>in the ways it can be used.

0:24:35.680 --> 0:24:37.160
<v Speaker 2>Let's talk in a little bit more.

0:24:37.000 --> 0:24:40.760
<v Speaker 3>Of a specific way. Can you just talk about some

0:24:40.920 --> 0:24:45.440
<v Speaker 3>examples of business challenges that can be solved with automation,

0:24:45.560 --> 0:24:47.399
<v Speaker 3>with this kind of automation we're talking about.

0:24:48.280 --> 0:24:51.760
<v Speaker 5>Yeah, so the really really guy's the limit. There's a

0:24:51.760 --> 0:24:55.880
<v Speaker 5>whole set of different applications that these models are really

0:24:55.880 --> 0:24:58.600
<v Speaker 5>good at. And basically it's a superset of everything we

0:24:58.720 --> 0:25:01.560
<v Speaker 5>used to use AI for in business. So, you know,

0:25:02.200 --> 0:25:03.879
<v Speaker 5>the simple kinds of things are like, hey, if I

0:25:03.920 --> 0:25:06.639
<v Speaker 5>have text and i'm you know, I have like product reviews,

0:25:06.960 --> 0:25:08.120
<v Speaker 5>and I want to be able to tell if these

0:25:08.119 --> 0:25:10.240
<v Speaker 5>are positive or negative. You know, like, let's look at

0:25:10.280 --> 0:25:11.920
<v Speaker 5>all the negative reviews so we can have a human

0:25:11.960 --> 0:25:15.199
<v Speaker 5>look through them and see what was up. Very common

0:25:15.560 --> 0:25:18.040
<v Speaker 5>business use case. You can do it with traditional deep

0:25:18.119 --> 0:25:21.560
<v Speaker 5>learning based AI. So so there's things like that that

0:25:21.600 --> 0:25:23.719
<v Speaker 5>are you know, it's very prosaic sort that we were

0:25:23.720 --> 0:25:25.560
<v Speaker 5>already doing it, We've been doing it for a long time.

0:25:26.440 --> 0:25:29.240
<v Speaker 5>Then you get situations that are that were harder for

0:25:29.359 --> 0:25:32.159
<v Speaker 5>the old day. I like, if I'm I want to

0:25:32.560 --> 0:25:35.160
<v Speaker 5>impress something like I want to I have, like say

0:25:35.200 --> 0:25:37.439
<v Speaker 5>I have a chat transcript, Like a customer called in

0:25:38.359 --> 0:25:41.920
<v Speaker 5>and they had a complaint. They call back, Okay, now

0:25:41.960 --> 0:25:44.720
<v Speaker 5>a new you know, a person on the line needs

0:25:44.800 --> 0:25:47.640
<v Speaker 5>to go read the old transcript to catch up. Wouldn't

0:25:47.640 --> 0:25:49.919
<v Speaker 5>it be better if we could just summarize that, just

0:25:49.920 --> 0:25:52.159
<v Speaker 5>condense it all down a quick little paragraph. You know,

0:25:52.240 --> 0:25:54.080
<v Speaker 5>customer called they were up said about this, rather than

0:25:54.119 --> 0:25:56.360
<v Speaker 5>having to read the blow by blow. There's just lots

0:25:56.400 --> 0:25:59.680
<v Speaker 5>of settings like that where summarization is really helpful. Hey,

0:25:59.680 --> 0:26:03.600
<v Speaker 5>you haven't meeting, and I'd like to just automatically, you know,

0:26:03.680 --> 0:26:06.120
<v Speaker 5>have have that meeting or that email or whatever. I'd

0:26:06.119 --> 0:26:07.560
<v Speaker 5>like to just have a condensed down so I can

0:26:07.640 --> 0:26:10.360
<v Speaker 5>really quickly get to the heart of the matter. These

0:26:10.400 --> 0:26:12.880
<v Speaker 5>models are are really good at doing that. They're also

0:26:12.960 --> 0:26:15.600
<v Speaker 5>really good at question answering. So if I want to

0:26:15.640 --> 0:26:17.920
<v Speaker 5>find out what's how many vacation days do I have?

0:26:18.280 --> 0:26:22.639
<v Speaker 5>I can now interact in natural language with a system

0:26:22.720 --> 0:26:25.000
<v Speaker 5>that can go and that it has access to our

0:26:25.119 --> 0:26:27.399
<v Speaker 5>HR policies, and I can actually have a you know,

0:26:27.480 --> 0:26:29.959
<v Speaker 5>a multi turn conversation where I can, you know, like

0:26:30.000 --> 0:26:32.520
<v Speaker 5>I would have with you know, somebody, you know, actual

0:26:33.480 --> 0:26:38.000
<v Speaker 5>HR professional or customer service representative. So a big part,

0:26:38.720 --> 0:26:41.879
<v Speaker 5>you know, what this is doing is it's it's putting

0:26:41.920 --> 0:26:44.280
<v Speaker 5>an interface. You know, when we think of computer interfaces,

0:26:44.280 --> 0:26:47.919
<v Speaker 5>we're usually thinking about UI user interface elements where I

0:26:47.920 --> 0:26:51.440
<v Speaker 5>click on menus and there's buttons and all this stuff. Increasingly,

0:26:51.520 --> 0:26:55.280
<v Speaker 5>now we can just talk, you know, you just in words.

0:26:55.359 --> 0:26:57.160
<v Speaker 5>You can describe what you want, you want to answer

0:26:57.359 --> 0:26:59.960
<v Speaker 5>ask a question you want to sort of command this

0:27:00.000 --> 0:27:02.840
<v Speaker 5>system to do something, rather than having to learn how

0:27:02.840 --> 0:27:04.919
<v Speaker 5>to do that clicking buttons, which might be inefficient. Now

0:27:04.920 --> 0:27:06.520
<v Speaker 5>we can just sort of spell it out.

0:27:07.080 --> 0:27:10.120
<v Speaker 3>Interesting, right, the graphical user interface that we all sort

0:27:10.119 --> 0:27:13.440
<v Speaker 3>of default to, that's not like the state of nature, right,

0:27:13.480 --> 0:27:16.000
<v Speaker 3>That's a thing that was invented and just came to

0:27:16.040 --> 0:27:18.439
<v Speaker 3>be the standard way that we interact with computers. And

0:27:18.480 --> 0:27:22.960
<v Speaker 3>so you could imagine, as you're saying, like chat essentially

0:27:23.119 --> 0:27:26.399
<v Speaker 3>chatting with the machine could could become a sort of

0:27:26.440 --> 0:27:29.720
<v Speaker 3>standard user interface, just like the graphical user interface, did

0:27:29.880 --> 0:27:31.280
<v Speaker 3>you know over the past several decades.

0:27:31.760 --> 0:27:35.160
<v Speaker 5>Absolutely, And I think those kinds of conversational interfaces are

0:27:35.200 --> 0:27:39.400
<v Speaker 5>going to be hugely important for increasing our productivity. It's

0:27:39.440 --> 0:27:41.600
<v Speaker 5>just a lot easier if I have to learn how

0:27:41.640 --> 0:27:43.600
<v Speaker 5>to use a tool or I have to kind of

0:27:43.600 --> 0:27:46.320
<v Speaker 5>have awkward, you know, interactions from the computer. I can

0:27:46.359 --> 0:27:48.000
<v Speaker 5>just tell it what I want and I can understand it.

0:27:48.040 --> 0:27:51.240
<v Speaker 5>Could you know, potentially even ask questions back to clarify

0:27:51.400 --> 0:27:56.280
<v Speaker 5>and have those kinds of conversations that can be extremely powerful.

0:27:56.400 --> 0:27:58.000
<v Speaker 5>And in fact, one area where that's going to I

0:27:58.000 --> 0:28:01.199
<v Speaker 5>think be absolutely game changing is in code. When we

0:28:01.240 --> 0:28:06.280
<v Speaker 5>write code. You know, programming languages are a way for

0:28:06.400 --> 0:28:10.280
<v Speaker 5>us to sort of match between our very sloppy way

0:28:10.280 --> 0:28:13.159
<v Speaker 5>of talking and the very exact way that you need

0:28:13.200 --> 0:28:15.560
<v Speaker 5>to command a computer to do what you wanted to do.

0:28:15.920 --> 0:28:18.640
<v Speaker 5>They're cumbersome to learn, they can, you know, create very

0:28:18.640 --> 0:28:21.800
<v Speaker 5>complex systems that are very hard to reason about. And

0:28:21.840 --> 0:28:24.120
<v Speaker 5>we're already starting to see the ability to just write

0:28:24.119 --> 0:28:26.680
<v Speaker 5>down what you want and AI will generate the code

0:28:26.720 --> 0:28:28.480
<v Speaker 5>for you. And I think we're just going to see

0:28:28.520 --> 0:28:30.960
<v Speaker 5>a huge revolution of like we just converse, you know,

0:28:31.040 --> 0:28:33.120
<v Speaker 5>we can have a conversation to say what we want,

0:28:33.200 --> 0:28:36.480
<v Speaker 5>and then the computer can actually not only do fixed

0:28:36.560 --> 0:28:38.720
<v Speaker 5>actions and do things for us, but it can actually

0:28:38.760 --> 0:28:40.960
<v Speaker 5>even write code to do new things, you know, and

0:28:41.640 --> 0:28:44.719
<v Speaker 5>generate software itself. Given how much software we have, how

0:28:44.760 --> 0:28:47.480
<v Speaker 5>much craving we have for software, like we'll never have

0:28:47.640 --> 0:28:51.040
<v Speaker 5>enough software in our world. Uh, you know, the ability

0:28:51.080 --> 0:28:54.360
<v Speaker 5>to have AI systems as a helper in that, I

0:28:54.360 --> 0:28:56.000
<v Speaker 5>think we're going to see a lot of a lot

0:28:56.040 --> 0:28:56.680
<v Speaker 5>of value there.

0:28:57.880 --> 0:29:00.480
<v Speaker 3>So if you if you think about the different ways

0:29:01.120 --> 0:29:03.360
<v Speaker 3>AI might be applied to business, I mean you've talked

0:29:03.360 --> 0:29:05.680
<v Speaker 3>about a number of the sort of classic use cases.

0:29:06.360 --> 0:29:09.760
<v Speaker 3>What are some of the more out there use cases.

0:29:09.760 --> 0:29:12.640
<v Speaker 3>What are some you know, unique ways you could imagine

0:29:12.680 --> 0:29:14.480
<v Speaker 3>AI being applied to business.

0:29:16.120 --> 0:29:18.840
<v Speaker 5>Yeah, there's really disguised the limit. I mean, we have

0:29:18.920 --> 0:29:21.120
<v Speaker 5>one project that I'm kind of a fan of where

0:29:21.760 --> 0:29:25.240
<v Speaker 5>we actually were working with a mechanical engineering professor at

0:29:25.320 --> 0:29:28.320
<v Speaker 5>MIT working on a classic problem, how do you build

0:29:28.640 --> 0:29:32.080
<v Speaker 5>linkage systems which are like you imagine bars and joints

0:29:32.200 --> 0:29:34.360
<v Speaker 5>and overs you know the things that.

0:29:34.320 --> 0:29:37.400
<v Speaker 3>Are building a thing, building a physical machine of some.

0:29:37.520 --> 0:29:43.200
<v Speaker 5>Kinda like real like metal, and you know nineteenth century

0:29:43.480 --> 0:29:46.560
<v Speaker 5>just old school industrial revolution. Yeah yeah, yeah, but you

0:29:46.560 --> 0:29:49.360
<v Speaker 5>know the little arm that's that's holding up my microphone

0:29:49.360 --> 0:29:51.960
<v Speaker 5>in front of me, Cranes that buld your buildings, you know,

0:29:52.040 --> 0:29:54.560
<v Speaker 5>parts of your engines. This is like classical stuff. It

0:29:54.600 --> 0:29:56.840
<v Speaker 5>turns out that you know, humans, if you want to

0:29:56.880 --> 0:30:00.120
<v Speaker 5>build an advanced system, you decide what like curve you

0:30:00.160 --> 0:30:02.800
<v Speaker 5>want to create, and then a human together with a

0:30:02.800 --> 0:30:06.600
<v Speaker 5>computer program, can build a five or six bar linkage,

0:30:06.680 --> 0:30:08.200
<v Speaker 5>and then that's kind of where you top out it

0:30:08.200 --> 0:30:11.040
<v Speaker 5>because it gets too complicated to work more than that.

0:30:11.720 --> 0:30:14.200
<v Speaker 5>We built a generative AI system that can build twenty

0:30:14.240 --> 0:30:17.560
<v Speaker 5>bar linkages, like arbitrarily complex. So these are machines that

0:30:17.600 --> 0:30:21.960
<v Speaker 5>are beyond the capability of a human to design themselves.

0:30:22.480 --> 0:30:25.440
<v Speaker 5>Another example, we have an AI system that can generate

0:30:25.640 --> 0:30:28.000
<v Speaker 5>electronic circuits. You know, we had a project where we're

0:30:28.000 --> 0:30:31.160
<v Speaker 5>working where we're building better power converters which allow our

0:30:31.920 --> 0:30:35.080
<v Speaker 5>computers and our devices to be more efficient, save energy,

0:30:35.880 --> 0:30:38.560
<v Speaker 5>you know, less less carbon ote. But I think the

0:30:38.600 --> 0:30:41.680
<v Speaker 5>world around us has always been shaped by technology. If

0:30:41.720 --> 0:30:43.720
<v Speaker 5>we look around you know, just think about how many

0:30:43.760 --> 0:30:46.200
<v Speaker 5>steps and how many people and how many designs went

0:30:46.200 --> 0:30:49.960
<v Speaker 5>into the table and the chair and the lamp. It's

0:30:50.000 --> 0:30:53.480
<v Speaker 5>really just astonishing. And that's already you know, the fruit

0:30:53.560 --> 0:30:56.800
<v Speaker 5>of automation and computers and those kinds of tools. But

0:30:56.800 --> 0:31:00.280
<v Speaker 5>we're going to see that increasingly be product also of AI.

0:31:00.360 --> 0:31:02.360
<v Speaker 5>It's just going to be everywhere around us. Everything we

0:31:02.520 --> 0:31:05.160
<v Speaker 5>touch is going to have been you know, helped in

0:31:05.200 --> 0:31:07.360
<v Speaker 5>some way to get to you by.

0:31:08.320 --> 0:31:10.880
<v Speaker 3>You know, that is a pretty profound transformation that you're

0:31:10.920 --> 0:31:13.600
<v Speaker 3>talking about in business. How do you think about the

0:31:13.640 --> 0:31:16.760
<v Speaker 3>implications of that, both for the sort of you know,

0:31:17.040 --> 0:31:20.160
<v Speaker 3>business itself and also for employees.

0:31:21.920 --> 0:31:24.880
<v Speaker 5>Yeah, so I think for businesses this is going to

0:31:25.280 --> 0:31:29.160
<v Speaker 5>cut costs, make new opportunities to like customers, you know,

0:31:29.240 --> 0:31:32.600
<v Speaker 5>like there's just you know, it's sort of all upside

0:31:32.640 --> 0:31:35.360
<v Speaker 5>right like for the for the workers, I think the

0:31:35.400 --> 0:31:38.440
<v Speaker 5>story is mostly good too. You know, like how many

0:31:38.480 --> 0:31:41.760
<v Speaker 5>things do you do in your day that you'd really

0:31:41.960 --> 0:31:44.600
<v Speaker 5>rather not right? You know, and we're used to having

0:31:44.600 --> 0:31:47.680
<v Speaker 5>things we don't like automated away, you know, we we

0:31:48.040 --> 0:31:50.520
<v Speaker 5>didn't you know, if you didn't like walking many miles

0:31:50.560 --> 0:31:52.200
<v Speaker 5>to work, then you know, like you can have a

0:31:52.240 --> 0:31:54.320
<v Speaker 5>car and you can drive there, or we used to

0:31:54.400 --> 0:31:57.200
<v Speaker 5>have a huge traction over ninety percent of the US

0:31:57.280 --> 0:32:01.000
<v Speaker 5>population engaged in agriculture. And then we mechanize how very

0:32:01.000 --> 0:32:03.040
<v Speaker 5>few people work in agriculture, a small number of people

0:32:03.040 --> 0:32:04.800
<v Speaker 5>can do the work of a large number of people.

0:32:05.440 --> 0:32:08.040
<v Speaker 5>And then you know, things like email, and yeah, they've

0:32:08.120 --> 0:32:10.760
<v Speaker 5>led to huge productivity enhancements because I don't need to

0:32:10.760 --> 0:32:13.120
<v Speaker 5>be writing letters and sending them in the mail. I

0:32:13.120 --> 0:32:17.560
<v Speaker 5>can just instantly communicate with people. We just become more effective,

0:32:17.720 --> 0:32:21.760
<v Speaker 5>Like our jobs have transformed, whether it's a physical job

0:32:21.840 --> 0:32:24.720
<v Speaker 5>like agriculture, or whether it's a knowledge worker job where

0:32:24.760 --> 0:32:28.480
<v Speaker 5>you're sending emails and communicating with people and coordinating teams.

0:32:28.760 --> 0:32:31.440
<v Speaker 5>We've just gotten better. And you know, the technology has

0:32:31.440 --> 0:32:34.920
<v Speaker 5>just made us more productive. And this is just another example. Now,

0:32:35.240 --> 0:32:37.360
<v Speaker 5>you know, there are people who worry that you know,

0:32:38.000 --> 0:32:40.440
<v Speaker 5>will be so good at that that maybe jobs will

0:32:40.480 --> 0:32:44.320
<v Speaker 5>be displaced, and that's a legitimate concern. But just like

0:32:45.720 --> 0:32:47.880
<v Speaker 5>how in agriculture, you know, it's not like suddenly we

0:32:47.920 --> 0:32:51.000
<v Speaker 5>had ninety percent of the population unemployed. You know, people

0:32:51.040 --> 0:32:55.280
<v Speaker 5>transitioned to other jobs. And the other thing that we've

0:32:55.280 --> 0:32:59.280
<v Speaker 5>found too is that our appetite for doing more things

0:32:59.840 --> 0:33:03.240
<v Speaker 5>is as humans is sort of insatiable. So even if

0:33:03.640 --> 0:33:06.360
<v Speaker 5>we can dramatically increase how much, you know, one human

0:33:06.440 --> 0:33:09.160
<v Speaker 5>can do, that doesn't necessarily mean you're going to do

0:33:09.160 --> 0:33:11.480
<v Speaker 5>a fixed amount of stuff. There's an appetite to have

0:33:11.560 --> 0:33:13.360
<v Speaker 5>even more. So we're going to you can continue to

0:33:13.360 --> 0:33:16.440
<v Speaker 5>grow grow the pie. So I think at least certainly

0:33:16.440 --> 0:33:18.080
<v Speaker 5>in the near term, you know, we're going to see

0:33:18.080 --> 0:33:19.960
<v Speaker 5>a lot of drudgery go away from work. We're going

0:33:20.000 --> 0:33:23.320
<v Speaker 5>to see people be able to be more effective at

0:33:23.320 --> 0:33:26.480
<v Speaker 5>their jobs. You know, we will see some transformation in

0:33:27.120 --> 0:33:32.200
<v Speaker 5>jobs and like. But we've seen that before, and the

0:33:32.240 --> 0:33:34.520
<v Speaker 5>technology a least has the potential to make our lives

0:33:34.520 --> 0:33:35.200
<v Speaker 5>a lot easier.

0:33:36.440 --> 0:33:41.400
<v Speaker 3>So IBM recently launched Watson X, which includes Watson x

0:33:41.520 --> 0:33:44.440
<v Speaker 3>dot AI. Tell me about that, Tell me about you

0:33:44.480 --> 0:33:46.520
<v Speaker 3>know what it is and the new possibilities that it

0:33:46.600 --> 0:33:47.160
<v Speaker 3>opens up.

0:33:48.040 --> 0:33:48.280
<v Speaker 2>Yeah.

0:33:48.360 --> 0:33:52.360
<v Speaker 5>So, so Watson X is obviously a bit of a

0:33:52.640 --> 0:33:56.640
<v Speaker 5>new branding on the Watson brand. You know TJ. Watson

0:33:56.680 --> 0:34:00.680
<v Speaker 5>that was the founder of IBM and our eechnologies of

0:34:01.080 --> 0:34:05.680
<v Speaker 5>the Watson brand. Watson X is a recognition that there's

0:34:05.720 --> 0:34:08.359
<v Speaker 5>something new, there's something that actually has changed the game.

0:34:09.000 --> 0:34:12.600
<v Speaker 5>We've gone from this old world of automation is to

0:34:12.760 --> 0:34:16.319
<v Speaker 5>labor intensive to this new world of possibilities where it's

0:34:16.360 --> 0:34:20.680
<v Speaker 5>much easier to use AI. And what watson x does

0:34:20.880 --> 0:34:25.280
<v Speaker 5>it brings together tools for businesses to harness that power.

0:34:25.719 --> 0:34:30.080
<v Speaker 5>So whattsonex dot AI so foundation models that our customers

0:34:30.120 --> 0:34:33.320
<v Speaker 5>can use. It includes tools that make it easy to run,

0:34:33.480 --> 0:34:37.759
<v Speaker 5>easy to deploy, easy to experiment. There's a watsonex dot

0:34:37.880 --> 0:34:41.560
<v Speaker 5>Data component which allows you to sort of organize and

0:34:41.600 --> 0:34:43.920
<v Speaker 5>access to your data. So what we're really trying to

0:34:43.960 --> 0:34:48.120
<v Speaker 5>do is give our customers a cohesive set of tools

0:34:48.360 --> 0:34:52.120
<v Speaker 5>to harness the value of these technologies and at the

0:34:52.120 --> 0:34:55.120
<v Speaker 5>same time be able to manage the risks and other

0:34:55.160 --> 0:34:57.120
<v Speaker 5>things that you have to keep an eye on in

0:34:57.200 --> 0:34:58.360
<v Speaker 5>an enterprise context.

0:35:00.080 --> 0:35:02.719
<v Speaker 3>So we talk about the guests on this show as

0:35:03.239 --> 0:35:07.319
<v Speaker 3>new creators, by which we mean people who are creatively

0:35:07.360 --> 0:35:12.239
<v Speaker 3>applying technology in business to drive change. And I'm curious

0:35:12.760 --> 0:35:17.480
<v Speaker 3>how creativity plays a role in the research that you do.

0:35:18.080 --> 0:35:22.640
<v Speaker 5>I honestly, I think the creative aspects of this job

0:35:23.120 --> 0:35:26.400
<v Speaker 5>this is what makes this work exciting. You know, I

0:35:26.400 --> 0:35:28.360
<v Speaker 5>should say, you know the folks who work in my

0:35:28.480 --> 0:35:31.560
<v Speaker 5>organization are doing the creating, and I.

0:35:31.520 --> 0:35:35.080
<v Speaker 3>Guess you're doing the managing so that they could do

0:35:35.160 --> 0:35:35.640
<v Speaker 3>the creator.

0:35:36.520 --> 0:35:39.960
<v Speaker 5>I'm helping them be their best, and I still get

0:35:39.960 --> 0:35:42.880
<v Speaker 5>to get involved in the weeds of the research as

0:35:42.920 --> 0:35:45.719
<v Speaker 5>much as I can. But you know, there's something really

0:35:45.719 --> 0:35:49.600
<v Speaker 5>exciting about inventing, you know, like one of the nice

0:35:49.600 --> 0:35:53.840
<v Speaker 5>things about doing invention and doing research on AI. In industries,

0:35:54.200 --> 0:35:57.160
<v Speaker 5>it's usually grounded and a real problem that somebody's having.

0:35:57.200 --> 0:35:59.600
<v Speaker 5>You know, a customer wants to solve this problem that's

0:36:00.400 --> 0:36:02.960
<v Speaker 5>losing money or there there would be a new opportunity.

0:36:03.280 --> 0:36:07.080
<v Speaker 5>You identify that problem and then you you build something

0:36:07.440 --> 0:36:09.640
<v Speaker 5>that's never been built before to do that. And I

0:36:09.719 --> 0:36:13.520
<v Speaker 5>think that's honestly the adrenaline rush that keeps all of

0:36:13.600 --> 0:36:16.200
<v Speaker 5>us in this field. How do you do something that

0:36:16.320 --> 0:36:20.120
<v Speaker 5>nobody else on earth has has done before or tried before,

0:36:20.560 --> 0:36:23.920
<v Speaker 5>So that that kind of creativity and there's also creativity

0:36:23.920 --> 0:36:26.719
<v Speaker 5>as well and identifying what those problems are, being able

0:36:26.719 --> 0:36:32.200
<v Speaker 5>to understand the places where you know the technology is

0:36:32.239 --> 0:36:35.520
<v Speaker 5>close enough to solving a problem, and doing that matchmaking

0:36:35.880 --> 0:36:39.560
<v Speaker 5>between problems that are now solvable, you know, and in

0:36:39.640 --> 0:36:43.200
<v Speaker 5>AI where the fields moving so fast, this constantly growing

0:36:43.280 --> 0:36:46.360
<v Speaker 5>horizon of things that we might be able to solve,

0:36:46.640 --> 0:36:49.480
<v Speaker 5>So that matchmaking, I think, is also a really interesting

0:36:49.520 --> 0:36:53.160
<v Speaker 5>creative problem. So I think I think that's that's that's

0:36:53.200 --> 0:36:56.120
<v Speaker 5>why it's so much fun. And it's a fun environment

0:36:56.200 --> 0:36:58.600
<v Speaker 5>we have here too. It's you know, people drawing on

0:36:58.640 --> 0:37:03.080
<v Speaker 5>whiteboards and writing on pages of math and you.

0:37:03.120 --> 0:37:05.640
<v Speaker 2>Know, like in a movie, like in a movie.

0:37:05.560 --> 0:37:08.400
<v Speaker 5>Yes, straight from sexual casting.

0:37:07.680 --> 0:37:09.680
<v Speaker 3>The drawing on the window, writing on the window in.

0:37:09.640 --> 0:37:13.520
<v Speaker 2>Sharp absolutely so.

0:37:13.520 --> 0:37:18.200
<v Speaker 3>So let's close with the really long view. How do

0:37:18.239 --> 0:37:23.480
<v Speaker 3>you imagine AI and people working together twenty years from now?

0:37:25.560 --> 0:37:28.680
<v Speaker 5>Yeah, it's really hard to make predictions.

0:37:28.960 --> 0:37:30.799
<v Speaker 2>The vision that I.

0:37:32.440 --> 0:37:38.279
<v Speaker 5>Like, actually this came from an MIT economist named David Ott,

0:37:38.520 --> 0:37:44.320
<v Speaker 5>which was imagine AI almost as a natural resource. Yeah,

0:37:44.719 --> 0:37:47.719
<v Speaker 5>we know how natural resources work, right, like this an

0:37:47.880 --> 0:37:49.480
<v Speaker 5>or we can dig up out of the earth. It

0:37:49.520 --> 0:37:52.799
<v Speaker 5>comes from springs from the earth. Or we usually think

0:37:52.840 --> 0:37:55.799
<v Speaker 5>of that in terms of physical stuff. With AI, you

0:37:55.800 --> 0:37:57.239
<v Speaker 5>can almost think of it as like there's a new

0:37:57.320 --> 0:38:00.560
<v Speaker 5>kind of abundance potentially twenty years from now or not

0:38:00.600 --> 0:38:02.920
<v Speaker 5>only can we have things we can build or eat

0:38:03.000 --> 0:38:05.839
<v Speaker 5>or use or burn or whatever. Now we have you know,

0:38:05.960 --> 0:38:08.520
<v Speaker 5>this ability to do things and understand things and do

0:38:08.600 --> 0:38:11.759
<v Speaker 5>intellectual work, and I think we can get to a

0:38:11.840 --> 0:38:17.040
<v Speaker 5>world where automating things is just seamless. We're surrounded by

0:38:17.320 --> 0:38:22.520
<v Speaker 5>capability to augment ourselves to get things done. And you

0:38:22.560 --> 0:38:24.560
<v Speaker 5>could think of that in terms of like, oh, that's

0:38:24.600 --> 0:38:26.960
<v Speaker 5>going to displace our jobs, because eventually the AI system

0:38:27.040 --> 0:38:29.160
<v Speaker 5>is going to do everything we can do. But you

0:38:29.200 --> 0:38:31.239
<v Speaker 5>could also think of it in terms of, like, wow,

0:38:31.320 --> 0:38:33.640
<v Speaker 5>that's just so much abundance that we now have, and

0:38:33.680 --> 0:38:36.879
<v Speaker 5>really how we use that abundance is sort of up

0:38:36.920 --> 0:38:39.520
<v Speaker 5>to us, you know, like when you can writing software

0:38:39.600 --> 0:38:41.920
<v Speaker 5>is super easy and fast and anybody can do it.

0:38:42.360 --> 0:38:44.160
<v Speaker 5>Just think about all the things you can do now,

0:38:44.760 --> 0:38:46.880
<v Speaker 5>think about all the new activities, and go out all

0:38:46.880 --> 0:38:49.040
<v Speaker 5>the ways we could use that to enrich our lives.

0:38:49.480 --> 0:38:52.520
<v Speaker 5>That's where I'd like to see us in twenty years.

0:38:52.560 --> 0:38:55.120
<v Speaker 5>You know, we can we can do just so much

0:38:55.280 --> 0:38:58.560
<v Speaker 5>more than we were able to do before abundance.

0:38:59.360 --> 0:39:02.160
<v Speaker 2>Great, thank you so much for your time.

0:39:02.920 --> 0:39:04.920
<v Speaker 5>Yeah, it's been a pleasure. Thanks for inviting me.

0:39:06.440 --> 0:39:10.839
<v Speaker 4>What a far ranging, deep conversation. I'm mesmerized by the vision.

0:39:10.880 --> 0:39:15.040
<v Speaker 4>David just described a world where natural conversation between mankind

0:39:15.080 --> 0:39:20.239
<v Speaker 4>and machine can generate creative solutions to our most complex problems.

0:39:20.560 --> 0:39:24.120
<v Speaker 4>A world where we view AI not as our replacements,

0:39:24.719 --> 0:39:27.799
<v Speaker 4>but as a powerful resource we can tap into and

0:39:27.920 --> 0:39:33.200
<v Speaker 4>exponentially boost our innovation and productivity. Thanks so much to

0:39:33.239 --> 0:39:36.600
<v Speaker 4>doctor David Cox for joining us on smart Talks. We

0:39:36.680 --> 0:39:40.640
<v Speaker 4>deeply appreciate him sharing his huge breadth of AI knowledge

0:39:40.680 --> 0:39:44.880
<v Speaker 4>with us and for explaining the transformative potential of foundation

0:39:45.040 --> 0:39:48.440
<v Speaker 4>models in a way that even I can understand. We

0:39:48.520 --> 0:39:53.240
<v Speaker 4>eagerly await his next great breakthrough. Smart Talks with IBM

0:39:53.320 --> 0:39:57.480
<v Speaker 4>is produced by Matt Romano, David jaw nishe Venkat and

0:39:57.600 --> 0:40:02.480
<v Speaker 4>Royston Preserve with Jacob Goldstein. We're edited by Lydia Jane Kott.

0:40:02.800 --> 0:40:07.160
<v Speaker 4>Our engineers are Jason Gambrel, Sarah Buguier and Ben Holliday.

0:40:07.719 --> 0:40:13.000
<v Speaker 4>Theme song by Gramosco. Special thanks to Carli Megliori, Andy Kelly,

0:40:13.080 --> 0:40:17.080
<v Speaker 4>Kathy Callahan and the eight Bar and IBM teams, as

0:40:17.080 --> 0:40:20.920
<v Speaker 4>well as the Pushkin marketing team. Smart Talks with IBM

0:40:21.239 --> 0:40:25.120
<v Speaker 4>is a production of Pushkin Industries and iHeartMedia. To find

0:40:25.320 --> 0:40:29.760
<v Speaker 4>more Pushkin podcasts, listen on the iHeartRadio app, Apple Podcasts,

0:40:29.880 --> 0:40:34.400
<v Speaker 4>or wherever you listen to podcasts him, Malcolm Gladwell. This

0:40:34.680 --> 0:40:49.960
<v Speaker 4>is a paid advertisement from IBM.