WEBVTT - Smart Talks with IBM: RE-AIR - Transformations in AI: Why Foundation Models Are the Future

0:00:00.160 --> 0:00:02.840
<v Speaker 1>Hey everyone, it's Robert and Joe here. Today we've got

0:00:02.880 --> 0:00:04.760
<v Speaker 1>something a little bit different to share with you. It

0:00:04.840 --> 0:00:08.039
<v Speaker 1>is a new season of the Smart Talks with IBM

0:00:08.160 --> 0:00:09.160
<v Speaker 1>podcast series.

0:00:09.640 --> 0:00:11.639
<v Speaker 2>Today we are witnessed to one of those rare moments

0:00:11.680 --> 0:00:14.400
<v Speaker 2>in history, the rise of an innovative technology with the

0:00:14.400 --> 0:00:18.680
<v Speaker 2>potential to radically transform business and society forever. The technology,

0:00:18.760 --> 0:00:22.240
<v Speaker 2>of course, is artificial intelligence, and it's the central focus

0:00:22.239 --> 0:00:24.840
<v Speaker 2>for this new season of Smart Talks with IBM.

0:00:25.320 --> 0:00:28.400
<v Speaker 1>Join hosts from your favorite Pushkin podcasts as they talk

0:00:28.480 --> 0:00:31.680
<v Speaker 1>with industry experts and leaders to explore how businesses can

0:00:31.720 --> 0:00:35.400
<v Speaker 1>integrate AI into their workflows and help drive real change

0:00:35.400 --> 0:00:38.240
<v Speaker 1>in this new era of AI. And of course, host

0:00:38.280 --> 0:00:40.479
<v Speaker 1>Malcolm Gladwell will be there to guide you through the

0:00:40.479 --> 0:00:42.640
<v Speaker 1>season and throw in his two cents as well.

0:00:43.120 --> 0:00:46.200
<v Speaker 2>Look out for new episodes of Smart Talks with IBM

0:00:46.400 --> 0:00:49.559
<v Speaker 2>every other week on the iHeartRadio app, Apple Podcasts, or

0:00:49.600 --> 0:00:53.360
<v Speaker 2>wherever you get your podcasts. And learn more at IBM

0:00:53.479 --> 0:00:55.480
<v Speaker 2>dot com slash smart Talks.

0:00:56.120 --> 0:00:59.760
<v Speaker 3>Hey, it's Jacob Goldstein for Smart Talks with IBM. Last

0:00:59.840 --> 0:01:02.280
<v Speaker 3>year I had the pleasure of sitting down with doctor

0:01:02.360 --> 0:01:07.399
<v Speaker 3>David Cox, VP of AI Models at IBM Research. We

0:01:07.480 --> 0:01:11.200
<v Speaker 3>explored the fascinating world of AI foundation models and their

0:01:11.240 --> 0:01:15.919
<v Speaker 3>revolutionary potential for business automation and innovation. When we first

0:01:16.000 --> 0:01:19.319
<v Speaker 3>aired this episode, the concept of foundation models was just

0:01:19.400 --> 0:01:23.280
<v Speaker 3>beginning to capture our attention. Since then, this technology has

0:01:23.400 --> 0:01:27.480
<v Speaker 3>evolved and redefined the boundaries of what's possible. Businesses are

0:01:27.520 --> 0:01:30.959
<v Speaker 3>becoming more savvy about selecting the right models and understanding

0:01:30.959 --> 0:01:34.360
<v Speaker 3>how they can drive revenue and efficiency. As I listened

0:01:34.360 --> 0:01:37.280
<v Speaker 3>back to the conversation, it was interesting to reflect on

0:01:37.319 --> 0:01:41.120
<v Speaker 3>some new developments and ideas that have emerged, and many

0:01:41.160 --> 0:01:44.320
<v Speaker 3>of these we will continue to explore throughout the season,

0:01:44.840 --> 0:01:46.920
<v Speaker 3>like how to play an active role in choosing the

0:01:46.920 --> 0:01:50.440
<v Speaker 3>best model for your needs. Whether you're a longtime listener

0:01:50.520 --> 0:01:52.920
<v Speaker 3>or tuning in for the first time, I'm certain you'll

0:01:52.920 --> 0:01:56.680
<v Speaker 3>find doctor Cox's insights as thought provoking as ever. Thanks

0:01:56.680 --> 0:01:59.760
<v Speaker 3>as always for joining us. Now let's dive in.

0:02:01.440 --> 0:02:05.240
<v Speaker 4>Hello, Hello, Welcome to Smart Talks with IBM, a podcast

0:02:05.240 --> 0:02:10.720
<v Speaker 4>from Pushkin Industries, iHeartRadio and IBM. I'm Malcolm Gladwell. Our

0:02:10.760 --> 0:02:15.440
<v Speaker 4>guest today is doctor David Cox, VP of AI Models

0:02:15.480 --> 0:02:20.480
<v Speaker 4>at IBM Research and IBM Director of the MIT IBM

0:02:20.560 --> 0:02:24.800
<v Speaker 4>Watson AI Lab, a first of its kind industry academic

0:02:24.919 --> 0:02:30.000
<v Speaker 4>collaboration between IBM and MIT focused on the fundamental research

0:02:30.400 --> 0:02:35.160
<v Speaker 4>of artificial intelligence. Over the course of decades, David Cox

0:02:35.240 --> 0:02:39.880
<v Speaker 4>watched as the AI revolution steadily grew from the simmering

0:02:39.919 --> 0:02:43.760
<v Speaker 4>ideas of a few academics and technologists into the industrial

0:02:43.840 --> 0:02:48.160
<v Speaker 4>boom we are experiencing today. Having dedicated his life to

0:02:48.240 --> 0:02:51.640
<v Speaker 4>pushing the field of AI towards new horizons, David has

0:02:51.680 --> 0:02:55.600
<v Speaker 4>both contributed to and presided over many of the major

0:02:55.720 --> 0:03:01.320
<v Speaker 4>breakthroughs in artificial intelligence. In today's episode, you'll hear David

0:03:01.320 --> 0:03:06.440
<v Speaker 4>explain some of the conceptual underpinnings of the current AI landscape,

0:03:06.600 --> 0:03:11.240
<v Speaker 4>things like foundation models, in surprisingly comprehensible terms, I might add,

0:03:11.800 --> 0:03:14.959
<v Speaker 4>we'll also get into some of the amazing practical applications

0:03:14.960 --> 0:03:18.000
<v Speaker 4>for AI in business, as well as what implications AI

0:03:18.120 --> 0:03:21.640
<v Speaker 4>will have for the future of work and design. David

0:03:21.680 --> 0:03:25.400
<v Speaker 4>spoke with Jacob Goldstein, host of the Pushkin podcast What's

0:03:25.440 --> 0:03:29.720
<v Speaker 4>Your Problem. A veteran business journalist, Jacob has reported for

0:03:29.760 --> 0:03:32.560
<v Speaker 4>The Wall Street Journal, the Miami Herald, and was a

0:03:32.560 --> 0:03:38.200
<v Speaker 4>longtime host of the NPR program Planet Money. Okay, let's

0:03:38.240 --> 0:03:39.120
<v Speaker 4>get to the interview.

0:03:41.480 --> 0:03:43.240
<v Speaker 3>Tell me about your job at IBM.

0:03:43.840 --> 0:03:47.160
<v Speaker 5>So I wear two hats at IBM. So one, I'm

0:03:47.200 --> 0:03:50.080
<v Speaker 5>the IBM Doctor of the MIT IBM Watson AI Lab.

0:03:50.560 --> 0:03:53.600
<v Speaker 5>So that's a joint lab between IBM and MIT where

0:03:53.640 --> 0:03:56.200
<v Speaker 5>we try and invent what's next in AI. It's been

0:03:56.280 --> 0:03:59.120
<v Speaker 5>running for about five years, and then more recently I

0:03:59.160 --> 0:04:02.400
<v Speaker 5>started as Vice president for AI Models, and I'm in

0:04:02.480 --> 0:04:07.040
<v Speaker 5>charge of building IBM's foundation models, you know, building these

0:04:07.120 --> 0:04:09.240
<v Speaker 5>these big models, generative models that allow us to have

0:04:09.240 --> 0:04:11.320
<v Speaker 5>all kinds of new exciting capabilities in AI.

0:04:11.840 --> 0:04:13.760
<v Speaker 3>So, so I want to talk to you a lot

0:04:13.840 --> 0:04:17.240
<v Speaker 3>about foundation models, about genitive AI. But before we get

0:04:17.279 --> 0:04:19.360
<v Speaker 3>to that, let's just spend a minute on the on

0:04:19.440 --> 0:04:25.080
<v Speaker 3>the IBM MIT collaboration. Where did that partnership start, How

0:04:25.080 --> 0:04:25.920
<v Speaker 3>did it originate?

0:04:27.080 --> 0:04:29.880
<v Speaker 5>Yeah, So, actually it turns out that MIT and IBM

0:04:30.120 --> 0:04:33.200
<v Speaker 5>have been collaborating for a very long time in the

0:04:33.240 --> 0:04:37.280
<v Speaker 5>area of AI. In fact, the term artificial intelligence was

0:04:37.360 --> 0:04:41.000
<v Speaker 5>coined in a nineteen fifty six workshop that was held

0:04:41.040 --> 0:04:43.200
<v Speaker 5>at Dartmouth. It was actually organized by an IBM or

0:04:43.279 --> 0:04:46.560
<v Speaker 5>Nathaniel Rochester, who led the development of the IBM seven

0:04:46.600 --> 0:04:49.840
<v Speaker 5>and one. So we've really been together in AIS since

0:04:49.880 --> 0:04:54.719
<v Speaker 5>the beginning and as AI kept accelerating more and more

0:04:54.760 --> 0:04:58.360
<v Speaker 5>and more, I think there was a really interesting decision

0:04:58.360 --> 0:05:01.080
<v Speaker 5>to say, let's make this a formal partnership. So IBM

0:05:01.120 --> 0:05:02.960
<v Speaker 5>in twenty seventeen and now, so it'll be committing close

0:05:02.960 --> 0:05:05.840
<v Speaker 5>to a quarter billion dollars over ten years to have

0:05:05.920 --> 0:05:09.799
<v Speaker 5>this joint lab with MIT, and we we located ourselves

0:05:09.839 --> 0:05:12.000
<v Speaker 5>right on the campus and we've been developing very very

0:05:12.080 --> 0:05:14.359
<v Speaker 5>deep relationships where we can you know, really get to

0:05:14.400 --> 0:05:17.479
<v Speaker 5>know each other, work shoulder to shoulder, conceiving what we

0:05:17.480 --> 0:05:20.120
<v Speaker 5>should work on next, and then executing the projects. And

0:05:20.200 --> 0:05:24.159
<v Speaker 5>it's really you know, very few entities like this exist

0:05:24.640 --> 0:05:27.640
<v Speaker 5>between academia industry. It's been really fun of the last

0:05:27.680 --> 0:05:28.920
<v Speaker 5>five years to be a part of it.

0:05:29.560 --> 0:05:31.080
<v Speaker 3>And what do you think are some of the most

0:05:31.080 --> 0:05:34.600
<v Speaker 3>important outcomes of this collaboration between IBM and MIT.

0:05:36.000 --> 0:05:38.680
<v Speaker 5>Yeah, so we're really kind of the tip of the

0:05:38.720 --> 0:05:43.640
<v Speaker 5>sphere for for IBM's the I strategy. So we're really looking,

0:05:43.880 --> 0:05:46.680
<v Speaker 5>you know, what's coming ahead, and you know, in areas

0:05:46.720 --> 0:05:50.480
<v Speaker 5>like foundation models, you know, as the field changes, MIT

0:05:50.640 --> 0:05:53.279
<v Speaker 5>people are interested in working on you know, faculty, students

0:05:53.279 --> 0:05:55.440
<v Speaker 5>and staff are interested in working on what's the latest thing,

0:05:55.480 --> 0:05:58.520
<v Speaker 5>what's the next thing. We at IBM Research are very

0:05:58.560 --> 0:06:00.839
<v Speaker 5>much interested in the same. We can kind of put

0:06:00.839 --> 0:06:03.719
<v Speaker 5>out feelers, you know, interesting things that we're seeing in

0:06:03.760 --> 0:06:06.480
<v Speaker 5>our research, interesting things we're hearing in the field. We

0:06:06.520 --> 0:06:09.640
<v Speaker 5>can go and chase those opportunities. So when something big comes,

0:06:09.760 --> 0:06:12.719
<v Speaker 5>like the big change that's been happening lately with foundation models,

0:06:12.720 --> 0:06:15.120
<v Speaker 5>we're ready to jump on it. That's really the purpose,

0:06:15.200 --> 0:06:18.160
<v Speaker 5>that's that's the lab functioning the way it should. We're

0:06:18.200 --> 0:06:21.680
<v Speaker 5>also really interested in how do we advance you know,

0:06:21.839 --> 0:06:24.560
<v Speaker 5>AI that can help with climate change or you know,

0:06:24.680 --> 0:06:27.320
<v Speaker 5>build better materials and all these kinds of things that

0:06:27.360 --> 0:06:30.320
<v Speaker 5>are you know, a broader aperture sometimes than what we

0:06:30.400 --> 0:06:33.400
<v Speaker 5>might consider just looking at the product portfolio of IBM,

0:06:33.720 --> 0:06:35.480
<v Speaker 5>and that that gives us again a breadth where we

0:06:35.520 --> 0:06:38.160
<v Speaker 5>can see connections that we might not have seen otherwise.

0:06:38.520 --> 0:06:41.080
<v Speaker 5>We can you know, think things that help out society

0:06:41.200 --> 0:06:42.680
<v Speaker 5>and also help out our customers.

0:06:43.480 --> 0:06:47.920
<v Speaker 3>So the last whatever six months, say, there has been

0:06:47.960 --> 0:06:53.680
<v Speaker 3>this wild rise in the public's interest in AI right

0:06:53.720 --> 0:06:57.160
<v Speaker 3>clearly coming out of these generative AI models that are

0:06:57.160 --> 0:07:01.279
<v Speaker 3>really accessible you know, certainly chat GPT language models like that,

0:07:01.320 --> 0:07:04.400
<v Speaker 3>as well as models that generate images like mid journey.

0:07:05.000 --> 0:07:08.480
<v Speaker 3>I mean, can you just sort of briefly talk about

0:07:07.920 --> 0:07:11.920
<v Speaker 3>the breakthroughs in AI that have made this moment feel

0:07:12.000 --> 0:07:15.520
<v Speaker 3>so exciting, so revolutionary for artificial intelligence.

0:07:16.560 --> 0:07:21.280
<v Speaker 5>Yeah. You know, I've been studying AI basically my entire

0:07:21.320 --> 0:07:23.440
<v Speaker 5>adult life. Before I came to IBM, I was a

0:07:23.440 --> 0:07:26.000
<v Speaker 5>professor at Harvard. I've been doing this a long time,

0:07:26.240 --> 0:07:28.520
<v Speaker 5>and I've gotten used to being surprised. It sounds like

0:07:28.560 --> 0:07:31.640
<v Speaker 5>a joke, but it's serious, Like I'm getting used to

0:07:31.680 --> 0:07:35.600
<v Speaker 5>being surprised at the acceleration of the pace again. It

0:07:35.680 --> 0:07:38.560
<v Speaker 5>tracks actually a long way back. You know, there's lots

0:07:38.600 --> 0:07:41.160
<v Speaker 5>of things where there was an idea that just simmered

0:07:41.840 --> 0:07:45.200
<v Speaker 5>for a really long time. Some of the key math

0:07:45.680 --> 0:07:49.280
<v Speaker 5>behind the stuff that we have today, which is amazing.

0:07:50.000 --> 0:07:52.880
<v Speaker 5>There's an algorithm called back propagation, which is sort of

0:07:52.960 --> 0:07:55.600
<v Speaker 5>key to training neural networks that's been around, you know,

0:07:55.640 --> 0:07:59.520
<v Speaker 5>since the eighties in wide use. And really what happened

0:07:59.640 --> 0:08:03.320
<v Speaker 5>was it simmered for a long time, and then enough

0:08:03.480 --> 0:08:07.239
<v Speaker 5>data and enough compute came so we had enough data

0:08:07.280 --> 0:08:11.240
<v Speaker 5>because you know, we all started carrying multiple cameras around

0:08:11.240 --> 0:08:13.640
<v Speaker 5>with us. Our mobile phones have all you know, all

0:08:13.640 --> 0:08:16.400
<v Speaker 5>these cameras and this we put everything on the Internet,

0:08:16.520 --> 0:08:18.760
<v Speaker 5>and there's all this data out there. We caught a

0:08:18.840 --> 0:08:21.320
<v Speaker 5>lucky break that there was something called the graphics processing unit,

0:08:21.400 --> 0:08:23.640
<v Speaker 5>which turns out to be really useful for doing these

0:08:23.720 --> 0:08:26.360
<v Speaker 5>kinds of algorithms, maybe even more useful than it is

0:08:26.400 --> 0:08:30.360
<v Speaker 5>for doing graphics. They're greater graphics too, And things just

0:08:30.480 --> 0:08:33.080
<v Speaker 5>kept kind of adding to the snowball. So we had

0:08:33.240 --> 0:08:37.439
<v Speaker 5>deep learning, which is sort of a rebrand of neural

0:08:37.480 --> 0:08:39.960
<v Speaker 5>networks that I mentioned from the eighties, and that was

0:08:40.040 --> 0:08:43.199
<v Speaker 5>enabled again by data because we digitalized the world and

0:08:43.679 --> 0:08:46.520
<v Speaker 5>compute because we kept building faster and faster and more

0:08:46.520 --> 0:08:49.760
<v Speaker 5>powerful computers, and then that allowed us to make this

0:08:49.760 --> 0:08:53.440
<v Speaker 5>this big breakthrough. And then, you know, more recently, using

0:08:53.520 --> 0:08:57.600
<v Speaker 5>the same building blocks, that inexorable rise of more and

0:08:57.640 --> 0:09:02.160
<v Speaker 5>more and more data, that technology called self supervised learning.

0:09:02.640 --> 0:09:07.360
<v Speaker 5>Where the key difference there in traditional deep learning, you know,

0:09:07.400 --> 0:09:10.040
<v Speaker 5>for classifying images, you know, like is this a cat

0:09:10.120 --> 0:09:13.320
<v Speaker 5>or is this a dog? And a picture those technologies

0:09:13.800 --> 0:09:17.120
<v Speaker 5>require supper visions, so you have to take what you

0:09:17.200 --> 0:09:18.560
<v Speaker 5>have and then you have to label it. So you

0:09:18.600 --> 0:09:19.920
<v Speaker 5>have to take a picture of a cat and then

0:09:19.960 --> 0:09:22.640
<v Speaker 5>you label it as a cat, and it turns out that,

0:09:22.800 --> 0:09:25.000
<v Speaker 5>you know, that's very powerful, but it takes a lot

0:09:25.000 --> 0:09:27.920
<v Speaker 5>of time to label gats and to label dogs, and

0:09:28.360 --> 0:09:30.280
<v Speaker 5>there's only so many labels that exist in the world.

0:09:30.679 --> 0:09:34.240
<v Speaker 5>So what really changed more recently is that we have

0:09:34.320 --> 0:09:36.800
<v Speaker 5>self supervised learning where you don't have to have the labels.

0:09:36.800 --> 0:09:39.360
<v Speaker 5>We can just take unannotated data. And what that does

0:09:39.400 --> 0:09:42.480
<v Speaker 5>is it allows you use even more data. And that's

0:09:42.520 --> 0:09:46.120
<v Speaker 5>really what drove this this latest sort of rage. And

0:09:46.120 --> 0:09:48.280
<v Speaker 5>then and then all of a sudden we start getting

0:09:48.320 --> 0:09:52.199
<v Speaker 5>these these really powerful models. And then really, this has

0:09:52.240 --> 0:09:57.040
<v Speaker 5>been simmering technologies, right, this has been happening for a

0:09:57.080 --> 0:10:01.280
<v Speaker 5>while and progressively getting more and more powerful. One of

0:10:01.280 --> 0:10:05.560
<v Speaker 5>the things that really happened with CHATGBT and technologies like

0:10:06.000 --> 0:10:09.079
<v Speaker 5>Stable Diffusion and mid Journey was that they made it

0:10:09.640 --> 0:10:12.319
<v Speaker 5>visible to the public. You know, you put it out

0:10:12.360 --> 0:10:14.600
<v Speaker 5>there the public can touch and feel and they're like, wow,

0:10:14.920 --> 0:10:18.480
<v Speaker 5>not only is there palpable change, and wow this you know,

0:10:18.520 --> 0:10:20.000
<v Speaker 5>I can talk to this thing. Wow, this thing can

0:10:20.080 --> 0:10:22.959
<v Speaker 5>generate an image. Not only that, but everyone can touch

0:10:23.000 --> 0:10:27.280
<v Speaker 5>and feel and try. My kids can use some of

0:10:27.280 --> 0:10:32.720
<v Speaker 5>these AI art generation technologies, and that's really just launched.

0:10:32.800 --> 0:10:36.040
<v Speaker 5>You know, it's like a propelled slingshot at us into

0:10:36.360 --> 0:10:38.400
<v Speaker 5>a different regime. In terms of the public awareness of

0:10:38.400 --> 0:10:39.239
<v Speaker 5>these technologies.

0:10:39.920 --> 0:10:43.040
<v Speaker 3>You mentioned earlier in the conversation foundation models, and I

0:10:43.080 --> 0:10:44.920
<v Speaker 3>want to talk a little bit about that. I mean,

0:10:44.960 --> 0:10:48.360
<v Speaker 3>can you just tell me, you know, what are foundation

0:10:48.600 --> 0:10:51.360
<v Speaker 3>models for AI and why are they a big deal?

0:10:52.520 --> 0:10:56.240
<v Speaker 5>Yeah, So this term foundation model was coined by a

0:10:56.280 --> 0:10:59.960
<v Speaker 5>group at Stanford, and I think it's actually a really

0:11:00.080 --> 0:11:03.600
<v Speaker 5>apt term because remember I said, you know, one of

0:11:03.600 --> 0:11:06.920
<v Speaker 5>the big things that unlocked this latest excitement was the

0:11:06.920 --> 0:11:10.520
<v Speaker 5>fact that we could use large amounts of unannotated data.

0:11:11.080 --> 0:11:12.440
<v Speaker 5>We could train a model. We don't have to go

0:11:12.480 --> 0:11:16.000
<v Speaker 5>through the painful effort of labeling each and every example.

0:11:16.559 --> 0:11:18.800
<v Speaker 5>You still need to have your model do something you

0:11:18.800 --> 0:11:21.000
<v Speaker 5>wanted to do. You still need to tell it what

0:11:21.040 --> 0:11:22.600
<v Speaker 5>you want to do. You can't just have a model

0:11:22.640 --> 0:11:25.160
<v Speaker 5>that doesn't, you know, have any purpose. But what a

0:11:25.200 --> 0:11:29.040
<v Speaker 5>foundation models that provides a foundation, like a literal foundation.

0:11:29.280 --> 0:11:31.320
<v Speaker 5>You can sort of stand on the shoulders of giants.

0:11:31.360 --> 0:11:34.079
<v Speaker 5>You can have them these massively trained models, and then

0:11:34.120 --> 0:11:36.280
<v Speaker 5>do a little bit on top. You know, you could

0:11:36.480 --> 0:11:38.560
<v Speaker 5>use just a few examples of what you're looking for

0:11:39.360 --> 0:11:41.520
<v Speaker 5>and you can get what you want from the model,

0:11:42.040 --> 0:11:44.080
<v Speaker 5>So just a little bit on top now gets to

0:11:44.240 --> 0:11:46.240
<v Speaker 5>the results that a huge amount of effort used to

0:11:46.280 --> 0:11:48.199
<v Speaker 5>have to put in, you know, to get from the

0:11:48.320 --> 0:11:50.360
<v Speaker 5>ground up to that level.

0:11:50.640 --> 0:11:54.560
<v Speaker 3>I was trying to think of of an analogy for

0:11:54.679 --> 0:11:57.720
<v Speaker 3>sort of foundation models versus what came before, and I

0:11:57.760 --> 0:12:00.240
<v Speaker 3>don't know that I came up with a good one,

0:12:00.280 --> 0:12:01.959
<v Speaker 3>but the best I could do was this. I want

0:12:01.960 --> 0:12:04.920
<v Speaker 3>you to tell me if it's plausible. It's like before

0:12:05.000 --> 0:12:08.520
<v Speaker 3>foundation models, it was like you had these sort of

0:12:08.600 --> 0:12:11.800
<v Speaker 3>single use kitchen appliances. You could make a waffle iron

0:12:11.840 --> 0:12:14.360
<v Speaker 3>if you wanted waffles, or you could make a toaster

0:12:14.520 --> 0:12:17.079
<v Speaker 3>if you wanted to make toast. But a foundation model

0:12:17.160 --> 0:12:19.720
<v Speaker 3>is like like an oven with a range on top.

0:12:19.800 --> 0:12:21.559
<v Speaker 3>So it's like this machine and you could just cook

0:12:21.640 --> 0:12:23.480
<v Speaker 3>anything with this machine.

0:12:24.120 --> 0:12:28.600
<v Speaker 5>Yeah, that's a great analogy. They're very versatile. The other

0:12:28.720 --> 0:12:31.280
<v Speaker 5>piece of it, too, is that they dramatically lower the

0:12:31.400 --> 0:12:34.560
<v Speaker 5>effort that it takes to do something that you want

0:12:34.600 --> 0:12:37.600
<v Speaker 5>to do. And sometimes I used to say about the

0:12:37.640 --> 0:12:39.600
<v Speaker 5>old world of AI, would say, you know, the problem

0:12:39.640 --> 0:12:43.400
<v Speaker 5>with automation is that it's too labor intensive. H sounds

0:12:43.440 --> 0:12:44.400
<v Speaker 5>like I'm making a joke.

0:12:44.640 --> 0:12:49.200
<v Speaker 3>Indeed, famously, if automation does one thing, it substitutes machines

0:12:49.320 --> 0:12:52.520
<v Speaker 3>or computing power for labor. Right, So what does that

0:12:52.600 --> 0:12:56.880
<v Speaker 3>mean to say AI is or automation is too labor intensive.

0:12:57.360 --> 0:12:59.320
<v Speaker 5>It sounds like I'm making a joke, but I'm actually serious.

0:12:59.559 --> 0:13:02.800
<v Speaker 5>What I mean is that the effort it took the

0:13:02.880 --> 0:13:06.719
<v Speaker 5>old regime to automate something was very, very high. So

0:13:06.920 --> 0:13:09.800
<v Speaker 5>if I need to go and curate all this data,

0:13:09.840 --> 0:13:13.040
<v Speaker 5>collect all this data, and then carefully label all these examples,

0:13:13.440 --> 0:13:17.360
<v Speaker 5>that labeling itself might be incredibly expensive and time. So

0:13:17.760 --> 0:13:20.360
<v Speaker 5>and we estimate anywhere between eighty to ninety percent of

0:13:20.400 --> 0:13:23.240
<v Speaker 5>the effort it takes to feel an AI solution actually

0:13:23.360 --> 0:13:26.959
<v Speaker 5>is just spent on data, so that that has some consequences,

0:13:27.240 --> 0:13:32.600
<v Speaker 5>which is the threshold for bothering. You know, if you're

0:13:32.600 --> 0:13:34.800
<v Speaker 5>going to only get a little bit of value back

0:13:35.040 --> 0:13:37.280
<v Speaker 5>from something, are you going to go through this huge

0:13:37.280 --> 0:13:40.800
<v Speaker 5>effort to curate all this data and then when it

0:13:40.800 --> 0:13:43.240
<v Speaker 5>comes time to train the model, you need highly skilled

0:13:43.240 --> 0:13:47.280
<v Speaker 5>people expensive or hard to find in the labor market.

0:13:47.440 --> 0:13:48.959
<v Speaker 5>You know, are you really going to do something that's

0:13:49.000 --> 0:13:50.920
<v Speaker 5>just a tiny, little incremental thing. Now, you're going to

0:13:50.960 --> 0:13:54.600
<v Speaker 5>do the only the highest value things that weren't at

0:13:54.800 --> 0:13:56.000
<v Speaker 5>level because.

0:13:55.679 --> 0:13:59.080
<v Speaker 3>You have to essentially build the whole machine from scratch,

0:13:59.280 --> 0:14:02.200
<v Speaker 3>and there aren't many things where it's worth that much

0:14:02.240 --> 0:14:04.280
<v Speaker 3>work to build a machine that's only going to do

0:14:04.400 --> 0:14:05.600
<v Speaker 3>one narrow thing.

0:14:06.040 --> 0:14:09.000
<v Speaker 5>That's right, and then you tackle the next problem and

0:14:09.080 --> 0:14:11.400
<v Speaker 5>you basically have to start over. And you know, there

0:14:11.440 --> 0:14:14.240
<v Speaker 5>are some nuances here, like for images, you can pre

0:14:14.280 --> 0:14:16.800
<v Speaker 5>train a model on some other tasks and change it around.

0:14:16.800 --> 0:14:19.760
<v Speaker 5>So there are some examples of this like non recurring

0:14:19.880 --> 0:14:22.480
<v Speaker 5>cost that we have in the old world too, But

0:14:22.520 --> 0:14:25.040
<v Speaker 5>by and large, it's just a lot of effort. It's hard.

0:14:25.320 --> 0:14:29.600
<v Speaker 5>It takes, you know, a large level of skill to implement.

0:14:30.400 --> 0:14:33.160
<v Speaker 5>One analogy that I like is, you know, think about

0:14:33.200 --> 0:14:35.320
<v Speaker 5>it as you know, you have a river of data,

0:14:35.720 --> 0:14:39.080
<v Speaker 5>you know, running through your company or your institution. Traditional

0:14:39.080 --> 0:14:41.600
<v Speaker 5>AI solutions are kind of like building a dam on

0:14:41.600 --> 0:14:45.080
<v Speaker 5>that river. You know, dams are very expensive things to build.

0:14:45.440 --> 0:14:49.680
<v Speaker 5>They require highly specialized skills and lots of planning. And

0:14:49.880 --> 0:14:51.560
<v Speaker 5>you know, you're only going to put a dam on

0:14:51.960 --> 0:14:54.680
<v Speaker 5>a river that's big enough that you're gonna get enough

0:14:54.800 --> 0:14:57.200
<v Speaker 5>energy out of it that it was worth trouble. You're

0:14:57.200 --> 0:14:58.640
<v Speaker 5>gonna get a lot of value out of that dam.

0:14:58.680 --> 0:15:00.320
<v Speaker 5>If you have a river like that, you know, a

0:15:00.400 --> 0:15:03.960
<v Speaker 5>river of data, but it's actually the vast majority of

0:15:04.160 --> 0:15:06.520
<v Speaker 5>the water you know in your kingdom actually isn't in

0:15:06.560 --> 0:15:10.560
<v Speaker 5>that river. It's in puddles and greeks and babid brooks,

0:15:10.640 --> 0:15:14.080
<v Speaker 5>And you know, there's a lot of value left on

0:15:14.120 --> 0:15:16.680
<v Speaker 5>the table because it's like, well, I can't there's nothing

0:15:16.720 --> 0:15:18.520
<v Speaker 5>you can do about it. It's just that that's too

0:15:19.480 --> 0:15:22.600
<v Speaker 5>low value. So it takes too much effort, so I'm

0:15:22.640 --> 0:15:24.200
<v Speaker 5>just not going to do it. The return around investment

0:15:24.560 --> 0:15:27.120
<v Speaker 5>just isn't there, so you just end up not automating

0:15:27.160 --> 0:15:29.960
<v Speaker 5>things because it's too much of a pain. Now what

0:15:30.000 --> 0:15:32.440
<v Speaker 5>foundation models do is they say, well, actually, no, we

0:15:32.480 --> 0:15:35.680
<v Speaker 5>can train a base model a foundation that you can

0:15:35.720 --> 0:15:37.360
<v Speaker 5>work on that we don't we don't care. We don't

0:15:37.400 --> 0:15:39.240
<v Speaker 5>specify what the task is ahead of time. We just

0:15:39.280 --> 0:15:42.440
<v Speaker 5>need to learn about the domain of data. So if

0:15:42.440 --> 0:15:45.320
<v Speaker 5>we want to build something that can understand English language,

0:15:45.640 --> 0:15:48.920
<v Speaker 5>there's a ton of English language text available out in

0:15:48.960 --> 0:15:53.040
<v Speaker 5>the world. We can now train models on huge quantities

0:15:53.040 --> 0:15:56.200
<v Speaker 5>of it, and then it learned the structure. It learned

0:15:56.280 --> 0:15:59.040
<v Speaker 5>how language, you know, good part of how language works

0:15:59.120 --> 0:16:01.400
<v Speaker 5>on all that unlabeled data. And then when you roll

0:16:01.480 --> 0:16:04.440
<v Speaker 5>up with your task, you know, I want to solve

0:16:04.440 --> 0:16:07.560
<v Speaker 5>this particular problem, you don't have to start from scratch.

0:16:07.600 --> 0:16:11.040
<v Speaker 5>You're starting from a very very very high place. So

0:16:11.080 --> 0:16:13.560
<v Speaker 5>that just gives you the ability to just you know, now,

0:16:13.600 --> 0:16:16.440
<v Speaker 5>all of a sudden, everything is accessible. All the puddles

0:16:16.440 --> 0:16:19.200
<v Speaker 5>and greeks and babbling brooks and kettlepons, you know, those

0:16:19.200 --> 0:16:23.960
<v Speaker 5>are all accessible now. And that's that's very exciting. But

0:16:24.040 --> 0:16:26.520
<v Speaker 5>it just changes the equation on what kinds of problems

0:16:26.640 --> 0:16:27.840
<v Speaker 5>you could use AI to solve.

0:16:27.960 --> 0:16:33.400
<v Speaker 3>And so foundation models basically mean that automating some new

0:16:33.520 --> 0:16:36.760
<v Speaker 3>task is much less labor intensive, The sort of marginal

0:16:36.840 --> 0:16:39.840
<v Speaker 3>effort to do some new automation thing is much lower

0:16:39.880 --> 0:16:43.120
<v Speaker 3>because you're building on top of the foundation model rather

0:16:43.200 --> 0:16:47.560
<v Speaker 3>than starting from scratch. Absolutely, so that is that is

0:16:47.680 --> 0:16:51.280
<v Speaker 3>like the exciting good news. I do feel like there's

0:16:52.080 --> 0:16:54.680
<v Speaker 3>a little bit of a countervailing idea that's worth talking

0:16:54.720 --> 0:16:57.080
<v Speaker 3>about here, and that is the idea that even though

0:16:57.080 --> 0:17:01.120
<v Speaker 3>there are these foundation models that are really powerful, that

0:17:01.160 --> 0:17:04.200
<v Speaker 3>are relatively easy to build on top of, it's still

0:17:04.240 --> 0:17:07.439
<v Speaker 3>the case, right that there is not some one size

0:17:07.480 --> 0:17:11.159
<v Speaker 3>fits all foundation model. So you know, what does that

0:17:11.320 --> 0:17:13.520
<v Speaker 3>mean and why is that important to think about in

0:17:13.560 --> 0:17:14.320
<v Speaker 3>this context?

0:17:14.880 --> 0:17:18.679
<v Speaker 5>Yeah, so we believe very strongly that there isn't just

0:17:18.800 --> 0:17:21.680
<v Speaker 5>one model to rule them all. There's a number of

0:17:21.720 --> 0:17:24.720
<v Speaker 5>reasons why that could be true. One which I think

0:17:24.800 --> 0:17:28.800
<v Speaker 5>is important and very relevant today is how much energy

0:17:29.119 --> 0:17:33.880
<v Speaker 5>these models can consume. So these models, you know, can

0:17:33.920 --> 0:17:39.360
<v Speaker 5>get very very large. So one thing that we're starting

0:17:39.400 --> 0:17:42.120
<v Speaker 5>to see or starting to believe, is that you probably

0:17:42.160 --> 0:17:47.280
<v Speaker 5>shouldn't use one giant sledgehammer model to solve every single problem,

0:17:47.480 --> 0:17:49.400
<v Speaker 5>you know, like we should pick the right size model

0:17:49.440 --> 0:17:52.239
<v Speaker 5>to solve the problem. We shouldn't necessarily assume that we

0:17:52.280 --> 0:17:56.840
<v Speaker 5>need the biggest, baddest model for every little use case.

0:17:57.320 --> 0:17:59.520
<v Speaker 5>And we're also seeing that, you know, small models that

0:17:59.520 --> 0:18:03.439
<v Speaker 5>are trained to like to specialize on particular domains can

0:18:03.480 --> 0:18:07.600
<v Speaker 5>actually outperform much bigger models. So bigger isn't always even better.

0:18:07.720 --> 0:18:10.280
<v Speaker 3>So they're more efficient and they do the thing you

0:18:10.320 --> 0:18:11.960
<v Speaker 3>want them to do better as well.

0:18:12.480 --> 0:18:15.760
<v Speaker 5>That's right. So Stanford, for instance, a group of Stanford

0:18:15.800 --> 0:18:18.919
<v Speaker 5>trained a model. It is a two point seven billion

0:18:18.960 --> 0:18:22.080
<v Speaker 5>parameter model, which isn't terribly big by today's standards. They

0:18:22.080 --> 0:18:24.359
<v Speaker 5>trained it just on the biomedical literature, you know, this

0:18:24.400 --> 0:18:26.800
<v Speaker 5>is the kind of thing that universities do, and what

0:18:26.840 --> 0:18:30.320
<v Speaker 5>they showed was that this model was better at answering

0:18:30.400 --> 0:18:32.920
<v Speaker 5>questions about the biomedical literature than some models that were

0:18:33.440 --> 0:18:37.159
<v Speaker 5>one hundred billion parameters, you know, many times larger. So

0:18:37.320 --> 0:18:39.880
<v Speaker 5>it's a little bit like you know, asking an expert

0:18:40.320 --> 0:18:43.600
<v Speaker 5>for help on something versus asking the smartest person. You know,

0:18:44.160 --> 0:18:46.720
<v Speaker 5>the smartest person you know may be very smart, but

0:18:46.800 --> 0:18:49.679
<v Speaker 5>they're not going to be expertise. And then as an

0:18:49.720 --> 0:18:52.199
<v Speaker 5>added bonus, you know, this is now a much smaller model,

0:18:52.280 --> 0:18:54.159
<v Speaker 5>it's much more efficient to run. We are you know,

0:18:54.760 --> 0:18:58.640
<v Speaker 5>you know, it's cheaper. So there's lots of different advantages there.

0:18:58.680 --> 0:19:02.280
<v Speaker 5>So I think we're going to see attention in the

0:19:02.320 --> 0:19:05.600
<v Speaker 5>industry between vendors that say, hey, this is the one,

0:19:05.800 --> 0:19:08.159
<v Speaker 5>you know, big model, and then others that say, well, actually,

0:19:08.440 --> 0:19:10.960
<v Speaker 5>you know, there's there's you know, lots of different tools

0:19:10.960 --> 0:19:13.000
<v Speaker 5>we can use that all have this nice quality that

0:19:13.040 --> 0:19:15.680
<v Speaker 5>we outlined at the beginning, and then we should really

0:19:15.680 --> 0:19:17.200
<v Speaker 5>pick the one that makes the most sense for the

0:19:17.560 --> 0:19:18.280
<v Speaker 5>task at hand.

0:19:19.560 --> 0:19:23.960
<v Speaker 3>So there's sustainability basically efficiency, another kind of set of

0:19:23.960 --> 0:19:27.880
<v Speaker 3>issues that come up a lot with AI A are bias, hallucination.

0:19:28.600 --> 0:19:31.200
<v Speaker 3>Can you talk a little bit about bias and hallucination,

0:19:31.320 --> 0:19:34.240
<v Speaker 3>what they are and how you're working to mitigate those problems.

0:19:34.640 --> 0:19:37.520
<v Speaker 5>Yeah, so there are lots of issues still as amazing

0:19:37.520 --> 0:19:40.440
<v Speaker 5>as these technologies are, and they are amazing, let's let's

0:19:40.480 --> 0:19:42.960
<v Speaker 5>be very clear, lots of great things we're going to

0:19:43.080 --> 0:19:46.920
<v Speaker 5>enable with these kinds of technologies. Bias isn't a new problem.

0:19:47.240 --> 0:19:51.840
<v Speaker 5>So you know, basically we've seen this since the beginning

0:19:51.880 --> 0:19:54.800
<v Speaker 5>of AI. If you train a model on data that

0:19:55.200 --> 0:19:57.320
<v Speaker 5>has a bias in it, the model is going to

0:19:57.359 --> 0:20:01.920
<v Speaker 5>recapitulate that bias when it provides it's answers. So every time,

0:20:02.119 --> 0:20:04.639
<v Speaker 5>you know, if all the text you have says, you know,

0:20:04.680 --> 0:20:07.760
<v Speaker 5>it's more likely to refer to female nurses and male scientists,

0:20:07.800 --> 0:20:09.879
<v Speaker 5>then you're going to you know, get models that you know.

0:20:09.960 --> 0:20:13.040
<v Speaker 5>For instance, there was an example where a machine learning

0:20:13.040 --> 0:20:17.480
<v Speaker 5>based translation system translated from Hungarian to English. Hungarian doesn't

0:20:17.480 --> 0:20:20.800
<v Speaker 5>have gendered pronouns. English does, and when you ask them

0:20:20.800 --> 0:20:23.119
<v Speaker 5>to translate, it would translate they are a nurse to

0:20:23.560 --> 0:20:26.520
<v Speaker 5>she is a nurse, translate they are a scientist. To

0:20:26.600 --> 0:20:29.720
<v Speaker 5>he is a scientist. And that's not because the people

0:20:29.720 --> 0:20:32.520
<v Speaker 5>who wrote the algorithm were building in bias and coding

0:20:32.560 --> 0:20:34.080
<v Speaker 5>in like, oh, it's got to be this way. It's

0:20:34.119 --> 0:20:36.359
<v Speaker 5>because the data was like that. You know, we have

0:20:36.480 --> 0:20:40.920
<v Speaker 5>biases in our society and they're reflected in our data

0:20:40.960 --> 0:20:44.600
<v Speaker 5>and our text and our images everywhere. And then the

0:20:44.640 --> 0:20:47.600
<v Speaker 5>models they're just mapping from what they what they've seen

0:20:47.600 --> 0:20:50.120
<v Speaker 5>in their training data to to the result that you're

0:20:50.200 --> 0:20:51.800
<v Speaker 5>trying to get them to do and to give, and

0:20:51.840 --> 0:20:56.280
<v Speaker 5>then these biases come out. So there's a very active

0:20:57.119 --> 0:21:00.320
<v Speaker 5>program of research and you know, we do quite a

0:21:00.320 --> 0:21:03.880
<v Speaker 5>bit at IBM research and my T but also all

0:21:03.920 --> 0:21:06.639
<v Speaker 5>over the community and industry and academia trying to figure

0:21:06.640 --> 0:21:09.840
<v Speaker 5>out how do we explicitly remove these biases, how do

0:21:09.840 --> 0:21:12.000
<v Speaker 5>we identify them, how do you know, how do we

0:21:12.040 --> 0:21:14.679
<v Speaker 5>build tools that allow people to audit their systems to

0:21:14.680 --> 0:21:17.000
<v Speaker 5>make sure they aren't biased. So this is a really

0:21:17.040 --> 0:21:20.200
<v Speaker 5>important thing. And you know, again this was here since

0:21:20.240 --> 0:21:24.000
<v Speaker 5>the beginning, you know, of machine learning and AI, but

0:21:24.680 --> 0:21:28.439
<v Speaker 5>foundation models and large language models and generative AI just

0:21:28.480 --> 0:21:31.199
<v Speaker 5>bring it into sharper even sharper focus because there's just

0:21:31.240 --> 0:21:34.600
<v Speaker 5>so much data and it's sort of building in baking

0:21:34.680 --> 0:21:37.560
<v Speaker 5>in all these different biases we have, so that that's

0:21:37.680 --> 0:21:41.680
<v Speaker 5>that's absolutely a problem that these models have. Another one

0:21:41.680 --> 0:21:45.480
<v Speaker 5>that you mentioned was hallucinations. So even the most impressive

0:21:45.520 --> 0:21:49.720
<v Speaker 5>of our models will often just make stuff up. You know,

0:21:49.920 --> 0:21:52.919
<v Speaker 5>the technical term that the field has chosen is hallucination.

0:21:53.520 --> 0:21:56.439
<v Speaker 5>To give you an example, I asked chat tbt to

0:21:56.720 --> 0:22:00.480
<v Speaker 5>create a biography of David Cox at IBM, and you know,

0:22:00.720 --> 0:22:03.320
<v Speaker 5>it started off really well. You know, the identified that

0:22:03.359 --> 0:22:05.800
<v Speaker 5>I was the director of the MNT IBM Watson and

0:22:05.800 --> 0:22:08.200
<v Speaker 5>said a few words about that, and then it proceeded

0:22:08.200 --> 0:22:12.760
<v Speaker 5>to create an authoritative but completely fake biography of me

0:22:12.800 --> 0:22:15.320
<v Speaker 5>where I was British, I was born in the UK,

0:22:16.680 --> 0:22:19.640
<v Speaker 5>I went to British university, you know universities in the UK.

0:22:19.720 --> 0:22:21.320
<v Speaker 5>I was professor the authority.

0:22:21.400 --> 0:22:24.960
<v Speaker 3>Right, it's the certainty that that is weird about it, right,

0:22:24.960 --> 0:22:28.240
<v Speaker 3>It's it's dead certain that you're from the UK, et cetera.

0:22:28.840 --> 0:22:31.879
<v Speaker 5>Absolutely, yeah, it has all kinds of flourishes like I

0:22:31.920 --> 0:22:36.639
<v Speaker 5>want awards in the UK. So yeah, it's it's problematic

0:22:36.720 --> 0:22:39.560
<v Speaker 5>because it kind of pokes a lot of weak spots

0:22:39.560 --> 0:22:44.760
<v Speaker 5>in our human psychology where if something sounds coherent, We're

0:22:44.880 --> 0:22:47.640
<v Speaker 5>likely to assume it's true. We're not used to interacting

0:22:47.640 --> 0:22:52.360
<v Speaker 5>with people who eloquently and authoritatively, you know, emit complete nonsense,

0:22:52.440 --> 0:22:55.280
<v Speaker 5>like yeah, you know, we can debate about that, but.

0:22:55.240 --> 0:22:57.600
<v Speaker 3>Yeah, we can debate about that. But yes, it's the

0:22:58.520 --> 0:23:02.159
<v Speaker 3>sort of blive confidence throws you off when you realize

0:23:02.200 --> 0:23:03.119
<v Speaker 3>it's completely wrong.

0:23:03.240 --> 0:23:06.000
<v Speaker 5>Right, that's right. And we do have a little bit

0:23:06.040 --> 0:23:09.240
<v Speaker 5>of like a great and powerful oz sort of vibe

0:23:09.280 --> 0:23:11.600
<v Speaker 5>going sometimes where we're like, well, you know, the AI

0:23:11.800 --> 0:23:15.560
<v Speaker 5>is all knowing and therefore whatever it says must be true.

0:23:15.800 --> 0:23:20.040
<v Speaker 5>But these things will make up stuff, you know, very aggressively,

0:23:20.760 --> 0:23:23.199
<v Speaker 5>and you know, you everyone can try asking it for

0:23:23.200 --> 0:23:26.720
<v Speaker 5>their their bio. You'll you'll get something that You'll always

0:23:26.720 --> 0:23:29.040
<v Speaker 5>get something that's of the right form, that has the

0:23:29.119 --> 0:23:32.040
<v Speaker 5>right tone. But you know, the facts just aren't necessarily there.

0:23:32.359 --> 0:23:34.760
<v Speaker 5>So that's obviously a problem. We need to figure out

0:23:34.760 --> 0:23:37.959
<v Speaker 5>how to close those gaps, fix those problems. There's lots

0:23:38.000 --> 0:23:40.080
<v Speaker 5>of ways we can use them much more easily.

0:23:40.600 --> 0:23:43.320
<v Speaker 4>I'd just like to say, faced with the awesome potential

0:23:43.359 --> 0:23:46.360
<v Speaker 4>of what these technologies might do, it's a bit encouraging

0:23:46.440 --> 0:23:49.960
<v Speaker 4>to hear that even chat GPT has a weakness for

0:23:50.080 --> 0:23:55.200
<v Speaker 4>inventing flamboyant, if fictional versions of people's lives, and while

0:23:55.320 --> 0:23:58.879
<v Speaker 4>entertaining ourselves with chat GPT and mid journey is important,

0:23:59.320 --> 0:24:03.840
<v Speaker 4>the way lpeople use consumer facing chatbots and generative AI

0:24:04.359 --> 0:24:08.240
<v Speaker 4>is just fundamentally different from the way an enterprise business

0:24:08.320 --> 0:24:11.960
<v Speaker 4>uses AI. How can we harness the abilities of artificial

0:24:12.000 --> 0:24:15.040
<v Speaker 4>intelligence to help us solve the problems we face in

0:24:15.119 --> 0:24:18.959
<v Speaker 4>business and technology. Let's listen on as David and Jacob

0:24:19.119 --> 0:24:20.440
<v Speaker 4>continue their conversation.

0:24:21.200 --> 0:24:24.160
<v Speaker 3>We've been talking in a somewhat abstract way about AI

0:24:24.280 --> 0:24:27.040
<v Speaker 3>in the ways it can be used. Let's talk in

0:24:27.040 --> 0:24:30.400
<v Speaker 3>a little bit more of a specific way. Can you

0:24:30.440 --> 0:24:34.240
<v Speaker 3>just talk about some examples of business challenges that can

0:24:34.280 --> 0:24:37.640
<v Speaker 3>be solved with automation, with this kind of automation we're

0:24:37.640 --> 0:24:38.560
<v Speaker 3>talking about.

0:24:39.119 --> 0:24:42.520
<v Speaker 5>Yeah, so the really really, this guy's the limit. There's

0:24:42.560 --> 0:24:46.480
<v Speaker 5>a whole set of different applications that these models are

0:24:46.520 --> 0:24:49.359
<v Speaker 5>really good at. And basically it's a superset of everything

0:24:49.400 --> 0:24:52.480
<v Speaker 5>we used to use AI for in business. So you know,

0:24:53.080 --> 0:24:54.760
<v Speaker 5>the simple kinds of things are like, hey, if I

0:24:54.760 --> 0:24:58.240
<v Speaker 5>have text and I have product reviews, and I want

0:24:58.240 --> 0:25:00.000
<v Speaker 5>to be able to tell if these are positive or negative.

0:25:00.240 --> 0:25:02.040
<v Speaker 5>You know, like let's look at all the negative reviews,

0:25:02.040 --> 0:25:03.399
<v Speaker 5>so we can have a human look through them and

0:25:03.720 --> 0:25:07.560
<v Speaker 5>see what was up. Very common business use case. You

0:25:07.600 --> 0:25:11.760
<v Speaker 5>can do it with traditional deep learning based AI. So so

0:25:11.640 --> 0:25:13.240
<v Speaker 5>there's things like that that are you know, it's very

0:25:13.440 --> 0:25:15.439
<v Speaker 5>prosaic sort of we were already doing it. We've been

0:25:15.440 --> 0:25:18.960
<v Speaker 5>doing it for a long time. Then you get situations

0:25:19.000 --> 0:25:21.159
<v Speaker 5>that are that were harder for the old day. I like,

0:25:21.600 --> 0:25:24.879
<v Speaker 5>if I'm I want to impress something like I want

0:25:24.920 --> 0:25:27.200
<v Speaker 5>to I have like say I have a chat transcript,

0:25:27.240 --> 0:25:30.200
<v Speaker 5>Like a customer called in and they had a complaint.

0:25:30.880 --> 0:25:34.359
<v Speaker 5>They called back. Okay, now a new you know, a

0:25:34.520 --> 0:25:36.480
<v Speaker 5>person on the line needs to go read the old

0:25:36.520 --> 0:25:39.320
<v Speaker 5>transcript to catch up. Wouldn't it be better if we

0:25:39.359 --> 0:25:41.800
<v Speaker 5>could just summarize that, just condense it all down a

0:25:41.920 --> 0:25:44.119
<v Speaker 5>quick little paragraph. You know, customer called they we upset

0:25:44.119 --> 0:25:46.160
<v Speaker 5>about this, rather than having to read the blow by blow.

0:25:46.600 --> 0:25:49.760
<v Speaker 5>There's just lots of settings like that where summarization is

0:25:49.800 --> 0:25:52.840
<v Speaker 5>really helpful. Hey, you have a meeting and I'd like

0:25:52.920 --> 0:25:55.600
<v Speaker 5>to just automatically you know, have had that meeting or

0:25:55.640 --> 0:25:57.480
<v Speaker 5>that email or whatever. I'd like to just have a

0:25:57.520 --> 0:25:59.679
<v Speaker 5>condensed down so I can really quickly get to the

0:25:59.680 --> 0:26:02.360
<v Speaker 5>heart of the matter. These models are are really good

0:26:02.359 --> 0:26:05.000
<v Speaker 5>at doing that. They're also really good at question answering.

0:26:05.320 --> 0:26:07.640
<v Speaker 5>So if I want to find out, what's how many

0:26:07.720 --> 0:26:11.320
<v Speaker 5>vacation days do I have? I can now interact in

0:26:11.440 --> 0:26:14.879
<v Speaker 5>natural language with a system that can go and that

0:26:14.920 --> 0:26:17.320
<v Speaker 5>it has access to our HR policies, and I can

0:26:17.359 --> 0:26:20.120
<v Speaker 5>actually have a you know, a multi turn conversation where

0:26:20.119 --> 0:26:22.600
<v Speaker 5>I can, you know, like I would have with you know, somebody,

0:26:22.760 --> 0:26:27.800
<v Speaker 5>you know, actual HR professional or customer service representative. So

0:26:28.240 --> 0:26:31.320
<v Speaker 5>a big part, you know, of what this is doing

0:26:31.440 --> 0:26:33.960
<v Speaker 5>is it's it's putting an interface. You know, when we

0:26:33.960 --> 0:26:37.600
<v Speaker 5>think of computer interfaces, we're usually thinking about UI user

0:26:37.640 --> 0:26:40.600
<v Speaker 5>interface elements where I click on menus and there's buttons

0:26:40.640 --> 0:26:44.520
<v Speaker 5>and all this stuff. Increasingly, now we can just talk,

0:26:44.680 --> 0:26:46.879
<v Speaker 5>you know, you just in words. You can describe what

0:26:46.920 --> 0:26:49.600
<v Speaker 5>you want, you want to answer, ask a question, you

0:26:49.640 --> 0:26:51.840
<v Speaker 5>want to sort of command the system to do something,

0:26:52.640 --> 0:26:54.720
<v Speaker 5>rather than having to learn how to do that clicking buttons,

0:26:54.760 --> 0:26:56.520
<v Speaker 5>which might be inefficient, Now we can just sort of

0:26:56.800 --> 0:26:57.400
<v Speaker 5>spell it out.

0:26:57.960 --> 0:27:00.960
<v Speaker 3>Interesting, right, the graphical user interface that we all sort

0:27:00.960 --> 0:27:04.280
<v Speaker 3>of default to, that's not like the state of nature, Right,

0:27:04.359 --> 0:27:06.879
<v Speaker 3>that's a thing that was invented and just came to

0:27:06.920 --> 0:27:09.320
<v Speaker 3>be the standard way that we interact with computers. And

0:27:09.359 --> 0:27:13.800
<v Speaker 3>so you could imagine, as you're saying, like chat essentially

0:27:14.000 --> 0:27:17.240
<v Speaker 3>chatting with the machine could could become a sort of

0:27:17.320 --> 0:27:20.560
<v Speaker 3>standard user interface, just like the graphical user interface, did

0:27:20.760 --> 0:27:22.160
<v Speaker 3>you know over the past several decades.

0:27:22.600 --> 0:27:26.040
<v Speaker 5>Absolutely, And I think those kinds of conversational interfaces are

0:27:26.040 --> 0:27:30.280
<v Speaker 5>going to be hugely important for increasing our productivity. It's

0:27:30.280 --> 0:27:32.159
<v Speaker 5>just a lot easier if I if I have to

0:27:32.200 --> 0:27:33.879
<v Speaker 5>learn how to use a tool, or I don't have

0:27:33.880 --> 0:27:36.960
<v Speaker 5>to kind of have awkward, you know, interactions from the computer.

0:27:36.960 --> 0:27:38.159
<v Speaker 5>I can just tell it what I want and I

0:27:38.200 --> 0:27:41.240
<v Speaker 5>can understand it. Could you know, potentially even ask questions

0:27:41.240 --> 0:27:45.200
<v Speaker 5>back to clarify and have those kinds of conversations that

0:27:45.240 --> 0:27:48.320
<v Speaker 5>can be extremely powerful. And in fact, one area where

0:27:48.320 --> 0:27:51.000
<v Speaker 5>that's going to I think be absolutely game changing is

0:27:51.040 --> 0:27:55.200
<v Speaker 5>in code. When we write code. You know, programming languages

0:27:55.680 --> 0:27:59.320
<v Speaker 5>are a way for us to sort of match between

0:28:00.000 --> 0:28:03.439
<v Speaker 5>a very sloppy way of talking and the very exact

0:28:03.480 --> 0:28:05.360
<v Speaker 5>way that you need to command a computer to do

0:28:05.560 --> 0:28:08.360
<v Speaker 5>what you wanted to do. They're cumbersome to learn, they

0:28:08.359 --> 0:28:10.800
<v Speaker 5>can you know, create very complex systems that are very

0:28:10.800 --> 0:28:13.800
<v Speaker 5>hard to reason about. And we're already starting to see

0:28:14.119 --> 0:28:15.840
<v Speaker 5>the ability to just write down what you want and

0:28:16.160 --> 0:28:18.680
<v Speaker 5>AI will generate the code for you. And I think

0:28:18.680 --> 0:28:20.439
<v Speaker 5>we're just going to see a huge revolution of like

0:28:20.680 --> 0:28:22.800
<v Speaker 5>we just converse you and we can have a conversation

0:28:23.160 --> 0:28:25.280
<v Speaker 5>to say what we want, and then the computer can

0:28:25.320 --> 0:28:29.000
<v Speaker 5>actually not only do fixed actions and do things for us,

0:28:29.000 --> 0:28:31.359
<v Speaker 5>but it can actually even write code to do new things,

0:28:31.359 --> 0:28:35.000
<v Speaker 5>you know, and generate software itself. Given how much software

0:28:35.040 --> 0:28:37.480
<v Speaker 5>we have, how much craving we have for software, like

0:28:37.520 --> 0:28:41.240
<v Speaker 5>we'll never have enough software in our world. Uh, you know,

0:28:41.360 --> 0:28:44.520
<v Speaker 5>the ability to have AI systems as a helper in that,

0:28:45.120 --> 0:28:46.680
<v Speaker 5>I think we're going to see a lot of a

0:28:46.720 --> 0:28:47.520
<v Speaker 5>lot of value there.

0:28:48.720 --> 0:28:51.360
<v Speaker 3>So if you if you think about the different ways

0:28:52.000 --> 0:28:54.200
<v Speaker 3>AI might be applied to business, I mean you've talked

0:28:54.200 --> 0:28:56.520
<v Speaker 3>about a number of the sort of classic use cases.

0:28:57.240 --> 0:29:00.600
<v Speaker 3>What are some of the more out there use cases.

0:29:00.640 --> 0:29:03.520
<v Speaker 3>What are some you know, unique ways you could imagine

0:29:03.560 --> 0:29:05.320
<v Speaker 3>AI being applied to business.

0:29:06.960 --> 0:29:09.520
<v Speaker 5>You know, there's really disguised the limit. I mean, we

0:29:09.600 --> 0:29:11.520
<v Speaker 5>have one project that I'm kind of a fan of

0:29:11.600 --> 0:29:15.760
<v Speaker 5>where we actually were working with a mechanical engineering professor

0:29:15.920 --> 0:29:18.720
<v Speaker 5>at MIT working on a classic problem, how do you

0:29:18.800 --> 0:29:22.200
<v Speaker 5>build linkage systems which are like you imagine bars and

0:29:22.480 --> 0:29:24.800
<v Speaker 5>joints and overs.

0:29:24.480 --> 0:29:26.800
<v Speaker 3>You know, the things that are building a thing, building

0:29:26.800 --> 0:29:28.640
<v Speaker 3>a physical machine of some kind.

0:29:29.240 --> 0:29:34.400
<v Speaker 5>Like real like metal and you know nineteenth century just

0:29:34.600 --> 0:29:37.520
<v Speaker 5>old school industrial revolution. Yeah yeah, yeah, but you know

0:29:37.560 --> 0:29:40.320
<v Speaker 5>the little arm that's that's holding up my microphone in

0:29:40.360 --> 0:29:42.800
<v Speaker 5>front of me. Cranes, get build your buildings, you know,

0:29:42.880 --> 0:29:45.400
<v Speaker 5>parts of your engines. This is like classical stuff. It

0:29:45.440 --> 0:29:47.720
<v Speaker 5>turns out that you know, humans, if you want to

0:29:47.720 --> 0:29:50.920
<v Speaker 5>build an advanced system, you decide what like curve you

0:29:50.960 --> 0:29:53.600
<v Speaker 5>want to create, and then a human together with a

0:29:53.640 --> 0:29:57.400
<v Speaker 5>computer program, can build a five or six bar linkage

0:29:57.560 --> 0:29:58.960
<v Speaker 5>and then that's kind of where you top out it

0:29:59.040 --> 0:30:01.920
<v Speaker 5>because it gets too comp replicated to work more than that.

0:30:02.560 --> 0:30:05.080
<v Speaker 5>We built a generative AI system that can build twenty

0:30:05.080 --> 0:30:08.440
<v Speaker 5>bar linkages, like arbitrarily complex. So these are machines that

0:30:08.480 --> 0:30:12.840
<v Speaker 5>are beyond the capability of a human to design themselves.

0:30:13.360 --> 0:30:16.320
<v Speaker 5>Another example, we have an AI system that can generate

0:30:16.480 --> 0:30:18.880
<v Speaker 5>electronic circuits. You know, we had a project where we're

0:30:18.880 --> 0:30:22.000
<v Speaker 5>working where we're building better power converters which allow our

0:30:22.800 --> 0:30:25.920
<v Speaker 5>computers and our devices to be more efficient, save energy,

0:30:26.720 --> 0:30:29.680
<v Speaker 5>you know, less less carbone. But I think the world

0:30:29.720 --> 0:30:32.640
<v Speaker 5>around us has always been shaped by technology. If we

0:30:32.680 --> 0:30:34.959
<v Speaker 5>look around, you know, just think about how many steps

0:30:35.000 --> 0:30:37.280
<v Speaker 5>and how many people and how many designs went into

0:30:37.320 --> 0:30:41.120
<v Speaker 5>the table and the chair and the lamp. It's really

0:30:41.160 --> 0:30:44.520
<v Speaker 5>just astonishing. And that's already you know, the fruit of

0:30:45.440 --> 0:30:47.800
<v Speaker 5>automation and computers and those kinds of tools. But we're

0:30:47.800 --> 0:30:51.120
<v Speaker 5>going to see that increasingly be product also of AI.

0:30:51.200 --> 0:30:53.240
<v Speaker 5>It's just going to be everywhere around us. Everything we

0:30:53.400 --> 0:30:56.560
<v Speaker 5>touch is going to have been helped in some way

0:30:56.640 --> 0:30:58.240
<v Speaker 5>to get to you by.

0:30:58.080 --> 0:31:01.560
<v Speaker 3>A you know, that is a pretty profound transformation that

0:31:01.600 --> 0:31:04.400
<v Speaker 3>you're talking about in business. How do you think about

0:31:04.400 --> 0:31:07.600
<v Speaker 3>the implications of that both for the sort of you know,

0:31:07.880 --> 0:31:11.080
<v Speaker 3>business itself and also for employees.

0:31:12.760 --> 0:31:16.880
<v Speaker 5>Yeah, so I think for businesses this is gonna cut costs,

0:31:17.160 --> 0:31:21.040
<v Speaker 5>make new opportunities to like customers, you know, like there's

0:31:21.080 --> 0:31:23.880
<v Speaker 5>just you know, it's sort of all upside right like

0:31:24.720 --> 0:31:26.720
<v Speaker 5>for the for the workers, I think the story is

0:31:26.760 --> 0:31:29.720
<v Speaker 5>mostly good too. You know, like how many things do

0:31:29.760 --> 0:31:33.239
<v Speaker 5>you do in your day that you'd really rather not?

0:31:33.440 --> 0:31:33.560
<v Speaker 2>Right?

0:31:34.080 --> 0:31:36.040
<v Speaker 5>You know, and we're used to having things we don't

0:31:36.160 --> 0:31:39.440
<v Speaker 5>like automated away. You know, we we didn't. You know,

0:31:39.440 --> 0:31:42.040
<v Speaker 5>if you didn't like walking many miles to work, then

0:31:42.080 --> 0:31:43.840
<v Speaker 5>you know, like you can have a car and you

0:31:43.840 --> 0:31:46.320
<v Speaker 5>can drive there. Or we used to have a huge

0:31:46.320 --> 0:31:49.720
<v Speaker 5>fraction over ninety percent of the US population engaged in agriculture,

0:31:49.800 --> 0:31:52.400
<v Speaker 5>and then we mechanized it. How very few people work

0:31:52.440 --> 0:31:54.360
<v Speaker 5>in agriculture, a small number of people can do the

0:31:54.360 --> 0:31:56.760
<v Speaker 5>work of a large number of people. And then you know,

0:31:56.880 --> 0:31:59.960
<v Speaker 5>things like email, and yeah, they've led to huge productor

0:32:00.160 --> 0:32:02.640
<v Speaker 5>the enhancements because I don't need to be writing letters

0:32:02.680 --> 0:32:04.720
<v Speaker 5>and sending them in the mail. I can just instantly

0:32:04.720 --> 0:32:09.000
<v Speaker 5>communicate with people. We just become more effective, Like our

0:32:09.080 --> 0:32:13.480
<v Speaker 5>jobs have transformed, whether it's a physical job like agriculture

0:32:13.560 --> 0:32:16.040
<v Speaker 5>or whether it's a knowledge worker job where you're sending

0:32:16.080 --> 0:32:19.960
<v Speaker 5>emails and communicating with people and coordinating teams. We've just

0:32:20.000 --> 0:32:22.680
<v Speaker 5>gotten better. And you know, the technology has just made

0:32:22.720 --> 0:32:25.800
<v Speaker 5>us more productive. And this is just another example. Now,

0:32:26.080 --> 0:32:28.200
<v Speaker 5>you know, there are people who worry that you know,

0:32:28.880 --> 0:32:31.320
<v Speaker 5>will be so good at that that maybe jobs will

0:32:31.320 --> 0:32:35.200
<v Speaker 5>be displaced, and that's a legitimate concern. But just like

0:32:36.560 --> 0:32:38.720
<v Speaker 5>how in agriculture, you know, it's not like suddenly we

0:32:38.800 --> 0:32:41.880
<v Speaker 5>had ninety percent of the population unemployed. You know, people

0:32:41.880 --> 0:32:46.040
<v Speaker 5>transitioned to other jobs. And the other thing that we

0:32:46.160 --> 0:32:50.160
<v Speaker 5>found too, is that our appetite for doing more things

0:32:50.800 --> 0:32:54.120
<v Speaker 5>is as humans is sort of insatiable. So even if

0:32:54.480 --> 0:32:57.680
<v Speaker 5>we can dramatically increase how much one human can do,

0:32:58.480 --> 0:33:00.720
<v Speaker 5>that doesn't necessarily mean we're going to do fixed amount

0:33:00.760 --> 0:33:02.960
<v Speaker 5>of stuff. There's an appetite to have even more, so

0:33:02.960 --> 0:33:05.200
<v Speaker 5>we're going to you can continue to grow, grow the pie.

0:33:05.640 --> 0:33:08.040
<v Speaker 5>So I think at least certainly in the near term,

0:33:08.280 --> 0:33:09.640
<v Speaker 5>you know, we're going to see a lot of drudgery

0:33:09.680 --> 0:33:12.400
<v Speaker 5>go away from work. We're going to see people be

0:33:12.400 --> 0:33:15.520
<v Speaker 5>able to be more effective at their jobs. You know,

0:33:15.520 --> 0:33:19.440
<v Speaker 5>we will see some transformation in jobs and like we've

0:33:19.480 --> 0:33:24.520
<v Speaker 5>seen that before, and the technology least has the potential

0:33:24.720 --> 0:33:26.040
<v Speaker 5>to make our lives a lot easier.

0:33:27.280 --> 0:33:32.280
<v Speaker 3>So IBM recently launched Watson X, which includes Watson X

0:33:32.360 --> 0:33:35.320
<v Speaker 3>dot AI. Tell me about that, Tell me about you

0:33:35.320 --> 0:33:37.400
<v Speaker 3>know what it is and the new possibilities that it

0:33:37.440 --> 0:33:38.000
<v Speaker 3>opens up.

0:33:38.920 --> 0:33:42.640
<v Speaker 5>Yeah. So so Watson X is obviously a bit of

0:33:43.160 --> 0:33:46.120
<v Speaker 5>a new branding on the Watson brand.

0:33:46.760 --> 0:33:46.960
<v Speaker 2>T J.

0:33:47.120 --> 0:33:50.720
<v Speaker 5>Watson that was the founder of IBM and our EI

0:33:50.800 --> 0:33:54.640
<v Speaker 5>technologies have had the Watson brand. Watson X is a

0:33:54.680 --> 0:33:58.280
<v Speaker 5>recognition that that there's something new, there's something that actually

0:33:58.280 --> 0:33:59.200
<v Speaker 5>has changed the game.

0:33:59.440 --> 0:33:59.560
<v Speaker 3>You know.

0:33:59.800 --> 0:34:03.480
<v Speaker 5>We gone from this old world of automation is to

0:34:03.600 --> 0:34:07.160
<v Speaker 5>labor intensive to this new world of possibilities where it's

0:34:07.240 --> 0:34:11.840
<v Speaker 5>much easier to use AI. And what watsonex does it

0:34:11.880 --> 0:34:16.799
<v Speaker 5>brings together tools for businesses to harness that power. So

0:34:17.120 --> 0:34:21.439
<v Speaker 5>whatsonex dot AI foundation models that our customers can use.

0:34:21.560 --> 0:34:24.600
<v Speaker 5>It includes tools that make it easy to run, easy

0:34:24.680 --> 0:34:29.040
<v Speaker 5>to deploy, easy to experiment. There's a watsonex dot Data

0:34:29.360 --> 0:34:32.800
<v Speaker 5>component which allows you to sort of organize and access

0:34:32.840 --> 0:34:34.920
<v Speaker 5>to your data. So what we're really trying to do

0:34:35.000 --> 0:34:39.919
<v Speaker 5>is give our customers a cohesive set of tools to

0:34:39.960 --> 0:34:43.200
<v Speaker 5>harness the value of these technologies and at the same

0:34:43.239 --> 0:34:46.239
<v Speaker 5>time be able to manage the risks and other things

0:34:46.280 --> 0:34:48.160
<v Speaker 5>that you have to keep an eye on in an

0:34:48.280 --> 0:34:49.239
<v Speaker 5>enterprise context.

0:34:50.880 --> 0:34:53.600
<v Speaker 3>So we talk about the guests on this show as

0:34:54.080 --> 0:34:58.200
<v Speaker 3>new creators, by which we mean people who are creatively

0:34:58.239 --> 0:35:03.120
<v Speaker 3>applying technology in businesiness to drive change. And I'm curious

0:35:03.640 --> 0:35:08.319
<v Speaker 3>how creativity plays a role in the research that you do.

0:35:08.920 --> 0:35:13.520
<v Speaker 5>I Honestly, I think the creative aspects of this job,

0:35:13.960 --> 0:35:17.279
<v Speaker 5>this is what makes this work exciting. You know, I

0:35:17.280 --> 0:35:19.200
<v Speaker 5>should say, you know, the folks who work in my

0:35:19.320 --> 0:35:24.160
<v Speaker 5>organization are doing the creating, and I guess you're.

0:35:24.000 --> 0:35:26.480
<v Speaker 3>Doing the managing so that they could do the creator.

0:35:27.400 --> 0:35:30.799
<v Speaker 5>I'm helping them be their best, and I still get

0:35:30.840 --> 0:35:33.719
<v Speaker 5>to get involved in the weeds of the research as

0:35:33.800 --> 0:35:36.560
<v Speaker 5>much as I can. But you know, there's something really

0:35:36.560 --> 0:35:40.480
<v Speaker 5>exciting about inventing, you know, like one of the nice

0:35:40.480 --> 0:35:44.680
<v Speaker 5>things about doing invention and doing research on AI. In industries,

0:35:45.040 --> 0:35:47.960
<v Speaker 5>it's usually grounded and a real problem that somebody is having.

0:35:48.040 --> 0:35:50.680
<v Speaker 5>You know, a customer wants to solve this problem. It's

0:35:51.239 --> 0:35:54.239
<v Speaker 5>losing money, or there wuld be a new opportunity. You

0:35:54.280 --> 0:35:58.799
<v Speaker 5>identify that problem and then you build something that's never

0:35:58.840 --> 0:36:01.120
<v Speaker 5>been built before to do that. And I think that's

0:36:01.400 --> 0:36:05.279
<v Speaker 5>honestly the adrenaline rush that keeps all of us in

0:36:05.320 --> 0:36:07.880
<v Speaker 5>this field. How do you do something that nobody else

0:36:08.120 --> 0:36:11.520
<v Speaker 5>on earth has has done before or tried before, so

0:36:11.560 --> 0:36:14.880
<v Speaker 5>that that kind of creativity and there's also creativity as

0:36:14.920 --> 0:36:17.680
<v Speaker 5>well and identifying what those problems are, being able to

0:36:17.760 --> 0:36:24.120
<v Speaker 5>understand the places where the technology is close enough to

0:36:24.400 --> 0:36:28.400
<v Speaker 5>solving a problem and doing that matchmaking between problems that

0:36:28.440 --> 0:36:31.120
<v Speaker 5>are now solvable, you know, and an AI where the

0:36:31.120 --> 0:36:34.759
<v Speaker 5>field is moving so fast, this is constantly growing horizon

0:36:35.480 --> 0:36:37.600
<v Speaker 5>of things that we might be able to solve. So

0:36:37.719 --> 0:36:41.920
<v Speaker 5>that matchmaking, I think, is also a really interesting creative problem.

0:36:42.120 --> 0:36:44.640
<v Speaker 5>So I think I think that's that's that's why it's

0:36:44.640 --> 0:36:47.359
<v Speaker 5>so much fun. And it's a fun environment we have

0:36:47.480 --> 0:36:50.520
<v Speaker 5>here too. It's you know, people drawing on whiteboards and

0:36:50.960 --> 0:36:54.960
<v Speaker 5>writing on pages of math and like in a movie,

0:36:55.400 --> 0:36:58.239
<v Speaker 5>like in a movie, yes, straight from sexual casting.

0:36:58.080 --> 0:37:00.320
<v Speaker 3>Drawing, the drawing on the window, writing on the window,

0:37:00.400 --> 0:37:06.160
<v Speaker 3>and sharp absolutely so, so let's close with the really

0:37:06.400 --> 0:37:12.080
<v Speaker 3>long view. How do you imagine AI and people working

0:37:12.120 --> 0:37:14.319
<v Speaker 3>together twenty years from now?

0:37:16.400 --> 0:37:21.040
<v Speaker 5>Yeah, it's really hard to make predictions. The vision that

0:37:21.600 --> 0:37:27.640
<v Speaker 5>I I like, actually this came from an MIT economist

0:37:27.760 --> 0:37:33.040
<v Speaker 5>named David Ottur, which was imagine AI almost as a

0:37:33.120 --> 0:37:37.880
<v Speaker 5>natural resource. You know, we know how natural resources work, right,

0:37:38.040 --> 0:37:39.719
<v Speaker 5>Like there's an or we can dig up out of

0:37:39.760 --> 0:37:42.320
<v Speaker 5>the earth that comes from kind of springs from the earth,

0:37:42.440 --> 0:37:45.400
<v Speaker 5>or we usually think of that in terms of physical stuff.

0:37:46.040 --> 0:37:47.640
<v Speaker 5>With AI, you can almost think of it as like

0:37:47.719 --> 0:37:50.719
<v Speaker 5>there's a new kind of abundance potentially twenty years from now,

0:37:50.800 --> 0:37:53.479
<v Speaker 5>where not only can we have things we can build

0:37:53.560 --> 0:37:56.160
<v Speaker 5>or eat or use or burn or whatever. Now we have,

0:37:56.400 --> 0:37:59.200
<v Speaker 5>you know, this ability to do things and understand things

0:37:59.200 --> 0:38:02.399
<v Speaker 5>and do intellectual work, and I think we can get

0:38:02.400 --> 0:38:06.719
<v Speaker 5>to a world where automating things is just seamless. We're

0:38:06.760 --> 0:38:11.759
<v Speaker 5>surrounded by capability to augment ourselves to get things done.

0:38:12.480 --> 0:38:15.239
<v Speaker 5>And you could think of that in terms of like, oh,

0:38:15.280 --> 0:38:17.399
<v Speaker 5>that's going to displace our jobs, because eventually the AI

0:38:17.480 --> 0:38:19.560
<v Speaker 5>system is going to do everything we can do. But

0:38:19.920 --> 0:38:22.120
<v Speaker 5>you could also think of it in terms of like, wow,

0:38:22.160 --> 0:38:24.480
<v Speaker 5>that's just so much abundance that we now have, and

0:38:24.520 --> 0:38:27.759
<v Speaker 5>really how we use that abundance is sort of up

0:38:27.800 --> 0:38:30.360
<v Speaker 5>to us, you know, like when you can writing software

0:38:30.440 --> 0:38:32.799
<v Speaker 5>is super easy and fast and anybody can do it.

0:38:33.200 --> 0:38:35.279
<v Speaker 5>Just think about all the things you can do now, like,

0:38:35.640 --> 0:38:37.719
<v Speaker 5>think about all the new activities and go about all

0:38:37.760 --> 0:38:39.880
<v Speaker 5>the ways we could use that to enrich our lives.

0:38:40.320 --> 0:38:43.359
<v Speaker 5>That's where I'd like to see us in twenty years.

0:38:43.400 --> 0:38:46.000
<v Speaker 5>You know, we can we can do just so much

0:38:46.160 --> 0:38:49.399
<v Speaker 5>more than we were able to do before abundance.

0:38:50.200 --> 0:38:53.040
<v Speaker 3>Great, thank you so much for your time.

0:38:53.800 --> 0:38:55.839
<v Speaker 5>Yeah, It's been a pleasure. Thanks for inviting me.

0:38:57.320 --> 0:39:01.400
<v Speaker 4>What a far ranging, deep conversation. I'm mesmerized by the

0:39:01.440 --> 0:39:05.360
<v Speaker 4>vision David just described. A world where natural conversation between

0:39:05.360 --> 0:39:09.960
<v Speaker 4>mankind and machine can generate creative solutions to our most

0:39:10.040 --> 0:39:13.799
<v Speaker 4>complex problems. A world where we view AI not as

0:39:13.880 --> 0:39:17.919
<v Speaker 4>our replacements, but as a powerful resource we can tap

0:39:17.960 --> 0:39:23.440
<v Speaker 4>into and exponentially boost our innovation and productivity. Thanks so

0:39:23.560 --> 0:39:26.920
<v Speaker 4>much to doctor David Cox for joining us on smart Talks.

0:39:27.360 --> 0:39:31.080
<v Speaker 4>We deeply appreciate him sharing his huge breadth of AI

0:39:31.160 --> 0:39:35.160
<v Speaker 4>knowledge with us and for explaining the transformative potential of

0:39:35.239 --> 0:39:38.600
<v Speaker 4>foundation models in a way that even I can understand.

0:39:39.200 --> 0:39:43.719
<v Speaker 4>We eagerly await his next great breakthrough. Smart Talks at

0:39:43.719 --> 0:39:48.239
<v Speaker 4>IBM is produced by Matt Romano, David jaw nishe Venkat

0:39:48.280 --> 0:39:52.720
<v Speaker 4>and Royston Preserve with Jacob Goldstein. We're edited by Lydia

0:39:52.760 --> 0:39:57.080
<v Speaker 4>Jean Kott. Our engineers are Jason Gambrel, Sarah Bouguer and

0:39:57.160 --> 0:40:01.799
<v Speaker 4>Ben Holliday. Theme song by Gramma's Scope. Special thanks to

0:40:01.880 --> 0:40:06.040
<v Speaker 4>Carli Megliori, Andy Kelly, Kathy Callahan and the Eight Bar

0:40:06.160 --> 0:40:10.200
<v Speaker 4>and IBM teams, as well as the Pushkin marketing team.

0:40:10.480 --> 0:40:13.800
<v Speaker 4>Smart Talks with IBM is a production of Pushkin Industries

0:40:14.040 --> 0:40:18.160
<v Speaker 4>and iHeartMedia. To find more Pushkin podcasts, listen on the

0:40:18.200 --> 0:40:23.359
<v Speaker 4>iHeartRadio app, Apple Podcasts, or wherever you listen to podcasts.

0:40:23.800 --> 0:40:40.840
<v Speaker 4>Him Malcolm Gladwell. This is a paid advertisement from IBM.