WEBVTT - Smart Talks with IBM: How open source can democratize AI

0:00:04.519 --> 0:00:12.719
<v Speaker 1>Welcome to Tech Stuff, a production from iHeartRadio. Today, we

0:00:12.760 --> 0:00:15.720
<v Speaker 1>are witnessed to one of those rare moments in history,

0:00:16.079 --> 0:00:19.319
<v Speaker 1>the rise of an innovative technology with the potential to

0:00:19.440 --> 0:00:24.160
<v Speaker 1>radically transform business in society forever. That technology, of course,

0:00:24.640 --> 0:00:28.200
<v Speaker 1>is artificial intelligence, and it's the central focus for this

0:00:28.360 --> 0:00:32.400
<v Speaker 1>new season of Smart Talks with IBM. Join hosts from

0:00:32.400 --> 0:00:36.159
<v Speaker 1>your favorite Pushkin podcasts as they talk with industry experts

0:00:36.159 --> 0:00:39.760
<v Speaker 1>and leaders to explore how businesses can integrate AI into

0:00:39.800 --> 0:00:43.120
<v Speaker 1>their workflows and help drive real change in this new

0:00:43.200 --> 0:00:46.879
<v Speaker 1>era of AI, and of course, host Malcolm Gladwell will

0:00:46.920 --> 0:00:49.200
<v Speaker 1>be there to guide you through the season and throw

0:00:49.320 --> 0:00:52.199
<v Speaker 1>in his two cents as well. Look out for new

0:00:52.240 --> 0:00:55.120
<v Speaker 1>episodes of Smart Talks with IBM every other week on

0:00:55.160 --> 0:00:59.400
<v Speaker 1>the iHeartRadio app, Apple Podcasts, wherever you get your podcasts,

0:00:59.600 --> 0:01:13.000
<v Speaker 1>and learn more at IBM dot com, slash smart Talks.

0:01:10.720 --> 0:01:19.880
<v Speaker 2>Pushkin Hello, Hello, Welcome to Smart Talks with IBM, a

0:01:19.959 --> 0:01:25.040
<v Speaker 2>podcast from Pushkin Industries, iHeartRadio and IBM. I'm Malcolm Gladwell.

0:01:25.560 --> 0:01:29.080
<v Speaker 2>This season, we're diving back into the world of artificial intelligence,

0:01:29.360 --> 0:01:32.720
<v Speaker 2>but with a focus on the powerful concept of open

0:01:33.120 --> 0:01:38.280
<v Speaker 2>its possibilities, implications, and misconceptions. We'll look at openness from

0:01:38.360 --> 0:01:41.360
<v Speaker 2>a variety of angles and explore how the concept is

0:01:41.440 --> 0:01:45.000
<v Speaker 2>already reshaping industries, ways of doing business, and a very

0:01:45.120 --> 0:01:48.880
<v Speaker 2>notion of what's possible. In today's episode, I sat down

0:01:48.880 --> 0:01:52.720
<v Speaker 2>with Mo Duffy, software engineering manager at red Hat, who

0:01:52.760 --> 0:01:56.440
<v Speaker 2>works on instruct Lab, a project co developed by red

0:01:56.440 --> 0:02:00.040
<v Speaker 2>Hat and IBM. Most shared with me how this a

0:02:00.080 --> 0:02:04.680
<v Speaker 2>new initiative, is revolutionizing AI training, making it not only

0:02:04.720 --> 0:02:09.400
<v Speaker 2>more accessible, but also more inclusive. This project, unique in

0:02:09.440 --> 0:02:14.280
<v Speaker 2>the industry, allows developers to submit incremental contributions to one

0:02:14.400 --> 0:02:18.760
<v Speaker 2>base AI model, creating a continuous loop of development, much

0:02:18.880 --> 0:02:24.200
<v Speaker 2>like normal open source software. By leveraging community contributions and

0:02:24.360 --> 0:02:27.560
<v Speaker 2>IBM's cutting edge granite models, Mo in the team of

0:02:27.600 --> 0:02:30.680
<v Speaker 2>ibmrs and red hatters are paving the way for a

0:02:30.720 --> 0:02:35.880
<v Speaker 2>future where AI development is a communal endeavor. Our insights

0:02:35.919 --> 0:02:40.400
<v Speaker 2>into open source software extend beyond technical proficiency to the

0:02:40.440 --> 0:02:44.800
<v Speaker 2>profound impact of collaborative effort. At the heart of Moe's

0:02:44.840 --> 0:02:49.400
<v Speaker 2>work is a belief in democratizing technology, ensuring that AI

0:02:49.480 --> 0:02:54.560
<v Speaker 2>becomes a tool accessible to all. So let's explore how Moe,

0:02:54.720 --> 0:02:58.680
<v Speaker 2>red Hat and IBM are empowering individuals and businesses alike

0:02:59.000 --> 0:03:03.400
<v Speaker 2>to reshape the fuel future of technology through collaboration and innovation.

0:03:08.440 --> 0:03:11.320
<v Speaker 2>We thank you for joining me today. Thank you so

0:03:11.400 --> 0:03:15.400
<v Speaker 2>much for I have just about the most Irish name ever.

0:03:16.240 --> 0:03:21.079
<v Speaker 2>I do very proud you weren't born in Ireland my grandparents,

0:03:21.160 --> 0:03:23.040
<v Speaker 2>Oh your grandparents, I see? Where did you grow up?

0:03:23.440 --> 0:03:24.480
<v Speaker 3>New York Queens?

0:03:24.800 --> 0:03:27.920
<v Speaker 2>Oh you're l a see. So tell me a little

0:03:27.919 --> 0:03:30.600
<v Speaker 2>bit about how how you got to red Hat. What

0:03:30.720 --> 0:03:31.320
<v Speaker 2>was your path?

0:03:31.800 --> 0:03:34.320
<v Speaker 3>When I was in high school? It was a chatty girl,

0:03:34.639 --> 0:03:37.160
<v Speaker 3>teenage girl on the phone. We had one phone line.

0:03:37.440 --> 0:03:40.280
<v Speaker 3>My older brother was studying at the local state college

0:03:40.280 --> 0:03:42.560
<v Speaker 3>computer science, and he had to tell that end to

0:03:42.880 --> 0:03:45.760
<v Speaker 3>compile his homework one phone line, and I'm on it

0:03:45.800 --> 0:03:48.760
<v Speaker 3>all the time. He got very frustrated and he needed

0:03:48.760 --> 0:03:51.320
<v Speaker 3>a compiler to do his homework. So he bought red

0:03:51.320 --> 0:03:55.760
<v Speaker 3>Hat Linux from a CompUSA, brought it home and that

0:03:55.880 --> 0:03:57.840
<v Speaker 3>was on the family computer. So I learned Linux and

0:03:57.840 --> 0:04:00.560
<v Speaker 3>I started playing around with it. I really liked it

0:04:00.640 --> 0:04:04.520
<v Speaker 3>because you could customize everything, like the entire user interface.

0:04:04.880 --> 0:04:07.040
<v Speaker 3>You could actually modify the code of the programs you

0:04:07.080 --> 0:04:09.320
<v Speaker 3>were using to do what you wanted. And for me,

0:04:09.400 --> 0:04:11.600
<v Speaker 3>It was really cool because especially when you're a kid

0:04:11.600 --> 0:04:13.360
<v Speaker 3>and like people tell you this is the way things

0:04:13.400 --> 0:04:15.120
<v Speaker 3>are and you just have to deal with it. It's

0:04:15.240 --> 0:04:16.760
<v Speaker 3>nice to be like I'm going to make things the

0:04:16.839 --> 0:04:20.120
<v Speaker 3>way I want, modify the code and playing. Yeah, it

0:04:20.200 --> 0:04:22.040
<v Speaker 3>was amazing and it was just such a time and

0:04:22.080 --> 0:04:25.080
<v Speaker 3>like before it was cool, I was doing it and

0:04:25.880 --> 0:04:27.839
<v Speaker 3>what I saw on that is sort of the potential

0:04:27.880 --> 0:04:30.920
<v Speaker 3>like number one of like a community of people working together.

0:04:30.920 --> 0:04:34.000
<v Speaker 3>And like the Internet existed, it was slow, it involved modems,

0:04:34.400 --> 0:04:36.320
<v Speaker 3>but there were people that you could talk to who

0:04:36.360 --> 0:04:40.120
<v Speaker 3>would give you tips and you'd share information, and this

0:04:40.240 --> 0:04:43.919
<v Speaker 3>collaborative building something together is really something special. Right. I

0:04:43.920 --> 0:04:47.760
<v Speaker 3>could file a complaint to whatever large software company made

0:04:47.760 --> 0:04:50.000
<v Speaker 3>whatever software I was into, or I could go to

0:04:50.000 --> 0:04:52.120
<v Speaker 3>an open source software community and be like, hey, guys,

0:04:52.160 --> 0:04:54.559
<v Speaker 3>I think we should do this, and like, yeah, okay,

0:04:54.560 --> 0:04:56.719
<v Speaker 3>I'll help, I'll pitch in. So you don't feel powerless,

0:04:56.720 --> 0:04:58.080
<v Speaker 3>you feel like you can have an impact, and that

0:04:58.160 --> 0:05:02.040
<v Speaker 3>was really exciting to me. However, open source software has

0:05:02.200 --> 0:05:05.440
<v Speaker 3>a reputation for not having the best user interface, not

0:05:05.480 --> 0:05:10.000
<v Speaker 3>the best user experience. So I ended up studying computer

0:05:10.040 --> 0:05:13.320
<v Speaker 3>science and Electronic Media dual major, and then I did

0:05:13.360 --> 0:05:17.280
<v Speaker 3>Human computeraction as my master's and my thought was, wouldn't

0:05:17.279 --> 0:05:20.960
<v Speaker 3>it be nice if this free software accessible to anybody,

0:05:21.279 --> 0:05:23.320
<v Speaker 3>if it was easier to use, some more people could

0:05:23.360 --> 0:05:26.000
<v Speaker 3>use it and take advantage of it. And so, long

0:05:26.040 --> 0:05:29.320
<v Speaker 3>story short, I ended up going to red Hat saying, Hey,

0:05:29.320 --> 0:05:30.720
<v Speaker 3>I want to learn how you guys work. Let me

0:05:30.760 --> 0:05:33.480
<v Speaker 3>embed in your team draft out of my graduate program.

0:05:33.600 --> 0:05:35.359
<v Speaker 3>And I'm like, I want to do this for a living.

0:05:35.480 --> 0:05:38.080
<v Speaker 3>This is cooler. So I thought this is the way

0:05:38.120 --> 0:05:39.719
<v Speaker 3>to go, and I've been there ever since. They haven't

0:05:39.760 --> 0:05:40.599
<v Speaker 3>been able to get rid of me.

0:05:42.640 --> 0:05:45.400
<v Speaker 2>To backtrack this a little bit, you were talking about

0:05:45.400 --> 0:05:49.120
<v Speaker 2>the sense of community that surrounds this way of thinking

0:05:49.160 --> 0:05:52.880
<v Speaker 2>about software. Talk a little bit more about what that

0:05:52.920 --> 0:05:55.680
<v Speaker 2>community is like, the benefits of that community, why it

0:05:55.760 --> 0:05:56.560
<v Speaker 2>appeals to you.

0:05:57.040 --> 0:05:59.359
<v Speaker 3>Sure, well, you know part of the reason I actually

0:05:59.440 --> 0:06:02.400
<v Speaker 3>ended up going going to the graduate school track. Suddenly

0:06:02.440 --> 0:06:05.560
<v Speaker 3>you're a peer of your professors and you're working side

0:06:05.600 --> 0:06:08.680
<v Speaker 3>by side with them. At some point they retire and

0:06:08.720 --> 0:06:12.320
<v Speaker 3>you're in the next generation. So it's sharing information, building

0:06:12.360 --> 0:06:14.920
<v Speaker 3>on the work of others in sort of this cycle

0:06:15.040 --> 0:06:19.520
<v Speaker 3>that extends past the human lifespan and in the same way,

0:06:19.640 --> 0:06:22.760
<v Speaker 3>Like the open source model is very similar, but you're

0:06:22.800 --> 0:06:25.720
<v Speaker 3>actually you're building something, and it's something in me. I'm

0:06:25.800 --> 0:06:28.640
<v Speaker 3>just really attracted. Like I don't like talking about stuff.

0:06:28.680 --> 0:06:32.520
<v Speaker 3>I like doing stuff with open source software. The software

0:06:32.520 --> 0:06:36.080
<v Speaker 3>doesn't cost anything, the code is out there, generally uses

0:06:36.160 --> 0:06:39.480
<v Speaker 3>open standards for the file formats. I can open up

0:06:39.480 --> 0:06:42.320
<v Speaker 3>files that I created and open source tools as a

0:06:42.360 --> 0:06:46.040
<v Speaker 3>high school student today because they were using open formats

0:06:46.040 --> 0:06:48.320
<v Speaker 3>and that software still exists. I can still compile the

0:06:48.320 --> 0:06:51.679
<v Speaker 3>code and it's an active community project. Like these things

0:06:51.720 --> 0:06:54.479
<v Speaker 3>can outlast any single company in the same way that

0:06:54.560 --> 0:06:56.920
<v Speaker 3>the academic community has been going on for so many

0:06:57.000 --> 0:06:59.680
<v Speaker 3>years and hopefully we'll continue moving on. So it's sort

0:06:59.680 --> 0:07:02.599
<v Speaker 3>of like not just the community around it, but just

0:07:02.680 --> 0:07:05.400
<v Speaker 3>the knowledge sharing and also bringing up the next generation

0:07:05.480 --> 0:07:07.520
<v Speaker 3>as well. Like all of that stuff really appealed to me.

0:07:07.560 --> 0:07:10.240
<v Speaker 3>And also at the center of it, the fact that

0:07:10.720 --> 0:07:13.800
<v Speaker 3>we could democratize it by following this open source process

0:07:14.160 --> 0:07:16.160
<v Speaker 3>and feel like we have some control, We're not at

0:07:16.200 --> 0:07:19.240
<v Speaker 3>the mercy of some faceless corporation making changes and we

0:07:19.280 --> 0:07:21.360
<v Speaker 3>have no impact. Like that really appealed to me too.

0:07:21.440 --> 0:07:26.240
<v Speaker 2>Yeah, for those of us who are not software phishonados,

0:07:27.040 --> 0:07:30.160
<v Speaker 2>take a step backwards and give me a kind of

0:07:31.160 --> 0:07:34.640
<v Speaker 2>description of terms. What's the opposite of open source proprietary?

0:07:34.720 --> 0:07:36.200
<v Speaker 3>Proprietary is what we say, So.

0:07:37.040 --> 0:07:40.840
<v Speaker 2>Specifically and practically, the difference would be what between something

0:07:40.840 --> 0:07:42.680
<v Speaker 2>that was opens us in something that was proprietary.

0:07:42.760 --> 0:07:45.360
<v Speaker 3>Sure, so there's a lot of difference. So with open

0:07:45.400 --> 0:07:49.360
<v Speaker 3>source software you get these rights when you're given the software,

0:07:49.440 --> 0:07:51.840
<v Speaker 3>you get the right to be able to share it.

0:07:51.920 --> 0:07:54.320
<v Speaker 3>And depending on the lot, different licenses that are considered

0:07:54.360 --> 0:07:57.360
<v Speaker 3>open source have different little things that you have to

0:07:57.360 --> 0:08:03.160
<v Speaker 3>be aware of. With proprietary code, it's one copyright the company.

0:08:03.200 --> 0:08:05.560
<v Speaker 3>Even a lot of times, when you sign your employment

0:08:05.560 --> 0:08:07.880
<v Speaker 3>contract for a software company and you write code for them,

0:08:08.200 --> 0:08:09.840
<v Speaker 3>you don't own it. You sign over your rights to

0:08:09.840 --> 0:08:11.760
<v Speaker 3>the company. So if you leave the company, the code

0:08:11.800 --> 0:08:14.120
<v Speaker 3>doesn't go with you. It stays in the ownership of

0:08:14.120 --> 0:08:16.760
<v Speaker 3>that company. So then one like one company buys out

0:08:16.800 --> 0:08:18.920
<v Speaker 3>another and kills a product, that code's gone.

0:08:19.040 --> 0:08:23.000
<v Speaker 2>It's gone. For a business, why would a business want

0:08:23.080 --> 0:08:26.880
<v Speaker 2>to be have open source code as opposed to proprietary.

0:08:26.240 --> 0:08:28.320
<v Speaker 3>Well, for the same reasons, Like, say you're a business,

0:08:28.800 --> 0:08:32.680
<v Speaker 3>You've invested all this money into this software platform right,

0:08:33.160 --> 0:08:36.280
<v Speaker 3>and you've upskilled your employees on it and it's a

0:08:36.320 --> 0:08:39.120
<v Speaker 3>core part of your business, and then a few years

0:08:39.200 --> 0:08:42.520
<v Speaker 3>later that company goes out of business or something happens,

0:08:42.679 --> 0:08:46.040
<v Speaker 3>or even something less drastic. You really need this future,

0:08:46.360 --> 0:08:49.480
<v Speaker 3>but for the company that makes the software, it's not

0:08:49.480 --> 0:08:52.320
<v Speaker 3>in their best interests. It's not worth the investment. They're

0:08:52.360 --> 0:08:54.280
<v Speaker 3>not going to do it. How do you get that future?

0:08:54.600 --> 0:08:57.160
<v Speaker 3>You either have to completely migrate to another solution, and

0:08:57.200 --> 0:08:59.120
<v Speaker 3>this is something that's core at your business that's going

0:08:59.160 --> 0:09:02.040
<v Speaker 3>to be a big deal to But if it's open source,

0:09:02.559 --> 0:09:05.640
<v Speaker 3>you could either hire a team of experts. You could

0:09:05.720 --> 0:09:08.160
<v Speaker 3>hire software engineers who are able to go do this

0:09:08.360 --> 0:09:12.200
<v Speaker 3>for you. Go in the upstream software community, implement the

0:09:12.240 --> 0:09:15.080
<v Speaker 3>feature that you want, and it'll be rolled into the

0:09:15.120 --> 0:09:17.520
<v Speaker 3>next version of that company software. So even if that

0:09:17.600 --> 0:09:20.800
<v Speaker 3>company didn't want to implement the future, if they did

0:09:20.800 --> 0:09:23.760
<v Speaker 3>it open source, they would inherit that feature from the

0:09:23.880 --> 0:09:26.400
<v Speaker 3>upstream community, is what we call it, So you have

0:09:26.480 --> 0:09:28.880
<v Speaker 3>some control over the situation. If it's open source, you

0:09:28.920 --> 0:09:32.960
<v Speaker 3>have an opportunity to actually affect change in the product,

0:09:33.200 --> 0:09:35.240
<v Speaker 3>and you could then pick it up or pay somebody

0:09:35.240 --> 0:09:37.360
<v Speaker 3>else to pick it up, or another company could form

0:09:37.400 --> 0:09:39.960
<v Speaker 3>and pick it up and keep it going, So there's

0:09:40.000 --> 0:09:42.440
<v Speaker 3>more possibilities. If it's open source, it's more like it's

0:09:42.480 --> 0:09:44.079
<v Speaker 3>like an insurance policy almost.

0:09:44.120 --> 0:09:47.680
<v Speaker 2>So innovation from the standpoint of the customer, innovation is

0:09:47.679 --> 0:09:50.679
<v Speaker 2>a lot easier when you're working in an open source environment.

0:09:50.880 --> 0:09:51.520
<v Speaker 3>Absolutely.

0:09:51.920 --> 0:09:55.680
<v Speaker 2>Yeah. So now at RedHat, you're working with something called

0:09:55.679 --> 0:09:58.440
<v Speaker 2>instruct lab. Tell us a little bit about what that is.

0:09:58.880 --> 0:10:01.040
<v Speaker 3>So the thing that really excites to me about getting

0:10:01.040 --> 0:10:03.400
<v Speaker 3>to work on this project is AI is sort of

0:10:03.440 --> 0:10:06.600
<v Speaker 3>that has been this scary thing for me because it's

0:10:06.600 --> 0:10:09.760
<v Speaker 3>one of those things like in order to be able

0:10:09.800 --> 0:10:15.040
<v Speaker 3>to pre train a model, you have to have unobtainium GPUs,

0:10:15.760 --> 0:10:20.040
<v Speaker 3>you have to have rich resources. It takes months, it

0:10:20.080 --> 0:10:25.040
<v Speaker 3>takes expertise. There's a small handful of companies that can

0:10:25.080 --> 0:10:29.079
<v Speaker 3>build a model from pre train to something usable, and

0:10:29.320 --> 0:10:31.640
<v Speaker 3>it kind of feels like those early days when I

0:10:31.760 --> 0:10:34.480
<v Speaker 3>was kind of delving in software and the same way.

0:10:34.520 --> 0:10:37.840
<v Speaker 3>I think if more people could contribute to AI models,

0:10:38.640 --> 0:10:42.240
<v Speaker 3>then it wouldn't be just influenced by whichever company had

0:10:42.280 --> 0:10:45.480
<v Speaker 3>the resources to build it. And there's been a lot

0:10:45.520 --> 0:10:48.880
<v Speaker 3>of emphasis on pre training models, so taking massive terabytes

0:10:49.000 --> 0:10:52.959
<v Speaker 3>data sets, throwing them through masses of GPUs over months

0:10:52.960 --> 0:10:56.240
<v Speaker 3>of time, spending hundreds of millions of dollars to build

0:10:56.320 --> 0:10:59.160
<v Speaker 3>a base model. But when instruct lab does is say, okay,

0:10:59.360 --> 0:11:02.280
<v Speaker 3>you have a base model. We're going to fine tune in.

0:11:02.320 --> 0:11:05.760
<v Speaker 3>On the other end, it takes less compute resources. The

0:11:05.800 --> 0:11:08.040
<v Speaker 3>way we've built in struck lab, you can play around

0:11:08.040 --> 0:11:10.600
<v Speaker 3>with the technology and learn it on it off the

0:11:10.600 --> 0:11:14.040
<v Speaker 3>shelf laptop that you can actually buy. So in this

0:11:14.160 --> 0:11:17.880
<v Speaker 3>way we're enabling a much broader set of people to

0:11:18.000 --> 0:11:20.679
<v Speaker 3>play with AI, to contribute it, to modify it. And

0:11:20.720 --> 0:11:23.680
<v Speaker 3>I'll tell you one story from red Hat. Succi, who

0:11:23.760 --> 0:11:28.199
<v Speaker 3>is our chief diversity officer, very interested in inclusive language

0:11:28.200 --> 0:11:31.200
<v Speaker 3>and open source software, doesn't have any experience with AI.

0:11:31.880 --> 0:11:33.920
<v Speaker 3>We have a community model that we have an upstream

0:11:33.920 --> 0:11:36.520
<v Speaker 3>project around for people to contribute knowledge and skills to

0:11:36.559 --> 0:11:38.680
<v Speaker 3>the model. She's like, I want to teach the model

0:11:39.000 --> 0:11:41.920
<v Speaker 3>how to use inclusive language, like replace this word with

0:11:42.000 --> 0:11:43.600
<v Speaker 3>this word or this word with this word. Oh my,

0:11:43.640 --> 0:11:46.520
<v Speaker 3>oh that's so cool. So she paired up with Nicholas

0:11:46.559 --> 0:11:49.079
<v Speaker 3>who is a technical guy at red Hat, and they

0:11:49.240 --> 0:11:52.560
<v Speaker 3>built and submitted a skill to the model that you

0:11:52.600 --> 0:11:54.440
<v Speaker 3>can just tell the model, can you please take this

0:11:54.559 --> 0:11:57.440
<v Speaker 3>document and translate this language to more inclusive language, and

0:11:57.480 --> 0:11:59.920
<v Speaker 3>it will do it. And they submitted it to the community.

0:12:00.080 --> 0:12:01.760
<v Speaker 3>They were so proud. It was like, that's the kind

0:12:01.800 --> 0:12:04.200
<v Speaker 3>of thing that, like, you know, maybe a company would

0:12:04.240 --> 0:12:06.680
<v Speaker 3>be incentivized to do that, but if you have some

0:12:06.800 --> 0:12:10.400
<v Speaker 3>tooling that's open source and something that anybody could access,

0:12:10.559 --> 0:12:13.120
<v Speaker 3>then those communities could actually get together and build that

0:12:13.240 --> 0:12:14.480
<v Speaker 3>knowledge into AI models.

0:12:14.720 --> 0:12:19.239
<v Speaker 2>Just so understand, what you guys have is the structure

0:12:19.320 --> 0:12:23.600
<v Speaker 2>for an AI system, and in other cases, individual companies

0:12:23.760 --> 0:12:27.720
<v Speaker 2>own and train their own AI systems. It takes enormous

0:12:27.720 --> 0:12:30.720
<v Speaker 2>amount of resources. They hoover up all kinds of information,

0:12:31.320 --> 0:12:34.480
<v Speaker 2>train it according to their own hidden set of rules,

0:12:34.559 --> 0:12:39.000
<v Speaker 2>and then a customer might use that for some price.

0:12:39.360 --> 0:12:41.280
<v Speaker 2>What you're saying is, in the same way that we

0:12:41.360 --> 0:12:45.200
<v Speaker 2>democratize the writing of software before, let's democratize the training

0:12:45.240 --> 0:12:48.880
<v Speaker 2>of an AI system. So anyone can contribute here and

0:12:49.320 --> 0:12:53.000
<v Speaker 2>teach the model the things that they're interested in teaching

0:12:53.000 --> 0:12:55.920
<v Speaker 2>the model. I'm guessing correct me. On the one hand,

0:12:56.400 --> 0:12:58.520
<v Speaker 2>this model, at least in the beginning, is going to

0:12:58.520 --> 0:13:01.680
<v Speaker 2>have a lot fewer resources available to it. But on

0:13:01.720 --> 0:13:03.480
<v Speaker 2>the other hand, it's going to have a much more

0:13:03.840 --> 0:13:05.760
<v Speaker 2>diverse set of inputs.

0:13:06.280 --> 0:13:09.360
<v Speaker 3>That's right. And the other thing is that IBM, basically

0:13:09.440 --> 0:13:12.120
<v Speaker 3>is part of this project, has something called the Granite

0:13:12.160 --> 0:13:15.320
<v Speaker 3>Model family, and they've donated some granite models. So these

0:13:15.360 --> 0:13:18.040
<v Speaker 3>are the ones that take the months and terabytes of

0:13:18.120 --> 0:13:21.280
<v Speaker 3>data and all the GPUs to train. So IBM has

0:13:21.320 --> 0:13:24.560
<v Speaker 3>created one of those, and they have listed out and

0:13:24.640 --> 0:13:26.800
<v Speaker 3>linked to the data sets that they used, and they

0:13:26.840 --> 0:13:29.760
<v Speaker 3>talk about the relative proportions they used when pre training,

0:13:30.120 --> 0:13:32.160
<v Speaker 3>so it's not just a black box. You know where

0:13:32.160 --> 0:13:35.000
<v Speaker 3>the data came from, which is a pretty open position

0:13:35.040 --> 0:13:37.600
<v Speaker 3>to take. That is what we recommend as the base.

0:13:37.679 --> 0:13:40.280
<v Speaker 3>So you use the instruct lab tuning. You take this

0:13:40.360 --> 0:13:43.280
<v Speaker 3>base granite model that IBM has provided, and you use

0:13:43.320 --> 0:13:45.760
<v Speaker 3>the instruct lab tooling that red Hat works on, and

0:13:45.800 --> 0:13:48.120
<v Speaker 3>you use that to fine tune the model to make

0:13:48.160 --> 0:13:49.800
<v Speaker 3>it whatever you want.

0:13:50.320 --> 0:13:53.240
<v Speaker 2>I want to go back to the partnership between IBM

0:13:53.320 --> 0:13:57.319
<v Speaker 2>and red Hat here with them providing the granite model

0:13:57.920 --> 0:14:00.480
<v Speaker 2>to your instruct lab. Is this the first ti I'm

0:14:00.960 --> 0:14:03.200
<v Speaker 2>red Hat and IBM have collaborated like this.

0:14:04.000 --> 0:14:06.559
<v Speaker 3>I think it's something that's been going on, Like another

0:14:06.840 --> 0:14:09.480
<v Speaker 3>a product within the red Hat family would be open

0:14:09.520 --> 0:14:12.840
<v Speaker 3>Shift AI, where they collaborate a lot with IBM Research team,

0:14:13.080 --> 0:14:15.679
<v Speaker 3>like BLM is one of the components of that product

0:14:15.679 --> 0:14:19.680
<v Speaker 3>that there's a nice kind of exchange and collaboration between

0:14:19.720 --> 0:14:21.000
<v Speaker 3>the two companies.

0:14:21.760 --> 0:14:24.400
<v Speaker 2>How large is the potential community of people who might

0:14:24.480 --> 0:14:27.560
<v Speaker 2>contribute to instruct lab It could.

0:14:27.320 --> 0:14:30.640
<v Speaker 3>Be thousands of people. I mean, we'll see, it's early days.

0:14:31.000 --> 0:14:34.320
<v Speaker 3>This is early technology that was invented at IBM Research

0:14:34.440 --> 0:14:36.480
<v Speaker 3>that they partnered with us at red Hat to kind

0:14:36.480 --> 0:14:39.480
<v Speaker 3>of build the software around it. There's still more to go,

0:14:39.600 --> 0:14:41.840
<v Speaker 3>Like right now, we have a team in the community

0:14:41.920 --> 0:14:44.240
<v Speaker 3>that's actually trying to build a web interface to make

0:14:44.280 --> 0:14:47.280
<v Speaker 3>it easier for anybody to contribute. So we have a

0:14:47.320 --> 0:14:50.440
<v Speaker 3>lot of those sort of user experience for the contributor

0:14:50.480 --> 0:14:52.560
<v Speaker 3>to the model stuff to work out that we're still

0:14:52.600 --> 0:14:55.800
<v Speaker 3>actively building on. But like my vision for it even

0:14:55.960 --> 0:14:58.840
<v Speaker 3>is I like going back to that academic model of

0:14:58.920 --> 0:15:01.440
<v Speaker 3>learning from what others and building upon it over time.

0:15:02.000 --> 0:15:04.280
<v Speaker 3>It would be very good for us to sort of

0:15:04.320 --> 0:15:08.640
<v Speaker 3>go out and try to collaborate with academics of all fields, like, hey,

0:15:08.680 --> 0:15:11.320
<v Speaker 3>you know, the model doesn't know about your field, would

0:15:11.400 --> 0:15:14.360
<v Speaker 3>you like to put something into the model about your

0:15:14.400 --> 0:15:17.000
<v Speaker 3>field so it knows about it, or even you know,

0:15:17.520 --> 0:15:20.400
<v Speaker 3>talk to the model it got it wrong, let's correct it.

0:15:20.440 --> 0:15:22.680
<v Speaker 3>Can we lean on your expertise to correct it and

0:15:22.760 --> 0:15:24.840
<v Speaker 3>make sure it gets it right and sort of use

0:15:24.960 --> 0:15:28.040
<v Speaker 3>that community model as a way for everybody to collaborate

0:15:28.080 --> 0:15:33.400
<v Speaker 3>because before instruct Lab, my understanding is if you wanted

0:15:33.440 --> 0:15:35.920
<v Speaker 3>to take a model that's open source license and play

0:15:35.960 --> 0:15:37.520
<v Speaker 3>with it, you could do that. You could take a

0:15:37.560 --> 0:15:40.200
<v Speaker 3>model kind of off the shelf from Hugging Face and

0:15:40.280 --> 0:15:42.360
<v Speaker 3>fine tune it yourself. But it's a bit of a

0:15:42.400 --> 0:15:45.000
<v Speaker 3>dead end because you made your contributions, but there's no

0:15:45.040 --> 0:15:48.400
<v Speaker 3>way for other people to collaborate with you. So the

0:15:48.400 --> 0:15:50.920
<v Speaker 3>way that we've built this is based on how the

0:15:50.960 --> 0:15:55.280
<v Speaker 3>technology works. Everybody can contribute to it. This is something

0:15:55.320 --> 0:15:57.320
<v Speaker 3>that you can keep growing and growing and growing over time.

0:15:57.680 --> 0:16:01.000
<v Speaker 2>Yeah. Yeah, what's the level of expert te is necessary

0:16:01.040 --> 0:16:02.000
<v Speaker 2>to be a contributor.

0:16:02.600 --> 0:16:04.480
<v Speaker 3>You don't need to be a data scientist, and you

0:16:04.520 --> 0:16:07.480
<v Speaker 3>don't need to have exotic hardware. Honestly, if you don't

0:16:07.480 --> 0:16:10.320
<v Speaker 3>even have laptop hardware that meets SUSPEC for doing instruct

0:16:10.400 --> 0:16:13.560
<v Speaker 3>Labs laptop version, you can submit it to the community

0:16:13.640 --> 0:16:16.160
<v Speaker 3>and then we'll actually build it for you. We have

0:16:16.240 --> 0:16:18.600
<v Speaker 3>bots and stuff that do that, and we're hoping over

0:16:18.640 --> 0:16:21.160
<v Speaker 3>time to make that more accessible, first by having a

0:16:21.280 --> 0:16:23.880
<v Speaker 3>user interface and then maybe later on having a web service.

0:16:24.200 --> 0:16:27.400
<v Speaker 2>Yeah, so give me an example of how a business

0:16:27.480 --> 0:16:29.560
<v Speaker 2>might make use of instruct lab.

0:16:30.120 --> 0:16:32.640
<v Speaker 3>One of the things that businesses are doing with AI

0:16:32.760 --> 0:16:36.440
<v Speaker 3>right now is using hosted API services. They're quite expensive,

0:16:36.720 --> 0:16:39.520
<v Speaker 3>but they're finding value, but it's hard given the amount

0:16:39.560 --> 0:16:41.840
<v Speaker 3>of money they're spending. And one of the things that's

0:16:41.840 --> 0:16:43.680
<v Speaker 3>a little scary about it too, is like you have

0:16:44.000 --> 0:16:48.440
<v Speaker 3>very sensitive internal documents and you have employees maybe not

0:16:48.600 --> 0:16:51.480
<v Speaker 3>understanding what they're actually doing because you know, how would

0:16:51.520 --> 0:16:54.920
<v Speaker 3>you if you're not technical enough when you're asking said

0:16:55.560 --> 0:17:00.800
<v Speaker 3>public web service AI model information about out you're copy

0:17:00.800 --> 0:17:05.120
<v Speaker 3>pasting internal company documents. It's going across the Internet into

0:17:05.160 --> 0:17:08.360
<v Speaker 3>another company's hands, and that company probably shouldn't have access

0:17:08.400 --> 0:17:11.600
<v Speaker 3>to that. So what both RedHat and IBM and the

0:17:11.640 --> 0:17:14.720
<v Speaker 3>space are looking at, like the instruct lab model is

0:17:14.840 --> 0:17:18.320
<v Speaker 3>very modest. It's seven billion parameter model, very small. It's

0:17:18.440 --> 0:17:21.560
<v Speaker 3>very cheap to serve inference on a seven billion parameter model.

0:17:22.240 --> 0:17:25.639
<v Speaker 3>It's competing with trillion parameter models that are hosted. You

0:17:25.760 --> 0:17:28.840
<v Speaker 3>take this small model that is cheap to run inference on,

0:17:29.480 --> 0:17:33.400
<v Speaker 3>you train it with your own company's proprietary data inside

0:17:33.400 --> 0:17:36.000
<v Speaker 3>the walls of your company, on your own hardware. You

0:17:36.040 --> 0:17:39.320
<v Speaker 3>can do all sorts of actual data analysis on your

0:17:39.320 --> 0:17:41.919
<v Speaker 3>most sensitive data and have the confidence that has not

0:17:42.000 --> 0:17:42.960
<v Speaker 3>left the premises.

0:17:43.760 --> 0:17:46.800
<v Speaker 2>In that use case, you're not actually training the model

0:17:46.840 --> 0:17:50.080
<v Speaker 2>for everyone. You're just taking it and doing some private

0:17:50.119 --> 0:17:52.800
<v Speaker 2>stuff on it. Exactly doesn't leave the building. But that's

0:17:52.840 --> 0:17:59.040
<v Speaker 2>separate from an interaction where you're doing something that contributes overall.

0:17:59.440 --> 0:18:02.280
<v Speaker 3>Right, That's something maybe that I should be more clear

0:18:02.280 --> 0:18:04.600
<v Speaker 3>about is there's sort of two tracks here, and this

0:18:04.680 --> 0:18:08.320
<v Speaker 3>is very red hat classic. You have your upstream community

0:18:08.359 --> 0:18:10.960
<v Speaker 3>track and you have your business product track. So the

0:18:11.040 --> 0:18:14.960
<v Speaker 3>upstream community track is just enabling anybody to contribute to

0:18:15.000 --> 0:18:16.840
<v Speaker 3>a model in a collaborative way and play with it.

0:18:17.280 --> 0:18:21.119
<v Speaker 3>The downstream product business oriented track is now take that

0:18:21.240 --> 0:18:25.840
<v Speaker 3>tech that we've honed and developed in the open community

0:18:26.520 --> 0:18:29.000
<v Speaker 3>and apply it to your business knowledge and skills.

0:18:30.040 --> 0:18:33.880
<v Speaker 2>This community driven approach marks a pivotal shift towards more

0:18:33.880 --> 0:18:39.679
<v Speaker 2>accessible AI solutions. The contrast between externally hosted AI services,

0:18:39.960 --> 0:18:43.159
<v Speaker 2>and the open model enhanced by instruct lab underscores the

0:18:43.160 --> 0:18:48.240
<v Speaker 2>potential for broader adoption of AI in diverse business contexts.

0:18:48.760 --> 0:18:52.160
<v Speaker 2>She envisions a future in which technological innovation is more

0:18:52.200 --> 0:18:56.520
<v Speaker 2>tailored to individual business needs, guided by principles of openness

0:18:56.640 --> 0:19:02.640
<v Speaker 2>and security. Seer imaginary case study. Sure, I'm a law firm,

0:19:03.200 --> 0:19:06.960
<v Speaker 2>I'm an entertainment law I have one hundred clients who

0:19:06.960 --> 0:19:11.800
<v Speaker 2>are big stars. They all have incredibly complicated contracts. I

0:19:11.880 --> 0:19:16.520
<v Speaker 2>feed a thousand of my company's contracts from the last

0:19:16.920 --> 0:19:20.119
<v Speaker 2>ten years into the model, and then every time I

0:19:20.160 --> 0:19:22.760
<v Speaker 2>have a new contract, I ask the model, am I

0:19:22.800 --> 0:19:25.479
<v Speaker 2>missing something? Can you go back and look through all

0:19:25.480 --> 0:19:28.119
<v Speaker 2>our own contracts and show me a contract that is

0:19:28.200 --> 0:19:32.240
<v Speaker 2>missing key components or exposes us to some liability. In

0:19:32.280 --> 0:19:37.240
<v Speaker 2>that case, the model would know my law firm contracts really,

0:19:37.440 --> 0:19:40.199
<v Speaker 2>really well. It's as if they've been working out my

0:19:40.280 --> 0:19:44.360
<v Speaker 2>law firm. They're not distracted by other people's particular styles

0:19:45.400 --> 0:19:49.320
<v Speaker 2>or a bunch of contracts from the utility industry, or

0:19:49.400 --> 0:19:53.880
<v Speaker 2>they know entertainment law contracts exactly.

0:19:54.000 --> 0:19:55.840
<v Speaker 3>Yeah, And you can train it in your own image,

0:19:55.880 --> 0:19:59.600
<v Speaker 3>your style of doing things. It's something that your company

0:19:59.680 --> 0:20:03.399
<v Speaker 3>can that is uniquely helpful to you. No third party

0:20:03.400 --> 0:20:05.639
<v Speaker 3>could do that because no third party understands how you

0:20:05.720 --> 0:20:09.439
<v Speaker 3>do business and understands your history and your documents. So

0:20:09.440 --> 0:20:12.080
<v Speaker 3>it's sort of a way of getting value out of

0:20:12.119 --> 0:20:14.679
<v Speaker 3>the stuff you already have sitting in a file cabinet somewhere.

0:20:14.800 --> 0:20:16.040
<v Speaker 3>It's it's very cool.

0:20:16.320 --> 0:20:19.320
<v Speaker 2>Yeah, give me a sort of a real world case

0:20:19.320 --> 0:20:22.560
<v Speaker 2>study where you think the business use case would be

0:20:22.560 --> 0:20:27.159
<v Speaker 2>really powerful. What's a business that really could see an

0:20:27.200 --> 0:20:31.200
<v Speaker 2>advantage to using instruct lab in its way.

0:20:31.680 --> 0:20:33.959
<v Speaker 3>The demo that I've given a couple of times at

0:20:33.960 --> 0:20:37.520
<v Speaker 3>different events used an imaginary insurance company. So you say,

0:20:37.560 --> 0:20:41.480
<v Speaker 3>you have this company, you have to recommend repairs for

0:20:41.560 --> 0:20:45.000
<v Speaker 3>various types of claims. You've been doing this for years,

0:20:45.040 --> 0:20:47.920
<v Speaker 3>you know. If you know the windshield's broken and you've

0:20:47.920 --> 0:20:50.760
<v Speaker 3>gotten this type of accident and it's this model car,

0:20:50.960 --> 0:20:52.720
<v Speaker 3>these are the kinds of things you want to look at.

0:20:53.400 --> 0:20:56.199
<v Speaker 3>So you could talk to any insurance agent in the

0:20:56.200 --> 0:20:58.920
<v Speaker 3>field and be like, oh, you know, it's a Tesla.

0:20:59.000 --> 0:21:01.480
<v Speaker 3>You might want to look at the or something. They'll

0:21:01.520 --> 0:21:04.840
<v Speaker 3>have some latent knowledge just so you can take that

0:21:04.960 --> 0:21:07.560
<v Speaker 3>and train it into a model. Honestly, I think these

0:21:07.640 --> 0:21:10.680
<v Speaker 3>kind of new technologies are better when they're less visible.

0:21:11.280 --> 0:21:13.719
<v Speaker 3>So say you have the claims agents in the field

0:21:13.760 --> 0:21:15.760
<v Speaker 3>and they have this tool and they're kind of entering

0:21:15.800 --> 0:21:18.760
<v Speaker 3>the claim data. They're on the scene at the car,

0:21:19.320 --> 0:21:22.000
<v Speaker 3>and it might say, oh, look, I see this is

0:21:22.040 --> 0:21:24.560
<v Speaker 3>a Ford fiesta. These are things you want to look

0:21:24.600 --> 0:21:27.800
<v Speaker 3>at for this type of accident. As you're entering the data,

0:21:28.200 --> 0:21:30.080
<v Speaker 3>it could be going through the knowledge you had loaded

0:21:30.119 --> 0:21:32.600
<v Speaker 3>into the model and be making these suggestions based on

0:21:32.600 --> 0:21:35.560
<v Speaker 3>your company's background, and hey, you know, let's not make

0:21:35.600 --> 0:21:38.120
<v Speaker 3>the same mistake twice. Let's make new mistakes, and let's

0:21:38.200 --> 0:21:41.080
<v Speaker 3>learn from the stuff we already did. So that's one example,

0:21:41.119 --> 0:21:43.679
<v Speaker 3>but there's so many different industries in ways that this

0:21:43.760 --> 0:21:46.520
<v Speaker 3>could help, and it could make those agents in the

0:21:46.560 --> 0:21:48.040
<v Speaker 3>field more efficient.

0:21:48.760 --> 0:21:51.160
<v Speaker 2>Have you had anyone talk to you about using instruct

0:21:51.240 --> 0:21:52.760
<v Speaker 2>lab in a way that surprised you.

0:21:54.800 --> 0:21:59.480
<v Speaker 3>I mean, some people have done funky things, but sort

0:21:59.480 --> 0:22:01.840
<v Speaker 3>of playing with the skills stuff, that's where I see

0:22:01.880 --> 0:22:04.840
<v Speaker 3>a lot of creativity. The difference between knowledge and skills

0:22:04.920 --> 0:22:08.240
<v Speaker 3>is that knowledge is pretty pretty understandable, right, like, oh,

0:22:08.280 --> 0:22:12.240
<v Speaker 3>historical insurance claims or you know, legal contracts. Skills are

0:22:12.280 --> 0:22:15.040
<v Speaker 3>a little different so whenever somebody submits a skill, sometimes

0:22:15.200 --> 0:22:17.200
<v Speaker 3>it tends to be really creative because it's not something

0:22:17.240 --> 0:22:20.520
<v Speaker 3>that's super intuitive. Somebody submitted a skill. I don't know

0:22:20.560 --> 0:22:23.560
<v Speaker 3>how well it worked, but it was like making ASKI art,

0:22:23.760 --> 0:22:25.960
<v Speaker 3>like draw me a I don't know, draw me a

0:22:26.000 --> 0:22:27.879
<v Speaker 3>dog I would do like an ASKI art dog. I mean,

0:22:27.920 --> 0:22:30.440
<v Speaker 3>it's stuff that you can do programmatically. One that was

0:22:30.480 --> 0:22:34.400
<v Speaker 3>actually very very helpful was you know, take this table

0:22:34.440 --> 0:22:37.640
<v Speaker 3>of data and convert it to this format. Like, oh,

0:22:37.680 --> 0:22:39.280
<v Speaker 3>that's nice. That actually saves me time.

0:22:39.840 --> 0:22:42.359
<v Speaker 2>How far away are we from the day when I

0:22:42.480 --> 0:22:47.160
<v Speaker 2>Malcolm Globwell technology ignore Amus can go home and easily

0:22:47.240 --> 0:22:52.280
<v Speaker 2>interact with instruct lab Maybe a few months, a few months,

0:22:53.400 --> 0:22:54.560
<v Speaker 2>you're gonna say a few years.

0:22:55.200 --> 0:22:56.920
<v Speaker 3>No, I think it could be a few months.

0:22:57.520 --> 0:22:59.040
<v Speaker 2>Wow, I hope.

0:23:00.240 --> 0:23:01.120
<v Speaker 3>Open source innovation.

0:23:01.520 --> 0:23:05.000
<v Speaker 2>Yeah, oh that's really interesting. Yeah. I'm always take it

0:23:05.000 --> 0:23:08.160
<v Speaker 2>by surprise. I'm still thinking in twentieth century terms about

0:23:08.160 --> 0:23:10.719
<v Speaker 2>how long things take, and you're in the twenty second

0:23:11.240 --> 0:23:12.080
<v Speaker 2>century as well as.

0:23:11.960 --> 0:23:16.400
<v Speaker 3>I can tell. The instruct lab core invention was invented

0:23:16.440 --> 0:23:19.400
<v Speaker 3>in a hotel room at an AI conference in December

0:23:19.520 --> 0:23:22.560
<v Speaker 3>with an amazing group of IBM research guys December of

0:23:22.560 --> 0:23:23.400
<v Speaker 3>twenty twenty three.

0:23:23.680 --> 0:23:26.399
<v Speaker 2>Wait back up, you have to tell the story.

0:23:26.600 --> 0:23:29.600
<v Speaker 3>This group of guys we've been working with, they were

0:23:29.640 --> 0:23:32.000
<v Speaker 3>at this conference together and it's a really funny story

0:23:32.040 --> 0:23:34.879
<v Speaker 3>because you know, it's hard to get access to GPUs

0:23:35.200 --> 0:23:37.040
<v Speaker 3>and like even you know, you're at IBM and it's

0:23:37.040 --> 0:23:39.800
<v Speaker 3>hard to get access because everybody wants access. They did

0:23:39.800 --> 0:23:42.760
<v Speaker 3>it over Christmas break because nobody was using the cluster

0:23:42.840 --> 0:23:44.959
<v Speaker 3>at the time, and they ran all of these experiments

0:23:44.960 --> 0:23:46.800
<v Speaker 3>and I'm like, whoa, this is really cool.

0:23:47.200 --> 0:23:51.440
<v Speaker 2>And their idea was we can do a stripped down

0:23:52.359 --> 0:23:56.639
<v Speaker 2>AI model, and was the idea and even back then

0:23:56.760 --> 0:23:59.200
<v Speaker 2>combine it with granted, what was the original idea?

0:23:59.240 --> 0:24:02.159
<v Speaker 3>The original idea, Yeah, it's sort of multi there's like

0:24:02.240 --> 0:24:05.080
<v Speaker 3>multiple aspects to it. So like one of the aspects

0:24:05.200 --> 0:24:07.280
<v Speaker 3>it actually came on later, but it starts at the

0:24:07.320 --> 0:24:10.919
<v Speaker 3>beginning of the workflow. Is you're using a taxonomy to

0:24:11.119 --> 0:24:13.840
<v Speaker 3>organize how you're fine tuning the model. So the old

0:24:13.840 --> 0:24:16.560
<v Speaker 3>approach they call it the blender approach, to just take

0:24:16.560 --> 0:24:19.160
<v Speaker 3>a bunch of data of roughly the type of data

0:24:19.200 --> 0:24:20.960
<v Speaker 3>that you'd like and you kind of throw it in

0:24:21.040 --> 0:24:23.560
<v Speaker 3>and then see what comes out, don't like it, Okay,

0:24:23.720 --> 0:24:26.760
<v Speaker 3>throw in more, try again, see what comes out. They

0:24:26.800 --> 0:24:30.199
<v Speaker 3>had used this taxonomy technique, so you actually build like

0:24:30.280 --> 0:24:33.600
<v Speaker 3>a taxonomy of like categories and subfolders of like this

0:24:33.680 --> 0:24:35.960
<v Speaker 3>is the knowledge and skills that we want to train

0:24:36.040 --> 0:24:39.600
<v Speaker 3>into the model. And that way you're sort of systematic

0:24:39.720 --> 0:24:42.720
<v Speaker 3>about what you're adding, and you can also identify gaps

0:24:42.720 --> 0:24:44.760
<v Speaker 3>pretty easily. Oh, I don't have a category for that.

0:24:44.880 --> 0:24:46.960
<v Speaker 3>Let me add that. So that's like one of the

0:24:47.560 --> 0:24:48.680
<v Speaker 3>parts of the invention here.

0:24:49.520 --> 0:24:54.439
<v Speaker 2>Point number one is let's be intentional and deliberate in

0:24:54.480 --> 0:24:55.720
<v Speaker 2>how we build and train this thing.

0:24:55.960 --> 0:24:59.239
<v Speaker 3>Yeah, and then the next component would be okay, so

0:24:59.600 --> 0:25:02.239
<v Speaker 3>is actually quite expensive. Part of the expense of like

0:25:02.920 --> 0:25:05.880
<v Speaker 3>tuning models and just training models in general is coming

0:25:05.920 --> 0:25:09.040
<v Speaker 3>up with the data. And what they wanted to do

0:25:09.160 --> 0:25:11.159
<v Speaker 3>is have a technique where you could have just a

0:25:11.200 --> 0:25:14.520
<v Speaker 3>little bit of data and expand it with something they're

0:25:14.560 --> 0:25:17.760
<v Speaker 3>calling synthetic data generation. And this is where it's sort

0:25:17.760 --> 0:25:22.280
<v Speaker 3>of like you have this student and teacher workflow. So

0:25:23.200 --> 0:25:26.960
<v Speaker 3>you have your taxonomy. The taxonomy has sort of the

0:25:27.000 --> 0:25:30.159
<v Speaker 3>knowledge like a business's knowledge documents, their insurance claims, and

0:25:30.240 --> 0:25:33.439
<v Speaker 3>it has these quizzes that you write, and that's to

0:25:33.480 --> 0:25:35.480
<v Speaker 3>teach the model. So I'm writing a quiz based just

0:25:35.520 --> 0:25:37.240
<v Speaker 3>like you do in school. You read the chapter all

0:25:37.280 --> 0:25:39.280
<v Speaker 3>in the American Revolution, and then you answer a ten

0:25:39.400 --> 0:25:42.680
<v Speaker 3>question quiz where you're giving the model quiz. You need

0:25:42.680 --> 0:25:45.880
<v Speaker 3>at least five questions and answers, and the answers need

0:25:45.920 --> 0:25:48.480
<v Speaker 3>to be taken from the context of the document, and

0:25:48.640 --> 0:25:52.120
<v Speaker 3>then you run it through a process called synthetic data generation,

0:25:52.400 --> 0:25:54.520
<v Speaker 3>and it looks at the documents or look at the

0:25:54.560 --> 0:25:57.680
<v Speaker 3>history chapter. It'll look at the questions and answers, and

0:25:57.720 --> 0:26:00.439
<v Speaker 3>then it'll look to that original document and come up

0:26:00.480 --> 0:26:02.879
<v Speaker 3>with more questions and answers based on the format of

0:26:02.880 --> 0:26:05.480
<v Speaker 3>the questions and answers you made. So you can take

0:26:05.560 --> 0:26:09.159
<v Speaker 3>five questions of answers amplify them into one hundred questions

0:26:09.160 --> 0:26:12.040
<v Speaker 3>and answers, two hundred questions and answers, and it's a

0:26:12.119 --> 0:26:15.280
<v Speaker 3>second model that is making the questions and answers. So

0:26:15.280 --> 0:26:18.360
<v Speaker 3>it's synthetic data generation using an AI model to make

0:26:18.400 --> 0:26:21.359
<v Speaker 3>the questions. We use an open source model to do that.

0:26:21.920 --> 0:26:24.600
<v Speaker 3>So that's the second part, and then the third part

0:26:24.680 --> 0:26:27.600
<v Speaker 3>is we have a multi phase tuning technique to actually

0:26:27.760 --> 0:26:31.280
<v Speaker 3>take the synthetic data and then basically bake it into

0:26:31.280 --> 0:26:34.520
<v Speaker 3>the model. So sort of that's the approach. A general

0:26:34.560 --> 0:26:37.280
<v Speaker 3>philosophy of the approach is using grantede because we know

0:26:37.320 --> 0:26:40.080
<v Speaker 3>where the data came from. Another approach is the fact

0:26:40.119 --> 0:26:42.480
<v Speaker 3>that we're using small models that are cheap to run

0:26:42.480 --> 0:26:45.040
<v Speaker 3>inference on. They're small enough that you can tune them

0:26:45.040 --> 0:26:47.880
<v Speaker 3>on laptop hardware. You don't need all the fancy expensive

0:26:47.920 --> 0:26:52.120
<v Speaker 3>GPU mania you're good. So sort of like a whole system,

0:26:52.160 --> 0:26:54.960
<v Speaker 3>it's like not any one component, but it's sort of

0:26:55.119 --> 0:26:57.639
<v Speaker 3>the approach they took with somewhat novel, and they were

0:26:57.720 --> 0:27:00.639
<v Speaker 3>very excited when they saw the experimental results. There was

0:27:00.680 --> 0:27:03.480
<v Speaker 3>a meeting between red hat and IBM. It was actually

0:27:03.480 --> 0:27:05.800
<v Speaker 3>an IBM research meeting that red Hatters were invited to,

0:27:06.560 --> 0:27:08.880
<v Speaker 3>and I think the red Hatters involves sort of saw

0:27:08.920 --> 0:27:13.480
<v Speaker 3>the potential, WHOA, we can make models open source finally,

0:27:13.600 --> 0:27:17.159
<v Speaker 3>rather than them just being these endless dead forks, we

0:27:17.200 --> 0:27:19.840
<v Speaker 3>could make it so people could contribute back and collaborate

0:27:19.880 --> 0:27:22.159
<v Speaker 3>around it. So that's when red hat became interested in

0:27:22.200 --> 0:27:25.679
<v Speaker 3>it and we sort of worked together and the research

0:27:25.720 --> 0:27:28.400
<v Speaker 3>engineers from IBM Research who came up with the technique

0:27:28.480 --> 0:27:31.080
<v Speaker 3>and then my team, the software engineers who know how

0:27:31.119 --> 0:27:36.080
<v Speaker 3>to take research code and productize it into actually runnable,

0:27:36.119 --> 0:27:41.040
<v Speaker 3>supportable software, kind of got together. We've been hanging out

0:27:41.040 --> 0:27:43.760
<v Speaker 3>in the Boston office at red Hat and building it out.

0:27:44.080 --> 0:27:47.320
<v Speaker 3>April eighteenth was when we went open source and we

0:27:47.359 --> 0:27:49.800
<v Speaker 3>made all of our repositories with all of the code public,

0:27:49.840 --> 0:27:52.119
<v Speaker 3>and right now we're working towards a product release, so

0:27:52.160 --> 0:27:53.120
<v Speaker 3>a supported product.

0:27:53.200 --> 0:27:55.320
<v Speaker 2>How long did it take you to be convinced of

0:27:56.280 --> 0:27:59.520
<v Speaker 2>the value of this idea? I mean, so people get

0:27:59.520 --> 0:28:03.760
<v Speaker 2>together in this hotel room. They're running these experiments over Christmas.

0:28:04.000 --> 0:28:06.199
<v Speaker 2>Are you aware of the experiments as they're running them?

0:28:07.280 --> 0:28:09.760
<v Speaker 3>I didn't find out till February.

0:28:09.800 --> 0:28:11.879
<v Speaker 2>So they come to you in February and they say, mo,

0:28:13.240 --> 0:28:15.320
<v Speaker 2>can you recreate that conversation?

0:28:16.359 --> 0:28:20.800
<v Speaker 3>Well, our CEO, Matt Hicks, and then Jeremy Eater, who's

0:28:20.800 --> 0:28:23.480
<v Speaker 3>one of our distinguished engineers, and Steve Watt, who's a VP,

0:28:23.680 --> 0:28:26.200
<v Speaker 3>were present I think at that meeting. So they kind

0:28:26.240 --> 0:28:28.480
<v Speaker 3>of brought it back to us and said, listen, we've

0:28:28.520 --> 0:28:32.920
<v Speaker 3>invited these IBM research folks to come visit in Boston,

0:28:33.680 --> 0:28:36.119
<v Speaker 3>you know, work with them, like, see, does this have

0:28:36.160 --> 0:28:38.360
<v Speaker 3>any merit could we build something from it, and so

0:28:38.440 --> 0:28:41.520
<v Speaker 3>they gave us some presentations. We were very excited when

0:28:41.520 --> 0:28:45.040
<v Speaker 3>they came to us. It only had support for Mac laptops.

0:28:45.640 --> 0:28:47.720
<v Speaker 3>Of course, you know Red Hat were Linux people, So

0:28:47.800 --> 0:28:49.600
<v Speaker 3>we're like, all right, we've got to fix that. So

0:28:49.800 --> 0:28:52.480
<v Speaker 3>a bunch of the junior engineers around the office kind

0:28:52.520 --> 0:28:53.800
<v Speaker 3>of came in and they're like, okay, we're going to

0:28:53.840 --> 0:28:56.000
<v Speaker 3>build Linux support. And they had it within like a

0:28:56.000 --> 0:28:58.880
<v Speaker 3>couple of days. It was crazy because this was just

0:28:58.920 --> 0:29:00.840
<v Speaker 3>meant to be. Hey, guys, you know now what these

0:29:00.880 --> 0:29:05.000
<v Speaker 3>are invited gas visiting our office, see what happens. And

0:29:05.000 --> 0:29:08.520
<v Speaker 3>we end up doing like weeks of hackfests and late

0:29:08.600 --> 0:29:11.320
<v Speaker 3>night pizzas in the conference room and like playing around

0:29:11.320 --> 0:29:14.400
<v Speaker 3>with it and learning, and it was It was very fun.

0:29:14.480 --> 0:29:15.200
<v Speaker 3>It's very cool.

0:29:15.320 --> 0:29:16.760
<v Speaker 2>Anyone else do anything like this.

0:29:18.160 --> 0:29:20.880
<v Speaker 3>Is not my understanding that anybody else is doing it yet,

0:29:21.480 --> 0:29:24.400
<v Speaker 3>maybe others will try. A lot of the focus has

0:29:24.480 --> 0:29:28.120
<v Speaker 3>been on that pre training phase, but for us, again

0:29:28.200 --> 0:29:31.800
<v Speaker 3>that fine tuning. It's more accessible because you don't need

0:29:31.840 --> 0:29:34.400
<v Speaker 3>all the exotic hardware, it doesn't take months. You can

0:29:34.480 --> 0:29:36.360
<v Speaker 3>do it on a laptop. You can do a smoke

0:29:36.400 --> 0:29:38.720
<v Speaker 3>test version of it in less than an hour.

0:29:39.280 --> 0:29:40.360
<v Speaker 2>What does the word smoke test.

0:29:40.600 --> 0:29:43.000
<v Speaker 3>Smoke test means you're not doing a full fine tuning

0:29:43.040 --> 0:29:46.080
<v Speaker 3>on the model. It's a different tuning process. It's like

0:29:46.160 --> 0:29:48.200
<v Speaker 3>kind of lower quality so to run on lower grade

0:29:48.200 --> 0:29:50.400
<v Speaker 3>hardware so you can kind of see them didn't move

0:29:50.400 --> 0:29:52.040
<v Speaker 3>the model or not, but it's not gonna give you

0:29:52.040 --> 0:29:54.640
<v Speaker 3>like the full picture. You need higher end hardware to

0:29:54.680 --> 0:29:56.720
<v Speaker 3>actually do the full thing. So that's what the product

0:29:56.720 --> 0:29:59.520
<v Speaker 3>will enable you to do once it's launched, is you're

0:29:59.520 --> 0:30:01.680
<v Speaker 3>gonna need GPUs, but when you have them, will help

0:30:01.720 --> 0:30:03.000
<v Speaker 3>you make the best usage of them.

0:30:03.280 --> 0:30:06.160
<v Speaker 2>Yeah. Yeah, And there's a little detail. I want to

0:30:06.200 --> 0:30:08.960
<v Speaker 2>go back to sure in order to run the tests

0:30:09.000 --> 0:30:14.640
<v Speaker 2>on this idea way back when they needed time on

0:30:14.720 --> 0:30:17.760
<v Speaker 2>the GPUs. So this this will be the in house

0:30:18.160 --> 0:30:22.320
<v Speaker 2>IBM and they were quiet at Christmas, So how much

0:30:22.400 --> 0:30:25.440
<v Speaker 2>time would you need on the GPUs to kind of

0:30:25.560 --> 0:30:26.560
<v Speaker 2>get proof of concept?

0:30:26.920 --> 0:30:29.320
<v Speaker 3>Well what happens is and it's sort of like a

0:30:29.320 --> 0:30:31.600
<v Speaker 3>lot of trial and error, right, And there's a lot

0:30:31.640 --> 0:30:35.240
<v Speaker 3>about this stuff that like you come up with a hypothesis,

0:30:35.320 --> 0:30:37.280
<v Speaker 3>you test it out, did it work or not? Okay,

0:30:37.400 --> 0:30:39.440
<v Speaker 3>it's just like you know, in the lab, you know,

0:30:39.600 --> 0:30:43.480
<v Speaker 3>buns and burners and beakers and whatever. So it really depends.

0:30:43.520 --> 0:30:46.880
<v Speaker 3>But it can be hours, it can be days. It

0:30:46.920 --> 0:30:48.960
<v Speaker 3>really depends on what they're trying to do. And then

0:30:49.040 --> 0:30:51.360
<v Speaker 3>sometimes they can cut the time down, you know, with

0:30:51.440 --> 0:30:53.080
<v Speaker 3>the number of GPUs you have, So like I have

0:30:53.080 --> 0:30:55.920
<v Speaker 3>a cluster of agpus, Okay, it might take a day,

0:30:56.000 --> 0:30:57.960
<v Speaker 3>but then if I can get thirty two, I can

0:30:57.960 --> 0:30:59.720
<v Speaker 3>pipeline it and make it go faster and get it

0:30:59.720 --> 0:31:01.800
<v Speaker 3>down a few hours. So it really depends, you know.

0:31:01.880 --> 0:31:04.960
<v Speaker 3>But it's like everybody's home for the holidays. It's a

0:31:05.000 --> 0:31:07.520
<v Speaker 3>lovely playground to kind of get that stuff going fast.

0:31:08.320 --> 0:31:11.880
<v Speaker 2>Let's jump forward one year. Tell me the status of

0:31:11.920 --> 0:31:15.400
<v Speaker 2>this project, Tell me who's using it, tell me how

0:31:15.440 --> 0:31:21.440
<v Speaker 2>big is it. Give me your optimistic but plausible prediction

0:31:21.760 --> 0:31:25.480
<v Speaker 2>about what instruct Lab looks like a year from now.

0:31:26.400 --> 0:31:29.800
<v Speaker 3>A year from now, I would like to see kind

0:31:29.800 --> 0:31:36.200
<v Speaker 3>of a vibrant community around not just building knowledge and

0:31:36.240 --> 0:31:39.960
<v Speaker 3>skills into a model, but coming up with better techniques

0:31:40.000 --> 0:31:42.560
<v Speaker 3>and innovation around how we do it. So I'd like

0:31:42.600 --> 0:31:45.600
<v Speaker 3>to see like the contributor experience as we grow more

0:31:45.640 --> 0:31:47.920
<v Speaker 3>and more contributors to be refined, So like a year

0:31:47.920 --> 0:31:51.160
<v Speaker 3>from now, Malcolm Gladwell could come impart some of his

0:31:51.200 --> 0:31:53.760
<v Speaker 3>wisdom into the model and it wouldn't be difficult, it

0:31:53.760 --> 0:31:55.920
<v Speaker 3>wouldn't be a big lift. I would love to see

0:31:56.000 --> 0:32:00.560
<v Speaker 3>the user interface tooling for doing that to be more soificated.

0:32:01.120 --> 0:32:04.400
<v Speaker 3>I would love to see more people taking this and

0:32:04.480 --> 0:32:07.120
<v Speaker 3>even using it. Maybe you're not sharing it with the community,

0:32:07.160 --> 0:32:10.080
<v Speaker 3>but you're using it for some private usage. Like I'll

0:32:10.080 --> 0:32:13.560
<v Speaker 3>give you an example. I'm in contact with a fellow

0:32:13.680 --> 0:32:16.360
<v Speaker 3>who is doing AI research and he's working with doctors.

0:32:16.400 --> 0:32:19.400
<v Speaker 3>They're GPS in an area of Canada where there's not

0:32:19.520 --> 0:32:22.200
<v Speaker 3>enough GPS for the number of patients, So you know,

0:32:22.320 --> 0:32:26.160
<v Speaker 3>anything you can do to save doctors time to get

0:32:26.200 --> 0:32:28.480
<v Speaker 3>to the next patient. It's like one of the things

0:32:28.480 --> 0:32:31.320
<v Speaker 3>that he has been doing experiments with is can we

0:32:31.480 --> 0:32:35.200
<v Speaker 3>use an open source, licensed model that the doctor can

0:32:35.320 --> 0:32:37.280
<v Speaker 3>run on their laptop so they don't have to worry

0:32:37.280 --> 0:32:39.800
<v Speaker 3>about all of the different privacy rules, Like it's privates

0:32:39.800 --> 0:32:43.880
<v Speaker 3>on the laptop right there, take his live transcription of

0:32:43.880 --> 0:32:47.560
<v Speaker 3>his conversation with the patient and then convert it automatically

0:32:47.600 --> 0:32:49.960
<v Speaker 3>to a SOAP format that can be entered in the database.

0:32:50.200 --> 0:32:52.800
<v Speaker 3>Typically this will take a doctor fifteen to twenty minutes

0:32:52.840 --> 0:32:56.560
<v Speaker 3>of paperwork. Why not save them the paperwork at least

0:32:56.600 --> 0:32:57.840
<v Speaker 3>have the model take a stab.

0:32:58.040 --> 0:33:00.120
<v Speaker 2>Does the model then hold on to that information? He

0:33:00.800 --> 0:33:03.440
<v Speaker 2>interacts with the model again when well, that's the.

0:33:03.400 --> 0:33:06.480
<v Speaker 3>Thing not withinstruct lab. Maybe that could be a future development.

0:33:06.560 --> 0:33:10.200
<v Speaker 3>It doesn't once you're doing inference, it's not ingesting that

0:33:10.280 --> 0:33:12.520
<v Speaker 3>what you're saying to it back in. It's only the

0:33:12.560 --> 0:33:14.800
<v Speaker 3>fine tuning phase. So the idea would be the doctor

0:33:14.840 --> 0:33:19.000
<v Speaker 3>could maybe load in past patient data as knowledge, and

0:33:19.000 --> 0:33:21.280
<v Speaker 3>then when he's trying to diagnose, maybe you know what

0:33:21.280 --> 0:33:24.440
<v Speaker 3>I'm saying. But the main idea is somebody might have

0:33:24.440 --> 0:33:28.160
<v Speaker 3>some private usage. I would love to see more usage

0:33:28.320 --> 0:33:31.400
<v Speaker 3>of this tool to enable people who otherwise never would

0:33:31.400 --> 0:33:34.280
<v Speaker 3>have had access to this type of technology, who never, like,

0:33:34.320 --> 0:33:37.680
<v Speaker 3>you know, a small country GP doctor is it doesn't

0:33:37.680 --> 0:33:40.520
<v Speaker 3>have GPUs. They're not going to hire some company to

0:33:40.520 --> 0:33:42.719
<v Speaker 3>custom build them a model. But maybe on the weekend,

0:33:42.720 --> 0:33:44.680
<v Speaker 3>if he's a techie guy, he could say with this.

0:33:45.000 --> 0:33:47.280
<v Speaker 2>Well, I mean, the more you talk, the more I'm

0:33:47.280 --> 0:33:51.440
<v Speaker 2>realizing that the simplicity of this model is the killer

0:33:51.480 --> 0:33:54.000
<v Speaker 2>app here. Once you know you can run it on

0:33:54.040 --> 0:33:57.920
<v Speaker 2>a laptop. You have democratized use in a way that's

0:33:57.920 --> 0:34:02.200
<v Speaker 2>inconceivable with some of these other much more complex. But

0:34:02.240 --> 0:34:05.880
<v Speaker 2>that's interesting because one would have thought intuitively that at

0:34:05.880 --> 0:34:08.360
<v Speaker 2>the beginning that the winner is going to be the

0:34:08.400 --> 0:34:13.880
<v Speaker 2>one with the biggest, most complex version. And you're saying, actually, no,

0:34:14.120 --> 0:34:18.000
<v Speaker 2>there's a whole series of uses where being lean and

0:34:18.920 --> 0:34:23.279
<v Speaker 2>focused focused is actually you know, it enables a whole

0:34:23.320 --> 0:34:26.279
<v Speaker 2>class of uses. Maybe another way of saying this is

0:34:26.840 --> 0:34:29.480
<v Speaker 2>who wouldn't be a potential instruct lab customer.

0:34:29.840 --> 0:34:33.000
<v Speaker 3>We don't know yet. It's so new, like we haven't

0:34:33.000 --> 0:34:35.319
<v Speaker 3>really had enough people experimenting and playing with it and

0:34:35.360 --> 0:34:38.160
<v Speaker 3>finding out all the things yet. But that's the thing

0:34:38.200 --> 0:34:40.120
<v Speaker 3>that's so exciting about it. It's like, I can't wait

0:34:40.160 --> 0:34:41.160
<v Speaker 3>to see what people do.

0:34:41.600 --> 0:34:43.360
<v Speaker 2>Is this the most exciting thing you've worked on in

0:34:43.360 --> 0:34:43.840
<v Speaker 2>your career?

0:34:44.120 --> 0:34:44.600
<v Speaker 3>I think so.

0:34:45.840 --> 0:34:49.640
<v Speaker 2>I think so. Yeah, Well, we are reaching the end

0:34:49.680 --> 0:34:52.480
<v Speaker 2>of our time, but before we finished, we can do

0:34:52.480 --> 0:34:56.800
<v Speaker 2>a little speed round. Sure, all right, complete the following sentence.

0:34:57.440 --> 0:34:59.680
<v Speaker 2>In five years, AI will.

0:35:00.160 --> 0:35:04.640
<v Speaker 3>Be boring, it will be integrated, It'll just work, and

0:35:04.640 --> 0:35:06.920
<v Speaker 3>there will be no now with AI thing. It'll just

0:35:06.960 --> 0:35:07.560
<v Speaker 3>be normal.

0:35:09.200 --> 0:35:12.360
<v Speaker 2>What's the number one thing? That people misunderstand about AI.

0:35:12.960 --> 0:35:16.480
<v Speaker 3>It's just matrix algebra. It's just numbers. It's not sentient.

0:35:16.719 --> 0:35:20.080
<v Speaker 3>It's not coming to take us over. It's just numbers.

0:35:20.280 --> 0:35:23.319
<v Speaker 2>You're on this side of you're on the team humanity. Yeah,

0:35:23.400 --> 0:35:28.239
<v Speaker 2>you're good. What advice would you give yourself ten years

0:35:28.280 --> 0:35:30.200
<v Speaker 2>ago to better prepare for today?

0:35:30.800 --> 0:35:34.600
<v Speaker 3>Learn Python for real. It's a programming language that's extensively

0:35:34.719 --> 0:35:37.520
<v Speaker 3>used in the community. I've always dabbled in it, but

0:35:37.680 --> 0:35:39.280
<v Speaker 3>I wish I had taken it more seriously.

0:35:39.520 --> 0:35:41.439
<v Speaker 2>Yeah, did you say, who had a daughter?

0:35:42.040 --> 0:35:43.040
<v Speaker 3>I have three daughters?

0:35:43.120 --> 0:35:45.840
<v Speaker 2>You have three daughters. I have two. You're if you

0:35:45.920 --> 0:35:49.719
<v Speaker 2>got three year you're you're on your own. What are

0:35:49.719 --> 0:35:51.120
<v Speaker 2>you making them study Python?

0:35:52.239 --> 0:35:55.279
<v Speaker 3>I am actually trying to do that. We're using a

0:35:55.320 --> 0:35:58.399
<v Speaker 3>microbit micro controller tool to do like a custom video

0:35:58.440 --> 0:36:01.800
<v Speaker 3>game controller. They prefer because it's a visual programming language,

0:36:01.800 --> 0:36:03.600
<v Speaker 3>but it has a Python interface too, and I'm like

0:36:03.719 --> 0:36:04.880
<v Speaker 3>pushing them towards Python.

0:36:05.239 --> 0:36:09.600
<v Speaker 2>Good. Chatbox and image generators are the biggest things in

0:36:09.640 --> 0:36:12.200
<v Speaker 2>consumer AI right now. What do you think is the

0:36:12.200 --> 0:36:14.000
<v Speaker 2>next big business application?

0:36:15.520 --> 0:36:20.880
<v Speaker 3>Private models? Small models fine tuned on your company's data

0:36:21.480 --> 0:36:23.160
<v Speaker 3>for you to use exclusively.

0:36:23.880 --> 0:36:27.240
<v Speaker 2>Are you using AI in your own personal life these days.

0:36:27.440 --> 0:36:29.279
<v Speaker 3>Honestly, I think a lot of us are using it

0:36:29.320 --> 0:36:31.680
<v Speaker 3>and we don't even realize it. Yeah, I mean, I'm

0:36:31.719 --> 0:36:35.719
<v Speaker 3>a ficiano of foreign languages. There's translation programs that are

0:36:35.719 --> 0:36:38.759
<v Speaker 3>built using machine learning underneath. One of the things I've

0:36:38.760 --> 0:36:41.799
<v Speaker 3>been dabbling with lately is using tech summarizations because I

0:36:41.840 --> 0:36:44.560
<v Speaker 3>tend to be very loquacious in my note taking and

0:36:44.600 --> 0:36:46.960
<v Speaker 3>that is not so useful for other people who would

0:36:47.000 --> 0:36:49.920
<v Speaker 3>just like a paragraph. So that's something I've been experimenting

0:36:49.920 --> 0:36:51.759
<v Speaker 3>with myself just to help my everyday work.

0:36:51.880 --> 0:36:56.160
<v Speaker 2>Yeah. We hear many definitions of open related to technology.

0:36:56.719 --> 0:36:59.879
<v Speaker 2>What's your definition of open and how does it help

0:36:59.880 --> 0:37:00.520
<v Speaker 2>you innovate?

0:37:00.880 --> 0:37:06.719
<v Speaker 3>My definition of open is basically sharing and being vulnerable,

0:37:06.840 --> 0:37:09.719
<v Speaker 3>like not just sharing in a have a cookie way,

0:37:09.840 --> 0:37:12.120
<v Speaker 3>but in a you know what, I don't actually know

0:37:12.160 --> 0:37:14.880
<v Speaker 3>how this works? Could you help me? And being open

0:37:15.120 --> 0:37:18.560
<v Speaker 3>to being wrong, being open to somebody helping you and

0:37:18.600 --> 0:37:20.920
<v Speaker 3>making that collaboration work. So it's not just about like

0:37:20.960 --> 0:37:24.400
<v Speaker 3>the artifact you're opening, it's your approach, like how you

0:37:24.440 --> 0:37:25.359
<v Speaker 3>do things being open?

0:37:25.600 --> 0:37:29.160
<v Speaker 2>Yeah, yeah, well I think that wraps us up. How

0:37:29.200 --> 0:37:32.520
<v Speaker 2>can listeners follow your work and learn more about Granted

0:37:32.600 --> 0:37:33.560
<v Speaker 2>and instruct lab.

0:37:33.840 --> 0:37:36.480
<v Speaker 3>Sure you can visit our project web page at instruct

0:37:36.520 --> 0:37:39.399
<v Speaker 3>lab dot ai, or you can visit our GitHub at

0:37:39.520 --> 0:37:42.600
<v Speaker 3>GitHub dot com slash instruct lab. We have lots of

0:37:42.640 --> 0:37:46.120
<v Speaker 3>instructions on how to get involved in an instruct lab wonderful.

0:37:46.440 --> 0:37:52.480
<v Speaker 2>Thank you so much, Thank you Malcolm. A big thank

0:37:52.520 --> 0:37:56.360
<v Speaker 2>you to Mo for the engaging discussion on the groundbreaking

0:37:56.680 --> 0:38:01.680
<v Speaker 2>possibilities of instruct lab Lord how this platform has the

0:38:01.680 --> 0:38:06.439
<v Speaker 2>potential to revolutionize industries from insurance to entertainment law by

0:38:06.520 --> 0:38:09.560
<v Speaker 2>using an open source community approach that makes it easier

0:38:09.600 --> 0:38:12.840
<v Speaker 2>for more people from all backgrounds to fine tune models

0:38:13.120 --> 0:38:19.680
<v Speaker 2>for specific purposes, ultimately making AI more accessible and impactful

0:38:20.280 --> 0:38:24.279
<v Speaker 2>than ever. Looking ahead, the future of AI isn't just

0:38:24.320 --> 0:38:29.400
<v Speaker 2>about technological efficiency. It's about enhancing our everyday experiences in

0:38:29.480 --> 0:38:33.440
<v Speaker 2>ways that were never possible before, like streamlining work for

0:38:33.520 --> 0:38:38.040
<v Speaker 2>doctors to improve the patient experience or assisting insurance agents

0:38:38.360 --> 0:38:43.120
<v Speaker 2>to improve the claims experience. Instruct Lab is paving the

0:38:43.120 --> 0:38:47.680
<v Speaker 2>way for more open, accessible AI future, one that's built

0:38:47.760 --> 0:38:54.320
<v Speaker 2>on collaboration and humanity. Smart Talks with IBM is produced

0:38:54.320 --> 0:38:59.160
<v Speaker 2>by Matt Romano, Joey Fishground and Jacob Goldstein were edited

0:38:59.280 --> 0:39:02.960
<v Speaker 2>by Lydia gy Caught. Our engineers are Sarah Bruguerer and

0:39:03.040 --> 0:39:07.480
<v Speaker 2>Ben Toliday. Theme song by Gramscow. Special thanks to the

0:39:07.480 --> 0:39:10.320
<v Speaker 2>eight Bar and IBM teams, as well as the Pushkin

0:39:10.440 --> 0:39:13.880
<v Speaker 2>marketing team. Smart Talks with IBM is a production of

0:39:13.920 --> 0:39:18.759
<v Speaker 2>Pushkin Industries and Ruby Studio at iHeartMedia. To find more

0:39:18.800 --> 0:39:24.040
<v Speaker 2>Pushkin podcasts, listen on the iHeartRadio app, Apple Podcasts, or

0:39:24.120 --> 0:39:28.880
<v Speaker 2>wherever you listen to podcasts. I'm Malcolm Gladwell. This is

0:39:28.920 --> 0:39:33.120
<v Speaker 2>a paid advertisement from IBM. The conversations on this podcast

0:39:33.360 --> 0:39:48.120
<v Speaker 2>don't necessarily represent IBM's positions, strategies, or opinions.