WEBVTT - Smart Talks with IBM: How open source can democratize AI

0:00:00.160 --> 0:00:02.840
<v Speaker 1>Hey everyone, it's Robert and Joe here. Today we've got

0:00:02.880 --> 0:00:04.760
<v Speaker 1>something a little bit different to share with you. It

0:00:04.840 --> 0:00:08.039
<v Speaker 1>is a new season of the Smart Talks with IBM

0:00:08.160 --> 0:00:09.160
<v Speaker 1>podcast series.

0:00:09.640 --> 0:00:11.639
<v Speaker 2>Today we are witnessed to one of those rare moments

0:00:11.680 --> 0:00:14.400
<v Speaker 2>in history, the rise of an innovative technology with the

0:00:14.400 --> 0:00:18.680
<v Speaker 2>potential to radically transform business and society forever. The technology,

0:00:18.760 --> 0:00:22.240
<v Speaker 2>of course, is artificial intelligence, and it's the central focus

0:00:22.239 --> 0:00:24.840
<v Speaker 2>for this new season of Smart Talks with IBM.

0:00:25.320 --> 0:00:28.400
<v Speaker 1>Join hosts from your favorite Pushkin podcasts as they talk

0:00:28.480 --> 0:00:31.680
<v Speaker 1>with industry experts and leaders to explore how businesses can

0:00:31.720 --> 0:00:35.400
<v Speaker 1>integrate AI into their workflows and help drive real change

0:00:35.400 --> 0:00:38.240
<v Speaker 1>in this new era of AI. And of course, host

0:00:38.280 --> 0:00:40.479
<v Speaker 1>Malcolm Gladwell will be there to guide you through the

0:00:40.479 --> 0:00:42.640
<v Speaker 1>season and throw in his two cents as well.

0:00:43.120 --> 0:00:46.200
<v Speaker 2>Look out for new episodes of Smart Talks with IBM

0:00:46.400 --> 0:00:49.559
<v Speaker 2>every other week on the iHeartRadio app, Apple Podcasts, or

0:00:49.600 --> 0:00:53.360
<v Speaker 2>wherever you get your podcasts. And learn more at IBM

0:00:53.479 --> 0:01:03.360
<v Speaker 2>dot com, slash smart Talks, Pushkin.

0:01:08.959 --> 0:01:12.600
<v Speaker 3>Hello, Hello, welcome to Smart Talks with IBM, a podcast

0:01:12.600 --> 0:01:17.840
<v Speaker 3>from Pushkin Industries, iHeartRadio and IBM. I'm Malcolm Gladwell. This

0:01:17.920 --> 0:01:21.240
<v Speaker 3>season We're diving back into the world of artificial intelligence,

0:01:21.520 --> 0:01:24.920
<v Speaker 3>but with a focus on the powerful concept of open

0:01:25.280 --> 0:01:30.440
<v Speaker 3>its possibilities, implications, and misconceptions. We'll look at openness from

0:01:30.520 --> 0:01:33.520
<v Speaker 3>a variety of angles and explore how the concept is

0:01:33.600 --> 0:01:37.200
<v Speaker 3>already reshaping industries, ways of doing business, and a very

0:01:37.280 --> 0:01:41.039
<v Speaker 3>notion of what's possible. In today's episode, I sat down

0:01:41.040 --> 0:01:44.880
<v Speaker 3>with mo Duffy, software engineering manager at red Hat, who

0:01:44.920 --> 0:01:48.600
<v Speaker 3>works on instruct Lab, a project co developed by red

0:01:48.600 --> 0:01:52.360
<v Speaker 3>Hat and IBM. Most shared with me how this new

0:01:52.440 --> 0:01:57.680
<v Speaker 3>initiative is revolutionizing AI training, making it not only more accessible,

0:01:57.880 --> 0:02:02.200
<v Speaker 3>but also more inclusive. In this project, unique in the industry,

0:02:02.400 --> 0:02:07.760
<v Speaker 3>allows developers to submit incremental contributions to one base AI model,

0:02:08.040 --> 0:02:12.240
<v Speaker 3>creating a continuous loop of development, much like normal open

0:02:12.280 --> 0:02:17.639
<v Speaker 3>source software. By leveraging community contributions and IBM's cutting edge

0:02:17.680 --> 0:02:20.840
<v Speaker 3>granite models, Mo in the team of ibmrs and red

0:02:20.840 --> 0:02:24.359
<v Speaker 3>hatters are paving the way for a future where AI

0:02:24.480 --> 0:02:28.880
<v Speaker 3>development is a communal endeavor. Our insights into open source

0:02:28.919 --> 0:02:33.959
<v Speaker 3>software extend beyond technical proficiency to the profound impact of

0:02:34.120 --> 0:02:37.720
<v Speaker 3>collaborative effort. At the heart of Moe's work is a

0:02:37.720 --> 0:02:42.519
<v Speaker 3>belief in democratizing technology, ensuring that AI becomes a tool

0:02:42.919 --> 0:02:47.320
<v Speaker 3>accessible to all. So let's explore how MOE, red Hat

0:02:47.360 --> 0:02:51.880
<v Speaker 3>and IBM are empowering individuals and businesses alike to reshape

0:02:51.880 --> 0:03:01.600
<v Speaker 3>the future of technology through collaboration and innovation. MO, thank

0:03:01.639 --> 0:03:03.800
<v Speaker 3>you for joining me today. Thank you so much, for

0:03:04.040 --> 0:03:08.480
<v Speaker 3>I have just about the most Irish name ever. I

0:03:08.560 --> 0:03:12.040
<v Speaker 3>do very proudure you weren't born in Ireland.

0:03:12.240 --> 0:03:13.239
<v Speaker 4>No, my grandparents.

0:03:13.320 --> 0:03:15.200
<v Speaker 3>Oh your grandparents, So I see, where did you grow up?

0:03:15.600 --> 0:03:16.639
<v Speaker 4>New York Queens?

0:03:16.960 --> 0:03:20.200
<v Speaker 3>Oh you're la see. So tell me a little bit

0:03:20.200 --> 0:03:22.960
<v Speaker 3>about how how you got to red hat. What was

0:03:22.960 --> 0:03:23.480
<v Speaker 3>your path?

0:03:23.960 --> 0:03:26.480
<v Speaker 4>When I was in high school, it was a chatty girl,

0:03:26.800 --> 0:03:29.280
<v Speaker 4>teenage girl on the phone. We had one phone line.

0:03:29.639 --> 0:03:32.440
<v Speaker 4>My older brother was studying at the local state college

0:03:32.440 --> 0:03:34.720
<v Speaker 4>computer science, and he had to tell that end to

0:03:35.040 --> 0:03:37.920
<v Speaker 4>compile his homework. One phone line, and I'm on it

0:03:37.960 --> 0:03:40.920
<v Speaker 4>all the time. He got very frustrated and he needed

0:03:40.920 --> 0:03:43.480
<v Speaker 4>a compiler to do his homework. So he bought red

0:03:43.480 --> 0:03:47.920
<v Speaker 4>Hat Linux from a CompUSA, brought it home and that

0:03:48.040 --> 0:03:49.960
<v Speaker 4>was on the family computer. So I learned Linux and

0:03:50.000 --> 0:03:52.720
<v Speaker 4>I started playing around with it. I really liked it

0:03:52.760 --> 0:03:56.680
<v Speaker 4>because you could customize everything, like the entire user interface.

0:03:57.040 --> 0:03:59.200
<v Speaker 4>You could actually modify the code of the programs you

0:03:59.240 --> 0:04:01.480
<v Speaker 4>were using to do what you wanted. And for me,

0:04:01.560 --> 0:04:03.760
<v Speaker 4>it was really cool because especially when you're a kid

0:04:03.800 --> 0:04:05.560
<v Speaker 4>and like people tell you this is the way things

0:04:05.560 --> 0:04:07.320
<v Speaker 4>are and you just have to deal with it, it's

0:04:07.400 --> 0:04:08.920
<v Speaker 4>nice to be like, I'm going to make things the

0:04:09.000 --> 0:04:12.280
<v Speaker 4>way I want, modify the code and playing. Yeah, it

0:04:12.360 --> 0:04:14.200
<v Speaker 4>was amazing and it was just such a time and

0:04:14.240 --> 0:04:17.240
<v Speaker 4>like before it was cool, I was doing it and

0:04:18.040 --> 0:04:20.000
<v Speaker 4>what I saw on that is sort of the potential

0:04:20.040 --> 0:04:23.080
<v Speaker 4>like number one of like a community of people working together.

0:04:23.080 --> 0:04:26.160
<v Speaker 4>And like the Internet existed, it was slow, it involved modems,

0:04:26.560 --> 0:04:28.480
<v Speaker 4>but there were people that you could talk to who

0:04:28.520 --> 0:04:32.279
<v Speaker 4>would give you tips and you'd share information, and this

0:04:32.400 --> 0:04:35.320
<v Speaker 4>collaborative building something together is really something special.

0:04:35.440 --> 0:04:35.640
<v Speaker 2>Right.

0:04:36.000 --> 0:04:39.640
<v Speaker 4>I could file a complaint to whatever large software company

0:04:39.680 --> 0:04:42.080
<v Speaker 4>made whatever software I was into, or I could go

0:04:42.080 --> 0:04:44.279
<v Speaker 4>to an open source software community and be like, hey, guys,

0:04:44.320 --> 0:04:46.719
<v Speaker 4>I think we should do this. I'm like, yeah, okay,

0:04:46.720 --> 0:04:48.880
<v Speaker 4>I'll help. I'll pitch in. So you don't feel powerless,

0:04:48.880 --> 0:04:50.240
<v Speaker 4>you feel like you can have an impact, and that

0:04:50.320 --> 0:04:54.200
<v Speaker 4>was really exciting to me. However, open source software has

0:04:54.360 --> 0:04:57.599
<v Speaker 4>a reputation for not having the best user interface, not

0:04:57.640 --> 0:05:02.159
<v Speaker 4>the best user experience. So I ended up studying Computer

0:05:02.200 --> 0:05:05.480
<v Speaker 4>science and Electronic media dual major, and then I did

0:05:05.560 --> 0:05:09.440
<v Speaker 4>human computeraction as my master's And my thought was, wouldn't

0:05:09.440 --> 0:05:13.120
<v Speaker 4>it be nice if this free software accessible to anybody,

0:05:13.440 --> 0:05:15.440
<v Speaker 4>if it was easier to use, some more people could

0:05:15.520 --> 0:05:18.160
<v Speaker 4>use it and take advantage of it. And so, long

0:05:18.200 --> 0:05:21.479
<v Speaker 4>story short, I ended up going to Red Hat saying, Hey,

0:05:21.480 --> 0:05:22.880
<v Speaker 4>I want to learn how you guys work. Let me

0:05:22.920 --> 0:05:25.719
<v Speaker 4>embed in your team draft out of my graduate program.

0:05:25.760 --> 0:05:27.520
<v Speaker 4>And I'm like, I want to do this for a living.

0:05:27.640 --> 0:05:30.240
<v Speaker 4>This is cooler. So I thought this is the way

0:05:30.279 --> 0:05:31.920
<v Speaker 4>to go, and I've been there ever since. They haven't

0:05:31.920 --> 0:05:32.760
<v Speaker 4>been able to get rid of me.

0:05:34.839 --> 0:05:37.560
<v Speaker 3>To backtrack just a little bit, you were talking about

0:05:37.600 --> 0:05:41.280
<v Speaker 3>the sense of community that surrounds this way of thinking

0:05:41.320 --> 0:05:45.039
<v Speaker 3>about software. Talk a little bit more about what that

0:05:45.080 --> 0:05:47.840
<v Speaker 3>community is like, the benefits of that community, why it

0:05:47.920 --> 0:05:48.720
<v Speaker 3>appeals to you.

0:05:49.200 --> 0:05:51.520
<v Speaker 4>Sure, well, you know part of the reason I actually

0:05:51.640 --> 0:05:54.720
<v Speaker 4>ended up going to the graduate school track. Suddenly you're

0:05:54.720 --> 0:05:57.920
<v Speaker 4>a peer of your professors and you're working side by

0:05:58.000 --> 0:06:01.040
<v Speaker 4>side with them. At some point they retire and you're

0:06:01.040 --> 0:06:04.599
<v Speaker 4>in the next generation. So it's sharing information, building on

0:06:04.640 --> 0:06:07.320
<v Speaker 4>the work of others in sort of this cycle that

0:06:07.520 --> 0:06:11.880
<v Speaker 4>extends past human lifespan and in the same way, like

0:06:11.960 --> 0:06:15.359
<v Speaker 4>the open source model is very similar, but you're actually

0:06:15.440 --> 0:06:18.080
<v Speaker 4>you're building something, and it's something in me. I'm just

0:06:18.440 --> 0:06:20.840
<v Speaker 4>really attracted, Like I don't like talking about stuff. I

0:06:21.000 --> 0:06:24.960
<v Speaker 4>like doing stuff with open source software. The software doesn't

0:06:24.960 --> 0:06:28.599
<v Speaker 4>cost anything, the code is out there, generally uses open

0:06:28.640 --> 0:06:32.039
<v Speaker 4>standards for the file formats. I can open up files

0:06:32.080 --> 0:06:34.680
<v Speaker 4>that I created and open source tools as a high

0:06:34.720 --> 0:06:38.279
<v Speaker 4>school student today because they were using open formats and

0:06:38.320 --> 0:06:40.800
<v Speaker 4>that software still exists. I can still compile the code

0:06:40.880 --> 0:06:44.080
<v Speaker 4>and it's an active community project. Like these things can

0:06:44.120 --> 0:06:46.800
<v Speaker 4>outlast any single company in the same way that the

0:06:46.839 --> 0:06:49.400
<v Speaker 4>academic community has been going on for so many years

0:06:49.440 --> 0:06:51.960
<v Speaker 4>and hopefully we'll continue moving on. So it's sort of

0:06:52.040 --> 0:06:54.919
<v Speaker 4>like not just the community around it, but just the

0:06:55.000 --> 0:06:58.119
<v Speaker 4>knowledge sharing and also bringing up the next generation as well.

0:06:58.160 --> 0:06:59.680
<v Speaker 4>Like all of that stuff really appealed to me. And

0:06:59.760 --> 0:07:02.440
<v Speaker 4>also so at the center of it the fact that

0:07:02.880 --> 0:07:05.960
<v Speaker 4>we could democratize it by following this open source process

0:07:06.320 --> 0:07:08.320
<v Speaker 4>and feel like we have some control. We're not at

0:07:08.360 --> 0:07:11.400
<v Speaker 4>the mercy of some faceless corporation making changes and we

0:07:11.440 --> 0:07:13.520
<v Speaker 4>have no impact. Like that really appealed to me too.

0:07:14.320 --> 0:07:19.400
<v Speaker 3>For those of us who are not software phisionados, take

0:07:19.400 --> 0:07:23.800
<v Speaker 3>a step backwards and give me a kind of description

0:07:23.920 --> 0:07:26.800
<v Speaker 3>of terms. What's the opposite of open source proprietary?

0:07:26.880 --> 0:07:28.360
<v Speaker 4>Proprietary is what we say.

0:07:28.280 --> 0:07:32.679
<v Speaker 3>So specifically and practically, the difference would be what between

0:07:32.720 --> 0:07:34.840
<v Speaker 3>something that was opens us in something that was proprietary.

0:07:34.920 --> 0:07:37.520
<v Speaker 4>Sure, so there's a lot of difference. So with open

0:07:37.560 --> 0:07:41.520
<v Speaker 4>source software you get these rights when you're given the software,

0:07:41.600 --> 0:07:44.000
<v Speaker 4>you get the right to be able to share it.

0:07:44.080 --> 0:07:46.520
<v Speaker 4>And depending on the lot, different licenses that are considered

0:07:46.520 --> 0:07:49.520
<v Speaker 4>open source have different little things that you have to

0:07:49.520 --> 0:07:55.320
<v Speaker 4>be aware of. With proprietary code, it's one copyright the company.

0:07:55.360 --> 0:07:57.720
<v Speaker 4>Even a lot of times, when you sign your employment

0:07:57.720 --> 0:08:00.160
<v Speaker 4>contract for a software company and you write code for them,

0:08:00.360 --> 0:08:02.000
<v Speaker 4>you don't own it. You sign over your rights to

0:08:02.040 --> 0:08:03.960
<v Speaker 4>the company. So if you leave the company, the code

0:08:03.960 --> 0:08:06.280
<v Speaker 4>doesn't go with you. It stays in the ownership of

0:08:06.280 --> 0:08:08.920
<v Speaker 4>that company. So then one like one company buys out

0:08:08.960 --> 0:08:11.080
<v Speaker 4>another and kills a product, that code's gone.

0:08:11.240 --> 0:08:15.160
<v Speaker 3>It's gone for a business, Why would a business want

0:08:15.240 --> 0:08:19.040
<v Speaker 3>to be have open source code as opposed to a proprietary.

0:08:18.400 --> 0:08:20.480
<v Speaker 4>Well for the same reasons. Like, say you're a business.

0:08:20.960 --> 0:08:24.840
<v Speaker 4>You've invested all this money into this software platform, right

0:08:25.320 --> 0:08:28.440
<v Speaker 4>and you've upskilled your employees on it, and it's a

0:08:28.480 --> 0:08:31.320
<v Speaker 4>core part of your business, and then a few years

0:08:31.360 --> 0:08:34.679
<v Speaker 4>later that company goes out of business or something happens,

0:08:34.840 --> 0:08:38.199
<v Speaker 4>or even something less drastic. You really need this future,

0:08:38.520 --> 0:08:41.640
<v Speaker 4>But for the company that makes the software, it's not

0:08:41.640 --> 0:08:44.480
<v Speaker 4>in their best interests. It's not worth the investment. They're

0:08:44.520 --> 0:08:46.440
<v Speaker 4>not going to do it. How do you get that future?

0:08:46.800 --> 0:08:49.320
<v Speaker 4>You either have to completely migrate to another solution, and

0:08:49.360 --> 0:08:51.280
<v Speaker 4>this is something it's core at your business, that's going

0:08:51.320 --> 0:08:53.439
<v Speaker 4>to be a big deal to migrate. But if it's

0:08:53.480 --> 0:08:57.240
<v Speaker 4>open source, you could either hire a team of experts.

0:08:57.440 --> 0:08:59.880
<v Speaker 4>You could hire software engineers who are able to go

0:09:00.120 --> 0:09:03.319
<v Speaker 4>do this for you. Go in the upstream software community,

0:09:03.800 --> 0:09:06.800
<v Speaker 4>implement the feature that you want, and it'll be rolled

0:09:06.840 --> 0:09:09.400
<v Speaker 4>into the next version of that company software. So even

0:09:09.440 --> 0:09:12.559
<v Speaker 4>if that company didn't want to implement the feature, if

0:09:12.600 --> 0:09:15.640
<v Speaker 4>they did it open source, they would inherit that feature

0:09:15.679 --> 0:09:18.160
<v Speaker 4>from the upstream community, is what we call it. So

0:09:18.240 --> 0:09:20.880
<v Speaker 4>you have some control over the situation. If it's open source,

0:09:20.960 --> 0:09:25.120
<v Speaker 4>you have an opportunity to actually affect change in the product,

0:09:25.400 --> 0:09:27.400
<v Speaker 4>and you could then pick it up or pay somebody

0:09:27.400 --> 0:09:29.520
<v Speaker 4>else to pick it up, or another company could form

0:09:29.559 --> 0:09:32.160
<v Speaker 4>and pick it up and keep it going. So there's

0:09:32.160 --> 0:09:34.640
<v Speaker 4>more possibilities. If it's open source, it's more like it's

0:09:34.679 --> 0:09:36.240
<v Speaker 4>like an insurance policy almost.

0:09:36.280 --> 0:09:39.840
<v Speaker 3>So innovation from the standpoint of the customer, innovation is

0:09:39.880 --> 0:09:42.880
<v Speaker 3>a lot easier when you're working in an open source environment.

0:09:43.040 --> 0:09:43.679
<v Speaker 4>Absolutely.

0:09:44.080 --> 0:09:47.840
<v Speaker 3>Yeah. So now at RedHat you're working with something called

0:09:47.840 --> 0:09:50.560
<v Speaker 3>instruct lab. Tell us a little bit about what that is.

0:09:51.040 --> 0:09:53.280
<v Speaker 4>So the thing that really excites me about getting to

0:09:53.320 --> 0:09:55.680
<v Speaker 4>work on this project is AI is sort of that

0:09:55.760 --> 0:09:58.920
<v Speaker 4>has been this scary thing for me because it's one

0:09:58.920 --> 0:10:02.040
<v Speaker 4>of those things like in order to be able to

0:10:02.120 --> 0:10:07.239
<v Speaker 4>pre train a model, you have to have unobtainium GPS,

0:10:07.920 --> 0:10:12.200
<v Speaker 4>you have to have rich resources, It takes months, it

0:10:12.240 --> 0:10:17.200
<v Speaker 4>takes expertise. There's a small handful of companies that can

0:10:17.240 --> 0:10:21.240
<v Speaker 4>build a model from pre train to something usable, and

0:10:21.480 --> 0:10:23.800
<v Speaker 4>it kind of feels like those early days when I

0:10:23.920 --> 0:10:26.640
<v Speaker 4>was kind of delving in software in the same way.

0:10:26.679 --> 0:10:30.000
<v Speaker 4>I think if more people could contribute to AI models,

0:10:30.800 --> 0:10:34.400
<v Speaker 4>then it wouldn't be just influenced by whichever company had

0:10:34.440 --> 0:10:37.679
<v Speaker 4>the resources to build it. And there's been a lot

0:10:37.679 --> 0:10:41.040
<v Speaker 4>of emphasis on pre training models, so taking massive terabytes

0:10:41.160 --> 0:10:45.120
<v Speaker 4>data sets, throwing them through masses of GPS over months

0:10:45.120 --> 0:10:48.440
<v Speaker 4>of time, spending hundreds of millions of dollars to build

0:10:48.480 --> 0:10:51.320
<v Speaker 4>a base model. But when instruct lab does is say okay,

0:10:51.559 --> 0:10:54.440
<v Speaker 4>you have a base model. We're going to fine tune in.

0:10:54.480 --> 0:10:57.920
<v Speaker 4>On the other end, it takes less compute resources. The

0:10:57.960 --> 0:11:00.240
<v Speaker 4>way we've built in struck lab, you can play around

0:11:00.280 --> 0:11:02.760
<v Speaker 4>with the technology and learn it on it off the

0:11:02.760 --> 0:11:06.200
<v Speaker 4>shelf laptop that you can actually buy. So in this

0:11:06.320 --> 0:11:10.040
<v Speaker 4>way we're enabling a much broader set of people to

0:11:10.160 --> 0:11:12.839
<v Speaker 4>play with AI, to contribute it, to modify it. And

0:11:12.880 --> 0:11:15.840
<v Speaker 4>I'll tell you one story from red Hat Succi, who

0:11:15.920 --> 0:11:20.360
<v Speaker 4>is our chief diversity officer, very interested in inclusive language

0:11:20.360 --> 0:11:23.280
<v Speaker 4>and open source software, doesn't have any experience with AI.

0:11:24.040 --> 0:11:26.079
<v Speaker 4>We have a community model that we have an upstream

0:11:26.080 --> 0:11:28.680
<v Speaker 4>project around for people to contribute knowledge and skills to

0:11:28.720 --> 0:11:30.840
<v Speaker 4>the model. She's like, I want to teach the model

0:11:31.160 --> 0:11:34.120
<v Speaker 4>how to use inclusive language, like replace this word with

0:11:34.160 --> 0:11:35.959
<v Speaker 4>this word or this word with this word. OHM Like, oh,

0:11:36.000 --> 0:11:38.880
<v Speaker 4>that's so cool. So She paired up with Nicholas who

0:11:38.960 --> 0:11:41.760
<v Speaker 4>is a technical guy at red Hat, and they built

0:11:41.880 --> 0:11:44.920
<v Speaker 4>and submitted a skill to the model that you can

0:11:45.000 --> 0:11:47.160
<v Speaker 4>just tell the model, can you please take this document

0:11:47.240 --> 0:11:49.679
<v Speaker 4>and translate this language to more inclusive language and it

0:11:49.720 --> 0:11:52.120
<v Speaker 4>will do it. And they submitted it to the community.

0:11:52.200 --> 0:11:53.920
<v Speaker 4>They were so proud. It was like, that's the kind

0:11:53.960 --> 0:11:56.360
<v Speaker 4>of thing that, like, you know, maybe a company would

0:11:56.400 --> 0:11:58.840
<v Speaker 4>be incentivized to do that, but if you have some

0:11:58.960 --> 0:12:02.560
<v Speaker 4>tooling that's open source and something that anybody could access,

0:12:02.720 --> 0:12:05.320
<v Speaker 4>than those communities could actually get together and build that

0:12:05.400 --> 0:12:06.640
<v Speaker 4>knowledge into AI models.

0:12:06.880 --> 0:12:11.400
<v Speaker 3>Just so understand, what you guys have is the structure

0:12:11.480 --> 0:12:15.760
<v Speaker 3>for an AI system, And in other cases, individual companies

0:12:15.920 --> 0:12:19.880
<v Speaker 3>own and train their own AI systems. It takes enormous

0:12:19.880 --> 0:12:22.920
<v Speaker 3>amount of resources. They hoover up all kinds of information,

0:12:23.480 --> 0:12:26.640
<v Speaker 3>train it according to their own hidden set of rules,

0:12:26.720 --> 0:12:31.160
<v Speaker 3>and then a customer might use that for some price.

0:12:31.520 --> 0:12:33.440
<v Speaker 3>What you're saying is, in the same way that we

0:12:33.520 --> 0:12:37.360
<v Speaker 3>democratize the writing of software before, let's democratize the training

0:12:37.400 --> 0:12:41.040
<v Speaker 3>of an AI system. So anyone can contribute here and

0:12:41.480 --> 0:12:45.160
<v Speaker 3>teach the model the things that they're interested in teaching

0:12:45.160 --> 0:12:48.079
<v Speaker 3>the model. I'm guessing correct me. On the one hand,

0:12:48.559 --> 0:12:50.680
<v Speaker 3>this model, at least in the beginning, is going to

0:12:50.679 --> 0:12:53.840
<v Speaker 3>have a lot fewer resources available to it. But on

0:12:53.880 --> 0:12:55.640
<v Speaker 3>the other hand, it's going to have a much more

0:12:56.000 --> 0:12:58.000
<v Speaker 3>diverse set of inputs.

0:12:58.440 --> 0:13:01.520
<v Speaker 4>That's right. And the other thing is that IBM, basically

0:13:01.600 --> 0:13:04.280
<v Speaker 4>is part of this project, has something called the Granite

0:13:04.320 --> 0:13:07.480
<v Speaker 4>Model family, and they've donated some granite models. So these

0:13:07.520 --> 0:13:10.200
<v Speaker 4>are the ones that take the months and terabytes of

0:13:10.280 --> 0:13:13.440
<v Speaker 4>data and all the GPUs to train. So IBM has

0:13:13.480 --> 0:13:16.760
<v Speaker 4>created one of those, and they have listed out and

0:13:16.800 --> 0:13:18.959
<v Speaker 4>linked to the data sets that they used, and they

0:13:19.000 --> 0:13:21.920
<v Speaker 4>talk about the relative proportions they used when pre training,

0:13:22.280 --> 0:13:24.280
<v Speaker 4>so it's not just the black box. You know where

0:13:24.320 --> 0:13:27.160
<v Speaker 4>the data came from, which is a pretty open position

0:13:27.200 --> 0:13:29.760
<v Speaker 4>to take. That is what we recommend as the base.

0:13:29.840 --> 0:13:32.440
<v Speaker 4>So you use the instruct lab tuning. You take this

0:13:32.520 --> 0:13:35.480
<v Speaker 4>base granite model that IBM has provided, and you use

0:13:35.480 --> 0:13:37.920
<v Speaker 4>the instruct lab tooling that red Hat works on, and

0:13:37.960 --> 0:13:40.280
<v Speaker 4>you use that to fine tune the model to make

0:13:40.320 --> 0:13:41.960
<v Speaker 4>it whatever you want.

0:13:42.480 --> 0:13:45.400
<v Speaker 3>I want to go back to the partnership between IBM

0:13:45.480 --> 0:13:49.480
<v Speaker 3>and red Hat here with them providing the granite model

0:13:50.080 --> 0:13:53.320
<v Speaker 3>to your instruct lab Is this the first time red

0:13:53.360 --> 0:13:56.760
<v Speaker 3>hat and IBM have collaborated like this, I think it's.

0:13:56.600 --> 0:13:59.920
<v Speaker 4>Something that's been going on. Like another a product within

0:14:00.080 --> 0:14:02.560
<v Speaker 4>the red hat family would be open Shift AI, where

0:14:02.559 --> 0:14:06.199
<v Speaker 4>they collaborate a lot with IBM Research team, Like BLM

0:14:06.280 --> 0:14:08.240
<v Speaker 4>is one of the components of that product that there's

0:14:08.440 --> 0:14:13.160
<v Speaker 4>a nice kind of exchange and collaboration between the two companies.

0:14:13.920 --> 0:14:16.559
<v Speaker 3>How large is the potential community of people who might

0:14:16.640 --> 0:14:18.240
<v Speaker 3>contribute to instruct lab.

0:14:19.160 --> 0:14:21.720
<v Speaker 4>It could be thousands of people. I mean, we'll see.

0:14:21.760 --> 0:14:25.520
<v Speaker 4>It's early days. This is early technology that was invented

0:14:25.520 --> 0:14:28.000
<v Speaker 4>at IBM Research that they partnered with us at red

0:14:28.000 --> 0:14:30.840
<v Speaker 4>Hat to kind of build the software around it. There's

0:14:30.880 --> 0:14:33.160
<v Speaker 4>still more to go, Like right now, we have a

0:14:33.240 --> 0:14:35.240
<v Speaker 4>team in the community that's actually trying to build a

0:14:35.280 --> 0:14:38.800
<v Speaker 4>web interface to make it easier for anybody to contribute.

0:14:38.960 --> 0:14:40.480
<v Speaker 4>So we have a lot of those sort of user

0:14:40.520 --> 0:14:43.960
<v Speaker 4>experience for the contributor to the model stuff to work

0:14:44.000 --> 0:14:46.680
<v Speaker 4>out that we're still actively building on. But like my

0:14:46.880 --> 0:14:49.520
<v Speaker 4>vision for it even is I like going back to

0:14:49.520 --> 0:14:52.600
<v Speaker 4>that academic model of learning from what others and building

0:14:52.680 --> 0:14:55.560
<v Speaker 4>upon it over time. It would be very good for

0:14:55.720 --> 0:14:58.560
<v Speaker 4>us to sort of go out and try to collaborate

0:14:58.720 --> 0:15:01.520
<v Speaker 4>with academics of all, like, hey, you know, the model

0:15:01.520 --> 0:15:05.000
<v Speaker 4>doesn't know about your field, would you like to put

0:15:05.040 --> 0:15:07.440
<v Speaker 4>something into the model about your field so it knows

0:15:07.480 --> 0:15:10.520
<v Speaker 4>about it, or even you know, talk to the model

0:15:10.760 --> 0:15:13.160
<v Speaker 4>it got it wrong, let's correct it. Can we lean

0:15:13.200 --> 0:15:15.320
<v Speaker 4>on your expertise to correct it and make sure it

0:15:15.320 --> 0:15:18.400
<v Speaker 4>gets it right and sort of use that community model

0:15:18.440 --> 0:15:22.480
<v Speaker 4>as a way for everybody to collaborate because before instruct lab,

0:15:23.240 --> 0:15:26.680
<v Speaker 4>my understanding is if you wanted to take a model

0:15:26.720 --> 0:15:28.800
<v Speaker 4>that's open source license and play with it, you could

0:15:28.840 --> 0:15:30.520
<v Speaker 4>do that. You could take a model kind of off

0:15:30.520 --> 0:15:33.560
<v Speaker 4>the shelf from Hugging Face and fine tune it yourself.

0:15:33.960 --> 0:15:35.520
<v Speaker 4>But it's a bit of a dead end because you

0:15:35.560 --> 0:15:38.120
<v Speaker 4>made your contributions, but there's no way for other people

0:15:38.600 --> 0:15:41.520
<v Speaker 4>to collaborate with you. So the way that we've built

0:15:41.560 --> 0:15:45.640
<v Speaker 4>this is based on how the technology works. Everybody can

0:15:45.680 --> 0:15:48.040
<v Speaker 4>contribute to it. This is something that you can keep

0:15:48.080 --> 0:15:49.520
<v Speaker 4>growing and growing and growing over time.

0:15:49.840 --> 0:15:53.400
<v Speaker 3>Yeah. Yeah, what's the level of expertise necessary to be

0:15:53.440 --> 0:15:54.160
<v Speaker 3>a contributor?

0:15:54.760 --> 0:15:56.640
<v Speaker 4>You don't need to be a data scientist and you

0:15:56.680 --> 0:15:59.600
<v Speaker 4>don't need to have exotic hardware. Honestly, if you don't

0:15:59.600 --> 0:16:02.520
<v Speaker 4>even have laptop hardware that meets SUSPEC for doing instruct

0:16:02.560 --> 0:16:05.720
<v Speaker 4>labs laptop version. You can submit it to the community

0:16:05.800 --> 0:16:08.320
<v Speaker 4>and then we'll actually build it for you. We have

0:16:08.400 --> 0:16:10.800
<v Speaker 4>bots and stuff that do that, and we're hoping over

0:16:10.840 --> 0:16:13.320
<v Speaker 4>time to make that more accessible, first by having a

0:16:13.440 --> 0:16:16.040
<v Speaker 4>user interface and then maybe later on having a web service.

0:16:16.360 --> 0:16:19.560
<v Speaker 3>Yeah, so give me an example of how a business

0:16:19.680 --> 0:16:21.720
<v Speaker 3>might make use of instruct lab.

0:16:22.280 --> 0:16:24.800
<v Speaker 4>One of the things that businesses are doing with AI

0:16:24.920 --> 0:16:28.600
<v Speaker 4>right now is using hosted API services. They're quite expensive,

0:16:28.880 --> 0:16:31.680
<v Speaker 4>but they're finding value, but it's hard given the amount

0:16:31.720 --> 0:16:34.000
<v Speaker 4>of money they're spending. And one of the things that's

0:16:34.000 --> 0:16:35.840
<v Speaker 4>a little scary about it too, is like you have

0:16:36.200 --> 0:16:40.560
<v Speaker 4>very sensitive internal documents and you have employees maybe not

0:16:40.760 --> 0:16:43.640
<v Speaker 4>understanding what they're actually doing because you know, how would

0:16:43.680 --> 0:16:47.080
<v Speaker 4>you if you're not technical enough when you're asking said

0:16:47.720 --> 0:16:53.440
<v Speaker 4>public web service AI model information about your copy pasting

0:16:53.560 --> 0:16:57.600
<v Speaker 4>internal company documents. It's going across the Internet into another

0:16:57.640 --> 0:17:00.800
<v Speaker 4>company's hands, and that company probably shouldn't have access to that.

0:17:01.280 --> 0:17:04.280
<v Speaker 4>So what both RedHat and IBM in the space are

0:17:04.280 --> 0:17:07.640
<v Speaker 4>looking at, like the instruct lab model is very modest.

0:17:07.640 --> 0:17:11.160
<v Speaker 4>It's seven billion parameter model very small. It's very cheap

0:17:11.200 --> 0:17:14.600
<v Speaker 4>to serve inference on a seven billion parameter model. It's

0:17:14.600 --> 0:17:18.120
<v Speaker 4>competing with trillion parameter models that are hosted. You take

0:17:18.160 --> 0:17:21.000
<v Speaker 4>this small model that is cheap to run inference on,

0:17:21.640 --> 0:17:25.560
<v Speaker 4>you train it with your own company's proprietary data inside

0:17:25.600 --> 0:17:28.159
<v Speaker 4>the walls of your company, on your own hardware. You

0:17:28.200 --> 0:17:31.480
<v Speaker 4>can do all sorts of actual data analysis on your

0:17:31.480 --> 0:17:34.080
<v Speaker 4>most sensitive data and have the confidence that has not

0:17:34.160 --> 0:17:35.080
<v Speaker 4>left the premises.

0:17:35.920 --> 0:17:38.960
<v Speaker 3>In that use case, you're not actually training the model

0:17:39.000 --> 0:17:42.240
<v Speaker 3>for everyone. You're just taking it and doing some private

0:17:42.280 --> 0:17:44.720
<v Speaker 3>stuff on it exactly, which doesn't leave the building. But

0:17:44.800 --> 0:17:49.320
<v Speaker 3>that's separate from an interaction where you're doing something that

0:17:50.000 --> 0:17:51.200
<v Speaker 3>contributes overall.

0:17:51.600 --> 0:17:54.200
<v Speaker 4>Right, And that's something maybe that I should be more

0:17:54.240 --> 0:17:56.480
<v Speaker 4>clear about is there's sort of two tracks here, and

0:17:56.560 --> 0:17:59.840
<v Speaker 4>this is very red hat classic. You have your upstream

0:18:00.040 --> 0:18:02.960
<v Speaker 4>community track and you have your business product tract. So

0:18:03.040 --> 0:18:07.040
<v Speaker 4>the upstream community track is just enabling anybody to contribute

0:18:07.080 --> 0:18:09.000
<v Speaker 4>to a model in a collaborative way and play with it.

0:18:09.440 --> 0:18:13.280
<v Speaker 4>The downstream product business oriented track is now take that

0:18:13.400 --> 0:18:18.000
<v Speaker 4>tech that we've honed and developed in the open community

0:18:18.680 --> 0:18:21.160
<v Speaker 4>and apply it to your business knowledge and skills.

0:18:22.200 --> 0:18:26.040
<v Speaker 3>This community driven approach marks a pivotal shift towards more

0:18:26.040 --> 0:18:31.840
<v Speaker 3>accessible AI solutions. The contrast between externally hosted AI services

0:18:32.119 --> 0:18:35.320
<v Speaker 3>and the open model enhanced by instruct lab underscores the

0:18:35.320 --> 0:18:40.440
<v Speaker 3>potential for broader adoption of AI in diverse business contexts.

0:18:40.920 --> 0:18:44.320
<v Speaker 3>She envisions a future in which technological innovation is more

0:18:44.359 --> 0:18:48.679
<v Speaker 3>tailored to individual business needs, guided by principles of openness

0:18:48.800 --> 0:18:53.679
<v Speaker 3>and security. To an imaginary case study, Sure, I'm a

0:18:53.760 --> 0:18:58.520
<v Speaker 3>law firm, I'm an entertainment law I have one hundred

0:18:58.560 --> 0:19:02.879
<v Speaker 3>clients who are big stars. They all have incredibly complicated contracts.

0:19:03.840 --> 0:19:08.280
<v Speaker 3>I feed a thousand of my company's contracts from the

0:19:08.359 --> 0:19:12.200
<v Speaker 3>last ten years into the model, and then every time

0:19:12.240 --> 0:19:14.800
<v Speaker 3>I have a new contract, I ask the model, am

0:19:14.840 --> 0:19:17.520
<v Speaker 3>I missing something? Can you go back and look through

0:19:17.560 --> 0:19:19.919
<v Speaker 3>all our own contracts and show me a contract that

0:19:20.200 --> 0:19:23.439
<v Speaker 3>is missing key components or exposes us to some liability.

0:19:24.320 --> 0:19:27.960
<v Speaker 3>In that case, the model would know my law firm

0:19:28.200 --> 0:19:31.560
<v Speaker 3>contracts really, really well. It's as if they've been working

0:19:32.080 --> 0:19:35.480
<v Speaker 3>out my law firm. They're not distracted by other people's

0:19:35.520 --> 0:19:41.240
<v Speaker 3>particular styles or a bunch of contracts from the utility industry,

0:19:41.400 --> 0:19:46.040
<v Speaker 3>or the They know entertainment law contracts exactly.

0:19:46.160 --> 0:19:48.000
<v Speaker 4>Yeah, and you can train it in your own image,

0:19:48.040 --> 0:19:51.800
<v Speaker 4>your style of doing things. It's something that your company

0:19:51.880 --> 0:19:55.200
<v Speaker 4>can produce that is uniquely helpful to you. No third

0:19:55.240 --> 0:19:57.720
<v Speaker 4>party could do that because no third party understands how

0:19:57.720 --> 0:20:01.200
<v Speaker 4>you do business and understands your his street in your documents.

0:20:01.520 --> 0:20:04.120
<v Speaker 4>So it's sort of a way of getting value out

0:20:04.119 --> 0:20:06.000
<v Speaker 4>of the stuff you already have sitting in a file

0:20:06.040 --> 0:20:08.200
<v Speaker 4>cabinet somewhere. It's very cool.

0:20:08.480 --> 0:20:11.480
<v Speaker 3>Yeah, give me a sort of a real world case

0:20:11.480 --> 0:20:14.720
<v Speaker 3>study where you think the business use case would be

0:20:14.720 --> 0:20:19.320
<v Speaker 3>really powerful. What's a business that really could see an

0:20:19.359 --> 0:20:23.360
<v Speaker 3>advantage to using instruct lab in its way.

0:20:23.840 --> 0:20:26.119
<v Speaker 4>The demo that I've given a couple of times at

0:20:26.119 --> 0:20:29.680
<v Speaker 4>different events used an imaginary insurance company. So you say,

0:20:29.720 --> 0:20:33.639
<v Speaker 4>you have this company, you have to recommend repairs for

0:20:33.720 --> 0:20:37.159
<v Speaker 4>various types of claims. You've been doing this for years,

0:20:37.200 --> 0:20:40.040
<v Speaker 4>you know. If you know the windshield's broken and you've

0:20:40.080 --> 0:20:42.920
<v Speaker 4>gotten this type of accident and it's this model car,

0:20:43.119 --> 0:20:44.880
<v Speaker 4>these are the kinds of things you want to look at.

0:20:45.560 --> 0:20:48.359
<v Speaker 4>So you could talk to any insurance agent in the

0:20:48.359 --> 0:20:51.080
<v Speaker 4>field and be like, oh, you know, it's a Tesla.

0:20:51.160 --> 0:20:53.280
<v Speaker 4>You might want to look at the battery or something like.

0:20:53.400 --> 0:20:56.800
<v Speaker 4>They'll have some latent knowledge just so you can take

0:20:56.840 --> 0:20:59.520
<v Speaker 4>that and train it into a model. Honestly, I think

0:20:59.560 --> 0:21:02.840
<v Speaker 4>these kind of new technologies are better when they're less visible.

0:21:03.440 --> 0:21:05.879
<v Speaker 4>So say you have the claims agents in the field

0:21:05.960 --> 0:21:07.960
<v Speaker 4>and they have this tool and they're kind of entering

0:21:07.960 --> 0:21:10.920
<v Speaker 4>the claim data. They're on the scene at the car,

0:21:11.520 --> 0:21:14.159
<v Speaker 4>and it might say, oh, look, I see this is

0:21:14.200 --> 0:21:16.720
<v Speaker 4>a Ford fiesta. These are things you want to look

0:21:16.760 --> 0:21:19.960
<v Speaker 4>at for this type of accident. As you're entering the data,

0:21:20.400 --> 0:21:22.280
<v Speaker 4>it could be going through the knowledge you had loaded

0:21:22.280 --> 0:21:24.760
<v Speaker 4>into the model and be making these suggestions based on

0:21:24.800 --> 0:21:27.719
<v Speaker 4>your company's background, and hey, you know, let's not make

0:21:27.760 --> 0:21:30.280
<v Speaker 4>the same mistake twice. Let's make new mistakes and let's

0:21:30.359 --> 0:21:33.240
<v Speaker 4>learn from the stuff we already did. So that's one example,

0:21:33.280 --> 0:21:35.840
<v Speaker 4>but there's so many different industries in ways that this

0:21:35.920 --> 0:21:38.679
<v Speaker 4>could help, and it could make those agents in the

0:21:38.720 --> 0:21:40.200
<v Speaker 4>field more efficient.

0:21:40.920 --> 0:21:43.320
<v Speaker 3>Have you had anyone talk to you about using instruct

0:21:43.400 --> 0:21:44.919
<v Speaker 3>lab in a way that surprised you.

0:21:46.960 --> 0:21:51.600
<v Speaker 4>I mean, some people have done funky things, but sort

0:21:51.640 --> 0:21:54.000
<v Speaker 4>of playing with the skills stuff, that's where I see

0:21:54.040 --> 0:21:57.040
<v Speaker 4>a lot of creativity. The difference between knowledge and skills

0:21:57.080 --> 0:22:00.960
<v Speaker 4>is that knowledge is pretty pretty understandable, right, oh, historical

0:22:00.960 --> 0:22:04.480
<v Speaker 4>insurance claims or you know, legal contracts. Skills are a

0:22:04.520 --> 0:22:07.439
<v Speaker 4>little different. So whenever somebody submits a skill, sometimes it

0:22:07.480 --> 0:22:09.520
<v Speaker 4>tends to be really creative because it's not something that's

0:22:09.560 --> 0:22:12.880
<v Speaker 4>super intuitive. Somebody submitted a skill. I don't know how

0:22:12.920 --> 0:22:15.720
<v Speaker 4>well it worked, but it was like making ASKI art,

0:22:15.920 --> 0:22:18.119
<v Speaker 4>like draw me a I don't know, draw me a

0:22:18.160 --> 0:22:20.040
<v Speaker 4>dog I would do like an ASKI art dog. I mean,

0:22:20.080 --> 0:22:22.600
<v Speaker 4>it's stuff that you can do programmatically. One that was

0:22:22.640 --> 0:22:26.560
<v Speaker 4>actually very very helpful was, you know, take this table

0:22:26.600 --> 0:22:29.720
<v Speaker 4>of data and convert it to this format, like, ooh,

0:22:29.840 --> 0:22:31.440
<v Speaker 4>that's nice. That actually saves me time.

0:22:32.000 --> 0:22:34.520
<v Speaker 3>How far away are we from the day when I

0:22:34.640 --> 0:22:39.320
<v Speaker 3>Malcolm Globwell technology ignore Amus can go home and easily

0:22:39.400 --> 0:22:44.440
<v Speaker 3>interact with instruct lab Maybe a few months, a few months,

0:22:45.560 --> 0:22:46.760
<v Speaker 3>you're gonna say a few years.

0:22:47.400 --> 0:22:49.080
<v Speaker 4>No, I think it'd be a few months.

0:22:49.680 --> 0:22:50.879
<v Speaker 3>Wow, I hope.

0:22:51.560 --> 0:22:53.280
<v Speaker 4>Hey it's power open source innovation.

0:22:53.680 --> 0:22:57.679
<v Speaker 3>Yeah, oh that's really interesting. Yeah, I'm always taken by surprise.

0:22:58.000 --> 0:23:00.640
<v Speaker 3>I'm still thinking in twentieth century terms about how long

0:23:00.680 --> 0:23:03.880
<v Speaker 3>things take, and you're in the twenty second century as

0:23:03.880 --> 0:23:04.240
<v Speaker 3>well as.

0:23:04.119 --> 0:23:04.719
<v Speaker 1>I could tell.

0:23:04.960 --> 0:23:09.240
<v Speaker 4>The instruct lab core invention was invented in a hotel

0:23:09.320 --> 0:23:12.400
<v Speaker 4>room at an AI conference in December with an amazing

0:23:12.440 --> 0:23:15.560
<v Speaker 4>group of IBM research guys December of twenty twenty three.

0:23:15.840 --> 0:23:18.560
<v Speaker 3>Wait, back up, you have to tell the story.

0:23:18.760 --> 0:23:21.760
<v Speaker 4>This group of guys we've been working with, they were

0:23:21.800 --> 0:23:24.159
<v Speaker 4>at this conference together, and it's a really funny story

0:23:24.200 --> 0:23:27.080
<v Speaker 4>because you know, it's hard to get access to GPUs

0:23:27.359 --> 0:23:29.200
<v Speaker 4>and like even you know, you're at IBM and it's

0:23:29.200 --> 0:23:31.960
<v Speaker 4>hard to get access because everybody wants access. They did

0:23:32.000 --> 0:23:34.919
<v Speaker 4>it over Christmas break because nobody was using the cluster

0:23:35.000 --> 0:23:37.119
<v Speaker 4>at the time, and they ran all of these experiments

0:23:37.119 --> 0:23:38.960
<v Speaker 4>and I'm like, whoa, this is really cool.

0:23:39.359 --> 0:23:43.320
<v Speaker 3>And wait. Their idea was we can do a stripped

0:23:43.320 --> 0:23:48.639
<v Speaker 3>down AI model, and was the idea and even back

0:23:48.680 --> 0:23:51.399
<v Speaker 3>then combine it with grantede, what was the original idea?

0:23:51.440 --> 0:23:54.880
<v Speaker 4>The original idea, it's sort of multi there's like multiple

0:23:54.920 --> 0:23:57.520
<v Speaker 4>aspects to it. So like one of the aspects it

0:23:57.560 --> 0:23:59.720
<v Speaker 4>actually came on later, but it starts at the beginning

0:23:59.760 --> 0:24:03.879
<v Speaker 4>of the workflow. Is you're using a taxonomy to organize

0:24:03.960 --> 0:24:06.440
<v Speaker 4>how you're fine tuning the model. So the old approach

0:24:06.560 --> 0:24:08.800
<v Speaker 4>they call it the blender approach, to just take a

0:24:08.800 --> 0:24:11.439
<v Speaker 4>bunch of data of roughly the type of data that

0:24:11.520 --> 0:24:13.320
<v Speaker 4>you'd like, and you kind of throw it in and

0:24:13.359 --> 0:24:16.080
<v Speaker 4>then see what comes out. Don't like it, Okay, throw

0:24:16.160 --> 0:24:19.160
<v Speaker 4>in more, try again, see what comes out. They had

0:24:19.400 --> 0:24:22.520
<v Speaker 4>used this taxonomy technique, so you actually build like a

0:24:22.560 --> 0:24:25.919
<v Speaker 4>taxonomy of like categories and subfolders of like this is

0:24:25.960 --> 0:24:28.400
<v Speaker 4>the knowledge and skills that we want to train into

0:24:28.440 --> 0:24:32.320
<v Speaker 4>the model. And that way you're sort of systematic about

0:24:32.440 --> 0:24:35.520
<v Speaker 4>what you're adding, and you can also identify gaps pretty easily.

0:24:35.560 --> 0:24:37.280
<v Speaker 4>Oh I don't have a category for that, let me

0:24:37.280 --> 0:24:40.080
<v Speaker 4>add that. So that's like one of the parts of

0:24:40.119 --> 0:24:40.840
<v Speaker 4>the invention here.

0:24:41.680 --> 0:24:46.600
<v Speaker 3>Point number one is let's be intentional and deliberate in

0:24:46.640 --> 0:24:47.880
<v Speaker 3>how we build and train this thing.

0:24:48.119 --> 0:24:51.359
<v Speaker 4>Yeah, and then the next component would be okay, so

0:24:51.680 --> 0:24:54.240
<v Speaker 4>it is actually quite expensive. Part of the expense of

0:24:54.280 --> 0:24:57.800
<v Speaker 4>like tuning models and just training models in general is

0:24:57.840 --> 0:25:01.040
<v Speaker 4>coming up with the data. And what they wanted to

0:25:01.080 --> 0:25:03.199
<v Speaker 4>do is have a technique where you could have just

0:25:03.240 --> 0:25:06.320
<v Speaker 4>a little bit of data and expand it with something

0:25:06.359 --> 0:25:09.760
<v Speaker 4>they're calling synthetic data generation. And this is where it's

0:25:09.760 --> 0:25:13.680
<v Speaker 4>sort of like you have this student and teacher workflow,

0:25:14.320 --> 0:25:19.040
<v Speaker 4>so you have your taxonomy. The taxonomy has sort of

0:25:19.040 --> 0:25:21.959
<v Speaker 4>the knowledge like a business's knowledge documents, their insurance claims,

0:25:22.240 --> 0:25:25.440
<v Speaker 4>and it has these quizzes that you write and that's

0:25:25.520 --> 0:25:27.480
<v Speaker 4>to teach the model. So I'm writing a quiz based

0:25:27.600 --> 0:25:29.200
<v Speaker 4>just like you do in school. You read the chapter

0:25:29.400 --> 0:25:31.480
<v Speaker 4>on the American Revolution, and then you answer a ten

0:25:31.560 --> 0:25:34.840
<v Speaker 4>question quiz where you're giving the model quiz. You need

0:25:34.840 --> 0:25:38.040
<v Speaker 4>at least five questions and answers, and the answers need

0:25:38.080 --> 0:25:40.679
<v Speaker 4>to be taken from the context of the document, and

0:25:40.800 --> 0:25:44.280
<v Speaker 4>then you run it through a process called synthetic data generation,

0:25:44.560 --> 0:25:46.679
<v Speaker 4>and it looks at the documents or look at the

0:25:46.720 --> 0:25:49.840
<v Speaker 4>history chapter. It'll look at the questions and answers, and

0:25:49.880 --> 0:25:52.600
<v Speaker 4>then it'll look to that original document and come up

0:25:52.600 --> 0:25:55.000
<v Speaker 4>with more questions and answers based on the format of

0:25:55.040 --> 0:25:57.640
<v Speaker 4>the questions and answers you made. So you can take

0:25:57.760 --> 0:26:01.320
<v Speaker 4>five questions of answers amplify them into one hundred questions

0:26:01.320 --> 0:26:04.199
<v Speaker 4>and answers, two hundred questions and answers, and it's a

0:26:04.280 --> 0:26:07.439
<v Speaker 4>second model that is making the questions and answers. So

0:26:07.440 --> 0:26:10.479
<v Speaker 4>it's synthetic data generation using an AI model to make

0:26:10.560 --> 0:26:13.520
<v Speaker 4>the questions. We use an open source model to do that.

0:26:14.119 --> 0:26:16.760
<v Speaker 4>So that's the second part. And then the third part

0:26:16.840 --> 0:26:19.760
<v Speaker 4>is we have a multi phase tuning technique to actually

0:26:19.920 --> 0:26:23.440
<v Speaker 4>take the synthetic data and then basically bake it into

0:26:23.480 --> 0:26:26.680
<v Speaker 4>the model. So sort of that's the approach. A general

0:26:26.720 --> 0:26:29.439
<v Speaker 4>philosophy of the approach is using granite because we know

0:26:29.480 --> 0:26:32.240
<v Speaker 4>where the data came from. Another approach is the fact

0:26:32.280 --> 0:26:34.640
<v Speaker 4>that we're using small models that are cheap to run

0:26:34.640 --> 0:26:37.199
<v Speaker 4>inference on. They're small enough that you can tune them

0:26:37.240 --> 0:26:40.040
<v Speaker 4>on laptop hardware. You don't need all the fancy expensive

0:26:40.080 --> 0:26:44.280
<v Speaker 4>GPU menia. You're good. So sort of like a whole system.

0:26:44.359 --> 0:26:47.159
<v Speaker 4>It's like not any one component. But it's sort of

0:26:47.280 --> 0:26:49.800
<v Speaker 4>the approach they took with somewhat novel, and they were

0:26:49.880 --> 0:26:52.800
<v Speaker 4>very excited when they saw the experimental results. There was

0:26:52.840 --> 0:26:55.639
<v Speaker 4>a meeting between red hat and IBM. It was actually

0:26:55.640 --> 0:26:57.960
<v Speaker 4>an IBM research meeting that red hatters were invited to,

0:26:58.720 --> 0:27:00.800
<v Speaker 4>and I think the red Hatter and Voves sort of

0:27:00.840 --> 0:27:05.640
<v Speaker 4>saw the potential, WHOA, we can make models open source finally,

0:27:05.760 --> 0:27:09.320
<v Speaker 4>rather than them just being these endless dead forks, we

0:27:09.359 --> 0:27:12.000
<v Speaker 4>could make it so people could contribute back and collaborate

0:27:12.040 --> 0:27:14.320
<v Speaker 4>around it. So that's when red Hat became interested in

0:27:14.359 --> 0:27:17.840
<v Speaker 4>it and we sort of worked together, and the research

0:27:17.880 --> 0:27:20.560
<v Speaker 4>engineers from IBM Research who came up with the technique,

0:27:20.640 --> 0:27:23.239
<v Speaker 4>and then my team, the software engineers who know how

0:27:23.280 --> 0:27:28.240
<v Speaker 4>to take research code and productize it into actually runnable,

0:27:28.280 --> 0:27:33.200
<v Speaker 4>supportable software, kind of got together. We've been hanging out

0:27:33.200 --> 0:27:35.880
<v Speaker 4>in the Boston office at red Hat and building it out.

0:27:36.240 --> 0:27:39.479
<v Speaker 4>April eighteenth was when we went open source and we

0:27:39.520 --> 0:27:41.959
<v Speaker 4>made all of our repositories with all of the code public,

0:27:42.000 --> 0:27:44.280
<v Speaker 4>and right now we're working towards a product release, so

0:27:44.320 --> 0:27:45.280
<v Speaker 4>a supported product.

0:27:45.400 --> 0:27:47.479
<v Speaker 3>How long did it take you to be convinced of

0:27:48.440 --> 0:27:51.720
<v Speaker 3>the value of this idea? I mean, so people get

0:27:51.720 --> 0:27:55.919
<v Speaker 3>together in this hotel room they're running these experiments over Christmas.

0:27:56.160 --> 0:27:58.440
<v Speaker 3>Are you aware of the experiments as they're running them?

0:27:59.080 --> 0:27:59.199
<v Speaker 2>They?

0:27:59.240 --> 0:28:00.760
<v Speaker 4>Oh, I didn't find out to February.

0:28:02.040 --> 0:28:05.560
<v Speaker 3>They come to you February and they say, MO, can

0:28:05.640 --> 0:28:07.480
<v Speaker 3>you recreate that conversation?

0:28:08.520 --> 0:28:12.960
<v Speaker 4>Well, our CEO, Matt Hicks, and then Jeremy Eater, who's

0:28:12.960 --> 0:28:15.640
<v Speaker 4>one of our distinguished engineers, and Steve Watt, who's a VP,

0:28:15.840 --> 0:28:18.360
<v Speaker 4>were present I think at that meeting. So they kind

0:28:18.400 --> 0:28:20.640
<v Speaker 4>of brought it back to us and said, listen, we've

0:28:20.680 --> 0:28:25.080
<v Speaker 4>invited these IBM research folks to come visit in Boston,

0:28:25.840 --> 0:28:28.280
<v Speaker 4>you know, work with them, like, see, does this have

0:28:28.320 --> 0:28:30.560
<v Speaker 4>any merit? Could we build something from it? And so

0:28:30.600 --> 0:28:33.840
<v Speaker 4>they gave us some presentations. We're very excited. When they

0:28:33.840 --> 0:28:37.200
<v Speaker 4>came to us. It only had support for Mac laptops.

0:28:37.800 --> 0:28:39.880
<v Speaker 4>Of course, you know Red Hat were Linux people, So

0:28:39.960 --> 0:28:41.800
<v Speaker 4>we're like, all right, we've got to fix that. So

0:28:41.960 --> 0:28:44.640
<v Speaker 4>a bunch of the junior engineers around the office kind

0:28:44.680 --> 0:28:46.240
<v Speaker 4>of came and they're like, okay, we're going to build

0:28:46.280 --> 0:28:48.400
<v Speaker 4>Linux support. And they had it within like a couple

0:28:48.400 --> 0:28:51.280
<v Speaker 4>of days. It was crazy because this was just meant

0:28:51.320 --> 0:28:53.840
<v Speaker 4>to be Hey, guys, you know what, these are invited

0:28:53.920 --> 0:28:57.440
<v Speaker 4>guests visiting our office. See what happens. And we ended

0:28:57.480 --> 0:29:00.920
<v Speaker 4>up doing like weeks of hack fe and late night

0:29:00.960 --> 0:29:03.600
<v Speaker 4>pizzas in the conference room and like playing around with

0:29:03.640 --> 0:29:06.560
<v Speaker 4>it and learning and it was it was very fun.

0:29:06.640 --> 0:29:07.360
<v Speaker 4>It's very cool.

0:29:07.480 --> 0:29:08.920
<v Speaker 3>Anyone else do anything like this.

0:29:10.320 --> 0:29:12.560
<v Speaker 4>Is not my understanding that anybody else is doing it,

0:29:12.800 --> 0:29:16.360
<v Speaker 4>yet maybe others will try a lot of the focus

0:29:16.400 --> 0:29:19.960
<v Speaker 4>has been on that pre training phase. But for us,

0:29:20.040 --> 0:29:23.200
<v Speaker 4>again that fine tuning. It's more accessible because you don't

0:29:23.760 --> 0:29:26.360
<v Speaker 4>need all the exotic hardware. It doesn't take months. You

0:29:26.400 --> 0:29:28.240
<v Speaker 4>can do it on a laptop. You can do a

0:29:28.280 --> 0:29:30.880
<v Speaker 4>smoke test version of it in less than an hour.

0:29:31.440 --> 0:29:32.560
<v Speaker 3>What is the word smoke test.

0:29:32.760 --> 0:29:35.160
<v Speaker 4>Smoke test means you're not doing a full fine tuning

0:29:35.200 --> 0:29:38.240
<v Speaker 4>on the model. It's a different tuning process. It's like

0:29:38.320 --> 0:29:40.880
<v Speaker 4>kind of lower quality, so to run on lower grade hardware,

0:29:41.040 --> 0:29:42.640
<v Speaker 4>so you can kind of see them didn't move the

0:29:42.680 --> 0:29:44.200
<v Speaker 4>model or not, but it's not going to give you,

0:29:44.200 --> 0:29:46.800
<v Speaker 4>like the full picture. You need higher end hardware to

0:29:46.840 --> 0:29:48.880
<v Speaker 4>actually do the full thing. So that's what the product

0:29:48.880 --> 0:29:51.720
<v Speaker 4>will enable you to do once it's launched, is you're

0:29:51.720 --> 0:29:53.520
<v Speaker 4>going to need the GPUs, but when you have them,

0:29:53.520 --> 0:29:55.160
<v Speaker 4>will help you make the best usage of them.

0:29:55.440 --> 0:29:58.239
<v Speaker 3>Yeah, yeah, and no, there's all the detail. I want

0:29:58.280 --> 0:30:01.320
<v Speaker 3>to go back to. Sure to run the tests on

0:30:01.360 --> 0:30:07.800
<v Speaker 3>this idea way back when they needed time on the GPUs,

0:30:08.240 --> 0:30:12.360
<v Speaker 3>So this will be the in house IBM and they

0:30:12.400 --> 0:30:15.320
<v Speaker 3>were quiet at Christmas, So how much time would you

0:30:15.400 --> 0:30:18.719
<v Speaker 3>need on the GPUs to kind of get proof of concept?

0:30:19.120 --> 0:30:21.480
<v Speaker 4>Well what happens is and it's sort of like a

0:30:21.480 --> 0:30:23.760
<v Speaker 4>lot of trial and error, right, And there's a lot

0:30:23.800 --> 0:30:27.400
<v Speaker 4>about this stuff that like you come up with the hypothesis,

0:30:27.480 --> 0:30:29.440
<v Speaker 4>you test it out, did it work or not? Okay,

0:30:29.560 --> 0:30:31.600
<v Speaker 4>it's just like you know in the lab, but you know,

0:30:31.760 --> 0:30:35.640
<v Speaker 4>buns and burners and beakers and whatever. So it really depends.

0:30:35.680 --> 0:30:39.040
<v Speaker 4>But it can be hours, it can be days. It

0:30:39.080 --> 0:30:41.120
<v Speaker 4>really depends on what they're trying to do. And then

0:30:41.200 --> 0:30:43.560
<v Speaker 4>sometimes they can cut the time down, you know, with

0:30:43.600 --> 0:30:45.240
<v Speaker 4>the number of GPUs you have. So like I have

0:30:45.240 --> 0:30:48.080
<v Speaker 4>a cluster of agpus, Okay, it might take a day,

0:30:48.160 --> 0:30:50.120
<v Speaker 4>but then if I can get thirty two, I can

0:30:50.120 --> 0:30:51.920
<v Speaker 4>pipeline it and make it go faster and get it

0:30:51.920 --> 0:30:53.959
<v Speaker 4>down to a few hours. So it really depends, you know.

0:30:54.040 --> 0:30:57.120
<v Speaker 4>But it's like everybody's home for the holidays. It's a

0:30:57.160 --> 0:30:59.719
<v Speaker 4>lovely playground to kind of get that stuff going fast.

0:31:00.480 --> 0:31:04.040
<v Speaker 3>Let's jump forward one year. Tell me the status of

0:31:04.080 --> 0:31:07.560
<v Speaker 3>this project, tell me who's using it, tell me how

0:31:07.600 --> 0:31:13.600
<v Speaker 3>big is it. Give me your optimistic but plausible prediction

0:31:13.920 --> 0:31:17.640
<v Speaker 3>about what instruct lab looks like a year from now.

0:31:18.560 --> 0:31:21.960
<v Speaker 4>A year from now, I would like to see kind

0:31:21.960 --> 0:31:28.360
<v Speaker 4>of a vibrant community around not just building knowledge and

0:31:28.400 --> 0:31:32.120
<v Speaker 4>skills into a model, but coming up with better techniques

0:31:32.160 --> 0:31:34.720
<v Speaker 4>and innovation around how we do it. So I'd like

0:31:34.760 --> 0:31:37.880
<v Speaker 4>to see the contributor experience as we grow more and

0:31:37.920 --> 0:31:40.640
<v Speaker 4>more contributors to be refined. So like a year from now,

0:31:40.840 --> 0:31:43.960
<v Speaker 4>Malcolm Gladwell could come impart some of his wisdom into

0:31:43.960 --> 0:31:46.320
<v Speaker 4>the model and it wouldn't be difficult, it wouldn't be

0:31:46.320 --> 0:31:49.240
<v Speaker 4>a big lift. I would love to see the user

0:31:49.240 --> 0:31:53.360
<v Speaker 4>interface tooling for doing that to be more sophisticated. I

0:31:53.360 --> 0:31:56.920
<v Speaker 4>would love to see more people taking this and even

0:31:57.040 --> 0:31:59.240
<v Speaker 4>using it. Maybe you're not sharing it with the community,

0:31:59.280 --> 0:32:02.240
<v Speaker 4>but you're using it for some private usage. Like I'll

0:32:02.240 --> 0:32:05.720
<v Speaker 4>give you an example. I'm in contact with a fellow

0:32:05.840 --> 0:32:08.560
<v Speaker 4>who is doing AI research and he's working with doctors.

0:32:08.600 --> 0:32:11.560
<v Speaker 4>They're GPS in an area of Canada where there's not

0:32:11.680 --> 0:32:14.360
<v Speaker 4>enough GPS for the number of patients, So you know,

0:32:14.480 --> 0:32:18.280
<v Speaker 4>anything you can do to save doctors time to get

0:32:18.360 --> 0:32:20.640
<v Speaker 4>to the next patient. It's like one of the things

0:32:20.640 --> 0:32:23.480
<v Speaker 4>that he has been doing experiments with is can we

0:32:23.640 --> 0:32:27.400
<v Speaker 4>use an open source, licensed model that the doctor can

0:32:27.480 --> 0:32:29.440
<v Speaker 4>run on their laptop so they don't have to worry

0:32:29.440 --> 0:32:31.960
<v Speaker 4>about all of the different privacy rules, Like it's privates

0:32:31.960 --> 0:32:36.040
<v Speaker 4>on the laptop right there, take his live transcription of

0:32:36.040 --> 0:32:39.720
<v Speaker 4>his conversation with the patient, and then convert it automatically

0:32:39.760 --> 0:32:42.120
<v Speaker 4>to a soap format that can be entered in the database.

0:32:42.360 --> 0:32:44.959
<v Speaker 4>Typically this will take a doctor fifteen to twenty minutes

0:32:45.000 --> 0:32:48.720
<v Speaker 4>of paperwork. Why not save them the paperwork at least

0:32:48.760 --> 0:32:50.000
<v Speaker 4>have the model take a stab.

0:32:50.200 --> 0:32:52.800
<v Speaker 3>Does the model then hold on to that information and

0:32:52.200 --> 0:32:54.760
<v Speaker 3>he interacts with the model again when.

0:32:55.040 --> 0:32:57.400
<v Speaker 4>Well, that's the thing not within struct lab. Maybe that

0:32:57.440 --> 0:33:00.440
<v Speaker 4>could be a future development. It doesn't once you're doing it, diference,

0:33:01.120 --> 0:33:03.840
<v Speaker 4>it's not ingesting that what you're saying to it back in.

0:33:04.160 --> 0:33:06.400
<v Speaker 4>It's only the fine tuning phase. So the idea would

0:33:06.440 --> 0:33:10.040
<v Speaker 4>be the doctor could maybe load in past patient data

0:33:10.320 --> 0:33:13.000
<v Speaker 4>as knowledge and then when he's trying to diagnose maybe

0:33:13.160 --> 0:33:15.640
<v Speaker 4>you know what I'm saying. Like, But the main idea

0:33:15.720 --> 0:33:18.120
<v Speaker 4>is somebody might have some private usage. I would love

0:33:18.200 --> 0:33:22.400
<v Speaker 4>to see more usage of this tool to enable people

0:33:22.400 --> 0:33:24.720
<v Speaker 4>who otherwise never would have had access to this type

0:33:24.760 --> 0:33:27.520
<v Speaker 4>of technology who never like you know, a small country

0:33:27.600 --> 0:33:31.760
<v Speaker 4>GP doctors, it doesn't have GPUs. They're not going to

0:33:31.840 --> 0:33:34.000
<v Speaker 4>hire some company to custom build them a model. But

0:33:34.040 --> 0:33:35.840
<v Speaker 4>maybe on the weekend, if he's a techie guy he

0:33:35.880 --> 0:33:37.000
<v Speaker 4>can say with us.

0:33:37.160 --> 0:33:39.440
<v Speaker 3>Well, I mean, the more you talk, the more I'm

0:33:39.480 --> 0:33:43.600
<v Speaker 3>realizing that the simplicity of this model is the killer

0:33:43.640 --> 0:33:46.160
<v Speaker 3>app here. Once you know you can run it on

0:33:46.200 --> 0:33:50.080
<v Speaker 3>a laptop, you have democratized use in a way that's

0:33:50.120 --> 0:33:54.360
<v Speaker 3>inconceivable with some of these other much more complex. But

0:33:54.400 --> 0:33:58.000
<v Speaker 3>that's interesting because one would have thought intuitively that at

0:33:58.040 --> 0:34:00.560
<v Speaker 3>the beginning that the winner is going to be the

0:34:00.560 --> 0:34:06.080
<v Speaker 3>one with the biggest, most complex version, And you're saying, actually, no,

0:34:06.280 --> 0:34:11.680
<v Speaker 3>there's a whole series of uses where being lean and focused,

0:34:11.960 --> 0:34:15.800
<v Speaker 3>focused is actually you know, it enables a whole class

0:34:15.800 --> 0:34:19.160
<v Speaker 3>of uses. Maybe another way of saying this is who

0:34:19.200 --> 0:34:21.640
<v Speaker 3>wouldn't be a potential instruct lab customer.

0:34:22.000 --> 0:34:25.160
<v Speaker 4>We don't know yet. It's so new, like we haven't

0:34:25.160 --> 0:34:27.480
<v Speaker 4>really had enough people experimenting and playing with it and

0:34:27.520 --> 0:34:30.160
<v Speaker 4>finding out all the things yet. But that's that's the

0:34:30.200 --> 0:34:32.080
<v Speaker 4>thing that's so exciting about it. It's like, I can't

0:34:32.080 --> 0:34:33.319
<v Speaker 4>wait to see what people do.

0:34:33.760 --> 0:34:35.520
<v Speaker 3>Is this the most exciting thing you've worked on in

0:34:35.560 --> 0:34:36.000
<v Speaker 3>your career?

0:34:36.320 --> 0:34:38.440
<v Speaker 4>I think so. I think so.

0:34:39.040 --> 0:34:42.360
<v Speaker 3>Yeah, Well, we are reaching the end of our time,

0:34:42.880 --> 0:34:46.080
<v Speaker 3>but before we finished, we can do a little speed round. Sure,

0:34:46.560 --> 0:34:50.840
<v Speaker 3>all right, complete the following sentence. In five years, AI

0:34:51.120 --> 0:34:52.400
<v Speaker 3>will be.

0:34:52.520 --> 0:34:56.920
<v Speaker 4>Boring, it will be integrated, It'll just work, and there

0:34:56.960 --> 0:34:59.720
<v Speaker 4>will be no now with AI thing. It'll just be normal.

0:35:01.360 --> 0:35:04.520
<v Speaker 3>What's the number one thing that people misunderstand about AI?

0:35:05.120 --> 0:35:08.640
<v Speaker 4>It's just matrix algebra. It's just numbers. It's not sentient.

0:35:08.880 --> 0:35:12.240
<v Speaker 4>It's not coming to take us over. It's just numbers.

0:35:12.440 --> 0:35:15.479
<v Speaker 3>You're on this side of You're on the team humanity. Yeah,

0:35:15.560 --> 0:35:20.080
<v Speaker 3>you're one good. What advice would you give yourself ten

0:35:20.160 --> 0:35:22.360
<v Speaker 3>years ago to better prepare for today?

0:35:22.960 --> 0:35:26.799
<v Speaker 4>Learn Python for real. It's a programming language that's extensively

0:35:26.880 --> 0:35:29.680
<v Speaker 4>used in the community. I've always dabbled in it, but

0:35:29.840 --> 0:35:31.440
<v Speaker 4>I wish I had taken it more seriously.

0:35:31.680 --> 0:35:33.640
<v Speaker 3>Yeah, did you say, who had a daughter?

0:35:34.200 --> 0:35:35.200
<v Speaker 4>I have three daughters?

0:35:35.280 --> 0:35:38.000
<v Speaker 3>You have three daughters. I have two. You're if you

0:35:38.080 --> 0:35:41.879
<v Speaker 3>got three year you're you're on your own. What are

0:35:41.880 --> 0:35:43.280
<v Speaker 3>you making them study Python?

0:35:44.400 --> 0:35:47.440
<v Speaker 4>I am actually trying to do that. We're using a

0:35:47.480 --> 0:35:50.560
<v Speaker 4>microbit micro controller tool to do like a custom video

0:35:50.600 --> 0:35:53.960
<v Speaker 4>game controller. They prefer Scratch because it's a visual programming language,

0:35:53.960 --> 0:35:55.759
<v Speaker 4>but it has a Python interface too, and I'm like

0:35:55.880 --> 0:35:57.040
<v Speaker 4>pushing them towards Python.

0:35:57.400 --> 0:36:01.680
<v Speaker 3>Good chat, bock and image generators are the biggest things

0:36:01.680 --> 0:36:04.200
<v Speaker 3>in consumer AI right now. What do you think is

0:36:04.239 --> 0:36:06.200
<v Speaker 3>the next big business application?

0:36:07.680 --> 0:36:13.040
<v Speaker 4>Private models, small models fine tuned on your company's data

0:36:13.640 --> 0:36:15.319
<v Speaker 4>for you to use exclusively.

0:36:16.040 --> 0:36:19.400
<v Speaker 3>Are you using AI in your own personal life these days?

0:36:19.600 --> 0:36:21.440
<v Speaker 4>Honestly, I think a lot of us are using it

0:36:21.480 --> 0:36:23.879
<v Speaker 4>and we don't even realize it. Yeah, I mean, I'm

0:36:23.880 --> 0:36:27.840
<v Speaker 4>a ficiano of foreign languages. There's translation programs that are

0:36:27.880 --> 0:36:30.920
<v Speaker 4>built using machine learning underneath. One of the things I've

0:36:30.960 --> 0:36:33.960
<v Speaker 4>been dabbling with lately is using tech summarizations because I

0:36:34.040 --> 0:36:36.719
<v Speaker 4>tend to be very loquacious in my note taking and

0:36:36.760 --> 0:36:39.120
<v Speaker 4>that is not so useful for other people who would

0:36:39.160 --> 0:36:42.080
<v Speaker 4>just like a paragraph. So that's something I've been experimenting

0:36:42.080 --> 0:36:43.919
<v Speaker 4>with myself just to help my everyday work.

0:36:44.040 --> 0:36:48.319
<v Speaker 3>Yeah. We hear many definitions of open related to technology.

0:36:48.880 --> 0:36:52.160
<v Speaker 3>What's your definition of open and how does it help

0:36:52.200 --> 0:36:52.680
<v Speaker 3>you innovate?

0:36:53.040 --> 0:36:58.920
<v Speaker 4>My definition of open is basically sharing and being vulnerable,

0:36:59.040 --> 0:37:02.080
<v Speaker 4>like not just sharing gonna have a cookie way, but

0:37:02.200 --> 0:37:04.520
<v Speaker 4>in a you know what, I don't actually know how

0:37:04.560 --> 0:37:07.360
<v Speaker 4>this works? Could you help me? And being open to

0:37:07.440 --> 0:37:11.080
<v Speaker 4>being wrong, being open to somebody helping you, and making

0:37:11.080 --> 0:37:13.200
<v Speaker 4>that collaboration work. So it's not just about like the

0:37:13.320 --> 0:37:16.680
<v Speaker 4>artifact or opening, it's your approach, like how you do

0:37:16.760 --> 0:37:17.520
<v Speaker 4>things being open.

0:37:17.800 --> 0:37:21.319
<v Speaker 3>Yeah yeah, well I think that wraps us up. How

0:37:21.360 --> 0:37:24.680
<v Speaker 3>can listeners follow your work and learn more about granted

0:37:24.760 --> 0:37:25.719
<v Speaker 3>and instruct lab.

0:37:26.000 --> 0:37:28.600
<v Speaker 4>Sure, you can visit our project web page at instruct

0:37:28.680 --> 0:37:31.600
<v Speaker 4>lab dot ai, or you can visit our GitHub at

0:37:31.680 --> 0:37:34.759
<v Speaker 4>GitHub dot com slash instruct lab. We have lots of

0:37:34.800 --> 0:37:38.280
<v Speaker 4>instructions on how to get involved in an instruct lab wonderful.

0:37:38.600 --> 0:37:44.600
<v Speaker 3>Thank you so much, Thank you, Malcolm. A big thank

0:37:44.680 --> 0:37:48.520
<v Speaker 3>you to Mow for the engaging discussion on the groundbreaking

0:37:48.840 --> 0:37:53.719
<v Speaker 3>possibilities of instruct lab. We've explored how this platform has

0:37:53.760 --> 0:37:58.120
<v Speaker 3>the potential to revolutionize industries from insurance to entertainment law

0:37:58.400 --> 0:38:01.200
<v Speaker 3>by using an open source community, the approach that makes

0:38:01.200 --> 0:38:04.200
<v Speaker 3>it easier for more people from all backgrounds to fine

0:38:04.239 --> 0:38:10.319
<v Speaker 3>tune models for specific purposes, ultimately making AI more accessible

0:38:10.920 --> 0:38:15.600
<v Speaker 3>and impactful than ever. Looking ahead, the future of AI

0:38:15.880 --> 0:38:20.440
<v Speaker 3>isn't just about technological efficiency. It's about enhancing our everyday

0:38:20.480 --> 0:38:25.279
<v Speaker 3>experiences in ways that were never possible before, like streamlining

0:38:25.320 --> 0:38:29.160
<v Speaker 3>work for doctors to improve the patient experience, or assisting

0:38:29.200 --> 0:38:34.680
<v Speaker 3>insurance agents to improve the claims experience. Instruct Lab is

0:38:34.800 --> 0:38:39.319
<v Speaker 3>paving the way for more open, accessible AI future, one

0:38:39.320 --> 0:38:45.840
<v Speaker 3>that's built on collaboration and humanity. Smart Talks with IBM

0:38:45.960 --> 0:38:50.120
<v Speaker 3>is produced by Matt Romano, Joey Fishground and Jacob Goldstein.

0:38:50.520 --> 0:38:54.320
<v Speaker 3>We're edited by Lydia jen Kott. Our engineers are Sarah

0:38:54.320 --> 0:38:59.440
<v Speaker 3>Bruger and Ben Tolliday. Theme song by Gramoscope Special thanks

0:38:59.440 --> 0:39:01.960
<v Speaker 3>to the Eight Bars and IBM teams, as well as

0:39:02.000 --> 0:39:05.520
<v Speaker 3>the Pushkin marketing team. Smart Talks with IBM is a

0:39:05.560 --> 0:39:10.319
<v Speaker 3>production of Pushkin Industries and Ruby Studio at iHeartMedia. To

0:39:10.400 --> 0:39:15.759
<v Speaker 3>find more Pushkin podcasts, listen on the iHeartRadio app, Apple Podcasts,

0:39:16.120 --> 0:39:20.880
<v Speaker 3>or wherever you listen to podcasts. I'm Malcolm Gladwell. This

0:39:21.000 --> 0:39:24.640
<v Speaker 3>is a paid advertisement from IBM. The conversations on this

0:39:24.719 --> 0:39:40.360
<v Speaker 3>podcast don't necessarily represent IBM's positions, strategies or opinions.