WEBVTT - Jack Morris on Finding the Next Big AI Breakthrough

0:00:02.720 --> 0:00:19.000
<v Speaker 1>Bloomberg Audio Studios, Podcasts, Radio News. Hello and welcome to

0:00:19.040 --> 0:00:21.080
<v Speaker 1>another episode of The Odd Laws podcast.

0:00:21.200 --> 0:00:23.520
<v Speaker 2>I'm Joe Wisenthal and I'm Tracy Alloway.

0:00:23.800 --> 0:00:26.319
<v Speaker 1>Tracy, have you played around with GPT five much?

0:00:26.920 --> 0:00:30.640
<v Speaker 2>Not really, I've been perplexity pills. Oh that's what your

0:00:30.720 --> 0:00:33.360
<v Speaker 2>main Yeah, that's my main one at the moment. But

0:00:33.479 --> 0:00:34.840
<v Speaker 2>is it good? I hear mixed.

0:00:35.080 --> 0:00:37.239
<v Speaker 1>I use it because I use GPT every day. It

0:00:37.280 --> 0:00:41.400
<v Speaker 1>does not strike me as like obviously better yeah for

0:00:41.600 --> 0:00:44.640
<v Speaker 1>my uses than like the three models, which I've been

0:00:44.720 --> 0:00:46.760
<v Speaker 1>very impressed by because you know, I want to establish them.

0:00:46.760 --> 0:00:47.960
<v Speaker 3>No hater or anything like that.

0:00:48.240 --> 0:00:50.040
<v Speaker 1>But like, it did not strike me as like, oh,

0:00:50.080 --> 0:00:50.720
<v Speaker 1>this is like an.

0:00:50.720 --> 0:00:52.839
<v Speaker 2>Amazing Yeah, this is the thing.

0:00:53.040 --> 0:00:54.280
<v Speaker 3>Step function or whatever.

0:00:54.280 --> 0:00:59.080
<v Speaker 2>It feels like the sort of breakthroughs awe inspiring breakthroughs

0:00:59.120 --> 0:01:00.680
<v Speaker 2>are kind of behind us, and a lot of the

0:01:00.720 --> 0:01:04.160
<v Speaker 2>progress on the models feels very incremental at this point,

0:01:04.160 --> 0:01:05.959
<v Speaker 2>even though people are spending a lot of time and

0:01:06.040 --> 0:01:07.240
<v Speaker 2>resources on doing it.

0:01:07.520 --> 0:01:09.880
<v Speaker 1>The one thing GPG five does prompt me and say, oh,

0:01:09.880 --> 0:01:11.560
<v Speaker 1>that's a great question. Would you like to follow up

0:01:11.560 --> 0:01:11.959
<v Speaker 1>more on that?

0:01:12.160 --> 0:01:13.160
<v Speaker 3>But it's like does it.

0:01:13.080 --> 0:01:16.320
<v Speaker 2>Say, o, Joe, you're so smart? That's such a smart question.

0:01:16.440 --> 0:01:17.559
<v Speaker 3>Say you know what it did? Say?

0:01:17.680 --> 0:01:19.560
<v Speaker 1>I asked to follow up, and it started an answer

0:01:19.560 --> 0:01:22.280
<v Speaker 1>with love it and then love it? Do you want

0:01:22.280 --> 0:01:23.200
<v Speaker 1>me to look into that?

0:01:23.720 --> 0:01:24.039
<v Speaker 4>Yes?

0:01:24.200 --> 0:01:26.600
<v Speaker 2>They are very flattering, aren't they. Actually, that's one thing

0:01:26.640 --> 0:01:29.399
<v Speaker 2>I like about perplexity is it doesn't really flatter you.

0:01:29.400 --> 0:01:30.600
<v Speaker 2>It just spits out an answer.

0:01:30.840 --> 0:01:33.800
<v Speaker 1>So anyway, there's so many questions I have about AI,

0:01:33.880 --> 0:01:36.039
<v Speaker 1>and we talk about the business old fair amount and

0:01:36.200 --> 0:01:38.160
<v Speaker 1>video and all that stuff. We actually don't really talk

0:01:38.200 --> 0:01:41.160
<v Speaker 1>that much about the pure research side as much. But

0:01:41.240 --> 0:01:43.160
<v Speaker 1>it's pretty important, I think, because I think a lot

0:01:43.200 --> 0:01:45.360
<v Speaker 1>of people would agree that if the skills are like

0:01:45.400 --> 0:01:47.840
<v Speaker 1>slowing down, or if there were a wall or something

0:01:47.880 --> 0:01:51.000
<v Speaker 1>like that, that might change some of these business model calculations,

0:01:51.040 --> 0:01:53.080
<v Speaker 1>et cetera. So I think it's good we need to

0:01:53.080 --> 0:01:55.280
<v Speaker 1>get an update on just sort of the state of

0:01:55.280 --> 0:01:56.800
<v Speaker 1>the art the science of AI.

0:01:57.080 --> 0:01:57.320
<v Speaker 4>Yeah.

0:01:57.400 --> 0:02:00.600
<v Speaker 2>Also, it would be nice just to understand what's possible

0:02:00.760 --> 0:02:03.080
<v Speaker 2>in terms of the AI models and what people are

0:02:03.080 --> 0:02:06.360
<v Speaker 2>actually researching, what they're working towards, work like, is it

0:02:06.480 --> 0:02:09.480
<v Speaker 2>mostly about price? Is it mostly about the output? Is

0:02:09.520 --> 0:02:12.000
<v Speaker 2>it mostly about energy use? All those things?

0:02:12.080 --> 0:02:13.760
<v Speaker 1>All those things, Well, I'm really excited to say we

0:02:13.800 --> 0:02:16.760
<v Speaker 1>have the perfect guest, someone who is an AI researcher.

0:02:17.000 --> 0:02:19.360
<v Speaker 1>We're gonna be speaking with Jack Morris. He's currently about

0:02:19.400 --> 0:02:21.280
<v Speaker 1>to finish his PhD.

0:02:20.880 --> 0:02:22.400
<v Speaker 3>At Cornell in AI.

0:02:22.440 --> 0:02:27.160
<v Speaker 1>He's been affiliated with Meta professionally, so presumably he already

0:02:27.200 --> 0:02:30.080
<v Speaker 1>has a one hundred million dollar pay package in the bank.

0:02:30.320 --> 0:02:32.840
<v Speaker 1>Now he's shaking his head, he's not that's a joke.

0:02:33.040 --> 0:02:36.079
<v Speaker 1>But Jack, thank you so much for coming on odd lots.

0:02:36.200 --> 0:02:38.040
<v Speaker 4>Yeah, thanks for having me. This is gonna be fun.

0:02:38.120 --> 0:02:39.560
<v Speaker 1>What do you explain to me, like what you're up to,

0:02:39.600 --> 0:02:41.239
<v Speaker 1>because I don't really understand how.

0:02:41.080 --> 0:02:42.040
<v Speaker 3>It works where people are.

0:02:42.040 --> 0:02:44.560
<v Speaker 1>They're at a university and they're also at a company,

0:02:44.639 --> 0:02:47.560
<v Speaker 1>and this isn't how it works. And much of the world, right,

0:02:47.600 --> 0:02:49.880
<v Speaker 1>people get their degree and then they get a job.

0:02:50.080 --> 0:02:52.200
<v Speaker 1>I get the impression that in the AI world it's

0:02:52.240 --> 0:02:56.440
<v Speaker 1>a little fuzzier in terms of one's affiliations between industry

0:02:56.600 --> 0:02:58.040
<v Speaker 1>and education and stuff like that.

0:02:58.280 --> 0:03:01.320
<v Speaker 4>Yeah, that's definitely true. I think might be on the

0:03:01.360 --> 0:03:03.200
<v Speaker 4>way out, but I can tell you about my situation.

0:03:03.400 --> 0:03:06.960
<v Speaker 4>So there's kind of a public research world and like

0:03:06.960 --> 0:03:11.240
<v Speaker 4>a private research world, and all the academic institutions do

0:03:11.280 --> 0:03:17.400
<v Speaker 4>public research, and the AI labs like Open Ai, Anthropic, Google,

0:03:17.440 --> 0:03:19.840
<v Speaker 4>deep Mind, they essentially do private research where they have

0:03:19.960 --> 0:03:23.079
<v Speaker 4>these people in house that are running experiments and learning

0:03:23.080 --> 0:03:25.960
<v Speaker 4>more about their systems, but they don't publish anything or

0:03:25.960 --> 0:03:28.280
<v Speaker 4>share any of their knowledge. And so a cool thing

0:03:28.280 --> 0:03:30.920
<v Speaker 4>about getting your PhD right now is you can do

0:03:31.080 --> 0:03:34.079
<v Speaker 4>research right about it and then publicize it like put

0:03:34.080 --> 0:03:36.400
<v Speaker 4>it online, I tweet about it. I kind of like

0:03:36.520 --> 0:03:38.920
<v Speaker 4>can talk to you about it. And there's a few

0:03:38.960 --> 0:03:42.000
<v Speaker 4>places left that will still kind of moment, we're never.

0:03:41.920 --> 0:03:42.720
<v Speaker 3>Going to hear from you again.

0:03:44.600 --> 0:03:46.640
<v Speaker 4>Yeah, I'll make sure they have a clause in my

0:03:46.720 --> 0:03:50.040
<v Speaker 4>contract that I can still talk to Joe and Tracy.

0:03:49.400 --> 0:03:52.320
<v Speaker 2>The all thoughts clause. Yes, that would be important. So

0:03:52.680 --> 0:03:56.440
<v Speaker 2>when we say AI research or an AI researcher, what

0:03:56.560 --> 0:04:01.480
<v Speaker 2>exactly does that entail? Can't the AI models just research themselves?

0:04:01.600 --> 0:04:02.440
<v Speaker 2>Just let them do it?

0:04:02.720 --> 0:04:06.000
<v Speaker 4>Yeah, that's actually a very smart idea, and like people

0:04:06.120 --> 0:04:08.720
<v Speaker 4>are really worried about that. Actually, Like if we get

0:04:08.720 --> 0:04:13.320
<v Speaker 4>to the point where the AI can improve itself into researching, yeah,

0:04:13.360 --> 0:04:15.120
<v Speaker 4>then it sort of gets smarter and then it improves

0:04:15.120 --> 0:04:17.120
<v Speaker 4>themself again and it ends up being this kind of

0:04:17.400 --> 0:04:21.160
<v Speaker 4>exponential improvement that ends up with all of our demise.

0:04:21.839 --> 0:04:24.840
<v Speaker 4>But I think right now it's not quite there yet.

0:04:25.080 --> 0:04:28.160
<v Speaker 4>Like maybe you can talk to CHGBT what good Yeah,

0:04:28.160 --> 0:04:29.960
<v Speaker 4>And good news for me too, because it means I

0:04:30.000 --> 0:04:33.599
<v Speaker 4>can still get a degree and be gainfully employed. But

0:04:34.200 --> 0:04:36.680
<v Speaker 4>I think it's it's still helpful, but we still need

0:04:36.720 --> 0:04:39.800
<v Speaker 4>like humans to make these improvements. And in terms of

0:04:39.839 --> 0:04:41.479
<v Speaker 4>what the actual day to day work looks like, I

0:04:41.480 --> 0:04:43.800
<v Speaker 4>think it really varies. Like there's some people working on

0:04:44.480 --> 0:04:47.279
<v Speaker 4>trying to make the models run faster, or trying to

0:04:47.279 --> 0:04:50.440
<v Speaker 4>make the hardware that runs the models run faster more efficiently.

0:04:50.880 --> 0:04:52.800
<v Speaker 4>There's people that try to work on the data, like

0:04:52.839 --> 0:04:55.640
<v Speaker 4>what should we train on more coding problems or more

0:04:55.880 --> 0:04:59.600
<v Speaker 4>textbooks or more Reddit posts, what works best to make

0:04:59.640 --> 0:05:01.920
<v Speaker 4>the model? And then there's a lot more people working

0:05:01.960 --> 0:05:04.680
<v Speaker 4>on different areas of the stack, like training algorithms. I

0:05:04.760 --> 0:05:08.240
<v Speaker 4>kind of have my own little niche and niche. There's

0:05:08.279 --> 0:05:11.760
<v Speaker 4>this old field of information theory from like the twentieth

0:05:11.760 --> 0:05:14.360
<v Speaker 4>century where they talk about bits like a zero or

0:05:14.400 --> 0:05:16.600
<v Speaker 4>a one is a bit and you can add them

0:05:16.680 --> 0:05:20.040
<v Speaker 4>up and have kilobytes and megabytes. And so I've been

0:05:20.080 --> 0:05:21.919
<v Speaker 4>trying to think about what that means in like the

0:05:22.000 --> 0:05:24.200
<v Speaker 4>chat GBT world, if you train a model on a

0:05:24.200 --> 0:05:26.880
<v Speaker 4>certain number of bits, how many bits does it actually learn?

0:05:27.200 --> 0:05:29.040
<v Speaker 4>And like can you look at the model and figure

0:05:29.040 --> 0:05:30.599
<v Speaker 4>out like if you have one slice of the model,

0:05:30.600 --> 0:05:32.600
<v Speaker 4>how many bits that is and stuff like that. So

0:05:32.640 --> 0:05:34.920
<v Speaker 4>maybe the easiest way to explain is if you had,

0:05:34.960 --> 0:05:37.640
<v Speaker 4>for some god forsaken reason to use chat GBT as

0:05:37.680 --> 0:05:40.560
<v Speaker 4>like a flash drive, like you had a certain set

0:05:40.600 --> 0:05:43.080
<v Speaker 4>of data and it had to memorize all that data,

0:05:43.200 --> 0:05:46.840
<v Speaker 4>Like how much data could it actually store? That's the

0:05:46.960 --> 0:05:48.800
<v Speaker 4>kind of area I've been working in. And then you know,

0:05:48.800 --> 0:05:50.880
<v Speaker 4>once you're there, you kind of realize we could do this,

0:05:51.040 --> 0:05:52.800
<v Speaker 4>or maybe next semester, if we have time, we could

0:05:53.120 --> 0:05:54.839
<v Speaker 4>try this other thing. And so there's it kind of

0:05:54.839 --> 0:05:56.880
<v Speaker 4>branches out and there's a lot of little problems that

0:05:56.920 --> 0:05:57.520
<v Speaker 4>you can try.

0:05:57.920 --> 0:06:01.800
<v Speaker 1>I mentioned GPT five fine to me, It does not

0:06:02.040 --> 0:06:05.479
<v Speaker 1>strike me as like you know, because actually so the

0:06:05.520 --> 0:06:08.360
<v Speaker 1>first time I use cha GPT is genuinely blown away

0:06:08.400 --> 0:06:10.559
<v Speaker 1>like most people. And then actually I was pretty blown

0:06:10.600 --> 0:06:13.600
<v Speaker 1>away by the three models, in part because of how

0:06:13.640 --> 0:06:16.640
<v Speaker 1>well they could do document search and superior to Google

0:06:16.680 --> 0:06:19.680
<v Speaker 1>Search in many respects and also just the organization of

0:06:19.720 --> 0:06:22.279
<v Speaker 1>a lot of unstructured data, et cetera. Like I didn't

0:06:22.279 --> 0:06:25.720
<v Speaker 1>have like some oh my god wow moment with GPT five.

0:06:25.760 --> 0:06:28.960
<v Speaker 1>It's like, this seems like, how do we measure whether

0:06:29.080 --> 0:06:31.359
<v Speaker 1>AI is getting better all the time.

0:06:32.680 --> 0:06:35.360
<v Speaker 4>Yeah, that's that's a huge question, right.

0:06:35.800 --> 0:06:37.200
<v Speaker 1>Well, let me ask you, Okay, let me ask you

0:06:37.320 --> 0:06:41.480
<v Speaker 1>actually a more specific question. How do the entities that

0:06:41.800 --> 0:06:46.320
<v Speaker 1>test AI models as their job or as their function?

0:06:46.800 --> 0:06:50.320
<v Speaker 1>What does the formal testing process look like to rank

0:06:50.400 --> 0:06:52.480
<v Speaker 1>the quality of AI models?

0:06:52.560 --> 0:06:55.240
<v Speaker 4>Okay, yeah, that's that's more tractable. We can we can

0:06:55.320 --> 0:06:57.120
<v Speaker 4>start there, and then we can talk about three and

0:06:57.320 --> 0:07:01.159
<v Speaker 4>GPT five. So there's essentially two ways people do this

0:07:01.240 --> 0:07:05.000
<v Speaker 4>kind of model evaluation. The main one is just by

0:07:05.120 --> 0:07:08.159
<v Speaker 4>testing them on different data sets. So, for example, there's

0:07:08.200 --> 0:07:10.680
<v Speaker 4>this data set called swee bench that's a bunch of

0:07:11.120 --> 0:07:14.680
<v Speaker 4>software engineering related coding problems and they all have a

0:07:14.760 --> 0:07:18.040
<v Speaker 4>human written solution and tests, and so you can ask

0:07:18.120 --> 0:07:20.280
<v Speaker 4>GPT five, can you write the code for this and

0:07:20.320 --> 0:07:22.320
<v Speaker 4>then run the tests and see if it's right? And

0:07:22.400 --> 0:07:24.360
<v Speaker 4>still the models are pretty bad at that. I think

0:07:24.360 --> 0:07:26.800
<v Speaker 4>they can do about half of them. They're very hard.

0:07:26.840 --> 0:07:30.640
<v Speaker 4>They're like entire days of work for professional software engineers.

0:07:30.880 --> 0:07:32.920
<v Speaker 4>But when a new model comes out, they can say, oh, look,

0:07:32.920 --> 0:07:35.360
<v Speaker 4>we actually got a higher score on sweet bench. And

0:07:35.400 --> 0:07:37.520
<v Speaker 4>there's a ton of different data sets like that. So

0:07:37.560 --> 0:07:39.920
<v Speaker 4>when GBT five comes out, they say, you know, it's

0:07:39.920 --> 0:07:42.880
<v Speaker 4>better at these types of coding tests. And a big

0:07:42.880 --> 0:07:46.760
<v Speaker 4>one that specifically open AI has been advocating for is math,

0:07:47.120 --> 0:07:50.200
<v Speaker 4>like they did the International Math Olympiad, and they said

0:07:50.760 --> 0:07:54.520
<v Speaker 4>essentially GBT five scored at the level of the best

0:07:54.640 --> 0:07:59.200
<v Speaker 4>high school mathematicians, which is pretty cool. But you raise

0:07:59.240 --> 0:08:00.960
<v Speaker 4>a good question of how is that actually map to

0:08:01.000 --> 0:08:03.200
<v Speaker 4>real world usage? And I think this is like a

0:08:03.240 --> 0:08:06.680
<v Speaker 4>really hard problem that people still haven't figured out.

0:08:06.960 --> 0:08:11.080
<v Speaker 2>Does anyone try to capture that sort of like genes sequah?

0:08:11.320 --> 0:08:13.640
<v Speaker 2>I guess when it comes to AI models, is one

0:08:13.680 --> 0:08:15.920
<v Speaker 2>of the tests asking it to I don't know, come

0:08:16.000 --> 0:08:17.680
<v Speaker 2>up with a stupid limerick or something.

0:08:18.160 --> 0:08:21.200
<v Speaker 4>Yeah, there are a lot of tests like that. There's

0:08:21.280 --> 0:08:25.200
<v Speaker 4>some creative writing benchmarks and some poetry related ones. But

0:08:25.480 --> 0:08:29.080
<v Speaker 4>I think you point out something interesting that for example,

0:08:29.120 --> 0:08:32.680
<v Speaker 4>I mostly use Claude from Anthropic and I think Claude

0:08:32.760 --> 0:08:36.520
<v Speaker 4>does have this something to it that's like a little

0:08:36.559 --> 0:08:38.960
<v Speaker 4>bit different, and it's very difficult to characterize. It's just

0:08:39.000 --> 0:08:40.400
<v Speaker 4>sort of the way it speaks to you and the

0:08:40.400 --> 0:08:42.760
<v Speaker 4>way it thinks of itself is I like it a

0:08:42.800 --> 0:08:44.960
<v Speaker 4>lot better, but I don't know how you would design

0:08:45.000 --> 0:08:47.320
<v Speaker 4>like a data set that can really capture that. The

0:08:47.360 --> 0:08:50.079
<v Speaker 4>second way they do the evaluation is by they call

0:08:50.080 --> 0:08:53.840
<v Speaker 4>it it's Elo scores, like in chess. So they, for example,

0:08:53.920 --> 0:08:56.440
<v Speaker 4>ask the two models to write a limerick, and then

0:08:56.480 --> 0:08:59.000
<v Speaker 4>they have humans rank which one is better, and they

0:08:59.040 --> 0:09:02.400
<v Speaker 4>make this kind of lat of Elo rankings for models.

0:09:02.640 --> 0:09:05.439
<v Speaker 4>So I think right now Claude or GPT five or

0:09:05.480 --> 0:09:09.440
<v Speaker 4>maybe the Google model is top on this ladder.

0:09:10.000 --> 0:09:12.680
<v Speaker 1>The algorithm made famous in the social network that Mark

0:09:12.760 --> 0:09:17.080
<v Speaker 1>Zuckerberg used to rate the of his colleagues still the

0:09:17.200 --> 0:09:19.400
<v Speaker 1>workhorse model for comp evaluation.

0:09:19.640 --> 0:09:22.800
<v Speaker 2>That's some good trivia, Joe, very good and no comment. Well,

0:09:22.880 --> 0:09:27.440
<v Speaker 2>I assume just on the hard number evaluation. People are

0:09:27.480 --> 0:09:31.839
<v Speaker 2>also ranking these on data usage, energy, that sort of.

0:09:31.760 --> 0:09:32.240
<v Speaker 4>Thing as well.

0:09:32.320 --> 0:09:35.760
<v Speaker 2>Right speed, speed would be a definitely.

0:09:35.960 --> 0:09:38.640
<v Speaker 4>The AI companies like to use price as a metric,

0:09:38.760 --> 0:09:41.120
<v Speaker 4>which is kind of interesting because there's a lot that

0:09:41.160 --> 0:09:43.440
<v Speaker 4>goes on behind the scenes, including just sort of like

0:09:44.280 --> 0:09:47.720
<v Speaker 4>free money that drives the prices down, but they also

0:09:47.760 --> 0:09:50.240
<v Speaker 4>do benchmark speed, and I think you make a good

0:09:50.280 --> 0:09:53.079
<v Speaker 4>point that the benchmarks can be pretty misleading, Like, for example,

0:09:53.080 --> 0:09:56.160
<v Speaker 4>there's a bunch of recent open source models that came

0:09:56.200 --> 0:09:59.000
<v Speaker 4>from different Chinese AI labs that have really, really high

0:09:59.080 --> 0:10:02.400
<v Speaker 4>scores on certain benchmarks, but people kind of think they're

0:10:02.400 --> 0:10:05.680
<v Speaker 4>not as good for real world usage for whatever reason.

0:10:06.360 --> 0:10:08.720
<v Speaker 1>I've seen people talk about this isn't part of the

0:10:08.840 --> 0:10:14.120
<v Speaker 1>problem with testing AI or evaluating AI. That a lot

0:10:14.160 --> 0:10:16.959
<v Speaker 1>of these problems exist in the real world already, right,

0:10:17.200 --> 0:10:19.720
<v Speaker 1>You see this a lot, and people are always finding this,

0:10:19.920 --> 0:10:23.400
<v Speaker 1>which is that here's an AI model that is amazing

0:10:23.559 --> 0:10:27.520
<v Speaker 1>at math on the math Olympiad, and yet it gets

0:10:27.520 --> 0:10:31.280
<v Speaker 1>tripped up by questions like which is heavier a pound

0:10:31.320 --> 0:10:33.880
<v Speaker 1>of steel or two pounds of feathers, And they'll say

0:10:33.920 --> 0:10:35.920
<v Speaker 1>that that's a trick question. A pound of steel weighs the

0:10:35.920 --> 0:10:38.520
<v Speaker 1>same as two pounds of feathers, which is clearly like

0:10:38.840 --> 0:10:41.760
<v Speaker 1>it was clearly then been trained in some sense to

0:10:42.280 --> 0:10:44.960
<v Speaker 1>recognize these steel versus feathers thing or whatever it is.

0:10:45.200 --> 0:10:47.920
<v Speaker 1>I forget if it's steel, But it also clearly can't

0:10:47.960 --> 0:10:49.719
<v Speaker 1>measure whether one or.

0:10:49.679 --> 0:10:50.480
<v Speaker 3>Two is bigger.

0:10:50.840 --> 0:10:54.960
<v Speaker 4>Yeah, that's a really good example. I think they kind

0:10:54.960 --> 0:10:58.720
<v Speaker 4>of successively include these kinds of things in more rounds

0:10:58.720 --> 0:11:00.760
<v Speaker 4>of training data, and so every time a new model

0:11:00.800 --> 0:11:03.640
<v Speaker 4>comes out, they kind of patch little holes that appeared

0:11:03.640 --> 0:11:06.040
<v Speaker 4>in the previous models. So you're pointing to this, like

0:11:06.080 --> 0:11:08.280
<v Speaker 4>they probably started with the classic riddle that's like a

0:11:08.320 --> 0:11:10.200
<v Speaker 4>pound of bricks or a pound of feathers bricks and

0:11:10.240 --> 0:11:13.120
<v Speaker 4>they're equal, but then like the models got that wrong,

0:11:13.160 --> 0:11:14.040
<v Speaker 4>and so they added to.

0:11:13.960 --> 0:11:19.080
<v Speaker 1>Something a very efficient way to achieve intelligence, like, oh yeah,

0:11:19.080 --> 0:11:19.960
<v Speaker 1>we should have included that.

0:11:20.000 --> 0:11:21.640
<v Speaker 3>Oh yeah, we got to include that trick. Oh yeah,

0:11:21.640 --> 0:11:22.320
<v Speaker 3>we gotta have right.

0:11:22.360 --> 0:11:26.480
<v Speaker 1>Like ever, like going that does not speak to me

0:11:26.880 --> 0:11:30.200
<v Speaker 1>of a line towards something that we would call anything

0:11:30.280 --> 0:11:32.280
<v Speaker 1>resembling human intelligence.

0:11:32.400 --> 0:11:35.760
<v Speaker 4>I definitely agree. I think one counter example is people

0:11:35.760 --> 0:11:37.880
<v Speaker 4>said this for a long time about self driving cars,

0:11:38.240 --> 0:11:40.480
<v Speaker 4>Like everyone was really excited about them for a long time,

0:11:40.520 --> 0:11:42.760
<v Speaker 4>and then they kind of didn't really work, like eight

0:11:42.880 --> 0:11:45.360
<v Speaker 4>or so years ago, and there was this period where

0:11:45.360 --> 0:11:47.959
<v Speaker 4>they were saying, oh, the models can't do green cones.

0:11:48.040 --> 0:11:50.400
<v Speaker 4>We're going out there trying to take videos of green cones,

0:11:50.440 --> 0:11:55.640
<v Speaker 4>and yeah, they can't do snow. I'm saying that it

0:11:55.720 --> 0:11:59.240
<v Speaker 4>worked for them, and so it might be possible. But

0:11:59.720 --> 0:12:01.960
<v Speaker 4>in the case of language models, there's something a little

0:12:02.000 --> 0:12:05.880
<v Speaker 4>more interesting happening, because we now have two ways to learn.

0:12:06.280 --> 0:12:07.760
<v Speaker 4>If you guys are ready, we could we could get

0:12:07.760 --> 0:12:10.040
<v Speaker 4>into something a little technical, which I think gives you

0:12:10.080 --> 0:12:13.280
<v Speaker 4>some insights. So there's essentially two ways you can teach

0:12:13.360 --> 0:12:16.680
<v Speaker 4>machines to learn from data. One is called supervised learning,

0:12:16.920 --> 0:12:19.640
<v Speaker 4>where the computer will copy what you did, which is

0:12:19.640 --> 0:12:22.040
<v Speaker 4>like basically what we're talking about now, and the other

0:12:22.160 --> 0:12:25.199
<v Speaker 4>is called reinforcement learning, where the computer just does something

0:12:25.280 --> 0:12:27.120
<v Speaker 4>and then you give it a reward if it does

0:12:27.160 --> 0:12:30.360
<v Speaker 4>something well. And so for a long time, like the

0:12:30.400 --> 0:12:34.640
<v Speaker 4>original chat GBT was mostly just trained with supervised learning,

0:12:34.720 --> 0:12:37.120
<v Speaker 4>like it would just copy the text from all of

0:12:37.160 --> 0:12:39.680
<v Speaker 4>the Internet, and so the best it could ever do

0:12:39.880 --> 0:12:44.280
<v Speaker 4>is emulate Reddit posts very well. And there was a

0:12:44.320 --> 0:12:47.439
<v Speaker 4>tiny bit of reinforcement learning, but people didn't know how

0:12:47.480 --> 0:12:50.040
<v Speaker 4>to do it right. And then you mentioned this three model,

0:12:50.040 --> 0:12:52.839
<v Speaker 4>which is kind of in some ways like a big jump,

0:12:52.960 --> 0:12:55.040
<v Speaker 4>like it made the models much better at math, much

0:12:55.040 --> 0:12:57.760
<v Speaker 4>better at certain things. And the way they did that

0:12:57.840 --> 0:13:00.760
<v Speaker 4>is actually through reinforcement learning. Found out a way to

0:13:00.840 --> 0:13:02.760
<v Speaker 4>kind of like let the model think for a while

0:13:03.240 --> 0:13:05.280
<v Speaker 4>and then give it a reward when it gets the

0:13:05.360 --> 0:13:07.600
<v Speaker 4>answer at the end. It's kind of scary.

0:13:07.840 --> 0:13:10.199
<v Speaker 2>Yeah, when you say give it a reward, is.

0:13:10.120 --> 0:13:13.680
<v Speaker 3>It like take a cookie paying robots?

0:13:13.920 --> 0:13:14.120
<v Speaker 1>Yeah?

0:13:14.240 --> 0:13:16.920
<v Speaker 2>Well no, genuinely, like what is the reward? You just

0:13:16.920 --> 0:13:18.080
<v Speaker 2>tell it it did a good job.

0:13:18.440 --> 0:13:20.199
<v Speaker 4>You just give it like a higher number. Okay, and

0:13:20.240 --> 0:13:21.559
<v Speaker 4>that makes you happy, all right.

0:13:22.120 --> 0:13:24.520
<v Speaker 2>I'd get a little bit worried when we're like giving

0:13:24.520 --> 0:13:27.520
<v Speaker 2>it cupcakes or something like here you go, good job.

0:13:28.440 --> 0:13:30.240
<v Speaker 2>Just going back to the intro, you know, we were

0:13:30.240 --> 0:13:32.880
<v Speaker 2>talking about how it feels like a lot of the

0:13:32.920 --> 0:13:36.520
<v Speaker 2>progress on AI models is a little bit more incremental,

0:13:36.960 --> 0:13:39.000
<v Speaker 2>and I guess it's hard to tell whether that's just

0:13:39.200 --> 0:13:41.840
<v Speaker 2>personal bias because now we're used to them and the

0:13:41.880 --> 0:13:44.720
<v Speaker 2>sort of wow moment has passed. But what does it

0:13:44.760 --> 0:13:47.440
<v Speaker 2>feel like to you in terms of improvements? Are we

0:13:47.600 --> 0:13:52.040
<v Speaker 2>seeing the improvement cycle accelerate or decelerate at this point?

0:13:52.240 --> 0:13:55.000
<v Speaker 4>I think it's kind of like the market, where it's

0:13:55.040 --> 0:13:57.679
<v Speaker 4>like always it gets faster for a little while, and

0:13:57.679 --> 0:14:00.560
<v Speaker 4>then it feels like things have slowed down and the

0:14:00.600 --> 0:14:02.920
<v Speaker 4>progress is never quite in the areas that you expect

0:14:03.000 --> 0:14:06.720
<v Speaker 4>as one example, people really thought this year was the

0:14:06.840 --> 0:14:10.680
<v Speaker 4>year when the assistance would start being able to act

0:14:10.720 --> 0:14:13.560
<v Speaker 4>like actual assistants, like the Year of agents. People actually

0:14:13.640 --> 0:14:15.800
<v Speaker 4>coined that term, I think, like the year of agents,

0:14:16.000 --> 0:14:19.080
<v Speaker 4>and it really it didn't happen for whatever reason. Maybe

0:14:19.080 --> 0:14:21.160
<v Speaker 4>it will in the next three months. But the agents

0:14:21.160 --> 0:14:23.200
<v Speaker 4>are still pretty bad the ones that you can use.

0:14:23.440 --> 0:14:25.920
<v Speaker 4>But they did get way better at competitive math, Like

0:14:25.960 --> 0:14:29.600
<v Speaker 4>now they can do these like world class proofs that

0:14:29.640 --> 0:14:33.080
<v Speaker 4>they couldn't do before. So it's almost unpredictable, like which

0:14:33.160 --> 0:14:36.120
<v Speaker 4>areas the AI will kind of conquer next, But it

0:14:36.160 --> 0:14:38.920
<v Speaker 4>does feel like progress is continuing.

0:14:39.320 --> 0:14:42.920
<v Speaker 1>Actually, what happened with agents? I've never had a successful

0:14:43.280 --> 0:14:45.840
<v Speaker 1>agent experience, even basic things like come up with a

0:14:45.880 --> 0:14:49.120
<v Speaker 1>list of every past odd Lots guests, yeah and put

0:14:49.160 --> 0:14:52.120
<v Speaker 1>it in a file or something like that, which just

0:14:52.760 --> 0:14:55.200
<v Speaker 1>there's an RSS feed that exists for odd Lots. This

0:14:55.200 --> 0:14:57.480
<v Speaker 1>should be ray stick for it all around, and then

0:14:58.040 --> 0:15:00.880
<v Speaker 1>something will happen or it'll get lazy. Here's like here's

0:15:00.920 --> 0:15:04.560
<v Speaker 1>fifteen and what is actually this is thought leaders love

0:15:04.600 --> 0:15:06.560
<v Speaker 1>this stuff. They love to talking about the agents. So

0:15:06.600 --> 0:15:09.720
<v Speaker 1>what actually happened with agents? Maybe they'll get there, but

0:15:09.800 --> 0:15:11.400
<v Speaker 1>what do you use to what is the roadblock there.

0:15:11.880 --> 0:15:14.680
<v Speaker 4>I don't think there's any conceptual roadblock, Like there's no

0:15:14.800 --> 0:15:17.400
<v Speaker 4>reason why you couldn't collect data for that and train

0:15:17.480 --> 0:15:20.600
<v Speaker 4>them either in a supervised way or using reinforcement learning.

0:15:20.920 --> 0:15:23.520
<v Speaker 4>It just hasn't happened yet. So I think maybe behind

0:15:23.520 --> 0:15:25.400
<v Speaker 4>the scenes it turned out that the problem was harder

0:15:25.400 --> 0:15:28.560
<v Speaker 4>than people thought, Like getting data from all those scenarios

0:15:28.640 --> 0:15:31.640
<v Speaker 4>is really hard. And there have been some stories from

0:15:31.800 --> 0:15:34.040
<v Speaker 4>like people that I've heard of that found these little

0:15:34.080 --> 0:15:37.840
<v Speaker 4>companies in San Francisco and they build these tiny environments

0:15:37.880 --> 0:15:41.240
<v Speaker 4>for the AI labs to do reinforcement learning on for agents,

0:15:41.320 --> 0:15:44.000
<v Speaker 4>like for example, doing a calendar. They'll build like a

0:15:44.000 --> 0:15:47.120
<v Speaker 4>little calendar app, but make it have rewards so you

0:15:47.120 --> 0:15:49.440
<v Speaker 4>can do reinforcement learning, and they can just sell that

0:15:49.520 --> 0:15:51.760
<v Speaker 4>for like hundreds of thousands of dollars. So I think

0:15:51.920 --> 0:15:54.600
<v Speaker 4>the progress is ongoing behind the scenes, Like there's a

0:15:54.600 --> 0:15:58.080
<v Speaker 4>whole ecosystem built around it. It just hasn't really manifested

0:15:58.080 --> 0:15:59.400
<v Speaker 4>in the products that we use.

0:16:00.000 --> 0:16:02.880
<v Speaker 2>I was going to ask, how much of the difficulty is,

0:16:03.360 --> 0:16:06.360
<v Speaker 2>you know, the actual development of the models, the thinking part,

0:16:06.440 --> 0:16:10.640
<v Speaker 2>versus just getting them to plug in seamlessly with other applications.

0:16:11.160 --> 0:16:15.440
<v Speaker 4>Yeah, I think the second thing is probably the biggest

0:16:15.440 --> 0:16:17.920
<v Speaker 4>barrier in terms of time, Like it just takes a

0:16:17.920 --> 0:16:20.520
<v Speaker 4>really long time to figure out what data you need

0:16:20.640 --> 0:16:23.160
<v Speaker 4>and collect it properly and actually train the models on

0:16:23.200 --> 0:16:25.680
<v Speaker 4>that data. But at the same time, there are people

0:16:26.120 --> 0:16:27.920
<v Speaker 4>like me who are trying to work on better like

0:16:28.000 --> 0:16:31.400
<v Speaker 4>conceptual frameworks for training the models. So to go back

0:16:31.400 --> 0:16:37.280
<v Speaker 4>to the three example, doing reinforcement learning on CHATGBT, like

0:16:37.320 --> 0:16:40.240
<v Speaker 4>that seems to me like a huge breakthrough, Like we

0:16:40.280 --> 0:16:42.760
<v Speaker 4>didn't know how to do that before. It unlocks all

0:16:42.800 --> 0:16:45.800
<v Speaker 4>sorts of doors and ways to train the models. So

0:16:45.960 --> 0:16:48.400
<v Speaker 4>even if maybe you don't think that model was that

0:16:48.480 --> 0:16:50.880
<v Speaker 4>much better than the previous one, it seems like it

0:16:50.960 --> 0:16:54.160
<v Speaker 4>will give us huge improvements in the future.

0:17:10.040 --> 0:17:14.879
<v Speaker 1>So you mentioned at the intro that it's possible, hopefully

0:17:14.920 --> 0:17:16.919
<v Speaker 1>you'll get a close but you might end up in

0:17:16.960 --> 0:17:19.879
<v Speaker 1>a situation which you go to work for some frontier

0:17:20.040 --> 0:17:22.800
<v Speaker 1>AI lab and we never hear from you again, or

0:17:22.840 --> 0:17:25.480
<v Speaker 1>you just post cryptic tweets like oh no idea, what's coming,

0:17:25.880 --> 0:17:26.680
<v Speaker 1>Oh it's gonna.

0:17:26.440 --> 0:17:29.760
<v Speaker 3>Be so over or whatever. Yeah, an the death Star, Yeah,

0:17:29.760 --> 0:17:30.560
<v Speaker 3>it's very annoying.

0:17:30.640 --> 0:17:33.320
<v Speaker 1>The way they all tweet, it's possible talk to us

0:17:33.359 --> 0:17:36.400
<v Speaker 1>about like why not work on an open source project?

0:17:36.880 --> 0:17:38.880
<v Speaker 1>And this is of course when people talk about deep

0:17:38.920 --> 0:17:40.680
<v Speaker 1>seek and a lot of the Chinese models that the

0:17:40.760 --> 0:17:43.520
<v Speaker 1>US competes with, a lot of those are open source.

0:17:43.840 --> 0:17:46.960
<v Speaker 1>Presumably you could keep coming on odd lads over and

0:17:47.000 --> 0:17:50.639
<v Speaker 1>over again, why like what is even the case for

0:17:50.800 --> 0:17:52.960
<v Speaker 1>the best and the brightest to work on a closed

0:17:52.960 --> 0:17:54.600
<v Speaker 1>source frontier models.

0:17:54.840 --> 0:17:58.080
<v Speaker 4>Yeah, it's a really hard question, Like I've I've struggled

0:17:58.119 --> 0:18:00.399
<v Speaker 4>with this in my own personal decision making. I was

0:18:00.560 --> 0:18:03.080
<v Speaker 4>originally thinking, Oh, I'd love to become a professor and

0:18:03.119 --> 0:18:07.479
<v Speaker 4>mentor younger students and get a whole like group of

0:18:07.520 --> 0:18:11.160
<v Speaker 4>these ideas going and start working on similar related problems

0:18:11.200 --> 0:18:12.920
<v Speaker 4>to the stuff I was talking about. And I still

0:18:12.920 --> 0:18:15.639
<v Speaker 4>think that would be fun. But there's a big gap

0:18:15.680 --> 0:18:18.520
<v Speaker 4>in terms of the things we can do at Cornell

0:18:18.640 --> 0:18:20.600
<v Speaker 4>and the things that you can do at open AI.

0:18:20.800 --> 0:18:24.919
<v Speaker 4>Like they just have like crazy infrastructure for training models

0:18:24.960 --> 0:18:29.480
<v Speaker 4>really easily and data and a ton of really good data.

0:18:29.960 --> 0:18:32.720
<v Speaker 4>And so I think as that gap has widened, I've

0:18:32.720 --> 0:18:34.760
<v Speaker 4>felt like a lot of what we're doing is like

0:18:35.080 --> 0:18:38.080
<v Speaker 4>kind of devising these toy scenarios where we can study

0:18:38.080 --> 0:18:41.240
<v Speaker 4>interesting things, but I feel a bit disconnected from the

0:18:41.280 --> 0:18:45.399
<v Speaker 4>real like progress of humanity. You know, like if you

0:18:45.480 --> 0:18:47.879
<v Speaker 4>really agree that this is like the biggest problem of

0:18:47.920 --> 0:18:49.760
<v Speaker 4>our time. I don't want to say it's like the

0:18:49.800 --> 0:18:52.439
<v Speaker 4>Manhattan Project, but like, what's more like trying to go

0:18:52.480 --> 0:18:54.680
<v Speaker 4>to the Moon in the sixties? The space race. It's

0:18:54.760 --> 0:18:56.720
<v Speaker 4>kind of like a space race going on in these

0:18:56.760 --> 0:18:58.840
<v Speaker 4>different private labs. You want to be a part of it.

0:18:58.880 --> 0:19:02.320
<v Speaker 4>Like there's crazy energy that it has huge implications for

0:19:02.359 --> 0:19:05.880
<v Speaker 4>the future of society. So I think I am interested

0:19:05.880 --> 0:19:09.760
<v Speaker 4>in participating in that. My big question is like, if

0:19:09.760 --> 0:19:13.560
<v Speaker 4>you think that the reinforcement learning thing was the most

0:19:13.600 --> 0:19:16.800
<v Speaker 4>recent big scientific breakthrough, like oh one, and then three,

0:19:17.240 --> 0:19:20.440
<v Speaker 4>what's next? And then like where will that actually be happening.

0:19:20.480 --> 0:19:22.800
<v Speaker 4>That's kind of what I'm thinking about right now.

0:19:22.880 --> 0:19:26.280
<v Speaker 2>Just on the data point. I was reading your excellent

0:19:26.480 --> 0:19:29.920
<v Speaker 2>substack and you argue that there's probably an upper bound

0:19:30.040 --> 0:19:33.320
<v Speaker 2>to what you can get out of a given data set,

0:19:34.160 --> 0:19:38.440
<v Speaker 2>and at some point, like the training starts to look

0:19:38.520 --> 0:19:42.520
<v Speaker 2>pretty similar, right, and the data becomes the differentiating factor.

0:19:43.520 --> 0:19:48.000
<v Speaker 2>How important are data sets to AI research? And I guess, like,

0:19:48.119 --> 0:19:50.359
<v Speaker 2>how do you go about finding really cool ones and

0:19:50.359 --> 0:19:53.520
<v Speaker 2>what's left. Because I feel like, you know, using the

0:19:53.560 --> 0:19:57.320
<v Speaker 2>space race analogy, everyone has been running so fast on this.

0:19:57.600 --> 0:19:59.800
<v Speaker 2>It feels like all the data sets must have been

0:19:59.840 --> 0:20:02.159
<v Speaker 2>a explored by now, but I guess they haven't.

0:20:02.520 --> 0:20:05.520
<v Speaker 4>Yeah, yeah, I think this is really getting to the

0:20:05.560 --> 0:20:08.280
<v Speaker 4>heart of what people are trying to figure out right

0:20:08.280 --> 0:20:11.880
<v Speaker 4>now in all these different labs. So I think you're

0:20:12.000 --> 0:20:15.800
<v Speaker 4>pretty much right that all of the public data sets

0:20:15.840 --> 0:20:21.159
<v Speaker 4>we have are pretty much used to TRAIN three or

0:20:21.520 --> 0:20:24.040
<v Speaker 4>GPT five or whatever. If there is a really good

0:20:24.400 --> 0:20:27.280
<v Speaker 4>website that should have been scraped and downloaded into the model,

0:20:27.320 --> 0:20:30.320
<v Speaker 4>it should probably be used. But there apparently is a

0:20:30.400 --> 0:20:33.679
<v Speaker 4>much larger amount of private data than public data. I mean,

0:20:33.760 --> 0:20:36.959
<v Speaker 4>you all work for Bloomberg, so you're probably intimately familiar

0:20:37.040 --> 0:20:39.119
<v Speaker 4>with this. But if you think about the different AI

0:20:39.200 --> 0:20:41.639
<v Speaker 4>labs that exist, they actually now do kind of have

0:20:41.760 --> 0:20:45.600
<v Speaker 4>different data related modes. Like XAI, they have all of

0:20:45.640 --> 0:20:49.280
<v Speaker 4>the Twitter data that's basically impossible to get elsewhere. CHADGBT

0:20:49.520 --> 0:20:52.480
<v Speaker 4>now has all of the user conversations with CHATGBT, which

0:20:52.520 --> 0:20:55.040
<v Speaker 4>are really useful. Claude has a ton of coding data

0:20:55.040 --> 0:20:57.720
<v Speaker 4>that other people don't have. Google has YouTube, which some

0:20:57.760 --> 0:21:00.760
<v Speaker 4>people think might be like the next source of making

0:21:00.800 --> 0:21:03.520
<v Speaker 4>really good models, and none of those things are really included,

0:21:03.880 --> 0:21:06.120
<v Speaker 4>at least not much in today's models.

0:21:06.560 --> 0:21:07.680
<v Speaker 3>This is really important.

0:21:07.800 --> 0:21:11.920
<v Speaker 1>Like once a lab builds some sort of base, whether

0:21:12.040 --> 0:21:16.160
<v Speaker 1>it's anthropic encoding or maybe cursor encoding, even though they're

0:21:16.200 --> 0:21:19.960
<v Speaker 1>not like a core lab, et cetera, like they become

0:21:20.119 --> 0:21:22.920
<v Speaker 1>a source of their own data that literally nobody else has.

0:21:23.240 --> 0:21:26.320
<v Speaker 4>Yeah, actually Cursor is a great example. So they are

0:21:26.440 --> 0:21:29.320
<v Speaker 4>very technical, they have really smart people. They're very small,

0:21:29.440 --> 0:21:32.760
<v Speaker 4>so they haven't quite scaled to at least in terms

0:21:32.760 --> 0:21:33.400
<v Speaker 4>of the number of people.

0:21:33.400 --> 0:21:34.960
<v Speaker 1>But I think about this like every time I was like,

0:21:35.440 --> 0:21:37.000
<v Speaker 1>when I've played with this is like this is good,

0:21:37.040 --> 0:21:40.040
<v Speaker 1>this is bad. I'm constantly teaching their model to get better,

0:21:40.119 --> 0:21:41.480
<v Speaker 1>right right, right right.

0:21:41.880 --> 0:21:43.479
<v Speaker 4>They're in a problem where they have the data. They

0:21:43.520 --> 0:21:46.000
<v Speaker 4>just have to take the right algorithms and scale it

0:21:46.080 --> 0:21:47.879
<v Speaker 4>up to train the model to be as good as

0:21:48.040 --> 0:21:50.720
<v Speaker 4>Claude is. But that actually seems a lot more feasible

0:21:50.800 --> 0:21:53.080
<v Speaker 4>than other companies that have no data and want to

0:21:53.080 --> 0:21:55.080
<v Speaker 4>train good models, even if they know how, it seems

0:21:55.200 --> 0:21:55.800
<v Speaker 4>very difficult.

0:21:56.640 --> 0:22:01.439
<v Speaker 2>How closely are AI researchers working or talking to I

0:22:01.440 --> 0:22:04.840
<v Speaker 2>guess other parts of the AI ecosystem, so you know,

0:22:05.080 --> 0:22:09.159
<v Speaker 2>chip makers, maybe cloud providers, that sort of thing. Is

0:22:09.160 --> 0:22:10.760
<v Speaker 2>there a lot of dialogue or not really.

0:22:11.000 --> 0:22:14.240
<v Speaker 4>I think certain people talk all the time to the

0:22:14.280 --> 0:22:17.720
<v Speaker 4>chip makers, Like there's a big community of people. You know,

0:22:17.800 --> 0:22:21.080
<v Speaker 4>the AI models all run on GPUs, and there are

0:22:21.160 --> 0:22:23.040
<v Speaker 4>a lot of people that are getting really good at

0:22:23.080 --> 0:22:26.760
<v Speaker 4>writing fast GPU code. It's called kernels, and all those

0:22:26.760 --> 0:22:29.280
<v Speaker 4>people who work on kernels talk to the chip makers

0:22:29.320 --> 0:22:32.040
<v Speaker 4>all the time. Like Amazon's making their own chips, Google

0:22:32.080 --> 0:22:35.000
<v Speaker 4>has their own chip. Now all the hyperscalers are making chips,

0:22:35.000 --> 0:22:36.919
<v Speaker 4>and I think they're all trying to talk to the

0:22:36.920 --> 0:22:39.240
<v Speaker 4>people that actually write the fast code that runs on

0:22:39.320 --> 0:22:41.480
<v Speaker 4>chips to figure out I think they call it hardware

0:22:41.520 --> 0:22:43.919
<v Speaker 4>software code design, Like everyone's kind of getting together and

0:22:43.960 --> 0:22:45.560
<v Speaker 4>trying to figure out what the best way is to

0:22:45.640 --> 0:22:47.639
<v Speaker 4>design the next round of GPUs.

0:22:48.359 --> 0:22:52.760
<v Speaker 1>So you mentioned, okay, Google might have an advantage because

0:22:52.960 --> 0:22:56.199
<v Speaker 1>it owns YouTube and there's just tons of obviously just

0:22:56.280 --> 0:22:56.840
<v Speaker 1>tons of.

0:22:57.119 --> 0:22:57.800
<v Speaker 3>Data in there.

0:22:57.840 --> 0:23:00.560
<v Speaker 1>So one way you could get access to the YouTube

0:23:00.640 --> 0:23:03.439
<v Speaker 1>data is to literally be Google and own it. But

0:23:03.560 --> 0:23:06.640
<v Speaker 1>another way that maybe you could get access to YouTube

0:23:06.720 --> 0:23:10.560
<v Speaker 1>data is operate in China where there are no laws

0:23:10.680 --> 0:23:13.320
<v Speaker 1>about this type of thing, or no, they're not beholding

0:23:13.320 --> 0:23:16.920
<v Speaker 1>the US copyright and just sort of scrape at all. Again,

0:23:17.080 --> 0:23:20.360
<v Speaker 1>since most of the Chinese AI labs are open to source,

0:23:21.040 --> 0:23:24.560
<v Speaker 1>why isn't this just a huge advantage for the Chinese

0:23:24.600 --> 0:23:27.359
<v Speaker 1>labs that they're really not going to be Hey, open

0:23:27.400 --> 0:23:29.080
<v Speaker 1>AI they get super at the New York Times all

0:23:29.119 --> 0:23:32.440
<v Speaker 1>these deepseek isn't having to deal with all these headaches?

0:23:33.000 --> 0:23:38.919
<v Speaker 4>Yeah, I think the American AI labs will probably do

0:23:39.080 --> 0:23:41.720
<v Speaker 4>things behind the scenes that they wouldn't tell you about

0:23:41.800 --> 0:23:45.439
<v Speaker 4>to get good data solution. Just don't so Yeah, Like

0:23:45.520 --> 0:23:48.680
<v Speaker 4>I think they wouldn't release the models that are potentially

0:23:48.720 --> 0:23:51.639
<v Speaker 4>trained on scraped or copyrighted data. But if that's the

0:23:51.680 --> 0:23:55.000
<v Speaker 4>way to get better math Olympiad scores, then people will

0:23:55.160 --> 0:23:57.560
<v Speaker 4>I think I would guess do that. But you're right

0:23:57.600 --> 0:24:00.159
<v Speaker 4>that like the Chinese, the Chinese model makers can to

0:24:00.160 --> 0:24:02.480
<v Speaker 4>sort of take all the books that they can pirate

0:24:02.560 --> 0:24:04.600
<v Speaker 4>from the Internet and train on them and they're not

0:24:04.680 --> 0:24:06.920
<v Speaker 4>violating any laws and they can release the model to

0:24:06.960 --> 0:24:09.520
<v Speaker 4>the public and it's all fine, which is honestly great

0:24:09.520 --> 0:24:12.800
<v Speaker 4>for us because then people like me could probably download

0:24:12.800 --> 0:24:15.240
<v Speaker 4>a model that's better than we would get otherwise.

0:24:15.640 --> 0:24:18.400
<v Speaker 2>What was your impression of deep Seek when it came out?

0:24:18.680 --> 0:24:19.080
<v Speaker 2>And now?

0:24:20.119 --> 0:24:24.000
<v Speaker 4>I was pretty surprised at how much of a splash

0:24:24.080 --> 0:24:26.920
<v Speaker 4>they made. The model is really good, and I think

0:24:26.960 --> 0:24:30.639
<v Speaker 4>a lot of people are building on it, including me,

0:24:30.840 --> 0:24:33.399
<v Speaker 4>and like most people that are at AI companies that

0:24:33.440 --> 0:24:36.639
<v Speaker 4>aren't super super big are building on deep Seek. But

0:24:37.840 --> 0:24:40.960
<v Speaker 4>it was surprising, like what a huge deal it was

0:24:41.000 --> 0:24:43.280
<v Speaker 4>to people, like my mom's asking me about deep Seek.

0:24:43.280 --> 0:24:45.439
<v Speaker 4>I think my grandma knew about deep Seek, and she

0:24:45.520 --> 0:24:47.040
<v Speaker 4>barely knew about chat GBT.

0:24:47.200 --> 0:24:50.800
<v Speaker 2>So that's when you know it's gone mainstream when starts.

0:24:50.480 --> 0:24:53.399
<v Speaker 4>Asking you and there was nothing else so far. I

0:24:53.480 --> 0:24:55.879
<v Speaker 4>think in the AI space that's made quite that much news.

0:24:55.960 --> 0:24:58.200
<v Speaker 1>But it sounds like what you're saying is that it's

0:24:58.240 --> 0:25:01.520
<v Speaker 1>a very good model, but that on the actual specs

0:25:02.359 --> 0:25:05.800
<v Speaker 1>from your perspective, it didn't quite deserve is much attention,

0:25:05.920 --> 0:25:08.680
<v Speaker 1>Like it was good, but like in your view, it's

0:25:08.720 --> 0:25:11.720
<v Speaker 1>not so good that everyone needed to be talking about it.

0:25:12.000 --> 0:25:16.360
<v Speaker 4>Yeah, I think it's really useful because they released all

0:25:16.400 --> 0:25:19.080
<v Speaker 4>their model weights and they said exactly what they did

0:25:19.080 --> 0:25:21.119
<v Speaker 4>to train it. Although they didn't say what the data was,

0:25:21.640 --> 0:25:24.240
<v Speaker 4>but it gave me the impression of there maybe six

0:25:24.280 --> 0:25:26.840
<v Speaker 4>to twelve months behind the American AI labs in terms

0:25:26.880 --> 0:25:29.040
<v Speaker 4>of how well they can do the training and stuff.

0:25:29.200 --> 0:25:31.560
<v Speaker 4>But it still was a pretty big update for me

0:25:31.680 --> 0:25:34.080
<v Speaker 4>to know that, Wow, there are one hundred people that

0:25:34.160 --> 0:25:36.520
<v Speaker 4>don't have PhDs working at a Chinese hedge fund that

0:25:36.560 --> 0:25:39.320
<v Speaker 4>are training these like cutting edge models. Like it is

0:25:39.359 --> 0:25:41.480
<v Speaker 4>incredible and they work very hard, they're very good.

0:25:57.600 --> 0:26:00.119
<v Speaker 2>Do you have pressure or do you feel pressure or

0:26:00.600 --> 0:26:05.280
<v Speaker 2>do AI researchers in general fuel pressure to consider monetization

0:26:05.520 --> 0:26:08.680
<v Speaker 2>when they're researching things or is it you know, mostly

0:26:09.080 --> 0:26:12.960
<v Speaker 2>still curiosity driven, that sort of old school Silicon Valley

0:26:13.000 --> 0:26:15.399
<v Speaker 2>we're improving the world kind of thing. Or is it

0:26:15.560 --> 0:26:19.280
<v Speaker 2>much more mercenary given that all of these big companies

0:26:19.320 --> 0:26:21.680
<v Speaker 2>seem to be competing in the same space.

0:26:22.359 --> 0:26:27.240
<v Speaker 4>Yeah. I think that over time it's gotten harder and

0:26:27.359 --> 0:26:30.400
<v Speaker 4>harder to do things that are just like cool ideas

0:26:30.560 --> 0:26:35.320
<v Speaker 4>or seem cute but don't have any necessary application, and

0:26:35.520 --> 0:26:37.639
<v Speaker 4>things are getting closer and closer to products, you know,

0:26:37.760 --> 0:26:40.720
<v Speaker 4>even like the language models that power CHAGBT. I was

0:26:40.760 --> 0:26:43.720
<v Speaker 4>working in those before CHAGBT, and they had some uses,

0:26:43.760 --> 0:26:47.439
<v Speaker 4>but also they're intellectually interesting and like fun to build.

0:26:47.960 --> 0:26:50.560
<v Speaker 4>But now if I came up with a better way

0:26:50.600 --> 0:26:54.360
<v Speaker 4>to train CHAGBT, that's like a multi billion dollar innovation.

0:26:54.560 --> 0:26:55.440
<v Speaker 2>The stakes are higher.

0:26:55.640 --> 0:26:57.760
<v Speaker 4>Yeah, I'd be like an asset to like the United

0:26:57.800 --> 0:27:00.120
<v Speaker 4>States government or something if I knew how to do that.

0:27:00.240 --> 0:27:02.800
<v Speaker 4>So I guess it depends on what kind of problems

0:27:02.840 --> 0:27:05.560
<v Speaker 4>you work on. Like, I'm more interested in understanding how

0:27:05.600 --> 0:27:10.080
<v Speaker 4>things work, so it becomes a bit less financially dire.

0:27:10.200 --> 0:27:14.440
<v Speaker 1>I think that six to twelve month gap between what

0:27:14.480 --> 0:27:17.640
<v Speaker 1>was that that was a January deep segment. Yeah, everyone

0:27:18.200 --> 0:27:20.320
<v Speaker 1>was in December that they first got at attention, then

0:27:20.359 --> 0:27:22.399
<v Speaker 1>for some reason really hit in January. Is that a

0:27:22.440 --> 0:27:26.160
<v Speaker 1>sustainable gap? Is there something either in access to data,

0:27:26.400 --> 0:27:30.240
<v Speaker 1>access to talent, access to compute, access to chips, whatever,

0:27:30.320 --> 0:27:34.800
<v Speaker 1>access to energy that in your view will allow us

0:27:34.840 --> 0:27:37.680
<v Speaker 1>frontier lebs to maintain some sort of six to twelve

0:27:37.720 --> 0:27:38.720
<v Speaker 1>month gap for a while.

0:27:39.119 --> 0:27:41.560
<v Speaker 4>It's pretty unclear to me. I think there are different

0:27:41.600 --> 0:27:43.920
<v Speaker 4>beliefs you can have. You can believe that the ideas

0:27:44.000 --> 0:27:46.959
<v Speaker 4>and the people are really the thing that differentiates the models,

0:27:47.040 --> 0:27:49.160
<v Speaker 4>and in that case, I think we haven't so far

0:27:49.240 --> 0:27:52.920
<v Speaker 4>seen a lot of like the top USAI researchers going

0:27:52.920 --> 0:27:56.800
<v Speaker 4>to work at Chinese labs, so that seems stable. You

0:27:56.800 --> 0:27:59.080
<v Speaker 4>could think that chips really matter, and in that case

0:27:59.600 --> 0:28:02.360
<v Speaker 4>the chip race is really happening between big American companies.

0:28:02.400 --> 0:28:04.960
<v Speaker 4>Like I think, actually China has a pretty big deficit

0:28:05.000 --> 0:28:08.440
<v Speaker 4>coming up in terms of like the GPUs we're exporting,

0:28:09.080 --> 0:28:10.919
<v Speaker 4>or you can think that the data matters, and I

0:28:10.920 --> 0:28:15.439
<v Speaker 4>guess actually any of these point in the favor of

0:28:15.480 --> 0:28:18.040
<v Speaker 4>the US. I think if you think the data really matters,

0:28:18.600 --> 0:28:21.520
<v Speaker 4>maybe the data they gather through like deepseek dot com

0:28:21.600 --> 0:28:23.359
<v Speaker 4>usage is really good and they can use it to

0:28:23.400 --> 0:28:26.119
<v Speaker 4>like bootstrap a better model. But I think the American

0:28:26.160 --> 0:28:28.639
<v Speaker 4>companies really do have an advantage. Like you all might

0:28:28.680 --> 0:28:31.600
<v Speaker 4>have heard this story just as an anecdote. Apparently at

0:28:31.640 --> 0:28:36.240
<v Speaker 4>Anthropic they've been buying and scanning thousands of old books

0:28:36.480 --> 0:28:38.520
<v Speaker 4>for several years, so they have this division. I think

0:28:38.520 --> 0:28:41.640
<v Speaker 4>they're based in New York that buys like shipping containers

0:28:41.680 --> 0:28:44.760
<v Speaker 4>full of old manuscripts, cuts off the spines and puts

0:28:44.800 --> 0:28:47.000
<v Speaker 4>them in these scanning machines and then they turn them

0:28:47.040 --> 0:28:49.840
<v Speaker 4>into like really high quality text. And so I'm noting

0:28:49.880 --> 0:28:53.040
<v Speaker 4>Claude has this like weird aspect to it. Maybe part

0:28:53.080 --> 0:28:57.160
<v Speaker 4>of the reason is they've gathered like trillions of words

0:28:57.280 --> 0:29:00.360
<v Speaker 4>worth of like old book data over many years, and

0:29:00.400 --> 0:29:03.080
<v Speaker 4>that's pretty hard to replicate elsewhere. So I think that

0:29:03.160 --> 0:29:05.040
<v Speaker 4>head start really does mean a lot.

0:29:06.160 --> 0:29:08.600
<v Speaker 2>What are you most excited about at the moment? The

0:29:08.640 --> 0:29:12.520
<v Speaker 2>book thing sounds very cool, but what is getting all

0:29:12.520 --> 0:29:13.960
<v Speaker 2>your attention right now?

0:29:14.160 --> 0:29:18.800
<v Speaker 4>Thanks for asking. I think I mentioned before I'm really

0:29:18.840 --> 0:29:21.560
<v Speaker 4>trying to figure out what's coming next. There are some

0:29:21.640 --> 0:29:24.800
<v Speaker 4>obvious things like we can get computer usage data and

0:29:25.160 --> 0:29:27.720
<v Speaker 4>train better agents, or we can get more coding data

0:29:27.800 --> 0:29:30.280
<v Speaker 4>and make them better coding or writing gp code or whatever,

0:29:30.560 --> 0:29:35.280
<v Speaker 4>But like, what are the non obvious advancements? And my

0:29:35.480 --> 0:29:39.640
<v Speaker 4>personal opinion is that the next round of improvements and

0:29:39.680 --> 0:29:44.240
<v Speaker 4>AI models will come from some type of personalization and

0:29:44.360 --> 0:29:48.560
<v Speaker 4>online learning, which means like models that one are trained

0:29:48.640 --> 0:29:50.880
<v Speaker 4>like per person or per company. So like you could

0:29:50.880 --> 0:29:54.280
<v Speaker 4>think of like CHADGBT is the same model that gets

0:29:54.280 --> 0:29:57.680
<v Speaker 4>served to everyone, so it has to store information about

0:29:57.920 --> 0:30:02.080
<v Speaker 4>random restaurants and like countries you never go to. But

0:30:02.200 --> 0:30:04.880
<v Speaker 4>instead if you had a CHAGBT that's specific to Bloomberg

0:30:04.960 --> 0:30:07.719
<v Speaker 4>or specific to your work, it might be able to

0:30:07.760 --> 0:30:10.280
<v Speaker 4>like use more of its brain to do work for you.

0:30:10.760 --> 0:30:13.040
<v Speaker 4>And then the second thing is if it was updated

0:30:13.120 --> 0:30:14.960
<v Speaker 4>every day, so like if you ask it to make

0:30:15.000 --> 0:30:19.160
<v Speaker 4>your odd lots calendar, yeah, or RSS feed and you're like, no,

0:30:19.360 --> 0:30:21.160
<v Speaker 4>that was wrong, Like you did it wrong for this

0:30:21.240 --> 0:30:23.960
<v Speaker 4>reason this reason, and you try again tomorrow, it'll still

0:30:24.000 --> 0:30:27.560
<v Speaker 4>break tomorrow because it doesn't like continuously improve its capabilities.

0:30:28.080 --> 0:30:31.520
<v Speaker 4>So oh yeah, I think that's the direction things are going.

0:30:31.640 --> 0:30:33.520
<v Speaker 3>I've heard people talk about this now.

0:30:33.560 --> 0:30:36.600
<v Speaker 1>Granted, models are getting better over time, but you know,

0:30:36.640 --> 0:30:40.880
<v Speaker 1>people might compare a coding model to a beginning software

0:30:41.000 --> 0:30:43.080
<v Speaker 1>engineer and say, the coding model is better, but that

0:30:43.200 --> 0:30:45.360
<v Speaker 1>software engineer is going to start getting better the next

0:30:45.400 --> 0:30:47.040
<v Speaker 1>day they're on the job, and every day for the

0:30:47.040 --> 0:30:49.240
<v Speaker 1>rest of their career, they're probably going to be a

0:30:49.280 --> 0:30:52.600
<v Speaker 1>better software engineer than they were the day before, whereas

0:30:52.640 --> 0:30:56.800
<v Speaker 1>at least that version of the model will not be better.

0:30:56.840 --> 0:30:58.200
<v Speaker 3>That is that right? Yeah?

0:30:58.240 --> 0:31:00.000
<v Speaker 1>Yeah, that seems like an issue that people talk about

0:31:00.280 --> 0:31:00.800
<v Speaker 1>in your world.

0:31:01.040 --> 0:31:03.160
<v Speaker 4>Yeah, yeah, I think this is a big problem. It's

0:31:03.200 --> 0:31:06.000
<v Speaker 4>like we have to wait six months for the chat

0:31:06.040 --> 0:31:09.000
<v Speaker 4>GPT five point one to come out, and then maybe

0:31:09.040 --> 0:31:11.640
<v Speaker 4>they'll include your problems as the training data, and so

0:31:11.680 --> 0:31:14.520
<v Speaker 4>maybe it'll get better, but it might not. And instead,

0:31:14.560 --> 0:31:17.280
<v Speaker 4>I think people need to think about ways to do

0:31:17.320 --> 0:31:20.360
<v Speaker 4>that update more dynamically, like every time you talk to it,

0:31:20.600 --> 0:31:22.360
<v Speaker 4>or maybe every night when you go to sleep, the

0:31:22.400 --> 0:31:24.880
<v Speaker 4>model kind of like gets to work and studies what

0:31:24.920 --> 0:31:27.040
<v Speaker 4>it was talking to you about and crafts better tests

0:31:27.040 --> 0:31:28.920
<v Speaker 4>for itself and then learns and then when you wake up,

0:31:29.000 --> 0:31:30.120
<v Speaker 4>the model's actually better.

0:31:30.600 --> 0:31:33.120
<v Speaker 1>The other big question that I have and is kind

0:31:33.160 --> 0:31:36.280
<v Speaker 1>of related to this, especially when we're talking about AI

0:31:36.440 --> 0:31:40.520
<v Speaker 1>replacing the humans in certain forms of labor, is that

0:31:40.720 --> 0:31:44.360
<v Speaker 1>like do we need really really advanced aid like in

0:31:44.400 --> 0:31:47.320
<v Speaker 1>other words, like there is a lot of again, the

0:31:47.440 --> 0:31:51.560
<v Speaker 1>existing models are extremely impressive, Like in your view, do

0:31:51.640 --> 0:31:54.880
<v Speaker 1>we need to get a lot better technically for them

0:31:54.920 --> 0:31:58.280
<v Speaker 1>to have economic impact? And since these are in many

0:31:58.280 --> 0:32:01.600
<v Speaker 1>cases businesses at the end of the day, is it

0:32:01.760 --> 0:32:05.400
<v Speaker 1>necessary that there's so much work being done towards advancing

0:32:05.880 --> 0:32:06.600
<v Speaker 1>the cutting edge?

0:32:06.920 --> 0:32:09.400
<v Speaker 4>Yeah, yeah, that's a great question, Like we could have

0:32:10.280 --> 0:32:13.720
<v Speaker 4>really good interns without ever getting better scores on the

0:32:13.720 --> 0:32:16.960
<v Speaker 4>Math Olympiad, Like that's not necessarily something that we ever

0:32:17.040 --> 0:32:19.680
<v Speaker 4>had to go after. I think part of the reason

0:32:19.720 --> 0:32:21.680
<v Speaker 4>for that is that AI labs are engaged in this

0:32:21.800 --> 0:32:24.360
<v Speaker 4>kind of neck and neck race to have the smartest model.

0:32:24.720 --> 0:32:28.640
<v Speaker 4>But I totally agree that AI could be economically transformative

0:32:29.040 --> 0:32:32.080
<v Speaker 4>without having a higher ceiling in terms of what it

0:32:32.080 --> 0:32:33.520
<v Speaker 4>can do. It's more like it needs to be more

0:32:33.560 --> 0:32:36.240
<v Speaker 4>consistent or like dependable than actually smarter.

0:32:37.320 --> 0:32:39.440
<v Speaker 2>This might be a weird question, but once you've made

0:32:39.640 --> 0:32:43.760
<v Speaker 2>a sort of foundational improvement to a particular model, how

0:32:43.800 --> 0:32:47.400
<v Speaker 2>easy or difficult is it to rewind if you need to.

0:32:47.760 --> 0:32:50.400
<v Speaker 2>And one of the reasons I ask is because you know,

0:32:50.480 --> 0:32:53.880
<v Speaker 2>some people have been complaining that they've been training chat

0:32:53.920 --> 0:32:56.760
<v Speaker 2>GPT to I don't know, be their boyfriend or whatever,

0:32:56.960 --> 0:33:01.040
<v Speaker 2>be their therapist topic. Yeah, and then it gets upgraded

0:33:01.360 --> 0:33:04.640
<v Speaker 2>and all of that training suddenly disappears and the personality

0:33:04.760 --> 0:33:06.280
<v Speaker 2>of the model changes.

0:33:07.160 --> 0:33:09.840
<v Speaker 4>Yeah, that was a really interesting story. So I think

0:33:09.880 --> 0:33:13.560
<v Speaker 4>the model before GPT five was four to zero. And

0:33:13.600 --> 0:33:17.280
<v Speaker 4>they said that they thought internally, like all the scientists

0:33:17.400 --> 0:33:20.600
<v Speaker 4>encoder people, that the new model was superior in every way.

0:33:20.640 --> 0:33:23.280
<v Speaker 4>It gives you shorter responses, it's a bit nicer, it's

0:33:23.360 --> 0:33:26.640
<v Speaker 4>much smarter. And then people got really upset because they

0:33:26.680 --> 0:33:28.720
<v Speaker 4>had spent so much time talking to the old model

0:33:28.760 --> 0:33:32.160
<v Speaker 4>that they felt like they'd experience like a serious loss

0:33:32.280 --> 0:33:33.080
<v Speaker 4>in their life.

0:33:33.240 --> 0:33:37.080
<v Speaker 2>Joe would miss the love it love it No.

0:33:37.160 --> 0:33:40.800
<v Speaker 1>But for real, this is un Ironically this strikes me

0:33:40.840 --> 0:33:44.440
<v Speaker 1>as another example for open source, which is that if

0:33:44.520 --> 0:33:47.160
<v Speaker 1>I'm going to form a I don't see it. I'm

0:33:47.160 --> 0:33:49.320
<v Speaker 1>forty five, I'm too old for that. But if someone

0:33:49.440 --> 0:33:51.800
<v Speaker 1>is going to form like some sort of friendship with

0:33:51.840 --> 0:33:54.480
<v Speaker 1>an AI model, I don't want it to be at

0:33:54.480 --> 0:33:57.400
<v Speaker 1>the whim of Sam Altman deciding it was like, oh

0:33:57.440 --> 0:34:00.160
<v Speaker 1>there's an upgrade. I would like to be friends, so

0:34:00.200 --> 0:34:02.120
<v Speaker 1>weird to be friends with the model that I know

0:34:02.200 --> 0:34:06.480
<v Speaker 1>that I can run in perpetuity and it will never change.

0:34:06.720 --> 0:34:09.359
<v Speaker 4>Yeah. I think that's definitely a good argument for why

0:34:09.400 --> 0:34:12.440
<v Speaker 4>open source is important, And if you ever fall in

0:34:12.480 --> 0:34:14.200
<v Speaker 4>love with a model, you should fall in love with

0:34:14.239 --> 0:34:14.880
<v Speaker 4>an openness.

0:34:16.280 --> 0:34:18.320
<v Speaker 2>That's good life advice, practical life.

0:34:18.120 --> 0:34:19.480
<v Speaker 3>Advice, really good life advice.

0:34:19.680 --> 0:34:22.399
<v Speaker 2>Well, speaking of open source, you know, I know programmers

0:34:22.719 --> 0:34:26.080
<v Speaker 2>tend to like open source for obvious reasons, but are

0:34:26.160 --> 0:34:30.680
<v Speaker 2>there any downsides to open source for AI specifically?

0:34:31.080 --> 0:34:33.080
<v Speaker 4>I think if you're running a company, there are a

0:34:33.120 --> 0:34:36.000
<v Speaker 4>lot of downsides potentially to open source. If you have

0:34:36.200 --> 0:34:41.120
<v Speaker 4>some brand new, fancy way of doing computation inside the

0:34:41.160 --> 0:34:43.319
<v Speaker 4>model that's actually better, you might want to keep that

0:34:43.360 --> 0:34:45.680
<v Speaker 4>information to yourself. And when you release the model, to

0:34:45.719 --> 0:34:47.920
<v Speaker 4>make it runnable, you have to release all the code

0:34:47.960 --> 0:34:50.600
<v Speaker 4>to run the model, which might contain like your secrets,

0:34:50.640 --> 0:34:52.520
<v Speaker 4>and so I think that's why people are hesitant to

0:34:52.520 --> 0:34:55.520
<v Speaker 4>do it. The other reason is because when you release

0:34:55.600 --> 0:34:59.480
<v Speaker 4>the model, it actually contains quite a lot of residual

0:34:59.480 --> 0:35:02.319
<v Speaker 4>information about how you actually trained it, Like you might

0:35:02.360 --> 0:35:04.400
<v Speaker 4>be able to infer what the data set was and

0:35:04.440 --> 0:35:08.080
<v Speaker 4>what the training process was, or even reconstruct the entire

0:35:08.200 --> 0:35:10.760
<v Speaker 4>training data set given just the weights of the model.

0:35:11.080 --> 0:35:15.160
<v Speaker 4>And so if you're worried about people finding out that

0:35:15.200 --> 0:35:17.400
<v Speaker 4>a certain thing was in your training data, you probably

0:35:17.440 --> 0:35:19.040
<v Speaker 4>can't release that model open source.

0:35:19.760 --> 0:35:23.520
<v Speaker 2>That reminds me how much of an AI researcher's day

0:35:23.520 --> 0:35:27.040
<v Speaker 2>to day life is just like looking at other model,

0:35:27.120 --> 0:35:30.359
<v Speaker 2>other people's models, and trying to, like I guess, pull

0:35:30.400 --> 0:35:32.799
<v Speaker 2>them apart and figure out how they were made and

0:35:32.800 --> 0:35:33.920
<v Speaker 2>sort of work backwards.

0:35:34.960 --> 0:35:38.080
<v Speaker 4>That definitely happens from time to time. I think usually

0:35:38.120 --> 0:35:41.000
<v Speaker 4>the scientific process is something like you start with other

0:35:41.040 --> 0:35:44.360
<v Speaker 4>people's models, and you run them and you see what happens,

0:35:44.400 --> 0:35:46.960
<v Speaker 4>and then you decide on some part of that process

0:35:47.000 --> 0:35:49.680
<v Speaker 4>that you think could be improved or could be explored further,

0:35:50.040 --> 0:35:51.960
<v Speaker 4>and you make some tiny changes to it, and then

0:35:52.000 --> 0:35:54.640
<v Speaker 4>you run it again and you compare like numbers, or

0:35:54.680 --> 0:35:57.320
<v Speaker 4>you make graphs of what happened before and what happens after.

0:35:57.680 --> 0:35:59.960
<v Speaker 4>So actually quite a bit of it, like, for example,

0:36:00.040 --> 0:36:02.520
<v Speaker 4>pull the GPT two model from open Ai, which was

0:36:03.280 --> 0:36:06.840
<v Speaker 4>twenty nineteen or something, their first kind of really larger

0:36:06.920 --> 0:36:10.279
<v Speaker 4>scale chatbot. Like I've spent hundreds of hours kind of

0:36:10.320 --> 0:36:12.520
<v Speaker 4>like playing with that code and talking to the model

0:36:12.560 --> 0:36:15.400
<v Speaker 4>and stuff like that. So thank goodness for open source.

0:36:15.440 --> 0:36:18.600
<v Speaker 1>For that reason, I joked in the beginning about you

0:36:18.680 --> 0:36:21.879
<v Speaker 1>having one hundred million dollar salary, but for real, as

0:36:21.920 --> 0:36:24.960
<v Speaker 1>you think about your career, and I hope you do

0:36:24.960 --> 0:36:28.040
<v Speaker 1>get a hundred million dollar salary, but as you think

0:36:28.080 --> 0:36:30.799
<v Speaker 1>about your career, what excites you?

0:36:30.880 --> 0:36:32.240
<v Speaker 3>And how much is it money?

0:36:32.440 --> 0:36:34.520
<v Speaker 1>But the reason I think about this is like they're

0:36:34.600 --> 0:36:38.000
<v Speaker 1>huge checks out there, but maybe some things are more.

0:36:38.040 --> 0:36:42.160
<v Speaker 1>Maybe achieving AGI is more excited than making an ad

0:36:42.200 --> 0:36:46.040
<v Speaker 1>network more efficient. Maybe something there's something more exciting than

0:36:46.640 --> 0:36:50.360
<v Speaker 1>shaving off a billionth of a second in terms of

0:36:50.360 --> 0:36:53.320
<v Speaker 1>a trade execution, all these things like how much is

0:36:53.360 --> 0:36:57.399
<v Speaker 1>it about exploring the frontiers of science, the new space race,

0:36:57.480 --> 0:36:59.840
<v Speaker 1>landing on the Moon versus the paycheck?

0:37:00.160 --> 0:37:03.000
<v Speaker 4>It's all about the paycheck. I'm just kidding. No, no,

0:37:03.160 --> 0:37:05.680
<v Speaker 4>not at all. Yeah, it's funny you ask. So this

0:37:05.719 --> 0:37:08.160
<v Speaker 4>hasn't happened to me, But just in the past two

0:37:08.200 --> 0:37:10.759
<v Speaker 4>weeks or so, a good friend of mine has been

0:37:10.800 --> 0:37:13.520
<v Speaker 4>dealing with this problem because she got an offer on

0:37:13.600 --> 0:37:15.839
<v Speaker 4>the order of like tens of millions of dollars per

0:37:15.920 --> 0:37:20.680
<v Speaker 4>year from a big AI company and she wasn't sure

0:37:20.680 --> 0:37:23.279
<v Speaker 4>if she wanted to work there, and I think originally

0:37:23.320 --> 0:37:26.799
<v Speaker 4>she said no, and then they doubled her offer, and

0:37:26.840 --> 0:37:28.920
<v Speaker 4>then like it's the exact same amount of cash, but

0:37:28.920 --> 0:37:31.120
<v Speaker 4>twice as much per year for certain number of years.

0:37:31.640 --> 0:37:34.600
<v Speaker 4>And you know, we were talking amongst ourselves like what

0:37:34.600 --> 0:37:37.120
<v Speaker 4>does this even mean at this point, Like you're, you know,

0:37:37.160 --> 0:37:40.279
<v Speaker 4>a twenty eight year old computer scientist that's been coming

0:37:40.320 --> 0:37:42.239
<v Speaker 4>from a PhD. So you make more on the order

0:37:42.280 --> 0:37:45.440
<v Speaker 4>of tens of thousands of dollars per year. I honestly

0:37:45.520 --> 0:37:49.120
<v Speaker 4>think personally, the marginal difference between having like ten and

0:37:49.160 --> 0:37:51.640
<v Speaker 4>twenty million dollars is like very low, Like I don't

0:37:51.640 --> 0:37:53.440
<v Speaker 4>even know what I would do with this.

0:37:53.520 --> 0:37:57.479
<v Speaker 1>Is this is my experience for me making ten million

0:37:57.520 --> 0:37:58.920
<v Speaker 1>twenty mine has basically.

0:37:58.520 --> 0:37:59.160
<v Speaker 3>Been the same to me.

0:37:59.360 --> 0:38:05.120
<v Speaker 4>Yeah, congratulations, but so yeah, I think there's more of

0:38:05.160 --> 0:38:08.040
<v Speaker 4>a desire to like be there the next time something

0:38:08.080 --> 0:38:12.040
<v Speaker 4>really interesting happens, and that kind of supersedes the money.

0:38:12.120 --> 0:38:14.120
<v Speaker 4>Like any of these places will pay you what's like

0:38:14.160 --> 0:38:16.319
<v Speaker 4>a really good salary to live on, and so it's

0:38:16.360 --> 0:38:19.399
<v Speaker 4>actually not a big consideration. It only becomes complicated when

0:38:19.400 --> 0:38:21.920
<v Speaker 4>you have like one option that's going to pay you

0:38:21.960 --> 0:38:24.319
<v Speaker 4>like forty times more than the other option, and then

0:38:24.760 --> 0:38:25.799
<v Speaker 4>things get confusing.

0:38:26.320 --> 0:38:28.879
<v Speaker 2>No, this isn't this should actually I was just thinking

0:38:28.920 --> 0:38:29.919
<v Speaker 2>about making twenty million.

0:38:30.000 --> 0:38:30.439
<v Speaker 3>No, I think.

0:38:32.080 --> 0:38:33.920
<v Speaker 1>Because I think about, Okay, what if you have this

0:38:33.960 --> 0:38:37.120
<v Speaker 1>great salary and you're like can live very easily in

0:38:37.120 --> 0:38:40.160
<v Speaker 1>New York City and have a really great life, or

0:38:40.600 --> 0:38:43.120
<v Speaker 1>you could make ten times that, which is a stupid

0:38:43.400 --> 0:38:45.800
<v Speaker 1>insane salary, right, but you don't write like your job.

0:38:46.040 --> 0:38:46.680
<v Speaker 3>But it's so.

0:38:46.800 --> 0:38:51.600
<v Speaker 1>Much money that strikes me is like not a trivial life.

0:38:52.400 --> 0:38:54.239
<v Speaker 1>You only live one time. There's like a different so

0:38:54.239 --> 0:38:55.480
<v Speaker 1>it could be a difficult question.

0:38:55.800 --> 0:38:58.400
<v Speaker 4>Yeah, yeah, but you can remind yourself that, like the

0:38:58.520 --> 0:39:02.120
<v Speaker 4>job you take once isn't the job that defines you forever.

0:39:02.280 --> 0:39:04.040
<v Speaker 4>Maybe maybe the right thing to do is to take

0:39:04.080 --> 0:39:05.520
<v Speaker 4>it for a few years but not the whole time,

0:39:05.520 --> 0:39:06.279
<v Speaker 4>and then go do something.

0:39:06.320 --> 0:39:09.680
<v Speaker 1>Everyone says they're going to do that and then.

0:39:09.560 --> 0:39:14.080
<v Speaker 2>They get locked in. Speaking of insanely large salaries, we

0:39:14.160 --> 0:39:16.520
<v Speaker 2>know that people are earning these salaries because they're like

0:39:16.719 --> 0:39:23.320
<v Speaker 2>star AI researchers. How much does personality play into where

0:39:23.360 --> 0:39:25.279
<v Speaker 2>you want to go work? Would you want to go

0:39:25.320 --> 0:39:30.399
<v Speaker 2>work somewhere specifically because there's an absolutely amazing researcher, or

0:39:30.520 --> 0:39:32.920
<v Speaker 2>does it tend to be again more about the paycheck,

0:39:32.960 --> 0:39:35.160
<v Speaker 2>maybe more about the data that's available to you, or

0:39:35.200 --> 0:39:37.440
<v Speaker 2>maybe more about the specific project that you're going to

0:39:37.480 --> 0:39:38.000
<v Speaker 2>be working on.

0:39:38.560 --> 0:39:42.520
<v Speaker 4>Yeah, I think different people assign different amounts of weight

0:39:42.600 --> 0:39:45.759
<v Speaker 4>to each of those things. In my experience, like most

0:39:45.760 --> 0:39:47.759
<v Speaker 4>of the people I know come from academia, which means

0:39:47.760 --> 0:39:49.920
<v Speaker 4>they already kind of gave up more of a salary

0:39:49.960 --> 0:39:52.800
<v Speaker 4>to do study things more deeply for several years. So

0:39:52.840 --> 0:39:55.239
<v Speaker 4>I think people that I know are more biased against money.

0:39:55.280 --> 0:39:58.040
<v Speaker 4>But like people do care about that. But I think

0:39:58.080 --> 0:40:00.640
<v Speaker 4>that the ego thing really matters. Some people want to

0:40:00.640 --> 0:40:02.640
<v Speaker 4>feel like they're very important and they're working on a

0:40:02.640 --> 0:40:05.840
<v Speaker 4>problem that matters. One way some companies are able to

0:40:05.840 --> 0:40:08.560
<v Speaker 4>pull researchers away from other companies is by saying, we'll

0:40:08.760 --> 0:40:11.200
<v Speaker 4>sign you more importance in your role and we'll give.

0:40:11.040 --> 0:40:12.440
<v Speaker 2>You we'll give you a really big title.

0:40:12.680 --> 0:40:16.400
<v Speaker 4>Yeah, exactly. Seriously, the title is like, Okay, maybe before

0:40:16.400 --> 0:40:17.960
<v Speaker 4>you were like a researcher or not. You get to

0:40:18.000 --> 0:40:19.840
<v Speaker 4>be like a head researcher. You get to have people

0:40:19.920 --> 0:40:22.000
<v Speaker 4>under you, or you're a chief scientist, and all these

0:40:22.000 --> 0:40:23.160
<v Speaker 4>things do matter to people.

0:40:23.719 --> 0:40:25.680
<v Speaker 3>It's a very good book about it.

0:40:26.000 --> 0:40:29.720
<v Speaker 1>Pursuing a mission in the realm of like a driven

0:40:29.840 --> 0:40:32.880
<v Speaker 1>visionary even when it's commercially.

0:40:32.600 --> 0:40:35.600
<v Speaker 2>Just say it, just say yeah, that's right. No.

0:40:35.719 --> 0:40:37.920
<v Speaker 1>I think about this all the time. Do you want

0:40:37.920 --> 0:40:39.120
<v Speaker 1>to work for Ilia or do you want to work

0:40:39.120 --> 0:40:40.919
<v Speaker 1>for Sam? And which one is the ahab and which

0:40:40.920 --> 0:40:44.160
<v Speaker 1>one is just trying to make an honest living selling ads.

0:40:44.239 --> 0:40:47.760
<v Speaker 1>I find this to be like a genuinely interesting, interesting

0:40:47.840 --> 0:40:50.680
<v Speaker 1>question for any individual to have to reckon with in

0:40:50.719 --> 0:40:51.200
<v Speaker 1>this career.

0:40:51.320 --> 0:40:52.960
<v Speaker 4>Oh. Absolutely, And sometimes it can be.

0:40:53.000 --> 0:40:55.719
<v Speaker 1>Very difficult to tell Jack Morris, thank you so much

0:40:55.719 --> 0:40:58.399
<v Speaker 1>for coming on. Please pursue a career that will allow

0:40:58.440 --> 0:40:59.840
<v Speaker 1>you to come back on a log.

0:41:00.200 --> 0:41:03.960
<v Speaker 2>Or insert the odd lots close when you're negotiating your

0:41:03.960 --> 0:41:06.239
<v Speaker 2>one hundred million dollar salary, or.

0:41:06.280 --> 0:41:08.560
<v Speaker 1>Take the fifty so you know what, fifty million, but

0:41:08.640 --> 0:41:10.880
<v Speaker 1>let me I don't need one hundred million, fifty million.

0:41:10.680 --> 0:41:11.359
<v Speaker 3>But keep the album.

0:41:11.880 --> 0:41:13.200
<v Speaker 4>Yeah, that would be fine with me.

0:41:13.360 --> 0:41:14.799
<v Speaker 3>All right, great, Well, thank you so much.

0:41:15.200 --> 0:41:16.680
<v Speaker 2>Yeah, thanks, thank you so much.

0:41:16.719 --> 0:41:17.239
<v Speaker 4>That was great.

0:41:29.680 --> 0:41:31.960
<v Speaker 1>Appreciate I think about that sometimes, like what if you

0:41:32.040 --> 0:41:35.479
<v Speaker 1>got like an insane salary like that, you just could

0:41:35.560 --> 0:41:37.480
<v Speaker 1>you would be insane to say no to But like

0:41:37.640 --> 0:41:39.000
<v Speaker 1>I don't know, that's I mean.

0:41:39.080 --> 0:41:41.960
<v Speaker 3>It's not our problem, but like, wouldn't it be fun?

0:41:42.200 --> 0:41:44.560
<v Speaker 1>You know? It's like, oh, but you're gonna be working

0:41:44.560 --> 0:41:47.600
<v Speaker 1>on ad optimization or whatever and you're not going to

0:41:47.680 --> 0:41:49.280
<v Speaker 1>be there when they land.

0:41:49.080 --> 0:41:51.640
<v Speaker 3>On the moon. But you got paid ten times.

0:41:51.360 --> 0:41:53.520
<v Speaker 1>More than the people at the Bay station working on

0:41:53.640 --> 0:41:55.560
<v Speaker 1>landing on the moon. That strins me as a kind

0:41:55.560 --> 0:41:56.520
<v Speaker 1>of a tough life choice.

0:41:56.520 --> 0:41:58.400
<v Speaker 2>I think you're using up a lot of brain power

0:41:58.400 --> 0:42:00.840
<v Speaker 2>and energy on a problem which will Jem said is

0:42:00.880 --> 0:42:01.239
<v Speaker 2>not you.

0:42:02.120 --> 0:42:03.000
<v Speaker 3>That's exactly right.

0:42:03.120 --> 0:42:06.600
<v Speaker 2>No, that conversation was really fun. Nice to talk to

0:42:06.640 --> 0:42:10.239
<v Speaker 2>an actual researcher just doing stuff in the space. One

0:42:10.239 --> 0:42:12.400
<v Speaker 2>thing I thought was very interesting was this idea that

0:42:12.480 --> 0:42:16.319
<v Speaker 2>everyone gets excited about a specific improvement in AI, and

0:42:16.360 --> 0:42:20.640
<v Speaker 2>then it seems like that particular one doesn't materialize and

0:42:20.760 --> 0:42:24.000
<v Speaker 2>instead something else emerges, as like the big breakthrough. So

0:42:24.360 --> 0:42:27.239
<v Speaker 2>instead of agents, we have math.

0:42:27.320 --> 0:42:29.640
<v Speaker 1>And math which none of us will ever. I would

0:42:29.840 --> 0:42:32.839
<v Speaker 1>really like for an agent to do something simple. I'm

0:42:32.880 --> 0:42:35.120
<v Speaker 1>going to a city book on the trip or whatever.

0:42:35.239 --> 0:42:37.000
<v Speaker 1>Or change my flight. Oh my god, I tried to.

0:42:37.160 --> 0:42:38.600
<v Speaker 2>That would be amazing.

0:42:38.080 --> 0:42:42.080
<v Speaker 1>Recently change my flight. Here's my information. I don't I

0:42:42.120 --> 0:42:45.239
<v Speaker 1>would like that. I do not need the math olympiad.

0:42:45.560 --> 0:42:46.440
<v Speaker 1>I am very impressed.

0:42:46.440 --> 0:42:47.160
<v Speaker 3>I don't need it.

0:42:47.640 --> 0:42:51.480
<v Speaker 2>Also, I am now very very intrigued by reinforced learning

0:42:51.880 --> 0:42:55.880
<v Speaker 2>and how you actually reward the computers for doing good stuff.

0:42:55.920 --> 0:42:58.440
<v Speaker 2>I feel like, actually that would be a really interesting

0:42:59.040 --> 0:43:03.200
<v Speaker 2>area to mine. Which is motivating motivating the models to

0:43:03.320 --> 0:43:03.880
<v Speaker 2>do better?

0:43:04.360 --> 0:43:06.680
<v Speaker 1>Yeah, I've thought about that, like in chess, like how

0:43:06.760 --> 0:43:08.359
<v Speaker 1>do how do the computers know.

0:43:08.360 --> 0:43:08.960
<v Speaker 3>They want to win?

0:43:09.160 --> 0:43:09.359
<v Speaker 4>Yeah?

0:43:09.400 --> 0:43:10.560
<v Speaker 3>You know, like why do they care?

0:43:10.760 --> 0:43:12.759
<v Speaker 2>You know, all they're saying anyway, why are they here?

0:43:12.920 --> 0:43:13.759
<v Speaker 2>Why are we here?

0:43:14.480 --> 0:43:16.240
<v Speaker 3>That's the thing with AI conversations.

0:43:16.400 --> 0:43:18.120
<v Speaker 2>That's existential fact, something.

0:43:17.880 --> 0:43:19.960
<v Speaker 1>We didn't talk about, which I am interested. No one

0:43:20.000 --> 0:43:22.600
<v Speaker 1>really talks about AI safety anymore. If you notice, like

0:43:22.640 --> 0:43:25.319
<v Speaker 1>they like very little, like for better or worse. You

0:43:25.360 --> 0:43:28.080
<v Speaker 1>don't hear people just all money and they don't really

0:43:28.120 --> 0:43:30.719
<v Speaker 1>talk about what the AI kill us all one day.

0:43:30.840 --> 0:43:32.879
<v Speaker 3>But one thing I did wonder about.

0:43:32.960 --> 0:43:35.440
<v Speaker 1>So when Deep Seat came out, one of its breakthroughs

0:43:35.520 --> 0:43:37.640
<v Speaker 1>was it showed the whole chain of thought, right, you

0:43:37.680 --> 0:43:40.200
<v Speaker 1>could see that, which prior to that open AI or

0:43:40.239 --> 0:43:42.319
<v Speaker 1>chatchybt's chain of thought model didn't show you.

0:43:42.280 --> 0:43:42.680
<v Speaker 4>That, right.

0:43:42.920 --> 0:43:44.920
<v Speaker 1>And it does strike me that if there are certain

0:43:45.000 --> 0:43:48.520
<v Speaker 1>things that are for safety reasons or whatever held back

0:43:48.600 --> 0:43:50.480
<v Speaker 1>or they don't want to do this, the nature of

0:43:50.480 --> 0:43:53.840
<v Speaker 1>competition means all the guardrails are coming off of Actually,

0:43:53.920 --> 0:43:56.279
<v Speaker 1>like that's if there's some guardrail you you have on

0:43:56.680 --> 0:43:59.480
<v Speaker 1>someone's going to open source whatever it is and they're

0:43:59.520 --> 0:44:00.520
<v Speaker 1>going to all give it up.

0:44:00.640 --> 0:44:04.080
<v Speaker 2>Yeah, both on the guardrails and on the data use ys.

0:44:04.600 --> 0:44:06.080
<v Speaker 2>All right, well shall we leave it there.

0:44:06.160 --> 0:44:06.879
<v Speaker 3>Let's leave it there.

0:44:07.040 --> 0:44:09.560
<v Speaker 2>This has been another episode of the aud Loots podcast.

0:44:09.600 --> 0:44:12.480
<v Speaker 2>I'm Tracy Alloway. You can follow me at Tracy Alloway.

0:44:12.640 --> 0:44:15.240
<v Speaker 1>And I'm Jill Wisenthal. You can follow me at the Stalwart.

0:44:15.360 --> 0:44:19.680
<v Speaker 1>Follow our guest Jack Morris, He's at j xmnop. Follow

0:44:19.760 --> 0:44:23.000
<v Speaker 1>our producers Kerman Rodriguez at Kerman armand dash O Bennett

0:44:23.000 --> 0:44:26.480
<v Speaker 1>at Dashbod and kil Brooks at Kilbrooks. More odd Loss content,

0:44:26.480 --> 0:44:28.520
<v Speaker 1>go to Bloomberg dot com slash od Lots with the

0:44:28.600 --> 0:44:31.319
<v Speaker 1>daily newsletter and all of our episodes, and you can

0:44:31.400 --> 0:44:33.359
<v Speaker 1>chat about all of these topics twenty four to seven

0:44:33.480 --> 0:44:36.520
<v Speaker 1>in our discord Discord dot gg slash.

0:44:36.200 --> 0:44:38.799
<v Speaker 2>Odd Lots And if you enjoy odd Lots, if you

0:44:38.960 --> 0:44:41.400
<v Speaker 2>like it when we talk about twenty million dollars salaries

0:44:41.440 --> 0:44:43.840
<v Speaker 2>that will never be ours, then please leave us a

0:44:43.920 --> 0:44:47.600
<v Speaker 2>positive review on your favorite podcast platform. And remember, if

0:44:47.640 --> 0:44:50.040
<v Speaker 2>you are a Bloomberg subscriber, you can listen to all

0:44:50.080 --> 0:44:52.960
<v Speaker 2>of our episodes absolutely ad free. All you need to

0:44:53.000 --> 0:44:55.520
<v Speaker 2>do is find the Bloomberg channel on Apple Podcasts and

0:44:55.600 --> 0:45:14.840
<v Speaker 2>follow the instructions there. Thanks for listening it