WEBVTT - Inside Hudson River Trading's Blistering Token Burn

0:00:02.720 --> 0:00:16.400
<v Speaker 1>Bloomberg Audio Studios, Podcasts, radio News.

0:00:18.520 --> 0:00:21.480
<v Speaker 2>Hello and welcome to another episode of the Odd Lots podcast.

0:00:21.560 --> 0:00:23.799
<v Speaker 3>I'm Joe Wisenthal and I'm Tracy Alway.

0:00:23.960 --> 0:00:26.400
<v Speaker 2>Tracy. We did another one of our live shows, this

0:00:26.480 --> 0:00:28.720
<v Speaker 2>time our biggest show ever, biggest show ever.

0:00:29.160 --> 0:00:31.720
<v Speaker 4>It was absolutely amazing. We did it at City Winery

0:00:31.800 --> 0:00:34.880
<v Speaker 4>in New York. I think we had over three hundred people.

0:00:35.080 --> 0:00:37.360
<v Speaker 2>Yeah, three hundred and fifty people were there.

0:00:37.560 --> 0:00:37.880
<v Speaker 3>Yeah.

0:00:37.920 --> 0:00:39.959
<v Speaker 4>And the crazy thing is, I think it was our

0:00:40.000 --> 0:00:43.440
<v Speaker 4>sort of first themed show, and we didn't really plan

0:00:43.560 --> 0:00:45.800
<v Speaker 4>it that way, but it just worked out right.

0:00:45.880 --> 0:00:48.400
<v Speaker 2>I guess like it themed an anti theme at the

0:00:48.440 --> 0:00:51.440
<v Speaker 2>same time, because we're in this moment in which everything

0:00:51.560 --> 0:00:55.240
<v Speaker 2>is just like AI markets, markets, AI, et cetera. But

0:00:55.440 --> 0:00:57.480
<v Speaker 2>you know, there's all kinds of new things to trade,

0:00:57.760 --> 0:01:00.680
<v Speaker 2>and people are fascinated by the trade itself, pro fascinated

0:01:00.680 --> 0:01:03.880
<v Speaker 2>by the way the technology development is affecting the trade.

0:01:04.000 --> 0:01:06.480
<v Speaker 2>So we really wanted to do a kind of future

0:01:06.480 --> 0:01:09.920
<v Speaker 2>of trading show, which is a very broad thing. But

0:01:10.000 --> 0:01:11.360
<v Speaker 2>it did sort of come out that way.

0:01:11.480 --> 0:01:15.200
<v Speaker 4>Yeah, it really did. And you finally fulfilled your longtime

0:01:15.280 --> 0:01:18.679
<v Speaker 4>dream of doing two part episodes with our guests. So

0:01:19.040 --> 0:01:21.720
<v Speaker 4>our first speaker of the evening was actually someone who's

0:01:21.720 --> 0:01:22.559
<v Speaker 4>been on the show before.

0:01:22.720 --> 0:01:25.559
<v Speaker 2>That's right, So we had him on the show last year,

0:01:25.840 --> 0:01:29.120
<v Speaker 2>and we had him on our live show. Listen to

0:01:29.160 --> 0:01:31.600
<v Speaker 2>our episode with Ian Dunning, he is the head of

0:01:31.720 --> 0:01:38.160
<v Speaker 2>AI at Hudson River Trading. Talked about all things implementing AI, GPUs,

0:01:38.400 --> 0:01:41.600
<v Speaker 2>all that stuff within the context of a trading shop.

0:01:41.640 --> 0:01:43.399
<v Speaker 3>Take a listen, Joe, this is.

0:01:43.360 --> 0:01:45.880
<v Speaker 4>Your dream, right, you finally got to do a two parts.

0:01:45.959 --> 0:01:48.080
<v Speaker 2>This is the thing I always think about, which is

0:01:48.080 --> 0:01:50.480
<v Speaker 2>that after every episode we do, I'm like, Oh, there's

0:01:50.480 --> 0:01:52.560
<v Speaker 2>a question I wish I had asked. So we had

0:01:52.640 --> 0:01:54.400
<v Speaker 2>Yan on sometime last.

0:01:54.200 --> 0:01:56.240
<v Speaker 4>Year, so the last round was easy, this one will

0:01:56.240 --> 0:01:56.480
<v Speaker 4>be right.

0:01:57.160 --> 0:01:58.800
<v Speaker 2>Worried about that, well, I was going to start first

0:01:58.960 --> 0:02:01.760
<v Speaker 2>before we you know, talk about what you do, et cetera.

0:02:01.880 --> 0:02:04.040
<v Speaker 2>So here's the question I wish I had asked last time. So,

0:02:04.120 --> 0:02:07.320
<v Speaker 2>Hudson River Trading Shop, you're involved in the AI stuff,

0:02:07.880 --> 0:02:11.440
<v Speaker 2>could you theoretically do what high Flyer did and launch

0:02:11.480 --> 0:02:14.800
<v Speaker 2>an LLM with this tech stack that you have and

0:02:14.880 --> 0:02:16.240
<v Speaker 2>launch a deep seat competitor?

0:02:16.639 --> 0:02:19.720
<v Speaker 3>I think so. I think we're good at training models.

0:02:19.720 --> 0:02:22.080
<v Speaker 3>We have a lot of compute and people are good

0:02:22.080 --> 0:02:24.360
<v Speaker 3>at doing the cycle of research which is required to

0:02:24.400 --> 0:02:28.079
<v Speaker 3>catch up to the sort of frontier. However, I guess

0:02:28.280 --> 0:02:31.680
<v Speaker 3>reaching the frontier is clearly a very daunting task. So

0:02:31.720 --> 0:02:35.120
<v Speaker 3>maybe it's with some effort deep seek, but beyond that,

0:02:35.160 --> 0:02:37.079
<v Speaker 3>it's not a claim I'd be willing to make. It's

0:02:37.120 --> 0:02:39.080
<v Speaker 3>a hugely capital intensive task, clearly.

0:02:39.120 --> 0:02:41.800
<v Speaker 2>And do you ever, like, do people ever chat about that?

0:02:41.840 --> 0:02:43.520
<v Speaker 3>It's we could we think about it?

0:02:43.639 --> 0:02:43.760
<v Speaker 4>You know?

0:02:44.280 --> 0:02:46.600
<v Speaker 3>I mean, I think perhaps we missed our moment to

0:02:46.639 --> 0:02:48.800
<v Speaker 3>do so. It is so many open models now coming

0:02:48.800 --> 0:02:51.240
<v Speaker 3>out from the US as well as like China, that

0:02:51.639 --> 0:02:53.880
<v Speaker 3>there's like a huge array of them. It's kind of

0:02:53.880 --> 0:02:56.040
<v Speaker 3>an interesting shift from that deep Seak moment where it

0:02:56.080 --> 0:02:57.720
<v Speaker 3>felt like it was the first bolt from the blue.

0:02:58.040 --> 0:03:00.600
<v Speaker 3>He's a competitive open model. So now I see so

0:03:00.639 --> 0:03:03.680
<v Speaker 3>many groups releasing them. I don't know what the future

0:03:03.680 --> 0:03:06.160
<v Speaker 3>of open models is. That they're all kind of a

0:03:06.240 --> 0:03:09.000
<v Speaker 3>serious step back in. The frontier is progressing so fast.

0:03:09.200 --> 0:03:11.440
<v Speaker 3>I don't know how you keep up with that, but

0:03:11.600 --> 0:03:13.639
<v Speaker 3>many people believe that it's possible. I'm not so sure.

0:03:13.800 --> 0:03:14.880
<v Speaker 3>I'm one of those people though.

0:03:15.040 --> 0:03:18.240
<v Speaker 4>Okay, so, speaking of things moving so fast, my first

0:03:18.320 --> 0:03:20.919
<v Speaker 4>question is slightly different. I looked up your Twitter feed.

0:03:21.040 --> 0:03:24.000
<v Speaker 4>Oh no, before you came on the show. Your last

0:03:24.000 --> 0:03:29.400
<v Speaker 4>tweet before today was and I quote feel this every day, worry.

0:03:29.440 --> 0:03:32.360
<v Speaker 4>It's some sort of AI induce delirium. But then again,

0:03:32.520 --> 0:03:35.600
<v Speaker 4>various empirical measures are exponential looking, so feels best to

0:03:35.640 --> 0:03:38.800
<v Speaker 4>assume we're hurtling towards some sort of endgame. So first

0:03:38.840 --> 0:03:42.760
<v Speaker 4>of all, please convince us all live on stage that

0:03:42.800 --> 0:03:45.600
<v Speaker 4>you are in fact not suffering from AI induce delarium.

0:03:45.680 --> 0:03:48.560
<v Speaker 4>But secondly, like, what is the endgame that you speak

0:03:48.600 --> 0:03:48.960
<v Speaker 4>of here?

0:03:49.480 --> 0:03:53.480
<v Speaker 3>God, now I saw like a San Francisco pressent I do. I'sist.

0:03:53.600 --> 0:03:57.600
<v Speaker 3>I've been doing AI stuff since around twenty sixteen, and

0:03:57.760 --> 0:04:00.480
<v Speaker 3>that stuted at Deep Mind, and it was a bit

0:04:00.480 --> 0:04:02.280
<v Speaker 3>of a culture shock for me, because there are true

0:04:02.320 --> 0:04:04.840
<v Speaker 3>believers then, and I was most certainly not a true believer,

0:04:05.480 --> 0:04:07.160
<v Speaker 3>and I resisted it, and it was kind of a

0:04:07.240 --> 0:04:11.760
<v Speaker 3>natural skeptic for a long time. But certain empirical measures

0:04:11.760 --> 0:04:14.040
<v Speaker 3>of the pace of progress in the outside world. And

0:04:14.040 --> 0:04:18.600
<v Speaker 3>I also look at our own business, which looks somewhat

0:04:18.640 --> 0:04:23.200
<v Speaker 3>exponentially the amount of compute I'll have next year versus

0:04:23.240 --> 0:04:26.880
<v Speaker 3>this year, and this year versus last year. Looks kind

0:04:26.920 --> 0:04:31.640
<v Speaker 3>of exponentially, and we're doing things today that I didn't

0:04:31.960 --> 0:04:34.600
<v Speaker 3>really I should have dreamt of. I wish I had

0:04:34.600 --> 0:04:36.560
<v Speaker 3>that kind of visionary say, I'm a visionary and I

0:04:36.600 --> 0:04:38.880
<v Speaker 3>can see the future and I'm building towards it. But no,

0:04:39.080 --> 0:04:41.479
<v Speaker 3>I'm I think I'm a pragmatic engineering archetype, and so

0:04:41.520 --> 0:04:44.919
<v Speaker 3>it's been very incremental and I'm like, wow, that happened

0:04:44.960 --> 0:04:46.960
<v Speaker 3>in a year. So what does this mean? It's some

0:04:46.960 --> 0:04:51.400
<v Speaker 3>sort of technological convergence, everything going faster all the time. Well,

0:04:51.400 --> 0:04:54.080
<v Speaker 3>give us any I'm probably, but you give us an.

0:04:53.960 --> 0:04:56.240
<v Speaker 2>Example then, because you know, obviously those of us using

0:04:56.360 --> 0:05:00.560
<v Speaker 2>just the regular models, obviously the improvements and capability from

0:05:00.560 --> 0:05:03.080
<v Speaker 2>one year to another our mind blowing. But from the

0:05:03.160 --> 0:05:06.800
<v Speaker 2>perspective of like, okay, the application of AI within the

0:05:06.839 --> 0:05:09.520
<v Speaker 2>trading context, what is something that you can do in

0:05:09.560 --> 0:05:12.520
<v Speaker 2>twenty twenty six that and say twenty twenty four you

0:05:12.560 --> 0:05:14.280
<v Speaker 2>would not have been able to anticipate.

0:05:14.400 --> 0:05:16.839
<v Speaker 3>Oh, I think it's one way to think is just

0:05:16.839 --> 0:05:19.760
<v Speaker 3>like the amount of compute going into both training a

0:05:19.800 --> 0:05:22.080
<v Speaker 3>model and running a model, and that it's the same

0:05:22.120 --> 0:05:25.880
<v Speaker 3>technology working across every equity, every future, every crypto market,

0:05:25.920 --> 0:05:28.000
<v Speaker 3>every option market across the world with a kind of

0:05:28.080 --> 0:05:31.800
<v Speaker 3>unified approach, and this is something that we're doing. But

0:05:31.960 --> 0:05:34.800
<v Speaker 3>even more interestingly, I would not claim that we are

0:05:34.880 --> 0:05:37.159
<v Speaker 3>some unique people who are only the ones who have

0:05:37.240 --> 0:05:40.000
<v Speaker 3>really made progress in AI and trading. I think many

0:05:40.040 --> 0:05:43.680
<v Speaker 3>of our peers are also investing massively and we're all

0:05:43.680 --> 0:05:45.120
<v Speaker 3>doing it all at the same time. And what does

0:05:45.160 --> 0:05:48.040
<v Speaker 3>that mean, Like, surely you can't just like keep getting

0:05:48.080 --> 0:05:50.919
<v Speaker 3>better at predicting markets Furvace. It's got to be some

0:05:50.960 --> 0:05:54.880
<v Speaker 3>sort of forcing function where you know your your margins

0:05:54.880 --> 0:05:56.240
<v Speaker 3>go to zero as you keep investing.

0:05:56.800 --> 0:05:59.680
<v Speaker 2>What you're saying is you are just getting better and

0:05:59.720 --> 0:06:02.279
<v Speaker 2>better at being able to predict where a market is

0:06:02.279 --> 0:06:04.240
<v Speaker 2>going to go further and further out in the.

0:06:04.160 --> 0:06:06.920
<v Speaker 3>Time frame basically, And we're not the only ones. So

0:06:07.040 --> 0:06:10.040
<v Speaker 3>in the end, can there be some highlander type thing

0:06:10.120 --> 0:06:12.240
<v Speaker 3>like what are we doing? And it's it's this is

0:06:12.240 --> 0:06:13.720
<v Speaker 3>like my scale, And I guess the other thing I

0:06:13.760 --> 0:06:16.280
<v Speaker 3>find interesting is, of course the scale that everyone can

0:06:16.320 --> 0:06:19.520
<v Speaker 3>see with the big labs and what they're doing of computeion.

0:06:19.600 --> 0:06:22.760
<v Speaker 3>It's like it looks awfully exponential to me. We just

0:06:22.800 --> 0:06:26.560
<v Speaker 3>had another model released today from Anthropic and the type

0:06:26.560 --> 0:06:30.560
<v Speaker 3>of spacing between them seems to be compressing. I do

0:06:30.600 --> 0:06:33.520
<v Speaker 3>sound hilarious. I sound feverish, and that's why.

0:06:32.760 --> 0:06:34.960
<v Speaker 2>It's literally everyone in this room.

0:06:35.000 --> 0:06:37.359
<v Speaker 3>Probably I feel everyone must feel the fever to some extent.

0:06:37.440 --> 0:06:40.480
<v Speaker 4>Yeah, I never understood the Highlander there can only be

0:06:40.520 --> 0:06:44.480
<v Speaker 4>one thing because they're already two. They could just coexist,

0:06:44.560 --> 0:06:47.640
<v Speaker 4>that's right. Anyway, Sorry, I'm just picking apart your analogy.

0:06:47.800 --> 0:06:50.840
<v Speaker 4>You mentioned a new model release. When a new model

0:06:50.880 --> 0:06:53.680
<v Speaker 4>gets released, like, what is the first thing you do

0:06:54.000 --> 0:06:56.440
<v Speaker 4>at Hudson River Trading to evaluate it? And how do

0:06:56.480 --> 0:06:58.440
<v Speaker 4>you actually compare them to the existing one?

0:06:58.839 --> 0:07:01.920
<v Speaker 3>So I mean our prime reuse cases as a trading

0:07:01.960 --> 0:07:05.039
<v Speaker 3>and definitely kind of just like accelerating your own research.

0:07:05.240 --> 0:07:07.840
<v Speaker 3>So that can be coding, but it can also be

0:07:08.040 --> 0:07:12.200
<v Speaker 3>coming up with experiment ideas, monitoring experiments. We had a

0:07:12.360 --> 0:07:15.040
<v Speaker 3>sort of a false start with AI I would say

0:07:15.280 --> 0:07:18.400
<v Speaker 3>sometime last year with the Opus four point zero models,

0:07:18.480 --> 0:07:23.240
<v Speaker 3>especially from Anthropic, where a cursory examination made us feel like, well,

0:07:23.280 --> 0:07:25.600
<v Speaker 3>this is the moment we've crossed the dividing line, and

0:07:25.640 --> 0:07:28.320
<v Speaker 3>we had a very feverish week where we felt the

0:07:28.360 --> 0:07:32.560
<v Speaker 3>AGI and we left feeling empty because we realized that

0:07:32.840 --> 0:07:35.120
<v Speaker 3>it was not there and was not able to meaningfully

0:07:35.240 --> 0:07:38.720
<v Speaker 3>augment human researchers. And then we had that same feeling

0:07:38.760 --> 0:07:41.560
<v Speaker 3>again when Opus four point five came out and suddenly

0:07:41.600 --> 0:07:43.560
<v Speaker 3>it was like, oh, wait, no, this is this is

0:07:43.560 --> 0:07:44.920
<v Speaker 3>actually what we thought it was going to be six

0:07:44.960 --> 0:07:48.600
<v Speaker 3>months ago. So in the most recent model releases, the

0:07:48.680 --> 0:07:52.000
<v Speaker 3>differences have been more subtle, but we see, I think

0:07:52.040 --> 0:07:54.600
<v Speaker 3>we have a much better sense of an ever reducing

0:07:54.600 --> 0:07:56.720
<v Speaker 3>set of errors they make, and so we're kind of

0:07:56.760 --> 0:07:59.360
<v Speaker 3>looking for those those of mistakes, and we are We

0:07:59.400 --> 0:08:00.960
<v Speaker 3>spent some time in the past couple of weeks trying

0:08:00.960 --> 0:08:03.240
<v Speaker 3>to come up with objective measures to index them against

0:08:03.440 --> 0:08:08.240
<v Speaker 3>humans in the active quant research ideating signals and things

0:08:08.720 --> 0:08:10.440
<v Speaker 3>quant research used to be, as we talked about a

0:08:10.480 --> 0:08:13.800
<v Speaker 3>little bit like hand crafting indicators and things, why not

0:08:13.840 --> 0:08:16.560
<v Speaker 3>ask AI agents to do that and compare them against

0:08:16.600 --> 0:08:20.200
<v Speaker 3>humans like a little sort of battle. And it's they're like,

0:08:20.320 --> 0:08:24.160
<v Speaker 3>I don't know, intern level AI. Perhaps the thing is like,

0:08:24.440 --> 0:08:25.720
<v Speaker 3>what do I think it'll be in a year? And

0:08:25.800 --> 0:08:27.200
<v Speaker 3>I would not want to make a bold claim, but

0:08:27.200 --> 0:08:27.760
<v Speaker 3>it will still be.

0:08:28.240 --> 0:08:32.160
<v Speaker 2>It'll be wild. So when we think about investing in general,

0:08:32.640 --> 0:08:36.319
<v Speaker 2>even within sort of like classical quant trade, and going

0:08:36.360 --> 0:08:40.240
<v Speaker 2>back decades, there is often it might be quant but

0:08:40.240 --> 0:08:43.560
<v Speaker 2>there's some intuition behind it. Right, Cheap stocks tend to

0:08:43.600 --> 0:08:46.760
<v Speaker 2>do better, and we don't actually totally have agreement why

0:08:46.800 --> 0:08:49.680
<v Speaker 2>they did for a while, but people aren't necessarily surprised

0:08:49.720 --> 0:08:52.760
<v Speaker 2>by that fact, right are we at the point where

0:08:52.760 --> 0:08:55.640
<v Speaker 2>it's like why even bother coming up with the human

0:08:55.720 --> 0:08:58.920
<v Speaker 2>intuitive story and you just skip the part of giving

0:08:58.920 --> 0:09:02.120
<v Speaker 2>an explanation that's owns logical to a person and it's

0:09:02.160 --> 0:09:07.360
<v Speaker 2>just basically pure like rigorous back testing. And then it's like, look,

0:09:07.480 --> 0:09:10.000
<v Speaker 2>here is something that seems to work, and we've back

0:09:10.040 --> 0:09:12.360
<v Speaker 2>tested it a million different ways and it seems to work,

0:09:12.480 --> 0:09:14.480
<v Speaker 2>and we don't even bother coming up with a story

0:09:14.480 --> 0:09:15.720
<v Speaker 2>for why, but we're going to trade it.

0:09:15.920 --> 0:09:17.920
<v Speaker 3>I feel like we're in that world today. It's seat

0:09:17.920 --> 0:09:21.760
<v Speaker 3>of post post post capitalism. When I see IPOs discussed

0:09:21.760 --> 0:09:24.080
<v Speaker 3>for this coming summer, at the valuations they are, I'm like,

0:09:24.240 --> 0:09:27.319
<v Speaker 3>what is a fundamental? Like what is anything? It feels

0:09:27.400 --> 0:09:30.800
<v Speaker 3>like markets are just the cynical thick is everything is gambling,

0:09:30.840 --> 0:09:33.280
<v Speaker 3>and so everything is some sort of like gambling market,

0:09:33.360 --> 0:09:36.640
<v Speaker 3>including public markets. But the joke is flows, it's buying

0:09:36.679 --> 0:09:38.720
<v Speaker 3>and selling, and it's just it's worth what it's worth,

0:09:38.920 --> 0:09:42.080
<v Speaker 3>and it's detached and more biased, and sellers price go up,

0:09:42.160 --> 0:09:45.480
<v Speaker 3>and models are excellent at like pulling that out of data.

0:09:45.559 --> 0:09:48.160
<v Speaker 2>But just like let's say, you know the classic example

0:09:48.200 --> 0:09:50.520
<v Speaker 2>of like a back test, it is like, oh, companies

0:09:50.559 --> 0:09:53.080
<v Speaker 2>with the ticker symbol it starts with P, they do

0:09:53.200 --> 0:09:55.640
<v Speaker 2>well on Tuesdays. And it's like, well, look the data

0:09:55.640 --> 0:09:57.080
<v Speaker 2>says that, but this makes no sense. We're not going

0:09:57.160 --> 0:09:58.839
<v Speaker 2>to trade that. Could it get to the point where

0:09:58.840 --> 0:10:01.240
<v Speaker 2>it's like, look, symbols that starts with P, you do

0:10:01.320 --> 0:10:03.480
<v Speaker 2>well on Tuesdays, And we've run this a bunch of

0:10:03.520 --> 0:10:05.120
<v Speaker 2>times and it seems to work, so we're gonna put

0:10:05.160 --> 0:10:05.760
<v Speaker 2>money beyond us.

0:10:06.280 --> 0:10:07.480
<v Speaker 4>AI Like I.

0:10:09.120 --> 0:10:12.000
<v Speaker 3>Feel like, yes, although it sounds crazy, it sounds like

0:10:12.000 --> 0:10:13.920
<v Speaker 3>AI delirium when I say it, but I feel like

0:10:13.960 --> 0:10:16.600
<v Speaker 3>there's some sense that that could be true at some

0:10:16.679 --> 0:10:20.400
<v Speaker 3>point I can't predict. So at the very short time scale,

0:10:20.400 --> 0:10:22.920
<v Speaker 3>people accept this already, right Like I can't tell you

0:10:22.960 --> 0:10:24.680
<v Speaker 3>the price of like a stock in a minute, and

0:10:24.679 --> 0:10:27.160
<v Speaker 3>no one would really reasonably expect any human to do so,

0:10:27.520 --> 0:10:29.080
<v Speaker 3>even if they had the uder book and spend all

0:10:29.120 --> 0:10:30.640
<v Speaker 3>the time in the world is staring at it. But

0:10:30.720 --> 0:10:33.920
<v Speaker 3>we accept that neural networks can do this, And then

0:10:34.520 --> 0:10:37.160
<v Speaker 3>when does that logic break down? Why should it break

0:10:37.200 --> 0:10:39.760
<v Speaker 3>down at some long time scale. If it's ingesting all

0:10:39.840 --> 0:10:41.800
<v Speaker 3>the data and has everything and it can keep it

0:10:41.840 --> 0:10:44.760
<v Speaker 3>all in the context in way human can't, why should

0:10:44.760 --> 0:10:48.040
<v Speaker 3>I be able to understand it? Yeah, and that is

0:10:48.040 --> 0:10:50.839
<v Speaker 3>a strange thought, a loss of control. It feels like

0:10:50.880 --> 0:10:54.240
<v Speaker 3>a loss of control. But it's like, you know, people

0:10:54.240 --> 0:10:56.280
<v Speaker 3>save us from math. Maybe humans are actually very bad

0:10:56.320 --> 0:10:58.920
<v Speaker 3>at math. So it's not surprising AI as much better

0:10:58.960 --> 0:11:02.200
<v Speaker 3>than humans that these like mass proofs. Humans probably would

0:11:02.200 --> 0:11:05.440
<v Speaker 3>be pretty bad at markets where thousands of tradable instruments

0:11:05.480 --> 0:11:08.000
<v Speaker 3>on like very long time scales. We just kind of

0:11:08.000 --> 0:11:10.480
<v Speaker 3>accepted that we were some people were good at this.

0:11:11.040 --> 0:11:14.520
<v Speaker 3>Maybe that's a temporary state of affairs.

0:11:14.640 --> 0:11:16.720
<v Speaker 4>Well, we talked about this the last time. You were

0:11:16.760 --> 0:11:20.040
<v Speaker 4>on the idea that the models themselves are not very interpretable,

0:11:20.200 --> 0:11:22.320
<v Speaker 4>I guess you would say, but you're comfortable with that

0:11:22.800 --> 0:11:25.960
<v Speaker 4>on a short trading timeframe, which is what you do.

0:11:26.520 --> 0:11:29.400
<v Speaker 4>And then we started joking about magic models, and magic

0:11:29.480 --> 0:11:31.920
<v Speaker 4>is a dangerous word to use on this podcast because

0:11:31.920 --> 0:11:36.000
<v Speaker 4>people start thinking about magic boxes. But anyway, now that

0:11:36.080 --> 0:11:38.360
<v Speaker 4>you've been doing this for another six months since we

0:11:38.480 --> 0:11:40.720
<v Speaker 4>last spoke to you, do you feel like you have

0:11:40.840 --> 0:11:44.199
<v Speaker 4>better insight into what the models are actually doing and

0:11:44.280 --> 0:11:46.880
<v Speaker 4>why they're able to succeed on short timeframes.

0:11:47.200 --> 0:11:50.520
<v Speaker 3>I do think there are diagnostics we have done where

0:11:51.000 --> 0:11:54.719
<v Speaker 3>when we can see things that we can understand. It's

0:11:54.760 --> 0:11:57.040
<v Speaker 3>like looking at some very very complex thing and you

0:11:57.040 --> 0:11:58.720
<v Speaker 3>can look at one facet of it and be like,

0:11:58.760 --> 0:12:00.680
<v Speaker 3>this is the fast that I understand, and that gives

0:12:00.720 --> 0:12:04.280
<v Speaker 3>you some confidence, but it might be illusory because it's

0:12:04.280 --> 0:12:06.800
<v Speaker 3>a very very complex object, and you can if you're

0:12:06.840 --> 0:12:10.080
<v Speaker 3>only taking slices through it and to understand aspects of it.

0:12:10.559 --> 0:12:13.480
<v Speaker 3>You know, we had this emergent phenomenon we saw where

0:12:13.600 --> 0:12:15.920
<v Speaker 3>it felt like the model kind of understood meme stocks

0:12:15.960 --> 0:12:21.400
<v Speaker 3>from first principles like quantum stocks and crypto stocks being

0:12:21.480 --> 0:12:24.920
<v Speaker 3>kind of adjacent in stock space, and of course from

0:12:24.960 --> 0:12:28.160
<v Speaker 3>a fundamental perspective that says, no, there's no meaning to it.

0:12:28.240 --> 0:12:30.160
<v Speaker 3>But we looked at the model and us under a

0:12:30.160 --> 0:12:33.280
<v Speaker 3>certain lens and it clearly felt like they knew they

0:12:33.320 --> 0:12:35.840
<v Speaker 3>were connected. There's some other actual companies that I probably

0:12:35.840 --> 0:12:38.080
<v Speaker 3>won't name. It feels like it's bad form, but you

0:12:38.120 --> 0:12:40.720
<v Speaker 3>know Wall Street BET's favorites, I guess, and they were

0:12:40.800 --> 0:12:43.400
<v Speaker 3>near the cluster too, And this was like just one

0:12:43.440 --> 0:12:45.880
<v Speaker 3>little window. But there were other slices we tried to

0:12:45.920 --> 0:12:47.920
<v Speaker 3>take which just didn't make sense to us. But again

0:12:48.000 --> 0:12:49.880
<v Speaker 3>it's like, who am I to say?

0:12:51.760 --> 0:12:55.280
<v Speaker 2>The model says they're in that vicinity of hyperdimensional space.

0:12:55.520 --> 0:12:58.040
<v Speaker 3>Yeah. The one thing for us though, is that when

0:12:58.080 --> 0:13:02.080
<v Speaker 3>we do have this magical model, it is in a

0:13:02.120 --> 0:13:04.679
<v Speaker 3>lot of safety around it. Because we're doing this higher

0:13:04.679 --> 0:13:07.280
<v Speaker 3>frequency trading. We're trading positions back and forth. There's a

0:13:07.320 --> 0:13:09.560
<v Speaker 3>lot of risk checks that are fully automated in things.

0:13:09.920 --> 0:13:12.480
<v Speaker 3>I don't know how you generalize this logic to long

0:13:12.600 --> 0:13:16.360
<v Speaker 3>term discretionary trading, where the idea of like risk checking

0:13:16.600 --> 0:13:19.559
<v Speaker 3>and that kind of layer of defense around it, it's

0:13:19.600 --> 0:13:21.200
<v Speaker 3>not so obvious to me how you apply that. We

0:13:21.200 --> 0:13:24.360
<v Speaker 3>can apply very strict controls around this model because it's

0:13:24.360 --> 0:13:28.000
<v Speaker 3>a well posed problem. We're not taking giant idiosyncratic risks

0:13:28.040 --> 0:13:30.440
<v Speaker 3>in like one name for months at a time. We

0:13:30.480 --> 0:13:32.160
<v Speaker 3>can sleep at night because of this. I don't know

0:13:32.200 --> 0:13:34.880
<v Speaker 3>how you apply the same thinking to like a fundamental long,

0:13:34.920 --> 0:13:36.240
<v Speaker 3>short thing where you have to put a trade on

0:13:36.280 --> 0:13:39.120
<v Speaker 3>and it's for three months, and you're intentionally taking a

0:13:39.280 --> 0:13:43.680
<v Speaker 3>very large risk in a very sudden direction. That's what's

0:13:43.720 --> 0:13:45.560
<v Speaker 3>the risk management story around the AI if you just

0:13:45.600 --> 0:13:59.760
<v Speaker 3>give up all control to just the magic prediction.

0:14:04.160 --> 0:14:06.720
<v Speaker 2>So you said something on the last time we interviewed you,

0:14:06.840 --> 0:14:08.559
<v Speaker 2>which is very important. First of all, I feel like

0:14:08.640 --> 0:14:11.400
<v Speaker 2>in the quote AI trade unquote, people are obsessed with like,

0:14:11.480 --> 0:14:14.640
<v Speaker 2>what's the bottleneck now, right? And because whatever the bottleneck is,

0:14:14.640 --> 0:14:16.840
<v Speaker 2>you probably solved for a lot more money. You said

0:14:17.320 --> 0:14:20.240
<v Speaker 2>the last time we talked to you, the chips themselves

0:14:20.640 --> 0:14:23.800
<v Speaker 2>are We're not actually a major constraint for you, and

0:14:23.800 --> 0:14:26.960
<v Speaker 2>that it was more like citing the chips and the

0:14:27.080 --> 0:14:29.840
<v Speaker 2>powering the chips, the access to electricity, talk about that.

0:14:29.880 --> 0:14:32.480
<v Speaker 2>What is the state right now? Let's say like I

0:14:32.560 --> 0:14:36.320
<v Speaker 2>assemble approach a bunch of people from Hudson River Trading. Yeah,

0:14:36.400 --> 0:14:39.320
<v Speaker 2>I get a bunch of GPUs. Is it then not

0:14:39.560 --> 0:14:41.440
<v Speaker 2>trivial to find a place to plug those in.

0:14:41.720 --> 0:14:45.560
<v Speaker 3>It's definitely hot to find sites and at shortly times.

0:14:45.560 --> 0:14:47.400
<v Speaker 3>If I went to the market and said I want,

0:14:47.520 --> 0:14:51.840
<v Speaker 3>you know, six thousand Blackwell GPUs in a box somewhere

0:14:51.840 --> 0:14:55.880
<v Speaker 3>in North America for delivery in Q four. I'm not

0:14:55.920 --> 0:14:59.440
<v Speaker 3>sure such an offering exists at any reasonable price. Like

0:14:59.480 --> 0:15:01.520
<v Speaker 3>if it from maybe someone will give up a lease

0:15:01.560 --> 0:15:02.960
<v Speaker 3>and I could snag it. But I think if I

0:15:03.000 --> 0:15:04.360
<v Speaker 3>went to the market and tried to get a.

0:15:04.400 --> 0:15:07.080
<v Speaker 2>Quia sorry just to be clear, the chips are available,

0:15:07.120 --> 0:15:08.280
<v Speaker 2>but not the competity.

0:15:08.280 --> 0:15:10.200
<v Speaker 3>I think if I had power, I could get the

0:15:10.280 --> 0:15:12.320
<v Speaker 3>chips Blackwell chips for delivery this yet, but I do

0:15:12.360 --> 0:15:14.800
<v Speaker 3>not think I could get the whole solution. And then

0:15:15.040 --> 0:15:17.240
<v Speaker 3>if you go into twenty twenty seven for the next

0:15:17.240 --> 0:15:20.160
<v Speaker 3>generation of GPUs, the Ruben GPUs, they at least for

0:15:20.200 --> 0:15:23.040
<v Speaker 3>the first like stretch, are going to be very much

0:15:23.160 --> 0:15:25.600
<v Speaker 3>assault out. And so I think that's like a maybe

0:15:25.600 --> 0:15:27.640
<v Speaker 3>you actually have on a twenty twenty seven delivery. You

0:15:27.680 --> 0:15:30.240
<v Speaker 3>have more like finding a data center shell by then.

0:15:30.800 --> 0:15:32.560
<v Speaker 3>But you need you need to be in cute now

0:15:32.680 --> 0:15:35.560
<v Speaker 3>for those GPUs if you want them early. So those

0:15:35.560 --> 0:15:37.840
<v Speaker 3>things are those things are in demand. I'll say that

0:15:37.880 --> 0:15:41.280
<v Speaker 3>for sure. And one of my greatest failures has been

0:15:41.760 --> 0:15:43.720
<v Speaker 3>you know, part of my skepticism has been predicting how

0:15:43.760 --> 0:15:46.000
<v Speaker 3>many GPUs we would need and a long enough horizon

0:15:46.440 --> 0:15:49.960
<v Speaker 3>and it's punishing because you're constantly playing catchup. And one

0:15:49.960 --> 0:15:52.880
<v Speaker 3>of our competitors put out a podcast this weekend and

0:15:53.240 --> 0:15:54.880
<v Speaker 3>they mentioned somebody along the lines of the fact they

0:15:55.120 --> 0:15:57.200
<v Speaker 3>had one data center and it was the data center,

0:15:57.240 --> 0:16:00.000
<v Speaker 3>and that was their data center, and then as they're hungering,

0:16:00.040 --> 0:16:01.520
<v Speaker 3>hungry for more compute, they had to go out and

0:16:01.520 --> 0:16:03.880
<v Speaker 3>find it wherever they could. And I would say, we

0:16:03.920 --> 0:16:06.280
<v Speaker 3>are in exactly the same boat. You just can't be picky.

0:16:06.360 --> 0:16:08.560
<v Speaker 3>It's like you've got like a mega watt there, I'll

0:16:08.600 --> 0:16:11.840
<v Speaker 3>take it, and it could be you know, not in

0:16:11.960 --> 0:16:13.520
<v Speaker 3>terms that are super favorable to you.

0:16:13.400 --> 0:16:15.640
<v Speaker 4>Because we'll say more about that. How are you actually

0:16:15.640 --> 0:16:17.920
<v Speaker 4>going out and sourcing this stuff, because as you say,

0:16:17.960 --> 0:16:21.040
<v Speaker 4>it seems to be exceptionally competitive. And at the same time,

0:16:21.080 --> 0:16:23.520
<v Speaker 4>don't you guys have an insane data center in like

0:16:23.600 --> 0:16:24.920
<v Speaker 4>Norway or something.

0:16:24.680 --> 0:16:27.400
<v Speaker 3>And it's not enough. Yeah, and it's not enough. Yes.

0:16:27.520 --> 0:16:32.520
<v Speaker 3>We go to the neo clouds, the hyperscalers, everyone, and

0:16:32.920 --> 0:16:35.680
<v Speaker 3>it's a constant dialogue and they're all in competition with

0:16:35.760 --> 0:16:38.520
<v Speaker 3>each other. But in some sense there must be some

0:16:38.640 --> 0:16:41.680
<v Speaker 3>much bigger, shadowy competition going on behind the scenes behind

0:16:41.680 --> 0:16:44.360
<v Speaker 3>these neoclouds because they are all looking for space and

0:16:44.440 --> 0:16:46.840
<v Speaker 3>power and I don't know if that's the true scarce resource,

0:16:47.200 --> 0:16:49.680
<v Speaker 3>and they're a kind of intermediory layer over it. I

0:16:49.680 --> 0:16:51.840
<v Speaker 3>don't know what their process is like for saucing it.

0:16:52.400 --> 0:16:55.040
<v Speaker 3>But yeah, they have come to us and said, this

0:16:55.160 --> 0:16:56.760
<v Speaker 3>lease opened up. Can you please get back to us

0:16:56.760 --> 0:16:59.240
<v Speaker 3>by the end of the day. And for commitments on

0:16:59.280 --> 0:17:02.240
<v Speaker 3>a long term contract, and our contracts are a long term.

0:17:02.480 --> 0:17:05.080
<v Speaker 3>This is not spot compute. This is like eight thousand

0:17:05.160 --> 0:17:08.760
<v Speaker 3>GPUs for three years, four years, five years payment? Do

0:17:08.760 --> 0:17:10.440
<v Speaker 3>you wanna pay half upfront? Do you want to pay

0:17:10.800 --> 0:17:12.960
<v Speaker 3>some per year? A lot of different commercial terms, credit

0:17:13.040 --> 0:17:15.359
<v Speaker 3>risk on both sides. It's complicated stuff.

0:17:15.560 --> 0:17:18.639
<v Speaker 2>Tell us more about the counterparty risk. So it's like

0:17:18.720 --> 0:17:21.760
<v Speaker 2>you come and you say you want capacity and some

0:17:21.920 --> 0:17:26.200
<v Speaker 2>data center. I'm from Hudson River Trading. Who yeah, well,

0:17:26.600 --> 0:17:28.159
<v Speaker 2>this is the kind of thing like this crowd. A

0:17:28.160 --> 0:17:30.439
<v Speaker 2>lot of people know what Hudgson River Trading is, but

0:17:30.520 --> 0:17:33.640
<v Speaker 2>maybe in San Francisco or whatever, that's not a household name,

0:17:33.720 --> 0:17:36.120
<v Speaker 2>et cetera. They want to know for sure that you're

0:17:36.119 --> 0:17:38.960
<v Speaker 2>gonna be good, You're gonna like pay your bills, et cetera.

0:17:39.200 --> 0:17:42.320
<v Speaker 2>How do you establish to the data center that you

0:17:42.359 --> 0:17:44.200
<v Speaker 2>were going to be a reliable I guess tenant.

0:17:44.280 --> 0:17:46.719
<v Speaker 3>Yeah, it's a it's a definitely being a dance. It's

0:17:46.720 --> 0:17:49.080
<v Speaker 3>getting better at this point. I think we've anted it

0:17:49.280 --> 0:17:51.480
<v Speaker 3>enough deals enough people that I think we have that,

0:17:51.640 --> 0:17:54.040
<v Speaker 3>But we've had everything from people being like, oh, you've

0:17:54.040 --> 0:17:57.280
<v Speaker 3>issued bonds, what's the rating on those too? Not wanting

0:17:57.359 --> 0:18:00.119
<v Speaker 3>us to sell too much of one site because if

0:18:00.119 --> 0:18:04.080
<v Speaker 3>we take all their power rights and then go bust,

0:18:04.320 --> 0:18:06.040
<v Speaker 3>they might have a long lead time where they can't

0:18:06.080 --> 0:18:08.640
<v Speaker 3>get into a tenant and fill that. And so there's

0:18:08.640 --> 0:18:10.800
<v Speaker 3>a kind of two potty problems of us where it's

0:18:10.840 --> 0:18:14.240
<v Speaker 3>like they want customers, but there's presumably a lot of customers,

0:18:14.280 --> 0:18:16.600
<v Speaker 3>but maybe not as many customers are willing to do

0:18:16.640 --> 0:18:19.920
<v Speaker 3>the big size and pay more upfront. But you know,

0:18:20.160 --> 0:18:22.960
<v Speaker 3>we're looking at their CDs is on some of these

0:18:23.119 --> 0:18:25.919
<v Speaker 3>ones and thinking about how that affects our You know,

0:18:25.960 --> 0:18:28.520
<v Speaker 3>maybe we should pay you three fifty an hour and

0:18:28.600 --> 0:18:31.040
<v Speaker 3>take out a CDs for ten cents for our equivalent

0:18:31.040 --> 0:18:35.680
<v Speaker 3>of insurance on your heavy leveraged neocloud. You're having a disruption,

0:18:36.160 --> 0:18:39.480
<v Speaker 3>no names, but you know it's I think is reason

0:18:39.600 --> 0:18:42.200
<v Speaker 3>to be cagy on both sides because this has all

0:18:42.320 --> 0:18:45.800
<v Speaker 3>come from nothing like a year ago. We weren't there

0:18:45.920 --> 0:18:48.959
<v Speaker 3>asking for it, and they didn't exist to sell it,

0:18:49.080 --> 0:18:51.919
<v Speaker 3>and so the only rock is Nvidio I guess, an

0:18:51.960 --> 0:18:56.119
<v Speaker 3>extremely well capitalized entity who is not going anywhere, and

0:18:56.160 --> 0:18:59.000
<v Speaker 3>it's making love GPUs and we have a very positive

0:18:59.000 --> 0:19:02.439
<v Speaker 3>relationship with them. I think that is also a material factor.

0:19:02.600 --> 0:19:05.480
<v Speaker 4>How much optionality do you actually have on GPUs now,

0:19:05.600 --> 0:19:08.960
<v Speaker 4>Like if you say you want to prioritize latency or throughput,

0:19:09.280 --> 0:19:11.919
<v Speaker 4>like can you get the chips that you need to

0:19:11.960 --> 0:19:14.119
<v Speaker 4>specify on one of those things? Or like do you

0:19:14.200 --> 0:19:15.440
<v Speaker 4>just take what you can get.

0:19:16.000 --> 0:19:19.960
<v Speaker 3>Or build your own? Yeah you can? You can. Well,

0:19:20.000 --> 0:19:22.560
<v Speaker 3>many people I guess now are working on building their

0:19:22.560 --> 0:19:26.280
<v Speaker 3>own chips for for inference, which is as strictly similar

0:19:26.359 --> 0:19:30.439
<v Speaker 3>technological problem, and ourselves and many of our peer trading

0:19:30.480 --> 0:19:35.199
<v Speaker 3>firms have hardware teams to tackle this and you can

0:19:35.240 --> 0:19:37.280
<v Speaker 3>outsource pots of it process, so it's not as daunting

0:19:37.320 --> 0:19:40.160
<v Speaker 3>as it seems. But it's definitely an active, very investigation

0:19:40.240 --> 0:19:42.200
<v Speaker 3>for us and now clearly everyone because I feel like

0:19:42.280 --> 0:19:45.359
<v Speaker 3>everyone's talking about their partnership with Broadcom or something like this.

0:19:45.400 --> 0:19:47.600
<v Speaker 3>And if someone says partnering your broadcom, it's like that

0:19:47.600 --> 0:19:48.760
<v Speaker 3>they're making an inference chip.

0:19:48.800 --> 0:19:51.480
<v Speaker 2>So it's interesting, right because you hear about like Amazon,

0:19:51.640 --> 0:19:57.280
<v Speaker 2>like them Google tp So we could be in a

0:19:57.280 --> 0:20:00.280
<v Speaker 2>world in which we hear of like a Hudson River

0:20:00.400 --> 0:20:02.960
<v Speaker 2>Trading branded chipise.

0:20:02.720 --> 0:20:04.840
<v Speaker 3>I don't think we'll sell it, but yeah, but yes,

0:20:04.880 --> 0:20:06.680
<v Speaker 3>you're right, that is that is definitely the right model.

0:20:06.800 --> 0:20:10.720
<v Speaker 3>And on the other hand, Jensen Never sleeps and Jensen

0:20:10.720 --> 0:20:14.639
<v Speaker 3>purchased Grock and you know they have got their new

0:20:15.000 --> 0:20:18.720
<v Speaker 3>product lineup from the groc acquisition, which is a very

0:20:18.800 --> 0:20:21.439
<v Speaker 3>compelling product as well. And there are others setups etched

0:20:21.480 --> 0:20:24.520
<v Speaker 3>comes to mind. So the inference space is a smaller

0:20:24.600 --> 0:20:27.640
<v Speaker 3>design space. It's not clear that inhouse solutions will be

0:20:27.680 --> 0:20:31.400
<v Speaker 3>a necessary thing in the future if physic enough people competing.

0:20:31.440 --> 0:20:34.959
<v Speaker 3>But on the training side, I mean, what a mote like,

0:20:35.080 --> 0:20:38.040
<v Speaker 3>there's just a video, it's just and I suppose Google.

0:20:38.200 --> 0:20:41.520
<v Speaker 3>But you know, if you're using TPUs, you're also kind

0:20:41.520 --> 0:20:43.960
<v Speaker 3>of entering a very close relationship with Google, some feeling

0:20:44.040 --> 0:20:46.840
<v Speaker 3>of vandor lockin. It's a complicated thing if you go

0:20:46.880 --> 0:20:48.600
<v Speaker 3>down that past. But if you're compute hungry, like the

0:20:48.640 --> 0:20:52.520
<v Speaker 3>neo labs. I mean, obviously the big labs. You'll take

0:20:52.520 --> 0:20:54.960
<v Speaker 3>what you can get. I think Anthropic takes TPUs, trainiums

0:20:54.960 --> 0:20:57.120
<v Speaker 3>and GPUs. You know they need them all.

0:20:58.040 --> 0:21:00.720
<v Speaker 2>So maybe we'll create a little bit of con controversy

0:21:00.840 --> 0:21:03.199
<v Speaker 2>here because later on in a little bit we're going

0:21:03.280 --> 0:21:06.320
<v Speaker 2>to be speaking with Carmen Lee, the CEU of Compute Exchange,

0:21:06.560 --> 0:21:10.400
<v Speaker 2>which is this, you know, one of the multiple entities

0:21:10.440 --> 0:21:15.160
<v Speaker 2>are trying to build financial markets for compute capacity right

0:21:15.280 --> 0:21:17.960
<v Speaker 2>trade it like oil, so it's like compute futures and

0:21:18.040 --> 0:21:21.560
<v Speaker 2>stuff that right now, could you see a use for

0:21:21.640 --> 0:21:26.280
<v Speaker 2>that of financial instrument that's like on some liquid tradable

0:21:26.320 --> 0:21:31.480
<v Speaker 2>exchange for H one hundred or whatever, some benchmark of

0:21:31.480 --> 0:21:33.720
<v Speaker 2>how much it costs run these chips. Could you see

0:21:33.720 --> 0:21:36.000
<v Speaker 2>that being a useful instrument for you at some point?

0:21:36.080 --> 0:21:38.640
<v Speaker 3>It's it's plausible. It highly relates back to my previously

0:21:38.720 --> 0:21:41.480
<v Speaker 3>stated failure to plan correctly for the future. If in

0:21:41.520 --> 0:21:44.199
<v Speaker 3>some sense I could lock in a price for some

0:21:44.320 --> 0:21:48.439
<v Speaker 3>future date for delivery or something of compute, something that

0:21:48.520 --> 0:21:51.399
<v Speaker 3>is connected to for prices compute in the long term future,

0:21:51.560 --> 0:21:53.640
<v Speaker 3>I think that could be value to that We could

0:21:53.680 --> 0:21:56.200
<v Speaker 3>basically hedge our risk that we wait too long to

0:21:56.200 --> 0:21:58.080
<v Speaker 3>put the order in and it price goes up. I mean,

0:21:58.200 --> 0:22:00.280
<v Speaker 3>in twenty twenty six, the priceive memory has gone up

0:22:01.040 --> 0:22:04.440
<v Speaker 3>so much that we do have concrete specific things. I

0:22:04.480 --> 0:22:06.639
<v Speaker 3>wish I put that order in a month earlier, so

0:22:06.680 --> 0:22:11.000
<v Speaker 3>it's a real real thing. Do I believe that there

0:22:11.040 --> 0:22:14.840
<v Speaker 3>would be a good market with left liquidity for long

0:22:14.920 --> 0:22:18.480
<v Speaker 3>dated compute futures? That I guess remains to be seen.

0:22:18.760 --> 0:22:20.040
<v Speaker 3>I don't know what I would do with a short

0:22:20.080 --> 0:22:23.760
<v Speaker 3>dated compute future. I do think defining what compute is

0:22:23.760 --> 0:22:26.080
<v Speaker 3>is pretty hard, and I have no idea what physical

0:22:26.119 --> 0:22:29.560
<v Speaker 3>delivery would be if that is indeed of interest because

0:22:29.760 --> 0:22:31.800
<v Speaker 3>you know, because of a long term contract and because

0:22:31.840 --> 0:22:35.720
<v Speaker 3>of how much work goes into every site, Like when

0:22:35.760 --> 0:22:38.159
<v Speaker 3>we connect to a neocloud site, we're thinking about how

0:22:38.160 --> 0:22:40.960
<v Speaker 3>to connect it back to our other sites. Everyone's got

0:22:40.960 --> 0:22:44.480
<v Speaker 3>a different networking system, the file system, Like you know,

0:22:44.560 --> 0:22:46.320
<v Speaker 3>visit of a GPUs, which was all the focus, but

0:22:46.359 --> 0:22:49.040
<v Speaker 3>this's also like you know, how is data stored of

0:22:49.080 --> 0:22:50.479
<v Speaker 3>a site or is it stored of itt site at all?

0:22:50.520 --> 0:22:52.720
<v Speaker 3>Is there an adjacent site that all the hard drives

0:22:52.760 --> 0:22:55.440
<v Speaker 3>are in and they're all idiosyncratic, and I can't do

0:22:55.480 --> 0:22:57.720
<v Speaker 3>anything of one hundred and twenty eight GPUs. I need

0:22:57.960 --> 0:23:00.800
<v Speaker 3>thousands of GPUs or bust. That's like my lot size,

0:23:01.200 --> 0:23:02.760
<v Speaker 3>And so it's very hot to see how you could

0:23:02.840 --> 0:23:06.040
<v Speaker 3>kind of break that down into useful units. But maybe

0:23:06.040 --> 0:23:08.760
<v Speaker 3>it's just a spot thing and if it's long dated,

0:23:08.800 --> 0:23:09.560
<v Speaker 3>I don't know, they could be.

0:23:09.720 --> 0:23:10.320
<v Speaker 2>We'll learn more.

0:23:11.520 --> 0:23:14.720
<v Speaker 4>Yeah, I did get a preview and it is pretty cool,

0:23:14.880 --> 0:23:17.119
<v Speaker 4>like the actual program where you can select like the

0:23:17.160 --> 0:23:19.879
<v Speaker 4>type of compute you need from a specific data center

0:23:19.920 --> 0:23:22.840
<v Speaker 4>that has like literally I think dozens, if not maybe

0:23:22.920 --> 0:23:25.439
<v Speaker 4>hundreds of parameters at the moment. So maybe we can

0:23:25.480 --> 0:23:29.120
<v Speaker 4>get a demonstration from Carmen. What is your token spend

0:23:30.000 --> 0:23:31.720
<v Speaker 4>at the moment? Is it bigger than Joe's?

0:23:31.760 --> 0:23:32.000
<v Speaker 1>I have?

0:23:33.480 --> 0:23:35.760
<v Speaker 3>I think I what is my average? I think it's

0:23:35.800 --> 0:23:37.879
<v Speaker 3>on the order of one hundred two hundred dollars a

0:23:37.960 --> 0:23:41.440
<v Speaker 3>day per employee's on my team. I feel like that's

0:23:41.520 --> 0:23:44.399
<v Speaker 3>kind of what I've been seeing lately, and some people

0:23:44.440 --> 0:23:48.040
<v Speaker 3>are more in a thousand a day range, burst bit

0:23:48.080 --> 0:23:48.760
<v Speaker 3>bursty for that.

0:23:49.040 --> 0:23:53.240
<v Speaker 4>Wait, do you like those people because they're supposedly more predictive?

0:23:54.320 --> 0:23:56.639
<v Speaker 3>Definitely not trying to encourage that. I mean, some people

0:23:56.720 --> 0:24:01.120
<v Speaker 3>go through sturges of experimentation slash Ai delarium, which is understandable.

0:24:01.480 --> 0:24:03.560
<v Speaker 3>And I think we are always trying to understand the

0:24:03.560 --> 0:24:05.919
<v Speaker 3>people who are using more like are they doing it

0:24:05.960 --> 0:24:09.080
<v Speaker 3>for something that you even't figured out yet? That's a

0:24:09.200 --> 0:24:12.399
<v Speaker 3>pretty profound new expense to have. It's it's it's not

0:24:12.600 --> 0:24:15.240
<v Speaker 3>at the level that concerns us. Well, it didn't exist

0:24:15.240 --> 0:24:16.760
<v Speaker 3>at all as an expense type. So that's kind of

0:24:16.760 --> 0:24:17.600
<v Speaker 3>interesting to think about.

0:24:17.600 --> 0:24:20.639
<v Speaker 2>Well, I'm curious, like you know, for the consumer models,

0:24:20.680 --> 0:24:23.879
<v Speaker 2>they talk about how psychophantic they are. Does that happen?

0:24:24.000 --> 0:24:26.880
<v Speaker 2>It's like, yes, you're close. This is really smart. You're

0:24:26.880 --> 0:24:29.960
<v Speaker 2>close to cracking the code of the market. Keep pursuing this.

0:24:30.200 --> 0:24:33.879
<v Speaker 2>They just one more this idea is doing or is

0:24:33.920 --> 0:24:36.200
<v Speaker 2>it Claude likes to say, this is doing some real

0:24:36.240 --> 0:24:37.880
<v Speaker 2>work here in this argument.

0:24:37.960 --> 0:24:38.520
<v Speaker 3>It's really good.

0:24:38.760 --> 0:24:40.639
<v Speaker 2>Do you get that in the engineering content?

0:24:40.760 --> 0:24:43.320
<v Speaker 3>I think we do. And it's interesting. We have we

0:24:43.359 --> 0:24:46.399
<v Speaker 3>decided a new internship for the summer, and uh, in

0:24:46.440 --> 0:24:49.399
<v Speaker 3>previous internships, we noticed that, you know, it's quite daunting

0:24:49.440 --> 0:24:51.320
<v Speaker 3>coming into this quent training context. You know, you have

0:24:51.400 --> 0:24:53.040
<v Speaker 3>no there's not much to like read a book, can't

0:24:53.040 --> 0:24:56.640
<v Speaker 3>read a tetbook about it. It's useful, So people ask AI,

0:24:57.400 --> 0:25:00.359
<v Speaker 3>and uh, you know, it would always mention some things

0:25:00.400 --> 0:25:03.439
<v Speaker 3>of like an unusual frequency that maybe an expert in

0:25:03.440 --> 0:25:06.159
<v Speaker 3>netfield wouldn't like focus us on some things. And we

0:25:06.280 --> 0:25:09.600
<v Speaker 3>noticed in our like winter internship program a lot of

0:25:09.720 --> 0:25:13.560
<v Speaker 3>sort of very technical quant finance research terms being mentioned

0:25:13.560 --> 0:25:15.680
<v Speaker 3>a lot by the interns that no full time are used.

0:25:16.000 --> 0:25:18.320
<v Speaker 3>And it's like the original seed of the mind virus

0:25:18.480 --> 0:25:21.080
<v Speaker 3>is AI. So there's a little stuff like that. But

0:25:22.080 --> 0:25:24.119
<v Speaker 3>our token spending is going to go up, but I

0:25:24.119 --> 0:25:28.520
<v Speaker 3>mean that's almost guaranteed, and yeah, we're getting value out

0:25:28.520 --> 0:25:32.440
<v Speaker 3>of it. Maybe not two x productivity, but I talked

0:25:32.480 --> 0:25:34.560
<v Speaker 3>to someone who said that team's fifty percent more productive.

0:25:35.359 --> 0:25:37.080
<v Speaker 3>That's pretty good. I mean, you'd definitely pay one hundred

0:25:37.119 --> 0:25:39.000
<v Speaker 3>dollars a day for that. I just don't understand how

0:25:39.000 --> 0:25:42.880
<v Speaker 3>people who are token poor could keep up with someone

0:25:42.920 --> 0:25:45.760
<v Speaker 3>who's token rich. And that's a gain goes to the

0:25:45.760 --> 0:25:48.960
<v Speaker 3>acceleration feeling. It's like if you have two people who

0:25:48.960 --> 0:25:52.320
<v Speaker 3>are sort of equally resourceful and smart, but someone who's

0:25:52.359 --> 0:25:54.800
<v Speaker 3>basically a co pilot with them, that's giving them a

0:25:54.840 --> 0:25:56.359
<v Speaker 3>fifty percent boost and all they have to do to

0:25:56.400 --> 0:25:59.640
<v Speaker 3>get that is essentially spend money. It creates a have

0:25:59.640 --> 0:26:03.639
<v Speaker 3>have no dynamic that possibly compounds. As you have more success,

0:26:03.680 --> 0:26:05.480
<v Speaker 3>you make more money, you're more willing to eat now

0:26:05.480 --> 0:26:07.160
<v Speaker 3>a thousand dollars a day per pres and for token

0:26:07.200 --> 0:26:10.000
<v Speaker 3>spend you go even faster. And this feeling again of

0:26:10.359 --> 0:26:14.199
<v Speaker 3>compounding acceleration, which might be delirium, but you could make

0:26:14.200 --> 0:26:16.159
<v Speaker 3>an argument for why it could be a real effect

0:26:16.200 --> 0:26:19.639
<v Speaker 3>instead of more winner. Take all contexts where speed of

0:26:19.680 --> 0:26:23.080
<v Speaker 3>improvement is like the key thing. There's a story there,

0:26:23.119 --> 0:26:25.480
<v Speaker 3>I think, or delirium matter well.

0:26:25.280 --> 0:26:27.280
<v Speaker 4>I mean, speaking of the haves and have nots. The

0:26:27.320 --> 0:26:31.480
<v Speaker 4>other big story in AI world is just competition for talent, right,

0:26:31.480 --> 0:26:35.400
<v Speaker 4>and everyone is sort of chasing the same genius engineer.

0:26:35.520 --> 0:26:37.760
<v Speaker 4>I guess how are you finding that At.

0:26:37.680 --> 0:26:40.320
<v Speaker 3>The moment, it's changed a bit. There's a lot of

0:26:40.400 --> 0:26:43.439
<v Speaker 3>dynamics going on. There's still a feeling that if you

0:26:43.680 --> 0:26:46.280
<v Speaker 3>are plucky enough, you can get a VC to fund

0:26:46.359 --> 0:26:49.880
<v Speaker 3>your idea based on very little. You have the right pedigree,

0:26:49.920 --> 0:26:51.840
<v Speaker 3>and that's always been true. I guess some sense this

0:26:51.960 --> 0:26:55.119
<v Speaker 3>is like the YC philosophy. Some sense you go and

0:26:55.280 --> 0:26:57.600
<v Speaker 3>it's just that some of the numbers and the foamo

0:26:57.680 --> 0:27:02.280
<v Speaker 3>feeling and is quite shocking. And so that's actually a

0:27:02.280 --> 0:27:04.080
<v Speaker 3>form of competition, just like why didn't I go create

0:27:04.119 --> 0:27:06.639
<v Speaker 3>a stetup. I don't have any ideas or anything. I'm

0:27:06.680 --> 0:27:09.240
<v Speaker 3>just going to make a startup for the big labs.

0:27:09.400 --> 0:27:13.400
<v Speaker 3>A question of upside remaining upside. You're at a trillian now,

0:27:13.520 --> 0:27:15.000
<v Speaker 3>I guess there are two big ones that are a

0:27:15.040 --> 0:27:18.000
<v Speaker 3>trillion dollar evaluation. Where do you go from there? I

0:27:18.000 --> 0:27:20.840
<v Speaker 3>think that's affecting people's level of like forward looking optimism

0:27:20.880 --> 0:27:23.280
<v Speaker 3>for people who are taking offers now and for people

0:27:23.280 --> 0:27:25.720
<v Speaker 3>who are at those places and looking to leave. Generally,

0:27:25.720 --> 0:27:28.120
<v Speaker 3>it's a question of like, well, they've become big tech,

0:27:28.760 --> 0:27:31.719
<v Speaker 3>They've added people at a vast rate, and the culture

0:27:31.720 --> 0:27:34.080
<v Speaker 3>has shifted, especially at some of the labs a lot,

0:27:34.240 --> 0:27:37.160
<v Speaker 3>and to our favor. For a while, it did feel

0:27:37.200 --> 0:27:39.320
<v Speaker 3>like we're in a very, very fierce competition. And now

0:27:39.359 --> 0:27:41.960
<v Speaker 3>maybe it's now it's maybe more even playing field. But

0:27:42.720 --> 0:27:44.879
<v Speaker 3>I don't know. I talk to a lot of undergrads

0:27:45.560 --> 0:27:50.360
<v Speaker 3>and they don't feel great about the future. They feel

0:27:50.440 --> 0:27:50.879
<v Speaker 3>very worried.

0:27:50.960 --> 0:27:54.360
<v Speaker 2>Basically, what I was gonna just went by way too fast.

0:27:54.440 --> 0:27:58.080
<v Speaker 2>But like you mentioned already, the models are like okay,

0:27:58.240 --> 0:28:02.280
<v Speaker 2>junior level, Yeah, what does talent look like at this point?

0:28:02.359 --> 0:28:04.359
<v Speaker 2>And what are like I've seen some of the anthropic

0:28:04.400 --> 0:28:08.119
<v Speaker 2>interview questions and it's like designing some GPU kernel or

0:28:08.160 --> 0:28:11.720
<v Speaker 2>like optimizing the configuration of GPS within the data center?

0:28:12.119 --> 0:28:14.119
<v Speaker 2>Did what do you want someone to bring to the

0:28:14.160 --> 0:28:15.080
<v Speaker 2>table at this point?

0:28:15.240 --> 0:28:17.240
<v Speaker 3>I mean, I think the first thing is just trying

0:28:17.280 --> 0:28:19.720
<v Speaker 3>to embrace an open book philosophy, like let the interviews

0:28:19.720 --> 0:28:22.520
<v Speaker 3>be done with the aid of AI is something we're

0:28:22.520 --> 0:28:24.320
<v Speaker 3>trying to aspire to do because it's just at some

0:28:24.359 --> 0:28:27.520
<v Speaker 3>point you become it becomes unrealistic to pretend anyone would

0:28:27.600 --> 0:28:30.040
<v Speaker 3>work without that. One of the big things in quant

0:28:30.119 --> 0:28:32.159
<v Speaker 3>is being like this is like archetype of like the

0:28:32.880 --> 0:28:36.200
<v Speaker 3>math theorist or the string theorist or something and they

0:28:36.240 --> 0:28:38.880
<v Speaker 3>go in to Long Island somewhere and they come out

0:28:38.880 --> 0:28:42.320
<v Speaker 3>with alpha. But you know, like our experience has been

0:28:42.360 --> 0:28:43.720
<v Speaker 3>a little bit more mixed because it's like, if you

0:28:43.760 --> 0:28:47.640
<v Speaker 3>can't implement your ideas, how do you how does that happen? Exactly? Well,

0:28:47.640 --> 0:28:50.000
<v Speaker 3>now Claude can presumably implement the ideas, So trying to

0:28:50.000 --> 0:28:53.160
<v Speaker 3>embrace that maybe we do accept more theorists, more dreamers,

0:28:53.200 --> 0:28:56.000
<v Speaker 3>people who can come up with ideas, trusting that the

0:28:56.040 --> 0:28:58.920
<v Speaker 3>implementation work can be done by AI. So I think

0:28:58.960 --> 0:29:01.320
<v Speaker 3>that's our shift. But I've been joking. It's like the

0:29:01.600 --> 0:29:04.600
<v Speaker 3>word cell versus shape rotator type, Like I feel like

0:29:04.640 --> 0:29:06.560
<v Speaker 3>the error of the word cell may be a bonus,

0:29:06.640 --> 0:29:10.040
<v Speaker 3>like if I did prompt. Yeah, I mean prompt engineering

0:29:10.080 --> 0:29:11.880
<v Speaker 3>is kind of a boomer term at this point, but

0:29:11.920 --> 0:29:14.680
<v Speaker 3>there is something to be said for like describing what

0:29:14.760 --> 0:29:19.800
<v Speaker 3>you want clearly and without confounding factors, and that is

0:29:19.880 --> 0:29:22.680
<v Speaker 3>a skill that can be learned, and that's not evenly

0:29:22.680 --> 0:29:25.080
<v Speaker 3>distributed in the population. And I would argue that as

0:29:25.440 --> 0:29:28.640
<v Speaker 3>shot up in value simply because of AI. So I

0:29:28.800 --> 0:29:30.600
<v Speaker 3>like to think of myself as one of these people though,

0:29:30.600 --> 0:29:32.000
<v Speaker 3>So that could be the Delarium'm talking.

0:29:32.040 --> 0:29:34.280
<v Speaker 2>I don't know, all right, Ian Dunning, we could talk

0:29:34.320 --> 0:29:37.680
<v Speaker 2>for two more hours. Thank you so much for joining

0:29:37.720 --> 0:29:56.640
<v Speaker 2>us at.

0:29:52.560 --> 0:29:55.959
<v Speaker 4>That was our conversation with Ian Dunning of Hudson River Trading,

0:29:56.040 --> 0:29:59.719
<v Speaker 4>recorded live at our New York show. I'm Tracy Alloway.

0:29:59.720 --> 0:30:01.720
<v Speaker 4>You can follow me at Tracy Alloway.

0:30:01.560 --> 0:30:04.480
<v Speaker 2>And I'm Joe Wisenthal. You can follow me at the Stalwart.

0:30:04.680 --> 0:30:08.560
<v Speaker 2>Follow Ian at Ian Dunning. Follow our producers Carmen Rodriguez

0:30:08.600 --> 0:30:12.720
<v Speaker 2>at Carmen Arman, Dashil Bennett at Dashbod, Calebrooks at Kalebrooks

0:30:12.920 --> 0:30:15.959
<v Speaker 2>and Kevin Lozano at Kevin Lloyd Lozano. And for more

0:30:15.960 --> 0:30:18.920
<v Speaker 2>odd Laws content, go to Bloomberg dot com slash odd Lots.

0:30:18.960 --> 0:30:21.320
<v Speaker 2>We have a daily newsletter and all of our episodes,

0:30:21.520 --> 0:30:23.600
<v Speaker 2>and you can chat about all these topics twenty four

0:30:23.600 --> 0:30:27.400
<v Speaker 2>to seven in our discord Discord dot gig slash odd Lots.

0:30:27.520 --> 0:30:29.400
<v Speaker 4>And if you enjoy odd Lots, if you like it

0:30:29.440 --> 0:30:31.800
<v Speaker 4>when we do these live shows and talk about how

0:30:31.840 --> 0:30:34.800
<v Speaker 4>trading firms are actually using AI, then please leave us

0:30:34.840 --> 0:30:38.120
<v Speaker 4>a positive review on your favorite podcast platform. And remember,

0:30:38.160 --> 0:30:40.560
<v Speaker 4>if you are a Bloomberg subscriber, you can listen to

0:30:40.720 --> 0:30:43.560
<v Speaker 4>all of our episodes absolutely ad free. All you need

0:30:43.640 --> 0:30:46.120
<v Speaker 4>to do is find the Bloomberg channel on Apple Podcasts

0:30:46.160 --> 0:31:10.360
<v Speaker 4>and follow the instructions there. Thanks for listening. In behead