WEBVTT - How CoreWeave Sees the Market for Compute Right Now

0:00:02.720 --> 0:00:15.840
<v Speaker 1>Bloomberg Audio Studios, Podcasts, radio News.

0:00:18.520 --> 0:00:21.560
<v Speaker 2>Hello and welcome to another episode of the Odd Lots podcast.

0:00:21.640 --> 0:00:24.040
<v Speaker 3>I'm Joe Wisenthal and I'm Tracy Alloway.

0:00:24.280 --> 0:00:29.680
<v Speaker 2>Tracy, I'm envisioning this future where like we have to

0:00:29.720 --> 0:00:33.680
<v Speaker 2>do a state of the sort of AI inference market episode,

0:00:33.720 --> 0:00:36.600
<v Speaker 2>like once a month, you know, where it's like things

0:00:36.600 --> 0:00:39.480
<v Speaker 2>are moving so rapidly and there's so much change either

0:00:39.479 --> 0:00:42.120
<v Speaker 2>in terms of what models are using or what they're

0:00:42.120 --> 0:00:44.720
<v Speaker 2>being used for, et cetera, that in the same way

0:00:44.760 --> 0:00:47.440
<v Speaker 2>we would do, like you know, the occasional regular stock

0:00:47.479 --> 0:00:50.839
<v Speaker 2>market episode or whatever, we would just do, Okay, what

0:00:50.920 --> 0:00:53.880
<v Speaker 2>are we seeing right now in a inference trends because

0:00:53.920 --> 0:00:56.120
<v Speaker 2>it just feels like the moment we do an episode,

0:00:56.120 --> 0:00:58.000
<v Speaker 2>a few weeks later it may be out of date.

0:00:58.240 --> 0:01:00.600
<v Speaker 3>We should just buy the bullet and do a weekly episode,

0:01:00.680 --> 0:01:04.800
<v Speaker 3>transform lots more into a market update on compute.

0:01:04.840 --> 0:01:07.840
<v Speaker 2>We could do inference in I don't know, we'll have

0:01:07.880 --> 0:01:12.160
<v Speaker 2>to workshop inference. No, no, we'd have to. But anyway,

0:01:12.360 --> 0:01:15.640
<v Speaker 2>this is lots of lots of inference. This is like

0:01:15.959 --> 0:01:19.360
<v Speaker 2>the story of the moment, and we know that, you know,

0:01:19.400 --> 0:01:22.640
<v Speaker 2>a couple of years ago, everyone was sort of dabbling

0:01:22.680 --> 0:01:28.000
<v Speaker 2>around with various things and experimenting and using AI, like

0:01:27.680 --> 0:01:30.240
<v Speaker 2>oh like write a poem for me about this, etcetera.

0:01:30.400 --> 0:01:32.440
<v Speaker 2>That phase of AI is long over, and we know

0:01:32.560 --> 0:01:36.800
<v Speaker 2>that companies specifically are spending a ton on compute, so

0:01:37.040 --> 0:01:40.839
<v Speaker 2>much so that CFOs around the world are getting sticker

0:01:40.880 --> 0:01:43.520
<v Speaker 2>shock about their compute budgets. And there was even a

0:01:43.560 --> 0:01:47.240
<v Speaker 2>headline of like Uber saying like, okay, like fifteen hundred

0:01:47.240 --> 0:01:50.760
<v Speaker 2>dollars of max per employee, like don't spend more than

0:01:50.800 --> 0:01:53.000
<v Speaker 2>that in a month on token, So like this is

0:01:53.040 --> 0:01:54.360
<v Speaker 2>a very fast moving area.

0:01:54.600 --> 0:01:56.880
<v Speaker 3>Yeah, you're starting to get headlines about, I guess a

0:01:57.000 --> 0:02:01.400
<v Speaker 3>corporate reckoning AI as more people experiment and spend money

0:02:01.440 --> 0:02:04.840
<v Speaker 3>on it. The Uber headline that you mentioned apparently Uber

0:02:04.880 --> 0:02:08.160
<v Speaker 3>burned through its entire twenty twenty six AI budget in

0:02:08.240 --> 0:02:12.160
<v Speaker 3>four months basically, and like what's more important is the

0:02:12.160 --> 0:02:15.639
<v Speaker 3>COEO was actually asking whether or not that was worth it,

0:02:15.680 --> 0:02:18.800
<v Speaker 3>like whether they saw productivity gains or whatever as a

0:02:18.800 --> 0:02:22.720
<v Speaker 3>result of that. The other very amusing headline that I saw,

0:02:22.800 --> 0:02:25.679
<v Speaker 3>and it was citing an unnamed source. It's from Axios,

0:02:25.720 --> 0:02:28.480
<v Speaker 3>so you know, oh yeah, not entirely sure it's true,

0:02:28.680 --> 0:02:31.480
<v Speaker 3>but reportedly great headline. It was a great headline. An

0:02:31.520 --> 0:02:34.760
<v Speaker 3>AI consultant told Axios that one of their clients recently

0:02:34.760 --> 0:02:38.120
<v Speaker 3>spent half a billion dollars in a single month after

0:02:38.160 --> 0:02:39.959
<v Speaker 3>failing to put usage limits on this.

0:02:40.080 --> 0:02:43.080
<v Speaker 2>Yeah, it's because there's everyone that's like, oh, I just

0:02:43.120 --> 0:02:45.320
<v Speaker 2>have a simple question. I want to look up our

0:02:45.320 --> 0:02:48.320
<v Speaker 2>guests title. I'm going to use the most advanced model

0:02:48.480 --> 0:02:50.720
<v Speaker 2>to do that, et cetera. I have a theory and

0:02:50.760 --> 0:02:53.200
<v Speaker 2>we'll get into this with our guests that one of

0:02:53.240 --> 0:02:55.400
<v Speaker 2>the things that will and we've talked about this with

0:02:55.480 --> 0:02:58.639
<v Speaker 2>a Goldenman's Marco Ardenti, but one of the things I

0:02:58.680 --> 0:03:02.280
<v Speaker 2>predict is that companies are like, clearly you know, they're

0:03:02.280 --> 0:03:04.080
<v Speaker 2>going to keep using it more and more would be

0:03:04.080 --> 0:03:06.520
<v Speaker 2>my guess. But there are probably a lot of investment

0:03:06.600 --> 0:03:09.840
<v Speaker 2>made in sort of like optimal model routing. Because some

0:03:09.919 --> 0:03:12.680
<v Speaker 2>models are like one hundred per query of what a

0:03:12.680 --> 0:03:15.320
<v Speaker 2>frontier model is, probably a lot of people don't know

0:03:15.360 --> 0:03:18.519
<v Speaker 2>like what is the sort of like efficient frontier model usage,

0:03:18.720 --> 0:03:21.640
<v Speaker 2>and so actually routing the query to the sort of

0:03:21.720 --> 0:03:25.679
<v Speaker 2>most efficient model. I have a feeling we're going to

0:03:25.720 --> 0:03:27.280
<v Speaker 2>see a lot of investment in that area.

0:03:27.120 --> 0:03:29.560
<v Speaker 3>Specifically, well, there's also just the question of whether or

0:03:29.560 --> 0:03:32.560
<v Speaker 3>not the models get cheaper overall as they advance, right,

0:03:32.600 --> 0:03:34.800
<v Speaker 3>and we have seen some I think Nvidia has a

0:03:34.840 --> 0:03:37.880
<v Speaker 3>new system or chip out or something that is supposed

0:03:37.920 --> 0:03:41.160
<v Speaker 3>to reduce token usage. We can get into that as well.

0:03:41.360 --> 0:03:44.480
<v Speaker 2>And you know, we did that live episode recently with

0:03:44.600 --> 0:03:47.520
<v Speaker 2>In Dunning of Hudson River Trading and he said a

0:03:47.520 --> 0:03:49.760
<v Speaker 2>lot of interesting things in that, But one of the

0:03:49.800 --> 0:03:54.160
<v Speaker 2>things he said is that the scarcity is increasingly like

0:03:54.440 --> 0:03:58.560
<v Speaker 2>just the real estate component. Finding a suitable place to

0:03:58.640 --> 0:04:02.280
<v Speaker 2>plug in your GPU, at least from his perspective, right now,

0:04:02.800 --> 0:04:05.320
<v Speaker 2>is as much, if not more so, of a challenge

0:04:05.360 --> 0:04:07.840
<v Speaker 2>than securing GPUs themselves, so like.

0:04:08.040 --> 0:04:10.400
<v Speaker 3>Which is different to what it was like three years ago.

0:04:10.480 --> 0:04:12.320
<v Speaker 2>Yeah, yeah, so just like where you plug it in.

0:04:12.400 --> 0:04:14.680
<v Speaker 2>We know there's all the like the anti data center

0:04:14.720 --> 0:04:16.680
<v Speaker 2>politics out there, so it's like, yeah, we got to

0:04:16.680 --> 0:04:17.279
<v Speaker 2>take the pulse of.

0:04:17.279 --> 0:04:20.240
<v Speaker 3>This market, all right, consider this our inference update.

0:04:20.440 --> 0:04:22.480
<v Speaker 2>Yeah, well, I'm really excited to say we really do

0:04:22.560 --> 0:04:25.880
<v Speaker 2>have the perfect guests. Someone we spoke to like truly

0:04:25.920 --> 0:04:28.280
<v Speaker 2>feels like eons a good I think the first thing

0:04:28.360 --> 0:04:30.680
<v Speaker 2>we ever connected with this company, They've always had a

0:04:30.680 --> 0:04:32.400
<v Speaker 2>lot of chips. But I think the first time we

0:04:32.440 --> 0:04:34.120
<v Speaker 2>ever linked up with this company was still in the

0:04:34.160 --> 0:04:36.680
<v Speaker 2>era where people were excited about in video chips being

0:04:36.760 --> 0:04:39.560
<v Speaker 2>used for like cryptomining and stuff like that. But we

0:04:39.560 --> 0:04:41.599
<v Speaker 2>are now in this very different era and this is

0:04:41.640 --> 0:04:44.200
<v Speaker 2>truly like one of the companies of the moment, and

0:04:44.279 --> 0:04:46.360
<v Speaker 2>that is, of course core weave, one of the so

0:04:46.440 --> 0:04:51.000
<v Speaker 2>called neo clouds, offering both training and inference services for

0:04:51.040 --> 0:04:54.120
<v Speaker 2>all sorts of different AI workloads. I'm very excited to

0:04:54.160 --> 0:04:57.560
<v Speaker 2>say back on the show, we have Brandon McBee, Coreweav's

0:04:57.560 --> 0:05:00.960
<v Speaker 2>co founder and chief development officer, So thank you so

0:05:01.040 --> 0:05:02.440
<v Speaker 2>much for coming on ALTS.

0:05:02.880 --> 0:05:06.920
<v Speaker 4>Appreciate being invited back, guys, and that was a fantastic intro.

0:05:07.000 --> 0:05:08.559
<v Speaker 4>We look forward to hitting these topics today.

0:05:08.720 --> 0:05:11.440
<v Speaker 2>All right, here's my question. So we know that like

0:05:11.640 --> 0:05:14.400
<v Speaker 2>at the tail end of last year and then in

0:05:14.440 --> 0:05:17.799
<v Speaker 2>the first quarter of this year, it's everyone started using

0:05:17.880 --> 0:05:22.320
<v Speaker 2>clog code and just there's clearly a key inflection moment

0:05:22.720 --> 0:05:25.920
<v Speaker 2>for sort of like overall AI demand. And then we

0:05:25.960 --> 0:05:29.719
<v Speaker 2>get into Q two and suddenly the CFO is, oh

0:05:29.800 --> 0:05:32.800
<v Speaker 2>my gosh, we're spending this much on inference. We got

0:05:32.800 --> 0:05:36.600
<v Speaker 2>to like figure things out just straight up like in

0:05:36.720 --> 0:05:41.359
<v Speaker 2>the last month whatever. Do you see any signs of

0:05:41.400 --> 0:05:44.839
<v Speaker 2>that happening yet of these companies which are all like

0:05:45.440 --> 0:05:51.200
<v Speaker 2>still AI eager AI adopters trying to get a little

0:05:51.200 --> 0:05:55.159
<v Speaker 2>bit of a handle and maybe slowing the rate of

0:05:55.200 --> 0:05:57.320
<v Speaker 2>the rate of growth. Is that happening yet?

0:05:57.680 --> 0:06:01.360
<v Speaker 4>Yeah, I think you see head lines there that there

0:06:01.400 --> 0:06:06.640
<v Speaker 4>are surprises of spend et cetera. I'd say our interpretation

0:06:07.000 --> 0:06:12.280
<v Speaker 4>of it is entirely look at the authentic and foundational

0:06:12.360 --> 0:06:14.680
<v Speaker 4>demand that is out there right, Like, all we're really

0:06:14.680 --> 0:06:19.119
<v Speaker 4>doing is talking about how much consumption there is of

0:06:19.400 --> 0:06:22.080
<v Speaker 4>AI and use for it. And I think that that

0:06:22.200 --> 0:06:24.880
<v Speaker 4>was a real question in the market twelve eighteen, twenty

0:06:24.920 --> 0:06:27.840
<v Speaker 4>four months ago, is will there be demand FEI? Where

0:06:27.920 --> 0:06:30.640
<v Speaker 4>is this inference demand that everyone's been talking about. And

0:06:31.240 --> 0:06:34.880
<v Speaker 4>I think you're absolutely correct January or so with this

0:06:35.080 --> 0:06:38.039
<v Speaker 4>kind of like next group of models that were coming out,

0:06:38.720 --> 0:06:41.880
<v Speaker 4>everyone all of a sudden and all at once said

0:06:42.279 --> 0:06:44.279
<v Speaker 4>this is what we've needed, like this is the real

0:06:44.360 --> 0:06:47.640
<v Speaker 4>product breakthrough. But I think we're keeping in mind that

0:06:47.760 --> 0:06:51.680
<v Speaker 4>product breakthrough was like for a limited set of people

0:06:51.960 --> 0:06:54.159
<v Speaker 4>at the end of the day, right, we're talking like

0:06:54.520 --> 0:06:59.280
<v Speaker 4>coding professionals, finance professionals, but it's a relatively small group

0:06:59.320 --> 0:07:03.679
<v Speaker 4>of people that are using infrastructure at this normal scale.

0:07:03.720 --> 0:07:07.080
<v Speaker 4>And so where we see this moving towards next is

0:07:07.240 --> 0:07:11.480
<v Speaker 4>broader enterprise use, like likely not seeing this whole to

0:07:11.600 --> 0:07:16.840
<v Speaker 4>connecting approach, and I think that that is unsustainable. But

0:07:17.560 --> 0:07:20.560
<v Speaker 4>do we see adoption in other sectors and how this

0:07:20.640 --> 0:07:24.200
<v Speaker 4>can continue to spread out? Absolutely? I mean, you know,

0:07:24.520 --> 0:07:27.920
<v Speaker 4>on our end, I think we have ten over one

0:07:27.960 --> 0:07:33.840
<v Speaker 4>billion dollar clients at this point, and our financial services

0:07:33.880 --> 0:07:37.840
<v Speaker 4>client backlog is into tens of billions of dollars at

0:07:37.840 --> 0:07:41.040
<v Speaker 4>this point. And so we're now talking about things outside

0:07:41.040 --> 0:07:45.080
<v Speaker 4>of AI labs, outside of hyperscalers. And look, as you

0:07:45.120 --> 0:07:51.080
<v Speaker 4>guys know, we support nine of the top ten AI

0:07:51.200 --> 0:07:54.080
<v Speaker 4>labs on the planet, and if you exclude China and

0:07:54.120 --> 0:07:57.280
<v Speaker 4>everything that's going on over there, Like, we have a

0:07:57.360 --> 0:08:01.000
<v Speaker 4>lot of visibility into what people are doing, and we're

0:08:01.040 --> 0:08:05.720
<v Speaker 4>not seeing any pullback on what they're doing on inference today.

0:08:05.720 --> 0:08:10.440
<v Speaker 4>If anything, it just remains this unrelenting demand for access

0:08:10.480 --> 0:08:15.360
<v Speaker 4>to the best technology solution in the market for running

0:08:15.520 --> 0:08:19.280
<v Speaker 4>artificial intelligence, and that's core week solution in the market.

0:08:19.640 --> 0:08:22.800
<v Speaker 3>Wait, say more about the customer mix now versus say

0:08:23.320 --> 0:08:26.600
<v Speaker 3>three years ago. So you have hyper scalers, you've got startups,

0:08:26.600 --> 0:08:30.640
<v Speaker 3>you've got various businesses. How has that, I guess composition

0:08:30.760 --> 0:08:31.760
<v Speaker 3>shifted over time.

0:08:32.120 --> 0:08:38.800
<v Speaker 4>Yeah, it's shifted enormously towards a more diverse customer base. Right.

0:08:38.840 --> 0:08:41.160
<v Speaker 4>We got a lot of flat for this. In our IPO. Right,

0:08:41.320 --> 0:08:45.760
<v Speaker 4>people were noting that we only had a handful of

0:08:45.840 --> 0:08:50.040
<v Speaker 4>large clients, that our clients were like just the hyperscalers

0:08:50.080 --> 0:08:53.560
<v Speaker 4>and AI lab or two. And I think that we

0:08:53.600 --> 0:08:57.840
<v Speaker 4>have made tremendous progress in driving diversifications. So I'd say

0:08:57.920 --> 0:09:01.880
<v Speaker 4>it's broadly cross three buckets today. Right. We had hyperscaleup

0:09:01.880 --> 0:09:05.440
<v Speaker 4>clients who continue to grow with us. We have AI

0:09:05.600 --> 0:09:09.000
<v Speaker 4>lab clients. As I said, nine of the top ten

0:09:09.120 --> 0:09:13.199
<v Speaker 4>AI labs on the planet choose core REEF. And then

0:09:13.200 --> 0:09:17.200
<v Speaker 4>we have this enterprise base. And the enterprise base just

0:09:17.240 --> 0:09:20.440
<v Speaker 4>doesn't grab as many headlines as you would expect because

0:09:20.440 --> 0:09:24.480
<v Speaker 4>it's not these massive, multi billion dollar contracts that are

0:09:24.720 --> 0:09:28.800
<v Speaker 4>being signed. But I think in Q four alone, we

0:09:28.920 --> 0:09:33.600
<v Speaker 4>added twice as many logos to our client base as

0:09:33.640 --> 0:09:37.280
<v Speaker 4>we had ever done versus any previous court. Right, And

0:09:37.320 --> 0:09:40.320
<v Speaker 4>that enterprise base is one that's growing so much. And

0:09:40.880 --> 0:09:43.280
<v Speaker 4>there was a point you guys hit on in the

0:09:43.320 --> 0:09:45.880
<v Speaker 4>intro that I think is really worth acknowledging, and it

0:09:46.000 --> 0:09:50.520
<v Speaker 4>was this concept of model routing and the idea that like,

0:09:50.880 --> 0:09:53.960
<v Speaker 4>not everyone needs just the latest model, that it's different

0:09:54.040 --> 0:09:57.000
<v Speaker 4>types of models I can hit different use cases. And

0:09:57.600 --> 0:10:00.000
<v Speaker 4>this is something we've been talking about for a while

0:10:00.480 --> 0:10:03.280
<v Speaker 4>right as it relates to the infrastructure side of things

0:10:03.440 --> 0:10:07.160
<v Speaker 4>as well, right, because you don't need that latest model

0:10:07.200 --> 0:10:10.440
<v Speaker 4>for everything, and accordingly you don't need the latest piece

0:10:10.480 --> 0:10:14.760
<v Speaker 4>of infrastructure to support every single inference or training query

0:10:14.800 --> 0:10:18.680
<v Speaker 4>that's out there. You can kind of conceptualize this matrix

0:10:18.760 --> 0:10:21.600
<v Speaker 4>of different sizes of workloads well to the different sizes

0:10:21.640 --> 0:10:24.320
<v Speaker 4>of GPUs, and all of a sudden that tells you,

0:10:24.679 --> 0:10:28.160
<v Speaker 4>my god, like h one hundreds could last six, seven,

0:10:28.280 --> 0:10:30.960
<v Speaker 4>eight years, a one hundreds are going to last longer.

0:10:31.000 --> 0:10:35.439
<v Speaker 4>And it totally changes the entire conversation around depreci full

0:10:35.520 --> 0:10:39.920
<v Speaker 4>life of infrastructure, as that was a really popular topic

0:10:40.280 --> 0:10:43.400
<v Speaker 4>during twenty twenty five. People were saying like, oh, this

0:10:43.440 --> 0:10:46.680
<v Speaker 4>stuff will last two years, it's worth zero afterwards, in

0:10:46.800 --> 0:10:50.720
<v Speaker 4>like we've never seen any semblance of that. Because of

0:10:50.760 --> 0:10:55.560
<v Speaker 4>the point you guys are accurately making, which is users

0:10:55.600 --> 0:10:59.839
<v Speaker 4>are going to need to find the way to use

0:11:00.120 --> 0:11:03.640
<v Speaker 4>the appropriate model for their prompts, and that'll be solved

0:11:03.640 --> 0:11:06.680
<v Speaker 4>by model round to your point, but that just further

0:11:06.800 --> 0:11:11.560
<v Speaker 4>enables this concept that infrastructure is going to be used longer,

0:11:11.640 --> 0:11:15.080
<v Speaker 4>and we see that every day in our portfolio, extending

0:11:15.080 --> 0:11:16.880
<v Speaker 4>all the way back to A one hundreds.

0:11:17.080 --> 0:11:19.480
<v Speaker 2>I just want to ask a specific question about the

0:11:19.520 --> 0:11:22.760
<v Speaker 2>broadening out of the customer base. And you mentioned, for example,

0:11:23.120 --> 0:11:26.960
<v Speaker 2>financial services clients. When you talk about, say a financial

0:11:27.000 --> 0:11:31.960
<v Speaker 2>services client as being distinct client from one of the

0:11:32.000 --> 0:11:34.960
<v Speaker 2>major AI labs, does that mean, well, you're saying so

0:11:35.000 --> 0:11:37.200
<v Speaker 2>it's like I'm just making it up. Let's just say

0:11:37.440 --> 0:11:39.720
<v Speaker 2>I don't know if these relationships exist. Let's say a

0:11:39.760 --> 0:11:45.680
<v Speaker 2>city group has an enterprise licensed with an ANTHROPIC. Does

0:11:45.720 --> 0:11:48.720
<v Speaker 2>that count as Anthropic as a customer or city as

0:11:48.760 --> 0:11:50.920
<v Speaker 2>a customer. And when you talk about this broadening out,

0:11:51.240 --> 0:11:55.280
<v Speaker 2>are there essentially more types of entities who are building

0:11:55.400 --> 0:11:59.559
<v Speaker 2>some type of model, not necessarily an LLM per se,

0:11:59.840 --> 0:12:03.480
<v Speaker 2>but it's some type of internal house specific model from

0:12:03.559 --> 0:12:05.040
<v Speaker 2>which they want to run inference.

0:12:05.320 --> 0:12:08.480
<v Speaker 4>It's a great question. The scenario you presented ANTHROPIC would

0:12:08.480 --> 0:12:09.040
<v Speaker 4>be our clients.

0:12:09.040 --> 0:12:09.839
<v Speaker 2>Okay, got it.

0:12:09.920 --> 0:12:11.880
<v Speaker 4>So what I'm highlighting I want to correct a number

0:12:11.880 --> 0:12:15.240
<v Speaker 4>I said earlier are financial service clients and this is

0:12:15.400 --> 0:12:21.079
<v Speaker 4>direct to those financial services. They're approaching ten billion in backlog. Okay,

0:12:21.120 --> 0:12:23.080
<v Speaker 4>so this would be you know, a good example of this,

0:12:23.120 --> 0:12:25.600
<v Speaker 4>and that's when we made recently is with Jane Street. Okay, right.

0:12:25.640 --> 0:12:28.160
<v Speaker 4>That's not Jane Street coming through Open AI or Anthropic

0:12:28.280 --> 0:12:31.280
<v Speaker 4>to get to us. That is Jane Street coming directly

0:12:31.400 --> 0:12:34.200
<v Speaker 4>to us and using our platform, and that for.

0:12:34.280 --> 0:12:39.319
<v Speaker 2>A model that they're building. So it's a Jane Street No, no, no,

0:12:39.400 --> 0:12:41.640
<v Speaker 2>I'm not saying it's setting inside training, but it would

0:12:41.640 --> 0:12:46.080
<v Speaker 2>be inference of a model that it's the Jane Street's

0:12:46.200 --> 0:12:50.920
<v Speaker 2>model of something rather than Jane Street's contract and enterprise

0:12:51.000 --> 0:12:53.000
<v Speaker 2>relationship with one of the major labs.

0:12:53.160 --> 0:12:55.800
<v Speaker 4>At the end of the day, we don't know what

0:12:55.960 --> 0:13:00.319
<v Speaker 4>exact workloads these entities are running, especially for and he's

0:13:00.320 --> 0:13:03.320
<v Speaker 4>like Jane Street would imagine that's highly secretive. Yeah. But

0:13:03.440 --> 0:13:06.439
<v Speaker 4>the point I would say is more that this is

0:13:06.480 --> 0:13:10.040
<v Speaker 4>not them coming through an AI lab yus. They are

0:13:10.440 --> 0:13:15.319
<v Speaker 4>interfacing with and managing the infrastructure directly on our platform.

0:13:15.440 --> 0:13:19.600
<v Speaker 4>And that's a really important distinction as we grow this

0:13:19.720 --> 0:13:23.560
<v Speaker 4>diversified client base. And I again, I think that we've

0:13:23.600 --> 0:13:26.720
<v Speaker 4>just done a wonderful job of executing.

0:13:26.400 --> 0:13:45.040
<v Speaker 2>That over the past, as you've talked about, including in

0:13:45.160 --> 0:13:47.520
<v Speaker 2>earnings releases, and as you can just tell from these

0:13:47.559 --> 0:13:52.360
<v Speaker 2>huge token budgets, inference demand is booming, but model training

0:13:52.559 --> 0:13:56.480
<v Speaker 2>is still important. But in addition to model training, to say, okay,

0:13:57.000 --> 0:14:00.280
<v Speaker 2>relative if you have a pie chart, the part that's

0:14:00.360 --> 0:14:03.800
<v Speaker 2>inference is getting bigger. But I assume the training is

0:14:03.840 --> 0:14:07.680
<v Speaker 2>also growing as well. But I'm curious from the perspective

0:14:07.760 --> 0:14:12.600
<v Speaker 2>of like say, the AI labs when they think about growth,

0:14:13.000 --> 0:14:16.440
<v Speaker 2>has there been a subtle shift from investing to push

0:14:16.520 --> 0:14:19.760
<v Speaker 2>the pure model frontier, having the absolute best state of

0:14:19.800 --> 0:14:24.520
<v Speaker 2>the art model, versus investing in, say, better harnesses. Because

0:14:24.520 --> 0:14:27.280
<v Speaker 2>a big reason we're excited, and i'll talk about AI

0:14:27.520 --> 0:14:31.560
<v Speaker 2>right now, is really the excitement that happened over with

0:14:31.680 --> 0:14:34.920
<v Speaker 2>Claude Code in the final quarter of twenty twenty five,

0:14:35.200 --> 0:14:38.680
<v Speaker 2>and it's like, oh, this harness has really unlocked a

0:14:38.680 --> 0:14:41.680
<v Speaker 2>bunch of capabilities. Has there been a shift in investment

0:14:41.720 --> 0:14:45.960
<v Speaker 2>from rather than just the purest, most advanced model to

0:14:46.560 --> 0:14:50.880
<v Speaker 2>let's invest more in tooling capacity and other things that

0:14:50.960 --> 0:14:54.840
<v Speaker 2>allow companies and clients to get more juice from an

0:14:54.880 --> 0:14:55.680
<v Speaker 2>advanced model.

0:14:55.920 --> 0:15:00.400
<v Speaker 4>I don't think that we're exposed to that decision making

0:15:01.040 --> 0:15:05.320
<v Speaker 4>with the AI labs as counterparties to us. The observation

0:15:05.520 --> 0:15:09.000
<v Speaker 4>I would make in a behavior change for the AI

0:15:09.160 --> 0:15:16.560
<v Speaker 4>labs is they want access to more infrastructure for longer duration, right,

0:15:16.640 --> 0:15:19.560
<v Speaker 4>And I'll qualify it a little bit, which is a year.

0:15:20.080 --> 0:15:23.760
<v Speaker 4>Two years ago, we were signing three year committed contracts.

0:15:23.880 --> 0:15:25.960
<v Speaker 4>The type of contracts we sign are basically like take

0:15:25.960 --> 0:15:29.760
<v Speaker 4>our pay contracts, which is the best way to finance

0:15:29.840 --> 0:15:34.240
<v Speaker 4>the infrastructure that we are building for our clients. Last

0:15:34.320 --> 0:15:37.400
<v Speaker 4>year it was four year contracts. Right. They were saying,

0:15:37.440 --> 0:15:42.440
<v Speaker 4>we want explicit access to Hopper for four years or

0:15:42.480 --> 0:15:46.720
<v Speaker 4>Blackwell for four years. Now they're coming and saying, well,

0:15:46.720 --> 0:15:49.000
<v Speaker 4>actually we want it for five years. We don't want

0:15:49.120 --> 0:15:52.880
<v Speaker 4>any interruption of use. We'll commit to the exact same

0:15:52.920 --> 0:15:56.440
<v Speaker 4>economics throughout the full duration of the contract. You can't

0:15:56.480 --> 0:15:59.080
<v Speaker 4>upgrade or change the infrastructure within it. You cannot cancel

0:15:59.120 --> 0:16:02.240
<v Speaker 4>the contract. It for five years, and they want it

0:16:02.280 --> 0:16:06.280
<v Speaker 4>at more scale. Right. The deployments are getting larger and larger,

0:16:06.440 --> 0:16:10.800
<v Speaker 4>so that's probably the best characterization we can offer on

0:16:11.000 --> 0:16:13.960
<v Speaker 4>decision making that AI labs are going through right now

0:16:14.000 --> 0:16:17.200
<v Speaker 4>as they look from an infrastructure perspective, it absolutely seems

0:16:17.280 --> 0:16:21.800
<v Speaker 4>like tooling is important, but scaling laws are still holding, right,

0:16:21.920 --> 0:16:27.040
<v Speaker 4>Like your ability to advance your frontier model through accessing

0:16:27.160 --> 0:16:30.960
<v Speaker 4>more infrastructure, its scale holds and that will hold through

0:16:31.120 --> 0:16:36.240
<v Speaker 4>Vera Ruben we expect and seemingly it's not stopping anytime soon.

0:16:36.400 --> 0:16:38.480
<v Speaker 3>Oh yeah, what's the deal with Arah Rubin? Can you

0:16:38.480 --> 0:16:39.320
<v Speaker 3>explain that to us?

0:16:40.720 --> 0:16:41.680
<v Speaker 4>Which aspect of it?

0:16:41.880 --> 0:16:42.480
<v Speaker 3>What is it?

0:16:42.640 --> 0:16:42.840
<v Speaker 4>Oh?

0:16:42.960 --> 0:16:45.040
<v Speaker 2>Yes, basically yeah.

0:16:44.800 --> 0:16:48.120
<v Speaker 4>So it's just Nvidio's next architecture that's coming out, right,

0:16:48.160 --> 0:16:51.920
<v Speaker 4>Like the current architecture that we're deploying today is Blackwell.

0:16:52.200 --> 0:16:56.640
<v Speaker 4>Blackwall comes. We deplay phronomenally in a NBL seventy two configuration,

0:16:56.760 --> 0:17:00.680
<v Speaker 4>which was an entire architecture change from the right. If

0:17:00.680 --> 0:17:04.879
<v Speaker 4>you recall Hopper game before Blackwell Hopper, you could deploy

0:17:05.000 --> 0:17:07.880
<v Speaker 4>these forty two U racks, which was typically like eight

0:17:07.960 --> 0:17:10.520
<v Speaker 4>GPUs in a server case. You would take it, plug

0:17:10.560 --> 0:17:14.480
<v Speaker 4>it in largely air cooled as well. Right, we ran

0:17:14.560 --> 0:17:17.919
<v Speaker 4>some liquid cooling just so we understood the requirements of

0:17:17.960 --> 0:17:22.800
<v Speaker 4>liquid cooling because Blackwell for our deployments is overwhelmingly liquid

0:17:22.840 --> 0:17:27.480
<v Speaker 4>cooled and its deployment configuration, and instead of eight GPUs

0:17:27.520 --> 0:17:30.879
<v Speaker 4>and a forty two U configuration, it's in this larger

0:17:30.920 --> 0:17:35.800
<v Speaker 4>seventy two GPU rack. It's like an entire chassis that's

0:17:35.840 --> 0:17:39.840
<v Speaker 4>being brought in and it just looks entirely different in

0:17:39.880 --> 0:17:42.479
<v Speaker 4>the data center. It's like this giant tower thing that

0:17:42.560 --> 0:17:47.120
<v Speaker 4>you've seen in pictures floating around on x so. Vera

0:17:47.200 --> 0:17:53.560
<v Speaker 4>Rubin will be the next architecture that comes out, and

0:17:53.840 --> 0:17:57.480
<v Speaker 4>we've started receiving testing racks for Vera Rubin.

0:17:57.560 --> 0:18:01.840
<v Speaker 3>But the basic idea is like the new configureuration makes

0:18:01.880 --> 0:18:05.760
<v Speaker 3>the whole system more efficient, like more tokens per energy

0:18:05.880 --> 0:18:06.920
<v Speaker 3>use and that sort of thing.

0:18:07.480 --> 0:18:10.560
<v Speaker 4>Yes, yeah, I think that's kind of where you're getting

0:18:10.560 --> 0:18:13.320
<v Speaker 4>to with it. But that doesn't necessarily mean going back

0:18:13.320 --> 0:18:16.520
<v Speaker 4>to the point earlier that everyone only wants the latest

0:18:16.520 --> 0:18:23.160
<v Speaker 4>generation of GPU. Right, we have massive demand for Ampeer, Hopper, Blackwell,

0:18:23.280 --> 0:18:28.120
<v Speaker 4>et cetera. And it just varies by US case model

0:18:28.800 --> 0:18:31.120
<v Speaker 4>and type of client as well. Like I would qualify

0:18:31.440 --> 0:18:35.119
<v Speaker 4>that AI labs are probably the ones who are lining

0:18:35.200 --> 0:18:40.359
<v Speaker 4>up first to secure access to the latest generation GPUs,

0:18:40.920 --> 0:18:46.720
<v Speaker 4>whereas enterprise clients might be probably very focused on current generation,

0:18:47.400 --> 0:18:50.359
<v Speaker 4>right like Hopper and Blackwell.

0:18:50.040 --> 0:18:52.760
<v Speaker 2>Right now, I'm going to be honest for a second.

0:18:52.880 --> 0:18:54.720
<v Speaker 2>You know, I try to keep up on a lot

0:18:54.760 --> 0:18:57.480
<v Speaker 2>of things AI related, I really do, and every single

0:18:57.520 --> 0:19:00.320
<v Speaker 2>day the ones that I got to do not keep

0:19:00.520 --> 0:19:03.320
<v Speaker 2>like in my mind. If you asked me, like I

0:19:03.440 --> 0:19:05.560
<v Speaker 2>liked it in the old days, when it was like

0:19:05.960 --> 0:19:08.320
<v Speaker 2>one eighty six, two eighty six, three eighty six, four

0:19:08.359 --> 0:19:11.440
<v Speaker 2>eighty six, Penniem and then like Penniam two, et cetera.

0:19:11.840 --> 0:19:14.159
<v Speaker 2>There was just this numerical sequence that I could keep

0:19:14.200 --> 0:19:16.359
<v Speaker 2>track of in my head. And so if someone asked

0:19:16.359 --> 0:19:19.879
<v Speaker 2>me like Joe, like Vera, Ruben Hopper, Blackwell, what was

0:19:19.960 --> 0:19:22.600
<v Speaker 2>the sequence, I'd be like, I gotta be honest with you.

0:19:22.640 --> 0:19:26.359
<v Speaker 2>I like, don't exactly remember, and I will prioritize that

0:19:26.400 --> 0:19:30.760
<v Speaker 2>at some point. But speaking of Silicon, so yesterday Microsoft

0:19:30.800 --> 0:19:32.560
<v Speaker 2>came out with a big They're really they want to

0:19:32.600 --> 0:19:34.119
<v Speaker 2>be in the game too. They don't want to just

0:19:34.160 --> 0:19:37.720
<v Speaker 2>be connected the labs. They want to have advanced models too,

0:19:37.760 --> 0:19:40.520
<v Speaker 2>and apparently it's a good model. And they announced the

0:19:40.880 --> 0:19:45.040
<v Speaker 2>MAI Thinking one model, but they said it's optimized on

0:19:45.200 --> 0:19:47.800
<v Speaker 2>the maya two hundred chip, which is their own chip.

0:19:48.280 --> 0:19:51.000
<v Speaker 2>And this is a thing which is even again going

0:19:51.040 --> 0:19:53.720
<v Speaker 2>back to our recent conversation we had even a place

0:19:53.760 --> 0:19:57.760
<v Speaker 2>like Hudson River Trading is thinking about getting into the

0:19:57.800 --> 0:20:02.760
<v Speaker 2>customized hardware game. How much juice for the squeeze is

0:20:02.840 --> 0:20:07.440
<v Speaker 2>there of aligning the model with custom silicon From your

0:20:07.600 --> 0:20:08.800
<v Speaker 2>vantage point.

0:20:09.080 --> 0:20:11.680
<v Speaker 4>What we could offer is what we hear from our clients. Yeah,

0:20:12.040 --> 0:20:15.760
<v Speaker 4>on that, And it's important to keep in mind we

0:20:15.840 --> 0:20:19.600
<v Speaker 4>can run any type of silicon on our platform. Right.

0:20:19.800 --> 0:20:23.000
<v Speaker 4>We are entirely customer led in what we build. Like,

0:20:23.040 --> 0:20:26.879
<v Speaker 4>we don't go commit to CAPEX and specuatively hope people

0:20:26.920 --> 0:20:30.040
<v Speaker 4>come and use infrastructure, right like, We wait until a

0:20:30.119 --> 0:20:33.280
<v Speaker 4>client says, we want you to go do this specific build,

0:20:33.359 --> 0:20:35.600
<v Speaker 4>here's what we want it to look like, and then

0:20:35.640 --> 0:20:37.399
<v Speaker 4>we go commit to that CAPEX right more like a

0:20:37.400 --> 0:20:42.679
<v Speaker 4>success based CAPEX approach. And the client isn't asking for

0:20:42.840 --> 0:20:47.800
<v Speaker 4>anything but in nvidio infrastructure. And I think a large

0:20:47.840 --> 0:20:50.840
<v Speaker 4>contributor to that is I mean, they built this incredible

0:20:50.880 --> 0:20:55.280
<v Speaker 4>ecosystem around their chipset. They have been dedicated to that

0:20:55.359 --> 0:20:57.880
<v Speaker 4>for I think over fifteen years at this point. Through

0:20:57.960 --> 0:21:02.600
<v Speaker 4>the Kuda architecture and in video. From what we hear

0:21:02.640 --> 0:21:07.160
<v Speaker 4>from more clients, that platform just remains the most efficient,

0:21:07.200 --> 0:21:11.719
<v Speaker 4>the most scalable, the most reliable set of infrastructure that

0:21:11.800 --> 0:21:16.800
<v Speaker 4>is in the market. Right, So I think others there's

0:21:16.880 --> 0:21:19.000
<v Speaker 4>always been you think over the past few years, right,

0:21:19.000 --> 0:21:21.200
<v Speaker 4>there's always been talking like what it is, but yeah,

0:21:21.240 --> 0:21:24.320
<v Speaker 4>this other again and these other chips, and at the

0:21:24.400 --> 0:21:27.879
<v Speaker 4>end of the day, like people are still using in

0:21:28.000 --> 0:21:32.280
<v Speaker 4>video infrastructure, they're committing to in Vidia infrastructure for five

0:21:32.440 --> 0:21:37.840
<v Speaker 4>plus year contracts in these billion, multi billion dollar commitments

0:21:38.600 --> 0:21:40.520
<v Speaker 4>because they know that that is going to be a

0:21:40.600 --> 0:21:44.760
<v Speaker 4>critical part of how they scale their business. We really

0:21:44.760 --> 0:21:49.439
<v Speaker 4>don't see demands on a material basis for anything but

0:21:49.880 --> 0:21:53.240
<v Speaker 4>that in Vidia compute and that's what we are building today.

0:21:53.640 --> 0:21:56.400
<v Speaker 2>Obviously, just to push back on this a little bit,

0:21:56.520 --> 0:21:59.040
<v Speaker 2>and I'm not really in any position to push back,

0:21:59.080 --> 0:22:02.240
<v Speaker 2>I can only relate what past guests have said and

0:22:02.440 --> 0:22:05.760
<v Speaker 2>my own reading. So what one of our guests said

0:22:05.920 --> 0:22:11.359
<v Speaker 2>is that absolutely in Nvidia has the lock on model training,

0:22:11.400 --> 0:22:14.359
<v Speaker 2>that if you want to train a model, that yes,

0:22:14.400 --> 0:22:16.399
<v Speaker 2>in video chips are the older game in talent, but

0:22:16.400 --> 0:22:19.879
<v Speaker 2>that for inference they're really his view, this is in

0:22:20.040 --> 0:22:23.080
<v Speaker 2>Donning again his views, there really were options. And then

0:22:23.119 --> 0:22:25.080
<v Speaker 2>of course we had someone who was much more biased.

0:22:25.119 --> 0:22:27.960
<v Speaker 2>We interviewed the CEU of Sarah Brash, the company that

0:22:27.960 --> 0:22:32.119
<v Speaker 2>makes the Gigantic plate and or sorry, the Gigantic pichanic,

0:22:32.480 --> 0:22:34.560
<v Speaker 2>and of course he did but I mean, of course

0:22:34.640 --> 0:22:37.320
<v Speaker 2>he was gonna say, yeah, the Kudo mote is vastly

0:22:37.359 --> 0:22:41.119
<v Speaker 2>overrated for inference. It barely exists now. Of course, of

0:22:41.119 --> 0:22:43.240
<v Speaker 2>course he's going to say that, so, like, you know,

0:22:43.320 --> 0:22:45.960
<v Speaker 2>he's in a competitor, but we've also heard it from

0:22:46.000 --> 0:22:50.080
<v Speaker 2>a user of inference, and intuitively it makes sense, like

0:22:50.200 --> 0:22:54.640
<v Speaker 2>training is very complicated, all that stuff. But what you're

0:22:54.680 --> 0:22:58.320
<v Speaker 2>saying is that from the customer standpoint, you see the

0:22:58.560 --> 0:23:02.360
<v Speaker 2>demand for in Vidia on both the training and the

0:23:02.400 --> 0:23:06.719
<v Speaker 2>inference as being steady, and that you perceive that advantage

0:23:06.760 --> 0:23:09.120
<v Speaker 2>to be consistent through both aspects.

0:23:09.280 --> 0:23:12.959
<v Speaker 4>So I believe in our last quarterly report, our CEO

0:23:13.119 --> 0:23:18.639
<v Speaker 4>might qualify that inference workloads represent well in access of

0:23:18.800 --> 0:23:23.800
<v Speaker 4>fifty percent of infrastructure utilization on our platform. Exact same

0:23:23.800 --> 0:23:26.480
<v Speaker 4>infrastructure we do is for training. Yeah. Right, When going

0:23:26.480 --> 0:23:28.680
<v Speaker 4>back to my commentive like it's very fungible between those

0:23:28.760 --> 0:23:33.920
<v Speaker 4>different types of workloads those customers are choosing in Vidia

0:23:33.960 --> 0:23:37.240
<v Speaker 4>to work with. Okay, I think what you're going to

0:23:37.320 --> 0:23:41.760
<v Speaker 4>see is people will want to try at small scale

0:23:42.280 --> 0:23:49.800
<v Speaker 4>other types of silicon, but the reliable, proven, and remains

0:23:49.840 --> 0:23:55.199
<v Speaker 4>from our perspective, most efficient infrastructure to use is in

0:23:55.320 --> 0:23:58.840
<v Speaker 4>video today. Does that change over time? Who really knows,

0:23:59.200 --> 0:24:03.639
<v Speaker 4>But I think we've seen in video battling this concept

0:24:03.960 --> 0:24:06.000
<v Speaker 4>for years, and every year they show up and like

0:24:06.040 --> 0:24:10.639
<v Speaker 4>they remain the de facto choice for AI infrastructure. I

0:24:10.640 --> 0:24:11.920
<v Speaker 4>think we're going to be one of the first people

0:24:11.920 --> 0:24:14.919
<v Speaker 4>in the market to see it because that will be

0:24:15.040 --> 0:24:18.320
<v Speaker 4>a tone shift change for our clients asking us to

0:24:18.480 --> 0:24:21.320
<v Speaker 4>run something else that hasn't happened.

0:24:21.400 --> 0:24:25.240
<v Speaker 3>Okay, So have the constraints on your business changed at all?

0:24:25.320 --> 0:24:28.560
<v Speaker 3>So three years ago we were talking about GPUs and

0:24:28.560 --> 0:24:31.640
<v Speaker 3>how hard they were to actually get. I imagine GPU

0:24:31.840 --> 0:24:35.520
<v Speaker 3>securing GPUs is still competitive to say the least. But

0:24:35.600 --> 0:24:38.720
<v Speaker 3>are you seeing other constraints emerge? Like Joe mentioned in

0:24:38.760 --> 0:24:43.320
<v Speaker 3>the intro, just land usage just places to actually build

0:24:43.520 --> 0:24:44.200
<v Speaker 3>data centers.

0:24:44.880 --> 0:24:48.439
<v Speaker 4>Landed usage specifically, I wouldn't say is as much of

0:24:48.480 --> 0:24:53.480
<v Speaker 4>a concern. Having a powered shell is the battleneck? Okay

0:24:53.480 --> 0:24:57.600
<v Speaker 4>today and let me qualify ball poweredshell. PowerShell is effectively

0:24:57.640 --> 0:25:02.040
<v Speaker 4>an empty data center that is energized. Right. It has

0:25:02.119 --> 0:25:05.800
<v Speaker 4>all the power and associated components. I can come into

0:25:05.880 --> 0:25:10.800
<v Speaker 4>it and deliver electrons into a reck, has the cooling

0:25:10.800 --> 0:25:13.160
<v Speaker 4>system built within it, like it has a whole thing. Right.

0:25:13.520 --> 0:25:17.960
<v Speaker 4>Poweredshell is the industry termed for it. That is the

0:25:18.000 --> 0:25:22.000
<v Speaker 4>bottleneck because of all of the supply chains that come

0:25:22.040 --> 0:25:24.159
<v Speaker 4>into that, right Like, not only you have electricity, do

0:25:24.200 --> 0:25:27.320
<v Speaker 4>you have the land, et cetera, But you have the

0:25:27.400 --> 0:25:32.400
<v Speaker 4>backup batteries, supplies, you have the transformers, you have personnel.

0:25:32.480 --> 0:25:35.119
<v Speaker 4>Right they just think about the electricians for these sites

0:25:35.160 --> 0:25:39.919
<v Speaker 4>and getting the accreditation on the electrician side to be

0:25:39.960 --> 0:25:41.639
<v Speaker 4>able to participate in these builds. I mean, I think

0:25:41.680 --> 0:25:44.560
<v Speaker 4>it's a five year plus apprenticeship to be able to

0:25:44.600 --> 0:25:47.360
<v Speaker 4>go through that program. Right. We can't just make new

0:25:47.400 --> 0:25:50.960
<v Speaker 4>electricians leveraging a supply chain, right like that that's a

0:25:51.040 --> 0:25:56.119
<v Speaker 4>trade that you can't really scale efficiently. So that is

0:25:56.400 --> 0:25:59.640
<v Speaker 4>absolutely the bottleneck for us. And I think our peer

0:25:59.720 --> 0:26:04.560
<v Speaker 4>set that's out there right now, access to chips, I

0:26:04.560 --> 0:26:07.440
<v Speaker 4>think we have a phenomenal relationship within video where we've

0:26:07.480 --> 0:26:11.400
<v Speaker 4>just proven to be the best operator of this infrastructure

0:26:11.680 --> 0:26:15.760
<v Speaker 4>on the planet. You know, a bottleneck that existed for

0:26:15.840 --> 0:26:18.120
<v Speaker 4>us previously I think was access to financing.

0:26:18.320 --> 0:26:20.720
<v Speaker 3>Yeah right, we an't know, doesn't seem to be an

0:26:20.760 --> 0:26:21.480
<v Speaker 3>issue anymore.

0:26:22.880 --> 0:26:28.760
<v Speaker 4>I would agree with that broadly, But that's years of

0:26:28.920 --> 0:26:33.000
<v Speaker 4>work in execution that has delivered that ability for us.

0:26:33.040 --> 0:26:35.800
<v Speaker 4>I mean, year to date, we've raised over twenty one

0:26:36.200 --> 0:26:40.480
<v Speaker 4>billion dollars of financing for our business. Like you don't

0:26:40.720 --> 0:26:42.479
<v Speaker 4>get to do that and just go from you know,

0:26:42.640 --> 0:26:46.000
<v Speaker 4>zero to twenty right out of nowhere. And I think

0:26:46.000 --> 0:26:50.880
<v Speaker 4>that's largely driven by our track record of execution. Right,

0:26:50.880 --> 0:26:56.320
<v Speaker 4>our investors, our creditors can see this deep set of

0:26:56.400 --> 0:27:00.879
<v Speaker 4>experience over the years of consistently delivering on these builds.

0:27:00.920 --> 0:27:05.959
<v Speaker 4>I mean, we have over a gigawant in active power

0:27:06.359 --> 0:27:08.639
<v Speaker 4>at this point, right, like a gigawat like at the

0:27:08.680 --> 0:27:13.239
<v Speaker 4>data center level with GPUs delivering into clients. And I

0:27:13.280 --> 0:27:16.439
<v Speaker 4>think that there's been kind of a misunderstanding of the

0:27:16.480 --> 0:27:20.000
<v Speaker 4>market where people are conflating the concept that like, you know,

0:27:20.520 --> 0:27:24.560
<v Speaker 4>something on paper is equivalent to being physically done and delivered.

0:27:24.680 --> 0:27:27.320
<v Speaker 4>And all I can say is there's an enormous gap

0:27:27.760 --> 0:27:31.560
<v Speaker 4>between you know, signing for power for delivery in twenty

0:27:31.760 --> 0:27:37.320
<v Speaker 4>thirty versus actually delivering that into billable GPU hours, and

0:27:37.400 --> 0:27:42.720
<v Speaker 4>that gap of execution is what has driven down our

0:27:42.840 --> 0:27:47.760
<v Speaker 4>costly capital so aggressively. That gap is where our business

0:27:48.200 --> 0:27:50.720
<v Speaker 4>sits and why it has been so successful. I mean,

0:27:50.720 --> 0:27:55.200
<v Speaker 4>that's the secret Sauce is our ability to take these

0:27:56.000 --> 0:28:00.240
<v Speaker 4>data center deployments and these customer relationships and deliver were

0:28:00.400 --> 0:28:02.560
<v Speaker 4>billable GPU hours into them.

0:28:03.160 --> 0:28:05.840
<v Speaker 2>You know, speaking of financing, I just want to say,

0:28:05.840 --> 0:28:09.359
<v Speaker 2>you know, during last year, like maybe six months ago,

0:28:09.840 --> 0:28:12.520
<v Speaker 2>that might have been the sort of near peak of

0:28:12.560 --> 0:28:15.320
<v Speaker 2>the Michael Burry inspired These chips are like in the

0:28:15.400 --> 0:28:19.640
<v Speaker 2>last two years stuff. And one of the viral charts

0:28:19.680 --> 0:28:23.120
<v Speaker 2>that you would see on Twitter was the core We've

0:28:23.200 --> 0:28:26.240
<v Speaker 2>CDs chart. Those have come way in, So it is

0:28:26.480 --> 0:28:30.320
<v Speaker 2>it is you know, while yeah, that's right, that's the

0:28:30.359 --> 0:28:33.480
<v Speaker 2>thing about CDs. No one never posts charts of credit

0:28:33.560 --> 0:28:36.439
<v Speaker 2>default swaps when they're coming in. They people love to

0:28:36.440 --> 0:28:38.680
<v Speaker 2>post them when they're blowing out. They have come in,

0:28:39.160 --> 0:28:41.520
<v Speaker 2>So you know, that does speak to some of this

0:28:41.600 --> 0:28:45.360
<v Speaker 2>point about these anxieties having been a leaved at least

0:28:45.800 --> 0:28:48.440
<v Speaker 2>somewhat since the start of the year. You know, it

0:28:48.520 --> 0:28:52.240
<v Speaker 2>occurred to me, like we're talking about credit default swaps,

0:28:52.240 --> 0:28:55.920
<v Speaker 2>we're talking about financing. I'm sort of gearing up to

0:28:55.960 --> 0:28:58.080
<v Speaker 2>write a big thing maybe, but I'm writing it in

0:28:58.080 --> 0:29:00.400
<v Speaker 2>my head currently that there really are a lot of

0:29:00.440 --> 0:29:04.200
<v Speaker 2>analogies between the business of data centers and the business

0:29:04.280 --> 0:29:06.720
<v Speaker 2>of banking. And one of the things in banking is

0:29:06.760 --> 0:29:11.240
<v Speaker 2>we all learned from SVB was the risk of industry

0:29:11.240 --> 0:29:14.160
<v Speaker 2>and depositor concentration that you if you have all your

0:29:14.240 --> 0:29:17.480
<v Speaker 2>depositors are either in like one depositor gets too big,

0:29:17.800 --> 0:29:20.560
<v Speaker 2>or all your depositors are in the same industry, then

0:29:20.600 --> 0:29:24.080
<v Speaker 2>you have this risk of like correlated withdrawals. And that's

0:29:24.080 --> 0:29:28.040
<v Speaker 2>what obviously did in SVB. When you think about planning

0:29:28.640 --> 0:29:31.120
<v Speaker 2>and you think about, okay, here's a investment, et cetera,

0:29:31.320 --> 0:29:34.480
<v Speaker 2>how much does this come up sort of like thinking

0:29:34.520 --> 0:29:40.480
<v Speaker 2>about I guess tenant uh diversification. Yeah, tenant diversification as

0:29:40.560 --> 0:29:43.040
<v Speaker 2>something that you think about in your multi your planning.

0:29:43.400 --> 0:29:46.480
<v Speaker 4>It's a critical aspect of it. Right, As I said

0:29:46.480 --> 0:29:48.920
<v Speaker 4>earlier too, like this was a key criticism of us

0:29:48.960 --> 0:29:52.040
<v Speaker 4>coming into our im right where we had that customer

0:29:52.080 --> 0:29:57.280
<v Speaker 4>concentration in our revenue, and we have made enormous progress there,

0:29:57.480 --> 0:29:59.000
<v Speaker 4>and I think the best way to think about it

0:29:59.040 --> 0:30:03.720
<v Speaker 4>is we could take all of our unallocated capacity. And

0:30:03.760 --> 0:30:07.720
<v Speaker 4>I say that very specific it's not unsold capacity and

0:30:08.000 --> 0:30:10.480
<v Speaker 4>implying that there's no demand for it. It's unallocated. There's

0:30:10.520 --> 0:30:14.160
<v Speaker 4>intense demand for we're figuring out where it should go,

0:30:14.640 --> 0:30:18.800
<v Speaker 4>and that customer piece of it, I think, honestly, like

0:30:18.880 --> 0:30:23.520
<v Speaker 4>we could allocate all that capacity to like single name clients, right, Like,

0:30:23.560 --> 0:30:26.800
<v Speaker 4>there is a pretty significant number of single name clients

0:30:26.800 --> 0:30:30.480
<v Speaker 4>we can go allocated out into. But I don't think

0:30:30.560 --> 0:30:33.200
<v Speaker 4>that is the business we are supposed to be building here.

0:30:33.280 --> 0:30:35.320
<v Speaker 4>I think the business we are supposed to be building

0:30:35.760 --> 0:30:43.000
<v Speaker 4>is a diversified cloud that is supporting the leading AI

0:30:43.280 --> 0:30:46.479
<v Speaker 4>consumers and producers on the planet. I don't think we're

0:30:46.520 --> 0:30:48.240
<v Speaker 4>supposed to be supported in just one or two companies.

0:31:04.360 --> 0:31:06.960
<v Speaker 3>When it comes to financing, Can you say a little

0:31:06.960 --> 0:31:10.160
<v Speaker 3>bit more about what changed to make the market more

0:31:10.240 --> 0:31:13.280
<v Speaker 3>comfortable with this, because like this is the big story

0:31:13.360 --> 0:31:17.200
<v Speaker 3>in markets, just how much AI is now being issued

0:31:17.240 --> 0:31:20.640
<v Speaker 3>through the corporate bond market. The equity market as we

0:31:20.680 --> 0:31:23.440
<v Speaker 3>know is basically all big tech at the moment. Like,

0:31:23.720 --> 0:31:26.280
<v Speaker 3>what changed on the part of investors was Was it

0:31:26.360 --> 0:31:29.880
<v Speaker 3>just pure return and performance or were there I guess

0:31:29.920 --> 0:31:34.320
<v Speaker 3>efforts to like make the contracts more robust or increase

0:31:34.480 --> 0:31:36.800
<v Speaker 3>visibility into demand and that sort of thing.

0:31:37.880 --> 0:31:44.040
<v Speaker 4>Seeing the inference aspect of it really emboldened investors, But

0:31:44.640 --> 0:31:48.400
<v Speaker 4>like that was really just January, right, or maybe late

0:31:48.520 --> 0:31:51.880
<v Speaker 4>Q four where you started seeing this just massive inflection

0:31:52.240 --> 0:31:58.760
<v Speaker 4>of demand driven by inference for us, right, It's tough

0:31:58.760 --> 0:32:01.000
<v Speaker 4>for me to speak about you know, other companies, but

0:32:01.080 --> 0:32:04.560
<v Speaker 4>for us, like why why have we been underwritten at

0:32:04.680 --> 0:32:08.280
<v Speaker 4>such scale and at a decreasing costly capital? I think

0:32:08.280 --> 0:32:10.280
<v Speaker 4>it goes back to that track record of execution, right,

0:32:10.440 --> 0:32:12.960
<v Speaker 4>is just the market has watched us execute and watch

0:32:13.000 --> 0:32:16.520
<v Speaker 4>us deliver on these contracts. And the way then tell

0:32:16.520 --> 0:32:18.600
<v Speaker 4>me if I'm going into too much detail here, but

0:32:18.600 --> 0:32:21.720
<v Speaker 4>but the way that we finance our business. You kind

0:32:21.720 --> 0:32:24.160
<v Speaker 4>of break it into two broad buckets, right, you have

0:32:24.440 --> 0:32:28.200
<v Speaker 4>parent co financing and acid co financing. And acid co

0:32:28.320 --> 0:32:34.560
<v Speaker 4>financing is where all of the GPUs you get financed. Right,

0:32:34.600 --> 0:32:38.400
<v Speaker 4>It's where all of our client contracts sit. And we

0:32:38.440 --> 0:32:43.000
<v Speaker 4>can take these financings and put them into SPDs or

0:32:43.040 --> 0:32:46.440
<v Speaker 4>we'll just call it a box so to say. And you.

0:32:48.200 --> 0:32:54.480
<v Speaker 2>Keep going lots of connotations, but keep going, keep we.

0:32:54.440 --> 0:32:58.840
<v Speaker 4>Put them into SPDs UH and these these stvs, they

0:32:58.920 --> 0:33:02.040
<v Speaker 4>have the emphas structure, they have the data center costs,

0:33:02.240 --> 0:33:06.760
<v Speaker 4>and they have the debt agreements within them. And so

0:33:07.080 --> 0:33:10.080
<v Speaker 4>you're able to pair this like five year take or

0:33:10.120 --> 0:33:15.200
<v Speaker 4>pay contract to an amortization schedule on the debt and

0:33:15.560 --> 0:33:17.479
<v Speaker 4>you have the revenue come into the box, pay down

0:33:17.520 --> 0:33:21.680
<v Speaker 4>the amortization schedule, pay down the operating costs of the

0:33:21.800 --> 0:33:25.720
<v Speaker 4>data center, and it still contributes a It has a

0:33:25.880 --> 0:33:29.760
<v Speaker 4>twenty five percent contribution margin of profit up to the

0:33:29.800 --> 0:33:33.920
<v Speaker 4>parent code. Right, Like these are highly profitable agreements down

0:33:33.920 --> 0:33:36.640
<v Speaker 4>to the SPB stack. And so you take that SPB

0:33:36.760 --> 0:33:40.520
<v Speaker 4>out to the credit market and say, look at this instrument.

0:33:40.640 --> 0:33:44.520
<v Speaker 4>It's a discrete set of contracts with counterparties like any

0:33:44.520 --> 0:33:47.400
<v Speaker 4>ease you want to consume GPU compute, you have the

0:33:47.480 --> 0:33:50.760
<v Speaker 4>data centers within it, et cetera. And you know, one

0:33:50.760 --> 0:33:53.000
<v Speaker 4>of the latest ones we did was as we call

0:33:53.120 --> 0:33:58.400
<v Speaker 4>ddtail four. This was a investment grade rated first of

0:33:58.400 --> 0:34:01.760
<v Speaker 4>its class. No one had done this before for GPU financing,

0:34:02.520 --> 0:34:06.440
<v Speaker 4>non recourse HPC infrastructure financing, and got done it. So

0:34:06.600 --> 0:34:11.120
<v Speaker 4>for plus two twenty five like that is a phenomenal

0:34:11.239 --> 0:34:15.040
<v Speaker 4>cost of capital for us, And importantly, we were able

0:34:15.080 --> 0:34:18.600
<v Speaker 4>to bring in the insurance charge of capital, which is

0:34:18.600 --> 0:34:20.680
<v Speaker 4>a massive change of capital out there that is looking

0:34:20.760 --> 0:34:23.920
<v Speaker 4>to do allocations into the space. So we're kind of

0:34:23.960 --> 0:34:28.880
<v Speaker 4>continuously making progress through these different stacks of capital, unlocking

0:34:29.000 --> 0:34:32.000
<v Speaker 4>access to more and more types of investors. It's why

0:34:32.000 --> 0:34:34.040
<v Speaker 4>you've seen this move into the convertible note market, into

0:34:34.040 --> 0:34:39.680
<v Speaker 4>the unsecured market as well, along with taking direct strategic

0:34:39.719 --> 0:34:44.799
<v Speaker 4>equity investments. But for us, it's really important for the

0:34:45.040 --> 0:34:49.880
<v Speaker 4>entire investor space to understand this business because this business

0:34:49.920 --> 0:34:54.000
<v Speaker 4>largely didn't exist before, right, Like people weren't making loans

0:34:54.000 --> 0:34:57.000
<v Speaker 4>into the hyper scale just to go credit these buildouts.

0:34:57.520 --> 0:35:01.879
<v Speaker 4>Right it's on core weave honestly to be building this

0:35:02.000 --> 0:35:08.040
<v Speaker 4>path into how do you finance the AI hyperscaler effectively?

0:35:08.080 --> 0:35:11.120
<v Speaker 4>And I think we've just done a terrific job of

0:35:11.160 --> 0:35:12.640
<v Speaker 4>it over the past few years.

0:35:12.920 --> 0:35:15.480
<v Speaker 2>You used to be, in a prior lifetime a trader.

0:35:15.200 --> 0:35:17.360
<v Speaker 4>Right, yes, as a commodity trader.

0:35:17.600 --> 0:35:19.799
<v Speaker 2>So I'm curious, like you know, there's a lot of

0:35:19.880 --> 0:35:22.120
<v Speaker 2>interest in and I don't know if it's going to

0:35:22.200 --> 0:35:26.520
<v Speaker 2>materialize in GPU capacity trading and there's gonna be a

0:35:26.560 --> 0:35:31.840
<v Speaker 2>new contract. We recently interviewed the CEO of Compute Exchange

0:35:31.840 --> 0:35:34.719
<v Speaker 2>and they're very close to having something listed on the CME.

0:35:35.400 --> 0:35:39.120
<v Speaker 2>From your perspective, because I don't have a view on

0:35:39.160 --> 0:35:41.680
<v Speaker 2>this yet, the one like you see, like okay, a

0:35:41.719 --> 0:35:45.480
<v Speaker 2>big AI company does a five year contract. As you say,

0:35:45.520 --> 0:35:48.279
<v Speaker 2>the duration is lengthening, we're gonna lock this in. I

0:35:48.320 --> 0:35:51.040
<v Speaker 2>don't know, like what the need is for tradeable compute

0:35:51.080 --> 0:35:54.120
<v Speaker 2>in that environment, etc. What's your guest, like, do you

0:35:55.040 --> 0:36:00.560
<v Speaker 2>anticipate that there will be a sufficient ecology of hedgers

0:36:00.680 --> 0:36:04.440
<v Speaker 2>and speculators such that there will be a liquid market

0:36:04.480 --> 0:36:05.640
<v Speaker 2>for tradable compute.

0:36:05.920 --> 0:36:07.960
<v Speaker 4>I think it's a it's a very much a timeline

0:36:08.040 --> 0:36:13.440
<v Speaker 4>question that's out there. Short term No, let me offer

0:36:14.000 --> 0:36:16.120
<v Speaker 4>why no short term? And then I'd say maybe in

0:36:16.160 --> 0:36:21.480
<v Speaker 4>the long term. And it all comes back to fungibility. Right.

0:36:21.520 --> 0:36:23.239
<v Speaker 4>If you think about gold, the gold is defined by

0:36:23.239 --> 0:36:27.160
<v Speaker 4>its chemical composition, right, and there's no question of of

0:36:27.680 --> 0:36:32.840
<v Speaker 4>what is gold and not gold, et cetera. Compute really isn't, right,

0:36:33.239 --> 0:36:39.160
<v Speaker 4>especially GPU compute GP compute today is not fungible. And

0:36:39.400 --> 0:36:43.920
<v Speaker 4>I think that this is well understood by our client base,

0:36:44.400 --> 0:36:47.920
<v Speaker 4>by our suppliers, by you know, third party consultants like

0:36:47.920 --> 0:36:52.480
<v Speaker 4>syny Analysis, And it's it's this idea that an H

0:36:52.520 --> 0:36:55.680
<v Speaker 4>one hundred deployed and one cloud doesn't have the same

0:36:55.719 --> 0:36:59.000
<v Speaker 4>performance of an eight one in another cloud. And the

0:36:59.280 --> 0:37:01.719
<v Speaker 4>metrics that people use are things like good put or

0:37:02.000 --> 0:37:05.759
<v Speaker 4>model flop utilization MFUS, and there are these measurements of

0:37:06.239 --> 0:37:09.840
<v Speaker 4>like how much more performance is one the exact same

0:37:09.960 --> 0:37:14.920
<v Speaker 4>GPU by the way, versus another GPU deployed in another facility.

0:37:15.080 --> 0:37:18.920
<v Speaker 4>And so in order for something to be commoditized, it

0:37:19.000 --> 0:37:23.280
<v Speaker 4>has to be fungible, right, Otherwise there's just too much

0:37:23.560 --> 0:37:27.080
<v Speaker 4>you know, murkiness, and there isn't like an exact data

0:37:27.120 --> 0:37:27.640
<v Speaker 4>point in there.

0:37:27.680 --> 0:37:30.000
<v Speaker 2>Can I push on that a little bit further? So,

0:37:30.440 --> 0:37:33.960
<v Speaker 2>I mean, I think that seems like a reasonable view.

0:37:34.760 --> 0:37:39.280
<v Speaker 2>Is the non fungibility related to configuration of like how

0:37:39.440 --> 0:37:44.120
<v Speaker 2>they literally like the configuration of the GPUs within physically,

0:37:44.600 --> 0:37:47.560
<v Speaker 2>Like what is it? Is it about power? I mean,

0:37:47.560 --> 0:37:49.600
<v Speaker 2>I think they're all like, you know, there are plenty

0:37:49.600 --> 0:37:51.719
<v Speaker 2>of places that will say you only have nine nines

0:37:51.800 --> 0:37:54.160
<v Speaker 2>or however many nines you need in your industry or whatever.

0:37:54.400 --> 0:37:58.080
<v Speaker 2>What is it in your view that would cause significant

0:37:58.200 --> 0:38:01.480
<v Speaker 2>changes in the performance of an H one hundred in

0:38:01.560 --> 0:38:02.760
<v Speaker 2>one cloud verse another.

0:38:03.120 --> 0:38:06.400
<v Speaker 4>It could be in some part configuration. Right, we build

0:38:06.400 --> 0:38:11.120
<v Speaker 4>everything the DJX reference back, which is the most outlined

0:38:11.160 --> 0:38:14.600
<v Speaker 4>by video. It's the most performance way to build, operate,

0:38:14.600 --> 0:38:18.640
<v Speaker 4>and deliver GPUs. But the rest of it, honestly is

0:38:18.719 --> 0:38:22.840
<v Speaker 4>just how you operate the GPUs, and that is the

0:38:22.880 --> 0:38:27.279
<v Speaker 4>core weave software stack. That is, how do you keep

0:38:27.320 --> 0:38:30.080
<v Speaker 4>these GPUs online? Right? Like what happens if the GPU flails?

0:38:30.280 --> 0:38:33.080
<v Speaker 4>Can you predict if a GPU is about to fail

0:38:33.320 --> 0:38:36.720
<v Speaker 4>and swap in other infrastructure so that the client doesn't

0:38:37.320 --> 0:38:41.360
<v Speaker 4>have downtime on that component. And there's an immense suite

0:38:41.400 --> 0:38:45.880
<v Speaker 4>of software solutions that and infrastuctual management solutions that we

0:38:46.000 --> 0:38:48.719
<v Speaker 4>have built to have the best good put to have

0:38:48.800 --> 0:38:52.880
<v Speaker 4>the best MFUS in the industry. That's none of that

0:38:53.040 --> 0:38:57.520
<v Speaker 4>is off the shelf, right, And so I wouldn't say

0:38:57.520 --> 0:38:59.719
<v Speaker 4>it comes down to the strict components. That's kind of

0:38:59.719 --> 0:39:02.200
<v Speaker 4>like a minimum starting point, right, Like you have to

0:39:02.239 --> 0:39:06.680
<v Speaker 4>start in DJ's referencepect But where's differentiation come from there?

0:39:06.719 --> 0:39:09.600
<v Speaker 4>I mean, that's that's the core re product you're describing

0:39:09.680 --> 0:39:10.040
<v Speaker 4>right there.

0:39:10.200 --> 0:39:12.680
<v Speaker 2>By the way, Tracy, I'm just looking up in terms

0:39:12.680 --> 0:39:16.440
<v Speaker 2>of art. Goodput measures the fraction of peak hardware performance

0:39:16.480 --> 0:39:18.920
<v Speaker 2>that the training job can extract. This is according to

0:39:18.960 --> 0:39:24.319
<v Speaker 2>Google and mfu's model FLOPS utilization Hardware metric for evaluating

0:39:24.440 --> 0:39:27.600
<v Speaker 2>real world efficiency of LEMA training. So two new terms.

0:39:27.640 --> 0:39:30.239
<v Speaker 2>I actually hadn't heard of MFUS or good put before this.

0:39:30.800 --> 0:39:32.439
<v Speaker 2>I just learned two new terms today.

0:39:32.480 --> 0:39:36.400
<v Speaker 3>We got to create a gloss AI glossary. Yeah, I do, Brandon.

0:39:36.920 --> 0:39:40.360
<v Speaker 3>When Joe asked you that question about compute markets earlier,

0:39:40.400 --> 0:39:43.120
<v Speaker 3>you said it was a timeline question, which in my

0:39:43.239 --> 0:39:47.000
<v Speaker 3>mind implies that it's inevitable, like it's just a question

0:39:47.040 --> 0:39:49.080
<v Speaker 3>of how long it takes. But then when you describe

0:39:49.080 --> 0:39:53.080
<v Speaker 3>the fungibility problem, it seems like this is an actual

0:39:53.160 --> 0:39:55.240
<v Speaker 3>issue that will be very difficult to solve.

0:39:55.560 --> 0:39:59.400
<v Speaker 4>Yes, I think that characterization is absolutely correct. Right, Like

0:39:59.480 --> 0:40:03.520
<v Speaker 4>if you just general commodity theory and I traded natural

0:40:03.560 --> 0:40:08.880
<v Speaker 4>as electricity agriculture products for over a decade, like it

0:40:08.960 --> 0:40:12.560
<v Speaker 4>suggests that it should become that at some point, But

0:40:12.760 --> 0:40:15.720
<v Speaker 4>what is the reality today? The reality is this stuff

0:40:15.760 --> 0:40:19.279
<v Speaker 4>isn't getting easier to operate, right. We've moved from these

0:40:19.360 --> 0:40:22.920
<v Speaker 4>kind of relatively simple forty two to uter air cold

0:40:23.560 --> 0:40:30.960
<v Speaker 4>racks of Hopper to these immensely complex Blackwell deployments moving

0:40:31.000 --> 0:40:33.960
<v Speaker 4>into Vera Rubin following that, like, it's not getting easier

0:40:34.280 --> 0:40:40.000
<v Speaker 4>to build, operate, provision, deliver these reputs. It's getting more difficult,

0:40:40.160 --> 0:40:44.200
<v Speaker 4>and I think until it starts becoming easier, you don't

0:40:44.239 --> 0:40:47.840
<v Speaker 4>really have a path to commoditization. You will have to

0:40:47.920 --> 0:40:53.160
<v Speaker 4>continue to prioritize working with the world class and world

0:40:53.280 --> 0:40:56.799
<v Speaker 4>leading operators of infrastructure. That's where we sit.

0:40:57.239 --> 0:40:59.080
<v Speaker 2>First of all, this is helpful, and I like that

0:40:59.080 --> 0:41:01.360
<v Speaker 2>we're getting multiple respectives because I do think this is

0:41:01.360 --> 0:41:03.840
<v Speaker 2>gonna be like one of the big questions for financial markets,

0:41:03.920 --> 0:41:06.560
<v Speaker 2>because let's say if they took off, then you could

0:41:06.560 --> 0:41:09.719
<v Speaker 2>imagine that might even improve financing conditions because then the

0:41:09.840 --> 0:41:13.359
<v Speaker 2>lender can hedge against that. Yeah, so like there would

0:41:13.360 --> 0:41:16.120
<v Speaker 2>probably be some good things for the industry if this

0:41:16.200 --> 0:41:19.279
<v Speaker 2>took off. So I appreciate it's good to have your

0:41:19.880 --> 0:41:23.440
<v Speaker 2>perspective on this. Why is it you know, I'm a

0:41:23.600 --> 0:41:26.600
<v Speaker 2>I'm an inference I am an inference user, by the way,

0:41:26.640 --> 0:41:29.399
<v Speaker 2>So I made a little machine learning model in one

0:41:29.400 --> 0:41:34.480
<v Speaker 2>of my hobby projects, and I provide inference O, Havelock,

0:41:34.680 --> 0:41:37.320
<v Speaker 2>dot AI or I'm a user of inference or whatever,

0:41:37.560 --> 0:41:38.680
<v Speaker 2>I have a model whatever.

0:41:38.960 --> 0:41:41.920
<v Speaker 3>Why is it that impressive if you were providing I'm

0:41:41.920 --> 0:41:42.319
<v Speaker 3>trying to.

0:41:42.480 --> 0:41:45.080
<v Speaker 2>I guess I have a consumer of inference. I use

0:41:45.120 --> 0:41:49.680
<v Speaker 2>a anyway, Why is it that I am actually very

0:41:49.760 --> 0:41:54.440
<v Speaker 2>easily able to get not a huge allocation of you like,

0:41:55.040 --> 0:41:57.319
<v Speaker 2>GPU access? So I was like, how do I train

0:41:57.400 --> 0:41:59.920
<v Speaker 2>this model? It's a model called bert the Google really

0:42:00.040 --> 0:42:03.040
<v Speaker 2>east in twenty eighteen or twenty nineteen. I fine tuned

0:42:03.040 --> 0:42:07.160
<v Speaker 2>it for my purposes and then literally using claud code,

0:42:07.400 --> 0:42:09.680
<v Speaker 2>I was able to in ten minutes sign up. I

0:42:09.680 --> 0:42:12.360
<v Speaker 2>started using this company called Modal, and I was able

0:42:12.400 --> 0:42:16.920
<v Speaker 2>to start training a model. I was surprised that there's

0:42:17.040 --> 0:42:18.239
<v Speaker 2>like and it didn't cost me very.

0:42:18.239 --> 0:42:19.360
<v Speaker 4>Much and I have like no volume.

0:42:19.400 --> 0:42:23.080
<v Speaker 2>But nonetheless, evidently there was a little GPU capacity out

0:42:23.080 --> 0:42:24.719
<v Speaker 2>there that I could get and it cost me like

0:42:24.800 --> 0:42:27.920
<v Speaker 2>five dollars or something for the whole thing. Given what

0:42:28.080 --> 0:42:31.759
<v Speaker 2>you always hear about like a utilization is slammed, why

0:42:32.080 --> 0:42:36.000
<v Speaker 2>is it actually not that hard to find GPU capacity

0:42:36.000 --> 0:42:36.960
<v Speaker 2>for someone like myself?

0:42:37.160 --> 0:42:40.560
<v Speaker 4>You know, I think it's the skill O difference right there,

0:42:40.600 --> 0:42:44.720
<v Speaker 4>finding ones or tens of GPUs. I think that's way

0:42:44.800 --> 0:42:48.239
<v Speaker 4>more accessible out there. Okay, our clients are focused on

0:42:48.320 --> 0:42:51.320
<v Speaker 4>the hundreds of thousands of gps.

0:42:51.360 --> 0:42:53.839
<v Speaker 2>I'm not there yet, but I'm not there yet, not yet.

0:42:54.120 --> 0:42:57.480
<v Speaker 4>I'm sure you'll get there, yes, And that's where it

0:42:57.560 --> 0:43:01.239
<v Speaker 4>kind of decommodetizes itself with scale as well. Right, Like,

0:43:01.480 --> 0:43:05.080
<v Speaker 4>as you're in the hundreds of thousands component, there's just

0:43:05.120 --> 0:43:09.200
<v Speaker 4>not that many deployments, right, It's handfuls of deployments at

0:43:09.200 --> 0:43:13.440
<v Speaker 4>that size, But getting access to ones of GPUs, I

0:43:13.480 --> 0:43:16.880
<v Speaker 4>think that there is a lot more ability to go

0:43:17.000 --> 0:43:19.040
<v Speaker 4>secure that sizing in the market.

0:43:19.520 --> 0:43:22.279
<v Speaker 3>So Joe and I are heading to Hong Kong very soon,

0:43:22.400 --> 0:43:24.880
<v Speaker 3>and I expect that AI in China is going to

0:43:24.880 --> 0:43:28.040
<v Speaker 3>be a big topic of conversation. How would you characterize

0:43:28.080 --> 0:43:31.200
<v Speaker 3>I guess the difference between the US and the Chinese

0:43:31.200 --> 0:43:33.759
<v Speaker 3>market at the moment. I'm sure this is something you

0:43:33.840 --> 0:43:36.960
<v Speaker 3>think about, even though you don't participate in the Chinese

0:43:37.000 --> 0:43:37.720
<v Speaker 3>market directly.

0:43:38.239 --> 0:43:40.440
<v Speaker 2>Yeah, that's Tracy asking for questions.

0:43:40.480 --> 0:43:41.680
<v Speaker 3>Yeahs Basically it's.

0:43:41.480 --> 0:43:43.960
<v Speaker 2>Like questions that we can ask people when we're over there.

0:43:44.040 --> 0:43:46.920
<v Speaker 4>Yeah, that's likely going to be my response, Tracy is like,

0:43:47.160 --> 0:43:50.759
<v Speaker 4>we just do not participate in that market, I think

0:43:50.760 --> 0:43:53.520
<v Speaker 4>that there's opportunity for us to be expanding. As you

0:43:53.520 --> 0:43:58.120
<v Speaker 4>guys know, we operate in Canada, Europe. I think moving

0:43:58.840 --> 0:44:01.560
<v Speaker 4>further east makes a lot of sense for us, But

0:44:01.600 --> 0:44:04.120
<v Speaker 4>we're trying to be very methodical in the way that

0:44:04.160 --> 0:44:07.560
<v Speaker 4>we expand, So unfortunately, I'm not gonna be able to

0:44:07.560 --> 0:44:10.640
<v Speaker 4>healthy with specific questions in that market. But I would

0:44:10.680 --> 0:44:12.960
<v Speaker 4>imagine you're going to encounter a lot of the same

0:44:13.000 --> 0:44:16.840
<v Speaker 4>things that you're seeing in the US, which is just insatiable,

0:44:17.160 --> 0:44:19.880
<v Speaker 4>unrelenting demand for AI And like you know, we just

0:44:20.000 --> 0:44:22.719
<v Speaker 4>kind of keep coming back to this. It's like there

0:44:22.920 --> 0:44:30.239
<v Speaker 4>is no solution in sight for being able to satiate demand, right,

0:44:30.280 --> 0:44:34.319
<v Speaker 4>There's just too many supply chain there's no path to

0:44:34.400 --> 0:44:36.960
<v Speaker 4>solving demand in the near term or even the medium term.

0:44:37.000 --> 0:44:39.799
<v Speaker 2>Frankly, you mentioned sir Tracy has two about land use.

0:44:39.880 --> 0:44:42.560
<v Speaker 2>You said really was an issue. But like the first

0:44:42.640 --> 0:44:45.440
<v Speaker 2>time we talked to you in twenty twenty three or

0:44:45.480 --> 0:44:50.240
<v Speaker 2>whenever that was, there was not a major growing movement

0:44:50.440 --> 0:44:53.320
<v Speaker 2>of people who were just like anti data centers in America.

0:44:53.360 --> 0:44:55.640
<v Speaker 2>Maybe there were a few fringed people, but it was

0:44:55.680 --> 0:44:58.239
<v Speaker 2>not something that was on the mind of politicians and

0:44:58.640 --> 0:45:02.000
<v Speaker 2>activists and so forth. You do see these headlines, you

0:45:02.040 --> 0:45:06.080
<v Speaker 2>know about some projects really having been shelved. It was

0:45:06.120 --> 0:45:08.680
<v Speaker 2>like a big one. Northern Virginia is a huge hotspot

0:45:08.719 --> 0:45:10.440
<v Speaker 2>for it and there was a big project that was

0:45:10.640 --> 0:45:12.720
<v Speaker 2>they pulled the plug on due to them they couldn't

0:45:12.719 --> 0:45:15.640
<v Speaker 2>get an agreement with the local government. That must have

0:45:15.800 --> 0:45:17.600
<v Speaker 2>affect you. What are you seeing in terms of like

0:45:17.800 --> 0:45:22.560
<v Speaker 2>your capacity to build? How has it changed specifically in

0:45:22.719 --> 0:45:25.880
<v Speaker 2>light of or have you seen a change? Would you

0:45:25.880 --> 0:45:28.120
<v Speaker 2>be able to build faster in a world where this

0:45:28.239 --> 0:45:31.280
<v Speaker 2>had never become a political hot button issue.

0:45:31.400 --> 0:45:35.960
<v Speaker 4>I believe it has become that hot button issue. It's

0:45:35.960 --> 0:45:41.080
<v Speaker 4>something that we're quite proactive about in market and I

0:45:41.080 --> 0:45:42.960
<v Speaker 4>think you just kind of go through the checks on

0:45:42.960 --> 0:45:45.600
<v Speaker 4>the diligence process to make sure you're going through it correctly.

0:45:45.960 --> 0:45:49.840
<v Speaker 4>I think that there's misconceptions out there, like water usage.

0:45:49.960 --> 0:45:52.640
<v Speaker 2>Yeah, setting aside, the misconcept like setting aside, I know,

0:45:52.800 --> 0:45:55.560
<v Speaker 2>setting aside, the whole debate about but just in terms of,

0:45:55.600 --> 0:45:59.280
<v Speaker 2>like operationally, what's it changed for you in terms.

0:45:59.040 --> 0:46:03.160
<v Speaker 4>Of Yeah, no, I would say our greatest challenge is

0:46:03.239 --> 0:46:07.200
<v Speaker 4>still just getting that delivery of our like the construction

0:46:07.400 --> 0:46:10.120
<v Speaker 4>and all the goings and getting everything in there like

0:46:10.160 --> 0:46:13.080
<v Speaker 4>that is truly more of the ball back that's in

0:46:13.080 --> 0:46:14.080
<v Speaker 4>the market today.

0:46:14.440 --> 0:46:17.239
<v Speaker 2>Brandon, thank you so much for coming back on odd lugs.

0:46:17.600 --> 0:46:20.400
<v Speaker 2>I'll have you back next month for another market about it. No,

0:46:20.480 --> 0:46:23.440
<v Speaker 2>or at least or maybe in three years, not three years,

0:46:24.120 --> 0:46:24.680
<v Speaker 2>not three years.

0:46:24.680 --> 0:46:26.600
<v Speaker 4>But really that's an eternity.

0:46:26.760 --> 0:46:29.040
<v Speaker 2>Yeah, I know, thank you so much.

0:46:29.160 --> 0:46:31.000
<v Speaker 4>Thanks guys, appreciate it.

0:46:43.239 --> 0:46:47.440
<v Speaker 2>I'm very excited about whether compute features will take off.

0:46:47.480 --> 0:46:49.680
<v Speaker 2>I think this is an exciting like story. You know,

0:46:49.960 --> 0:46:51.520
<v Speaker 2>it's not the biggest story in the world, but it

0:46:51.600 --> 0:46:52.920
<v Speaker 2>is actually a very exciting story.

0:46:52.960 --> 0:46:55.560
<v Speaker 3>I've said this before. Even if you're not that interested

0:46:55.680 --> 0:46:59.520
<v Speaker 3>in AI, this is a really interesting market structure story, right.

0:46:59.560 --> 0:47:02.360
<v Speaker 3>It's based the creation of a brand new market and

0:47:02.640 --> 0:47:06.799
<v Speaker 3>poses all these interesting philosophical questions about how you do that.

0:47:06.920 --> 0:47:10.680
<v Speaker 3>And I thought Brandon's point about fungibility. I mean, that

0:47:10.800 --> 0:47:14.279
<v Speaker 3>is a real issue, and it does seem like it's

0:47:14.320 --> 0:47:18.239
<v Speaker 3>a challenging one to fix at the moment. I don't

0:47:18.239 --> 0:47:22.040
<v Speaker 3>know if it's inevitable in the future, but who knows.

0:47:22.160 --> 0:47:23.560
<v Speaker 2>No, No, I mean it makes a lot of sense.

0:47:23.560 --> 0:47:26.920
<v Speaker 2>This was also Lewis Hart's point that it's like the

0:47:27.000 --> 0:47:30.040
<v Speaker 2>fun you know, it's in the word commodity, right if

0:47:30.080 --> 0:47:31.960
<v Speaker 2>it's if it's not a commodity, you're not going to

0:47:32.000 --> 0:47:34.640
<v Speaker 2>get a commodity market for it. And of course a

0:47:34.719 --> 0:47:38.240
<v Speaker 2>number of entities are betting that it will be commoditized.

0:47:38.640 --> 0:47:42.440
<v Speaker 2>But if the if it's true that like you know,

0:47:42.600 --> 0:47:46.719
<v Speaker 2>they're getting more difficult to work, that the technical demands

0:47:46.920 --> 0:47:50.719
<v Speaker 2>on the influence provider, on the data center company are

0:47:50.719 --> 0:47:54.840
<v Speaker 2>getting greater in order to get the maximum you know, juice,

0:47:55.040 --> 0:47:57.719
<v Speaker 2>then maybe it doesn't become commoditized. But I think that's

0:47:57.719 --> 0:47:58.920
<v Speaker 2>like a fascinating thing.

0:47:59.520 --> 0:48:03.080
<v Speaker 3>Like if you do see those efficiency improvements and new

0:48:03.160 --> 0:48:05.799
<v Speaker 3>designs and things like that, you could imagine that, like

0:48:05.880 --> 0:48:08.839
<v Speaker 3>the demand is there for standardized GPU as well.

0:48:09.040 --> 0:48:09.960
<v Speaker 2>Yeah, I don't know.

0:48:10.000 --> 0:48:11.719
<v Speaker 3>Like I'm really torn. It feels like they should go

0:48:11.760 --> 0:48:12.200
<v Speaker 3>either way.

0:48:12.320 --> 0:48:15.160
<v Speaker 2>Well, and even in his answer, he talked about how

0:48:15.160 --> 0:48:20.560
<v Speaker 2>they can figure their own GPUs to expect largely that

0:48:20.800 --> 0:48:25.520
<v Speaker 2>Nvidia itself has come up with, so in theory, like

0:48:25.640 --> 0:48:30.000
<v Speaker 2>there is aspect that everyone can match to. So that's

0:48:30.040 --> 0:48:33.239
<v Speaker 2>like a really interesting that's a really interesting question. I

0:48:33.280 --> 0:48:37.360
<v Speaker 2>also really want to do more on all of these,

0:48:37.719 --> 0:48:42.560
<v Speaker 2>so Google his TPUs, Amazon his tranium. Microsoft has its

0:48:42.560 --> 0:48:46.920
<v Speaker 2>own hardware. Maybe even Jane Streets in the Hudson River

0:48:47.000 --> 0:48:50.000
<v Speaker 2>tradings will have their own hardware if they're not, like

0:48:50.360 --> 0:48:53.920
<v Speaker 2>I want to understand better why, right, because like they

0:48:53.960 --> 0:48:56.319
<v Speaker 2>presumably have some reason and they at least like the

0:48:56.320 --> 0:48:58.400
<v Speaker 2>Microsoft will say, well, this will run better on our

0:48:58.440 --> 0:49:02.480
<v Speaker 2>customers hardware. I want to understand why that would be.

0:49:02.800 --> 0:49:06.400
<v Speaker 2>How much difference in performance is there? And then the

0:49:06.480 --> 0:49:12.160
<v Speaker 2>degree to which demand materializes from users for non in

0:49:12.239 --> 0:49:14.040
<v Speaker 2>video silica, And there's like a really big question.

0:49:14.200 --> 0:49:15.720
<v Speaker 3>Yeah, why custom chips?

0:49:15.800 --> 0:49:18.200
<v Speaker 2>Yeah, and what can you get out of that if

0:49:18.200 --> 0:49:21.840
<v Speaker 2>you align model and chip to optimally work together. I

0:49:21.880 --> 0:49:24.160
<v Speaker 2>have no idea, but I feel like it's an episode

0:49:24.160 --> 0:49:24.759
<v Speaker 2>I would like to do.

0:49:24.880 --> 0:49:26.839
<v Speaker 3>Yeah, we should, all right? Shall we leave it there

0:49:26.840 --> 0:49:28.040
<v Speaker 3>in the meantime, Let's leave it there?

0:49:28.040 --> 0:49:28.359
<v Speaker 4>All right?

0:49:28.400 --> 0:49:30.800
<v Speaker 3>This has been another episode of the All Thoughts podcast.

0:49:30.920 --> 0:49:34.120
<v Speaker 3>I'm Tracy Alloway. You can follow me at Tracy Alloway.

0:49:33.800 --> 0:49:36.560
<v Speaker 2>And I'm Joe Wisenthal. You can follow me at the Stalwart.

0:49:36.719 --> 0:49:40.200
<v Speaker 2>Follow our guest Brandon McBee at Brandon McBee. Follow our

0:49:40.200 --> 0:49:43.920
<v Speaker 2>producers Carmen Rodriguez at Carmen armand Dash'll Bennett at Dashbod,

0:49:44.000 --> 0:49:47.560
<v Speaker 2>kelb Brooks at Kelbrooks and Kevin Lozano at Kevin Lloyd Lozano.

0:49:48.040 --> 0:49:50.319
<v Speaker 2>From our odd Laws content, go to Bloomberg dot com

0:49:50.320 --> 0:49:52.440
<v Speaker 2>slash odd lots or have a daily newsletter and all

0:49:52.480 --> 0:49:54.719
<v Speaker 2>of our episodes and you can shout about all these

0:49:54.760 --> 0:49:58.120
<v Speaker 2>topics twenty four to seven in our discord Discord dot

0:49:58.160 --> 0:49:59.960
<v Speaker 2>gg slash oddlines.

0:49:59.520 --> 0:50:01.200
<v Speaker 3>And if you and enjoy all thoughts, if you want

0:50:01.239 --> 0:50:03.600
<v Speaker 3>us to do an episode on custom chips, then please

0:50:03.680 --> 0:50:06.600
<v Speaker 3>leave us a positive review on your favorite podcast platform.

0:50:06.920 --> 0:50:09.520
<v Speaker 3>And remember, if you are a Bloomberg subscriber, you can

0:50:09.560 --> 0:50:12.839
<v Speaker 3>listen to all of our episodes absolutely ad free. All

0:50:12.840 --> 0:50:14.880
<v Speaker 3>you need to do is find the Bloomberg channel on

0:50:14.960 --> 0:50:18.520
<v Speaker 3>Apple Podcasts and follow the instructions there. Thanks for listening.