WEBVTT - Josh Wolfe: The ChatGPT of Robotics is Coming

0:00:03.120 --> 0:00:08.360
<v Speaker 1>Bloomberg Audio Studios, Podcasts, Radio News.

0:00:20.200 --> 0:00:23.640
<v Speaker 2>Hello and welcome to another episode of the Odd Lots Podcast.

0:00:23.720 --> 0:00:26.040
<v Speaker 3>I'm Joe Wisenthal and I'm Tracy Alloway.

0:00:26.200 --> 0:00:27.880
<v Speaker 2>Tracy, let's talk about AI some more.

0:00:29.960 --> 0:00:32.760
<v Speaker 1>Okay, Well, we could just have AI write the script

0:00:32.760 --> 0:00:34.360
<v Speaker 1>for us. We could give ourselves some time.

0:00:34.479 --> 0:00:36.360
<v Speaker 2>No, I don't think the technology is there yet, you

0:00:36.400 --> 0:00:39.680
<v Speaker 2>know what, so I can't say who. Maybe I was

0:00:39.720 --> 0:00:43.000
<v Speaker 2>trying to a professor recently and she said something really

0:00:43.040 --> 0:00:45.720
<v Speaker 2>interesting to me. I'm not supposed I'm not sure it's fine,

0:00:45.720 --> 0:00:48.279
<v Speaker 2>I think, And she said, like, you know, there's all

0:00:48.280 --> 0:00:51.879
<v Speaker 2>this anxiety about you know, kids cheating on their essays

0:00:51.960 --> 0:00:55.279
<v Speaker 2>or having Chad g GBT write their essays for them,

0:00:55.320 --> 0:00:58.120
<v Speaker 2>and like, you know, supposedly professors are like tearing their

0:00:58.120 --> 0:01:00.400
<v Speaker 2>hair out trying to figure out what to do about this,

0:01:00.480 --> 0:01:03.200
<v Speaker 2>and the AI detector don't really work all that well particularly,

0:01:03.560 --> 0:01:06.399
<v Speaker 2>but apparently like it seems like the solution is to

0:01:06.560 --> 0:01:09.800
<v Speaker 2>just grade them as regular essays no matter what. And

0:01:09.840 --> 0:01:12.600
<v Speaker 2>it sounds like at this point all the chat GPT

0:01:13.040 --> 0:01:16.160
<v Speaker 2>essays are basically solid c essays, and so if you

0:01:16.200 --> 0:01:18.039
<v Speaker 2>just sort of like take them at phase value, even

0:01:18.080 --> 0:01:20.920
<v Speaker 2>if you think they might be AI generated at least

0:01:20.959 --> 0:01:23.160
<v Speaker 2>at this point, it doesn't seem like a way at

0:01:23.200 --> 0:01:25.960
<v Speaker 2>least for college students to write good essays.

0:01:25.760 --> 0:01:29.520
<v Speaker 1>Yet, so our baseline, our average is chat GPT.

0:01:29.680 --> 0:01:32.080
<v Speaker 3>Now, yeah, that's basically can you beat the bot?

0:01:32.240 --> 0:01:35.560
<v Speaker 1>Did you see the thing someone was tweeting about one

0:01:35.640 --> 0:01:39.200
<v Speaker 1>of the tells for AI generated words is if you

0:01:39.400 --> 0:01:42.119
<v Speaker 1>use the word delve. Yes, I saw that, which, as

0:01:42.160 --> 0:01:45.600
<v Speaker 1>someone who I'm sure has used delve numerous times on

0:01:45.640 --> 0:01:47.640
<v Speaker 1>this podcast and in my writing, I thought was a

0:01:47.680 --> 0:01:50.320
<v Speaker 1>little unfair. It is a little it's a cliche, but

0:01:50.520 --> 0:01:52.520
<v Speaker 1>that doesn't mean it's from AI.

0:01:52.720 --> 0:01:55.960
<v Speaker 2>Although being said, there's more to AI than just chat

0:01:56.000 --> 0:01:59.160
<v Speaker 2>GPT obviously in chatbots, and this sort of has come

0:01:59.240 --> 0:02:02.840
<v Speaker 2>up on a couple episodes recently, but only like very tangentially.

0:02:03.240 --> 0:02:06.840
<v Speaker 2>People talk about the use of like AI and industrial applications,

0:02:06.880 --> 0:02:08.280
<v Speaker 2>and I've seen a lot of stuff. There have been

0:02:08.280 --> 0:02:12.760
<v Speaker 2>a couple Bloomberg articles about some startups that sort of say, like, okay, well,

0:02:12.800 --> 0:02:15.800
<v Speaker 2>like what if we trained robots the same way we

0:02:15.880 --> 0:02:18.480
<v Speaker 2>trade large language models, where you feed them just like

0:02:18.680 --> 0:02:21.280
<v Speaker 2>tons and tons and tons of real world data, and

0:02:21.360 --> 0:02:23.360
<v Speaker 2>so that yeah, like you sure you still have to

0:02:23.360 --> 0:02:26.600
<v Speaker 2>solve the mechanical engineering part. But then what if that

0:02:26.760 --> 0:02:30.080
<v Speaker 2>allows them, like all this training data to do more

0:02:30.280 --> 0:02:34.760
<v Speaker 2>advanced industrial things like I don't know, like make a

0:02:34.800 --> 0:02:38.560
<v Speaker 2>pizza or you know, be a more powerful humanless assembly

0:02:38.600 --> 0:02:39.079
<v Speaker 2>line or.

0:02:39.040 --> 0:02:39.840
<v Speaker 3>Something like that.

0:02:40.240 --> 0:02:42.800
<v Speaker 2>Where it's like we see all these oppressive robots and

0:02:42.880 --> 0:02:45.680
<v Speaker 2>videos like Boston Dynamics, but I never know like if

0:02:45.720 --> 0:02:48.000
<v Speaker 2>any of this is like quite there yet in terms

0:02:48.000 --> 0:02:49.000
<v Speaker 2>of having its value.

0:02:49.160 --> 0:02:52.679
<v Speaker 1>Yeah. So the robotics aspect of AI is something that's

0:02:52.800 --> 0:02:55.799
<v Speaker 1>incredibly interesting to me. It kind of makes me think

0:02:55.960 --> 0:02:58.760
<v Speaker 1>about the world that we want to see. So it

0:02:58.800 --> 0:03:02.280
<v Speaker 1>would be great if if we had physical robots who

0:03:02.320 --> 0:03:04.799
<v Speaker 1>are able to do stuff like clean up a house

0:03:05.120 --> 0:03:08.480
<v Speaker 1>or take care of an elderly family member or something

0:03:08.560 --> 0:03:11.320
<v Speaker 1>like that. It's not so great if all of our

0:03:11.320 --> 0:03:15.959
<v Speaker 1>technological prowess basically goes into writing satirical lyrics. Yes, via

0:03:16.080 --> 0:03:19.000
<v Speaker 1>chat GPT, like that's fun. I can do that myself,

0:03:19.400 --> 0:03:22.600
<v Speaker 1>But what I really need is someone to vacuum or

0:03:22.680 --> 0:03:23.600
<v Speaker 1>dust the house.

0:03:23.520 --> 0:03:24.320
<v Speaker 3>Do the laundry.

0:03:24.440 --> 0:03:26.760
<v Speaker 2>Yeah, exactly, really nice. All right, Well, I just want

0:03:26.760 --> 0:03:29.400
<v Speaker 2>to jump right into it because we really do have

0:03:29.680 --> 0:03:31.800
<v Speaker 2>the perfect guest. We're going to be speaking to someone

0:03:31.800 --> 0:03:34.960
<v Speaker 2>who has been investing in AI for a long time.

0:03:35.280 --> 0:03:39.040
<v Speaker 2>A lot of vcs like started investing in AI last

0:03:39.120 --> 0:03:42.560
<v Speaker 2>year obviously, but this is someone who has been investing

0:03:42.560 --> 0:03:45.360
<v Speaker 2>in AI for quite some time before it became the

0:03:45.360 --> 0:03:48.360
<v Speaker 2>hot new thing. We spoke to him last July and

0:03:48.440 --> 0:03:50.560
<v Speaker 2>had a great conversation about what he was seeing in

0:03:50.600 --> 0:03:53.400
<v Speaker 2>the space. So I'm really pleased to welcome back on

0:03:53.440 --> 0:03:55.240
<v Speaker 2>the show. We're going to be speaking with Josh Wolfe,

0:03:55.560 --> 0:03:59.360
<v Speaker 2>co founder and managing partner of Lux Capital. Josh, thank

0:03:59.360 --> 0:04:01.120
<v Speaker 2>you so much for coming back on ODLTS.

0:04:01.120 --> 0:04:03.440
<v Speaker 4>Great to be on. I feel like I should say

0:04:03.480 --> 0:04:04.920
<v Speaker 4>hello and like a robot voice.

0:04:06.560 --> 0:04:08.840
<v Speaker 3>So what's interesting to you these days? What are you

0:04:08.840 --> 0:04:10.360
<v Speaker 3>seeing out there that gets you excited?

0:04:10.720 --> 0:04:12.440
<v Speaker 4>Well, you know, you guys started this off with AI

0:04:12.720 --> 0:04:15.960
<v Speaker 4>and with an AI. Look, we've had what I would

0:04:15.960 --> 0:04:19.120
<v Speaker 4>call maybe a little bit pejorative, a little bit looting crude.

0:04:19.440 --> 0:04:21.880
<v Speaker 4>We've had chips everybody knows that we talked last time

0:04:21.880 --> 0:04:25.320
<v Speaker 4>about in AMD. We've got chatbots. You already have some

0:04:25.360 --> 0:04:27.479
<v Speaker 4>of these guys that are starting to fail. They've raised

0:04:27.480 --> 0:04:30.680
<v Speaker 4>billions of dollars and in some cases just you know,

0:04:30.800 --> 0:04:35.720
<v Speaker 4>relatively undifferentiated, big debate between open source that is approaching

0:04:35.760 --> 0:04:39.480
<v Speaker 4>the ASSYMP tote of achievement that the big private models

0:04:39.520 --> 0:04:42.799
<v Speaker 4>have and then being a little bit lout you've got chicks.

0:04:42.880 --> 0:04:44.719
<v Speaker 4>What do I mean by that? Most of the applications

0:04:44.720 --> 0:04:47.760
<v Speaker 4>in AI are the mundane and passe on the one side,

0:04:47.760 --> 0:04:51.520
<v Speaker 4>which would be like customer service and basic call center

0:04:52.000 --> 0:04:55.400
<v Speaker 4>supplementation or substitution. And then at the other end you

0:04:55.440 --> 0:04:57.400
<v Speaker 4>have people that are spending tens of thousands of dollars

0:04:57.440 --> 0:05:00.599
<v Speaker 4>a month in some case on AI girlfriends and people

0:05:00.640 --> 0:05:02.800
<v Speaker 4>that are doing what they often do with technology which

0:05:02.800 --> 0:05:05.840
<v Speaker 4>is used for prurient interests. So that to me the

0:05:05.880 --> 0:05:08.400
<v Speaker 4>two barbel extremes in AI, where people are actually making

0:05:08.440 --> 0:05:12.640
<v Speaker 4>money and profits serving demand for basic human instincts I

0:05:12.640 --> 0:05:16.680
<v Speaker 4>guess and needs. That's interesting. Overall, you're seeing a big

0:05:16.680 --> 0:05:19.719
<v Speaker 4>shift from the compute piece to the energy piece, meaning

0:05:19.960 --> 0:05:24.400
<v Speaker 4>people now recognize this bottleneck in AI is not going

0:05:24.440 --> 0:05:25.880
<v Speaker 4>to be so much about the chips. We also talked

0:05:25.880 --> 0:05:28.800
<v Speaker 4>about this months ago when we said, look, you don't

0:05:28.839 --> 0:05:31.960
<v Speaker 4>necessarily need these in video chips for inference, the part

0:05:32.000 --> 0:05:33.960
<v Speaker 4>that most people do when they're querying all these models.

0:05:33.960 --> 0:05:37.000
<v Speaker 4>You do need them for training, but the power levels

0:05:37.040 --> 0:05:39.599
<v Speaker 4>on these things are just enormous. There was a Dell

0:05:39.720 --> 0:05:42.960
<v Speaker 4>earnings call that they think sort of accidentally leaked that

0:05:43.040 --> 0:05:45.720
<v Speaker 4>this in VideA B one hundred Blackwell chip is going

0:05:45.760 --> 0:05:48.320
<v Speaker 4>to be a thousand wide draw of power, which is

0:05:48.360 --> 0:05:51.039
<v Speaker 4>like forty fifty percent more than these eight one hundred chips.

0:05:51.560 --> 0:05:53.840
<v Speaker 4>Why does that matter because now you got to figure

0:05:53.880 --> 0:05:56.279
<v Speaker 4>out how do you supply the energy for that. And

0:05:56.320 --> 0:05:59.880
<v Speaker 4>a tidbit which I thought was interesting was Amazon acquired

0:06:00.120 --> 0:06:03.520
<v Speaker 4>a nuclear powered data center in Pennsylvania. They've spent six

0:06:03.560 --> 0:06:06.080
<v Speaker 4>hundred and fifty million dollars, they got about a gig

0:06:06.200 --> 0:06:08.680
<v Speaker 4>lot of power. And I think that that is going

0:06:08.720 --> 0:06:09.880
<v Speaker 4>to be a trend. I think it's actually going to

0:06:09.960 --> 0:06:11.840
<v Speaker 4>usher in a wave of what I like to call

0:06:11.880 --> 0:06:14.880
<v Speaker 4>out elemental energy, but demand for nuclear power to power

0:06:14.920 --> 0:06:16.240
<v Speaker 4>these big AI data centers.

0:06:16.480 --> 0:06:18.160
<v Speaker 1>Yeah, it's kind of funny when you think about it.

0:06:18.200 --> 0:06:20.920
<v Speaker 1>I don't think anyone expected uranium to end up being

0:06:20.960 --> 0:06:23.680
<v Speaker 1>an AI play, but here we are. I want to

0:06:23.720 --> 0:06:26.280
<v Speaker 1>go back to something you said, and you actually brought

0:06:26.279 --> 0:06:28.960
<v Speaker 1>it up the last time we spoke to you last year,

0:06:29.240 --> 0:06:31.800
<v Speaker 1>and it was the idea of I guess the novelty

0:06:32.160 --> 0:06:36.200
<v Speaker 1>of some of these more public facing AI projects, And

0:06:36.360 --> 0:06:39.119
<v Speaker 1>I think you pointed out last year it's really fun

0:06:39.200 --> 0:06:42.080
<v Speaker 1>to use some of these things, like generate a bunch

0:06:42.080 --> 0:06:45.160
<v Speaker 1>of cartoon versions of yourself or whatever, but it might

0:06:45.200 --> 0:06:47.480
<v Speaker 1>not be a sustainable business model, and it might end

0:06:47.560 --> 0:06:51.920
<v Speaker 1>up being a functionality that is eventually incorporated into another

0:06:52.000 --> 0:06:55.520
<v Speaker 1>platform or a different project. Have you seen any example,

0:06:55.680 --> 0:07:01.080
<v Speaker 1>specific examples of more public facing novelty AI start to

0:07:01.320 --> 0:07:04.039
<v Speaker 1>like go away? I think you mentioned a few failures recently.

0:07:04.400 --> 0:07:06.280
<v Speaker 4>Yes, You've had a whole bunch of companies that basically

0:07:06.320 --> 0:07:09.200
<v Speaker 4>took chat, GPTA, GPT three or four and put a

0:07:09.200 --> 0:07:12.440
<v Speaker 4>wrapper around it, basically meaning give the average user who

0:07:12.480 --> 0:07:14.720
<v Speaker 4>doesn't know how to use these or even do prompts,

0:07:14.760 --> 0:07:17.080
<v Speaker 4>some means to interact with it. And those things raised

0:07:17.120 --> 0:07:18.720
<v Speaker 4>a bunch of money. They made these things accessible, and

0:07:18.720 --> 0:07:21.400
<v Speaker 4>they've sort of gone away. The foundation models themselves that

0:07:21.440 --> 0:07:24.560
<v Speaker 4>are undergirding all of this themselves are also starting to

0:07:24.560 --> 0:07:27.640
<v Speaker 4>be relatively commoditized away. And some of these things have

0:07:27.760 --> 0:07:29.560
<v Speaker 4>prominent people and have raised a lot of money, but

0:07:29.600 --> 0:07:32.640
<v Speaker 4>they are what I would consider failing. Take inflection. You've

0:07:32.600 --> 0:07:35.360
<v Speaker 4>got Mustapha Solimon, super smart guy, co founder of deep

0:07:35.400 --> 0:07:37.640
<v Speaker 4>mind raised I think a billion and a half for

0:07:37.680 --> 0:07:39.480
<v Speaker 4>that company. You know, I'm gonna be careful here because

0:07:39.520 --> 0:07:43.000
<v Speaker 4>Microsoft has been probably the savviest actor in this entire game,

0:07:43.320 --> 0:07:45.880
<v Speaker 4>figuring out that they can acquire things by doing things

0:07:45.920 --> 0:07:48.480
<v Speaker 4>in a clever way, skirting FTC and DJ oversight. They

0:07:48.480 --> 0:07:51.000
<v Speaker 4>effectively control open ai. As I think we also talked

0:07:51.000 --> 0:07:53.480
<v Speaker 4>about heard, particularly going into the end of last year

0:07:53.520 --> 0:07:55.960
<v Speaker 4>when there was all the drama around open Ai. Sadia said, look,

0:07:56.160 --> 0:07:58.120
<v Speaker 4>if open ai went out of business, we own it,

0:07:58.200 --> 0:08:00.600
<v Speaker 4>control it. We've got all the data up left, right center,

0:08:00.680 --> 0:08:03.320
<v Speaker 4>you know, all around them. Same thing with this company Inflection.

0:08:03.400 --> 0:08:04.840
<v Speaker 4>They did I think a six hundred and fifty six

0:08:04.920 --> 0:08:07.840
<v Speaker 4>hundred and seventy five million dollar license for the technology

0:08:08.240 --> 0:08:10.480
<v Speaker 4>that basically was a payment above and beyond what the

0:08:10.640 --> 0:08:13.360
<v Speaker 4>venture investors had made. Venture investors made a little bit

0:08:13.400 --> 0:08:16.320
<v Speaker 4>of money, not a lot. Key management went over to Microsoft.

0:08:16.360 --> 0:08:19.000
<v Speaker 4>But Microsoft has been very clever. So back to your question,

0:08:19.440 --> 0:08:20.920
<v Speaker 4>I think the big are going to get bigger and

0:08:20.960 --> 0:08:25.120
<v Speaker 4>are going to be most of the beneficiaries here, Microsoft, Adobe, Amazon,

0:08:25.280 --> 0:08:28.000
<v Speaker 4>Amazon themselves. Coming up on the one year anniversary of Bedrock,

0:08:28.040 --> 0:08:30.240
<v Speaker 4>they're going to announce that they've got the best performing

0:08:30.360 --> 0:08:33.320
<v Speaker 4>model with Anthropic, which they've made billions of dollars of

0:08:33.360 --> 0:08:36.200
<v Speaker 4>investments in now sort of competitor to Opening Eye CHATGPT.

0:08:36.280 --> 0:08:38.400
<v Speaker 4>They're also going to renounce something with one of our

0:08:38.440 --> 0:08:41.240
<v Speaker 4>companies that hasn't yet been publicly disclosed in biology. That's

0:08:41.280 --> 0:08:44.000
<v Speaker 4>going to be one of the two biggest waves next

0:08:44.040 --> 0:08:47.719
<v Speaker 4>biology and what you started the conversation with robotics. If

0:08:47.720 --> 0:08:51.319
<v Speaker 4>you just take robotics as an example, I think in

0:08:51.960 --> 0:08:53.839
<v Speaker 4>one of our companies, Hugging Face, which is one of

0:08:53.880 --> 0:08:56.880
<v Speaker 4>the main repositories for all these open source models, there's

0:08:56.920 --> 0:09:01.440
<v Speaker 4>something like sixty thousand text generations models. You know, it's

0:09:01.440 --> 0:09:03.720
<v Speaker 4>like fifty nine thousand, seven hundred something now, but just

0:09:03.760 --> 0:09:06.040
<v Speaker 4>an enormous number of text generation models. This is in

0:09:06.120 --> 0:09:08.520
<v Speaker 4>like two or three. It's like everybody's doing this. Everybody's

0:09:08.520 --> 0:09:10.280
<v Speaker 4>trying to do it all, basically trying to predict the

0:09:10.280 --> 0:09:13.760
<v Speaker 4>next word based on the prior one. What this transformer technology,

0:09:14.160 --> 0:09:17.520
<v Speaker 4>which was invented at Google, ended up parlaying into guess

0:09:17.520 --> 0:09:20.240
<v Speaker 4>how many robotic models there are? Fifty nine thousand in

0:09:20.280 --> 0:09:22.200
<v Speaker 4>text generation? Guess how many robotic models there are?

0:09:22.520 --> 0:09:23.120
<v Speaker 1>A fraction.

0:09:24.559 --> 0:09:27.600
<v Speaker 4>I mean, I'm obviously leading with my phone are but

0:09:28.360 --> 0:09:31.319
<v Speaker 4>nineteen robotic models. Okay, so you got that to me.

0:09:31.960 --> 0:09:34.040
<v Speaker 4>As a venture investor, we're just always looking at where's

0:09:34.040 --> 0:09:39.200
<v Speaker 4>their abundance, where's their scarcity? Their scarcity of robotic models? Now, why, Well,

0:09:39.240 --> 0:09:42.920
<v Speaker 4>it's relatively easy to train on the open Internet. You've

0:09:42.920 --> 0:09:45.400
<v Speaker 4>got Wikipedia, you've got YouTube videos. You know, whether you're

0:09:45.400 --> 0:09:47.120
<v Speaker 4>not you're supposed to be doing that or not. Like

0:09:47.120 --> 0:09:50.000
<v Speaker 4>the woman who was asked it, soa hey, how did

0:09:50.040 --> 0:09:51.560
<v Speaker 4>you train these things? Was it on YouTube? And she

0:09:51.600 --> 0:09:54.280
<v Speaker 4>gave it and you probably saw that, So there's gonna

0:09:54.280 --> 0:09:56.400
<v Speaker 4>be all kinds of copyright stuff on that. Robots are hard.

0:09:56.440 --> 0:09:59.400
<v Speaker 4>Why Most of the robotic stuff that has been out

0:09:59.400 --> 0:10:01.480
<v Speaker 4>in the world, like you talked about in your intro,

0:10:02.200 --> 0:10:06.320
<v Speaker 4>is constrained in manufacturing facilities, in work cells on an

0:10:06.320 --> 0:10:11.600
<v Speaker 4>assembly line, very specific, parametrically constrained, so very few degrees

0:10:11.600 --> 0:10:13.960
<v Speaker 4>of freedom of what they're actually doing. The robots themselves

0:10:14.040 --> 0:10:16.520
<v Speaker 4>might have multi access scripts and controllers, but they're not

0:10:16.559 --> 0:10:19.880
<v Speaker 4>moving around very freely. You have exceptions with like Amazon

0:10:20.120 --> 0:10:24.200
<v Speaker 4>which acquired Kiva, moving the warehouse inventory stuff around, but

0:10:24.280 --> 0:10:30.360
<v Speaker 4>again relatively xyz access not unstructured environments. You and I

0:10:30.840 --> 0:10:33.880
<v Speaker 4>and our listeners. We all thrive every day in unstructured

0:10:33.960 --> 0:10:38.240
<v Speaker 4>environments and that is where you need enormous training data.

0:10:38.440 --> 0:10:40.280
<v Speaker 4>You can't search the internet for that, So how do

0:10:40.320 --> 0:10:43.079
<v Speaker 4>you do it. There's a few things that have emerged,

0:10:43.080 --> 0:10:45.920
<v Speaker 4>and you mentioned some articles. We funded a company that

0:10:45.960 --> 0:10:48.400
<v Speaker 4>recently came out of STELL called Physical Intelligence stead of

0:10:48.480 --> 0:10:51.559
<v Speaker 4>artificial intelligence, Physical Intelligence, and it is the krem Dela

0:10:51.600 --> 0:10:54.320
<v Speaker 4>Creme team from Stanford and Berkeley. You've got some open

0:10:54.360 --> 0:10:57.240
<v Speaker 4>ai folks, You've got Google Deep Mind folks. They took

0:10:57.240 --> 0:10:59.600
<v Speaker 4>investment from open ai US and a bunch of other

0:10:59.679 --> 0:11:03.560
<v Speaker 4>vcs and they are just twenty four to seven training

0:11:03.640 --> 0:11:07.360
<v Speaker 4>robots doing all kinds of crazy things like folding, laundry, pouring.

0:11:07.440 --> 0:11:12.240
<v Speaker 4>Determined but to let these robots encounter unstructured environments and

0:11:12.280 --> 0:11:15.480
<v Speaker 4>then be able to thrive in them. The next thing

0:11:15.480 --> 0:11:18.480
<v Speaker 4>that you're going to see are visual models where you're

0:11:18.520 --> 0:11:21.760
<v Speaker 4>effectively giving, like an ikea sketch or you're drawing something,

0:11:21.840 --> 0:11:24.800
<v Speaker 4>and you're being able to instruct the robot to have

0:11:24.840 --> 0:11:27.439
<v Speaker 4>a sense of intuitive physics of how the world works

0:11:27.440 --> 0:11:29.559
<v Speaker 4>and how things might connect to each other and then

0:11:30.120 --> 0:11:33.360
<v Speaker 4>learn from that. And then we're also training these robots

0:11:33.360 --> 0:11:36.560
<v Speaker 4>with simple verbal cues. So there's a video you can

0:11:36.600 --> 0:11:39.440
<v Speaker 4>see online from some of the researchers where they are

0:11:39.480 --> 0:11:42.560
<v Speaker 4>picking nuts and m and ms and separating them, you know,

0:11:42.640 --> 0:11:44.440
<v Speaker 4>just as a task of being able to sort and

0:11:44.480 --> 0:11:47.480
<v Speaker 4>filter with precision and dexterity, and if they picked a

0:11:47.480 --> 0:11:50.120
<v Speaker 4>wrong one, you can actually instead of physically grabbing the things,

0:11:50.120 --> 0:11:53.760
<v Speaker 4>say stop grab the Eminem's not the nuts. And now

0:11:53.800 --> 0:11:57.040
<v Speaker 4>it knows that. So I think that we're about to

0:11:57.120 --> 0:12:01.000
<v Speaker 4>unleash in robotics what will become a chat GPT like

0:12:01.080 --> 0:12:03.439
<v Speaker 4>moment where people are so used to seeing robots and

0:12:03.480 --> 0:12:05.679
<v Speaker 4>they see the arms, and they've seen West World and

0:12:05.720 --> 0:12:08.600
<v Speaker 4>this kind of stuff, and suddenly something happens that just

0:12:08.679 --> 0:12:11.240
<v Speaker 4>blows your mind. And I think that's coming soon.

0:12:27.120 --> 0:12:29.960
<v Speaker 2>That's pretty exciting because, like I said, you know, for

0:12:30.120 --> 0:12:33.000
<v Speaker 2>like at least a decade, I've been watching those Boston

0:12:33.480 --> 0:12:36.440
<v Speaker 2>whatever online, those youtubes, and at this point I'm sort

0:12:36.440 --> 0:12:38.960
<v Speaker 2>of convinced that it's like basically a content generator because

0:12:38.960 --> 0:12:41.280
<v Speaker 2>it never seems like they're like crazy robot dogs or

0:12:41.320 --> 0:12:44.040
<v Speaker 2>anything like ever become commercial. But maybe this is the

0:12:44.040 --> 0:12:46.880
<v Speaker 2>missing link. But you've brought up like three different avenues

0:12:46.920 --> 0:12:48.400
<v Speaker 2>we could go on and I want to sort of

0:12:48.400 --> 0:12:51.120
<v Speaker 2>eventually hit on any of them. Here's a specific question,

0:12:51.120 --> 0:12:53.920
<v Speaker 2>and then we can maybe get back to robotics. This

0:12:54.080 --> 0:12:57.760
<v Speaker 2>element where there is such a shortage of advanced, cutting

0:12:57.880 --> 0:12:59.840
<v Speaker 2>edge talent, people who really know how to do this.

0:13:00.200 --> 0:13:02.960
<v Speaker 2>And you mentioned that guy that got hired by Microsoft

0:13:03.000 --> 0:13:07.440
<v Speaker 2>from his other companies as an investor in AI or

0:13:07.559 --> 0:13:11.160
<v Speaker 2>robotics startups. Is this a dynamic that's different than in

0:13:11.280 --> 0:13:15.400
<v Speaker 2>other software or other tech investing. Basically, this sort of

0:13:15.440 --> 0:13:18.520
<v Speaker 2>like highly skilled tech qman risk.

0:13:18.600 --> 0:13:22.640
<v Speaker 4>Basically yes, in that you always are looking for what's

0:13:22.640 --> 0:13:25.040
<v Speaker 4>scarce and you want scarce talent. If anybody could do this,

0:13:25.120 --> 0:13:27.920
<v Speaker 4>it's just not that valuable companies would get funded. Vcs

0:13:27.920 --> 0:13:30.440
<v Speaker 4>would fund forty of them at the same time, or

0:13:30.520 --> 0:13:33.120
<v Speaker 4>maybe four hundred of them. Contrast to things where it's

0:13:33.240 --> 0:13:35.840
<v Speaker 4>very web based or the old groupons of yesterday. This

0:13:35.920 --> 0:13:40.160
<v Speaker 4>is highly technical, often PhD scientists. The vast majority of

0:13:40.200 --> 0:13:43.080
<v Speaker 4>the founding teams that we've backed at companies like Covariant

0:13:43.360 --> 0:13:46.760
<v Speaker 4>or format or this new company Physical Intelligence, they're all

0:13:46.840 --> 0:13:50.319
<v Speaker 4>PhDs that are coming out of Stanford, Carnegie, Mel and MIT.

0:13:50.800 --> 0:13:53.640
<v Speaker 4>Some of the best robotics programs in the world, and

0:13:53.840 --> 0:13:57.160
<v Speaker 4>there's lineage of these great professors. Many of them have passed,

0:13:57.160 --> 0:13:59.320
<v Speaker 4>but for example, there's this one guy, Hans Morvec used

0:13:59.320 --> 0:14:00.679
<v Speaker 4>to be a carnegiemail and I got to meet him

0:14:00.679 --> 0:14:02.199
<v Speaker 4>when he was still alive, but he was one of

0:14:02.240 --> 0:14:04.880
<v Speaker 4>the very early pioneers in robotics. And he's got this

0:14:05.000 --> 0:14:09.199
<v Speaker 4>paradox that insiders in the robotics world called the Morvek paradox,

0:14:09.520 --> 0:14:12.320
<v Speaker 4>which is this weird counterintuitive phenomenon which is basically like

0:14:12.679 --> 0:14:15.120
<v Speaker 4>all the stuff that we think is really hard is

0:14:15.120 --> 0:14:17.920
<v Speaker 4>actually pretty easy for AI, and all the stuff that

0:14:17.960 --> 0:14:20.760
<v Speaker 4>we find totally intuitive and easy, like riding a bike,

0:14:21.000 --> 0:14:24.440
<v Speaker 4>that's really hard for robots. So there's this great paradox

0:14:24.480 --> 0:14:26.560
<v Speaker 4>that some of the most brilliant researchers are working on,

0:14:27.400 --> 0:14:29.040
<v Speaker 4>which is how do we do the kind of stuff

0:14:29.040 --> 0:14:31.280
<v Speaker 4>that a four year old can do very intuitively with

0:14:31.360 --> 0:14:33.560
<v Speaker 4>these very complex, expensive machines. And there's all kinds of

0:14:33.560 --> 0:14:35.760
<v Speaker 4>considerations we could talk about about where are these arms

0:14:35.840 --> 0:14:39.200
<v Speaker 4>coming from? The acquisitions that China has been making from

0:14:39.240 --> 0:14:41.520
<v Speaker 4>what historically was a lot of German companies. I mean

0:14:41.520 --> 0:14:43.760
<v Speaker 4>when I say arms meeting the robotic arms they can

0:14:43.840 --> 0:14:46.760
<v Speaker 4>move thing. And then there's this great philosophical debate that

0:14:46.760 --> 0:14:48.600
<v Speaker 4>it hasn't yet come to four. But I believe will

0:14:49.000 --> 0:14:51.120
<v Speaker 4>and investors are sort of lining up on this. I'm

0:14:51.160 --> 0:14:55.680
<v Speaker 4>on the opposite side of some people are funding humanoid robots.

0:14:56.120 --> 0:14:57.600
<v Speaker 4>And the reason that I say I'm on the opposite

0:14:57.600 --> 0:14:59.640
<v Speaker 4>side of it is I don't really believe in them. Yes,

0:15:00.040 --> 0:15:01.480
<v Speaker 4>you would want somebody to help take care of your

0:15:01.520 --> 0:15:04.920
<v Speaker 4>grandma and maybe provide some companionship. But this idea of

0:15:04.960 --> 0:15:08.960
<v Speaker 4>the movies of these ex machina kind of robots that

0:15:09.160 --> 0:15:13.800
<v Speaker 4>embody a human form. We know that engineering is better

0:15:13.920 --> 0:15:16.840
<v Speaker 4>than evolution. If we were inventing a car tomorrow, it

0:15:16.840 --> 0:15:19.400
<v Speaker 4>would be a terrible idea to take Fred Flintstone and

0:15:19.520 --> 0:15:22.280
<v Speaker 4>use his feet, you know, to power these stone wheels.

0:15:22.320 --> 0:15:24.640
<v Speaker 4>We know that an actuator and the axle and an

0:15:24.640 --> 0:15:27.680
<v Speaker 4>engine or just better, and evolution didn't create that. Why

0:15:27.680 --> 0:15:31.480
<v Speaker 4>would we create these humanoid hands where if I'm twisting

0:15:31.520 --> 0:15:33.640
<v Speaker 4>the cap off of a bottle, you know I can

0:15:33.720 --> 0:15:35.440
<v Speaker 4>only I have to turn my hand like seven times

0:15:35.440 --> 0:15:38.600
<v Speaker 4>to do that. Whereas if I was just designed the

0:15:38.600 --> 0:15:40.800
<v Speaker 4>perfect robot, I'd have a little suction cup that would

0:15:40.800 --> 0:15:42.640
<v Speaker 4>go on top. It'd have like a drill bit mechanism.

0:15:42.680 --> 0:15:44.720
<v Speaker 4>It would quickly twist it off and then it would

0:15:44.920 --> 0:15:47.360
<v Speaker 4>Swiss army knife, you know, swap out for the next

0:15:48.280 --> 0:15:52.280
<v Speaker 4>technical gripper capability. So I think that people are misguided

0:15:52.800 --> 0:15:54.600
<v Speaker 4>and they're basically going to end up doing things for

0:15:54.680 --> 0:15:59.040
<v Speaker 4>like prosthetics or you know, something that's sort of Westworld like.

0:15:59.480 --> 0:16:02.120
<v Speaker 4>But I think the practical robots that we're going to

0:16:02.200 --> 0:16:03.680
<v Speaker 4>all be using in our homes are going to look

0:16:03.680 --> 0:16:05.160
<v Speaker 4>nothing like these humanoid robots.

0:16:05.360 --> 0:16:08.320
<v Speaker 1>This is funny. This is very reminiscent of a weird

0:16:08.400 --> 0:16:10.920
<v Speaker 1>conversation I used to have with my dad. He had

0:16:11.040 --> 0:16:16.040
<v Speaker 1>like some sort of bugbear around the shape of aliens hands,

0:16:16.400 --> 0:16:18.640
<v Speaker 1>and he was like, why are they always shown or

0:16:18.720 --> 0:16:24.040
<v Speaker 1>depicted in these illustrations as having like human like hands

0:16:24.160 --> 0:16:26.760
<v Speaker 1>or sometimes even three fingers, Like why wouldn't they have

0:16:26.960 --> 0:16:32.520
<v Speaker 1>just evolved to the next level of very very efficient physiology. Anyway,

0:16:32.800 --> 0:16:34.920
<v Speaker 1>One thing I wanted to ask, and I'm trying to

0:16:34.920 --> 0:16:37.560
<v Speaker 1>think how to frame this question or what the right

0:16:37.600 --> 0:16:43.320
<v Speaker 1>word is, but how open source is robotics in the

0:16:43.360 --> 0:16:46.720
<v Speaker 1>sense of, like how much of the technology is shareable

0:16:47.000 --> 0:16:49.600
<v Speaker 1>or replicable? Because I feel like one of the reasons,

0:16:49.640 --> 0:16:51.360
<v Speaker 1>and you touched on it earlier, but one of the

0:16:51.360 --> 0:16:55.280
<v Speaker 1>reasons we have seen this boom in AI is because

0:16:55.320 --> 0:16:58.880
<v Speaker 1>you can go to places like hugging Face and download

0:16:59.000 --> 0:17:02.160
<v Speaker 1>a bunch of code, open source code, and build off

0:17:02.200 --> 0:17:05.919
<v Speaker 1>of it and it sort of multiplies around itself. But

0:17:06.080 --> 0:17:09.159
<v Speaker 1>is there any aspect of that all in robotics or

0:17:09.200 --> 0:17:11.119
<v Speaker 1>is it just much more proprietary.

0:17:11.520 --> 0:17:15.040
<v Speaker 4>The hardware piece has historically been very proprietary, although there's

0:17:15.080 --> 0:17:18.120
<v Speaker 4>lots of knockoff things. There is a Chinese company which

0:17:18.160 --> 0:17:20.480
<v Speaker 4>is increasingly dominating the field. A lot of people don't

0:17:20.480 --> 0:17:23.960
<v Speaker 4>know the name called Unitry Uni Tree Unitry that is

0:17:24.000 --> 0:17:27.200
<v Speaker 4>sort of copying the Boston Dynamics robots that Joe was

0:17:27.240 --> 0:17:29.720
<v Speaker 4>talking about and that you see in Black Mirror episodes

0:17:29.880 --> 0:17:33.560
<v Speaker 4>and those kinds of things. On the software side, it

0:17:33.720 --> 0:17:38.000
<v Speaker 4>really because it has the same kernel, the same origins

0:17:38.400 --> 0:17:41.840
<v Speaker 4>as many of the AI software things that led to

0:17:41.920 --> 0:17:45.600
<v Speaker 4>large language models and came from transformers academic roots and

0:17:45.640 --> 0:17:48.080
<v Speaker 4>academics like to share and publish, and of course you

0:17:48.119 --> 0:17:50.880
<v Speaker 4>can patent certain things, but by and large, the early

0:17:51.720 --> 0:17:55.560
<v Speaker 4>systems something called ROSS, as you might guess, robot operating system.

0:17:55.840 --> 0:17:58.159
<v Speaker 4>People that were doing something called r duino, which is

0:17:58.520 --> 0:18:02.119
<v Speaker 4>sort of for hobby programmers with hardware and software at

0:18:02.160 --> 0:18:06.119
<v Speaker 4>the intersection Poppy. There's a handful of these things. But again,

0:18:06.200 --> 0:18:08.480
<v Speaker 4>now you're in this mode where you need to find

0:18:08.560 --> 0:18:11.840
<v Speaker 4>training data and you need to do the work and

0:18:11.920 --> 0:18:14.560
<v Speaker 4>spend the time, and that costs money. So you will

0:18:14.600 --> 0:18:17.280
<v Speaker 4>have a mix of open and closed models. And if

0:18:17.320 --> 0:18:19.760
<v Speaker 4>you take a company like Physical Intelligence, their mo their

0:18:19.800 --> 0:18:23.200
<v Speaker 4>motive is is we want to build the operating system

0:18:23.200 --> 0:18:25.639
<v Speaker 4>that any robot can basically use to navigate the world.

0:18:25.640 --> 0:18:28.800
<v Speaker 4>They want to build the brains for the robots as

0:18:28.800 --> 0:18:32.800
<v Speaker 4>opposed to the robots themselves. And there's an interesting philosophical

0:18:32.840 --> 0:18:36.800
<v Speaker 4>and scientific tangent which Barbara Tavski, who is a friend

0:18:36.840 --> 0:18:39.879
<v Speaker 4>and she was the partner of now the late Danny Connoman,

0:18:39.920 --> 0:18:42.439
<v Speaker 4>who was also a friend, her work and she was

0:18:42.600 --> 0:18:44.880
<v Speaker 4>far less famous, but I think actually more important work

0:18:45.240 --> 0:18:49.359
<v Speaker 4>is all about motor function, and her hypothesis is that

0:18:49.440 --> 0:18:52.520
<v Speaker 4>the entire reason for the existence of the brain, like

0:18:52.560 --> 0:18:56.640
<v Speaker 4>the sole purpose of it is to actually produce movement,

0:18:56.880 --> 0:19:00.239
<v Speaker 4>movement towards food or a mate, or to run from

0:19:00.280 --> 0:19:02.119
<v Speaker 4>a prey or predator, which in turn is doing the

0:19:02.160 --> 0:19:04.800
<v Speaker 4>same kind of thing, and I think that some of

0:19:04.840 --> 0:19:11.680
<v Speaker 4>the most interesting philosophical questions about consciousness and memory, spatial perception,

0:19:12.160 --> 0:19:16.600
<v Speaker 4>embodied cognition, gesturing. I mean, I'm wildly gesticulating while I'm

0:19:16.600 --> 0:19:18.600
<v Speaker 4>talking now with my hands. It's just an innate thing

0:19:18.760 --> 0:19:21.040
<v Speaker 4>being able to do mental simulations, like when we think

0:19:21.080 --> 0:19:24.640
<v Speaker 4>about human brains and machine brains merging. I actually think

0:19:24.680 --> 0:19:28.800
<v Speaker 4>it's really going to be very revelatory as these robotic

0:19:28.800 --> 0:19:31.320
<v Speaker 4>systems advance and us understanding that's a lot of the

0:19:31.400 --> 0:19:34.879
<v Speaker 4>purpose of thinking and intelligence is actually just about moving.

0:19:35.359 --> 0:19:37.720
<v Speaker 4>And so that's a pretty cool side effect that I

0:19:37.720 --> 0:19:39.400
<v Speaker 4>think is going to come from all the commercial and

0:19:39.640 --> 0:19:42.400
<v Speaker 4>speculative ventures stuff that we do and funding these things.

0:19:42.720 --> 0:19:46.080
<v Speaker 2>You keep saying things that like jog me onto something

0:19:46.080 --> 0:19:47.560
<v Speaker 2>else that I was meaning to ask you, but you

0:19:47.680 --> 0:19:49.639
<v Speaker 2>mentioned this impulse.

0:19:49.240 --> 0:19:52.360
<v Speaker 3>That academics like to share their work, and.

0:19:52.320 --> 0:19:54.479
<v Speaker 2>That reminded me of something that someone was telling me.

0:19:54.640 --> 0:19:57.280
<v Speaker 2>So you know, maybe I'm sure you are on this

0:19:57.320 --> 0:19:59.679
<v Speaker 2>site a lot. But for listeners, there's this site archived

0:19:59.720 --> 0:20:03.320
<v Speaker 2>dot org where people like publish research and sort of

0:20:03.359 --> 0:20:06.560
<v Speaker 2>like an open source, sort of ungated manner about all

0:20:06.640 --> 0:20:09.760
<v Speaker 2>kinds of scientific and computer things and just today on

0:20:09.840 --> 0:20:12.879
<v Speaker 2>the artificial intelligence page, there's like fifteen new papers and

0:20:12.880 --> 0:20:16.520
<v Speaker 2>they have headlines like autonomous evaluation and Refinement of digital

0:20:16.560 --> 0:20:20.520
<v Speaker 2>agents or a modular benchmark framework to measure progress and

0:20:20.640 --> 0:20:23.760
<v Speaker 2>improve LLM agents. And this guy was who I was

0:20:23.760 --> 0:20:25.920
<v Speaker 2>talking to, made the contention that's like all this stuff

0:20:26.000 --> 0:20:29.400
<v Speaker 2>is being published and investors, a lot of vcs don't

0:20:29.440 --> 0:20:32.640
<v Speaker 2>really have these sort of like technical chops to judge

0:20:32.680 --> 0:20:33.720
<v Speaker 2>a lot of this research.

0:20:34.359 --> 0:20:35.200
<v Speaker 3>It's like the hear.

0:20:35.080 --> 0:20:39.320
<v Speaker 2>Take my money meme, I'm curious, like what you see

0:20:39.440 --> 0:20:42.040
<v Speaker 2>in this space where it's like there must be a

0:20:42.040 --> 0:20:44.720
<v Speaker 2>lot of investors like yourself who are like wowed by

0:20:44.960 --> 0:20:47.200
<v Speaker 2>PhDs like doing all kinds of stuff. It's like, we

0:20:47.280 --> 0:20:50.879
<v Speaker 2>got a one hundred x breakthrough in the energy efficiency

0:20:50.920 --> 0:20:53.760
<v Speaker 2>of this in video chip by training the model differently,

0:20:53.840 --> 0:20:56.560
<v Speaker 2>like how do you evaluate the science and how risky

0:20:56.640 --> 0:20:59.680
<v Speaker 2>is that for investors with a sort of like plethora

0:20:59.800 --> 0:21:02.240
<v Speaker 2>of ostensible breakthrough is happening left and right.

0:21:02.240 --> 0:21:04.480
<v Speaker 4>Well, you're right, there's endless I mean it is why

0:21:04.520 --> 0:21:06.320
<v Speaker 4>there's this endless frontier and sort of the batter for

0:21:06.400 --> 0:21:08.959
<v Speaker 4>bush sense, and it's always very exciting and you need

0:21:09.000 --> 0:21:11.040
<v Speaker 4>to have a very high filter for these things. Not

0:21:11.080 --> 0:21:13.720
<v Speaker 4>everything is commercent. Sometimes there's just a breakthrough, but maybe

0:21:13.760 --> 0:21:15.879
<v Speaker 4>that breakthrough can be licensed to a company. And so

0:21:15.920 --> 0:21:18.440
<v Speaker 4>the people who are actually commercializing this and thinking about

0:21:18.440 --> 0:21:22.399
<v Speaker 4>capital allocation and recruiting teams and then deciding these are

0:21:22.400 --> 0:21:24.920
<v Speaker 4>our top three priorities, and even though there's forty other

0:21:24.960 --> 0:21:27.080
<v Speaker 4>really exciting things that we could and maybe should be doing,

0:21:27.200 --> 0:21:28.880
<v Speaker 4>we're just not going to do that. Now. That's really

0:21:28.960 --> 0:21:32.280
<v Speaker 4>what company building is about. And so oftentimes we might

0:21:32.280 --> 0:21:34.440
<v Speaker 4>have a brilliant scientist, but maybe that person just isn't

0:21:34.480 --> 0:21:36.960
<v Speaker 4>a good salesperson. They can't tell a narrative and convince

0:21:37.000 --> 0:21:39.159
<v Speaker 4>people to move across the country and join them, they

0:21:39.160 --> 0:21:40.879
<v Speaker 4>can't raise capital, and therefore they're not going to be

0:21:40.920 --> 0:21:43.360
<v Speaker 4>a great entrepreneur and they're probably better off as a scientist.

0:21:43.640 --> 0:21:47.480
<v Speaker 4>But when we're making evaluations, it's how much money is

0:21:47.480 --> 0:21:49.600
<v Speaker 4>going to accomplish what in what period of time? And

0:21:49.600 --> 0:21:51.480
<v Speaker 4>who's going to care. It's like if you were playing poker.

0:21:51.520 --> 0:21:53.120
<v Speaker 4>You're looking at your hand, You're figuring out how much

0:21:53.119 --> 0:21:54.760
<v Speaker 4>money do you have to anti up for the next round,

0:21:55.000 --> 0:21:57.800
<v Speaker 4>And then what's the exogenous what's the outside view, what's

0:21:57.800 --> 0:21:59.880
<v Speaker 4>the market going to say, and will they care. That's why,

0:22:00.440 --> 0:22:01.719
<v Speaker 4>you know, we talked a little bit about this, but

0:22:01.800 --> 0:22:04.120
<v Speaker 4>very skeptical about other fields that people are funding right now,

0:22:04.720 --> 0:22:08.520
<v Speaker 4>in fusion or in quantum computing. I'm so skeptical about

0:22:08.560 --> 0:22:11.960
<v Speaker 4>them in part from hard and cynicism of twenty years

0:22:12.000 --> 0:22:14.800
<v Speaker 4>of people pitching us things that are always about unbreakable

0:22:14.840 --> 0:22:19.119
<v Speaker 4>cryptography and femtosecond and kneeling of this quantity, and I'm like, so,

0:22:19.280 --> 0:22:22.040
<v Speaker 4>what most of the things that people promised, you know,

0:22:22.119 --> 0:22:26.320
<v Speaker 4>about unbreakable cryptography or molecular modeling for people are doing.

0:22:26.359 --> 0:22:29.040
<v Speaker 4>They're just not using quantum computers. They're using GPUs, they're

0:22:29.080 --> 0:22:31.760
<v Speaker 4>using in video chips, they're using new algorithms. So I've

0:22:31.760 --> 0:22:34.320
<v Speaker 4>been very skeptical about that, also been very skeptical about

0:22:34.320 --> 0:22:37.200
<v Speaker 4>fusion for the same sort of reason that to your point,

0:22:37.240 --> 0:22:40.760
<v Speaker 4>there's like this ignorance arbitrage that people take advantage of.

0:22:40.800 --> 0:22:43.280
<v Speaker 4>They take advantage of that Investors don't fully understand something.

0:22:43.520 --> 0:22:45.440
<v Speaker 4>It's hot, it's on the front page of a newspaper

0:22:45.520 --> 0:22:48.679
<v Speaker 4>or magazine or you know whatever. It's a buzz and

0:22:48.720 --> 0:22:50.280
<v Speaker 4>they want to play in it, and so they invest

0:22:50.359 --> 0:22:52.760
<v Speaker 4>in it. And that's how you get frauds. So we're

0:22:52.760 --> 0:22:55.720
<v Speaker 4>always looking and basically trying to say, is this academic

0:22:55.800 --> 0:22:59.040
<v Speaker 4>practitioner commercially minded. They're probably not going to leave their jobs,

0:22:59.040 --> 0:23:00.760
<v Speaker 4>so we're only getting them twenty percent of their time.

0:23:00.880 --> 0:23:03.399
<v Speaker 4>Is the intellectual property reel. And then oftentimes to you're

0:23:03.400 --> 0:23:06.440
<v Speaker 4>pointing about the papers, there is a high reference ability

0:23:06.440 --> 0:23:09.680
<v Speaker 4>in papers. A paper that is cited enormously or immensely

0:23:09.960 --> 0:23:13.640
<v Speaker 4>has a lot more credibility because you have the vainglorious

0:23:13.960 --> 0:23:17.880
<v Speaker 4>error detection and correction of other scientists who are trying

0:23:17.880 --> 0:23:19.800
<v Speaker 4>to shoot that person down who has all the fame

0:23:19.840 --> 0:23:22.720
<v Speaker 4>for doing it, trying to seize that mantle. And so

0:23:23.240 --> 0:23:26.080
<v Speaker 4>scientists are not a benevelent bunch. They're just as competitive

0:23:26.080 --> 0:23:29.119
<v Speaker 4>as an investigative journalist trying to break the story or investor,

0:23:29.440 --> 0:23:31.359
<v Speaker 4>or an A and R REP for a music band

0:23:31.400 --> 0:23:34.080
<v Speaker 4>trying to get there before everybody else. And we're no different.

0:23:34.119 --> 0:23:36.800
<v Speaker 4>They're no different, But that's how all this stuff progresses.

0:23:37.960 --> 0:23:40.199
<v Speaker 1>Talk to us about the robotic arm. Yeah, I'm going

0:23:40.280 --> 0:23:40.840
<v Speaker 1>to take the bait.

0:23:41.040 --> 0:23:41.560
<v Speaker 3>Yeah.

0:23:41.640 --> 0:23:43.160
<v Speaker 4>The first point I would make is I don't believe

0:23:43.160 --> 0:23:45.760
<v Speaker 4>in these humanoid robot arms with like fingers and you know,

0:23:45.840 --> 0:23:48.879
<v Speaker 4>high dexterity. I think it's cool Parler trick, but I

0:23:48.920 --> 0:23:51.240
<v Speaker 4>just I think we should have robotic arms that look

0:23:51.320 --> 0:23:53.960
<v Speaker 4>more like Swiss army knives that are swapping and moving stuff.

0:23:53.960 --> 0:23:55.920
<v Speaker 4>And you can see a bunch of these things online.

0:23:56.000 --> 0:23:57.960
<v Speaker 4>You know, different tools for different tasks and be able

0:23:57.960 --> 0:24:01.280
<v Speaker 4>to do them instantaneously when you out to the industry structure.

0:24:01.600 --> 0:24:04.760
<v Speaker 4>You've got Fanik, which is a major Japanese player. They

0:24:04.800 --> 0:24:08.200
<v Speaker 4>started with industrial robots. They do factory automation, but they've

0:24:08.200 --> 0:24:10.720
<v Speaker 4>been a key player, probably a twenty five thirty billion

0:24:10.760 --> 0:24:15.000
<v Speaker 4>dollar enterprise value company. You've got ABB, which is Swiss

0:24:15.000 --> 0:24:19.240
<v Speaker 4>Swedish multinational industrial robot arms. Those are most of the

0:24:19.240 --> 0:24:22.200
<v Speaker 4>things that you would see in even a Tesla gigafactory

0:24:22.280 --> 0:24:24.520
<v Speaker 4>or something where Elon's talking about all their automated things

0:24:24.520 --> 0:24:26.919
<v Speaker 4>and they've got some of these ABB arms or the

0:24:26.960 --> 0:24:30.160
<v Speaker 4>other one is Kuka. Kuka which is a German company

0:24:30.520 --> 0:24:32.320
<v Speaker 4>and they were one of the great leaders. They got

0:24:32.320 --> 0:24:34.399
<v Speaker 4>bought by a Chinese company, I want to say twenty

0:24:34.440 --> 0:24:37.200
<v Speaker 4>sixteen or twenty seventeen, and China's made a bunch of

0:24:37.520 --> 0:24:41.440
<v Speaker 4>I think very smart investments acquiring technologies that were a

0:24:41.480 --> 0:24:43.840
<v Speaker 4>little bit before their time, And I think it presents

0:24:43.840 --> 0:24:46.640
<v Speaker 4>some geopolitical things that probably in five or ten years

0:24:46.680 --> 0:24:48.800
<v Speaker 4>will be looking at and saying, my gosh, how is

0:24:48.840 --> 0:24:53.399
<v Speaker 4>the dominant robot arms supplier or robot body supplier be

0:24:53.480 --> 0:24:57.439
<v Speaker 4>akin to like being dependent on TSMC, a Chinese company.

0:24:57.720 --> 0:25:02.879
<v Speaker 4>So I think there's going to be nash robot companies

0:25:03.000 --> 0:25:04.959
<v Speaker 4>that form in the same way you're starting to see

0:25:05.320 --> 0:25:07.000
<v Speaker 4>national AI companies form.

0:25:07.359 --> 0:25:11.160
<v Speaker 2>How does a company like Physical Intelligence, how are they

0:25:11.200 --> 0:25:14.440
<v Speaker 2>solving the data problem? Because like you said, there isn't

0:25:14.480 --> 0:25:17.000
<v Speaker 2>just the equivalent of a robot Internet where they can

0:25:17.040 --> 0:25:19.640
<v Speaker 2>watch millions of hours of a robot or I'm trying

0:25:19.680 --> 0:25:22.760
<v Speaker 2>to do something, or humans doing something or whatever. What

0:25:22.880 --> 0:25:24.919
<v Speaker 2>is the approach by which the sort of the token

0:25:25.000 --> 0:25:26.080
<v Speaker 2>problem is being solved?

0:25:26.359 --> 0:25:28.240
<v Speaker 4>I would call it the easy way to do it,

0:25:28.280 --> 0:25:30.959
<v Speaker 4>which is actually quite trivial and hard, is doing what

0:25:31.000 --> 0:25:33.720
<v Speaker 4>people originally did with robotic surgery. So we had a

0:25:33.720 --> 0:25:36.359
<v Speaker 4>company called Ors Surgical Robotics. We saw that JJ for

0:25:36.359 --> 0:25:40.880
<v Speaker 4>six billion dollars. It started with surgeons operating these things

0:25:41.240 --> 0:25:43.560
<v Speaker 4>in like a telerobot, so they had little pinchers on

0:25:43.600 --> 0:25:46.280
<v Speaker 4>their fingers and you know, from five feet away or

0:25:46.320 --> 0:25:49.280
<v Speaker 4>in a totally clean operating room. They were operating, but

0:25:49.320 --> 0:25:53.119
<v Speaker 4>it was their hands being teletransmitted to the device. And

0:25:53.200 --> 0:25:55.320
<v Speaker 4>so that is the first way which has come up

0:25:55.359 --> 0:25:59.600
<v Speaker 4>with one hundred different tasks, maybe the highest frequency things

0:25:59.760 --> 0:26:04.080
<v Speaker 4>like washing dishes, folding clothes, again unstructured environments, being able

0:26:04.080 --> 0:26:06.520
<v Speaker 4>to do them in multiple different houses, multiple different heights,

0:26:07.160 --> 0:26:09.920
<v Speaker 4>multiple different you know, wet clothes, dry clothes. Being able

0:26:09.960 --> 0:26:13.040
<v Speaker 4>to pour coffee, being able to have the dexterity to

0:26:13.080 --> 0:26:15.560
<v Speaker 4>open up a ca cup. I actually don't drink those.

0:26:15.560 --> 0:26:17.640
<v Speaker 4>I think they're disgusting, but you know, put it into

0:26:17.640 --> 0:26:18.480
<v Speaker 4>a coffee machine.

0:26:18.560 --> 0:26:19.520
<v Speaker 3>They don't drink any of that.

0:26:19.640 --> 0:26:23.760
<v Speaker 4>Slow and there it's an engineer that is operating them

0:26:24.280 --> 0:26:29.919
<v Speaker 4>and the movement of compensating for gravity, how much force,

0:26:30.040 --> 0:26:32.760
<v Speaker 4>how much tension, how much pressure. That's all information. It's

0:26:32.760 --> 0:26:35.800
<v Speaker 4>information that historically has not been captured and some of

0:26:35.840 --> 0:26:38.239
<v Speaker 4>that is then extensible. And this is a really cool thing.

0:26:38.240 --> 0:26:39.719
<v Speaker 4>You can see some of these robots. You might have

0:26:39.880 --> 0:26:42.560
<v Speaker 4>five different robots, but they have this amazing thing called

0:26:42.600 --> 0:26:45.960
<v Speaker 4>transfer learning. You teach one robot a thing, and suddenly

0:26:46.000 --> 0:26:48.840
<v Speaker 4>the other robot, which is disconnected, you know, to or

0:26:48.920 --> 0:26:51.560
<v Speaker 4>it's connected through the Internet through it, but it can

0:26:51.600 --> 0:26:54.680
<v Speaker 4>actually learn what that robot just learned and perform the task.

0:26:54.760 --> 0:26:57.359
<v Speaker 4>So that's actually pretty eerie and pretty cool. It's the

0:26:57.359 --> 0:26:59.919
<v Speaker 4>same sort of thing like if I had one roll

0:27:00.680 --> 0:27:03.160
<v Speaker 4>that saw where I tossed a ball in a room,

0:27:03.680 --> 0:27:06.040
<v Speaker 4>but three other robots didn't know, they would instantly know

0:27:06.440 --> 0:27:08.520
<v Speaker 4>because they have the eyes of the first robot. So

0:27:08.640 --> 0:27:11.320
<v Speaker 4>there's all kinds of training like that. Then there's schematics

0:27:11.320 --> 0:27:13.000
<v Speaker 4>and drawings. So I was alluding to this before of

0:27:13.040 --> 0:27:16.439
<v Speaker 4>like Ikea drawings, but once you can take diagrams and

0:27:16.440 --> 0:27:20.879
<v Speaker 4>schematics and actually use visual language models, it's something that

0:27:21.240 --> 0:27:23.439
<v Speaker 4>Opening Eyes a partner here with physical intelligence, and they

0:27:23.440 --> 0:27:25.719
<v Speaker 4>have pioneered some of that. That's going to be wild too,

0:27:25.760 --> 0:27:29.199
<v Speaker 4>where you can literally just show an Ikea drawing and

0:27:29.240 --> 0:27:32.960
<v Speaker 4>the robot goes with a constraint set of limited pieces

0:27:32.960 --> 0:27:35.200
<v Speaker 4>that they have to put together, a set of screws

0:27:35.520 --> 0:27:38.840
<v Speaker 4>and wrenches and whatnot, and they can completely assemble whatever

0:27:38.880 --> 0:27:41.720
<v Speaker 4>it is, a nursery thing or furniture or desk, which

0:27:41.720 --> 0:27:43.080
<v Speaker 4>I think is going to be pretty wild for people

0:27:43.080 --> 0:27:43.479
<v Speaker 4>to see too.

0:27:43.720 --> 0:27:46.040
<v Speaker 1>That would actually be a major quality of life cure,

0:27:46.720 --> 0:27:49.840
<v Speaker 1>not having to put together ikea furniture. That'd be amazing.

0:27:50.359 --> 0:27:53.040
<v Speaker 1>Could you talk a little bit more about how robots

0:27:53.200 --> 0:27:55.880
<v Speaker 1>are learning and I guess, like what the different types

0:27:56.200 --> 0:27:59.880
<v Speaker 1>of learning are or different patterns of learning, and what

0:28:00.119 --> 0:28:02.360
<v Speaker 1>you've seen that's been most promising so far.

0:28:02.920 --> 0:28:05.680
<v Speaker 4>You have two main categories or maybe three. You've got

0:28:05.680 --> 0:28:11.320
<v Speaker 4>supervised learning, so there you've got input data. The robot

0:28:11.400 --> 0:28:14.959
<v Speaker 4>is learning, it's being corrected, it's being told in some

0:28:15.000 --> 0:28:18.040
<v Speaker 4>cases like I described before, whether it's voice or a

0:28:18.119 --> 0:28:22.240
<v Speaker 4>gesture or nudge. Then you have unsupervised learning, where the

0:28:22.320 --> 0:28:26.080
<v Speaker 4>robots are basically training on unstructured data. They get to

0:28:26.080 --> 0:28:30.200
<v Speaker 4>discover patterns, they encounter the world, they encounter boundaries, gravity,

0:28:30.400 --> 0:28:32.960
<v Speaker 4>those kinds of things, and it might be slower in

0:28:33.000 --> 0:28:36.880
<v Speaker 4>that case, but they're reducing the dimensions for error. There's

0:28:36.920 --> 0:28:40.160
<v Speaker 4>something that I coined this term. I call it MBTFU,

0:28:40.240 --> 0:28:43.680
<v Speaker 4>which is meantime between f ups. But you want that

0:28:43.720 --> 0:28:45.600
<v Speaker 4>to be as long as possible. If you go back

0:28:45.640 --> 0:28:48.720
<v Speaker 4>to like the early rumbas, you know, a roomba would

0:28:48.760 --> 0:28:51.280
<v Speaker 4>not know if it was cleaning up a spill of

0:28:51.320 --> 0:28:53.760
<v Speaker 4>chocolate milk, or if your dog made a mess and

0:28:53.840 --> 0:28:56.320
<v Speaker 4>it smears. It like to tell all over your floor, right.

0:28:56.520 --> 0:28:59.520
<v Speaker 4>You want to increase as long as possible the meantime

0:28:59.560 --> 0:29:02.840
<v Speaker 4>between me, you know, basically error reduction. Men, you've got

0:29:02.880 --> 0:29:06.520
<v Speaker 4>reinforcement learning, imitation learning, where you might be controlling the

0:29:06.600 --> 0:29:09.720
<v Speaker 4>robot or having it mimic you. There's this idea of

0:29:09.720 --> 0:29:12.040
<v Speaker 4>transfer learning, where a single robot learned something, but it

0:29:12.040 --> 0:29:15.720
<v Speaker 4>can transfer it to different robots or from a different domain.

0:29:16.240 --> 0:29:19.000
<v Speaker 4>So people are trying lots of different approaches, and the

0:29:19.000 --> 0:29:22.920
<v Speaker 4>more different mechanisms you have, people are then going to

0:29:22.960 --> 0:29:25.280
<v Speaker 4>figure out, okay, which is the least data intensive, or

0:29:25.280 --> 0:29:27.840
<v Speaker 4>which has the lowest latency and is the quickest, or

0:29:28.120 --> 0:29:31.560
<v Speaker 4>which is the best system for training a robot that

0:29:32.400 --> 0:29:36.160
<v Speaker 4>you put it in a totally unstructured environment and without any training,

0:29:36.200 --> 0:29:38.000
<v Speaker 4>sort of what they call it zero shot learning. It's

0:29:38.000 --> 0:29:41.400
<v Speaker 4>able to figure out from prior knowledge. I know that

0:29:41.440 --> 0:29:44.560
<v Speaker 4>I can go through that chair. I know that it swivels,

0:29:44.600 --> 0:29:46.640
<v Speaker 4>I have to turn it this way. I know how

0:29:46.720 --> 0:29:49.200
<v Speaker 4>much force I need to pick up a general coke

0:29:49.280 --> 0:29:54.120
<v Speaker 4>can and those kinds of things. And I think, again,

0:29:54.160 --> 0:29:57.520
<v Speaker 4>it's going to be enlightening of how much we as

0:29:57.560 --> 0:30:01.880
<v Speaker 4>we navigate any given minute in our life. Take for granted,

0:30:02.040 --> 0:30:05.680
<v Speaker 4>all this intuitive tacit knowledge that we have about the

0:30:05.680 --> 0:30:09.480
<v Speaker 4>physical world, and there really is this intuitive physics of

0:30:09.800 --> 0:30:11.800
<v Speaker 4>how we move around. Robots are going to learn.

0:30:11.680 --> 0:30:15.440
<v Speaker 2>That is really simple question. In the next five years,

0:30:15.880 --> 0:30:18.400
<v Speaker 2>is it plausible that I'll have a robot in my

0:30:18.480 --> 0:30:20.320
<v Speaker 2>house where I could take all the clothes out of

0:30:20.360 --> 0:30:23.120
<v Speaker 2>the dryer, drop it into something and have it turned

0:30:23.160 --> 0:30:27.480
<v Speaker 2>into folded clothes. And or if not that, what could

0:30:27.560 --> 0:30:30.920
<v Speaker 2>be that chat gpt of robots that's right around the corner.

0:30:31.840 --> 0:30:34.240
<v Speaker 4>Well, one thing which I posited to the team because

0:30:34.280 --> 0:30:36.720
<v Speaker 4>I was thinking about exactly that. You know, what would

0:30:36.720 --> 0:30:38.920
<v Speaker 4>be a really cool thing. I lose stuff in our

0:30:38.960 --> 0:30:41.280
<v Speaker 4>apartment all the time. We have a bunch of different rooms.

0:30:41.920 --> 0:30:44.960
<v Speaker 4>I would love to basically say, has anybody seen which

0:30:45.000 --> 0:30:46.720
<v Speaker 4>is often what I do to my wife and three kids?

0:30:46.760 --> 0:30:49.920
<v Speaker 4>Has anybody seen my wallet? Has anybody seen my glasses? Okay,

0:30:50.360 --> 0:30:53.840
<v Speaker 4>just announcing that you can see that a robot with

0:30:53.920 --> 0:30:55.760
<v Speaker 4>a series of things in your home that would have

0:30:55.840 --> 0:30:58.720
<v Speaker 4>visual identifiers and visual learning machine learning to be able

0:30:58.760 --> 0:31:03.440
<v Speaker 4>to spot object in a video frame could say yes, Josh,

0:31:03.480 --> 0:31:06.120
<v Speaker 4>I know exactly where they are and go and retrieve them.

0:31:06.360 --> 0:31:09.040
<v Speaker 4>So fetch and retrieving objects in the home to me

0:31:09.640 --> 0:31:11.480
<v Speaker 4>would be a pretty cool thing. Where'd I put that

0:31:11.520 --> 0:31:14.160
<v Speaker 4>remote or whatever? And the robot basically knows because they

0:31:14.200 --> 0:31:16.520
<v Speaker 4>can go through the DVR of the home and they

0:31:16.520 --> 0:31:18.040
<v Speaker 4>basically know where it is and they can fetch it

0:31:18.040 --> 0:31:20.720
<v Speaker 4>and retrieve it with the right physics. Folding laundry. I

0:31:20.720 --> 0:31:23.160
<v Speaker 4>don't know how to handicap that, and we can do

0:31:23.240 --> 0:31:24.520
<v Speaker 4>that now pretty.

0:31:24.160 --> 0:31:26.960
<v Speaker 2>Crudely through That's the one I want because I have

0:31:27.040 --> 0:31:28.960
<v Speaker 2>a swall in New York City apartment. I don't lose

0:31:29.000 --> 0:31:31.680
<v Speaker 2>things like too much. Basically, like, I need that fold

0:31:31.760 --> 0:31:32.880
<v Speaker 2>I need that folding robot.

0:31:32.960 --> 0:31:34.520
<v Speaker 4>What are you gonna pay for that? Though? I don't know.

0:31:34.600 --> 0:31:37.320
<v Speaker 4>You know, would you pay five grand ten grand for

0:31:37.640 --> 0:31:40.840
<v Speaker 4>a robot that folded your clothes? Probably not. So it's

0:31:40.920 --> 0:31:42.640
<v Speaker 4>it's gonna have a high. That's why most of these

0:31:42.680 --> 0:31:46.880
<v Speaker 4>things have found their way into industrial uses first, and

0:31:46.960 --> 0:31:48.880
<v Speaker 4>over time they'll get cheaper and cheaper, they'll get better

0:31:48.960 --> 0:31:51.120
<v Speaker 4>and better. But you know, look, I'm one of the

0:31:51.160 --> 0:31:54.280
<v Speaker 4>few people that have an Amazon astro It's a robot

0:31:54.320 --> 0:31:56.840
<v Speaker 4>that you know, rolls around the house and you can

0:31:56.840 --> 0:31:58.480
<v Speaker 4>tell to go into a room. You can put something

0:31:58.520 --> 0:32:00.160
<v Speaker 4>in the back of it and take it to It

0:32:00.200 --> 0:32:02.640
<v Speaker 4>can do facial identification for my family, so I can

0:32:02.720 --> 0:32:06.040
<v Speaker 4>say where's quinner body, and they'll find my younger kids

0:32:06.080 --> 0:32:09.760
<v Speaker 4>and I can send a message noise the hell out

0:32:09.800 --> 0:32:11.400
<v Speaker 4>of my wife. But I think it's sort of cool,

0:32:11.400 --> 0:32:13.160
<v Speaker 4>and yeah, every robot that comes out, I'll be an

0:32:13.160 --> 0:32:16.240
<v Speaker 4>early adoptor, and yeah, we like funny things.

0:32:33.560 --> 0:32:36.440
<v Speaker 1>So one of the interesting things about the current tech

0:32:36.560 --> 0:32:39.840
<v Speaker 1>cycle and all this enthusiasm for AI is that so

0:32:40.040 --> 0:32:43.680
<v Speaker 1>far it's been the big incumbents who seem to be

0:32:43.720 --> 0:32:47.040
<v Speaker 1>winning here. And part of that is because the capital

0:32:47.080 --> 0:32:50.440
<v Speaker 1>investment needed is so large, the amount of data needed

0:32:50.680 --> 0:32:53.360
<v Speaker 1>is so large when it comes to robotics, would you

0:32:53.400 --> 0:32:56.280
<v Speaker 1>expect to see a similar thing? And then added on

0:32:56.360 --> 0:32:59.280
<v Speaker 1>to that, could you have a situation where, you know,

0:32:59.360 --> 0:33:03.120
<v Speaker 1>if you are a large manufacturer, or perhaps you are

0:33:03.560 --> 0:33:07.040
<v Speaker 1>a company that has a lot of proprietary data, like

0:33:07.160 --> 0:33:11.480
<v Speaker 1>an insurance company or a financial company or something like that,

0:33:11.840 --> 0:33:14.240
<v Speaker 1>could you develop your own robotics Like would that be

0:33:14.320 --> 0:33:17.840
<v Speaker 1>your edge here? And could you potentially just do it yourself?

0:33:18.480 --> 0:33:21.880
<v Speaker 4>On the first case, if I look at the current landscape,

0:33:22.320 --> 0:33:24.800
<v Speaker 4>the one company in sort of the big magnificent seven

0:33:25.080 --> 0:33:27.760
<v Speaker 4>would be Amazon, just because they have already been investing

0:33:27.800 --> 0:33:31.200
<v Speaker 4>in this and for a long time. Jeff Bezos is

0:33:31.280 --> 0:33:34.440
<v Speaker 4>very passionate about robots. So I think there's a DNA

0:33:34.560 --> 0:33:38.520
<v Speaker 4>there where doing things that enable them to do the

0:33:38.520 --> 0:33:41.240
<v Speaker 4>three things that Jeff loves to do historically, which is

0:33:41.560 --> 0:33:46.120
<v Speaker 4>increased choice and availability, increase convenience for customers and lower prices,

0:33:46.160 --> 0:33:51.000
<v Speaker 4>and factory automation, warehouses delivery. Even when they bought our

0:33:51.000 --> 0:33:53.400
<v Speaker 4>company for a little over a billion dollars called Zookes

0:33:53.440 --> 0:33:55.800
<v Speaker 4>that was doing autonomous driving, they have a long term

0:33:55.800 --> 0:33:57.800
<v Speaker 4>intention to be able to do twenty four to seven

0:33:57.880 --> 0:34:00.520
<v Speaker 4>right hand turn lanes, navigating around the city with a

0:34:00.600 --> 0:34:03.200
<v Speaker 4>human that's basically delivering last miles kind of stuff. And

0:34:04.000 --> 0:34:07.720
<v Speaker 4>so so I could see Amazon doing that. Microsoft, I

0:34:07.720 --> 0:34:10.000
<v Speaker 4>don't really see them getting into robotics in a significant way.

0:34:10.000 --> 0:34:12.360
<v Speaker 4>They have some R and D efforts Google and we

0:34:12.400 --> 0:34:14.400
<v Speaker 4>talked a little bit about this phenomenon, but they are

0:34:14.480 --> 0:34:17.279
<v Speaker 4>the third or fourth group where we have taken a

0:34:17.400 --> 0:34:19.560
<v Speaker 4>team out of a big tech We did it with

0:34:19.680 --> 0:34:23.320
<v Speaker 4>Google with a company called Osmo to create basically Shazam

0:34:23.400 --> 0:34:29.000
<v Speaker 4>for smell. We did it with a BIOAI company called

0:34:29.040 --> 0:34:31.240
<v Speaker 4>Evolutionary Scale out of meta that's going to be announced

0:34:31.239 --> 0:34:33.480
<v Speaker 4>more publicly soon. And then we did it with this

0:34:33.520 --> 0:34:36.040
<v Speaker 4>team out of Google that became a physical intelligence between

0:34:36.040 --> 0:34:38.800
<v Speaker 4>Google deep Mind and open AI and Stanford and Berkeley.

0:34:39.120 --> 0:34:41.080
<v Speaker 4>So I think that this is going to be more

0:34:41.200 --> 0:34:44.120
<v Speaker 4>the startups. The beneficiaries of the money will still be

0:34:44.280 --> 0:34:46.200
<v Speaker 4>Nvidia and some of the chip players, some of the

0:34:46.239 --> 0:34:48.319
<v Speaker 4>hardware providers, because we need the hardware to be able

0:34:48.320 --> 0:34:50.400
<v Speaker 4>to train the robots. But I really think that this

0:34:50.560 --> 0:34:52.719
<v Speaker 4>is an open field. And again just going back to

0:34:52.760 --> 0:34:54.719
<v Speaker 4>that stat of how many large language model there are

0:34:54.800 --> 0:34:57.880
<v Speaker 4>for chatbots and how few there are for robots. I

0:34:57.920 --> 0:35:00.239
<v Speaker 4>think it's a big opportunity. Now five years from now

0:35:00.280 --> 0:35:02.160
<v Speaker 4>we might have a bubble in robots, but today I

0:35:02.200 --> 0:35:05.439
<v Speaker 4>think it's it's it's a really exciting field, just.

0:35:05.360 --> 0:35:09.560
<v Speaker 2>An existing AI and competitive advantage. So when I first

0:35:09.560 --> 0:35:13.640
<v Speaker 2>in the year two thousand, I think encountered Google, I

0:35:13.760 --> 0:35:16.640
<v Speaker 2>used it. I was like, this is way better than

0:35:16.719 --> 0:35:19.239
<v Speaker 2>Yahoo or anything else. I never like stopped after that.

0:35:19.320 --> 0:35:22.040
<v Speaker 2>Again with chat bots, just going back now, I even

0:35:22.040 --> 0:35:24.520
<v Speaker 2>talking about robotics here. It's like I and I like

0:35:24.680 --> 0:35:27.960
<v Speaker 2>got a chet GPT progount or whatever early on, and

0:35:28.000 --> 0:35:30.040
<v Speaker 2>I thought it was pretty cool and I like use

0:35:30.120 --> 0:35:32.640
<v Speaker 2>it for some stuff. And then like the new version

0:35:32.680 --> 0:35:35.040
<v Speaker 2>of Claude came out from Anthropic and I was like, oh,

0:35:35.080 --> 0:35:36.600
<v Speaker 2>this is actually kind of cool and I kind of

0:35:36.640 --> 0:35:38.080
<v Speaker 2>like it more. I don't really know why I like

0:35:38.120 --> 0:35:39.719
<v Speaker 2>it more, but for some reason, I like it more

0:35:39.920 --> 0:35:42.600
<v Speaker 2>and I had no problem switching over. Could it be

0:35:42.719 --> 0:35:45.840
<v Speaker 2>possible that some of these like core like models do

0:35:46.000 --> 0:35:49.000
<v Speaker 2>not prove to be as sticky or have as deep

0:35:49.080 --> 0:35:50.600
<v Speaker 2>moat as people expect.

0:35:50.880 --> 0:35:54.200
<v Speaker 4>Totally Inflection, which launched there is called PI just wasn't

0:35:54.239 --> 0:35:57.560
<v Speaker 4>that good. They've now gone to Microsoft Anthropic. Yeah, at

0:35:57.560 --> 0:36:00.000
<v Speaker 4>first was a little bit behind. They are the most

0:36:00.080 --> 0:36:02.120
<v Speaker 4>formative model. They're the best performing one. They're the one

0:36:02.120 --> 0:36:05.080
<v Speaker 4>that I also like, you use the most. Why it's fastest,

0:36:05.440 --> 0:36:07.440
<v Speaker 4>it has a little bit more.

0:36:08.160 --> 0:36:09.640
<v Speaker 3>It seems to talk a little bitter.

0:36:10.560 --> 0:36:13.240
<v Speaker 4>Yeah, but here's an interesting thing to your point about

0:36:13.239 --> 0:36:16.760
<v Speaker 4>you know, sort of motes and Google early on most

0:36:16.800 --> 0:36:20.080
<v Speaker 4>searches for Google, and remember Google's two hundred and eighty

0:36:20.160 --> 0:36:25.719
<v Speaker 4>billion dollars add revenue generated, you know, enormous Google. Most

0:36:25.719 --> 0:36:28.160
<v Speaker 4>of those are like five to ten word searches, right,

0:36:28.200 --> 0:36:29.600
<v Speaker 4>so you put something in your like, I don't know,

0:36:29.719 --> 0:36:34.759
<v Speaker 4>restaurant in West Village or you know whatever. Okay, Perplexity,

0:36:34.800 --> 0:36:36.839
<v Speaker 4>which I don't know if you've used, and we met

0:36:36.840 --> 0:36:39.560
<v Speaker 4>the founder early on and we ended up not investing

0:36:39.560 --> 0:36:41.279
<v Speaker 4>and it was probably a miss for us, but the

0:36:41.400 --> 0:36:43.160
<v Speaker 4>founder they are focused. You know what I'm going to

0:36:43.239 --> 0:36:45.600
<v Speaker 4>do the point two percent of searches or the one

0:36:45.680 --> 0:36:48.160
<v Speaker 4>percent of searches that are longer formed where people really

0:36:48.160 --> 0:36:50.840
<v Speaker 4>want to ask a full question. And I see a

0:36:50.840 --> 0:36:52.520
<v Speaker 4>lot of people that are using it and it's not

0:36:52.560 --> 0:36:55.759
<v Speaker 4>coming up with some Wolke answer or some pithy, you know,

0:36:55.960 --> 0:37:00.480
<v Speaker 4>rock like response. It's actually a well researched, footnoted, sources,

0:37:00.880 --> 0:37:04.239
<v Speaker 4>cited response. So to me, the two things that I

0:37:04.320 --> 0:37:06.640
<v Speaker 4>use most on the chap outside right now are clawed

0:37:06.880 --> 0:37:10.440
<v Speaker 4>from Anthropic and Perplexity. But in six months there might

0:37:10.480 --> 0:37:15.120
<v Speaker 4>be something totally different. And the sheer number of models

0:37:15.520 --> 0:37:18.239
<v Speaker 4>that are available on open source. Who knows what Apple

0:37:18.320 --> 0:37:21.040
<v Speaker 4>ends up doing here and that being integrated. You know,

0:37:21.120 --> 0:37:25.000
<v Speaker 4>Series sucked, but so did Alexa. Amazon's making moves Apple

0:37:25.000 --> 0:37:28.839
<v Speaker 4>will too. So yeah, but you know, use you mount

0:37:28.840 --> 0:37:31.280
<v Speaker 4>for a second. All of this stuff and Venture Venture

0:37:31.360 --> 0:37:35.080
<v Speaker 4>is going through this downturn and the one area where

0:37:35.160 --> 0:37:38.160
<v Speaker 4>there are very high valuations and a lot of money

0:37:38.160 --> 0:37:40.960
<v Speaker 4>flowing and a lot of talent is been in AI,

0:37:41.160 --> 0:37:43.520
<v Speaker 4>and therefore the future returns on these things are going

0:37:43.560 --> 0:37:45.840
<v Speaker 4>to be lower. And that's why we at Lucks decided,

0:37:45.960 --> 0:37:47.840
<v Speaker 4>you know, we're going to focus on AI in the

0:37:47.840 --> 0:37:50.040
<v Speaker 4>physical world. Having done all the stuff over the past

0:37:50.080 --> 0:37:52.759
<v Speaker 4>five years, I would say there's this five year psychological bias.

0:37:52.760 --> 0:37:54.600
<v Speaker 4>Everybody wants to be invested today where you should have

0:37:54.600 --> 0:37:57.600
<v Speaker 4>been five years ago, So hugging face and Mosaic and

0:37:57.760 --> 0:37:59.400
<v Speaker 4>which we sold the data bricks. We were in that

0:37:59.400 --> 0:38:02.640
<v Speaker 4>five years ago. Today we're really interested in biology and

0:38:02.760 --> 0:38:05.400
<v Speaker 4>robotics and AIS use in those. And then I have

0:38:05.440 --> 0:38:08.319
<v Speaker 4>a really weird theme which is so sexy because it's

0:38:08.440 --> 0:38:11.680
<v Speaker 4>unsexy to me and to others. You understand accounting, vast

0:38:11.719 --> 0:38:14.279
<v Speaker 4>majority of venture investors and make startup people don't. But

0:38:14.560 --> 0:38:17.880
<v Speaker 4>do you take capex. Capex is made up of two pieces,

0:38:18.040 --> 0:38:22.080
<v Speaker 4>growth and maintenance. And everybody's been funding growth, growth, growth, growth, growth,

0:38:22.120 --> 0:38:26.280
<v Speaker 4>investor growth. So I got interested in maintenance. Why because

0:38:26.320 --> 0:38:31.120
<v Speaker 4>you have trillions of dollars of assets infrastructure, hospital systems,

0:38:31.320 --> 0:38:35.480
<v Speaker 4>energy systems, buildings that need to be maintained and every

0:38:35.560 --> 0:38:38.480
<v Speaker 4>generation and every new startup and every new investor always

0:38:38.480 --> 0:38:40.120
<v Speaker 4>wants to do the new new thing. It's why we

0:38:40.120 --> 0:38:41.640
<v Speaker 4>get new music, and we get new food, and we

0:38:41.640 --> 0:38:44.920
<v Speaker 4>get new fashion. But there's all these neglected assets, and

0:38:44.960 --> 0:38:48.800
<v Speaker 4>I think you can apply new technology to maintaining these systems.

0:38:48.840 --> 0:38:52.160
<v Speaker 4>And so I've become obsessed with this unsexy theme of maintenance,

0:38:52.640 --> 0:38:54.160
<v Speaker 4>which I think is going to become a hot area

0:38:54.200 --> 0:38:55.000
<v Speaker 4>over the next few years.

0:38:55.120 --> 0:38:58.400
<v Speaker 1>Well, you mean maintenance of physical infrastructure. So the idea

0:38:58.440 --> 0:38:59.920
<v Speaker 1>that you could have, I don't know, a little row

0:39:00.080 --> 0:39:02.920
<v Speaker 1>bought that goes around your factory or a bunch of

0:39:03.040 --> 0:39:06.680
<v Speaker 1>highways and sort of surveys it for cracks or things

0:39:06.719 --> 0:39:08.839
<v Speaker 1>that it thinks needs to be fixed.

0:39:08.680 --> 0:39:12.880
<v Speaker 4>Totally, could be infrastructure for transportation, it could be inside

0:39:12.920 --> 0:39:15.719
<v Speaker 4>of hospitals where there are road routine things. And it's

0:39:15.760 --> 0:39:18.799
<v Speaker 4>also oddly, and I know you guys have covered this,

0:39:18.920 --> 0:39:22.200
<v Speaker 4>but the idea that AI is really coming for the

0:39:22.239 --> 0:39:25.040
<v Speaker 4>white collar workers. You know, you joke that you could

0:39:25.040 --> 0:39:27.040
<v Speaker 4>talk about AI and generate a script, you know, based

0:39:27.040 --> 0:39:27.520
<v Speaker 4>on AI.

0:39:27.880 --> 0:39:30.280
<v Speaker 1>But oh no, it wasn't a joke, very serious.

0:39:31.360 --> 0:39:34.480
<v Speaker 4>They always thought that they were relatively insulated and it

0:39:34.520 --> 0:39:36.480
<v Speaker 4>was the blue collar workers. But let me tell you

0:39:36.560 --> 0:39:38.480
<v Speaker 4>the guy that put me in business, Bill Conway, the

0:39:38.880 --> 0:39:41.680
<v Speaker 4>founder of the Carlile Group. He's spending all his philanthropic money,

0:39:41.960 --> 0:39:44.640
<v Speaker 4>or a significant portion of it, funding nursing schools. Why

0:39:44.680 --> 0:39:48.440
<v Speaker 4>because he identified a very high magnitude impact. Because we

0:39:48.480 --> 0:39:51.359
<v Speaker 4>have such a shortage of nurses in this country. That's

0:39:51.360 --> 0:39:54.680
<v Speaker 4>an opportunity for maintenance where robots and technology can play

0:39:54.680 --> 0:39:58.040
<v Speaker 4>a role. How do you augment and help nurses plummers.

0:39:58.080 --> 0:40:00.920
<v Speaker 4>We have a massive shortage of plumbers this country, and

0:40:01.000 --> 0:40:03.520
<v Speaker 4>so I actually think that the blue collar workers, empowered

0:40:03.520 --> 0:40:07.000
<v Speaker 4>by technology and maintaining all of these systems around us,

0:40:07.400 --> 0:40:09.360
<v Speaker 4>are actually going to be a winning combination.

0:40:09.520 --> 0:40:12.040
<v Speaker 2>I want to talk about another aspect of I guess

0:40:12.160 --> 0:40:15.640
<v Speaker 2>AI investing, which is that in this sort of the

0:40:15.760 --> 0:40:19.360
<v Speaker 2>SaaS wave, the twenty tens a decade like compute was

0:40:19.520 --> 0:40:22.359
<v Speaker 2>very cheap, right, and so basically like that part, you're

0:40:22.360 --> 0:40:25.320
<v Speaker 2>like plug into aws and it's sort of yeah, I know,

0:40:25.360 --> 0:40:27.640
<v Speaker 2>it probably costs some money, but is not a big

0:40:27.800 --> 0:40:29.520
<v Speaker 2>lineup or a line item.

0:40:29.640 --> 0:40:32.719
<v Speaker 3>Ultimately for a lot of these companies, How does that change.

0:40:32.400 --> 0:40:34.719
<v Speaker 2>In twenty twenty four when you're dealing with an AI

0:40:34.840 --> 0:40:39.319
<v Speaker 2>company and electricity bills exist or hardware accumulation, depending on

0:40:39.560 --> 0:40:42.560
<v Speaker 2>where they are in the stack. How is an investor

0:40:42.880 --> 0:40:45.080
<v Speaker 2>do you think about like the changing I guess people

0:40:45.080 --> 0:40:47.799
<v Speaker 2>will talk about like you know, shifting, you know, having

0:40:47.800 --> 0:40:50.319
<v Speaker 2>to spend more on capex versus op X relative to

0:40:50.360 --> 0:40:53.720
<v Speaker 2>the prior generation of tech startups. How does that play

0:40:53.719 --> 0:40:55.920
<v Speaker 2>out in the investments you choose?

0:40:56.239 --> 0:40:58.480
<v Speaker 4>It's a great question in the AI world, and then

0:40:58.520 --> 0:41:00.799
<v Speaker 4>I'll give you the biology world. On the AI side,

0:41:00.800 --> 0:41:02.920
<v Speaker 4>you know, take opening Eye. These are all rumored numbers,

0:41:02.960 --> 0:41:05.799
<v Speaker 4>not you know, nothing's fully confirmed, but two billion, maybe

0:41:05.840 --> 0:41:08.520
<v Speaker 4>three billion of revenue. I think about ten million people

0:41:08.640 --> 0:41:11.799
<v Speaker 4>paying twenty bucks a month or thereabout, and probably one

0:41:11.840 --> 0:41:13.640
<v Speaker 4>hundred million users. You know, I don't know how many

0:41:13.640 --> 0:41:15.960
<v Speaker 4>of those are unique, but they're not making money on that.

0:41:15.960 --> 0:41:18.440
<v Speaker 4>They're losing several billion dollars today because you have these

0:41:18.520 --> 0:41:21.120
<v Speaker 4>upfront costs, big cap X, a lot of training, you know,

0:41:21.120 --> 0:41:23.280
<v Speaker 4>and then you try to maybe do some big enterprise deals.

0:41:23.719 --> 0:41:26.520
<v Speaker 4>A company like hugging Face is profitable because they're not

0:41:26.560 --> 0:41:28.560
<v Speaker 4>doing they're just hosting it and you know, letting people

0:41:28.640 --> 0:41:30.960
<v Speaker 4>run in frints and then charging and making margin on

0:41:31.000 --> 0:41:34.120
<v Speaker 4>that kind of stuff. So that to me is interesting

0:41:34.160 --> 0:41:36.800
<v Speaker 4>of the people that spend a ton of money and

0:41:36.960 --> 0:41:39.319
<v Speaker 4>they've got to earn it back. And can you get

0:41:39.360 --> 0:41:42.200
<v Speaker 4>pricing power by going from twenty bucks a month to

0:41:42.200 --> 0:41:44.239
<v Speaker 4>thirty bucks a month? And maybe you get that because

0:41:44.280 --> 0:41:47.000
<v Speaker 4>now you have open AI premium where you have access

0:41:47.040 --> 0:41:49.399
<v Speaker 4>to say Sora for video generation or something like that.

0:41:49.640 --> 0:41:51.359
<v Speaker 4>So that's going to be a big question on are

0:41:51.400 --> 0:41:53.719
<v Speaker 4>these profitable investments? Not? Are they cool? Not? Are they

0:41:53.760 --> 0:41:57.240
<v Speaker 4>world changing? Absolutely, but are they profitable investments? And look,

0:41:57.520 --> 0:41:59.520
<v Speaker 4>the market may not care if they're profitable. Market funds

0:41:59.520 --> 0:42:01.279
<v Speaker 4>all kinds of profitable thing that they believe in the

0:42:01.360 --> 0:42:04.360
<v Speaker 4>narrative in the story. But thinking about fundamental businesses and

0:42:04.400 --> 0:42:07.279
<v Speaker 4>the economic changes between Capeck and op X, I think

0:42:07.280 --> 0:42:09.799
<v Speaker 4>in AI it's very hard if you are building out

0:42:09.800 --> 0:42:11.759
<v Speaker 4>your data centers, trying to do your own training, your

0:42:11.800 --> 0:42:15.400
<v Speaker 4>own inference, hosting these models, it's very hard. Biology we

0:42:15.480 --> 0:42:19.200
<v Speaker 4>will see an AWS moment where instead of you having

0:42:19.239 --> 0:42:22.160
<v Speaker 4>to be a biotech firm that opens your own wet

0:42:22.239 --> 0:42:26.239
<v Speaker 4>lab or moves into Alexandria real estate which is specialized

0:42:26.280 --> 0:42:29.560
<v Speaker 4>in hosting biotech companies in all these different regions, approximate

0:42:29.680 --> 0:42:32.799
<v Speaker 4>to academic research centers. You will be able to just

0:42:32.840 --> 0:42:35.719
<v Speaker 4>take your experiment and upload it to the cloud, where

0:42:35.760 --> 0:42:39.920
<v Speaker 4>there are cloud based robotic labs. We funded some of these.

0:42:40.400 --> 0:42:43.120
<v Speaker 4>There's one company called Stratios. There's a ton that are

0:42:43.120 --> 0:42:45.080
<v Speaker 4>going to come on wave. And this is exciting because

0:42:45.120 --> 0:42:47.440
<v Speaker 4>you can be a scientist on the beach in the Bahamas,

0:42:47.480 --> 0:42:50.400
<v Speaker 4>pull up your iPad, run an experiment. The robots are

0:42:50.400 --> 0:42:53.920
<v Speaker 4>performing ninety percent of the activity of pouring something from

0:42:53.960 --> 0:42:57.160
<v Speaker 4>a beaker into another, running a centrifuge, and then the

0:42:57.239 --> 0:42:58.759
<v Speaker 4>data that comes off of that, and this is the

0:42:58.800 --> 0:43:02.040
<v Speaker 4>really cool part. Then the robot and the machines will

0:43:02.040 --> 0:43:04.600
<v Speaker 4>actually say to you, hey, do you want to run

0:43:04.600 --> 0:43:07.680
<v Speaker 4>this experiment, but change these four parameters or these variables,

0:43:07.760 --> 0:43:09.600
<v Speaker 4>and you just click a button yes, as though it's

0:43:09.640 --> 0:43:13.120
<v Speaker 4>reverse prompting you, and then you run another experiment. So

0:43:13.200 --> 0:43:17.920
<v Speaker 4>the implication here is that the boost in productivity for science,

0:43:18.280 --> 0:43:22.120
<v Speaker 4>for generation of truth, of new information, of new knowledge,

0:43:22.360 --> 0:43:24.279
<v Speaker 4>that to me is the most exciting thing. And the

0:43:24.320 --> 0:43:27.680
<v Speaker 4>companies that capture that, forget about the societal dividend, I think,

0:43:27.680 --> 0:43:28.800
<v Speaker 4>are going to make a lot of money.

0:43:29.040 --> 0:43:32.040
<v Speaker 1>Yeah, this actually reminds me of the conversation that we

0:43:32.200 --> 0:43:36.200
<v Speaker 1>had regarding snack food innovation and this idea that you

0:43:36.239 --> 0:43:39.799
<v Speaker 1>can use a sort of factorio like simulation just to

0:43:39.880 --> 0:43:43.680
<v Speaker 1>run new processes through your factory and see how they

0:43:43.680 --> 0:43:46.000
<v Speaker 1>would actually work out and what the supply chain might

0:43:46.040 --> 0:43:48.480
<v Speaker 1>look like. But not to give in to my five

0:43:48.560 --> 0:43:52.719
<v Speaker 1>year bias too much and overly focus on chat GPT.

0:43:53.200 --> 0:43:56.160
<v Speaker 1>But where are we in terms of context window expansion,

0:43:56.200 --> 0:43:58.879
<v Speaker 1>because this is something we spoke about with you last year,

0:43:59.160 --> 0:44:01.120
<v Speaker 1>and I think for a lot of people it's probably

0:44:01.120 --> 0:44:04.680
<v Speaker 1>one of the overriding annoyances with something like chat GPT,

0:44:04.840 --> 0:44:07.440
<v Speaker 1>the fact that you can't actually copy and paste that

0:44:07.560 --> 0:44:10.080
<v Speaker 1>much text into it, and that you are limited in

0:44:10.160 --> 0:44:12.960
<v Speaker 1>terms of the output that it actually gives you. Have

0:44:13.040 --> 0:44:15.680
<v Speaker 1>there been major advancements since we last spoke to you.

0:44:15.960 --> 0:44:18.719
<v Speaker 4>Well, the CLAW three is one of the largest, and

0:44:18.760 --> 0:44:22.040
<v Speaker 4>then you've got all kinds of interesting collaborations. You've got

0:44:22.320 --> 0:44:25.880
<v Speaker 4>Nvidia and Microsoft died one with a huge number of tokens.

0:44:26.120 --> 0:44:29.200
<v Speaker 4>You've got a one to one labs that has this

0:44:29.239 --> 0:44:32.280
<v Speaker 4>thing called Jurassic Again, a lot of people are making

0:44:32.520 --> 0:44:35.400
<v Speaker 4>headway here, but we are I think a year away

0:44:35.920 --> 0:44:40.440
<v Speaker 4>from you being able to upload hundreds of PDFs thousands

0:44:40.440 --> 0:44:44.360
<v Speaker 4>of books if they're not already immediately referenceable and be

0:44:44.480 --> 0:44:50.680
<v Speaker 4>able to detect pattern change amongst documents, summarize and unearth,

0:44:51.160 --> 0:44:54.480
<v Speaker 4>you know, the entirety of key concepts, and then I

0:44:54.480 --> 0:44:57.440
<v Speaker 4>think the most valuable thing will be it prompting you

0:44:57.520 --> 0:45:00.160
<v Speaker 4>to say, here's a question you didn't ask about all

0:45:00.200 --> 0:45:03.000
<v Speaker 4>these documents that you just uploaded. So yeah, I think

0:45:03.320 --> 0:45:07.480
<v Speaker 4>we're just keep increasing the context window. But that said,

0:45:08.200 --> 0:45:11.879
<v Speaker 4>most of the history of innovation is just like, keep

0:45:11.880 --> 0:45:15.400
<v Speaker 4>increasing this factor, and then somebody else comes along and

0:45:15.400 --> 0:45:17.719
<v Speaker 4>invent something. It's like that factor doesn't matter anymore. You know.

0:45:17.760 --> 0:45:21.000
<v Speaker 4>My favorite iconic example of this is like sailboats. You

0:45:21.040 --> 0:45:23.800
<v Speaker 4>find these sailing ships back in the day, they just

0:45:23.840 --> 0:45:25.799
<v Speaker 4>kept adding more and more sales, Like these things started

0:45:25.800 --> 0:45:28.440
<v Speaker 4>to look ridiculous, you know, and then somebody invents the

0:45:28.480 --> 0:45:31.800
<v Speaker 4>electric motor, and you have a motor both, so I

0:45:31.840 --> 0:45:33.719
<v Speaker 4>think we'll have the same sort of thing. And then

0:45:33.719 --> 0:45:37.239
<v Speaker 4>people figure out, hey, there's a better architecture here than

0:45:37.320 --> 0:45:39.880
<v Speaker 4>just constantly increasing the context window. And some of that

0:45:39.960 --> 0:45:43.880
<v Speaker 4>might be with memory retrieval and being able to reference

0:45:43.880 --> 0:45:46.319
<v Speaker 4>other models and just go into the archive of what

0:45:46.360 --> 0:45:48.840
<v Speaker 4>they have. So yeah, that's going to keep expanding.

0:45:48.960 --> 0:45:52.080
<v Speaker 2>Josh woolf Lux Capital, thank you so much for coming

0:45:52.120 --> 0:45:53.160
<v Speaker 2>back on odd lots.

0:45:53.200 --> 0:45:55.719
<v Speaker 3>Always great to get an update on what you're interested in.

0:45:56.040 --> 0:45:57.520
<v Speaker 4>It was great to be with you guys.

0:45:57.680 --> 0:46:02.600
<v Speaker 2>Yeah good, you're prepared already to ingratiate yourself and to

0:46:03.120 --> 0:46:05.160
<v Speaker 2>blend in with the human art robot.

0:46:06.280 --> 0:46:07.920
<v Speaker 3>Thank you so much. That was fantastic.

0:46:22.600 --> 0:46:24.839
<v Speaker 2>First of all, Tracy, I really like talking to Josh

0:46:24.960 --> 0:46:27.680
<v Speaker 2>and always like getting an update. I really do want

0:46:27.680 --> 0:46:30.200
<v Speaker 2>the folding the clothes folding robot though, Like I actually

0:46:30.280 --> 0:46:33.080
<v Speaker 2>think that's a really big deal and would make almost

0:46:33.120 --> 0:46:36.839
<v Speaker 2>everyone's life better if they didn't have to worry about

0:46:36.880 --> 0:46:37.600
<v Speaker 2>folding clothes.

0:46:37.680 --> 0:46:40.200
<v Speaker 1>I agree, it would be far more useful to have

0:46:40.280 --> 0:46:43.799
<v Speaker 1>something doing physical tasks like folding laundry versus telling you

0:46:43.880 --> 0:46:46.799
<v Speaker 1>where you're lost, Yeah, is in your desk.

0:46:47.040 --> 0:46:48.840
<v Speaker 3>I want I need that folding robot.

0:46:48.880 --> 0:46:50.920
<v Speaker 1>I mean, I will say, I know everyone likes to

0:46:51.160 --> 0:46:54.399
<v Speaker 1>make fun of Alexa as well, but our house we've

0:46:54.480 --> 0:46:57.920
<v Speaker 1>kitted out all the lights on. They're all those smart

0:46:57.960 --> 0:47:01.040
<v Speaker 1>bulbs because we don't have any overhead wiring, so like

0:47:01.080 --> 0:47:04.359
<v Speaker 1>everything has to be lamps. So if you didn't have

0:47:04.880 --> 0:47:08.040
<v Speaker 1>a robot, that was able to turn on all of

0:47:08.080 --> 0:47:10.480
<v Speaker 1>your appliances at once in a room. It would be

0:47:10.640 --> 0:47:13.160
<v Speaker 1>incredibly annoying because you would be going from lamp to

0:47:13.239 --> 0:47:15.840
<v Speaker 1>lamp to lamp. So it does make a difference in

0:47:15.880 --> 0:47:18.520
<v Speaker 1>my daily life at least, I mean, there's so much

0:47:18.920 --> 0:47:20.759
<v Speaker 1>to pull out from that. So one thing that I

0:47:20.800 --> 0:47:25.120
<v Speaker 1>thought was interesting from an industrial policy perspective was Josh's

0:47:25.560 --> 0:47:30.319
<v Speaker 1>discussion of some of the robotic capabilities being developed in

0:47:30.480 --> 0:47:33.399
<v Speaker 1>places like China and the idea that we might have

0:47:33.440 --> 0:47:37.080
<v Speaker 1>another Chips Semiconductor like situation on our hands where we

0:47:37.120 --> 0:47:40.400
<v Speaker 1>wake up in ten years and realize that a primary

0:47:40.520 --> 0:47:44.840
<v Speaker 1>component of robotics is being built much more efficiently and

0:47:44.920 --> 0:47:48.160
<v Speaker 1>cheaply elsewhere outside of the US or the West. And

0:47:48.160 --> 0:47:50.000
<v Speaker 1>then the other thing I thought was interesting was the

0:47:50.040 --> 0:47:52.879
<v Speaker 1>idea of leap frogging, right, So, I think a lot

0:47:52.920 --> 0:47:56.480
<v Speaker 1>of people, myself included, when we think of technological advances,

0:47:56.560 --> 0:48:00.239
<v Speaker 1>it's like, can this do this thing slightly faster? Can

0:48:00.280 --> 0:48:02.759
<v Speaker 1>it do it on a slightly larger scale, to the

0:48:02.760 --> 0:48:06.759
<v Speaker 1>point about the context window and expansion there. But you

0:48:06.840 --> 0:48:10.600
<v Speaker 1>can leap frog in technology, as Josh was saying, and

0:48:10.640 --> 0:48:13.520
<v Speaker 1>you can go from the sailboat to the motor boat,

0:48:13.640 --> 0:48:19.000
<v Speaker 1>or you could bypass a human evolution, for instance, and

0:48:19.120 --> 0:48:22.920
<v Speaker 1>instead of having humanoid robots, you could have a Edward

0:48:23.000 --> 0:48:26.200
<v Speaker 1>scissorhands like thing with a Swiss army knife on the

0:48:26.280 --> 0:48:26.799
<v Speaker 1>end of his arm.

0:48:26.960 --> 0:48:27.200
<v Speaker 4>Yeah.

0:48:27.200 --> 0:48:29.239
<v Speaker 2>That made a ton of sense to me, which is like,

0:48:29.280 --> 0:48:32.440
<v Speaker 2>if you're like starting from scratch, like it's not obvious

0:48:32.480 --> 0:48:37.040
<v Speaker 2>that the human form that was developed over millions of

0:48:37.120 --> 0:48:41.040
<v Speaker 2>years through evolution is necessarily the thing you want to

0:48:41.120 --> 0:48:45.400
<v Speaker 2>create or recreate to do various tasks that you need.

0:48:45.640 --> 0:48:47.040
<v Speaker 2>There was a lot in there that I liked. The

0:48:47.160 --> 0:48:48.400
<v Speaker 2>thing that he would talk about at the end. It

0:48:48.480 --> 0:48:52.040
<v Speaker 2>sort of sounded like cloud kitchens, but for biology les. Yeah,

0:48:52.120 --> 0:48:53.560
<v Speaker 2>so if you just have all the robots do it

0:48:53.600 --> 0:48:56.640
<v Speaker 2>and then they can prompt you for other ideas, that's interesting.

0:48:56.800 --> 0:49:00.480
<v Speaker 3>It does seem exciting. The idea of ways to.

0:49:00.480 --> 0:49:03.799
<v Speaker 2>Accumulate training data for these sort of like you know

0:49:04.040 --> 0:49:07.160
<v Speaker 2>that you could maybe solve the mechanical engineering, but without

0:49:07.280 --> 0:49:09.560
<v Speaker 2>you know, there's no equivalent of like all of the

0:49:09.600 --> 0:49:12.839
<v Speaker 2>text on Reddit or Wikipedia or whatever that change or

0:49:12.880 --> 0:49:16.319
<v Speaker 2>you know, Google books or YouTube. So like having to

0:49:16.360 --> 0:49:20.640
<v Speaker 2>recreate that as a bottleneck for building robots was really interesting.

0:49:21.000 --> 0:49:24.319
<v Speaker 2>I love the term he used I think was ignorance arbitrage, Yeah,

0:49:24.360 --> 0:49:26.399
<v Speaker 2>which is a really great term. So it's like, yeah,

0:49:26.480 --> 0:49:29.640
<v Speaker 2>like in a lot of like pure science spaces, you're

0:49:29.680 --> 0:49:32.279
<v Speaker 2>going to get investors who are willing to throw money

0:49:32.320 --> 0:49:34.320
<v Speaker 2>at someone who just like has a really good idea

0:49:34.360 --> 0:49:35.880
<v Speaker 2>on paper because that person is smart.

0:49:35.920 --> 0:49:38.359
<v Speaker 1>Well, I think this is also the really unusual thing

0:49:38.480 --> 0:49:41.520
<v Speaker 1>about this particular cycle, which is the dominance of the

0:49:41.560 --> 0:49:44.640
<v Speaker 1>incumbents and the fact that on the one hand, you

0:49:44.719 --> 0:49:47.239
<v Speaker 1>do have a bunch of open source software and to

0:49:47.280 --> 0:49:51.440
<v Speaker 1>some extent you can take something off of a repository

0:49:51.600 --> 0:49:53.879
<v Speaker 1>and you can pitch it to investors and say this

0:49:53.920 --> 0:49:56.319
<v Speaker 1>is the next big thing, and they might not have

0:49:56.360 --> 0:50:01.200
<v Speaker 1>the technological expertise to actually evaluate that. But when it

0:50:01.239 --> 0:50:05.759
<v Speaker 1>comes to making you know, actual advancements in something like robotics,

0:50:05.800 --> 0:50:07.880
<v Speaker 1>it does feel like you have to have an edge

0:50:07.920 --> 0:50:09.799
<v Speaker 1>in one respect or another. You either have to have

0:50:09.920 --> 0:50:12.840
<v Speaker 1>the capital to deploy or you have to have access

0:50:13.080 --> 0:50:13.840
<v Speaker 1>to that data.

0:50:14.160 --> 0:50:14.840
<v Speaker 4>So I don't know.

0:50:14.880 --> 0:50:16.080
<v Speaker 1>I guess we'll see how it shakes out.

0:50:16.200 --> 0:50:19.080
<v Speaker 2>I guess we'll have Josh back next year, yeah, next, next,

0:50:19.400 --> 0:50:21.640
<v Speaker 2>next Springer summer to see what the next big thing is.

0:50:21.640 --> 0:50:24.400
<v Speaker 1>Then hopefully he can bring a robot with him of some.

0:50:24.320 --> 0:50:25.600
<v Speaker 3>Sort or folding robot.

0:50:25.719 --> 0:50:27.080
<v Speaker 1>Yeah, all right, shall we leave it there.

0:50:27.160 --> 0:50:27.879
<v Speaker 3>Let's leave it there.

0:50:28.040 --> 0:50:30.840
<v Speaker 1>This has been another episode of the All Thoughts podcast.

0:50:30.920 --> 0:50:34.200
<v Speaker 1>I'm Tracy Alloway. You can follow me at Tracy Alloway and.

0:50:34.160 --> 0:50:36.640
<v Speaker 2>I'm joll Wisenthal. You can follow me at the Stalwart.

0:50:36.840 --> 0:50:40.240
<v Speaker 2>Follow our guest Josh Wolf. He's at Wolf Josh. Follow

0:50:40.280 --> 0:50:44.040
<v Speaker 2>our producers Carmen Rodriguez at Carman Erman dash, Ol Bennett

0:50:44.040 --> 0:50:46.960
<v Speaker 2>at Dashbot and kel Brooks at Kelbrooks. Thank you to

0:50:47.000 --> 0:50:49.760
<v Speaker 2>our producer Moses on Them. For more odd Lags content,

0:50:49.840 --> 0:50:51.920
<v Speaker 2>go to Bloomberg dot com slash odd Lots, where we

0:50:51.960 --> 0:50:55.080
<v Speaker 2>have transcripts of blog and a newsletter. And if you

0:50:55.080 --> 0:50:57.479
<v Speaker 2>want to chat about all of these topics, including AI

0:50:57.560 --> 0:51:00.279
<v Speaker 2>and robotics, there's a room in that in the lot

0:51:00.360 --> 0:51:03.800
<v Speaker 2>Discord chatroom Discord dot gg odd lots.

0:51:03.880 --> 0:51:04.600
<v Speaker 3>Go check it out.

0:51:04.880 --> 0:51:07.359
<v Speaker 1>And if you enjoy odd Thoughts, if you want us

0:51:07.400 --> 0:51:11.520
<v Speaker 1>to crowdsource buying a unitary humanoid robot or.

0:51:11.480 --> 0:51:13.040
<v Speaker 3>Something similar, I'm Ellie Express.

0:51:13.160 --> 0:51:16.239
<v Speaker 1>That's right, then please leave us a positive review on

0:51:16.320 --> 0:51:20.000
<v Speaker 1>your favorite podcast platform. And remember, if you are a

0:51:20.040 --> 0:51:23.120
<v Speaker 1>Bloomberg subscriber, you can listen to all of our episodes

0:51:23.239 --> 0:51:26.080
<v Speaker 1>absolutely ad free. All you need to do is connect

0:51:26.120 --> 0:51:30.120
<v Speaker 1>your Bloomberg subscription with Apple Podcasts. Thanks for listening

0:52:00.040 --> 0:52:01.719
<v Speaker 2>And