WEBVTT - Building a Robot That Can Walk the Walk

0:00:15.356 --> 0:00:23.156
<v Speaker 1>Pushkin. So how long have you been trying to make

0:00:23.196 --> 0:00:23.916
<v Speaker 1>a robot walk?

0:00:25.396 --> 0:00:27.796
<v Speaker 2>It's been my entire career, starting from why I went

0:00:27.836 --> 0:00:29.396
<v Speaker 2>to college in the first place.

0:00:29.636 --> 0:00:34.996
<v Speaker 1>Why why that particular problem? Why is that your life's work?

0:00:35.556 --> 0:00:38.436
<v Speaker 2>You know, there's few things more interesting and more dynamically

0:00:38.436 --> 0:00:41.436
<v Speaker 2>complex and more elegant than the way animals move in

0:00:41.476 --> 0:00:43.876
<v Speaker 2>the world. And to be able to get machines that

0:00:43.996 --> 0:00:46.436
<v Speaker 2>can move that way, they can interact physically with the

0:00:46.436 --> 0:00:49.996
<v Speaker 2>world the way humans and animals do. What a fun

0:00:50.076 --> 0:00:51.796
<v Speaker 2>and interesting thing to work on for a career.

0:00:57.756 --> 0:01:00.076
<v Speaker 1>I'm Jacob Goldstein and this is What's Your Problem? The

0:01:00.116 --> 0:01:02.036
<v Speaker 1>show where I talk to people who are trying to

0:01:02.116 --> 0:01:06.796
<v Speaker 1>make technological progress. My guest today is Jonathan Hurst. He's

0:01:06.796 --> 0:01:10.236
<v Speaker 1>a professor at Oregon State University and founder and chief

0:01:10.436 --> 0:01:15.596
<v Speaker 1>robot officer at Agility Robotics. Agility Robotics has built a

0:01:15.716 --> 0:01:19.196
<v Speaker 1>robot called Digit. Digit looks kind of like a person.

0:01:19.316 --> 0:01:21.996
<v Speaker 1>It walks around on two legs. It's got this flat,

0:01:21.996 --> 0:01:24.596
<v Speaker 1>rectangular head, and it has two arms that it can

0:01:24.676 --> 0:01:28.836
<v Speaker 1>use to pick stuff up. Jonathan's problem is this, how

0:01:28.876 --> 0:01:31.636
<v Speaker 1>can you make a walking robot that can do useful

0:01:31.676 --> 0:01:36.316
<v Speaker 1>work and that companies will actually pay for. Jonathan says

0:01:36.356 --> 0:01:40.276
<v Speaker 1>that robot Digit is already being tested out in warehouses

0:01:40.356 --> 0:01:41.276
<v Speaker 1>in the real world.

0:01:43.956 --> 0:01:46.676
<v Speaker 2>We are deploying robots with cost We have two announced.

0:01:46.676 --> 0:01:49.596
<v Speaker 2>We've announced a couple with Amazon and with GXO. You know,

0:01:50.036 --> 0:01:52.996
<v Speaker 2>you place an order and Digit handles that order as

0:01:53.036 --> 0:01:56.516
<v Speaker 2>part of the workflow that has happened is happening right now, and.

0:01:56.436 --> 0:02:02.876
<v Speaker 1>So specifically, what are your robots doing well? First at Amazon, Yeah,

0:02:02.876 --> 0:02:05.636
<v Speaker 1>the first use case or the first class of use

0:02:05.676 --> 0:02:08.836
<v Speaker 1>cases that works for us is picking up these plastic

0:02:08.876 --> 0:02:11.996
<v Speaker 1>tote plastic bins and putting them somewhere else. And then

0:02:12.556 --> 0:02:18.316
<v Speaker 1>in warehouses, Yeah, but anywhere warehouses, in logistics, in manufacturing,

0:02:18.436 --> 0:02:20.796
<v Speaker 1>you know, the whole environments like that that are a

0:02:20.796 --> 0:02:24.236
<v Speaker 1>bit structured. They're kind of like there's these islands of

0:02:24.276 --> 0:02:27.156
<v Speaker 1>automation they call them, you know, where one machine is

0:02:27.196 --> 0:02:30.076
<v Speaker 1>putting things in a bin, another machine sorts bins and

0:02:30.116 --> 0:02:33.596
<v Speaker 1>sends them different parts of the warehouse. Right now, sometimes

0:02:33.636 --> 0:02:36.156
<v Speaker 1>a person will stand there, the robot will tell them

0:02:36.156 --> 0:02:38.036
<v Speaker 1>which bin to pick up, and then all they are

0:02:38.036 --> 0:02:41.156
<v Speaker 1>basically is a manipulator for the robots system. They pick

0:02:41.236 --> 0:02:42.596
<v Speaker 1>up the bin and put it on the conveyor belt

0:02:42.636 --> 0:02:44.076
<v Speaker 1>and wait for the robot to tell them the next

0:02:44.076 --> 0:02:46.836
<v Speaker 1>thing to do. And it's really hard to hire people

0:02:46.876 --> 0:02:50.116
<v Speaker 1>for that. There are a lot of open jobs in that,

0:02:50.276 --> 0:02:52.596
<v Speaker 1>and so it's kind of a perfect place for Digit

0:02:52.676 --> 0:02:56.996
<v Speaker 1>to walk in in this relatively structured first use case. Now,

0:02:57.116 --> 0:03:01.236
<v Speaker 1>digits of course going to evolve towards picking up boxes, depalletizing,

0:03:01.636 --> 0:03:04.876
<v Speaker 1>loading and unloading you know, tractor trailers and eventually getting

0:03:04.876 --> 0:03:08.636
<v Speaker 1>out to things like retail and stocking shelves and you

0:03:08.676 --> 0:03:12.036
<v Speaker 1>know sticks in hospitals carrying things around and eventually become

0:03:12.036 --> 0:03:15.876
<v Speaker 1>a consumer product. So that's where you are today and

0:03:15.916 --> 0:03:18.356
<v Speaker 1>where you want to get. I want to talk now

0:03:18.436 --> 0:03:23.916
<v Speaker 1>about how you got here. Sure, and there's this really

0:03:24.356 --> 0:03:27.836
<v Speaker 1>basic set of things you had to figure out just

0:03:28.036 --> 0:03:33.876
<v Speaker 1>around how locomotion works, right, how people walk, also interestingly

0:03:33.916 --> 0:03:37.676
<v Speaker 1>and sort of surprisingly, how how birds like ostriches walk.

0:03:37.996 --> 0:03:42.076
<v Speaker 1>So I know there is this series of robots that

0:03:42.116 --> 0:03:45.036
<v Speaker 1>you built on the way on the way to Digit, right,

0:03:45.196 --> 0:03:47.756
<v Speaker 1>they were two before and then Digit and it seems

0:03:47.796 --> 0:03:50.396
<v Speaker 1>like going through those and what you figured out the

0:03:50.476 --> 0:03:53.716
<v Speaker 1>sort of key insight on each one is a really

0:03:53.836 --> 0:03:57.236
<v Speaker 1>nice way to get a kind of deeper insight into

0:03:57.796 --> 0:03:59.716
<v Speaker 1>how it works and what you had to understand to

0:03:59.756 --> 0:04:02.356
<v Speaker 1>make a robot that can walk great, just being a

0:04:02.396 --> 0:04:05.556
<v Speaker 1>really hard problem, right, Like there's like lots of robots

0:04:05.556 --> 0:04:08.116
<v Speaker 1>that we don't even sort of think of as robots arms,

0:04:08.156 --> 0:04:11.396
<v Speaker 1>and you know, self driving cars are arguably a kind

0:04:11.396 --> 0:04:13.716
<v Speaker 1>of robot and whatever, but like getting a robot to

0:04:13.756 --> 0:04:16.556
<v Speaker 1>walk is clearly a very hard problem.

0:04:16.716 --> 0:04:20.836
<v Speaker 2>Yeah, Okay, So where I started in trying to understand

0:04:20.916 --> 0:04:24.516
<v Speaker 2>how to make a machine work is on the biomechanics,

0:04:24.756 --> 0:04:27.156
<v Speaker 2>try to understand how animals work. And a lot of

0:04:27.156 --> 0:04:30.596
<v Speaker 2>biomechanics is about specific muscles and muscle groups and joints,

0:04:30.636 --> 0:04:33.836
<v Speaker 2>and we're instead looking at holistically from a big picture,

0:04:34.196 --> 0:04:36.396
<v Speaker 2>what is the center of mass of an animal doing?

0:04:36.436 --> 0:04:38.956
<v Speaker 2>What are the forces happening on the ground. I did

0:04:38.956 --> 0:04:42.356
<v Speaker 2>this collaboration with Monica Daily at the Royal Veterinary College

0:04:42.436 --> 0:04:45.356
<v Speaker 2>and we looked at guinea fowl and ostriches and turkeys

0:04:45.396 --> 0:04:47.636
<v Speaker 2>and you know, a whole bunch of different sizes.

0:04:47.876 --> 0:04:51.036
<v Speaker 1>Why birds. It's really interesting to me that you did that.

0:04:51.196 --> 0:04:55.156
<v Speaker 2>Why because humans are weird? How many other bipeds are there?

0:04:55.236 --> 0:04:58.436
<v Speaker 2>You know, all the evolved bipeds, all of the extant

0:04:58.556 --> 0:05:00.836
<v Speaker 2>existing bipeds in the world.

0:05:00.756 --> 0:05:02.836
<v Speaker 1>Everything that walks on two legs. Yeah, huh.

0:05:03.236 --> 0:05:07.276
<v Speaker 2>Those are all theropods. They're all kind of more like

0:05:07.316 --> 0:05:10.756
<v Speaker 2>a bird than they are like or a monkey. They

0:05:10.796 --> 0:05:14.076
<v Speaker 2>have evolved far longer than we have. You know, we've

0:05:14.076 --> 0:05:16.996
<v Speaker 2>been out of trees for a very short period of time.

0:05:17.036 --> 0:05:19.756
<v Speaker 2>Compared to all of the other existing bipeds in the world.

0:05:20.116 --> 0:05:22.636
<v Speaker 2>They can all run much faster than we can for

0:05:22.956 --> 0:05:27.836
<v Speaker 2>you know, very efficiently using less energy insul So.

0:05:27.756 --> 0:05:31.316
<v Speaker 1>Like an ostrich is a better model for just abstract

0:05:31.396 --> 0:05:34.836
<v Speaker 1>walking on two legs. Oh sure, I mean, yeah, I

0:05:34.876 --> 0:05:36.516
<v Speaker 1>love that, But I want to be clear.

0:05:36.556 --> 0:05:38.636
<v Speaker 2>What we're not trying to do is study how it

0:05:38.636 --> 0:05:40.636
<v Speaker 2>does an Ostrich run versus how a human runs. What

0:05:40.636 --> 0:05:43.196
<v Speaker 2>we're trying to do is study what is a fundamental

0:05:43.236 --> 0:05:46.476
<v Speaker 2>truth between all animals in how they run, so we

0:05:46.516 --> 0:05:49.316
<v Speaker 2>can try and weed out things that have to do

0:05:49.396 --> 0:05:51.596
<v Speaker 2>with the size of the animal, or things that have

0:05:51.676 --> 0:05:54.036
<v Speaker 2>to do with where exactly the ankle joint is or

0:05:54.036 --> 0:05:56.916
<v Speaker 2>how long this joint is or that one. We want

0:05:56.956 --> 0:06:00.476
<v Speaker 2>to know what is the same amongst all animals to

0:06:00.636 --> 0:06:01.156
<v Speaker 2>walk and run.

0:06:01.196 --> 0:06:04.556
<v Speaker 1>It's like the Platonic ideal of bipedalism. The sort of

0:06:04.556 --> 0:06:06.956
<v Speaker 1>the fundrationing theory of it. Yeah, that's right.

0:06:07.076 --> 0:06:09.636
<v Speaker 2>Yeah, And so biomechanists have been looking at this since

0:06:09.676 --> 0:06:11.996
<v Speaker 2>the seventies and thinking about this in terms of spring

0:06:12.036 --> 0:06:14.116
<v Speaker 2>mass models of locomotion. And it was only in like

0:06:14.156 --> 0:06:16.356
<v Speaker 2>the two thousands sometimes that I think it was Hart

0:06:16.396 --> 0:06:19.596
<v Speaker 2>McGuire and Andre Seifarth put together a paper that showed, hey,

0:06:19.956 --> 0:06:23.876
<v Speaker 2>this spring mass model reproduces all of the behavior we

0:06:23.956 --> 0:06:27.116
<v Speaker 2>see for walking and running and transitions between these gates.

0:06:27.756 --> 0:06:30.396
<v Speaker 1>When you say, when you say spring mass model, that

0:06:30.436 --> 0:06:32.876
<v Speaker 1>sounds big and exciting, But just to be clear, what

0:06:32.916 --> 0:06:35.036
<v Speaker 1>do you mean when you say spring mass model.

0:06:35.036 --> 0:06:41.236
<v Speaker 2>I mean a mathematical representation of a pogo stick. Go on, Okay,

0:06:41.436 --> 0:06:43.796
<v Speaker 2>So a pogo stick is basically the simplest thing that

0:06:43.876 --> 0:06:46.756
<v Speaker 2>can run, and a kangaroo looks a lot like a

0:06:46.756 --> 0:06:49.756
<v Speaker 2>pogo sticky. Now, if you just stick a pogo stick

0:06:49.796 --> 0:06:52.276
<v Speaker 2>on each leg, now you're bipedal running, you know.

0:06:52.356 --> 0:06:52.636
<v Speaker 1>Okay.

0:06:53.036 --> 0:06:54.836
<v Speaker 2>And then if you add a whole bunch of complexity

0:06:54.836 --> 0:06:57.516
<v Speaker 2>to it, you have heel toe and you know, knee

0:06:57.556 --> 0:06:59.836
<v Speaker 2>joints and all this other stuff. But if you really

0:06:59.876 --> 0:07:01.556
<v Speaker 2>boil it down and try and make it as simple

0:07:01.556 --> 0:07:04.036
<v Speaker 2>as possible, you get to some pretty basic math models

0:07:04.036 --> 0:07:07.356
<v Speaker 2>that do represent how the progression of the center of

0:07:07.436 --> 0:07:10.196
<v Speaker 2>mass of the animal moves, and how the ground reaction

0:07:10.356 --> 0:07:11.796
<v Speaker 2>forces progress and so on.

0:07:11.916 --> 0:07:14.956
<v Speaker 1>Right, Okay, so if I picture just like a lump

0:07:14.956 --> 0:07:17.476
<v Speaker 1>of mass on top of two pogo sticks, you got it.

0:07:17.516 --> 0:07:18.276
<v Speaker 1>I'm kind of in the right.

0:07:18.356 --> 0:07:21.556
<v Speaker 2>Okay, you're absolutely okay. So that's a roughly a math

0:07:21.596 --> 0:07:23.716
<v Speaker 2>model that at least gives you a very good concept

0:07:23.756 --> 0:07:28.036
<v Speaker 2>of how do all animals run, horses, ghost, crabs, humans, ostriches, whatever.

0:07:28.356 --> 0:07:31.316
<v Speaker 1>Right, So this paper comes out, this paper that says,

0:07:32.036 --> 0:07:33.836
<v Speaker 1>think of a lump of mass on top of two

0:07:33.876 --> 0:07:37.436
<v Speaker 1>pogo sticks. You you know about it because you're in

0:07:37.476 --> 0:07:39.876
<v Speaker 1>this world? What do you What effect does it have

0:07:39.916 --> 0:07:41.516
<v Speaker 1>on you? What do you do as a result of this.

0:07:42.116 --> 0:07:44.036
<v Speaker 2>What I was looking at? And here's the argument at

0:07:44.036 --> 0:07:50.036
<v Speaker 2>the time. Do these spring mass models simply seem to

0:07:50.076 --> 0:07:53.316
<v Speaker 2>describe the things we're observing, you know? Or is it

0:07:53.396 --> 0:07:56.476
<v Speaker 2>describing core physics of how it works? Like, in other words,

0:07:56.676 --> 0:07:58.956
<v Speaker 2>if you build a spring mass model and a policy

0:07:58.996 --> 0:08:00.916
<v Speaker 2>that works, is it going to make a robot stabilize?

0:08:01.116 --> 0:08:04.156
<v Speaker 2>Or is it simply like a picture that kind of

0:08:04.196 --> 0:08:05.796
<v Speaker 2>looks like what the animals are actually doing.

0:08:05.876 --> 0:08:08.316
<v Speaker 1>So like if we actually do the lump of mass

0:08:08.316 --> 0:08:10.636
<v Speaker 1>on top of two pot sticks as a robot, will

0:08:10.676 --> 0:08:11.036
<v Speaker 1>it work?

0:08:11.436 --> 0:08:13.596
<v Speaker 2>Yeah? And so look the question is, you know, how

0:08:13.636 --> 0:08:16.036
<v Speaker 2>do you control these things? And then how does that

0:08:16.076 --> 0:08:18.716
<v Speaker 2>translate into walking and running which had never really been

0:08:18.716 --> 0:08:21.476
<v Speaker 2>done before that way, you haven't done this continuous transition

0:08:21.556 --> 0:08:23.996
<v Speaker 2>between walking and running and changing the speed and everything else,

0:08:24.356 --> 0:08:26.676
<v Speaker 2>and it's unknown how to stabilize that over all kinds

0:08:26.716 --> 0:08:29.436
<v Speaker 2>of terrain. So that's why we built Atreus. That's why

0:08:29.476 --> 0:08:30.676
<v Speaker 2>we built the robot Atreus.

0:08:30.836 --> 0:08:31.996
<v Speaker 1>And so what is Atreus?

0:08:32.396 --> 0:08:35.076
<v Speaker 2>So Atreus is it is a bipedal robot.

0:08:35.196 --> 0:08:37.076
<v Speaker 1>Okay, what does Atrius look like?

0:08:38.636 --> 0:08:41.556
<v Speaker 2>So Atrius was on the Late Night with Stephen Colbert

0:08:41.796 --> 0:08:44.036
<v Speaker 2>and he described it as a microwave on stilts.

0:08:44.676 --> 0:08:51.076
<v Speaker 1>Okay, that's good. So it doesn't it doesn't look humanoid

0:08:51.076 --> 0:08:53.836
<v Speaker 1>at all. It looks maybe like a dancing alien or

0:08:53.916 --> 0:08:57.996
<v Speaker 1>like a moon lander or something, but not humanoid.

0:08:58.436 --> 0:09:00.916
<v Speaker 2>Yeah, absolutely not. So it doesn't look like an animal

0:09:00.996 --> 0:09:03.756
<v Speaker 2>in any way. But it's designed entirely to be the

0:09:03.756 --> 0:09:06.276
<v Speaker 2>math model of what we see an animal running. And

0:09:06.516 --> 0:09:09.436
<v Speaker 2>the name Atrius is an acronym for assume the robot

0:09:09.516 --> 0:09:12.476
<v Speaker 2>is a sphere, right. The whole idea is the robot

0:09:12.556 --> 0:09:13.796
<v Speaker 2>is this simple math model.

0:09:14.316 --> 0:09:16.356
<v Speaker 1>Where it's just some mass in the middle and then

0:09:16.396 --> 0:09:18.356
<v Speaker 1>some very light springy legs.

0:09:18.676 --> 0:09:21.796
<v Speaker 2>That's it. And so what we ultimately showed with Atreus

0:09:21.836 --> 0:09:25.916
<v Speaker 2>it's the first robot ever to reproduce human walking gate dynamics.

0:09:25.996 --> 0:09:28.756
<v Speaker 2>The robot walks across the force plate, a graduate student

0:09:28.756 --> 0:09:31.036
<v Speaker 2>walks across the forest plate. Looking at the data, you

0:09:31.076 --> 0:09:32.156
<v Speaker 2>can't actually tell a difference.

0:09:32.276 --> 0:09:35.596
<v Speaker 1>Huh. So, like from the plate's point of view, it

0:09:35.676 --> 0:09:38.236
<v Speaker 1>feels the same whether you're robot or a human is

0:09:38.276 --> 0:09:38.916
<v Speaker 1>walking across.

0:09:38.916 --> 0:09:39.476
<v Speaker 2>That's correct.

0:09:39.676 --> 0:09:41.956
<v Speaker 1>So it's not true for earlier robots that looked like

0:09:41.996 --> 0:09:43.356
<v Speaker 1>they were walking like humans.

0:09:43.076 --> 0:09:44.956
<v Speaker 2>They would look the same. But if you looked at

0:09:44.996 --> 0:09:47.236
<v Speaker 2>the dynamics of it, if you looked at the ground

0:09:47.276 --> 0:09:49.276
<v Speaker 2>reaction forces, they differed quite a lot.

0:09:49.796 --> 0:09:52.196
<v Speaker 1>Why is that important? Why is that important?

0:09:52.356 --> 0:09:55.396
<v Speaker 2>It's just one symptom, right to show that, hey, we've

0:09:55.436 --> 0:09:58.316
<v Speaker 2>actually captured the physics here. But the other symptom that's

0:09:58.356 --> 0:10:01.116
<v Speaker 2>important is we are able to walk and run outdoors

0:10:01.196 --> 0:10:04.156
<v Speaker 2>over all kinds of terrain without any sort of perception.

0:10:04.556 --> 0:10:07.476
<v Speaker 2>The robot can handle amazing obstacles and it would just

0:10:07.516 --> 0:10:10.236
<v Speaker 2>soak them up, you know, going over potholes, going from

0:10:10.276 --> 0:10:13.556
<v Speaker 2>grass to pavement, going over big pieces of plywood we

0:10:13.596 --> 0:10:14.556
<v Speaker 2>would throw in its way.

0:10:15.076 --> 0:10:16.756
<v Speaker 1>And you're saying it didn't do this because it had

0:10:16.796 --> 0:10:18.956
<v Speaker 1>like a clever brain. You're saying it was just the

0:10:18.956 --> 0:10:22.396
<v Speaker 1>physics of the brainless machine was able to deal.

0:10:23.756 --> 0:10:26.596
<v Speaker 2>No cameras on it, It had no awareness of the environment.

0:10:26.756 --> 0:10:29.756
<v Speaker 2>It was a very simple spring mass model, very very

0:10:29.756 --> 0:10:32.276
<v Speaker 2>simple control that did nothing but try to balance that.

0:10:32.596 --> 0:10:34.596
<v Speaker 2>And it was able to just absorb all these kinds

0:10:34.596 --> 0:10:36.276
<v Speaker 2>of disturbances and just keep on going.

0:10:36.716 --> 0:10:38.796
<v Speaker 1>That sounds like a big deal. I agree.

0:10:39.356 --> 0:10:42.476
<v Speaker 2>I'm very excited about this, right. That is the point

0:10:42.476 --> 0:10:45.036
<v Speaker 2>that we decided we were going to found Agility Robotics.

0:10:45.476 --> 0:10:47.356
<v Speaker 2>We said, you know, this was a mission for like

0:10:47.796 --> 0:10:50.556
<v Speaker 2>years and years and years. That is why I became

0:10:50.596 --> 0:10:54.716
<v Speaker 2>a professor, is to say, my goal here in academia

0:10:55.276 --> 0:10:58.916
<v Speaker 2>is to show that this spring mass physics is real

0:10:59.316 --> 0:11:02.236
<v Speaker 2>and really make sure we understand that well. And if

0:11:02.276 --> 0:11:04.156
<v Speaker 2>we can show that and prove that and with this

0:11:04.276 --> 0:11:08.476
<v Speaker 2>ATREUS project, we then can take the next steps. Right,

0:11:08.596 --> 0:11:12.636
<v Speaker 2>So that's what we did. It was a scientific kind

0:11:12.636 --> 0:11:15.636
<v Speaker 2>of breakthrough. But the machine could only walk and run.

0:11:15.956 --> 0:11:20.236
<v Speaker 2>It couldn't stand, it couldn't turn. You know, it breaks often.

0:11:20.596 --> 0:11:23.156
<v Speaker 2>If it ever fell, it would just be completely destroyed.

0:11:23.316 --> 0:11:26.236
<v Speaker 2>So it's not a productive, useful machine. It's a science demonstrator.

0:11:27.596 --> 0:11:33.196
<v Speaker 1>Okay, So you have a TRIOS that is academically it

0:11:33.356 --> 0:11:37.036
<v Speaker 1>kind of intellectually a breakthrough, right, but nobody's going to

0:11:37.116 --> 0:11:40.356
<v Speaker 1>buy it to do anything. It's not useful in a

0:11:40.516 --> 0:11:44.076
<v Speaker 1>practical sense. What do you do next?

0:11:44.436 --> 0:11:46.996
<v Speaker 2>Okay? So at that point we say, hey, you know,

0:11:47.316 --> 0:11:51.276
<v Speaker 2>we understand a really significant portion of what does it

0:11:51.316 --> 0:11:53.516
<v Speaker 2>take to make a robot that can go where people go?

0:11:54.436 --> 0:11:56.756
<v Speaker 2>And what an opportunity? This is? What are all the

0:11:56.796 --> 0:11:58.316
<v Speaker 2>things that robots are going to be able to do

0:11:58.756 --> 0:12:03.356
<v Speaker 2>when they can be coexisting with humans? Right? And the

0:12:03.516 --> 0:12:06.436
<v Speaker 2>barrier to moving forward on that now is more about

0:12:06.676 --> 0:12:11.036
<v Speaker 2>execution on engineering and execution on building a use case

0:12:11.036 --> 0:12:12.516
<v Speaker 2>in a business around it.

0:12:12.596 --> 0:12:15.196
<v Speaker 1>Because you feel like you've solved the sort of core

0:12:15.356 --> 0:12:19.116
<v Speaker 1>threshold technical problem of a robot that can walk in

0:12:19.276 --> 0:12:21.276
<v Speaker 1>unfamiliar environments and not fall down.

0:12:21.476 --> 0:12:25.316
<v Speaker 2>Correct, we have the foundation layer of leged locomotion. Now,

0:12:25.396 --> 0:12:27.676
<v Speaker 2>sure we'd like to add on perception to it so

0:12:27.676 --> 0:12:30.036
<v Speaker 2>that it can handle stairs effectively and things like that

0:12:30.116 --> 0:12:33.516
<v Speaker 2>and intentionally handle obstacles. We need to do the engineering

0:12:33.556 --> 0:12:35.796
<v Speaker 2>of the electrical system and the hardware system that still

0:12:35.796 --> 0:12:38.716
<v Speaker 2>captures the same physics but can take a beating, can stand,

0:12:38.796 --> 0:12:41.076
<v Speaker 2>can steer, can start to do things right. So we

0:12:41.156 --> 0:12:43.236
<v Speaker 2>know we're going to need to build a robot with legs,

0:12:43.236 --> 0:12:46.996
<v Speaker 2>with manipulators, with sensors. Okay, because so we're kind of

0:12:47.036 --> 0:12:49.956
<v Speaker 2>going down a path now where we want to take

0:12:49.996 --> 0:12:54.116
<v Speaker 2>the exact same first principles approach to how do we

0:12:54.156 --> 0:12:56.716
<v Speaker 2>build a machine that can manipulate things in a human

0:12:56.756 --> 0:13:00.276
<v Speaker 2>world and get around in it and interact with people.

0:13:00.316 --> 0:13:03.436
<v Speaker 2>Build a human centric machine. So the first step to

0:13:03.476 --> 0:13:06.276
<v Speaker 2>doing that was designing a robot that we could sell

0:13:06.316 --> 0:13:09.116
<v Speaker 2>to other researchers to continue the work on the leg

0:13:09.156 --> 0:13:12.516
<v Speaker 2>leggs as we then worked towards you know, arms and manipulators,

0:13:12.556 --> 0:13:15.996
<v Speaker 2>and that was Cassie, our first robot that Agilia Robotics sold.

0:13:17.476 --> 0:13:20.676
<v Speaker 2>Cassie added the ability to stand in place because it

0:13:20.716 --> 0:13:23.356
<v Speaker 2>had ankles, and it had the ability to steer because

0:13:23.396 --> 0:13:26.556
<v Speaker 2>you could turn the legs. But more importantly, it was

0:13:26.836 --> 0:13:30.716
<v Speaker 2>much more compact and extremely robust, so this robot can

0:13:30.836 --> 0:13:32.996
<v Speaker 2>fall hard on concrete and you just pick it back

0:13:33.076 --> 0:13:34.596
<v Speaker 2>up and it can get going again.

0:13:34.956 --> 0:13:37.436
<v Speaker 1>And so, Cassie, I'm looking at a picture of it now.

0:13:37.796 --> 0:13:42.316
<v Speaker 1>It basically looks like a pair of Ostrich legs. Yeah,

0:13:42.316 --> 0:13:44.796
<v Speaker 1>it looks like an Ostrich. Does an Ostrich have a wasist?

0:13:45.116 --> 0:13:47.596
<v Speaker 1>I don't know an ostrich from the waist down, No.

0:13:47.596 --> 0:13:51.476
<v Speaker 2>It doesn't know. Yeah, the pelvis is stationary and a

0:13:51.476 --> 0:13:54.116
<v Speaker 2>bird fixed rather than a human pelvis, which is mobile.

0:13:54.476 --> 0:13:57.756
<v Speaker 1>So I know it's not technically correct to say that

0:13:57.796 --> 0:14:00.756
<v Speaker 1>an Ostrich legs bend backwards, but it looks that way, right.

0:14:00.836 --> 0:14:03.396
<v Speaker 2>Yeah, Their thigh is very short, their knee is up

0:14:03.436 --> 0:14:05.676
<v Speaker 2>next to their body, and what you perceive as their

0:14:05.756 --> 0:14:06.956
<v Speaker 2>knee is actually their ankle.

0:14:07.236 --> 0:14:11.636
<v Speaker 1>In designing this robot, how do you get to legs

0:14:11.716 --> 0:14:15.316
<v Speaker 1>that look like Ostrich legs? Like, it's like convergent evolution,

0:14:15.516 --> 0:14:18.636
<v Speaker 1>you know, convergent evolution maybe hopefully. Isn't it the case

0:14:18.676 --> 0:14:23.036
<v Speaker 1>that like cephalopods that like whatever squids have eyes like

0:14:23.076 --> 0:14:26.076
<v Speaker 1>our eyes, but they evolve totally independently. Is this like that?

0:14:26.516 --> 0:14:29.036
<v Speaker 2>There's a lot of examples of convergent evolution. We can

0:14:29.076 --> 0:14:31.876
<v Speaker 2>only guess, right because we don't necessarily know, and a

0:14:31.916 --> 0:14:35.276
<v Speaker 2>scientists only hypothesize the evolutionary pressures that cause animals to

0:14:35.316 --> 0:14:36.076
<v Speaker 2>be the shape that they are.

0:14:36.316 --> 0:14:38.876
<v Speaker 1>But the pressure that led you you didn't say let's

0:14:38.916 --> 0:14:41.116
<v Speaker 1>make legs that look like Ostrich eggs. You just did

0:14:41.156 --> 0:14:42.876
<v Speaker 1>a bunch of math and you wound up with legs

0:14:42.876 --> 0:14:44.156
<v Speaker 1>that look like Ostrich legs.

0:14:44.156 --> 0:14:46.316
<v Speaker 2>That is correct, and there are a bunch of features

0:14:46.356 --> 0:14:48.836
<v Speaker 2>on our robot that have gone down a similar path.

0:14:49.036 --> 0:14:51.476
<v Speaker 2>And I actually love that because when we end up

0:14:52.636 --> 0:14:55.676
<v Speaker 2>you know, blank sheet, pursue all of the different configuration

0:14:55.716 --> 0:14:57.516
<v Speaker 2>options and say okay, here's what we think is the

0:14:57.516 --> 0:14:59.996
<v Speaker 2>optimal choice, and we say, wow, that looks just like

0:15:00.036 --> 0:15:01.956
<v Speaker 2>a person, or that just looks like a bird or something.

0:15:02.436 --> 0:15:04.556
<v Speaker 2>It's actually really good. It means we're probably on the

0:15:04.596 --> 0:15:05.076
<v Speaker 2>right path.

0:15:05.516 --> 0:15:09.076
<v Speaker 1>Yeah, it's it's exciting in a way, right, like like

0:15:09.196 --> 0:15:11.276
<v Speaker 1>you do a bunch of bath and then suddenly you

0:15:11.316 --> 0:15:12.876
<v Speaker 1>look up and you see an Ostrich.

0:15:13.156 --> 0:15:15.116
<v Speaker 2>But it won't always be that way, right because we're

0:15:15.156 --> 0:15:18.756
<v Speaker 2>not using muscle and bone. We're using aluminum and you know, electricity,

0:15:18.796 --> 0:15:19.596
<v Speaker 2>it's a whole different thing.

0:15:19.636 --> 0:15:21.956
<v Speaker 1>In a way, it's surprising that it is right, like

0:15:21.996 --> 0:15:24.276
<v Speaker 1>you would expect that it wouldn't look at all.

0:15:24.196 --> 0:15:26.316
<v Speaker 2>Familiar, but there are clear differences.

0:15:26.596 --> 0:15:29.676
<v Speaker 1>I suppose I'm projecting this. It's like, whatever is the

0:15:29.716 --> 0:15:34.356
<v Speaker 1>Ostrich version of anthropomorphizing, right, I'm Ostrich pomorphizing?

0:15:34.596 --> 0:15:36.316
<v Speaker 2>Yeah, you got it. And like you said, it's like

0:15:36.356 --> 0:15:38.436
<v Speaker 2>a cartoon version of an Ostrich leg maybe.

0:15:38.556 --> 0:15:42.436
<v Speaker 1>Yeah. Okay, So you've got this robot that is looks

0:15:42.476 --> 0:15:44.596
<v Speaker 1>to my little brain like a pair of Ostrich legs.

0:15:44.636 --> 0:15:47.036
<v Speaker 1>It's just a coincidence because it just turns out to

0:15:47.036 --> 0:15:49.036
<v Speaker 1>be the best way to build a couple legs. And

0:15:49.076 --> 0:15:52.116
<v Speaker 1>do you sell it to other academics? What do you

0:15:52.116 --> 0:15:52.516
<v Speaker 1>do it?

0:15:52.756 --> 0:15:54.716
<v Speaker 2>Yeah? We sold it to some of the top universities

0:15:54.716 --> 0:15:56.156
<v Speaker 2>in the country and the kind in the world.

0:15:59.396 --> 0:16:02.436
<v Speaker 1>So Cassie is a robot that academics can experiment with

0:16:02.636 --> 0:16:05.756
<v Speaker 1>and learn from. But it doesn't have arms. It isn't

0:16:05.756 --> 0:16:09.556
<v Speaker 1>built to do useful tasks. In a minute, Nathan and

0:16:09.636 --> 0:16:12.316
<v Speaker 1>his colleagues build a robot that does have farms, that

0:16:12.476 --> 0:16:16.316
<v Speaker 1>can do useful tasks, and that companies like Amazon are

0:16:16.356 --> 0:16:30.116
<v Speaker 1>testing out in the real world right now. The latest

0:16:30.196 --> 0:16:34.036
<v Speaker 1>robot from Jonathan's company is called Digit. It has two legs,

0:16:34.116 --> 0:16:37.316
<v Speaker 1>two arms, It walks around, it picks stuff up, and

0:16:37.396 --> 0:16:40.436
<v Speaker 1>it looks kind of like a person. But Jonathan says

0:16:40.556 --> 0:16:43.156
<v Speaker 1>he and his colleagues didn't set out to build a

0:16:43.236 --> 0:16:44.516
<v Speaker 1>robot that looks like a person.

0:16:44.916 --> 0:16:47.436
<v Speaker 2>So yeah, so like, I'll take you down one of

0:16:47.476 --> 0:16:50.476
<v Speaker 2>these thought processes that ended up looking like a person. Okay,

0:16:51.636 --> 0:16:54.796
<v Speaker 2>We said, okay, we need to do some sort of

0:16:54.796 --> 0:16:58.556
<v Speaker 2>inertial control of this thing because the robot can't turn

0:16:58.716 --> 0:17:01.036
<v Speaker 2>very well. It's got little feet, and so when you

0:17:01.076 --> 0:17:04.196
<v Speaker 2>try to turn aggressively, its skids right, okay, and if

0:17:04.196 --> 0:17:06.836
<v Speaker 2>you look at any other bipads, this is one of

0:17:06.836 --> 0:17:09.356
<v Speaker 2>the reasons wings evolved. It's because they're running in the stick,

0:17:09.396 --> 0:17:11.676
<v Speaker 2>going out to catch the air to help them in

0:17:11.836 --> 0:17:14.636
<v Speaker 2>maneuvering and turn it. You can go in a straight line,

0:17:14.876 --> 0:17:17.396
<v Speaker 2>but figuring out how to maneuver quickly as hard when

0:17:17.396 --> 0:17:19.236
<v Speaker 2>you've only got a little foot on the ground. You know,

0:17:19.676 --> 0:17:22.116
<v Speaker 2>udropeds can really plant all four feet and twist and

0:17:22.156 --> 0:17:24.676
<v Speaker 2>apply big torchs on the ground, and a biped not

0:17:24.716 --> 0:17:26.396
<v Speaker 2>so much. You've got one foot at a time. How

0:17:26.396 --> 0:17:27.756
<v Speaker 2>do you change your orientation?

0:17:27.916 --> 0:17:28.116
<v Speaker 1>Right?

0:17:28.516 --> 0:17:31.036
<v Speaker 2>So we looked at like putting a gyro on board

0:17:31.396 --> 0:17:35.076
<v Speaker 2>reaction wheels or tails or things like that, because you know,

0:17:35.116 --> 0:17:37.076
<v Speaker 2>we ruled out reaction wheels because that's just a big

0:17:37.116 --> 0:17:39.036
<v Speaker 2>thing of brass that has to be mass you don't

0:17:39.036 --> 0:17:41.756
<v Speaker 2>want in a robot like this. We thought about tails,

0:17:41.916 --> 0:17:45.596
<v Speaker 2>and you look at any animals with tails, bipeds in particular.

0:17:46.116 --> 0:17:48.876
<v Speaker 2>Typically that's to control the pitch. In other words, you'd

0:17:48.956 --> 0:17:51.476
<v Speaker 2>leap off the ground, and then you want to reorient

0:17:51.556 --> 0:17:53.956
<v Speaker 2>your body and your feet so that your feet land

0:17:54.196 --> 0:17:57.116
<v Speaker 2>forward rather than you know, just tumbling in the air. Okay,

0:17:57.636 --> 0:17:59.276
<v Speaker 2>but we don't want to control pitch. We want to

0:17:59.316 --> 0:18:02.116
<v Speaker 2>control y'all. We want to steer the robot. So what

0:18:02.156 --> 0:18:04.476
<v Speaker 2>we kind of came up with is that the best

0:18:04.516 --> 0:18:07.796
<v Speaker 2>way to do that would be a pair of tails

0:18:07.836 --> 0:18:10.556
<v Speaker 2>that are symmetrical on the front of the back or

0:18:10.596 --> 0:18:12.276
<v Speaker 2>the side of the side of the robot, so that

0:18:12.316 --> 0:18:14.916
<v Speaker 2>when you swing those tails, you're controlling exactly down the

0:18:14.916 --> 0:18:18.996
<v Speaker 2>center line of your yaw, and so that just happens

0:18:19.036 --> 0:18:21.116
<v Speaker 2>to be where our arms and shoulders are. You know,

0:18:21.676 --> 0:18:24.996
<v Speaker 2>this bilaterally symmetrical pair of tails that can inertially actuate

0:18:25.076 --> 0:18:27.076
<v Speaker 2>you around the center and allow you to steer.

0:18:27.836 --> 0:18:31.756
<v Speaker 1>So that's the sort of intellectually elegant version of how

0:18:31.796 --> 0:18:35.796
<v Speaker 1>you get to a robot that looks like it has arms.

0:18:35.956 --> 0:18:37.596
<v Speaker 1>You're saying, in fact, in terms of the way you

0:18:37.636 --> 0:18:40.156
<v Speaker 1>thought of it, it's a pair of tails that happened

0:18:40.156 --> 0:18:43.796
<v Speaker 1>to sit where our arms sit. I mean, presumably there's

0:18:43.796 --> 0:18:45.676
<v Speaker 1>a simpler version, which is, you want to build a

0:18:45.756 --> 0:18:47.796
<v Speaker 1>robot that can do stuff in a world that is

0:18:47.836 --> 0:18:50.756
<v Speaker 1>built for humans, and having arms would be useful in

0:18:50.796 --> 0:18:52.996
<v Speaker 1>that context as well. Or no, was it true?

0:18:53.036 --> 0:18:55.516
<v Speaker 2>Member? Yeah, what we're not trying to do is make

0:18:55.556 --> 0:18:57.476
<v Speaker 2>a humanoid robot that looks like a person. What we're

0:18:57.476 --> 0:19:00.516
<v Speaker 2>trying to do is on first principles, understand exactly why

0:19:00.516 --> 0:19:01.196
<v Speaker 2>we do each thing.

0:19:01.516 --> 0:19:03.676
<v Speaker 1>But are you really just adding the arm so that

0:19:03.756 --> 0:19:05.876
<v Speaker 1>it can turn when it's walking, Like I feel like

0:19:05.956 --> 0:19:07.636
<v Speaker 1>that's what you're saying, and I'm skeptical.

0:19:08.156 --> 0:19:11.196
<v Speaker 2>Yeah. It's also that there were three other reasons why

0:19:11.236 --> 0:19:13.916
<v Speaker 2>the arms should be there that were all aligned. They

0:19:13.916 --> 0:19:16.716
<v Speaker 2>were not compromises where you know, putting the arms here

0:19:16.796 --> 0:19:18.396
<v Speaker 2>is better for one thing or another.

0:19:18.676 --> 0:19:18.836
<v Speaker 1>Huh.

0:19:19.076 --> 0:19:21.356
<v Speaker 2>It's that. Hey, if you go on just the just

0:19:21.476 --> 0:19:24.716
<v Speaker 2>the path of I want to improve my locomotion capability,

0:19:25.076 --> 0:19:26.996
<v Speaker 2>you land at the solution of where the arms are.

0:19:27.356 --> 0:19:29.476
<v Speaker 2>If you go down the path of this robot's going

0:19:29.516 --> 0:19:32.236
<v Speaker 2>to fall, and we know that it can't just fall

0:19:32.276 --> 0:19:34.796
<v Speaker 2>and land on its torso it'll break things. How do

0:19:34.876 --> 0:19:37.516
<v Speaker 2>we put manipulator's arms something on it so that it's

0:19:37.516 --> 0:19:38.956
<v Speaker 2>going to be able to catch itself when.

0:19:38.876 --> 0:19:41.196
<v Speaker 1>It falls oriented? Okay.

0:19:41.676 --> 0:19:43.836
<v Speaker 2>And then the third one, of course, is picking up

0:19:43.876 --> 0:19:46.636
<v Speaker 2>things right manipulation in the world and being by manual

0:19:46.676 --> 0:19:49.796
<v Speaker 2>in your manipulation so that you can basically a giant

0:19:49.836 --> 0:19:51.916
<v Speaker 2>pincher grass. That's how you pick up boxes and tots

0:19:51.916 --> 0:19:53.916
<v Speaker 2>and all these things you want to move. That's also

0:19:53.956 --> 0:19:57.476
<v Speaker 2>the best place for them. So basically, you just set

0:19:57.516 --> 0:20:00.596
<v Speaker 2>out to build a machine that could go where humans

0:20:00.636 --> 0:20:03.316
<v Speaker 2>go and pick up things that are the size that

0:20:03.436 --> 0:20:07.436
<v Speaker 2>humans pick up, and from first principles, with your eyes closed,

0:20:07.436 --> 0:20:09.916
<v Speaker 2>you wound up with a thing that looks like a guy.

0:20:10.556 --> 0:20:13.476
<v Speaker 1>Absolutely so okay, so this is how you get to

0:20:13.876 --> 0:20:16.996
<v Speaker 1>digit the robot that you're now building and selling to people.

0:20:17.196 --> 0:20:20.516
<v Speaker 2>That's right. So now we're taking this transition right now

0:20:20.516 --> 0:20:23.716
<v Speaker 2>as a company from that very intellectual and first principles

0:20:23.716 --> 0:20:26.596
<v Speaker 2>approach that I shared with you to now working with

0:20:26.596 --> 0:20:29.756
<v Speaker 2>the customer understanding what their use case is, writing down

0:20:29.796 --> 0:20:33.356
<v Speaker 2>the sets of requirements, like you know, the temperature ranges,

0:20:33.476 --> 0:20:35.716
<v Speaker 2>the you know, weights of all the things that you're

0:20:35.756 --> 0:20:39.156
<v Speaker 2>going to be able to pick up, the safety requirements,

0:20:39.196 --> 0:20:41.916
<v Speaker 2>you know. And it's a massive list, hundreds of things

0:20:41.916 --> 0:20:44.436
<v Speaker 2>in a list to write down the requirements documents so

0:20:44.476 --> 0:20:47.156
<v Speaker 2>that we can engineer a system that is a product. Yeah,

0:20:47.236 --> 0:20:50.036
<v Speaker 2>very different from designing a robot that can do cool things.

0:20:50.356 --> 0:20:53.116
<v Speaker 2>We're engineering a product, and that's the pivot that our

0:20:53.156 --> 0:20:54.316
<v Speaker 2>company is in right the second.

0:20:54.356 --> 0:20:56.876
<v Speaker 1>And like I imagine for you personally, that must be

0:20:56.916 --> 0:20:59.636
<v Speaker 1>a significant shift, right if you spent whatever twenty some

0:20:59.836 --> 0:21:04.876
<v Speaker 1>years in the kind of abstract academic world of like

0:21:06.076 --> 0:21:09.876
<v Speaker 1>let's build the thing that works and know think deep

0:21:09.916 --> 0:21:14.076
<v Speaker 1>thoughts to like let's mass produce a product that people

0:21:14.076 --> 0:21:15.516
<v Speaker 1>will pay us for. That's quite different.

0:21:15.836 --> 0:21:18.436
<v Speaker 2>Oh, it's fundamentally different. It's a whole different way of thinking.

0:21:19.196 --> 0:21:23.676
<v Speaker 2>In fact, I changed my title to chief robot Officer, right,

0:21:23.876 --> 0:21:27.396
<v Speaker 2>and we hired Melanie Wise as our new chief technology officer.

0:21:27.756 --> 0:21:30.316
<v Speaker 2>She comes out of FET Robotics. She was the founder

0:21:30.356 --> 0:21:33.116
<v Speaker 2>there and recently sold that company and they were deploying

0:21:33.156 --> 0:21:36.156
<v Speaker 2>thousands of robots in logistics squarehouses. And she is an

0:21:36.196 --> 0:21:40.716
<v Speaker 2>absolute expert on understanding customers and product and creating a product.

0:21:40.956 --> 0:21:43.156
<v Speaker 2>And what we've done is we've shifted our organization. So

0:21:43.596 --> 0:21:46.556
<v Speaker 2>you know, Melanie is in charge span of that whole

0:21:46.556 --> 0:21:49.236
<v Speaker 2>product side of the organization and the engineering to make

0:21:49.276 --> 0:21:52.116
<v Speaker 2>a product. I'm now leading the innovation team.

0:21:52.596 --> 0:21:55.076
<v Speaker 1>So you get to keep doing kind of the stuff

0:21:55.116 --> 0:21:56.036
<v Speaker 1>you've been doing.

0:21:56.236 --> 0:21:57.156
<v Speaker 2>The things I'm good at.

0:21:57.356 --> 0:22:00.556
<v Speaker 1>Yeah, so what is the frontier on the innovation side.

0:22:00.556 --> 0:22:02.076
<v Speaker 1>What are you trying to figure out next?

0:22:02.676 --> 0:22:07.556
<v Speaker 2>It is fundamentally hardware that enables the kinds of physics

0:22:07.556 --> 0:22:10.316
<v Speaker 2>that we want to achieve, powered by some of these

0:22:10.356 --> 0:22:13.196
<v Speaker 2>new AI tools. You know, we're getting to a point

0:22:13.196 --> 0:22:16.796
<v Speaker 2>now where some of these tools will allow us to

0:22:16.876 --> 0:22:20.516
<v Speaker 2>create behaviors and create things that as an engineer we

0:22:20.556 --> 0:22:24.396
<v Speaker 2>don't know how to model. Huh, And that's super interesting.

0:22:24.596 --> 0:22:27.196
<v Speaker 2>So instead you're describing the symptoms of it, and then

0:22:27.196 --> 0:22:30.476
<v Speaker 2>the system, the learning system, figures out how to make

0:22:30.516 --> 0:22:31.036
<v Speaker 2>that happen.

0:22:31.196 --> 0:22:34.436
<v Speaker 1>Amazingly different than what you've been describing. You've been describing

0:22:34.436 --> 0:22:37.636
<v Speaker 1>of like, let's think of you know, first principles, just

0:22:37.796 --> 0:22:41.196
<v Speaker 1>the physics of the universe, and from that build a machine.

0:22:41.596 --> 0:22:44.916
<v Speaker 1>And now you're talking about you know, an era when

0:22:45.036 --> 0:22:47.596
<v Speaker 1>possibly you'll be able to ignore all of that and

0:22:47.636 --> 0:22:49.676
<v Speaker 1>say to the AI, you figure it out, here's what

0:22:49.756 --> 0:22:50.356
<v Speaker 1>I want to do.

0:22:50.756 --> 0:22:52.916
<v Speaker 2>Well, let me put some caveats on that.

0:22:52.996 --> 0:22:54.916
<v Speaker 1>Yeah, that sounds ridiculous when I say it that way.

0:22:55.036 --> 0:22:56.836
<v Speaker 2>Well, remember that the AI has to operate on a

0:22:56.836 --> 0:22:59.436
<v Speaker 2>piece of hardware. Yeah right, And so that piece of

0:22:59.436 --> 0:23:02.796
<v Speaker 2>hardware we still have to engineer and design to be

0:23:02.996 --> 0:23:05.276
<v Speaker 2>able to achieve the physics that we want to achieve.

0:23:05.556 --> 0:23:07.636
<v Speaker 1>Though you could say to an AI, here's what we

0:23:07.676 --> 0:23:10.356
<v Speaker 1>want to do. What should the hardware look like? Maybe

0:23:10.396 --> 0:23:13.116
<v Speaker 1>in one hundred years you think one hundred, one hundred,

0:23:13.116 --> 0:23:13.916
<v Speaker 1>who knows one hundred.

0:23:14.516 --> 0:23:18.316
<v Speaker 2>It's fine, fine, three years, you know, future, Not today.

0:23:18.076 --> 0:23:21.316
<v Speaker 1>Not certainly not. So what specifically are you doing today?

0:23:21.316 --> 0:23:23.356
<v Speaker 1>Are you taking this robot that you have digit and

0:23:23.396 --> 0:23:26.516
<v Speaker 1>seeing if you sort of put an AI layer in it,

0:23:26.596 --> 0:23:27.876
<v Speaker 1>on it near it? What can you do?

0:23:27.956 --> 0:23:30.596
<v Speaker 2>Is that what's happening all of the above, So, you know,

0:23:30.636 --> 0:23:32.156
<v Speaker 2>on the hardware side, So there's a lot on the

0:23:32.156 --> 0:23:34.796
<v Speaker 2>hardware you do just to make it even possible for

0:23:34.836 --> 0:23:38.116
<v Speaker 2>the AI to learn. We're building then a whole architecture

0:23:38.156 --> 0:23:40.236
<v Speaker 2>of a digital twin so that you can learn things

0:23:40.276 --> 0:23:42.876
<v Speaker 2>in simulation first, and then you know, transfer from sim

0:23:42.916 --> 0:23:43.196
<v Speaker 2>to reel.

0:23:43.796 --> 0:23:46.556
<v Speaker 1>A digital twin is basically making a version of the

0:23:46.636 --> 0:23:49.716
<v Speaker 1>robot that exists as software that exists virtually.

0:23:49.476 --> 0:23:52.596
<v Speaker 2>Version of the environment as well that the robot operates in,

0:23:52.796 --> 0:23:55.836
<v Speaker 2>so that everything can be done, you know, decades of

0:23:55.876 --> 0:23:58.476
<v Speaker 2>experience on the robot can be done in hours of

0:23:58.516 --> 0:24:00.316
<v Speaker 2>time through the you know, et cetera.

0:24:00.796 --> 0:24:03.156
<v Speaker 1>So the digital twin is allowing you to try and

0:24:03.236 --> 0:24:06.396
<v Speaker 1>generate data to train the AI. Is that that's what's the.

0:24:06.316 --> 0:24:08.676
<v Speaker 2>Source of the data, right, A lot of language models

0:24:08.676 --> 0:24:11.316
<v Speaker 2>and things like that are based on data from the internet. Well,

0:24:11.756 --> 0:24:14.396
<v Speaker 2>I don't think that that's feasible for robot control because

0:24:14.436 --> 0:24:16.556
<v Speaker 2>the physics of the hardware is so unique. So even

0:24:16.596 --> 0:24:19.196
<v Speaker 2>if you're trying to teleoperate this thing, you've got this

0:24:19.276 --> 0:24:22.516
<v Speaker 2>weird translation between what a person would do to then

0:24:22.556 --> 0:24:25.076
<v Speaker 2>the robot trying to mimic that, which is probably not

0:24:25.236 --> 0:24:25.956
<v Speaker 2>the dime of the right.

0:24:25.876 --> 0:24:28.756
<v Speaker 1>Dynamic robot doesn't actually walk like a person, even if

0:24:28.796 --> 0:24:29.796
<v Speaker 1>it looks like it's.

0:24:29.596 --> 0:24:32.476
<v Speaker 2>Right, that's right, everything's sort of different internally about it.

0:24:32.476 --> 0:24:33.476
<v Speaker 2>How it would control itself.

0:24:34.076 --> 0:24:40.596
<v Speaker 1>What what what are you worried about? Like you what

0:24:40.636 --> 0:24:42.316
<v Speaker 1>could go wrong and how are you trying to get

0:24:42.316 --> 0:24:43.076
<v Speaker 1>it not to go wrong?

0:24:43.156 --> 0:24:45.036
<v Speaker 2>So before I talk about what I'm worried about, let

0:24:45.036 --> 0:24:47.476
<v Speaker 2>me tell you what I'm excited about. Fair We had

0:24:47.516 --> 0:24:49.876
<v Speaker 2>ten of these Cassie robots out in the world, and

0:24:49.956 --> 0:24:51.756
<v Speaker 2>so researchers all over the place for for years and

0:24:51.836 --> 0:24:54.676
<v Speaker 2>years are working on various kinds of controllers. When we

0:24:54.676 --> 0:24:58.196
<v Speaker 2>were able to successfully get a learned policy working on Cassie,

0:24:58.556 --> 0:25:01.316
<v Speaker 2>were to run a five k across campus, we were

0:25:01.356 --> 0:25:04.356
<v Speaker 2>able to the world record in the one hundred meter dash.

0:25:04.476 --> 0:25:05.396
<v Speaker 2>We were able to.

0:25:05.316 --> 0:25:08.636
<v Speaker 1>Do for a robot. Yes, and when you say learned,

0:25:08.636 --> 0:25:11.076
<v Speaker 1>you mean developed with machine learning as opposed to in

0:25:11.076 --> 0:25:12.036
<v Speaker 1>the old way. Is that what you mean?

0:25:12.156 --> 0:25:15.796
<v Speaker 2>Correct? It's an entirely machine learned policy that was learned

0:25:15.796 --> 0:25:18.396
<v Speaker 2>in simulation and then put on the thing.

0:25:18.916 --> 0:25:22.836
<v Speaker 1>What's a definition of a policy? As you're using the word, I.

0:25:22.796 --> 0:25:26.636
<v Speaker 2>Mean, a policy is just a bunch of math that

0:25:27.076 --> 0:25:30.196
<v Speaker 2>takes as input all of the sensors and then spits

0:25:30.236 --> 0:25:33.796
<v Speaker 2>out numbers that describe the torques you should apply to

0:25:33.836 --> 0:25:34.396
<v Speaker 2>the motors.

0:25:34.596 --> 0:25:37.836
<v Speaker 1>Okay, so it's sort of if this happens in the world, robot,

0:25:37.916 --> 0:25:39.636
<v Speaker 1>you should do this set of things.

0:25:40.356 --> 0:25:42.356
<v Speaker 2>Yeah, you're based on you know, but you know it's

0:25:42.396 --> 0:25:45.996
<v Speaker 2>like the input from thirty encoders or one hundred different sensors,

0:25:46.036 --> 0:25:47.356
<v Speaker 2>all of that complex input.

0:25:47.476 --> 0:25:51.316
<v Speaker 1>But if this is complicated and the then that are complicated, Yeah.

0:25:51.196 --> 0:25:53.396
<v Speaker 2>No, you're rreat you're right about that. Yeah, it's an equation.

0:25:53.676 --> 0:25:57.916
<v Speaker 1>Good. So the machine learning basically made the robot work

0:25:57.956 --> 0:25:58.516
<v Speaker 1>way better.

0:25:58.836 --> 0:26:02.596
<v Speaker 2>It made the robot work much better, and even more importantly,

0:26:02.836 --> 0:26:06.116
<v Speaker 2>once the pipeline was in place, we can learn new

0:26:06.116 --> 0:26:09.756
<v Speaker 2>skills and learn new policies much faster with much less

0:26:09.796 --> 0:26:13.356
<v Speaker 2>engineer time. How we can get there faster and have

0:26:13.436 --> 0:26:16.436
<v Speaker 2>higher performance using learning approaches to control.

0:26:16.436 --> 0:26:19.196
<v Speaker 1>It's like a productivity like supercharger. It just makes everything

0:26:19.236 --> 0:26:20.796
<v Speaker 1>go much faster and more efficiently.

0:26:21.036 --> 0:26:25.156
<v Speaker 2>Absolutely, it's a new tool. It's amazing, that's right. Okay,

0:26:25.236 --> 0:26:29.076
<v Speaker 2>So what am I worried about? Right? What am I

0:26:29.156 --> 0:26:32.316
<v Speaker 2>worried about? I'm worried that this is one of those

0:26:32.396 --> 0:26:34.396
<v Speaker 2>kind of black Swan events. Right, this is one of

0:26:34.396 --> 0:26:38.476
<v Speaker 2>those things that changes everything, and everybody doesn't exactly know

0:26:38.556 --> 0:26:40.996
<v Speaker 2>what all the implications are yet and what are the

0:26:41.316 --> 0:26:45.236
<v Speaker 2>you know, the right paths forward, and so everybody's trying everything.

0:26:45.116 --> 0:26:50.396
<v Speaker 1>And this being basically a useful bipedal robot like it,

0:26:50.636 --> 0:26:53.316
<v Speaker 1>it could be hugely important in ways that we don't understand,

0:26:53.356 --> 0:26:55.156
<v Speaker 1>and there could be unintended consequences.

0:26:55.196 --> 0:26:59.756
<v Speaker 2>What I actually mean is that this new realization that

0:26:59.836 --> 0:27:05.956
<v Speaker 2>we can use learning policies to control dynamic robots and machines, yeah,

0:27:06.396 --> 0:27:11.356
<v Speaker 2>means that the entire way it all robotics controls people

0:27:11.396 --> 0:27:16.156
<v Speaker 2>have been doing robot controls before is not as relevant.

0:27:17.076 --> 0:27:20.516
<v Speaker 2>And this new tool that nobody really understands that well

0:27:20.596 --> 0:27:23.836
<v Speaker 2>yet is clearly the future of how it's going to work.

0:27:25.036 --> 0:27:29.676
<v Speaker 1>So what are I mean? I understand that if it's

0:27:29.676 --> 0:27:31.956
<v Speaker 1>really a black swan, you don't know what's going to happen,

0:27:31.956 --> 0:27:33.596
<v Speaker 1>because if you knew, it wouldn't be a black swan,

0:27:33.676 --> 0:27:36.796
<v Speaker 1>But like, what are you thinking of? What could it mean?

0:27:37.276 --> 0:27:41.156
<v Speaker 1>You know, plainly, bipedal robots are a very powerful tool,

0:27:41.236 --> 0:27:46.556
<v Speaker 1>and you could imagine malevolent uses of them, right, I guess, So,

0:27:46.716 --> 0:27:48.956
<v Speaker 1>I guess we've already got drones. Right, It doesn't matter

0:27:48.956 --> 0:27:51.156
<v Speaker 1>whether the robot that kills you looks like a dude, right,

0:27:51.196 --> 0:27:52.276
<v Speaker 1>those things already exist.

0:27:52.516 --> 0:27:54.756
<v Speaker 2>In fact, a humanoid robot is probably the least effective

0:27:54.756 --> 0:27:58.076
<v Speaker 2>way to do that I think, honestly, my biggest worry

0:27:58.116 --> 0:28:00.196
<v Speaker 2>about AI in general has a lot more to do

0:28:00.276 --> 0:28:03.956
<v Speaker 2>with its ability to influence people, its ability to model

0:28:03.996 --> 0:28:06.636
<v Speaker 2>people's feelings and that kind of thing.

0:28:06.836 --> 0:28:09.476
<v Speaker 1>So that's more like using large language model for kind

0:28:09.516 --> 0:28:11.396
<v Speaker 1>of personalized misinformation or that.

0:28:11.436 --> 0:28:15.036
<v Speaker 2>Stat it that's the biggest threat building robots that are

0:28:15.036 --> 0:28:17.956
<v Speaker 2>going to like take people. I don't know, I just

0:28:17.996 --> 0:28:19.436
<v Speaker 2>don't see it fair.

0:28:19.876 --> 0:28:22.156
<v Speaker 1>I mean, I'm kind of tired of talking about technological

0:28:22.236 --> 0:28:25.156
<v Speaker 1>unemployment because the robot looks like a person. It makes

0:28:25.196 --> 0:28:26.836
<v Speaker 1>me feel like we should touch on it. Do you

0:28:26.876 --> 0:28:28.716
<v Speaker 1>want to just speak for a moment to the prospect

0:28:28.756 --> 0:28:30.156
<v Speaker 1>of technological unemployment?

0:28:30.356 --> 0:28:33.316
<v Speaker 2>Sure, I'll say our entire business model is centered around

0:28:33.356 --> 0:28:36.556
<v Speaker 2>the number of unfilled roles that exist in the logistics environment.

0:28:37.276 --> 0:28:39.316
<v Speaker 2>It's not centered around how it's going to be less

0:28:39.316 --> 0:28:44.036
<v Speaker 2>expensive than human Labor's centered around how they actually in

0:28:44.076 --> 0:28:48.156
<v Speaker 2>geographic locations do not have enough people to provide the

0:28:48.196 --> 0:28:51.156
<v Speaker 2>service that they're providing, you know, the way they're doing

0:28:51.156 --> 0:28:55.556
<v Speaker 2>it now, there's no way forward for improved logistics and

0:28:55.636 --> 0:28:58.196
<v Speaker 2>getting your things in one day and you know, all

0:28:58.236 --> 0:29:00.916
<v Speaker 2>the stuff that people really really want. There's no path

0:29:00.956 --> 0:29:03.556
<v Speaker 2>forward to do that with more human labor doing it,

0:29:03.556 --> 0:29:07.756
<v Speaker 2>it must be automated in a significant way. So you know,

0:29:07.796 --> 0:29:10.036
<v Speaker 2>that's our whole business model is based on unfilled roles.

0:29:11.356 --> 0:29:14.916
<v Speaker 1>What is your like happy version of the future in

0:29:14.996 --> 0:29:17.116
<v Speaker 1>whatever number of years seems like the right number of.

0:29:17.996 --> 0:29:21.996
<v Speaker 2>Man These robots are actually safe and smart enough in

0:29:22.076 --> 0:29:23.796
<v Speaker 2>order to do a lot of useful things in the world.

0:29:23.876 --> 0:29:25.836
<v Speaker 2>And the relationship with people is kind of like a

0:29:25.836 --> 0:29:29.036
<v Speaker 2>service animal. And you know, these robots are everywhere and

0:29:29.636 --> 0:29:32.316
<v Speaker 2>delivering all the packages to your door, and you know,

0:29:32.396 --> 0:29:34.916
<v Speaker 2>being a telepresence device that you can easily log into

0:29:34.956 --> 0:29:37.796
<v Speaker 2>and keep in touch with people, and you know, in

0:29:38.196 --> 0:29:41.436
<v Speaker 2>a lot of warehousing and stocking and you know, doing

0:29:41.476 --> 0:29:44.116
<v Speaker 2>the dull, dirty, dangerous to classic three d's of robotics.

0:29:45.436 --> 0:29:49.556
<v Speaker 2>It's we've always wanted that. It's all about improving the

0:29:49.596 --> 0:29:52.756
<v Speaker 2>quality of life. It's all about letting enabling humans to

0:29:52.796 --> 0:29:55.196
<v Speaker 2>be more human, letting people do the things that are

0:29:55.436 --> 0:29:58.116
<v Speaker 2>that they want to do that involve the social interaction

0:29:58.196 --> 0:30:00.716
<v Speaker 2>and the creativity and the variety that people are so

0:30:00.756 --> 0:30:02.956
<v Speaker 2>good at, and having robots that can pick up all

0:30:02.996 --> 0:30:05.956
<v Speaker 2>of the tasks that we'd rather not do, and having

0:30:06.036 --> 0:30:08.996
<v Speaker 2>robots that can be in environments that are designed around us.

0:30:09.716 --> 0:30:12.436
<v Speaker 2>Is a really important step to being able to achieve

0:30:12.476 --> 0:30:13.316
<v Speaker 2>that in a really great way.

0:30:14.436 --> 0:30:15.956
<v Speaker 1>Is there anything else you want to talk about?

0:30:17.396 --> 0:30:22.076
<v Speaker 2>MM. One thing I want to make sure that we

0:30:22.116 --> 0:30:26.796
<v Speaker 2>get across is kind of the clear argument that the

0:30:26.956 --> 0:30:30.516
<v Speaker 2>fastest path to general purpose machine that does use full

0:30:30.516 --> 0:30:33.196
<v Speaker 2>work in human spaces is to do one thing first

0:30:33.436 --> 0:30:35.316
<v Speaker 2>and then the second thing, and to do it for

0:30:35.396 --> 0:30:38.356
<v Speaker 2>customers and to get it deployed and to figure out

0:30:38.396 --> 0:30:41.036
<v Speaker 2>the reliability and the safety and so on. That's the path.

0:30:41.836 --> 0:30:43.556
<v Speaker 2>There is no good way to just like jump to

0:30:43.596 --> 0:30:44.036
<v Speaker 2>the answer.

0:30:44.356 --> 0:30:46.916
<v Speaker 1>So basically you're saying, you can't just build a robot

0:30:46.956 --> 0:30:48.636
<v Speaker 1>that does everything. You have to build a robot that

0:30:48.636 --> 0:30:51.036
<v Speaker 1>does one thing and then figure out how to make

0:30:51.076 --> 0:30:51.796
<v Speaker 1>it to a second thing.

0:30:52.276 --> 0:30:53.836
<v Speaker 2>Yeah, but you want to stay on that vision. You

0:30:53.836 --> 0:30:55.956
<v Speaker 2>want to have your guess of what the everything robot

0:30:55.956 --> 0:30:58.356
<v Speaker 2>looks like, but you know you're going to be wrong,

0:30:58.956 --> 0:31:00.876
<v Speaker 2>and so then you start on what's the match to

0:31:00.916 --> 0:31:02.756
<v Speaker 2>the first thing that it should be able to do,

0:31:03.356 --> 0:31:05.836
<v Speaker 2>and then you keep on revising and iterating on that path.

0:31:06.076 --> 0:31:08.396
<v Speaker 2>So I'll give an example. This This is through our

0:31:08.516 --> 0:31:11.156
<v Speaker 2>entire est then of building the function first and the

0:31:11.156 --> 0:31:13.636
<v Speaker 2>physics first and the first principles approach to figuring out

0:31:13.636 --> 0:31:16.116
<v Speaker 2>things like the legs. Right, yeah, I want to point out,

0:31:16.156 --> 0:31:20.676
<v Speaker 2>like hands, how do you produce a dextrous manipulator? By dexterous,

0:31:20.756 --> 0:31:22.596
<v Speaker 2>I mean something that can pick up pens and pick

0:31:22.676 --> 0:31:25.476
<v Speaker 2>up objects and do useful things with the hand, open doorknobs,

0:31:25.516 --> 0:31:28.396
<v Speaker 2>all the stuff. Right, I don't care how it looks,

0:31:28.596 --> 0:31:32.396
<v Speaker 2>I care what it does. So what we're doing for

0:31:32.436 --> 0:31:35.476
<v Speaker 2>our first use case is picking up totes that can

0:31:35.516 --> 0:31:38.516
<v Speaker 2>have a twenty kilogram bowling ball rolling around in it.

0:31:38.996 --> 0:31:41.276
<v Speaker 2>That's a very hard thing to pick up. And so

0:31:41.396 --> 0:31:43.956
<v Speaker 2>now our grippers are these big graspers that can grasp

0:31:43.996 --> 0:31:46.436
<v Speaker 2>the side of this tote and pick it up. A

0:31:46.476 --> 0:31:48.796
<v Speaker 2>lot of groups are sort of have these five figured

0:31:48.836 --> 0:31:52.556
<v Speaker 2>hands on their robots which look like a human's hand,

0:31:53.156 --> 0:31:55.476
<v Speaker 2>but certainly couldn't pick up those twenty kilogram totes.

0:31:55.676 --> 0:31:59.036
<v Speaker 1>What do you have just done? Pincer sort of basically yeah.

0:31:58.676 --> 0:32:01.236
<v Speaker 2>You know, effectively it's a big sort of four fingered

0:32:01.276 --> 0:32:03.956
<v Speaker 2>pincher thing that can have leverage on it and grasp

0:32:04.036 --> 0:32:04.956
<v Speaker 2>the sides of the tote.

0:32:05.036 --> 0:32:07.436
<v Speaker 1>I mean does that mean that it can't do sort

0:32:07.476 --> 0:32:10.476
<v Speaker 1>of fine motor things. The trade off that you're making this.

0:32:10.596 --> 0:32:12.996
<v Speaker 2>I would say the current pincher design, Yeah, it's not

0:32:13.116 --> 0:32:15.636
<v Speaker 2>designed to pick up small objects. But you know, we

0:32:15.676 --> 0:32:18.996
<v Speaker 2>see a vision where that's actually a tool, not a hand.

0:32:19.516 --> 0:32:22.276
<v Speaker 2>And just like people use tools, robots are gonna use tools.

0:32:22.636 --> 0:32:26.916
<v Speaker 2>But there's such an opportunity to have the tool be

0:32:26.996 --> 0:32:29.556
<v Speaker 2>attached at the wrist or the fingers or the elbow

0:32:29.636 --> 0:32:31.876
<v Speaker 2>or the forearm or wherever. And we don't know exactly

0:32:31.876 --> 0:32:34.636
<v Speaker 2>how that should be, and nobody does. But one thing

0:32:34.636 --> 0:32:38.636
<v Speaker 2>I can say with confidence is that these five fingered

0:32:38.636 --> 0:32:42.356
<v Speaker 2>hands cannot do dexterous manipulation. It's not just a controls problem.

0:32:42.356 --> 0:32:47.116
<v Speaker 2>They're making the same mistake of creating something that has

0:32:47.156 --> 0:32:50.876
<v Speaker 2>the same morphology. It looks like a person. It looks

0:32:50.916 --> 0:32:52.836
<v Speaker 2>like a person, but that doesn't mean it can apply

0:32:52.916 --> 0:32:54.996
<v Speaker 2>the right forces, or have the right kinematics, or have

0:32:55.036 --> 0:32:57.996
<v Speaker 2>the right dynamics or the right compliance or anything that.

0:32:58.076 --> 0:32:59.476
<v Speaker 2>We don't know how to do that. It's one of

0:32:59.476 --> 0:33:02.556
<v Speaker 2>the grand challenges in robotics. So the fastest path to

0:33:02.596 --> 0:33:05.636
<v Speaker 2>get there is to start manipulating things and do stuff

0:33:05.636 --> 0:33:08.836
<v Speaker 2>that has metrics right measures, a.

0:33:09.396 --> 0:33:11.716
<v Speaker 1>Job that someone is actually willing to pay for that's right.

0:33:11.796 --> 0:33:15.156
<v Speaker 1>So right now it's moving tots around as basically, what's

0:33:15.196 --> 0:33:15.996
<v Speaker 1>the second job?

0:33:16.876 --> 0:33:19.916
<v Speaker 2>Boxes? Cardboard boxes, huh? And then the third job is

0:33:19.956 --> 0:33:21.916
<v Speaker 2>starting we want to start doing each picking, you know,

0:33:22.196 --> 0:33:24.116
<v Speaker 2>picking up things and putting them in the boxes and

0:33:24.156 --> 0:33:25.556
<v Speaker 2>in the So those are much.

0:33:25.436 --> 0:33:30.396
<v Speaker 1>Smaller things, different than picking up a big punt.

0:33:30.236 --> 0:33:32.476
<v Speaker 2>That's right. And so the question is, and nobody knows

0:33:32.476 --> 0:33:35.156
<v Speaker 2>the answer to this, is that two different tools or

0:33:35.236 --> 0:33:38.036
<v Speaker 2>is that one general purpose manipulator that can do both

0:33:38.036 --> 0:33:38.476
<v Speaker 2>of those things.

0:33:38.476 --> 0:33:40.196
<v Speaker 1>They need to put a different hand on the robot

0:33:40.196 --> 0:33:42.156
<v Speaker 1>to pick up a little thing versus pick up a

0:33:42.156 --> 0:33:43.756
<v Speaker 1>big sure, or is it optimal?

0:33:44.556 --> 0:33:46.396
<v Speaker 2>And you know this will be the this gets into

0:33:46.756 --> 0:33:51.316
<v Speaker 2>customer requirements question rather than fundamental science question. Yeah, that's

0:33:51.516 --> 0:33:53.676
<v Speaker 2>your that's for your product person to figure out. Yeah,

0:33:53.716 --> 0:33:55.716
<v Speaker 2>but that's how this is going to evolve. That's how

0:33:55.716 --> 0:33:57.956
<v Speaker 2>we're going to get to dexterous manipulation in the world

0:33:58.076 --> 0:33:59.356
<v Speaker 2>in a way that really is meaningful.

0:33:59.396 --> 0:34:01.836
<v Speaker 1>Investors, one job at a ton, Yeah.

0:34:01.756 --> 0:34:05.196
<v Speaker 2>Figure out if you've got the first like you know,

0:34:05.236 --> 0:34:07.596
<v Speaker 2>the first four or five jobs, and now you've got

0:34:07.596 --> 0:34:11.076
<v Speaker 2>your requirements, yours of requirements and your measurements and your

0:34:11.076 --> 0:34:13.236
<v Speaker 2>metrics and now engineers are going to be able to

0:34:13.236 --> 0:34:16.436
<v Speaker 2>iterate on that really make something work. But just trying

0:34:16.436 --> 0:34:17.916
<v Speaker 2>to copy a five figure in hand and say now

0:34:17.956 --> 0:34:20.556
<v Speaker 2>now it's an AI problem completely false, that's just not

0:34:20.676 --> 0:34:21.196
<v Speaker 2>going to happen.

0:34:24.956 --> 0:34:27.196
<v Speaker 1>We'll be back in a minute with the lightning round.

0:34:38.076 --> 0:34:42.556
<v Speaker 1>Let's finish with the lightning round. I've read that you

0:34:42.676 --> 0:34:45.196
<v Speaker 1>like to jog at night for work, but like, are

0:34:45.236 --> 0:34:47.396
<v Speaker 1>there specific things that have happened to you or that

0:34:47.476 --> 0:34:50.756
<v Speaker 1>you've done to try and sort of you know, put

0:34:50.796 --> 0:34:54.396
<v Speaker 1>yourself in hard walking or running settings and where you've

0:34:54.436 --> 0:34:57.076
<v Speaker 1>actually had a thing happened and thought, oh, we need

0:34:57.116 --> 0:34:59.996
<v Speaker 1>to make sure the robot doesn't fall over when X happens.

0:35:00.916 --> 0:35:03.796
<v Speaker 2>Honestly, it's more like when you're taking a hike, you know,

0:35:03.836 --> 0:35:05.316
<v Speaker 2>and you kind of get into the mode of you're

0:35:05.356 --> 0:35:08.596
<v Speaker 2>just your long day and a long hike and you know,

0:35:08.676 --> 0:35:12.836
<v Speaker 2>watching and being mindful, I guess, and thinking about what

0:35:12.876 --> 0:35:16.076
<v Speaker 2>are my feet doing and how is the contact progressing

0:35:16.156 --> 0:35:18.636
<v Speaker 2>with the ground and how does that feel and why

0:35:18.756 --> 0:35:21.796
<v Speaker 2>is that happening? And sort of daydreaming while you're thinking

0:35:21.796 --> 0:35:24.476
<v Speaker 2>that through is a really good way then to start

0:35:24.476 --> 0:35:29.596
<v Speaker 2>to recognize connections to research papers and connections to things

0:35:29.596 --> 0:35:32.196
<v Speaker 2>that people have found scientifically, and then start to pull

0:35:32.236 --> 0:35:35.556
<v Speaker 2>together hypotheses about how you would implement something like this

0:35:35.636 --> 0:35:38.956
<v Speaker 2>on a robot, or what is necessary because we're human,

0:35:39.196 --> 0:35:42.716
<v Speaker 2>and what is necessary because it's fundamental to look emotion

0:35:43.236 --> 0:35:45.876
<v Speaker 2>like why do we have feet? That's very different from

0:35:45.916 --> 0:35:49.596
<v Speaker 2>bird feet atreus didn't have feet, you know, And then

0:35:49.636 --> 0:35:51.916
<v Speaker 2>what exactly do we get anyway? Thinking through all that

0:35:51.996 --> 0:35:54.516
<v Speaker 2>kind of stuff.

0:35:56.836 --> 0:35:58.596
<v Speaker 1>C three po or R two D two.

0:36:01.876 --> 0:36:04.476
<v Speaker 2>Why that is a tough one. I'm going to say

0:36:04.556 --> 0:36:05.316
<v Speaker 2>R two D two.

0:36:05.836 --> 0:36:09.836
<v Speaker 1>Not the bipedal one, not because that walks on two legs.

0:36:10.116 --> 0:36:12.476
<v Speaker 2>C three, Well, no, I'm going to change my mind.

0:36:13.436 --> 0:36:15.036
<v Speaker 1>I buyased it. Wait, why were you going to say

0:36:15.076 --> 0:36:16.396
<v Speaker 1>or two I was going to say to.

0:36:16.596 --> 0:36:19.796
<v Speaker 2>You because C three pos a protocol droid And if

0:36:19.836 --> 0:36:22.676
<v Speaker 2>it's just about language, why do you have legs? Was

0:36:22.716 --> 0:36:23.676
<v Speaker 2>On the other hand, you.

0:36:24.076 --> 0:36:25.756
<v Speaker 1>Just you don't need a robot at all, right, you

0:36:25.836 --> 0:36:27.556
<v Speaker 1>just need a little phone or whatever.

0:36:27.676 --> 0:36:30.756
<v Speaker 2>On the other hand, right, it's a human centric robot.

0:36:30.836 --> 0:36:32.956
<v Speaker 2>It's a robot that's meant to be a translator and

0:36:33.076 --> 0:36:35.196
<v Speaker 2>be existing with people in the room and so on,

0:36:35.436 --> 0:36:38.316
<v Speaker 2>and so actually making something that's more of a humanoid

0:36:38.436 --> 0:36:40.836
<v Speaker 2>makes more sense if it's basically a social robot.

0:36:41.196 --> 0:36:45.316
<v Speaker 1>Yeah, okay, Uh, what's something you wish that a robot

0:36:45.356 --> 0:36:48.196
<v Speaker 1>could do for you, you know, outside of work.

0:36:50.396 --> 0:36:54.716
<v Speaker 2>Honestly, what I want is for robots to make everything cheaper.

0:36:55.196 --> 0:36:58.436
<v Speaker 1>Uh huh. I love it when technology makes things cheaper. Well,

0:36:58.476 --> 0:37:00.716
<v Speaker 1>and that's what's been happening for the past couple hundred years,

0:37:00.756 --> 0:37:04.876
<v Speaker 1>and all of automation has effectively made everybody richer effectively

0:37:04.916 --> 0:37:08.396
<v Speaker 1>and improved everybody's quality of life because everything is cheaper.

0:37:09.196 --> 0:37:12.036
<v Speaker 1>And I want to be able to book my vacation

0:37:12.156 --> 0:37:15.116
<v Speaker 1>to the moon. I want to be able to not

0:37:15.196 --> 0:37:17.716
<v Speaker 1>really worry about you know, as it is, I don't

0:37:17.716 --> 0:37:20.316
<v Speaker 1>really worry about how much my phone costs or whatever.

0:37:20.396 --> 0:37:23.396
<v Speaker 1>If it breaks, it's not an expensive phone, it's fine,

0:37:23.436 --> 0:37:25.436
<v Speaker 1>it works, and all my software to transfers over. I

0:37:25.436 --> 0:37:28.196
<v Speaker 1>want everything to be like that. I want my lifestyle

0:37:28.276 --> 0:37:32.556
<v Speaker 1>to be supported by things that don't really cost a lot. Yes, well,

0:37:32.596 --> 0:37:36.036
<v Speaker 1>and I mean in the very long run, the sort

0:37:36.036 --> 0:37:40.956
<v Speaker 1>of technology driven productivity gains lift people out of poverty. Right,

0:37:40.996 --> 0:37:43.596
<v Speaker 1>everybody cares about our phones, but there are a lot

0:37:43.636 --> 0:37:45.676
<v Speaker 1>of people who have three dollars a day right now,

0:37:45.716 --> 0:37:46.996
<v Speaker 1>and it would be great if they could get to

0:37:47.076 --> 0:37:48.356
<v Speaker 1>three hundred dollars a day.

0:37:48.436 --> 0:37:50.876
<v Speaker 2>Or beyond that. It's just that, you know, all the

0:37:50.916 --> 0:37:54.716
<v Speaker 2>things that we need become very easy to acquire. Food

0:37:54.796 --> 0:37:57.876
<v Speaker 2>is no longer, labor on farms is done in an

0:37:57.876 --> 0:38:01.116
<v Speaker 2>automated way, Labor in terms of logistics and transporting things

0:38:01.116 --> 0:38:03.156
<v Speaker 2>has done in an automated way, and so all of

0:38:03.156 --> 0:38:05.796
<v Speaker 2>this stuff becomes so affordable that it's easy to uplift

0:38:05.796 --> 0:38:08.876
<v Speaker 2>the quality of life of everybody on earth. That is

0:38:08.916 --> 0:38:10.076
<v Speaker 2>what a lot from Robotics.

0:38:15.956 --> 0:38:19.116
<v Speaker 1>Jonathan Hurst is the co founder and chief robot officer

0:38:19.236 --> 0:38:24.716
<v Speaker 1>at Agility Robotics. Today's show was produced by Gabriel Hunter Chang.

0:38:24.996 --> 0:38:28.316
<v Speaker 1>It was edited by Lydia Jane Kott and engineered by

0:38:28.356 --> 0:38:31.956
<v Speaker 1>Sarah Bruguer. You can email us at problem at Pushkin

0:38:32.036 --> 0:38:35.236
<v Speaker 1>dot FM. I'm Jacob Goldstein, and we'll be back next

0:38:35.236 --> 0:38:45.556
<v Speaker 1>week with another episode of What's Your Problem