WEBVTT - Fei-Fei Li Helped Create AI, Now She Feels the Responsibility

0:00:02.759 --> 0:00:07.240
<v Speaker 1>Bloomberg Audio Studios, podcasts, radio news.

0:00:09.200 --> 0:00:12.680
<v Speaker 2>I was always a curious kid. I wasn't curious about

0:00:13.119 --> 0:00:17.360
<v Speaker 2>nightclubs or her or other things. I was an avid

0:00:17.480 --> 0:00:19.400
<v Speaker 2>lover of science.

0:00:20.640 --> 0:00:26.560
<v Speaker 1>Faith a Lee Tech CEO, Stanford professor and godmother of AI.

0:00:27.000 --> 0:00:30.720
<v Speaker 1>Are you conscious of the power that you have?

0:00:31.040 --> 0:00:36.159
<v Speaker 2>I'm conscious of my responsibility. I understand. I'm one of

0:00:36.200 --> 0:00:40.479
<v Speaker 2>the people who brought this technology to our world. The

0:00:40.560 --> 0:00:43.919
<v Speaker 2>agency is not the machines, It's ours.

0:00:45.040 --> 0:00:50.120
<v Speaker 1>From Bloomberg Weekend. This is the Michelle Hussein Show. I'm

0:00:50.120 --> 0:00:59.440
<v Speaker 1>Michelle Hussein. Have you ever thought about where AI came from?

0:00:59.640 --> 0:01:04.920
<v Speaker 1>The origin story of tools like Chat, GPT, Claude, or Gemini,

0:01:04.959 --> 0:01:08.679
<v Speaker 1>which are part of many people's everyday lives. I don't

0:01:08.720 --> 0:01:13.040
<v Speaker 1>mean the principle of training large language models on data,

0:01:13.480 --> 0:01:17.240
<v Speaker 1>but the long, often lonely work of the scientists who

0:01:17.280 --> 0:01:23.440
<v Speaker 1>believed years ago that you could teach machines to be intelligent. Well,

0:01:23.480 --> 0:01:26.800
<v Speaker 1>today's guest has lived that story because she was at

0:01:26.800 --> 0:01:31.440
<v Speaker 1>the forefront of unlocking this world for us all, and

0:01:31.520 --> 0:01:36.199
<v Speaker 1>she has a deeply resonant personal story. She was fifteen

0:01:36.280 --> 0:01:39.200
<v Speaker 1>when she came to the US from China with her parents.

0:01:39.640 --> 0:01:44.120
<v Speaker 1>She had little English life was hard, money incredibly tight,

0:01:44.920 --> 0:01:49.200
<v Speaker 1>but she excelled at school and set herself audacious problems

0:01:49.240 --> 0:01:53.160
<v Speaker 1>to solve. Her name is fay fe Lee, and by

0:01:53.160 --> 0:01:56.440
<v Speaker 1>the end of our conversation, I was asking myself, why

0:01:56.440 --> 0:01:59.840
<v Speaker 1>do most people know the name Sam Aortman and not hers.

0:02:00.560 --> 0:02:04.840
<v Speaker 1>She's the one whose breakthrough was critical, No wonder they

0:02:04.880 --> 0:02:07.120
<v Speaker 1>call her the godmother of AI.

0:02:08.440 --> 0:02:09.160
<v Speaker 2>We ended up.

0:02:09.080 --> 0:02:13.400
<v Speaker 1>Talking about everything from her own childhood to yes, whether

0:02:13.480 --> 0:02:16.600
<v Speaker 1>AI will do away with us humans, and what we

0:02:16.600 --> 0:02:20.320
<v Speaker 1>should teach the kids of today. Her answer on that

0:02:20.760 --> 0:02:24.320
<v Speaker 1>might just surprise you, but I hope you enjoy our

0:02:24.400 --> 0:02:30.720
<v Speaker 1>conversation as much as I did. Doctor fay fe Lee,

0:02:30.720 --> 0:02:34.280
<v Speaker 1>thank you for coming on the show. I'd love to

0:02:34.360 --> 0:02:39.000
<v Speaker 1>start with this remarkable period for your industry. It's three

0:02:39.080 --> 0:02:41.520
<v Speaker 1>years since chat GPT was released to the public, and

0:02:41.639 --> 0:02:45.040
<v Speaker 1>since then there have been new vehicles, new apps, huge

0:02:45.080 --> 0:02:49.040
<v Speaker 1>amounts of investment flowing towards the industry. How does this

0:02:49.160 --> 0:02:50.360
<v Speaker 1>moment feel to you?

0:02:50.960 --> 0:02:53.280
<v Speaker 2>Well, first of all, Michelle, thank you for inviting me

0:02:53.400 --> 0:02:57.200
<v Speaker 2>to this show. I'm very excited. It's a good question

0:02:57.560 --> 0:03:00.040
<v Speaker 2>because AI is not new to me. I've been in

0:03:00.120 --> 0:03:04.080
<v Speaker 2>this field for twenty five years. I live and breathe

0:03:04.120 --> 0:03:08.639
<v Speaker 2>it every day since the beginning of my career. Yet

0:03:08.880 --> 0:03:13.440
<v Speaker 2>this moment is still as daunting and almost surreal to

0:03:13.560 --> 0:03:18.760
<v Speaker 2>me in terms of its massive, profound impact. This is

0:03:18.800 --> 0:03:24.760
<v Speaker 2>a civilizational technology, and it's surreal personally to me because

0:03:25.240 --> 0:03:28.880
<v Speaker 2>I'm part of the group of scientists that made this happen,

0:03:29.680 --> 0:03:34.079
<v Speaker 2>and clearly I did not expect it'll be this massive.

0:03:34.600 --> 0:03:36.520
<v Speaker 1>When was the moment it changed? Because I know you've

0:03:36.560 --> 0:03:39.880
<v Speaker 1>talked about the years when it was like an AI

0:03:40.040 --> 0:03:44.200
<v Speaker 1>winter and what you're describing now, how the moment feels

0:03:44.240 --> 0:03:46.640
<v Speaker 1>extraordinary even to you. Is it because of the pace

0:03:46.680 --> 0:03:49.040
<v Speaker 1>of developments or is it because the world has woken

0:03:49.120 --> 0:03:51.800
<v Speaker 1>up to it and therefore turned the spotlight on people

0:03:51.840 --> 0:03:52.160
<v Speaker 1>like you.

0:03:52.680 --> 0:03:57.240
<v Speaker 2>I think it's intertwined, right, But for me to define

0:03:57.280 --> 0:04:01.480
<v Speaker 2>this as a civilizational technology is not about spotlight. It's

0:04:01.560 --> 0:04:06.080
<v Speaker 2>not even about how powerful it is, is about how

0:04:06.120 --> 0:04:12.320
<v Speaker 2>many people it impacts. Everyone's life work, while being future

0:04:13.000 --> 0:04:18.760
<v Speaker 2>will somehow be touched by or impacted by AI.

0:04:18.520 --> 0:04:20.120
<v Speaker 1>In bad ways as well as good.

0:04:20.480 --> 0:04:22.320
<v Speaker 2>What technology is a dubba sword?

0:04:22.520 --> 0:04:22.719
<v Speaker 1>Right?

0:04:22.960 --> 0:04:27.160
<v Speaker 2>Yes? I think both ways, Because since the dawn of

0:04:27.240 --> 0:04:32.440
<v Speaker 2>human civilization, we create tools we call technologies and these

0:04:32.480 --> 0:04:36.600
<v Speaker 2>tools are meant in general for doing good things, but

0:04:36.720 --> 0:04:39.800
<v Speaker 2>along the way we might intentionally use it in the

0:04:39.839 --> 0:04:43.120
<v Speaker 2>wrong way, or we might have unintended consequences.

0:04:43.320 --> 0:04:45.760
<v Speaker 1>And strands of this I know will emerge in different

0:04:45.800 --> 0:04:48.320
<v Speaker 1>ways over the course of this conversation. But you said

0:04:48.360 --> 0:04:52.240
<v Speaker 1>the word power, and I'm struck by the fact that

0:04:52.920 --> 0:04:56.640
<v Speaker 1>the power of this technology is in the hands of

0:04:57.320 --> 0:05:00.680
<v Speaker 1>a very small number of companies, most of them American.

0:05:01.040 --> 0:05:03.120
<v Speaker 1>How does that sit with you?

0:05:03.120 --> 0:05:10.000
<v Speaker 2>You're right, the major tech companies, through their massively reaching products,

0:05:10.760 --> 0:05:15.960
<v Speaker 2>they are impacting globally our society the most. I would

0:05:16.000 --> 0:05:20.160
<v Speaker 2>personally like to see this technology being much more democratized.

0:05:20.920 --> 0:05:24.640
<v Speaker 2>I would like to see no matter who builds or

0:05:24.680 --> 0:05:28.800
<v Speaker 2>holds the profound impact of this technology, do it in

0:05:28.839 --> 0:05:33.480
<v Speaker 2>a responsible way. And I also do believe every individual

0:05:34.320 --> 0:05:38.159
<v Speaker 2>in this era should feel they have the agency to

0:05:38.320 --> 0:05:39.720
<v Speaker 2>impact this technology.

0:05:39.920 --> 0:05:41.680
<v Speaker 1>We'll talk a bit more in a moment. I think

0:05:41.720 --> 0:05:45.240
<v Speaker 1>about the democratization and how that might be achieved. But you,

0:05:45.320 --> 0:05:47.760
<v Speaker 1>of course, in terms of companies in this field, you're

0:05:47.839 --> 0:05:50.240
<v Speaker 1>part of that because you are a tech CEO as

0:05:50.279 --> 0:05:52.920
<v Speaker 1>well as an academic. In fact, I think you're very

0:05:52.960 --> 0:05:55.720
<v Speaker 1>young company, little more than a year old is reportedly

0:05:55.720 --> 0:05:57.320
<v Speaker 1>already worth a billion dollars.

0:05:57.880 --> 0:06:03.279
<v Speaker 2>Yes, co founder CEO of war Laps, and we are

0:06:03.320 --> 0:06:08.240
<v Speaker 2>building the next frontier of AI, which is spatial intelligence,

0:06:08.320 --> 0:06:11.839
<v Speaker 2>which people don't hear too much about today because we're

0:06:11.880 --> 0:06:16.000
<v Speaker 2>all about large language models. Yet I believe spatial intelligence

0:06:16.080 --> 0:06:20.320
<v Speaker 2>is as critical and complimentary to language intelligence.

0:06:20.760 --> 0:06:24.520
<v Speaker 1>Releasing this idea of virtual world, which again we'll dig into.

0:06:24.600 --> 0:06:27.240
<v Speaker 1>But before we do that, I want you to take

0:06:27.320 --> 0:06:29.320
<v Speaker 1>us back again to the fact that you've seen the

0:06:29.360 --> 0:06:32.159
<v Speaker 1>whole trajectory of this industry. You've been in it for

0:06:32.200 --> 0:06:35.360
<v Speaker 1>twenty five years. I know that your first academic love

0:06:35.440 --> 0:06:38.359
<v Speaker 1>was physics. Yes, what was it in the life or

0:06:38.400 --> 0:06:40.960
<v Speaker 1>work of the physicists you most admire that made you

0:06:41.120 --> 0:06:42.839
<v Speaker 1>think beyond that particular field.

0:06:43.279 --> 0:06:46.159
<v Speaker 2>I grew up in a it's not a small town,

0:06:46.240 --> 0:06:50.279
<v Speaker 2>but it's a less well known city in China, and

0:06:50.400 --> 0:06:54.560
<v Speaker 2>I come from a small family, so you could say

0:06:54.720 --> 0:06:58.000
<v Speaker 2>life was small in the sense it was in the

0:06:58.240 --> 0:07:02.720
<v Speaker 2>eighties my childhood, which is fairly simple and isolated. You're

0:07:02.720 --> 0:07:08.880
<v Speaker 2>a child I was, and my family in general is

0:07:08.960 --> 0:07:16.240
<v Speaker 2>just small. Physics is almost the opposite. It's vast, it's audacious,

0:07:16.760 --> 0:07:20.600
<v Speaker 2>The imagination is unbounded. You look up in the sky,

0:07:21.280 --> 0:07:25.080
<v Speaker 2>you can ponder about the beginning of the universe. You

0:07:25.160 --> 0:07:28.240
<v Speaker 2>look at a piece of a snowflake, you can zoom

0:07:28.280 --> 0:07:32.640
<v Speaker 2>into the molecular structure of the matters you think about time,

0:07:32.760 --> 0:07:37.520
<v Speaker 2>you think about magnetic field, you think about nuclear It

0:07:37.600 --> 0:07:41.280
<v Speaker 2>takes my imagination to places that you can never be

0:07:41.920 --> 0:07:45.960
<v Speaker 2>in this world. And what really fundamentally fascinates me to

0:07:46.040 --> 0:07:49.480
<v Speaker 2>this day about physics is not be afraid of asking

0:07:49.560 --> 0:07:56.000
<v Speaker 2>the most boldest audacious question about our physical world, of

0:07:56.080 --> 0:07:58.560
<v Speaker 2>our universe, of where we come from.

0:07:58.720 --> 0:08:02.280
<v Speaker 1>But your audacious question, I think, was about what is intelligence?

0:08:02.440 --> 0:08:06.240
<v Speaker 2>Yes, And I think it's rooted in my love for

0:08:06.320 --> 0:08:10.760
<v Speaker 2>physics and my training in physics that I look at

0:08:10.960 --> 0:08:15.360
<v Speaker 2>each physicist I admire, look at their audacious question, right

0:08:15.400 --> 0:08:23.040
<v Speaker 2>from Newton to Maxwell, to Schrodinger to i Instell, my

0:08:23.160 --> 0:08:27.760
<v Speaker 2>favorite physicist, and I wanted to find my own audacious question.

0:08:28.160 --> 0:08:32.840
<v Speaker 2>And somewhere in the middle of college, my audacious question

0:08:33.200 --> 0:08:37.360
<v Speaker 2>shifted from physical matters to intelligence.

0:08:38.120 --> 0:08:38.679
<v Speaker 3>What is it?

0:08:38.760 --> 0:08:43.160
<v Speaker 2>How does it come about? And most fascinatingly, how do

0:08:43.240 --> 0:08:48.440
<v Speaker 2>we build intelligent machines? And that became my quest Minor Star.

0:08:48.520 --> 0:08:53.520
<v Speaker 1>And that's quantumly because from machines that were doing calculations

0:08:53.520 --> 0:08:56.760
<v Speaker 1>and computations, you're really talking about machines that learn and

0:08:56.800 --> 0:08:58.040
<v Speaker 1>that are constantly learning.

0:08:58.280 --> 0:09:01.960
<v Speaker 2>I like you use the physics pun the quantum leap.

0:09:03.760 --> 0:09:07.120
<v Speaker 1>Crossing over into into ordinary parlance. But I think, didn't

0:09:07.120 --> 0:09:09.120
<v Speaker 1>you have this light bulb moment as you're thinking about

0:09:09.160 --> 0:09:12.840
<v Speaker 1>intelligence you realize that a critical part of that is

0:09:12.880 --> 0:09:17.000
<v Speaker 1>our ability to recognize objects. The fact that around us

0:09:17.120 --> 0:09:19.559
<v Speaker 1>right now there are multiple objects, we know what they are.

0:09:19.679 --> 0:09:24.200
<v Speaker 2>The ability that humans have about recognizing objects in the

0:09:24.240 --> 0:09:28.240
<v Speaker 2>world is foundational. So I decided that was my first

0:09:28.360 --> 0:09:33.479
<v Speaker 2>love start my PhD dissertation is to build machine algorithms

0:09:33.559 --> 0:09:37.400
<v Speaker 2>to recognize as meaning objects as possible.

0:09:37.720 --> 0:09:39.920
<v Speaker 1>I think the key question is how right how do

0:09:40.000 --> 0:09:42.280
<v Speaker 1>you teach the machines? And what I found really interesting

0:09:42.360 --> 0:09:46.360
<v Speaker 1>about your background is that you are reading very widely

0:09:46.679 --> 0:09:52.040
<v Speaker 1>and there are two key breakthroughs that make your ultimate

0:09:52.080 --> 0:09:55.800
<v Speaker 1>breakthrough possible. And one is that you start thinking and

0:09:55.920 --> 0:09:59.720
<v Speaker 1>learning about what psychologists and linguists are saying that it's

0:09:59.760 --> 0:10:01.000
<v Speaker 1>really to your field.

0:10:01.240 --> 0:10:04.319
<v Speaker 2>Well, that's the beauty of doing science at the forefront,

0:10:04.520 --> 0:10:06.880
<v Speaker 2>because it's you. No one knows how to do it.

0:10:07.559 --> 0:10:11.040
<v Speaker 2>As an AI scientist, you look for possible answers, and

0:10:11.080 --> 0:10:14.840
<v Speaker 2>it's pretty natural to look at human brain and human

0:10:14.880 --> 0:10:19.880
<v Speaker 2>mind and try to understand or be inspired by what

0:10:20.080 --> 0:10:23.760
<v Speaker 2>humans can do. And one of the inspirations I got

0:10:23.880 --> 0:10:27.559
<v Speaker 2>in my early days of studying or trying to unlock

0:10:27.640 --> 0:10:31.600
<v Speaker 2>this visual intelligence problem is look at how our visual

0:10:31.920 --> 0:10:36.360
<v Speaker 2>semantic space is structured. There are so many tens and

0:10:36.480 --> 0:10:39.880
<v Speaker 2>thousands and millions of objects in the world. How are

0:10:39.920 --> 0:10:44.280
<v Speaker 2>they organized? Are they organized by alphabets or are they

0:10:44.440 --> 0:10:47.199
<v Speaker 2>organized by size or colors?

0:10:47.480 --> 0:10:49.880
<v Speaker 1>And you're asking that because you have to understand how

0:10:49.920 --> 0:10:53.640
<v Speaker 1>our brains organize in order to have something to teach

0:10:53.720 --> 0:10:54.439
<v Speaker 1>the computers.

0:10:54.679 --> 0:10:57.280
<v Speaker 2>That's one way to think about it. And I come

0:10:57.400 --> 0:11:02.440
<v Speaker 2>upon this linguistic work call word net word that is

0:11:02.480 --> 0:11:07.320
<v Speaker 2>a way to organize semantic concepts, not visual, just semantic

0:11:08.000 --> 0:11:11.200
<v Speaker 2>or words in a particular taxonomy.

0:11:11.360 --> 0:11:12.160
<v Speaker 1>Give me an example.

0:11:12.280 --> 0:11:16.760
<v Speaker 2>An example is in a dictionary, an apple and an

0:11:16.800 --> 0:11:21.080
<v Speaker 2>appliance are very close together, but in real life, an

0:11:21.120 --> 0:11:24.600
<v Speaker 2>apple and a pair a much closer to each other,

0:11:25.040 --> 0:11:29.800
<v Speaker 2>and an apple and pair both belong to fruits, whereas

0:11:29.880 --> 0:11:34.400
<v Speaker 2>an appliance belong to a whole different family of objects

0:11:34.800 --> 0:11:38.560
<v Speaker 2>and I came upon this taxonomy called word net that

0:11:39.320 --> 0:11:44.000
<v Speaker 2>describes this organization of objects, and I made a connection

0:11:44.360 --> 0:11:47.640
<v Speaker 2>in my head. Is that two things is very clear.

0:11:48.280 --> 0:11:51.800
<v Speaker 2>One is that this could be the way visual concepts

0:11:51.920 --> 0:11:55.880
<v Speaker 2>are organized, because apples and pairs are much more connected

0:11:55.960 --> 0:12:01.040
<v Speaker 2>than apples and wash machines, for example. But also even

0:12:01.120 --> 0:12:05.800
<v Speaker 2>more importantly is the scale. Is that if you look

0:12:05.880 --> 0:12:11.280
<v Speaker 2>at the number of objects described by language, realize how

0:12:11.520 --> 0:12:16.080
<v Speaker 2>vast it is, how big it is. And that was

0:12:16.240 --> 0:12:21.920
<v Speaker 2>a particular epiphany for me. We as intelligent animals, as humans,

0:12:22.480 --> 0:12:26.360
<v Speaker 2>we experienced the world with massive amount of data, and

0:12:26.400 --> 0:12:28.760
<v Speaker 2>we need to endow machines with that ability.

0:12:28.960 --> 0:12:32.120
<v Speaker 1>Hence the big data sets needed. And actually it's worth

0:12:32.200 --> 0:12:34.959
<v Speaker 1>noting that at that time, I think this is early

0:12:35.040 --> 0:12:37.719
<v Speaker 1>part of the century, the idea of big data two

0:12:37.760 --> 0:12:39.520
<v Speaker 1>thousand and six exists.

0:12:40.040 --> 0:12:44.960
<v Speaker 2>No big data was not this phrase did not even exist.

0:12:45.280 --> 0:12:48.800
<v Speaker 2>The kind of scientific data sets we were playing with

0:12:49.600 --> 0:12:54.760
<v Speaker 2>were tiny. For example, in images, most of the graduate

0:12:54.800 --> 0:12:59.160
<v Speaker 2>students in my era were playing with data sets of

0:12:59.600 --> 0:13:04.439
<v Speaker 2>four or six or at most twenty object classes each

0:13:04.640 --> 0:13:08.400
<v Speaker 2>class there were a couple of hundred examples and most

0:13:08.600 --> 0:13:12.040
<v Speaker 2>and that's how tiny it is. Whereas fast forward three

0:13:12.120 --> 0:13:17.400
<v Speaker 2>years later, after we created image that it had twenty

0:13:17.440 --> 0:13:23.720
<v Speaker 2>two thousand classes of objects and fifteen million annotated image.

0:13:23.440 --> 0:13:27.000
<v Speaker 1>And image net was a huge breakthrough. It's the reason

0:13:27.120 --> 0:13:30.000
<v Speaker 1>that you've been called the godmother of AI. I'd love

0:13:30.080 --> 0:13:34.200
<v Speaker 1>to understand more about what it is about you that

0:13:34.480 --> 0:13:37.240
<v Speaker 1>enabled you to make these connections and see things that

0:13:37.280 --> 0:13:40.400
<v Speaker 1>other scientists didn't. And one that immediately comes to mind

0:13:40.600 --> 0:13:44.040
<v Speaker 1>is that you acquired English as a second language after

0:13:44.080 --> 0:13:46.840
<v Speaker 1>you moved to the United States. Is there something in

0:13:46.880 --> 0:13:50.880
<v Speaker 1>that that led to what you're describing.

0:13:51.600 --> 0:13:54.760
<v Speaker 2>Michelle, that's a great question. I've been asked this question,

0:13:54.840 --> 0:13:59.160
<v Speaker 2>and my true answers I don't know. Human creativity is

0:13:59.280 --> 0:14:02.840
<v Speaker 2>still such a mystery. People talk about AI doing everything,

0:14:02.880 --> 0:14:05.960
<v Speaker 2>and I disagree. There's so much about the mystery of

0:14:06.040 --> 0:14:09.000
<v Speaker 2>human mind that we don't know. So I can only

0:14:09.160 --> 0:14:14.760
<v Speaker 2>conjecture that a combination of my interest and my experiences

0:14:14.840 --> 0:14:19.320
<v Speaker 2>led to this, including the curiosity and the hunger for

0:14:19.560 --> 0:14:24.440
<v Speaker 2>audacious problem definition I find myself. I'm not afraid of

0:14:24.640 --> 0:14:31.840
<v Speaker 2>asking crazy questions in science, not afraid of seeking solutions

0:14:31.880 --> 0:14:37.360
<v Speaker 2>that's out of box. And maybe my appreciation for the

0:14:37.480 --> 0:14:42.520
<v Speaker 2>linguistic link with vision might be accenterated by my own

0:14:42.600 --> 0:14:45.200
<v Speaker 2>journey of learning the different language. But I don't know

0:14:45.280 --> 0:14:45.800
<v Speaker 2>the answer.

0:14:46.560 --> 0:14:49.920
<v Speaker 1>What was it like coming to the United States as

0:14:49.920 --> 0:14:54.280
<v Speaker 1>a teenager, because I think that's a particularly difficult stage

0:14:54.320 --> 0:14:57.400
<v Speaker 1>in life to move and to have to make new friends,

0:14:57.480 --> 0:15:00.200
<v Speaker 1>even if you weren't battling a language barrier.

0:15:00.680 --> 0:15:06.400
<v Speaker 2>That was hard, I think being an immigrant. And I

0:15:06.440 --> 0:15:11.160
<v Speaker 2>came to this country when I was fifteen in New Jersey. Parcipitately,

0:15:11.200 --> 0:15:14.600
<v Speaker 2>in New Jersey, none of us, my parents nor I

0:15:14.880 --> 0:15:19.040
<v Speaker 2>spoke much English at all. I was young enough to

0:15:19.120 --> 0:15:22.800
<v Speaker 2>learn more quickly. It was very hard for my parents,

0:15:23.000 --> 0:15:26.200
<v Speaker 2>and we were not financially very well at all. My

0:15:26.280 --> 0:15:29.600
<v Speaker 2>parents were doing cashier jobs and I was doing Chinese

0:15:29.640 --> 0:15:34.840
<v Speaker 2>restaurant jobs. Eventually, by the time I entered college, also,

0:15:35.120 --> 0:15:38.320
<v Speaker 2>my mom's health was not so good. So my family

0:15:38.360 --> 0:15:41.240
<v Speaker 2>and I decided we have to run this little dry

0:15:41.320 --> 0:15:45.800
<v Speaker 2>cleaner shop in New Jersey to make some money to survive, right,

0:15:46.000 --> 0:15:49.520
<v Speaker 2>And you got involved yourself, I joked, I was the CEO.

0:15:50.800 --> 0:15:54.120
<v Speaker 2>I ran the dry cleaner shop for seven years from

0:15:54.160 --> 0:15:58.480
<v Speaker 2>what age I was eighteen nineteen to the middle of

0:15:58.520 --> 0:16:01.680
<v Speaker 2>my graduate school, so all the way through college and

0:16:01.720 --> 0:16:04.640
<v Speaker 2>graduate school, seven years, even when you're at a distance

0:16:04.680 --> 0:16:07.960
<v Speaker 2>you're running your parents strike cleaning business. Yes, because I

0:16:08.080 --> 0:16:10.000
<v Speaker 2>was the one who speak English, so I take all

0:16:10.040 --> 0:16:12.240
<v Speaker 2>the customer phone calls, I deal with the building, I

0:16:12.360 --> 0:16:16.560
<v Speaker 2>deal with inspections or the business. What did it teach

0:16:16.600 --> 0:16:21.280
<v Speaker 2>you resilience? As a scientist, you have to be resilient

0:16:21.360 --> 0:16:27.720
<v Speaker 2>because science is a nonlinear journey. Nobody has one conjecture

0:16:27.840 --> 0:16:30.240
<v Speaker 2>and all the solutions in front of you. You have

0:16:30.320 --> 0:16:34.040
<v Speaker 2>to go through such a challenge to arrive and find

0:16:34.080 --> 0:16:38.640
<v Speaker 2>an answer. And as an immigrant you learn to be resilient.

0:16:39.000 --> 0:16:42.680
<v Speaker 1>Were your parents pushing you because they clearly wanted a

0:16:42.880 --> 0:16:46.760
<v Speaker 1>better life for you, That's why they left China for

0:16:46.800 --> 0:16:49.600
<v Speaker 1>the United States. So how much of this was coming

0:16:49.640 --> 0:16:51.480
<v Speaker 1>from them and how much from your own sense of

0:16:51.560 --> 0:16:53.280
<v Speaker 1>responsibility to them.

0:16:53.720 --> 0:16:57.440
<v Speaker 2>To their credit, they did not push me much. They're

0:16:57.480 --> 0:17:01.640
<v Speaker 2>not tiger parents in today's language. I think part of

0:17:01.680 --> 0:17:04.600
<v Speaker 2>it is they're just trying to survive, to be honest,

0:17:05.440 --> 0:17:10.160
<v Speaker 2>and especially my mom, she is an intellectual at heart,

0:17:10.359 --> 0:17:16.800
<v Speaker 2>she loves reading, but the combination of a tough, survival

0:17:16.920 --> 0:17:21.400
<v Speaker 2>driven immigration life plus her health issues, she was not

0:17:21.400 --> 0:17:24.960
<v Speaker 2>pushing me at all. As a teenager, I kind of

0:17:25.040 --> 0:17:27.560
<v Speaker 2>had no choice. I either have to make it or

0:17:27.640 --> 0:17:30.920
<v Speaker 2>not make it, and the stakes are pretty high. So

0:17:31.359 --> 0:17:34.919
<v Speaker 2>I was pretty self motivated. I was just curious. I

0:17:35.040 --> 0:17:39.560
<v Speaker 2>was always a curious kid, and my curiosity had an outlet,

0:17:40.000 --> 0:17:44.000
<v Speaker 2>which was science, and that really grounded me. I wasn't

0:17:44.080 --> 0:17:48.320
<v Speaker 2>curious about nightclubs or other things. I was an avid

0:17:48.480 --> 0:17:49.880
<v Speaker 2>lover of science.

0:17:50.040 --> 0:17:52.479
<v Speaker 1>But I also going to avid reader that from your

0:17:52.520 --> 0:17:57.040
<v Speaker 1>early childhood, you're putting away the children's books, tanning towards

0:17:57.080 --> 0:17:59.919
<v Speaker 1>the classics, the drawn up books.

0:18:00.480 --> 0:18:05.240
<v Speaker 2>My mom liked reading, and she probably thought that you're

0:18:05.240 --> 0:18:09.399
<v Speaker 2>talking about when I was nine ten. At that time,

0:18:09.480 --> 0:18:13.640
<v Speaker 2>she probably did have influence more influenced. She thought I

0:18:13.800 --> 0:18:18.160
<v Speaker 2>was precocious enough that just read some grown up books.

0:18:19.520 --> 0:18:22.119
<v Speaker 1>You also had a teacher who is really important in

0:18:22.160 --> 0:18:23.280
<v Speaker 1>your life. Tell me about him.

0:18:23.760 --> 0:18:28.879
<v Speaker 2>I excelled in math, and I liked math, and I

0:18:28.960 --> 0:18:34.840
<v Speaker 2>befriended the head teacher, mister Bob Sabella. And the math teacher,

0:18:35.400 --> 0:18:41.119
<v Speaker 2>we became friends through the mutual love of science fiction reading, and.

0:18:41.080 --> 0:18:42.440
<v Speaker 1>We start read in Chinese.

0:18:43.359 --> 0:18:46.240
<v Speaker 2>At that time, I was reading in Chinese. Eventually I

0:18:46.280 --> 0:18:52.240
<v Speaker 2>started reading English. He's a remarkable person because he probably

0:18:52.280 --> 0:18:56.399
<v Speaker 2>saw in me this desire to learn, so he went

0:18:57.080 --> 0:19:00.399
<v Speaker 2>out of his way to create opportunity for me to

0:19:00.960 --> 0:19:06.480
<v Speaker 2>continue to excel in math. I remembered I placed out

0:19:06.520 --> 0:19:11.040
<v Speaker 2>of the highest curriculum of math, and so there was

0:19:11.080 --> 0:19:14.439
<v Speaker 2>no more courses for me. He would use his lunch

0:19:14.480 --> 0:19:18.880
<v Speaker 2>hours to create a one one class for me to do.

0:19:19.600 --> 0:19:22.000
<v Speaker 2>Now that I'm grown up, I know he was not

0:19:22.200 --> 0:19:26.200
<v Speaker 2>paid extra. It was really out of a teacher's love

0:19:26.359 --> 0:19:31.800
<v Speaker 2>and sense of responsibility. And he really became such an

0:19:32.200 --> 0:19:33.960
<v Speaker 2>important person in my.

0:19:33.880 --> 0:19:35.960
<v Speaker 1>Life because he still alive mister Sabella.

0:19:36.119 --> 0:19:40.159
<v Speaker 2>Mister Isabella passed away when I was assistant professor at Stanford,

0:19:40.320 --> 0:19:43.800
<v Speaker 2>but his family, his two sons, his wife, and I

0:19:43.800 --> 0:19:45.680
<v Speaker 2>I think they are my new Jersey family.

0:19:45.760 --> 0:19:48.479
<v Speaker 1>At this point, you used the word love about what

0:19:48.600 --> 0:19:51.800
<v Speaker 1>he did for you, and I wonder if they also

0:19:51.880 --> 0:19:55.320
<v Speaker 1>He and his family introduced you to American society with

0:19:55.880 --> 0:19:59.119
<v Speaker 1>your first friends and an entrance to the whole world

0:19:59.160 --> 0:20:00.679
<v Speaker 1>of America beyond your school.

0:20:00.880 --> 0:20:08.720
<v Speaker 2>Absolutely, they introduced me to the quintessential middle class American family.

0:20:09.720 --> 0:20:13.480
<v Speaker 2>They live in a suburban house, they have two lovely kids.

0:20:13.960 --> 0:20:17.040
<v Speaker 2>Of course they're all married and have the grand children

0:20:17.160 --> 0:20:20.240
<v Speaker 2>generation now, but it was a great window for me

0:20:20.400 --> 0:20:23.760
<v Speaker 2>to know the society, to be grounded, to have friends

0:20:23.800 --> 0:20:25.200
<v Speaker 2>and have a teacher who cared.

0:20:46.760 --> 0:20:49.359
<v Speaker 1>Do you think you could have had the career that

0:20:49.440 --> 0:20:53.239
<v Speaker 1>you have had in China because now there are like

0:20:53.320 --> 0:20:56.760
<v Speaker 1>significant advances happening in AI in China.

0:20:57.440 --> 0:20:59.520
<v Speaker 2>I don't think I'm able to answer this question because

0:20:59.520 --> 0:21:03.600
<v Speaker 2>I think life is so serendipitous, right the journey would

0:21:03.600 --> 0:21:08.080
<v Speaker 2>be very different, and in a way we could have

0:21:08.240 --> 0:21:13.360
<v Speaker 2>simulated all possibilities. But what is timeless or what is

0:21:13.520 --> 0:21:19.480
<v Speaker 2>invariant for anyone is the sense of curiosity, the pursuit

0:21:19.520 --> 0:21:23.760
<v Speaker 2>of north stars. So if I were me, I will

0:21:23.800 --> 0:21:26.240
<v Speaker 2>still be doing AI somehow, I believe.

0:21:27.840 --> 0:21:29.480
<v Speaker 1>Do you still feel connected to China?

0:21:30.040 --> 0:21:33.280
<v Speaker 2>It's part of my heritage. I feel very lucky that

0:21:33.520 --> 0:21:36.800
<v Speaker 2>my career has been in America, higher education and in

0:21:36.920 --> 0:21:40.560
<v Speaker 2>Silicon Valley then as also in and out of industry

0:21:41.160 --> 0:21:45.560
<v Speaker 2>and being in tech. The combination of all these ingredients

0:21:46.240 --> 0:21:50.919
<v Speaker 2>is very global. The environment my family right now is in,

0:21:51.080 --> 0:21:55.840
<v Speaker 2>which is Stanford, San Francisco, Silicon Valley, is very international,

0:21:55.920 --> 0:21:59.680
<v Speaker 2>so I feel very connected. And the discipline in which

0:21:59.720 --> 0:22:04.399
<v Speaker 2>is A is so horizontal it touches people everywhere that

0:22:04.520 --> 0:22:09.120
<v Speaker 2>I do feel much more like a global citizen at

0:22:09.119 --> 0:22:09.960
<v Speaker 2>this point.

0:22:09.800 --> 0:22:11.520
<v Speaker 1>And of course it is a global industry. But there

0:22:11.520 --> 0:22:14.040
<v Speaker 1>are some really striking advances in China, not least the

0:22:14.119 --> 0:22:17.480
<v Speaker 1>number of patents, in the number of AI public papers

0:22:17.520 --> 0:22:19.880
<v Speaker 1>coming out the Deep Seek moment earlier this year. As

0:22:19.880 --> 0:22:22.639
<v Speaker 1>you look ahead in this century, do you think China

0:22:22.680 --> 0:22:25.520
<v Speaker 1>will catch up with the US in the way that

0:22:25.600 --> 0:22:28.399
<v Speaker 1>it has in other fields like manufacturing.

0:22:28.920 --> 0:22:32.280
<v Speaker 2>I do think China is a powerhouse in AI. In

0:22:32.320 --> 0:22:36.960
<v Speaker 2>this moment, most people would recognize the two leading countries

0:22:37.119 --> 0:22:42.320
<v Speaker 2>in AI are China and US. The excitement, the energy,

0:22:42.920 --> 0:22:46.440
<v Speaker 2>and also frankly, the ambition of many regions and many

0:22:46.520 --> 0:22:49.960
<v Speaker 2>countries in the world of wanting to have a roll

0:22:50.040 --> 0:22:54.560
<v Speaker 2>in AI, wanting to catch up or even come ahead

0:22:54.600 --> 0:22:58.320
<v Speaker 2>in certain areas AI. That is pretty universal.

0:22:58.160 --> 0:23:02.840
<v Speaker 1>And your own next frontier spatial intelligence. Tell me what

0:23:03.160 --> 0:23:06.040
<v Speaker 1>you mean when you use the word spatial intelligence. What

0:23:06.080 --> 0:23:07.359
<v Speaker 1>are you working on right now?

0:23:07.560 --> 0:23:13.960
<v Speaker 2>Yeah? So, spatial intelligence is the ability for AI or frankly,

0:23:14.119 --> 0:23:20.640
<v Speaker 2>any intelligence to understand, perceive, reason and interact and also

0:23:20.800 --> 0:23:28.800
<v Speaker 2>create spaces worlds. It comes from a continuation of visual intelligence.

0:23:29.200 --> 0:23:32.040
<v Speaker 2>The first half of my career around the image that

0:23:32.040 --> 0:23:35.160
<v Speaker 2>at time was trying to solve the fundamental problem of

0:23:35.760 --> 0:23:39.520
<v Speaker 2>understanding what we are seeing, and that was a very

0:23:39.560 --> 0:23:44.040
<v Speaker 2>important problem. But it's not enough because that's a very

0:23:44.080 --> 0:23:47.800
<v Speaker 2>passive act. It's just receiving information and being able to

0:23:48.560 --> 0:23:51.879
<v Speaker 2>understand this is a cop, this is a beautiful lady,

0:23:52.280 --> 0:23:55.520
<v Speaker 2>this is a microphone. But if you look at evolution,

0:23:55.720 --> 0:24:01.840
<v Speaker 2>you look at human intelligence, perception is profound linked to action.

0:24:02.720 --> 0:24:07.040
<v Speaker 2>We see because we move, we move, therefore we need

0:24:07.080 --> 0:24:10.640
<v Speaker 2>to see better. And how do you create that connection

0:24:10.840 --> 0:24:14.320
<v Speaker 2>has a lot to do with space, because you need

0:24:14.359 --> 0:24:17.119
<v Speaker 2>to see the three D space. You need to understand

0:24:17.160 --> 0:24:20.720
<v Speaker 2>how things move. You need to understand when I touch

0:24:20.840 --> 0:24:26.600
<v Speaker 2>this cup, how do I mathematically organize my fingers so

0:24:26.680 --> 0:24:30.600
<v Speaker 2>that it creates a space that would allow me to

0:24:30.800 --> 0:24:36.080
<v Speaker 2>grab this cup. All this intricacy is center around this

0:24:36.240 --> 0:24:38.520
<v Speaker 2>capability of spatial intelligence.

0:24:38.560 --> 0:24:40.880
<v Speaker 1>And I've looked on your website and I've seen the

0:24:40.920 --> 0:24:46.399
<v Speaker 1>preview that you've released Marble. You can see essentially a

0:24:46.480 --> 0:24:51.080
<v Speaker 1>virtual world which one goes into and from room to room,

0:24:51.119 --> 0:24:53.200
<v Speaker 1>and one door opens and you go from one place

0:24:53.240 --> 0:24:56.640
<v Speaker 1>to another. But I'm not sure how you use it.

0:24:56.680 --> 0:25:00.439
<v Speaker 1>Is it essentially to you a tool for training AI.

0:25:00.480 --> 0:25:04.840
<v Speaker 1>It's a different way to train AI, rather than for example,

0:25:04.920 --> 0:25:07.359
<v Speaker 1>meta saying this is in the metaverse. This is a

0:25:07.400 --> 0:25:09.439
<v Speaker 1>world you can go into and spend time and as

0:25:09.480 --> 0:25:10.080
<v Speaker 1>a human being.

0:25:10.320 --> 0:25:13.640
<v Speaker 2>Right, So let's just be clear with the definition. Marble

0:25:14.080 --> 0:25:18.680
<v Speaker 2>is a frontier model. It's a frontier world model. What

0:25:18.720 --> 0:25:21.080
<v Speaker 2>it does is not just to see and go into

0:25:21.119 --> 0:25:26.200
<v Speaker 2>a world. What's really remarkable is it generates a three

0:25:26.320 --> 0:25:31.120
<v Speaker 2>D world at a simple prompt. A prompt could be

0:25:32.080 --> 0:25:36.080
<v Speaker 2>give me a modern looking kitchen, or the prompt could

0:25:36.160 --> 0:25:39.880
<v Speaker 2>be here's a picture of a modern looking kitchen. Make

0:25:39.960 --> 0:25:43.920
<v Speaker 2>it a three D world. The ability to create a

0:25:44.040 --> 0:25:48.880
<v Speaker 2>three D world is a fundamental ability. It's fundamental to humans,

0:25:48.880 --> 0:25:51.800
<v Speaker 2>and it's I hope one day, is fundamental to AI.

0:25:52.359 --> 0:25:57.920
<v Speaker 2>If you are a designer or architect, you can use

0:25:58.000 --> 0:26:02.439
<v Speaker 2>this three D world to ida to design. If you

0:26:02.520 --> 0:26:06.200
<v Speaker 2>are a game developer, you can use it to obtain

0:26:06.280 --> 0:26:09.720
<v Speaker 2>these three D worlds so that you can design games.

0:26:09.920 --> 0:26:13.440
<v Speaker 2>If you want to do robotics simulation, these worlds will

0:26:13.480 --> 0:26:17.320
<v Speaker 2>become very useful as training data for robots. If you

0:26:17.480 --> 0:26:23.800
<v Speaker 2>want to create immersive educational experiences in ARVR, this model

0:26:23.840 --> 0:26:25.040
<v Speaker 2>would help you to do that.

0:26:25.119 --> 0:26:27.959
<v Speaker 1>Interesting I'm imagining girls in Afghanistan, Maybe you could do

0:26:28.480 --> 0:26:32.240
<v Speaker 1>virtual classrooms in a very challenged place.

0:26:32.200 --> 0:26:35.800
<v Speaker 2>Yes, Or I'm imagining, for example, how do you explain

0:26:35.880 --> 0:26:40.920
<v Speaker 2>to an eight year old what is a cell? One day,

0:26:41.119 --> 0:26:45.080
<v Speaker 2>we'll create a world that's inside of a cell, and

0:26:45.119 --> 0:26:49.320
<v Speaker 2>then the student can walk into that cell and understand

0:26:49.440 --> 0:26:54.399
<v Speaker 2>the nuclears, enzymes, the membranes. So you can use this

0:26:54.560 --> 0:26:59.359
<v Speaker 2>for so many possibilities. So that's your next frontier, can

0:26:59.400 --> 0:27:03.000
<v Speaker 2>we conscious? Yours is a very big, complex industry, but

0:27:03.200 --> 0:27:07.159
<v Speaker 2>there are some immediate pressing issues, and I wonder if

0:27:07.160 --> 0:27:09.760
<v Speaker 2>I could put a selection of those to you for

0:27:09.880 --> 0:27:12.720
<v Speaker 2>you to give us an instinctive or even a nutshell

0:27:13.320 --> 0:27:16.800
<v Speaker 2>response on how you see them. For example, and you'll

0:27:16.800 --> 0:27:19.440
<v Speaker 2>have heard this many times before. Number one, is AI

0:27:19.520 --> 0:27:26.600
<v Speaker 2>going to destroy jobs? Large numbers of jobs? Technology do

0:27:26.760 --> 0:27:31.520
<v Speaker 2>change the landscape of labor. A technology as impactful and

0:27:31.720 --> 0:27:36.000
<v Speaker 2>profound as AI will have a profound impact in jobs.

0:27:35.720 --> 0:27:41.320
<v Speaker 1>Which is happening of their customer support roles are going

0:27:41.359 --> 0:27:43.000
<v Speaker 1>to go because of AI.

0:27:43.000 --> 0:27:48.240
<v Speaker 2>Right and software engineering, contact centers, analyst jobs, So it will,

0:27:48.440 --> 0:27:50.360
<v Speaker 2>there's no question about it.

0:27:50.359 --> 0:27:52.800
<v Speaker 1>It's not going to create as many jobs in its place,

0:27:52.880 --> 0:27:55.159
<v Speaker 1>is it. I wonder if that worries you.

0:27:55.440 --> 0:27:59.959
<v Speaker 2>The juris store out there. Every time humanity has created

0:27:59.920 --> 0:28:07.520
<v Speaker 2>a more advanced technology, for example steam engine, electricity, PC cars,

0:28:07.800 --> 0:28:11.879
<v Speaker 2>we have gone through difficult times, but we also have

0:28:12.040 --> 0:28:17.560
<v Speaker 2>gone through real landscaping of jobs. So only in talking

0:28:17.600 --> 0:28:22.280
<v Speaker 2>about the number bigger or smaller doesn't do justice. We

0:28:22.320 --> 0:28:24.480
<v Speaker 2>need to look at this in a much more nuanced

0:28:24.560 --> 0:28:27.920
<v Speaker 2>way and really think about how to respond to this change.

0:28:28.119 --> 0:28:32.119
<v Speaker 2>There's the individual level responsibility you've got to learn, you

0:28:32.280 --> 0:28:37.040
<v Speaker 2>gotta obscure yourself, but there is also company level responsibility.

0:28:37.119 --> 0:28:41.120
<v Speaker 2>There's also societal level responsibility. So this is a big question.

0:28:41.480 --> 0:28:44.000
<v Speaker 1>Number two is an even bigger question. If you take

0:28:44.040 --> 0:28:47.000
<v Speaker 1>those headphones, I want to play you the voice and

0:28:47.240 --> 0:28:51.400
<v Speaker 1>a view on the existential question about whether human beings

0:28:51.440 --> 0:28:53.720
<v Speaker 1>are going to be replaced by AI. And this is

0:28:53.760 --> 0:28:57.200
<v Speaker 1>a voice you will know Professor Jeffrey Hinton, whose work

0:28:57.280 --> 0:29:00.479
<v Speaker 1>has overlapped with yours and who is a no laureate

0:29:00.520 --> 0:29:00.920
<v Speaker 1>as well.

0:29:01.360 --> 0:29:04.640
<v Speaker 3>When AI gets super intelligent, it might just replace us.

0:29:05.360 --> 0:29:07.680
<v Speaker 3>How do we prevent it taking over? Even if all

0:29:07.680 --> 0:29:10.800
<v Speaker 3>the countries collaborate, what do you do? And I think

0:29:10.800 --> 0:29:13.840
<v Speaker 3>at present, all the big companies and governments have the

0:29:13.880 --> 0:29:19.280
<v Speaker 3>wrong model. Their basic model is I'm the CEO, and

0:29:19.320 --> 0:29:23.200
<v Speaker 3>this super intelligent AI is the extremely smart executive assistant.

0:29:23.960 --> 0:29:26.480
<v Speaker 3>I'm the boss. It's not going to be like that

0:29:26.520 --> 0:29:28.640
<v Speaker 3>when it's smarter than us and more powerful than us.

0:29:30.200 --> 0:29:32.400
<v Speaker 1>What do you think of that? Because Professor Hinton thinks

0:29:32.440 --> 0:29:35.880
<v Speaker 1>it's a ten to twenty percent chance that AI leads

0:29:35.920 --> 0:29:37.040
<v Speaker 1>to human extinction.

0:29:38.040 --> 0:29:41.440
<v Speaker 2>So, first of all, Professor Hinton or I call him Jeff,

0:29:41.480 --> 0:29:44.320
<v Speaker 2>because I have known him for twenty five years since

0:29:44.360 --> 0:29:47.280
<v Speaker 2>I was first a graduate student. He's someone I admire

0:29:47.360 --> 0:29:52.600
<v Speaker 2>and studied his technical papers. But this thing about replacing

0:29:52.720 --> 0:29:58.560
<v Speaker 2>human race, I actually do respectfully disagree, not in the

0:29:58.680 --> 0:30:01.640
<v Speaker 2>sense that it will never happen. It is in a

0:30:01.760 --> 0:30:09.160
<v Speaker 2>sense that if human race became really being trouble, in

0:30:09.240 --> 0:30:14.400
<v Speaker 2>my opinion, that's a result of humans doing wrong things,

0:30:14.600 --> 0:30:16.520
<v Speaker 2>not machines doing wrong things.

0:30:16.800 --> 0:30:19.600
<v Speaker 1>But the very practical point that he put in that

0:30:19.760 --> 0:30:23.760
<v Speaker 1>clip is where he says, how do we prevent the

0:30:23.800 --> 0:30:27.280
<v Speaker 1>super intelligent creation taking over at the point that it

0:30:27.400 --> 0:30:31.400
<v Speaker 1>becomes more intelligent than us? We have no model for that. Now,

0:30:31.720 --> 0:30:34.400
<v Speaker 1>if that creation that is more intelligent than us, says,

0:30:34.760 --> 0:30:39.240
<v Speaker 1>turn off human beings life support or do something else.

0:30:39.320 --> 0:30:42.680
<v Speaker 1>That is existential? How would we stop it?

0:30:43.280 --> 0:30:47.120
<v Speaker 2>So? I think this question has made an assumption, which

0:30:47.160 --> 0:30:50.240
<v Speaker 2>is from today, which we don't have such a machine,

0:30:50.240 --> 0:30:54.400
<v Speaker 2>super intelligent machine, Yet we still have a distance, We

0:30:54.520 --> 0:30:57.280
<v Speaker 2>still have a journey to take from today to that day.

0:30:57.760 --> 0:31:01.160
<v Speaker 2>And my question is why would humanity as a whole

0:31:01.280 --> 0:31:06.200
<v Speaker 2>allow this to happen? Where is our collective responsibility? Where's

0:31:06.240 --> 0:31:08.160
<v Speaker 2>our governance or regulation?

0:31:08.600 --> 0:31:09.960
<v Speaker 1>Which is why I wonder, then, do you think there

0:31:10.000 --> 0:31:11.640
<v Speaker 1>is a way to make sure that there is an

0:31:11.720 --> 0:31:13.720
<v Speaker 1>upper limit to superintelligence?

0:31:14.120 --> 0:31:16.800
<v Speaker 2>I think there is a way to make sure there

0:31:17.040 --> 0:31:21.840
<v Speaker 2>is a responsible development and usage of.

0:31:21.880 --> 0:31:25.880
<v Speaker 1>Technology internationally agreed, like at the government level. Is it

0:31:25.920 --> 0:31:29.239
<v Speaker 1>a treaty? Is it just companies agreeing to right?

0:31:30.280 --> 0:31:34.040
<v Speaker 2>You're right? The field is so nascent that we don't

0:31:34.120 --> 0:31:38.560
<v Speaker 2>yet have the level of international treaties. We don't yet

0:31:38.640 --> 0:31:41.760
<v Speaker 2>have the level of global consent. I think we have

0:31:41.800 --> 0:31:46.040
<v Speaker 2>global awareness, and I do want to say that we

0:31:46.080 --> 0:31:51.840
<v Speaker 2>shouldn't over click out the only one possible consequences or

0:31:51.880 --> 0:31:56.520
<v Speaker 2>negative consequences of AI. This technology is powerful. It might

0:31:56.640 --> 0:32:00.920
<v Speaker 2>have other negative consequences. It also has a tone of

0:32:01.280 --> 0:32:06.880
<v Speaker 2>benevolent applications for humanity. We need to look at this holistically.

0:32:07.000 --> 0:32:08.960
<v Speaker 1>Do you get frustrated by some of the questions? And

0:32:09.040 --> 0:32:12.440
<v Speaker 1>because I know you talked to politicians, people with political

0:32:12.480 --> 0:32:14.560
<v Speaker 1>power a lot. You've done that in the US, You've

0:32:14.560 --> 0:32:16.720
<v Speaker 1>done that in the UK and in France and elsewhere.

0:32:17.600 --> 0:32:19.840
<v Speaker 1>What's the most common question they ask you that you

0:32:19.960 --> 0:32:20.880
<v Speaker 1>find frustrating.

0:32:21.520 --> 0:32:25.160
<v Speaker 2>I wouldn't use the word frustrating. I would use the

0:32:25.160 --> 0:32:29.080
<v Speaker 2>word concerned, because I think our public discourse of AI

0:32:29.480 --> 0:32:32.920
<v Speaker 2>need to move from the very simple question of what

0:32:32.960 --> 0:32:36.160
<v Speaker 2>do we do when machine overlord is here? So I

0:32:37.080 --> 0:32:40.400
<v Speaker 2>don't get frustrated, I get concerned if the only way

0:32:40.480 --> 0:32:45.360
<v Speaker 2>to ask this question is is very simple binary do

0:32:45.440 --> 0:32:48.400
<v Speaker 2>you want it or not want it? Another question I

0:32:48.400 --> 0:32:52.160
<v Speaker 2>get asked a lot, possibly more than this question, is

0:32:52.320 --> 0:32:58.200
<v Speaker 2>question from parents worldwide. Parents ask me AI is coming.

0:32:59.280 --> 0:33:03.040
<v Speaker 2>How do I advise my kids? What's the future of

0:33:03.120 --> 0:33:06.480
<v Speaker 2>my kids? What should they do? Should they study computer science?

0:33:06.880 --> 0:33:08.480
<v Speaker 2>Are they going to have jobs? And so?

0:33:08.680 --> 0:33:11.720
<v Speaker 1>Answer it? Then people listening to this are probably thinking

0:33:11.800 --> 0:33:13.240
<v Speaker 1>exactly the same. What do you say?

0:33:13.960 --> 0:33:19.040
<v Speaker 2>I say that AI is a very powerful technology, and

0:33:19.120 --> 0:33:24.000
<v Speaker 2>I'm my mother. The most important thing we should empower

0:33:24.120 --> 0:33:31.800
<v Speaker 2>our kids is to empower them as humans with agency,

0:33:32.200 --> 0:33:37.120
<v Speaker 2>with dignity with the desire to learn, and there are

0:33:37.280 --> 0:33:41.320
<v Speaker 2>timeless values of humanity. Be an honest person, be a

0:33:41.360 --> 0:33:45.840
<v Speaker 2>hard working person, be creative. Worry about what they'll study.

0:33:46.360 --> 0:33:51.000
<v Speaker 2>Worry is not the right word. Be totally informed and

0:33:51.240 --> 0:33:56.160
<v Speaker 2>understand that your children's future is going to be living

0:33:56.120 --> 0:34:00.960
<v Speaker 2>in the world of AI technology and depending on their interests,

0:34:01.120 --> 0:34:08.040
<v Speaker 2>their passion, their personality, their circumstance, prepare them in that future.

0:34:08.120 --> 0:34:10.040
<v Speaker 2>Worry doesn't solve the problem.

0:34:10.239 --> 0:34:13.680
<v Speaker 1>I've got another industry question, which is about the huge

0:34:13.760 --> 0:34:17.560
<v Speaker 1>sums of money that are flowing into again, not that

0:34:17.680 --> 0:34:24.200
<v Speaker 1>many companies like yours, and whether this might be a bubble,

0:34:24.239 --> 0:34:26.879
<v Speaker 1>whether this might be like a dot com bubble where

0:34:26.920 --> 0:34:29.279
<v Speaker 1>it turns out that some of these companies are overvalued.

0:34:29.880 --> 0:34:33.480
<v Speaker 2>First of all, my company is still a startup. When

0:34:33.520 --> 0:34:36.520
<v Speaker 2>we're talking about huge amount of money, we really look

0:34:36.520 --> 0:34:40.800
<v Speaker 2>at the big techs. AI is still a nascent technology,

0:34:41.280 --> 0:34:44.160
<v Speaker 2>and from a development point of view, there's still a

0:34:44.360 --> 0:34:48.000
<v Speaker 2>lot to be developed. The science is very hard. It

0:34:48.120 --> 0:34:52.680
<v Speaker 2>takes a lot to make scientific and technological breakthroughs. This

0:34:52.840 --> 0:34:57.879
<v Speaker 2>is why resourcing these efforts are still important. The other

0:34:58.000 --> 0:35:00.719
<v Speaker 2>side of this is the market. Are we going to

0:35:00.760 --> 0:35:04.759
<v Speaker 2>see the payoff from the market. By and large, I

0:35:04.880 --> 0:35:09.240
<v Speaker 2>do believe that the applications of AI is so massive,

0:35:09.440 --> 0:35:14.320
<v Speaker 2>whether we're talking about software engineering to creativity, to healthcare,

0:35:14.520 --> 0:35:19.080
<v Speaker 2>to education, to financial services, that I think we're going

0:35:19.080 --> 0:35:24.280
<v Speaker 2>to continue to see an expansion of the market of AI.

0:35:25.120 --> 0:35:28.200
<v Speaker 2>I look at it as there are so many human

0:35:29.000 --> 0:35:32.440
<v Speaker 2>needs both in terms of wellbeing as well as in

0:35:32.520 --> 0:35:37.279
<v Speaker 2>terms of productivity that can be helped by AI as

0:35:37.280 --> 0:35:42.279
<v Speaker 2>an assistant, as a collaborator, and that part. I do

0:35:42.400 --> 0:35:46.040
<v Speaker 2>believe strongly that this is an expanding market.

0:35:46.400 --> 0:35:49.400
<v Speaker 1>But what does it cost in terms of power and

0:35:49.480 --> 0:35:54.760
<v Speaker 1>therefore energy and therefore climate. There's a prominent AI entrepreneur

0:35:54.800 --> 0:35:57.120
<v Speaker 1>you probably know, Jerry Kaplan, who said that we could

0:35:57.120 --> 0:36:01.200
<v Speaker 1>be heading for a new ecological disaster because of the

0:36:01.280 --> 0:36:05.480
<v Speaker 1>amount of energy consumed by the vast data centers that

0:36:05.520 --> 0:36:07.240
<v Speaker 1>we're going to need in growing numbers.

0:36:08.080 --> 0:36:11.080
<v Speaker 2>This is an interesting question. I do think that in

0:36:11.200 --> 0:36:14.799
<v Speaker 2>order to train large models, we're seeing more and more

0:36:14.920 --> 0:36:20.480
<v Speaker 2>need for power or energy. But nobody says these data

0:36:20.520 --> 0:36:25.799
<v Speaker 2>centers must be powered by fossil fuel. For example, Our

0:36:25.960 --> 0:36:29.600
<v Speaker 2>innovation and the energy side will be part of this

0:36:29.800 --> 0:36:31.239
<v Speaker 2>innovation cycle, right.

0:36:31.239 --> 0:36:33.920
<v Speaker 1>I think it's just because the amount of power they

0:36:33.960 --> 0:36:36.279
<v Speaker 1>need is so enormous it's hard to see it coming

0:36:36.280 --> 0:36:37.920
<v Speaker 1>from renewable energy alone.

0:36:38.080 --> 0:36:40.760
<v Speaker 2>I think right now this is true, but I also

0:36:40.920 --> 0:36:45.759
<v Speaker 2>know that as for example, I visit Middle East, there's

0:36:45.840 --> 0:36:50.000
<v Speaker 2>a lot of effort in building renewable energy for big

0:36:50.080 --> 0:36:54.040
<v Speaker 2>data centers. I do think that countries that need to

0:36:54.080 --> 0:36:57.399
<v Speaker 2>build these big data centers need to also examine its

0:36:58.239 --> 0:37:02.960
<v Speaker 2>energy policy and industry. This is an opportunity for us

0:37:03.080 --> 0:37:07.000
<v Speaker 2>to invest and develop more renewable energy.

0:37:07.320 --> 0:37:10.040
<v Speaker 1>What worries you about your industry because you're painting a

0:37:10.120 --> 0:37:12.200
<v Speaker 1>very positive picture, and you've been at the forefront of

0:37:12.239 --> 0:37:15.120
<v Speaker 1>this and you see much more potential. So I understand

0:37:15.160 --> 0:37:16.200
<v Speaker 1>where you're coming from.

0:37:16.920 --> 0:37:20.239
<v Speaker 2>But in quite a moment, I'm not a tech utopian,

0:37:20.520 --> 0:37:23.440
<v Speaker 2>nor am I a dystopian. I think I'm actually the

0:37:23.480 --> 0:37:27.360
<v Speaker 2>boring middle. The boring middle wants to apply a much

0:37:27.400 --> 0:37:33.400
<v Speaker 2>more pragmatic and scientific lens to this. So what worries me?

0:37:34.000 --> 0:37:37.440
<v Speaker 2>Of course, aning tool in the hands of the wrong

0:37:38.160 --> 0:37:42.440
<v Speaker 2>mindset or wrong intention would worry me since the dawn

0:37:42.480 --> 0:37:46.200
<v Speaker 2>of human civilization, if fire was such a critical invention

0:37:46.360 --> 0:37:50.560
<v Speaker 2>for our species, yet using fire to harm people is

0:37:51.040 --> 0:37:55.120
<v Speaker 2>massively bad. So adding wrong use of AI, worries speed,

0:37:55.880 --> 0:37:59.959
<v Speaker 2>the wrong way to communicate with the public, Worry speed

0:38:00.680 --> 0:38:04.239
<v Speaker 2>because I do feel there's a lot of anxiety in

0:38:04.360 --> 0:38:07.920
<v Speaker 2>different parts of the world. The one worry I have

0:38:08.160 --> 0:38:12.320
<v Speaker 2>is our teachers. These people and my own personal experience

0:38:12.360 --> 0:38:15.640
<v Speaker 2>tells me they're the backbone of our society. They are

0:38:15.719 --> 0:38:20.960
<v Speaker 2>so critical for educating our future generation. Are we having

0:38:21.000 --> 0:38:24.440
<v Speaker 2>the right communication with them? Are we bringing them along?

0:38:24.600 --> 0:38:31.000
<v Speaker 2>Are our teachers using AI tools to superpower their profession

0:38:31.680 --> 0:38:37.040
<v Speaker 2>or helping our children to use AI instead of letting

0:38:37.040 --> 0:38:41.480
<v Speaker 2>them feel left out, frustrated and just we should be

0:38:41.560 --> 0:38:45.400
<v Speaker 2>embracing them. We should be helping them. And this would

0:38:45.400 --> 0:38:47.480
<v Speaker 2>be a concern if we don't do the right thing

0:38:47.560 --> 0:38:50.680
<v Speaker 2>for our teachers, for our nurses, for our doctors, for

0:38:51.160 --> 0:38:53.040
<v Speaker 2>many parts of our society.

0:38:53.800 --> 0:38:56.480
<v Speaker 1>This is the Bloomberg Weekend Interview. And so we're always

0:38:56.480 --> 0:38:59.680
<v Speaker 1>interested in people's lives as well as their work. And

0:39:00.239 --> 0:39:03.640
<v Speaker 1>I realize that your life, the life that you're living today,

0:39:03.680 --> 0:39:05.480
<v Speaker 1>is so different from the way that you grew up,

0:39:05.800 --> 0:39:09.640
<v Speaker 1>working in your parents dry cleaner, keeping it a lot.

0:39:09.520 --> 0:39:10.520
<v Speaker 2>Of laundry at home.

0:39:10.680 --> 0:39:15.680
<v Speaker 1>So are you conscious of the power that you have

0:39:16.160 --> 0:39:19.160
<v Speaker 1>as a leader in this industry of our time in

0:39:19.200 --> 0:39:19.720
<v Speaker 1>the future.

0:39:20.239 --> 0:39:25.600
<v Speaker 2>I'm conscious of my responsibility. I understand I'm one of

0:39:25.640 --> 0:39:30.960
<v Speaker 2>the people who brought this technology to our world. I

0:39:31.080 --> 0:39:35.279
<v Speaker 2>understand that I'm so privileged to be working at the

0:39:36.000 --> 0:39:40.440
<v Speaker 2>one of the best universities in the world, educating tomorrow's

0:39:40.520 --> 0:39:44.680
<v Speaker 2>leaders and doing the cutting edge research. I'm conscious of

0:39:45.600 --> 0:39:49.000
<v Speaker 2>myself being an entrepreneur and a CEO of one of

0:39:49.040 --> 0:39:52.960
<v Speaker 2>the most exciting startups in the gen AI world. So

0:39:53.080 --> 0:39:58.719
<v Speaker 2>everything I do has a consequence, and that's a responsibility

0:39:58.840 --> 0:40:04.600
<v Speaker 2>I shoulder, and I take that very seriously because this

0:40:04.760 --> 0:40:08.800
<v Speaker 2>is what I keep telling people in the age of AI.

0:40:09.320 --> 0:40:14.320
<v Speaker 2>The agency should be within humans. The agency is not

0:40:14.440 --> 0:40:19.319
<v Speaker 2>the machines. It's ours. My agency is to create exciting

0:40:20.200 --> 0:40:23.040
<v Speaker 2>technology and to use it responsibily.

0:40:23.640 --> 0:40:26.600
<v Speaker 1>And the humans in your life, your children, what do

0:40:26.640 --> 0:40:31.319
<v Speaker 1>you not let them do with AI or with their

0:40:31.320 --> 0:40:33.360
<v Speaker 1>devices or on the internet.

0:40:33.520 --> 0:40:37.840
<v Speaker 2>It's the timeless advice of don't do stupid things with

0:40:39.200 --> 0:40:43.880
<v Speaker 2>your tools. You have to think about why you're using

0:40:43.960 --> 0:40:47.319
<v Speaker 2>the tool and how you're using it. It could be

0:40:47.560 --> 0:40:52.080
<v Speaker 2>as simple as don't be lazy just because there is AI. Right,

0:40:52.160 --> 0:40:56.800
<v Speaker 2>if you want to understand math, maybe large language models

0:40:56.840 --> 0:40:58.880
<v Speaker 2>can give you the answer, but that is not the

0:40:58.920 --> 0:41:02.120
<v Speaker 2>way to learn. As the questions. You could use the

0:41:02.160 --> 0:41:05.480
<v Speaker 2>AI tool to prompt a good question so that you

0:41:05.640 --> 0:41:08.719
<v Speaker 2>learn because of it. So don't be lazy is one

0:41:08.760 --> 0:41:11.600
<v Speaker 2>of them. The other side is don't use it to

0:41:11.680 --> 0:41:16.680
<v Speaker 2>do bad things. For example, integrity of information right, fake images,

0:41:16.760 --> 0:41:21.640
<v Speaker 2>fake voices, fake texts. These are issues of AI as

0:41:21.680 --> 0:41:26.400
<v Speaker 2>well as our social media driven communication in our society.

0:41:26.480 --> 0:41:28.920
<v Speaker 1>I think you're sort of calling for old fashioned values

0:41:29.000 --> 0:41:33.040
<v Speaker 1>amid this amid this world full of new developments and

0:41:33.480 --> 0:41:36.799
<v Speaker 1>challenges we couldn't have imagined even three years ago. Yeah,

0:41:36.840 --> 0:41:39.160
<v Speaker 1>you can call it old fashion, you can call it timeless.

0:41:39.480 --> 0:41:42.279
<v Speaker 1>As an educator, as a mom, I think there are

0:41:42.440 --> 0:41:45.400
<v Speaker 1>some human values that are timeless, and we need to

0:41:45.719 --> 0:41:50.239
<v Speaker 1>recognize that. Finally, you're reading from the child who is

0:41:50.280 --> 0:41:55.080
<v Speaker 1>already reading the classics at an early age. Nowadays, What

0:41:55.160 --> 0:41:57.720
<v Speaker 1>do you read when you're kicking back at the weekend,

0:41:57.760 --> 0:41:58.680
<v Speaker 1>if that ever happens.

0:41:58.840 --> 0:42:03.600
<v Speaker 2>Yeah, honestly, I read a lot of technical papers these days.

0:42:03.719 --> 0:42:07.839
<v Speaker 2>It's it's kind of sad. I also read to my kids.

0:42:08.680 --> 0:42:12.200
<v Speaker 2>I have to say that my favorite book these days

0:42:12.280 --> 0:42:15.880
<v Speaker 2>is Harry Potter because it's such a great book and

0:42:16.000 --> 0:42:19.040
<v Speaker 2>I read a bed time reading to my kids.

0:42:19.040 --> 0:42:20.520
<v Speaker 1>Well, that series is going to keep you going for

0:42:20.600 --> 0:42:23.160
<v Speaker 1>quite a long time, Yes, depending on I'm most cure at.

0:42:23.320 --> 0:42:23.520
<v Speaker 2>Yeah.

0:42:23.640 --> 0:42:26.280
<v Speaker 1>So you're not taking away the children's books in their lives, then.

0:42:26.880 --> 0:42:30.480
<v Speaker 2>No, because they live in a totally different time. They

0:42:30.640 --> 0:42:34.959
<v Speaker 2>actually are exposed to information in a completely different way

0:42:35.040 --> 0:42:39.400
<v Speaker 2>from my time. There's like barely TV in the nineteen

0:42:39.520 --> 0:42:44.120
<v Speaker 2>nineties or eighties in China. Now they have the entire Internet.

0:42:44.480 --> 0:42:47.000
<v Speaker 2>I give them a kindle and Chinese. Do you speak

0:42:47.000 --> 0:42:50.360
<v Speaker 2>to them in Chinese? I speak to them in not

0:42:50.480 --> 0:42:53.000
<v Speaker 2>great but I do speak to them in Chinese, and

0:42:53.080 --> 0:42:55.160
<v Speaker 2>their father speak to them in Italian.

0:42:56.040 --> 0:42:59.880
<v Speaker 1>Okay, citizens of the world, Yes, as you've become as well,

0:43:00.320 --> 0:43:05.120
<v Speaker 1>Doctor Fayfaeley. Thank you so much. Thank you, Michelle, And

0:43:05.200 --> 0:43:08.799
<v Speaker 1>that's the Michelle Hussein Show for this week. Do subscribe

0:43:08.800 --> 0:43:11.959
<v Speaker 1>if you haven't already. That way you'll know as soon

0:43:12.000 --> 0:43:14.600
<v Speaker 1>as we have a new episode. Thank you for the

0:43:14.600 --> 0:43:18.160
<v Speaker 1>comments and the ratings, and don't forget you can email

0:43:18.280 --> 0:43:23.759
<v Speaker 1>us the addresses Michelle Show at Bloomberg dot net. If

0:43:23.760 --> 0:43:26.360
<v Speaker 1>you want to see the written versions of these conversations

0:43:26.680 --> 0:43:30.440
<v Speaker 1>with my notes, they're at Bloomberg dot Com Slash Weekend,

0:43:31.040 --> 0:43:34.560
<v Speaker 1>and you can watch them on YouTube and Bloomberg TV.

0:43:35.280 --> 0:43:39.280
<v Speaker 1>The show's producers are Jessica Beck and Chris Martleu. Guest

0:43:39.280 --> 0:43:43.400
<v Speaker 1>booking is by Dave Warren, Social media by Alex Morgan.

0:43:44.080 --> 0:43:47.600
<v Speaker 1>Our sound engineer this week was Richard Ward. Video editing

0:43:47.920 --> 0:43:52.000
<v Speaker 1>was by Laura Francis. Our executive producer is Louisa Lewis

0:43:52.640 --> 0:43:56.960
<v Speaker 1>at Bloomberg Weekend, Brendan Francis Nunam is Editorial director of

0:43:57.040 --> 0:44:01.680
<v Speaker 1>Audio and Special Projects, and our executive editor is Catherine bell.

0:44:02.440 --> 0:44:05.360
<v Speaker 1>Our music is by Bart Walshaw. And we'd also like

0:44:05.400 --> 0:44:10.759
<v Speaker 1>to thank Alana Susnow Samasadi and Sage Bauman and thank

0:44:10.800 --> 0:44:16.880
<v Speaker 1>you for listening until next weekend. Goodbye,