WEBVTT - Jon McAuliffe on Innovation and Statistical Methods

0:00:02.040 --> 0:00:07.360
<v Speaker 1>This is Master's in Business with Barry Ridholds on Bloomberg Radio.

0:00:08.400 --> 0:00:11.800
<v Speaker 2>This week on the podcast Strap Yourself In, I have

0:00:12.039 --> 0:00:16.159
<v Speaker 2>another extra special guest. John mccauliffe is co founder and

0:00:16.239 --> 0:00:19.720
<v Speaker 2>chief investment officer at the Volleyon Group. They're a five

0:00:19.760 --> 0:00:23.360
<v Speaker 2>billion dollar hedge funds and one of the earliest shops

0:00:24.079 --> 0:00:27.880
<v Speaker 2>to ever use machine learning as it applies to training

0:00:27.920 --> 0:00:33.239
<v Speaker 2>and investment management decisions. It is a full systematic approach

0:00:33.360 --> 0:00:39.599
<v Speaker 2>to using computer horsepower and database and machine learning and

0:00:39.880 --> 0:00:43.960
<v Speaker 2>their own predictive engine to make investments and trades, and

0:00:44.000 --> 0:00:48.400
<v Speaker 2>it's managed to put together quite a track record. Previously,

0:00:48.479 --> 0:00:52.680
<v Speaker 2>John was at d SHAW where he ran statistical arbitrage.

0:00:53.080 --> 0:00:56.040
<v Speaker 2>He is one of the people who worked on the

0:00:56.080 --> 0:01:01.520
<v Speaker 2>Amazon recommendation engine, and he is currently a professor of

0:01:01.560 --> 0:01:06.040
<v Speaker 2>statistics at Berkeley. I don't even know where to begin,

0:01:06.200 --> 0:01:10.160
<v Speaker 2>other than say, if you're interested in AI or machine

0:01:10.240 --> 0:01:16.080
<v Speaker 2>learning or quantitative strategies, this is just a masterclass in

0:01:16.120 --> 0:01:18.280
<v Speaker 2>how it's done by one of the first people in

0:01:18.319 --> 0:01:22.880
<v Speaker 2>the space to not only do this sort of machine

0:01:22.920 --> 0:01:25.800
<v Speaker 2>learning and apply it to investing, but one of the best.

0:01:26.120 --> 0:01:29.400
<v Speaker 2>I think this is a fascinating conversation and I believe

0:01:29.440 --> 0:01:31.840
<v Speaker 2>you will find it to be so. Also, with no

0:01:32.000 --> 0:01:37.800
<v Speaker 2>further ado, my discussion with volleyon groups. John mccauliffe. John mccauliff,

0:01:37.920 --> 0:01:39.479
<v Speaker 2>Welcome to Bloomberg.

0:01:40.120 --> 0:01:41.560
<v Speaker 1>Thanks, Barry. I'm really happy to be here.

0:01:41.640 --> 0:01:45.160
<v Speaker 2>So let's talk a little bit about your academic background. First,

0:01:45.840 --> 0:01:49.640
<v Speaker 2>you start out undergrad computer science and applied mathematics at

0:01:49.720 --> 0:01:54.040
<v Speaker 2>Harvard before you go on to get a PhD from

0:01:54.080 --> 0:01:58.240
<v Speaker 2>California Berkeley. What led to a career in data analysis?

0:01:58.280 --> 0:02:00.880
<v Speaker 2>How early did you know that's what you wanted to do?

0:02:01.880 --> 0:02:06.200
<v Speaker 1>Well, it was a winding path. Actually, I was very

0:02:06.240 --> 0:02:11.320
<v Speaker 1>interested in international relations and foreign languages when I was

0:02:11.400 --> 0:02:13.880
<v Speaker 1>finishing high school. In fact, I spent the last year

0:02:13.919 --> 0:02:17.040
<v Speaker 1>of high school as an exchange student in Germany. And

0:02:17.120 --> 0:02:20.400
<v Speaker 1>so when I got to college, I was expecting to

0:02:20.919 --> 0:02:23.880
<v Speaker 1>major in government and go on to maybe work in

0:02:23.919 --> 0:02:25.600
<v Speaker 1>the foreign service something like that.

0:02:25.800 --> 0:02:30.200
<v Speaker 2>Really, so this is a big shift from your original expectations.

0:02:30.280 --> 0:02:33.559
<v Speaker 1>Yeah, it took about one semester for me to realize

0:02:33.840 --> 0:02:36.600
<v Speaker 1>that none of the questions that were being asked in

0:02:36.639 --> 0:02:39.680
<v Speaker 1>my classes had definitive and correct answers.

0:02:40.680 --> 0:02:41.720
<v Speaker 2>Did that frustrate you?

0:02:41.880 --> 0:02:45.760
<v Speaker 1>It did frustrate me. Yeah, And so I stayed home

0:02:45.880 --> 0:02:47.959
<v Speaker 1>over winter I stayed Excuse me, I didn't go home.

0:02:48.000 --> 0:02:51.120
<v Speaker 1>I stayed at college over winter break to try to

0:02:51.160 --> 0:02:52.400
<v Speaker 1>sort out what the heck I was going to do,

0:02:52.440 --> 0:02:54.440
<v Speaker 1>because I could see that it wasn't My plan was

0:02:54.440 --> 0:02:58.000
<v Speaker 1>in disarray. And I'd always been interested in computers, had

0:02:58.000 --> 0:03:01.840
<v Speaker 1>played around with computers, never done anything very serious, but

0:03:02.919 --> 0:03:05.560
<v Speaker 1>I thought I might as well give it a shot,

0:03:05.639 --> 0:03:07.799
<v Speaker 1>and so in the spring semester I took my first

0:03:07.840 --> 0:03:13.560
<v Speaker 1>computer science course. And when you write software, everything has

0:03:13.720 --> 0:03:15.800
<v Speaker 1>a right answer. It either does what you wanted to

0:03:15.800 --> 0:03:16.200
<v Speaker 1>do or.

0:03:16.160 --> 0:03:20.360
<v Speaker 2>It doesn't, does not compile exactly. So that's really quite

0:03:20.480 --> 0:03:26.919
<v Speaker 2>quite fascinating. So what led you from Berkeley to d

0:03:27.280 --> 0:03:30.040
<v Speaker 2>sure that they're one of the first quand shops. How

0:03:30.080 --> 0:03:31.839
<v Speaker 2>did you get there? What sort of research did Yeah?

0:03:31.840 --> 0:03:34.920
<v Speaker 1>I actually I spent time at d Shot in between

0:03:34.960 --> 0:03:38.920
<v Speaker 1>my undergrad and my PhD program, So it was after

0:03:39.000 --> 0:03:40.720
<v Speaker 1>Harvard that I went to that show.

0:03:40.840 --> 0:03:45.280
<v Speaker 2>Did that light an interest in using machine learning and

0:03:45.320 --> 0:03:48.840
<v Speaker 2>computers applied to finance or what was that experience like?

0:03:49.120 --> 0:03:54.040
<v Speaker 1>Yeah, it made me really interested in and excited about

0:03:54.520 --> 0:03:59.720
<v Speaker 1>using statistical thinking and data analysis to sort of understand

0:03:59.720 --> 0:04:02.840
<v Speaker 1>then amics of securities prices. Machine learning did not play

0:04:02.880 --> 0:04:05.840
<v Speaker 1>really a role at that time, I think, not at

0:04:05.920 --> 0:04:08.680
<v Speaker 1>d SHAW, but you know, probably nowhere it was too

0:04:08.720 --> 0:04:13.960
<v Speaker 1>immature a feel in the nineties. But I had already

0:04:14.000 --> 0:04:20.039
<v Speaker 1>been curious and interested in using these kinds of statistical

0:04:20.080 --> 0:04:23.480
<v Speaker 1>tools in trading and in investing when I was finishing

0:04:23.520 --> 0:04:25.880
<v Speaker 1>college and then at d SHAW. You know, I had

0:04:25.920 --> 0:04:28.800
<v Speaker 1>brilliant colleagues and we were working on hard problems. So

0:04:28.839 --> 0:04:30.200
<v Speaker 1>I really, I really got a lot.

0:04:30.040 --> 0:04:32.559
<v Speaker 2>Of us still one of the top performing hedge funds,

0:04:32.600 --> 0:04:34.520
<v Speaker 2>one of the earliest quant hedge funds, A great a

0:04:34.560 --> 0:04:37.440
<v Speaker 2>great place to absolutely cut your teeth at. So was

0:04:37.480 --> 0:04:40.080
<v Speaker 2>it Harvard d SHAW and then Berkeley?

0:04:40.160 --> 0:04:40.800
<v Speaker 1>Yeah, that's right?

0:04:40.839 --> 0:04:43.800
<v Speaker 2>And then from Berkeley? How did you end up at Amazon?

0:04:44.800 --> 0:04:47.400
<v Speaker 1>I guess I should correct myself. There was a year

0:04:47.440 --> 0:04:50.320
<v Speaker 1>at Amazon after d Eshaw, but before Berkeley.

0:04:50.240 --> 0:04:55.520
<v Speaker 2>And am I reading this correctly? The recommendation engine that

0:04:55.640 --> 0:04:58.480
<v Speaker 2>Amazon uses you helped develop?

0:04:58.880 --> 0:05:01.200
<v Speaker 1>I would say I worked on it. I would you

0:05:01.200 --> 0:05:03.560
<v Speaker 1>know it existed, It was in place when I got there,

0:05:03.920 --> 0:05:07.200
<v Speaker 1>and sort of the things that are familiar about the

0:05:07.200 --> 0:05:11.480
<v Speaker 1>recommendation engine had already been built by my manager and

0:05:12.000 --> 0:05:17.520
<v Speaker 1>his colleagues. But I worked I did research on improvements

0:05:17.560 --> 0:05:21.880
<v Speaker 1>and different ways of forming recommendations. It was funny because

0:05:21.920 --> 0:05:28.840
<v Speaker 1>at the time, the entire database of purchase history for

0:05:29.240 --> 0:05:32.719
<v Speaker 1>all of Amazon fit in one twenty gigabyte file on

0:05:32.800 --> 0:05:34.160
<v Speaker 1>a disc, so I could just load it on my

0:05:34.200 --> 0:05:35.280
<v Speaker 1>computer and run.

0:05:35.360 --> 0:05:36.960
<v Speaker 2>Now I don't think we could do that anymore. We

0:05:37.160 --> 0:05:39.920
<v Speaker 2>could not, so, thank goodness is Amica Zone cloud services,

0:05:39.960 --> 0:05:43.279
<v Speaker 2>so you could put what is it, twenty five years

0:05:43.520 --> 0:05:50.200
<v Speaker 2>and hundreds of billions of dollars of transactions. So my

0:05:50.320 --> 0:05:53.839
<v Speaker 2>assumption is products like that are highly iterative. The first

0:05:53.920 --> 0:05:56.159
<v Speaker 2>version is all right, it does a half decent job,

0:05:56.440 --> 0:05:58.240
<v Speaker 2>and then it gets better, and then it starts to

0:05:58.279 --> 0:06:01.919
<v Speaker 2>get almost spookily good. It's like, oh, how much of

0:06:01.960 --> 0:06:04.560
<v Speaker 2>that is just the size of the database, and how

0:06:04.600 --> 0:06:07.360
<v Speaker 2>much of that is just a clever algorithm.

0:06:08.040 --> 0:06:12.960
<v Speaker 1>Well, that's a great question, because the two are inextricably linked.

0:06:14.160 --> 0:06:18.080
<v Speaker 1>The way that you make algorithms great is by making

0:06:18.120 --> 0:06:21.360
<v Speaker 1>them more powerful, more expressive, able to describe lots of

0:06:21.360 --> 0:06:25.000
<v Speaker 1>different kinds of patterns and relationships. But those kinds of

0:06:25.040 --> 0:06:29.480
<v Speaker 1>approaches need huge amounts of data in order to correctly

0:06:29.520 --> 0:06:32.280
<v Speaker 1>sort out what's signal and what's noise. The more expressive.

0:06:32.800 --> 0:06:35.680
<v Speaker 1>A tool like that is like a recommender system, the

0:06:35.680 --> 0:06:40.359
<v Speaker 1>more prone it is to mistake one time noise for

0:06:40.560 --> 0:06:44.200
<v Speaker 1>persistent signal, and that is a recurring theme in statistical prediction.

0:06:44.279 --> 0:06:47.599
<v Speaker 1>It is really the central problem in statistical predictions. So

0:06:47.839 --> 0:06:50.520
<v Speaker 1>you have it in recommender systems, you have it in

0:06:50.720 --> 0:06:54.960
<v Speaker 1>predicting price action, in the problems that we solve, and elsewhere.

0:06:55.400 --> 0:06:57.560
<v Speaker 2>There was a pretty infamous New York Times article a

0:06:57.560 --> 0:07:02.919
<v Speaker 2>couple of years ago about targets using their own recommender

0:07:03.000 --> 0:07:09.279
<v Speaker 2>system and sending out maternity things to people. A dad

0:07:09.320 --> 0:07:13.160
<v Speaker 2>gets his young teenage daughters what is this and goes

0:07:13.200 --> 0:07:15.200
<v Speaker 2>in to yell at them, and turns out she was

0:07:15.200 --> 0:07:20.080
<v Speaker 2>pregnant and they had pieced it together. How far of

0:07:20.120 --> 0:07:23.400
<v Speaker 2>a leap is it from these systems to much more

0:07:23.880 --> 0:07:29.160
<v Speaker 2>sophisticated machine learning and even large language models.

0:07:30.080 --> 0:07:32.840
<v Speaker 1>It's the answer, it turns out, is that it's a

0:07:32.920 --> 0:07:37.520
<v Speaker 1>question of scale. That wasn't at all obvious before GPT

0:07:38.120 --> 0:07:42.640
<v Speaker 1>three and chat GPT, But it just turned out that

0:07:42.920 --> 0:07:46.360
<v Speaker 1>when you have, for example, GPT is built from a

0:07:46.440 --> 0:07:50.800
<v Speaker 1>database of sentences in English, it's got a trillion words

0:07:50.800 --> 0:07:53.880
<v Speaker 1>in it that database, and when you take a trillion

0:07:53.920 --> 0:07:55.520
<v Speaker 1>words and you use it to fit a model that

0:07:55.560 --> 0:07:59.080
<v Speaker 1>has one hundred and seventy five billion parameters. There is

0:07:59.120 --> 0:08:02.480
<v Speaker 1>apparently a kind of transition where things become, you know,

0:08:02.560 --> 0:08:05.440
<v Speaker 1>frankly astounding. I don't I think, I don't think that

0:08:05.480 --> 0:08:07.600
<v Speaker 1>anybody who isn't astounded is telling the truth.

0:08:07.720 --> 0:08:13.200
<v Speaker 2>Right. It's eerie is in terms of how sophisticated it is,

0:08:13.520 --> 0:08:17.120
<v Speaker 2>but it's also kind of surprising in terms of I

0:08:17.120 --> 0:08:20.520
<v Speaker 2>guess what the programs like to call hallucinations. I guess

0:08:20.560 --> 0:08:24.160
<v Speaker 2>if you're using the Internet as your base model, Hey,

0:08:24.320 --> 0:08:26.760
<v Speaker 2>there's one or two things on the Internet that are wrong,

0:08:27.240 --> 0:08:30.080
<v Speaker 2>so of course that's going to show up in something

0:08:30.080 --> 0:08:30.960
<v Speaker 2>like chap GPT.

0:08:31.640 --> 0:08:36.559
<v Speaker 1>Yeah, you know. Underlyingly, there's this tool GPT three that's

0:08:36.600 --> 0:08:39.560
<v Speaker 1>really the engine that powers jed GPT, and that tool

0:08:40.760 --> 0:08:43.560
<v Speaker 1>it has one goal. It's a simple goal. You show

0:08:43.600 --> 0:08:46.240
<v Speaker 1>at the beginning of a sentence, and it predicts the

0:08:46.280 --> 0:08:48.600
<v Speaker 1>next word in the sentence, and that's all it is

0:08:48.600 --> 0:08:50.800
<v Speaker 1>trained to do. I mean, it really is actually that simple.

0:08:51.280 --> 0:08:53.559
<v Speaker 2>It's a dumb program that looks smart.

0:08:53.679 --> 0:08:58.079
<v Speaker 1>If you like. But the thing about predicting the next

0:08:58.120 --> 0:09:01.600
<v Speaker 1>word in a sentence is whether you know the sequence

0:09:01.600 --> 0:09:04.400
<v Speaker 1>of words that's being output, is leading to something that

0:09:04.440 --> 0:09:07.400
<v Speaker 1>is true or false, is irrelevant. The only thing that

0:09:07.520 --> 0:09:10.640
<v Speaker 1>it is trained to do is make highly accurate predictions

0:09:10.679 --> 0:09:11.320
<v Speaker 1>of next words.

0:09:11.640 --> 0:09:16.080
<v Speaker 2>So when I said it's really very sophisticated, it just

0:09:16.640 --> 0:09:20.080
<v Speaker 2>for what we tend to call this artificial intelligence. But

0:09:20.160 --> 0:09:22.559
<v Speaker 2>I've read a number of people said, hey, this really

0:09:22.600 --> 0:09:26.320
<v Speaker 2>is an AI. This is something a little more rudimentary.

0:09:26.480 --> 0:09:30.719
<v Speaker 1>Yeah, I think, you know, a critic would say that

0:09:31.280 --> 0:09:35.559
<v Speaker 1>artificial intelligence is a complete misnomer. There's sort of nothing

0:09:36.160 --> 0:09:40.560
<v Speaker 1>remotely intelligent in the colloquial sense about these systems. And

0:09:40.600 --> 0:09:44.080
<v Speaker 1>then a common defense in AI research is that artificial

0:09:44.120 --> 0:09:46.240
<v Speaker 1>intelligence is a moving target. As soon as you build

0:09:46.240 --> 0:09:50.120
<v Speaker 1>a system that does something quasi magical that was the

0:09:50.160 --> 0:09:53.720
<v Speaker 1>old yardstick of intelligence, then the goalposts get moved by

0:09:53.720 --> 0:09:57.400
<v Speaker 1>the people who are supplying the evaluations. And I guess

0:09:57.440 --> 0:09:59.320
<v Speaker 1>I would sit somewhere in between. I think the language

0:09:59.320 --> 0:10:03.480
<v Speaker 1>is unfortunate because it's so easily misconstrued. I wouldn't call

0:10:03.520 --> 0:10:05.760
<v Speaker 1>the system dumb, and I wouldn't call it smart. It's

0:10:06.000 --> 0:10:08.319
<v Speaker 1>you know, those are those are not characteristics of these systems.

0:10:08.440 --> 0:10:10.320
<v Speaker 2>But it's complex and sophisticated.

0:10:10.440 --> 0:10:12.280
<v Speaker 1>It certainly is it has one hundred and seventy five

0:10:12.280 --> 0:10:15.559
<v Speaker 1>billion parameters. That doesn't fit your definition of complex you

0:10:15.679 --> 0:10:16.120
<v Speaker 1>know what would?

0:10:16.920 --> 0:10:21.360
<v Speaker 2>Yeah, that works for me. So your in your career line,

0:10:21.400 --> 0:10:26.720
<v Speaker 2>where is aphametrics and what was that recommendation engine? Like?

0:10:26.920 --> 0:10:29.760
<v Speaker 1>Yeah, So that was work I did as a summer

0:10:30.280 --> 0:10:34.120
<v Speaker 1>research intern during my PhD. And that work was about

0:10:35.800 --> 0:10:39.880
<v Speaker 1>what's called the problem is called genotype calling. So genotype calling,

0:10:40.360 --> 0:10:43.200
<v Speaker 1>I'll explain, Barry, do you have an identical twin? I

0:10:43.280 --> 0:10:46.720
<v Speaker 1>do not, Okay, So I can safely say your genome

0:10:46.760 --> 0:10:49.040
<v Speaker 1>is unique in the world. There's no one else who

0:10:49.040 --> 0:10:52.560
<v Speaker 1>has exactly your genome. On the other hand, if you

0:10:52.559 --> 0:10:55.280
<v Speaker 1>were to lay your genome in mind alongside each other

0:10:55.400 --> 0:10:58.840
<v Speaker 1>lined up, they would be ninety nine point nine percent identical.

0:10:58.960 --> 0:11:03.240
<v Speaker 1>About one position in a thousand is different. But those

0:11:03.280 --> 0:11:05.719
<v Speaker 1>differences are what caused you to be you and me

0:11:05.800 --> 0:11:08.200
<v Speaker 1>to be me. So they're obviously of intense kind of

0:11:08.200 --> 0:11:11.680
<v Speaker 1>scientific and applied interest. And so it's very important to

0:11:11.800 --> 0:11:15.320
<v Speaker 1>be able to take a sort of a sample of

0:11:15.360 --> 0:11:20.360
<v Speaker 1>your DNA and quickly produce a profile of all the

0:11:20.400 --> 0:11:23.960
<v Speaker 1>places that have variability what your particular values are, Okay,

0:11:24.120 --> 0:11:27.079
<v Speaker 1>And that problem is the genotyping problem.

0:11:27.400 --> 0:11:31.000
<v Speaker 2>And this used to be a very expensive, very complex

0:11:31.760 --> 0:11:34.840
<v Speaker 2>problem to solve that. We've spent billions of dollars figuring

0:11:34.880 --> 0:11:38.040
<v Speaker 2>out now a lot faster, a lot cheaper.

0:11:37.800 --> 0:11:40.560
<v Speaker 1>A lot faster. In fact, even the technology I worked

0:11:40.600 --> 0:11:43.280
<v Speaker 1>on in two thousand and five two thousand and four

0:11:43.840 --> 0:11:47.679
<v Speaker 1>is multiple generations old and not really what's used anymore.

0:11:47.880 --> 0:11:51.000
<v Speaker 2>So let's talk about what you did at the efficient frontier.

0:11:51.559 --> 0:11:56.680
<v Speaker 2>Explain what real time click prediction rules are and how

0:11:56.720 --> 0:11:59.120
<v Speaker 2>it works for a keyword search.

0:11:59.280 --> 0:12:06.000
<v Speaker 1>Sure, the revenue engine that drives Google is search keyword ads, right,

0:12:06.040 --> 0:12:07.480
<v Speaker 1>So every time you do a search, at the top

0:12:07.520 --> 0:12:10.360
<v Speaker 1>you see ad ad AD, And so how do those

0:12:10.400 --> 0:12:14.520
<v Speaker 1>ads get there? Well, actually it's surprising maybe if you

0:12:14.520 --> 0:12:16.280
<v Speaker 1>don't know about it, but every single time you type

0:12:16.280 --> 0:12:19.240
<v Speaker 1>in a search term on Google and hit return, a

0:12:19.559 --> 0:12:23.280
<v Speaker 1>very fast auction takes place, and a whole bunch of

0:12:23.360 --> 0:12:28.840
<v Speaker 1>companies running software bid electronically to place their ads at

0:12:28.840 --> 0:12:33.360
<v Speaker 1>the top of your search results. And the more or

0:12:33.480 --> 0:12:36.600
<v Speaker 1>less the results that are shown on the page are

0:12:36.600 --> 0:12:38.760
<v Speaker 1>in order of how much they bid. It's not quite true,

0:12:38.760 --> 0:12:40.000
<v Speaker 1>but you could think of it. It's true.

0:12:40.360 --> 0:12:44.160
<v Speaker 2>A rough outline. So the first three sponsored results on

0:12:44.280 --> 0:12:48.320
<v Speaker 2>a Google page, go through that auction process, and I

0:12:48.360 --> 0:12:50.760
<v Speaker 2>think at this point everybody knows what page rank is

0:12:50.800 --> 0:12:53.800
<v Speaker 2>for for the rest of that that's right, And that

0:12:53.840 --> 0:12:56.080
<v Speaker 2>seemed to be Google secret sauce early on.

0:12:56.280 --> 0:13:01.360
<v Speaker 1>Right, Well, you know, to talk about the the ad placement.

0:13:01.520 --> 0:13:04.120
<v Speaker 1>So the people who are supplying the ad, who are

0:13:04.120 --> 0:13:06.200
<v Speaker 1>participating in the auctions, they have a problem, which is

0:13:06.200 --> 0:13:08.840
<v Speaker 1>how much to bid, right, And so how would you

0:13:08.880 --> 0:13:11.959
<v Speaker 1>decide how much to bid? Well, you want to know

0:13:13.120 --> 0:13:15.520
<v Speaker 1>basically the probability that somebody is going to click on

0:13:15.559 --> 0:13:19.040
<v Speaker 1>your ad, and then you would multiply that by how

0:13:19.120 --> 0:13:21.959
<v Speaker 1>much money you make eventually if they click. And that's

0:13:22.040 --> 0:13:24.840
<v Speaker 1>kind of an expectation of how much money you'll make.

0:13:25.200 --> 0:13:29.959
<v Speaker 1>And so then you gear your bid price to make

0:13:30.000 --> 0:13:32.319
<v Speaker 1>sure that it's going to be profitable for you. And

0:13:32.360 --> 0:13:36.000
<v Speaker 1>then so really you have to make a decision about

0:13:36.200 --> 0:13:38.040
<v Speaker 1>what this click through rate is going to be. You

0:13:38.040 --> 0:13:39.800
<v Speaker 1>have to predict the click through probability.

0:13:40.360 --> 0:13:42.480
<v Speaker 2>So I was going to say, this sounds like it's

0:13:42.520 --> 0:13:48.000
<v Speaker 2>a very sophisticated application of computer science probability and statistics.

0:13:48.520 --> 0:13:50.880
<v Speaker 2>And if you do it right, you make money, and

0:13:50.960 --> 0:13:54.360
<v Speaker 2>if you do it wrong, your ad budget is a

0:13:54.360 --> 0:13:55.400
<v Speaker 2>money loser.

0:13:55.160 --> 0:13:55.559
<v Speaker 1>That's right.

0:13:55.840 --> 0:13:58.520
<v Speaker 2>Huh. So tell us a little bit about your doctorate,

0:13:58.600 --> 0:14:01.920
<v Speaker 2>what you wrote about for your PhD at Berkeley.

0:14:02.240 --> 0:14:06.640
<v Speaker 1>Yeah, so we're back to genomes. Actually, this was around

0:14:06.679 --> 0:14:08.679
<v Speaker 1>the time when I was in my first year of

0:14:08.679 --> 0:14:11.560
<v Speaker 1>my PhD program, is when the human genome was published

0:14:11.960 --> 0:14:16.120
<v Speaker 1>in Nature. So it was kind of really the beginning

0:14:16.200 --> 0:14:20.720
<v Speaker 1>of the explosion of work on kind of high throughput,

0:14:21.240 --> 0:14:26.400
<v Speaker 1>large scale genetics research. And one really important question after

0:14:26.440 --> 0:14:28.600
<v Speaker 1>you've sequenced a genome is well, what are all the

0:14:28.640 --> 0:14:30.360
<v Speaker 1>bits of it doing. You can look at a string

0:14:30.400 --> 0:14:33.520
<v Speaker 1>of DNA. It's just made up of these kind of

0:14:33.520 --> 0:14:37.400
<v Speaker 1>four letters, but you don't want to just know the

0:14:37.440 --> 0:14:39.960
<v Speaker 1>four letters. They're kind of a code. And some parts

0:14:39.960 --> 0:14:43.920
<v Speaker 1>of the DNA represent useful stuff that is being turned

0:14:44.080 --> 0:14:47.720
<v Speaker 1>by your cell into proteins and et cetera, and other

0:14:47.840 --> 0:14:49.880
<v Speaker 1>parts of the DNA don't appear to have any function

0:14:49.920 --> 0:14:51.760
<v Speaker 1>at all, and it's really important to know which is

0:14:51.800 --> 0:14:56.000
<v Speaker 1>which as a biology researcher. And so it's you know,

0:14:56.040 --> 0:15:01.240
<v Speaker 1>for a long time before high throughput sequencing, biologists would

0:15:01.240 --> 0:15:03.320
<v Speaker 1>be in the lab and they would very laboriously look

0:15:03.360 --> 0:15:05.920
<v Speaker 1>at very tiny segments of DNA and establish what their

0:15:05.920 --> 0:15:08.960
<v Speaker 1>function was. But now we have the whole human genome

0:15:09.040 --> 0:15:10.880
<v Speaker 1>sitting on disk, and we would like to be able

0:15:10.920 --> 0:15:13.200
<v Speaker 1>to just run an analysis on it and have the

0:15:13.240 --> 0:15:16.760
<v Speaker 1>computer spit out everything that is functional and not functional.

0:15:17.760 --> 0:15:21.480
<v Speaker 1>And so that's the problem I worked on. And a

0:15:21.520 --> 0:15:24.400
<v Speaker 1>really important insight is that you can take advantage of

0:15:24.440 --> 0:15:28.600
<v Speaker 1>the idea of natural selection and the idea of evolution

0:15:29.120 --> 0:15:31.640
<v Speaker 1>to help you. And the way you do that is

0:15:32.160 --> 0:15:34.840
<v Speaker 1>you have the human genome, you sequence a bunch of

0:15:35.160 --> 0:15:38.800
<v Speaker 1>primate genomes nearby relatives of the union, and you lay

0:15:38.840 --> 0:15:41.760
<v Speaker 1>all those genomes on top of each other, and then

0:15:41.960 --> 0:15:45.800
<v Speaker 1>you look for places where all of the genomes agree. Right,

0:15:45.920 --> 0:15:50.080
<v Speaker 1>there hasn't been variation that's happening through mutations. And why

0:15:50.160 --> 0:15:53.080
<v Speaker 1>hasn't there been, Well, the biggest force that throws out

0:15:53.160 --> 0:15:56.440
<v Speaker 1>variation is natural selection. If you get a mutation in

0:15:56.480 --> 0:15:59.400
<v Speaker 1>a part of your genome that really matters, then you're

0:15:59.480 --> 0:16:02.640
<v Speaker 1>kind of on it and you won't have progeny and

0:16:02.680 --> 0:16:06.120
<v Speaker 1>that'll get stamped out. So natural selection is this very

0:16:06.120 --> 0:16:10.160
<v Speaker 1>strong force that's causing DNA not to change. And so

0:16:10.200 --> 0:16:13.160
<v Speaker 1>when you when you make these primate alignments, you can

0:16:13.360 --> 0:16:18.320
<v Speaker 1>really leverage that fact and look for conservation and use

0:16:18.360 --> 0:16:20.000
<v Speaker 1>that as a big signal that something is functional.

0:16:20.280 --> 0:16:25.160
<v Speaker 2>Huh, really really interesting. You mentioned our DNA is ninety

0:16:25.240 --> 0:16:28.640
<v Speaker 2>nine point ninety nine. I don't know how many places

0:16:28.640 --> 0:16:30.200
<v Speaker 2>to the right of the decimal point you would want

0:16:30.200 --> 0:16:34.560
<v Speaker 2>to go, but very similar. How how similar or different

0:16:34.720 --> 0:16:38.360
<v Speaker 2>are we from let's say, a chimpanzee. I've always questioned,

0:16:38.400 --> 0:16:41.680
<v Speaker 2>there's an urban legend that they're practically the same. It

0:16:41.680 --> 0:16:46.680
<v Speaker 2>always seems like it's overstated two percent. So you and

0:16:46.720 --> 0:16:49.680
<v Speaker 2>I have a point one percent different me and the

0:16:49.720 --> 0:16:52.280
<v Speaker 2>average chimp. It's two point zero percent.

0:16:52.440 --> 0:16:55.800
<v Speaker 1>That's exactly right. Yeah, so chimps are essentially our closest

0:16:56.360 --> 0:16:57.720
<v Speaker 1>non human primate relatives.

0:16:58.320 --> 0:17:02.280
<v Speaker 2>Really really quite fascinating. So let's talk a little bit

0:17:02.320 --> 0:17:05.160
<v Speaker 2>about the firm. You guys were one of the earliest

0:17:05.160 --> 0:17:09.040
<v Speaker 2>pioneers of machine learning research. Explain a little bit what

0:17:09.119 --> 0:17:09.840
<v Speaker 2>the firm does.

0:17:10.880 --> 0:17:16.439
<v Speaker 1>Sure, so, we run trading strategies investment strategies that are

0:17:16.640 --> 0:17:20.320
<v Speaker 1>fully automated, so we call them fully systematic, and that

0:17:20.400 --> 0:17:24.760
<v Speaker 1>means that we have software systems that run every day

0:17:25.440 --> 0:17:29.800
<v Speaker 1>during market hours, and they take in information about the

0:17:29.880 --> 0:17:34.400
<v Speaker 1>characteristics of the securities we're trading. Think of stocks and

0:17:34.440 --> 0:17:39.800
<v Speaker 1>then they make predictions of how the prices of each

0:17:39.960 --> 0:17:43.400
<v Speaker 1>security is going to change over time, and then they

0:17:44.480 --> 0:17:47.600
<v Speaker 1>decide on changes in our inventory, changes in held positions

0:17:48.280 --> 0:17:53.000
<v Speaker 1>based on those predictions, and then those desired changes are

0:17:53.040 --> 0:17:56.360
<v Speaker 1>sent into an execution system which automatically carries them out.

0:17:56.960 --> 0:18:02.680
<v Speaker 2>So fully automated. Is there supervision or it's kind of

0:18:02.760 --> 0:18:05.040
<v Speaker 2>running on its own with a couple of checks.

0:18:05.119 --> 0:18:09.119
<v Speaker 1>There's lots of human diagnostic supervision, right, So there are

0:18:09.160 --> 0:18:14.760
<v Speaker 1>people who are watching screens full of instrumentation and telemetry

0:18:14.840 --> 0:18:17.720
<v Speaker 1>about what the systems are doing. But those people are

0:18:17.760 --> 0:18:21.240
<v Speaker 1>not taking any actions, right unless there's a problem, right,

0:18:21.320 --> 0:18:23.000
<v Speaker 1>and then they do.

0:18:23.480 --> 0:18:26.120
<v Speaker 2>So let's talk a little bit about how machines learn

0:18:26.200 --> 0:18:30.280
<v Speaker 2>to identify signals. I'm assuming you start with the giant

0:18:30.359 --> 0:18:35.320
<v Speaker 2>database that is the history of stock prices, volume movement, etc.

0:18:36.200 --> 0:18:38.840
<v Speaker 2>And then bring in a lot of additional things to bear.

0:18:39.520 --> 0:18:44.439
<v Speaker 2>What's the process like developing a particular trading strategy.

0:18:44.960 --> 0:18:48.760
<v Speaker 1>Yeah, so, as you're saying, we begin with a very

0:18:48.840 --> 0:18:54.160
<v Speaker 1>large historical data set of prices and volumes, market data

0:18:54.200 --> 0:18:59.440
<v Speaker 1>that kind, but importantly all kinds of other information about securities,

0:19:00.000 --> 0:19:04.760
<v Speaker 1>financial statement data, textual data, analyst data.

0:19:05.080 --> 0:19:11.000
<v Speaker 2>So it's everything from prices fundamental everything from learnings to

0:19:11.080 --> 0:19:14.080
<v Speaker 2>revenue to sales, etc. I'm assuming the change and the

0:19:14.680 --> 0:19:17.159
<v Speaker 2>delta of the change is going to be very significant

0:19:17.200 --> 0:19:22.000
<v Speaker 2>in that. What about macroeconomic what some people call noise,

0:19:22.119 --> 0:19:27.159
<v Speaker 2>but one would imagine some signal in everything from inflation

0:19:27.359 --> 0:19:31.840
<v Speaker 2>to interest rates to GDPs firm spending. Are those inputs

0:19:32.280 --> 0:19:34.200
<v Speaker 2>worthwhile or how do you think about those?

0:19:34.560 --> 0:19:38.640
<v Speaker 1>So we don't hold portfolios that are exposed to those things.

0:19:38.760 --> 0:19:42.320
<v Speaker 1>So it's really a business decision on our part. We

0:19:42.440 --> 0:19:47.200
<v Speaker 1>are working with institutional investors who already have as much

0:19:47.240 --> 0:19:50.200
<v Speaker 1>exposure as they want to things like the market or

0:19:50.520 --> 0:19:56.040
<v Speaker 1>to well recognized econometric risk factors like value, and so

0:19:56.080 --> 0:19:58.680
<v Speaker 1>they don't need our help to be exposed to those things.

0:19:58.680 --> 0:20:01.439
<v Speaker 1>They are very well equipped to handle that part of

0:20:01.480 --> 0:20:05.440
<v Speaker 1>their investment process. What we're trying to provide is the

0:20:05.480 --> 0:20:09.000
<v Speaker 1>most diversification possible. So we want to give them a

0:20:09.040 --> 0:20:14.000
<v Speaker 1>new return stream which has good and stable returns, but

0:20:14.160 --> 0:20:17.199
<v Speaker 1>on top of that, importantly, is also not correlated with

0:20:17.240 --> 0:20:19.639
<v Speaker 1>any of the other return streams that they already that

0:20:19.680 --> 0:20:20.280
<v Speaker 1>they already have.

0:20:20.480 --> 0:20:25.040
<v Speaker 2>That's interesting. So can I assume that you're applying your

0:20:25.359 --> 0:20:29.359
<v Speaker 2>machine learning methodology across different asset classes or is it

0:20:29.400 --> 0:20:30.560
<v Speaker 2>strictly equities? Oh?

0:20:30.600 --> 0:20:34.639
<v Speaker 1>No, We apply it to UH to equities, to credit,

0:20:34.720 --> 0:20:39.840
<v Speaker 1>to corporate bonds, and we trade futures contracts, and in

0:20:39.880 --> 0:20:41.520
<v Speaker 1>the fullness of time, we hope that we will be

0:20:41.560 --> 0:20:44.320
<v Speaker 1>trading kind of every security in the world.

0:20:44.359 --> 0:20:47.159
<v Speaker 2>So, so currently stocks, bonds, When you say futures, I

0:20:47.200 --> 0:20:48.440
<v Speaker 2>assume commodities, all.

0:20:48.400 --> 0:20:49.320
<v Speaker 1>Kinds of futures contract.

0:20:49.359 --> 0:20:52.399
<v Speaker 2>It's really really interesting. So it could be anything from

0:20:52.640 --> 0:20:56.560
<v Speaker 2>interest rate swaps to commodities to the full gamut. So,

0:20:56.880 --> 0:21:01.480
<v Speaker 2>so how different is this approach from what other quant

0:21:01.600 --> 0:21:05.480
<v Speaker 2>shops do that really focus on equities.

0:21:06.800 --> 0:21:11.280
<v Speaker 1>I think it's kind of the same question as asking, well,

0:21:11.400 --> 0:21:13.119
<v Speaker 1>what do we mean when we say we use machine

0:21:13.160 --> 0:21:16.480
<v Speaker 1>learning or that you know we are our principles are

0:21:16.520 --> 0:21:20.520
<v Speaker 1>our machine learning principles, and so how does that make

0:21:20.600 --> 0:21:24.159
<v Speaker 1>us different than the kind of standard approach in quantitative trading?

0:21:24.840 --> 0:21:28.000
<v Speaker 1>And the answer to the question really comes back to

0:21:28.000 --> 0:21:31.720
<v Speaker 1>this idea we mentioned a little while ago of how

0:21:31.760 --> 0:21:36.200
<v Speaker 1>powerful the tools are that you're using to form predictions. Right,

0:21:36.480 --> 0:21:40.199
<v Speaker 1>So in our business, the thing that we build is

0:21:40.200 --> 0:21:44.040
<v Speaker 1>called a prediction rule. Okay, that's that's our widget and

0:21:44.320 --> 0:21:46.760
<v Speaker 1>What a prediction rule does is it takes in a

0:21:46.760 --> 0:21:49.560
<v Speaker 1>bunch of input, a bunch of information about a stock

0:21:49.880 --> 0:21:53.600
<v Speaker 1>at a moment in time, and it hands you a

0:21:53.760 --> 0:21:56.080
<v Speaker 1>guess about how that stock's price is going to change

0:21:56.200 --> 0:22:00.840
<v Speaker 1>over some future period of time. Okay, and so there

0:22:00.920 --> 0:22:05.320
<v Speaker 1>is one most important question about prediction rules, which is

0:22:05.480 --> 0:22:07.840
<v Speaker 1>how complex are they? How much complexity do they have?

0:22:08.400 --> 0:22:13.000
<v Speaker 1>Complexity is a colloquial term. It's unfortunately another example of

0:22:13.600 --> 0:22:16.879
<v Speaker 1>a place where things can be vague or ambiguous because

0:22:18.080 --> 0:22:21.359
<v Speaker 1>a general purpose word has been borrowed in a technical setting.

0:22:21.520 --> 0:22:24.399
<v Speaker 1>But when you use the word complexity in statistical prediction,

0:22:24.720 --> 0:22:28.800
<v Speaker 1>there's a very specific meaning. It means how much expressive

0:22:28.840 --> 0:22:32.280
<v Speaker 1>power does this prediction rule have? How good a job

0:22:32.440 --> 0:22:35.280
<v Speaker 1>can it do of approximating what's going on in the

0:22:35.359 --> 0:22:38.200
<v Speaker 1>data you show it. Remember, we have these giant historical

0:22:38.280 --> 0:22:41.760
<v Speaker 1>data sets, and every entry in the data set looks

0:22:41.800 --> 0:22:44.520
<v Speaker 1>like this. What was going on with the stock at

0:22:44.520 --> 0:22:47.440
<v Speaker 1>a moment in a certain moment in time, it's price action,

0:22:47.680 --> 0:22:52.080
<v Speaker 1>it's financials analyst information. And then what did its price

0:22:52.160 --> 0:22:55.040
<v Speaker 1>do in the subsequent twenty four hours or the subsequent

0:22:55.160 --> 0:23:01.000
<v Speaker 1>fifteen minutes or whatever. Okay, and so when you talk

0:23:01.040 --> 0:23:04.240
<v Speaker 1>about the amount of complexity that a prediction rule has,

0:23:04.720 --> 0:23:08.280
<v Speaker 1>that means how well is it able to capture the

0:23:08.320 --> 0:23:11.000
<v Speaker 1>relationship between the things that you can show it when

0:23:11.040 --> 0:23:13.840
<v Speaker 1>you ask it for a prediction, and what actually happens

0:23:14.040 --> 0:23:18.159
<v Speaker 1>to the price. And naturally you kind of want to

0:23:18.720 --> 0:23:20.840
<v Speaker 1>use high complexity rules because they have a lot of

0:23:20.880 --> 0:23:23.440
<v Speaker 1>approximating power. They do a good job of describing anything

0:23:23.480 --> 0:23:26.920
<v Speaker 1>that's going on. But there are two There are two

0:23:26.960 --> 0:23:30.639
<v Speaker 1>disadvantages to high complexity. One is it needs a lot

0:23:30.680 --> 0:23:34.639
<v Speaker 1>of data, otherwise it gets fooled into thinking that randomness

0:23:34.680 --> 0:23:39.000
<v Speaker 1>is actually signal. And the other is that it's hard

0:23:39.000 --> 0:23:41.880
<v Speaker 1>to reason about what's going on under the hood. Right,

0:23:41.960 --> 0:23:45.080
<v Speaker 1>when you have very simple prediction rules, you can sort

0:23:45.119 --> 0:23:48.040
<v Speaker 1>of summarize everything that's good that they're doing in a sentence. Right,

0:23:48.119 --> 0:23:50.920
<v Speaker 1>you can look you can look inside them and get

0:23:50.920 --> 0:23:53.840
<v Speaker 1>a complete understanding of how they behave, and that's not

0:23:53.880 --> 0:23:55.960
<v Speaker 1>possible with high complexity prediction rules.

0:23:56.040 --> 0:23:59.440
<v Speaker 2>So I'm glad you brought up the concept of how

0:23:59.520 --> 0:24:04.159
<v Speaker 2>easy it is or how frequently you can fool an

0:24:04.200 --> 0:24:08.000
<v Speaker 2>algorithm or a complex rule, because sometimes the results are

0:24:08.040 --> 0:24:11.119
<v Speaker 2>just random. And it reminds me of the issue of

0:24:11.880 --> 0:24:14.960
<v Speaker 2>back testing. No one ever shows you a bad back test.

0:24:15.480 --> 0:24:19.480
<v Speaker 2>How do you deal with the issue of overfitting and

0:24:19.720 --> 0:24:23.479
<v Speaker 2>back testing that just is geared towards what already happened

0:24:23.520 --> 0:24:25.440
<v Speaker 2>and not what might happen in the future.

0:24:25.680 --> 0:24:28.960
<v Speaker 1>Yeah, that is you know, if you like the million

0:24:28.960 --> 0:24:34.639
<v Speaker 1>dollar question in statistical prediction, Okay, and it might you

0:24:34.720 --> 0:24:38.840
<v Speaker 1>might find it surprising that relatively straightforward ideas go a

0:24:38.840 --> 0:24:43.280
<v Speaker 1>long way here. And so let me let me just

0:24:43.359 --> 0:24:45.399
<v Speaker 1>describe a little scenario of how you deal you can

0:24:45.440 --> 0:24:47.560
<v Speaker 1>deal with this. All right, we agree, we have this

0:24:47.600 --> 0:24:50.840
<v Speaker 1>big historical data set, right, One thing you could do

0:24:50.920 --> 0:24:53.280
<v Speaker 1>is just start analyzing the heck out of that data

0:24:53.320 --> 0:24:56.920
<v Speaker 1>set and find a complicated prediction rule. But you're you've

0:24:56.920 --> 0:24:59.640
<v Speaker 1>already started doing it wrong. The first thing you do

0:24:59.760 --> 0:25:02.239
<v Speaker 1>before or you even look at the data is you

0:25:02.560 --> 0:25:04.520
<v Speaker 1>randomly pick out half of the data and you lock

0:25:04.560 --> 0:25:07.280
<v Speaker 1>it in a drawer. Okay, and that leads you with

0:25:07.359 --> 0:25:09.280
<v Speaker 1>the other half of the data that you haven't locked away.

0:25:09.800 --> 0:25:12.119
<v Speaker 1>On this half, you get to go hogwild. You build

0:25:12.200 --> 0:25:16.399
<v Speaker 1>every kind of prediction rule, simple rules, enormously complicated rules,

0:25:16.480 --> 0:25:20.640
<v Speaker 1>everything in between. Right, and now you can check how

0:25:20.960 --> 0:25:23.520
<v Speaker 1>accurate all of these prediction rules that you've built are

0:25:24.400 --> 0:25:26.800
<v Speaker 1>on the data that they have been looking at, and

0:25:26.840 --> 0:25:29.200
<v Speaker 1>the answer will always be the same. The most complex

0:25:29.280 --> 0:25:32.000
<v Speaker 1>rules will look the best. Of course, they have the

0:25:32.040 --> 0:25:35.360
<v Speaker 1>most expressive power, so naturally they do the best job

0:25:35.359 --> 0:25:38.600
<v Speaker 1>of describing what you showed them. The big problem is

0:25:38.960 --> 0:25:41.040
<v Speaker 1>that what you showed them is a mix of signal

0:25:41.080 --> 0:25:43.960
<v Speaker 1>and noise, and there's no way you can tell to

0:25:44.080 --> 0:25:47.080
<v Speaker 1>what extent a complex rule has found the signal versus

0:25:47.119 --> 0:25:49.400
<v Speaker 1>the noise. All you know is that it's perfectly described

0:25:49.440 --> 0:25:51.960
<v Speaker 1>to the data you showed it. You certainly suspect it

0:25:52.000 --> 0:25:55.520
<v Speaker 1>must be overfitting if it's doing that. Well, okay, so

0:25:55.640 --> 0:25:59.200
<v Speaker 1>now you freeze all those prediction rules. You're not allowed

0:25:59.200 --> 0:26:01.560
<v Speaker 1>to change them in any way anymore. And now you

0:26:01.640 --> 0:26:04.120
<v Speaker 1>unlock the drawer and you pull out all that data

0:26:04.160 --> 0:26:06.520
<v Speaker 1>that you've never looked at. You can't overfit data that

0:26:06.560 --> 0:26:09.840
<v Speaker 1>you never fit, and so you take that data and

0:26:09.880 --> 0:26:12.359
<v Speaker 1>you run it through each of these prediction rules that's frozen,

0:26:12.359 --> 0:26:14.880
<v Speaker 1>that you built. And now it is not the case

0:26:14.920 --> 0:26:18.760
<v Speaker 1>at all that the most complex rules look the best. Instead,

0:26:19.119 --> 0:26:23.080
<v Speaker 1>you'll see a kind of U shaped behavior where the

0:26:23.200 --> 0:26:25.760
<v Speaker 1>very simple rules are too simple. They've missed signal, they

0:26:25.840 --> 0:26:29.120
<v Speaker 1>left signal on the table. The two complex rules are

0:26:29.280 --> 0:26:32.320
<v Speaker 1>also doing badly because they've captured all the signal but

0:26:32.320 --> 0:26:34.440
<v Speaker 1>also lots of noise. And then somewhere in the middle

0:26:34.520 --> 0:26:37.840
<v Speaker 1>is a sweet spot where you've struck the right trade

0:26:37.840 --> 0:26:42.720
<v Speaker 1>off between how much expressive power the prediction rule has

0:26:43.000 --> 0:26:45.439
<v Speaker 1>and how good a job it is doing of avoiding

0:26:46.440 --> 0:26:49.080
<v Speaker 1>the mistaking of noise for signal.

0:26:49.280 --> 0:26:53.000
<v Speaker 2>Really really intriguing. So you guys, have you've built one

0:26:53.000 --> 0:26:57.399
<v Speaker 2>of the largest specialized machine learning research and development teams

0:26:57.480 --> 0:27:01.080
<v Speaker 2>in finance. How do you assndle a team like that

0:27:02.359 --> 0:27:05.720
<v Speaker 2>and how do you get the brain trust to do

0:27:05.800 --> 0:27:09.680
<v Speaker 2>the sort of work that's applicable to managing assets.

0:27:10.680 --> 0:27:13.879
<v Speaker 1>Well, the short answer is, we spend a huge amount

0:27:14.040 --> 0:27:20.160
<v Speaker 1>of energy on recruiting and uh, you know, identifying the

0:27:20.200 --> 0:27:23.280
<v Speaker 1>sort of premier people in the field of machine learning,

0:27:23.440 --> 0:27:28.480
<v Speaker 1>kind of both academic and practitioners, and we exhibit a

0:27:28.480 --> 0:27:30.760
<v Speaker 1>lot of patients. We we wait a really long time

0:27:31.280 --> 0:27:33.400
<v Speaker 1>to be able to find the people who are kind

0:27:33.400 --> 0:27:36.919
<v Speaker 1>of really the best, and that that that matters enormously

0:27:37.080 --> 0:27:40.399
<v Speaker 1>to us, both from the standpoint of the success of

0:27:40.400 --> 0:27:43.840
<v Speaker 1>the firm and also because it's something that you know,

0:27:43.880 --> 0:27:47.240
<v Speaker 1>we value extremely highly just having great colleagues, brilliant colleagues

0:27:47.240 --> 0:27:49.120
<v Speaker 1>that you know, I want to work in a place

0:27:49.119 --> 0:27:51.720
<v Speaker 1>where I can learn from all the people around me.

0:27:51.920 --> 0:27:55.520
<v Speaker 1>And you know, when when my co founder Michael Caratanev

0:27:55.520 --> 0:28:01.399
<v Speaker 1>and I we're talking about starting Bollion, one of the

0:28:01.440 --> 0:28:04.359
<v Speaker 1>reasons that was on our minds is we wanted to

0:28:04.520 --> 0:28:07.639
<v Speaker 1>be in control of who we worked with. You know,

0:28:07.680 --> 0:28:10.640
<v Speaker 1>we really wanted to be able to assemble a group

0:28:10.680 --> 0:28:14.359
<v Speaker 1>of people who were, you know, as brilliant as we

0:28:14.400 --> 0:28:17.359
<v Speaker 1>could find, but also you know, good people, people that

0:28:17.520 --> 0:28:19.640
<v Speaker 1>we liked, people that we were excited to collaborate with.

0:28:20.000 --> 0:28:22.960
<v Speaker 2>So let's talk about some of the fundamental principles Volnon

0:28:23.160 --> 0:28:29.440
<v Speaker 2>is built on. You reference a prediction based approach from

0:28:29.480 --> 0:28:33.840
<v Speaker 2>a paper Leo Briman wrote called two Cultures. Yeah, tell

0:28:33.920 --> 0:28:37.000
<v Speaker 2>us a little bit about what two cultures actually is.

0:28:37.200 --> 0:28:40.600
<v Speaker 1>Yeah. So this this paper was written about twenty years ago.

0:28:41.080 --> 0:28:45.400
<v Speaker 1>Leo Briman was one of the great probabilists and statisticians

0:28:46.360 --> 0:28:53.760
<v Speaker 1>of his generation. Berkeley professor need I say, and you know,

0:28:53.880 --> 0:28:59.280
<v Speaker 1>Leo had been a practitioner in statistical consulting actually for

0:28:59.400 --> 0:29:02.840
<v Speaker 1>quite some time. I'm in between a U. C. L.

0:29:02.880 --> 0:29:06.880
<v Speaker 1>A tenured job and returning to academia at Berkeley, and

0:29:07.080 --> 0:29:10.600
<v Speaker 1>he learned a lot in that time about actually solving

0:29:10.640 --> 0:29:15.880
<v Speaker 1>prediction problems and instead of hypothetically solving them in sort

0:29:15.880 --> 0:29:20.800
<v Speaker 1>of the academic context. And so all of his insights

0:29:20.840 --> 0:29:25.160
<v Speaker 1>about the difference really culminated in this paper from two

0:29:25.200 --> 0:29:26.200
<v Speaker 1>thousand that he wrote.

0:29:26.160 --> 0:29:30.000
<v Speaker 2>The difference between practical use versus academic theory if you like.

0:29:30.160 --> 0:29:35.000
<v Speaker 1>Yeah, And so he identified two schools of thought about

0:29:35.000 --> 0:29:40.400
<v Speaker 1>solving prediction problems, right, and one school is sort of

0:29:40.920 --> 0:29:44.120
<v Speaker 1>model based. The idea is there's some stuff you're going

0:29:44.200 --> 0:29:49.080
<v Speaker 1>to get to observe stock characteristics. Let's say there's a

0:29:49.080 --> 0:29:51.840
<v Speaker 1>thing you wish you knew future price change, let's say,

0:29:51.880 --> 0:29:55.280
<v Speaker 1>and there's a box in nature that turns those inputs

0:29:55.280 --> 0:29:59.840
<v Speaker 1>into the output, right. And in the model based school

0:29:59.880 --> 0:30:03.400
<v Speaker 1>of thought, you try to open that box, reason about

0:30:03.400 --> 0:30:06.440
<v Speaker 1>how it must work, make theories. In our case, these

0:30:06.440 --> 0:30:11.440
<v Speaker 1>would be sort of econometric theories, financial economics theories. And

0:30:11.480 --> 0:30:14.640
<v Speaker 1>then those theories have knobs, not many, and you use

0:30:14.760 --> 0:30:17.560
<v Speaker 1>data to set the knobs, but otherwise you believe the model.

0:30:19.200 --> 0:30:22.520
<v Speaker 1>And he contrasts that with the machine learning school of thought,

0:30:22.640 --> 0:30:27.840
<v Speaker 1>which is also has the idea of Nature's box. The

0:30:27.920 --> 0:30:30.360
<v Speaker 1>inputs go in, the thing you wish you knew comes out.

0:30:30.840 --> 0:30:32.920
<v Speaker 1>But in machine learning, you don't try to open the box.

0:30:33.280 --> 0:30:35.600
<v Speaker 1>You just try to approximate what the box is doing.

0:30:36.120 --> 0:30:39.800
<v Speaker 1>And your measure of success is predictive accuracy, and is

0:30:39.880 --> 0:30:43.240
<v Speaker 1>only predictive accuracy. If you build a gadget and that

0:30:43.320 --> 0:30:47.400
<v Speaker 1>gadget produces predictions that are really accurate they turn out

0:30:47.440 --> 0:30:50.680
<v Speaker 1>to look like the thing that nature produces, then that

0:30:50.800 --> 0:30:55.080
<v Speaker 1>is success. And at the time he wrote the paper,

0:30:55.200 --> 0:30:58.960
<v Speaker 1>his assessment was ninety eight percent of statistics was taking

0:30:58.960 --> 0:31:01.560
<v Speaker 1>the model based approach, two percent was taking the machine

0:31:01.600 --> 0:31:02.200
<v Speaker 1>learning approach.

0:31:02.600 --> 0:31:05.520
<v Speaker 2>And are those statistics still valid today or have we

0:31:05.600 --> 0:31:06.440
<v Speaker 2>shifted quite a bit?

0:31:06.520 --> 0:31:10.520
<v Speaker 1>We shifted quite a bit, And different different arenas of

0:31:11.680 --> 0:31:16.160
<v Speaker 1>prediction problems have different mixes these days. But even in finance,

0:31:16.200 --> 0:31:19.120
<v Speaker 1>I would say it's it's probably more like fifty to.

0:31:19.120 --> 0:31:20.960
<v Speaker 2>Fifty really that much?

0:31:21.240 --> 0:31:26.200
<v Speaker 1>Yeah, I think you know, And if you the logical

0:31:26.240 --> 0:31:31.760
<v Speaker 1>extreme is natural language modeling, which was done for decades

0:31:31.760 --> 0:31:35.080
<v Speaker 1>and decades in the model based approach, where you kind

0:31:35.080 --> 0:31:39.120
<v Speaker 1>of reasoned about linguistic characteristics of how people kind of

0:31:39.240 --> 0:31:42.800
<v Speaker 1>do dialogue and those models had some parameters and you

0:31:42.840 --> 0:31:47.080
<v Speaker 1>fit them with data, and then instead you have, as

0:31:47.080 --> 0:31:50.680
<v Speaker 1>we said, a database of a trillion words and a

0:31:50.720 --> 0:31:53.160
<v Speaker 1>tool with one hundred and seventy five billion parameters, and

0:31:53.200 --> 0:31:56.600
<v Speaker 1>you run that and there is no hope of completely

0:31:56.680 --> 0:31:59.640
<v Speaker 1>understanding what is going on inside of GPD three. But

0:31:59.680 --> 0:32:04.080
<v Speaker 1>nobody complains about that because the results are astounding. The

0:32:04.120 --> 0:32:07.479
<v Speaker 1>thing that you get is incredible. And so that is

0:32:08.320 --> 0:32:12.320
<v Speaker 1>by analogy, the way that we reason about running systematic

0:32:12.360 --> 0:32:16.640
<v Speaker 1>investment strategies. At the end of the day, predictive accuracy

0:32:16.840 --> 0:32:20.840
<v Speaker 1>is what creates returns for investors. Being able to give

0:32:21.760 --> 0:32:25.239
<v Speaker 1>complete descriptions of exactly how the predictions arise does not

0:32:25.400 --> 0:32:29.040
<v Speaker 1>in itself create returns for investors. Now, I'm not against

0:32:29.080 --> 0:32:32.800
<v Speaker 1>interpretability and simplicity all equal. I love interpretability and simplicity,

0:32:32.960 --> 0:32:35.719
<v Speaker 1>but all else is not equal. If you want the

0:32:35.760 --> 0:32:39.520
<v Speaker 1>most accurate predictions, you are going to have to sacrifice

0:32:39.560 --> 0:32:43.160
<v Speaker 1>some amount of simplicity. In fact, this truth is so

0:32:43.280 --> 0:32:45.400
<v Speaker 1>widespread that Leo gave it a name in his paper.

0:32:45.440 --> 0:32:49.080
<v Speaker 1>He called it Accam's dilemma. So Accam's razor is the

0:32:49.080 --> 0:32:52.240
<v Speaker 1>philosophical idea that you should choose the simplest explanation that

0:32:52.320 --> 0:32:58.360
<v Speaker 1>fits the facts. Akam's dilemma is the point that in

0:32:58.400 --> 0:33:02.560
<v Speaker 1>statistical prediction, simplest approach, even though you wish you could

0:33:02.600 --> 0:33:04.840
<v Speaker 1>choose it, is not the most accurate approach if you

0:33:04.880 --> 0:33:08.600
<v Speaker 1>care about predictive accuracy. If you're putting predictive accuracy first,

0:33:09.120 --> 0:33:12.160
<v Speaker 1>then you have to embrace a certain amount of complexity

0:33:12.400 --> 0:33:13.640
<v Speaker 1>and lack of interpretability.

0:33:13.960 --> 0:33:17.960
<v Speaker 2>Huh, that's really quite fascinating. So let's talk a little

0:33:17.960 --> 0:33:24.600
<v Speaker 2>bit about artificial intelligence and large language models. You follow

0:33:24.680 --> 0:33:29.280
<v Speaker 2>d Shaw playing in e commerce and biotech. It seems

0:33:29.400 --> 0:33:34.920
<v Speaker 2>like this approach to using statistics, probability, and computer science

0:33:35.960 --> 0:33:38.240
<v Speaker 2>is applicable to so many different fields.

0:33:38.640 --> 0:33:42.760
<v Speaker 1>It is. Yeah, I think you're talking about prediction problems ultimately,

0:33:43.000 --> 0:33:48.920
<v Speaker 1>So in recommender systems, you can think of the question

0:33:49.000 --> 0:33:53.160
<v Speaker 1>as being well, if I had to predict what thing

0:33:53.200 --> 0:33:55.920
<v Speaker 1>I could show a person that would you be most

0:33:56.080 --> 0:33:59.600
<v Speaker 1>likely to change their behavior and cause them to buy it.

0:34:00.000 --> 0:34:06.120
<v Speaker 1>It's a kind of prediction problem that motivates recommendations. In biotechnology.

0:34:07.440 --> 0:34:12.120
<v Speaker 1>Very often we are trying to make predictions about whether someone,

0:34:12.440 --> 0:34:15.600
<v Speaker 1>let's say, does or doesn't have a condition a disease

0:34:15.840 --> 0:34:19.799
<v Speaker 1>based on lots of information we can gather from high

0:34:19.880 --> 0:34:25.879
<v Speaker 1>throughput diagnostic techniques. These days, the keyword in biology and

0:34:25.960 --> 0:34:30.319
<v Speaker 1>in medicine and biotechnology is high throughput. You're running analyses

0:34:30.680 --> 0:34:34.520
<v Speaker 1>on an individual that are producing hundreds of thousands of numbers,

0:34:35.360 --> 0:34:37.680
<v Speaker 1>and you want to be able to take all of

0:34:37.719 --> 0:34:40.520
<v Speaker 1>that kind of wealth of data and turn it into

0:34:40.560 --> 0:34:42.400
<v Speaker 1>diagnostic information about.

0:34:42.160 --> 0:34:47.759
<v Speaker 2>And we've seen AI get applied to pharmaceutical development in

0:34:47.840 --> 0:34:52.120
<v Speaker 2>ways that people just never really could have imagined just

0:34:52.200 --> 0:34:54.879
<v Speaker 2>a few short years ago. Is there a field that

0:34:55.040 --> 0:34:57.879
<v Speaker 2>AI and large language models are not going to touch

0:34:58.280 --> 0:34:59.920
<v Speaker 2>or is this just the future of everything.

0:35:01.800 --> 0:35:04.160
<v Speaker 1>The kinds of fields where you would expect uptake to

0:35:04.200 --> 0:35:10.040
<v Speaker 1>be slow are where it is hard to assemble large

0:35:10.120 --> 0:35:15.760
<v Speaker 1>data sets of systematically gathered data. And so any field

0:35:15.800 --> 0:35:20.959
<v Speaker 1>where it's relatively easy to at large scale, let's say,

0:35:20.960 --> 0:35:23.279
<v Speaker 1>produce the kinds of the same kinds of informations that

0:35:23.920 --> 0:35:26.520
<v Speaker 1>experts are using to make their decisions, you should expect

0:35:26.520 --> 0:35:29.160
<v Speaker 1>that field to be impacted by these tools if it

0:35:29.160 --> 0:35:29.920
<v Speaker 1>hasn't been already.

0:35:30.000 --> 0:35:33.200
<v Speaker 2>So you're kind of answering my next question, which is

0:35:33.680 --> 0:35:36.719
<v Speaker 2>what led you back to investment management. But it seems

0:35:37.040 --> 0:35:40.560
<v Speaker 2>if there's any field that just generates endless amounts of data.

0:35:40.840 --> 0:35:44.200
<v Speaker 1>It's the markets, that's true. And I had already been

0:35:44.960 --> 0:35:48.879
<v Speaker 1>really interested in the problems of systematic investment strategies from

0:35:48.920 --> 0:35:52.160
<v Speaker 1>my time working in d SHAW, and so my co

0:35:52.280 --> 0:35:56.200
<v Speaker 1>founder Michael Kratanav and I, you know, we were both

0:35:56.200 --> 0:35:59.399
<v Speaker 1>in the Bay Area in the two thousand and four.

0:36:02.120 --> 0:36:04.280
<v Speaker 1>He was there because of a firm that he had founded,

0:36:04.320 --> 0:36:06.960
<v Speaker 1>and I was there finishing my PhD. And we started

0:36:06.960 --> 0:36:10.360
<v Speaker 1>to talk about the idea of using contemporary machine learning

0:36:10.360 --> 0:36:14.480
<v Speaker 1>methods to build strategies that would be, you know, really

0:36:14.480 --> 0:36:19.600
<v Speaker 1>different from strategies that result from classical techniques. And we

0:36:19.640 --> 0:36:21.400
<v Speaker 1>had met at d SHAW in the nineties and been

0:36:21.480 --> 0:36:26.640
<v Speaker 1>less excited about this idea because the methods were pretty immature.

0:36:27.000 --> 0:36:29.719
<v Speaker 1>There wasn't actually a giant diversity of data back in

0:36:29.760 --> 0:36:33.200
<v Speaker 1>the nineties in financial markets, not like there was in

0:36:33.239 --> 0:36:36.719
<v Speaker 1>two thousand and five. And compute was really still quite

0:36:36.719 --> 0:36:39.359
<v Speaker 1>expensive in the nineties, whereas in two thousand and five,

0:36:40.000 --> 0:36:42.440
<v Speaker 1>you know, it had been dropping in the usual More's

0:36:42.520 --> 0:36:45.560
<v Speaker 1>Law way. And this was even before GPUs, and so

0:36:45.840 --> 0:36:47.879
<v Speaker 1>when we looked at the problem in two thousand and five,

0:36:48.520 --> 0:36:52.960
<v Speaker 1>it felt like there was a very live opportunity to

0:36:53.040 --> 0:36:56.120
<v Speaker 1>do something with a lot of promise that would be

0:36:56.200 --> 0:36:59.680
<v Speaker 1>really different. And we had the sense that not a

0:36:59.680 --> 0:37:02.480
<v Speaker 1>lot of people were of the same opinion, and so

0:37:02.520 --> 0:37:04.680
<v Speaker 1>it seemed like something that we should try.

0:37:04.880 --> 0:37:08.240
<v Speaker 2>That there was a void. Nothing nothing in the market

0:37:08.239 --> 0:37:12.160
<v Speaker 2>hates more than a vacuum and intellectual approach. So so

0:37:12.560 --> 0:37:17.879
<v Speaker 2>you mentioned the diversity of various data sources. What what

0:37:18.000 --> 0:37:21.520
<v Speaker 2>don't you consider, like how how far off of price

0:37:21.560 --> 0:37:25.280
<v Speaker 2>and volume do you go in the net you're casting

0:37:25.440 --> 0:37:28.040
<v Speaker 2>for inputs into into your systems.

0:37:29.000 --> 0:37:33.440
<v Speaker 1>Well, I think we're prepared as a you know, as

0:37:33.480 --> 0:37:36.919
<v Speaker 1>a as a research principle, we're prepared to consider any

0:37:37.120 --> 0:37:41.160
<v Speaker 1>data that has some bearing on price formation, like some

0:37:41.160 --> 0:37:44.560
<v Speaker 1>some plausible bearing on how prices are formed. Now, of

0:37:44.600 --> 0:37:47.880
<v Speaker 1>course we're you know, we're a relatively small group of

0:37:47.880 --> 0:37:50.759
<v Speaker 1>people with a lot of ideas and uh, and so

0:37:51.120 --> 0:37:55.240
<v Speaker 1>we have to prioritize so you know, in the event

0:37:55.360 --> 0:37:58.200
<v Speaker 1>we end up pursuing data that you know makes a

0:37:58.239 --> 0:38:00.520
<v Speaker 1>lot of sense, you know, we don't we don't try.

0:38:00.920 --> 0:38:03.080
<v Speaker 2>I mean, can you go as far as politics or

0:38:03.120 --> 0:38:06.360
<v Speaker 2>the weather, like how far off of prices can you

0:38:06.560 --> 0:38:07.160
<v Speaker 2>can you look?

0:38:07.239 --> 0:38:10.120
<v Speaker 1>So, you know, an example would be the weather. You're

0:38:09.920 --> 0:38:12.800
<v Speaker 1>for most securities, you're not going to be very interested

0:38:12.880 --> 0:38:15.200
<v Speaker 1>in the weather, but for commodities future as you might be,

0:38:15.320 --> 0:38:17.080
<v Speaker 1>so that you know, that's the kind of reasoning you

0:38:17.080 --> 0:38:17.560
<v Speaker 1>would apply.

0:38:18.120 --> 0:38:22.799
<v Speaker 2>Right, really really interesting. So let's talk about some of

0:38:22.840 --> 0:38:26.960
<v Speaker 2>the strategies. You guys are running short and mid horizon

0:38:27.120 --> 0:38:32.600
<v Speaker 2>US equities, European equities, Asian equities, mid horizon US credit,

0:38:33.040 --> 0:38:36.680
<v Speaker 2>and then cross assets. So I might to assume all

0:38:36.719 --> 0:38:40.200
<v Speaker 2>of these are machine learning based, and how similar different

0:38:40.960 --> 0:38:43.480
<v Speaker 2>is each approach to each of those asset classes.

0:38:43.920 --> 0:38:50.040
<v Speaker 1>Yeah, they're all machine learning based. The kind of principles

0:38:50.080 --> 0:38:53.360
<v Speaker 1>that I've described of using as much complexity as you

0:38:53.440 --> 0:38:57.800
<v Speaker 1>need to maximize predictive accuracy, et cetera. Those principles underlie

0:38:57.880 --> 0:39:00.840
<v Speaker 1>all the systems. But of course it's trading. Trading corporate

0:39:00.840 --> 0:39:03.840
<v Speaker 1>bonds is very different from trading equities, and so the

0:39:04.200 --> 0:39:06.040
<v Speaker 1>implementations reflect that reality.

0:39:06.760 --> 0:39:09.879
<v Speaker 2>Huh. So let's talk a little bit about the four

0:39:09.960 --> 0:39:15.000
<v Speaker 2>step process that you bring to the systematic approach, and

0:39:15.040 --> 0:39:19.240
<v Speaker 2>this is off of your site, so it's it's data prediction, engine,

0:39:19.640 --> 0:39:26.400
<v Speaker 2>portfolio construction, and execution. Yeah, I'm assuming that is heavily

0:39:26.560 --> 0:39:30.359
<v Speaker 2>computer and machine learning based. At each step along the way.

0:39:30.440 --> 0:39:31.359
<v Speaker 2>Is that is that fair?

0:39:32.480 --> 0:39:34.880
<v Speaker 1>I think that's fair. I mean to different degrees. The

0:39:35.360 --> 0:39:41.719
<v Speaker 1>data gathering that's you know, that's a that's largely a

0:39:41.800 --> 0:39:46.080
<v Speaker 1>software and kind of operations and infrastructure job.

0:39:46.280 --> 0:39:48.280
<v Speaker 2>Do you guys have to spend a lot of time

0:39:48.400 --> 0:39:51.759
<v Speaker 2>cleaning up that data and making sure that because you

0:39:52.120 --> 0:39:56.320
<v Speaker 2>hear between CRISP and s and P and Bloomberg, sometimes

0:39:56.400 --> 0:39:58.719
<v Speaker 2>you'll pull something up and they're just all off a

0:39:58.719 --> 0:40:00.759
<v Speaker 2>little bit from each other because they all bring a

0:40:00.840 --> 0:40:04.040
<v Speaker 2>very different approach to data assembly. How do you make

0:40:04.080 --> 0:40:07.960
<v Speaker 2>sure everything is consistent and there's no errors or errants

0:40:09.040 --> 0:40:09.960
<v Speaker 2>inputs throughout.

0:40:10.239 --> 0:40:14.000
<v Speaker 1>Yeah, through a lot of effort. Essentially, there there we have.

0:40:15.040 --> 0:40:17.799
<v Speaker 1>You know, we have an entire group of people who

0:40:17.840 --> 0:40:23.800
<v Speaker 1>focus on data operations, both for gathering a historical data

0:40:23.880 --> 0:40:27.080
<v Speaker 1>and for the management of the ongoing live data feeds.

0:40:27.360 --> 0:40:29.319
<v Speaker 1>There's no way around that. I mean, that's just work

0:40:29.360 --> 0:40:31.080
<v Speaker 1>that you have to that you have to do.

0:40:31.200 --> 0:40:33.080
<v Speaker 2>You just have to brute force your way through that.

0:40:33.520 --> 0:40:36.640
<v Speaker 2>And then the prediction engine. Sounds like that's the single

0:40:36.719 --> 0:40:41.600
<v Speaker 2>most important part of the machine learning process if I'm

0:40:42.040 --> 0:40:45.880
<v Speaker 2>understanding you correctly, that that's where all the meat of

0:40:45.960 --> 0:40:47.080
<v Speaker 2>the technology is.

0:40:47.280 --> 0:40:50.680
<v Speaker 1>Yeah, I understand the sentiment. I mean, it's worth emphasizing

0:40:50.719 --> 0:40:54.200
<v Speaker 1>that you do not get to a successful systematic strategy

0:40:54.200 --> 0:40:57.200
<v Speaker 1>without all the ingredients. You have to have clean data

0:40:57.680 --> 0:41:01.920
<v Speaker 1>because of the garbage in garbage out. You have to

0:41:01.920 --> 0:41:06.880
<v Speaker 1>have accurate predictions. But you know, predictions don't automatically translate

0:41:06.920 --> 0:41:10.400
<v Speaker 1>into returns for investors. Those predictions are kind of the

0:41:10.480 --> 0:41:15.440
<v Speaker 1>power that drives the portfolio holding part of the system.

0:41:15.480 --> 0:41:18.520
<v Speaker 2>So let's talk about that portfolio construction. Given that you

0:41:18.640 --> 0:41:22.920
<v Speaker 2>have a prediction engine that and good data going into it,

0:41:23.200 --> 0:41:26.560
<v Speaker 2>so you're fairly confident as to the output. How do

0:41:26.640 --> 0:41:28.920
<v Speaker 2>you then take that output and say, here's how I'm

0:41:28.920 --> 0:41:32.400
<v Speaker 2>going to build a portfolio based on what this generates.

0:41:32.520 --> 0:41:38.920
<v Speaker 1>Yeah, so there are three big ingredients in the portfolio construction.

0:41:39.160 --> 0:41:43.440
<v Speaker 1>The predictions what is usually called a risk model in

0:41:43.760 --> 0:41:50.560
<v Speaker 1>this business, which means some understanding of how volatile prices

0:41:50.600 --> 0:41:54.000
<v Speaker 1>are across all the securities you're trading, how correlated they are,

0:41:54.840 --> 0:41:57.200
<v Speaker 1>how you know if they have a if they have

0:41:57.239 --> 0:42:00.719
<v Speaker 1>a big movement, how big that movement will be. That's

0:42:00.760 --> 0:42:03.880
<v Speaker 1>all the risk model. And then the final ingredient is

0:42:04.840 --> 0:42:08.240
<v Speaker 1>what's usually called a market impact model, and that means

0:42:09.640 --> 0:42:13.279
<v Speaker 1>an understanding of how much you are going to push

0:42:13.320 --> 0:42:15.759
<v Speaker 1>prices away from you when you try to trade. This

0:42:15.800 --> 0:42:18.799
<v Speaker 1>is a reality of all trading. You buy a lot

0:42:18.800 --> 0:42:21.239
<v Speaker 1>of a security, you push the price up, you push

0:42:21.280 --> 0:42:24.239
<v Speaker 1>it away from you in the unfavorable direction. And in

0:42:24.280 --> 0:42:27.959
<v Speaker 1>the systems that we run, the predictions that we're trying

0:42:27.960 --> 0:42:31.880
<v Speaker 1>to capture are about the same size as the effect

0:42:31.920 --> 0:42:34.279
<v Speaker 1>that we have on the markets when we trade, and

0:42:34.360 --> 0:42:38.439
<v Speaker 1>so you cannot neglect that impact effect when you're thinking

0:42:38.480 --> 0:42:41.000
<v Speaker 1>about what portfolios to hold.

0:42:41.040 --> 0:42:44.760
<v Speaker 2>So execution becomes really important. If you're not executing well,

0:42:44.920 --> 0:42:48.560
<v Speaker 2>you are moving prices away from your profit.

0:42:48.880 --> 0:42:53.400
<v Speaker 1>That's right, and it is you know, probably the single

0:42:53.520 --> 0:42:59.319
<v Speaker 1>thing that undoes quantitative hedge funds most often is that

0:43:00.040 --> 0:43:04.560
<v Speaker 1>they misunderstand how much they're moving prices. They get too big,

0:43:04.600 --> 0:43:08.040
<v Speaker 1>they start trading too much, and they sort of blowed

0:43:08.080 --> 0:43:08.600
<v Speaker 1>themselves up.

0:43:08.800 --> 0:43:11.120
<v Speaker 2>It's funny that you say that, because as you were

0:43:11.160 --> 0:43:14.239
<v Speaker 2>describing that, the first name that popped into my head

0:43:14.680 --> 0:43:19.400
<v Speaker 2>was long term capital managements trading these really thinly traded

0:43:20.239 --> 0:43:27.080
<v Speaker 2>obscure fixed income products, and everything they bought they sent

0:43:27.239 --> 0:43:30.359
<v Speaker 2>higher because there just wasn't any volume in it. And

0:43:30.400 --> 0:43:33.359
<v Speaker 2>when they needed liquiditly there was none to be had.

0:43:33.440 --> 0:43:36.800
<v Speaker 2>And you know that plus no risk management one hundred

0:43:36.960 --> 0:43:39.440
<v Speaker 2>x leverage equals a kaboom.

0:43:39.840 --> 0:43:43.120
<v Speaker 1>They made a number of mistakes the book. The book

0:43:43.160 --> 0:43:45.279
<v Speaker 1>is good. So when genius fail in, oh absolutely love

0:43:45.280 --> 0:43:46.880
<v Speaker 1>that fantastically fascinating.

0:43:46.960 --> 0:43:51.239
<v Speaker 2>So when you're reading a book like that, somewhere in

0:43:51.280 --> 0:43:53.279
<v Speaker 2>the back of your head are you thinking, hey, this

0:43:53.440 --> 0:43:56.280
<v Speaker 2>is like a what not to do when you're setting

0:43:56.360 --> 0:44:00.000
<v Speaker 2>up a machine learning fund. How influential is something like.

0:44:00.040 --> 0:44:03.480
<v Speaker 1>Well one hundred percent? I mean, look, I think the

0:44:03.520 --> 0:44:05.959
<v Speaker 1>most important adage I've ever heard in my professional life

0:44:06.000 --> 0:44:10.160
<v Speaker 1>is good judgment comes from experience. Experience comes from bad judgment.

0:44:10.600 --> 0:44:13.800
<v Speaker 1>So the extent to which you can get good judgment

0:44:14.120 --> 0:44:17.440
<v Speaker 1>from other people's experience, that is that that is like

0:44:17.480 --> 0:44:22.400
<v Speaker 1>a free tuition. And so we talk a lot about

0:44:22.560 --> 0:44:25.640
<v Speaker 1>all the all the mistakes that that that other people

0:44:25.680 --> 0:44:29.960
<v Speaker 1>have made. And you know, we do not congratulate ourselves

0:44:29.960 --> 0:44:33.600
<v Speaker 1>on having avoided mistakes. We think those people were smart.

0:44:33.680 --> 0:44:36.040
<v Speaker 1>I mean look that you know, you read about these

0:44:36.040 --> 0:44:38.040
<v Speaker 1>events and these people. None of these people were dummies.

0:44:38.080 --> 0:44:40.000
<v Speaker 1>They were sophisticated Nobel laureates.

0:44:40.080 --> 0:44:43.040
<v Speaker 2>Yeah, right, it's they just didn't have a guide book

0:44:43.080 --> 0:44:45.560
<v Speaker 2>on what not to do, which which you guys.

0:44:45.280 --> 0:44:47.880
<v Speaker 1>Do We don't. No, I don't think we do. I

0:44:47.920 --> 0:44:50.200
<v Speaker 1>mean apart from that, apart from reading about right, But

0:44:50.480 --> 0:44:53.160
<v Speaker 1>everybody is undone by a failure that they they didn't

0:44:53.280 --> 0:44:55.240
<v Speaker 1>they did, they didn't think of ever didn't know about yet.

0:44:55.280 --> 0:44:57.200
<v Speaker 1>And we're extremely cognizant of that.

0:44:57.560 --> 0:45:00.680
<v Speaker 2>Huh. That has to be somewhat humbling to come being

0:45:01.000 --> 0:45:06.720
<v Speaker 2>on the lookout for that blind spot that could disrupt everything.

0:45:06.920 --> 0:45:11.480
<v Speaker 1>Yes, yeah, humility is the key ingredient in running in

0:45:11.560 --> 0:45:13.040
<v Speaker 1>running these systems.

0:45:13.400 --> 0:45:18.000
<v Speaker 2>Really quite amazing. So let's talk a little bit about

0:45:18.360 --> 0:45:22.960
<v Speaker 2>how academically focused volling On is. You guys have a

0:45:23.040 --> 0:45:28.000
<v Speaker 2>pretty deep R and D team internally, you teach at Berkeley.

0:45:28.200 --> 0:45:30.560
<v Speaker 2>What does it mean for a Hedge fund to be

0:45:30.680 --> 0:45:32.000
<v Speaker 2>academically focused?

0:45:32.480 --> 0:45:36.120
<v Speaker 1>What I would say probably is kind of evidence based

0:45:36.280 --> 0:45:40.640
<v Speaker 1>rather than academically focused. Saying academically focused gives the impression

0:45:40.760 --> 0:45:43.600
<v Speaker 1>that kind of papers would be the goal or the

0:45:43.960 --> 0:45:46.080
<v Speaker 1>desired output, and that's not the case at all. We have,

0:45:46.520 --> 0:45:49.319
<v Speaker 1>you know, a very specific applied problem that we are

0:45:49.440 --> 0:45:50.200
<v Speaker 1>trying to solve.

0:45:50.480 --> 0:45:51.799
<v Speaker 2>Papers are a mean to an end.

0:45:52.000 --> 0:45:56.320
<v Speaker 1>Papers are you know, we don't write papers for external consumption.

0:45:56.360 --> 0:45:59.600
<v Speaker 1>We do lots of writing internally, and that's to make

0:45:59.640 --> 0:46:02.359
<v Speaker 1>sure that that you know, we're keeping track of our

0:46:02.400 --> 0:46:04.840
<v Speaker 1>own kind of scientific process.

0:46:05.000 --> 0:46:08.800
<v Speaker 2>But you're fairly widely published in statistics and machine learning. Yes,

0:46:08.840 --> 0:46:13.280
<v Speaker 2>what purpose does that serve other than a calling card

0:46:13.440 --> 0:46:16.440
<v Speaker 2>for the fund as well as Hey, I have this

0:46:16.560 --> 0:46:18.360
<v Speaker 2>idea and I want to see what the rest of

0:46:18.600 --> 0:46:21.640
<v Speaker 2>my peers think of it. When when you put stuff

0:46:21.680 --> 0:46:24.839
<v Speaker 2>out into the world, what sort of feedback or pushback

0:46:25.239 --> 0:46:26.120
<v Speaker 2>do you get?

0:46:27.480 --> 0:46:29.319
<v Speaker 1>I guess I would have to say, I really I

0:46:29.400 --> 0:46:32.279
<v Speaker 1>do that as kind of a double life of non

0:46:32.320 --> 0:46:37.880
<v Speaker 1>financial research. So it's just something that I really enjoy. Principally,

0:46:37.880 --> 0:46:39.359
<v Speaker 1>what it means is that I get to work with

0:46:39.719 --> 0:46:44.760
<v Speaker 1>PhD students, and you know, we have really outstanding PhD

0:46:44.800 --> 0:46:50.600
<v Speaker 1>students at Berkeley in statistics, and so it's an opportunity

0:46:50.640 --> 0:46:58.640
<v Speaker 1>for me to do a kind of intellectual work that namely,

0:46:58.880 --> 0:47:01.080
<v Speaker 1>you know, writing a paper laying out an argument for

0:47:01.120 --> 0:47:05.040
<v Speaker 1>public consumption, et cetera that is kind of closed off

0:47:05.120 --> 0:47:05.960
<v Speaker 1>as far as so.

0:47:05.960 --> 0:47:10.080
<v Speaker 2>Not adjacent to what you guys are doing at Volleyon generally. No, No,

0:47:10.560 --> 0:47:14.440
<v Speaker 2>that's really interesting. So then I always assume that that

0:47:14.600 --> 0:47:17.799
<v Speaker 2>was part of your process for developing new models to

0:47:17.840 --> 0:47:22.120
<v Speaker 2>apply machine learning to new assets. Take us through the process.

0:47:22.160 --> 0:47:25.400
<v Speaker 2>How do you go about saying, Hey, this is an

0:47:25.440 --> 0:47:28.080
<v Speaker 2>asset class we don't have exposure to. Let's see how

0:47:28.080 --> 0:47:32.280
<v Speaker 2>to apply what we already know to that specific area.

0:47:32.560 --> 0:47:35.399
<v Speaker 1>Yeah, we have it's a great question. So we're trying

0:47:35.400 --> 0:47:39.680
<v Speaker 1>as much as possible to get the problem for a

0:47:39.719 --> 0:47:43.160
<v Speaker 1>new asset class into a familiar setup, into you know,

0:47:43.719 --> 0:47:47.839
<v Speaker 1>as standard a setup as we can, and so we

0:47:47.920 --> 0:47:52.200
<v Speaker 1>know what these systems look like in the world of equity.

0:47:52.320 --> 0:47:55.359
<v Speaker 1>And so if you're trying to do the same kind,

0:47:55.400 --> 0:47:57.440
<v Speaker 1>if you're trying to build the same kind of system

0:47:57.520 --> 0:47:59.920
<v Speaker 1>for corporate bonds, and you start off by saying, well, okay,

0:48:00.239 --> 0:48:02.400
<v Speaker 1>i'd like I need to know, you know, closing prices

0:48:02.480 --> 0:48:05.279
<v Speaker 1>or inter day prices for all the bonds. Already, you

0:48:05.320 --> 0:48:09.239
<v Speaker 1>have a very big problem in corporate bonds because there

0:48:09.320 --> 0:48:15.440
<v Speaker 1>is no there is no live price feeds that's showing

0:48:15.480 --> 0:48:18.560
<v Speaker 1>you a bit offer quote in the way that there

0:48:18.640 --> 0:48:21.920
<v Speaker 1>is inequity. And so before you can even get started

0:48:21.960 --> 0:48:24.680
<v Speaker 1>thinking about predicting how a price is going to change,

0:48:24.680 --> 0:48:26.080
<v Speaker 1>it would be nice if you know what the price

0:48:26.160 --> 0:48:28.759
<v Speaker 1>currently was, and that is already a problem you have

0:48:28.800 --> 0:48:30.719
<v Speaker 1>to solve in corporate bonds as opposed to being just

0:48:30.760 --> 0:48:32.000
<v Speaker 1>an input that you have access to.

0:48:32.320 --> 0:48:35.279
<v Speaker 2>The old joke was trading by appointment only. Yeah, and

0:48:35.320 --> 0:48:37.520
<v Speaker 2>that seems to be a bit of an issue. And

0:48:37.560 --> 0:48:42.080
<v Speaker 2>there are so many more bond issues than there are equities. Absolutely,

0:48:42.239 --> 0:48:45.480
<v Speaker 2>is this just a database challenge or how do you work?

0:48:45.520 --> 0:48:49.280
<v Speaker 1>No, it's a statistics problem, but it's it's a different

0:48:49.360 --> 0:48:52.319
<v Speaker 1>kind of statistics problem. We're not in this case. We're

0:48:52.360 --> 0:48:54.920
<v Speaker 1>not trying to yet. We're not yet trying to predict

0:48:55.080 --> 0:48:58.520
<v Speaker 1>the future of any quantity. We're trying to say, I

0:48:58.560 --> 0:49:00.839
<v Speaker 1>wish I knew what the fair value of this of

0:49:00.880 --> 0:49:05.319
<v Speaker 1>this CSIP was. I can't see that exactly because there's

0:49:05.320 --> 0:49:07.239
<v Speaker 1>no live order book that with a bid and an

0:49:07.280 --> 0:49:09.680
<v Speaker 1>offer that's got lots of liquidity that lets me figure

0:49:09.680 --> 0:49:11.320
<v Speaker 1>out the fair value. But I do know what.

0:49:11.520 --> 0:49:14.120
<v Speaker 2>At best, you have a recent price, maybe not even

0:49:14.160 --> 0:49:14.680
<v Speaker 2>so recent.

0:49:14.840 --> 0:49:17.839
<v Speaker 1>I have lots of related information. I know you know

0:49:18.480 --> 0:49:21.279
<v Speaker 1>this bond. Maybe this bond didn't trade today, but it

0:49:21.320 --> 0:49:23.279
<v Speaker 1>traded a few times yesterday. I get to say I

0:49:23.400 --> 0:49:26.440
<v Speaker 1>know where it traded. I'm in touch with bond dealers,

0:49:26.440 --> 0:49:28.959
<v Speaker 1>so I know where they've quoted this bond, maybe only

0:49:28.960 --> 0:49:31.440
<v Speaker 1>on one side over the last few days. I have

0:49:31.560 --> 0:49:34.800
<v Speaker 1>some information about the company that issued this bond, et cetera.

0:49:35.280 --> 0:49:37.719
<v Speaker 1>So I have lots of stuff that's related to the

0:49:37.800 --> 0:49:39.320
<v Speaker 1>number I know that I want to know. I just

0:49:39.360 --> 0:49:42.319
<v Speaker 1>don't know that number, right, And so what I want

0:49:42.360 --> 0:49:44.440
<v Speaker 1>to try to do is kind of fill in and

0:49:44.600 --> 0:49:46.919
<v Speaker 1>do what's what in statistics or in control we would

0:49:46.920 --> 0:49:50.440
<v Speaker 1>call a now casting problem, huh, And it's an analogy

0:49:50.480 --> 0:49:55.600
<v Speaker 1>actually is too automatically controlling an airplane. So surprisingly, Oh,

0:49:55.840 --> 0:49:57.719
<v Speaker 1>the main there there are there are when you're if

0:49:57.719 --> 0:49:59.800
<v Speaker 1>you're trying if a software is trying to fly in

0:49:59.800 --> 0:50:03.080
<v Speaker 1>air plane, there are six things that it absolutely has

0:50:03.120 --> 0:50:05.160
<v Speaker 1>to know. Has to know the x y z of

0:50:05.160 --> 0:50:07.400
<v Speaker 1>where the plane is and the x y z of

0:50:07.440 --> 0:50:10.200
<v Speaker 1>its velocity where it's headed. Right, those are the six

0:50:10.280 --> 0:50:14.120
<v Speaker 1>most important numbers. Now, nature does not just supply those

0:50:14.200 --> 0:50:17.440
<v Speaker 1>numbers to you. You cannot know those numbers with perfect exactitude.

0:50:17.600 --> 0:50:20.200
<v Speaker 1>But there's lots of instruments on the plane, and there's

0:50:20.239 --> 0:50:23.800
<v Speaker 1>GPS and all sorts of information that is very closely

0:50:23.840 --> 0:50:26.040
<v Speaker 1>related to the numbers You wish you knew, and you

0:50:26.080 --> 0:50:28.919
<v Speaker 1>can use statistics to go from all that stuff that's

0:50:29.040 --> 0:50:32.719
<v Speaker 1>adjacent to a guess and infill of the thing you

0:50:32.800 --> 0:50:35.120
<v Speaker 1>wish you knew, And the same goes with the current

0:50:35.160 --> 0:50:36.560
<v Speaker 1>price of a corporate bond.

0:50:37.160 --> 0:50:41.520
<v Speaker 2>Huh. That's really kind of interesting. So I'm curious as

0:50:41.600 --> 0:50:45.399
<v Speaker 2>to how often you start working your way into one

0:50:45.480 --> 0:50:50.640
<v Speaker 2>particular asset or a particular strategy for that asset and

0:50:50.760 --> 0:50:54.040
<v Speaker 2>just suddenly realize, oh, this is wildly different than we

0:50:54.160 --> 0:50:57.960
<v Speaker 2>previously expected, and suddenly you're down a rabbit hole to

0:50:58.160 --> 0:51:02.200
<v Speaker 2>just wildly unexpected areas. It sounds like that isn't all

0:51:02.239 --> 0:51:02.920
<v Speaker 2>then uncommon.

0:51:02.960 --> 0:51:03.960
<v Speaker 1>It is not uncommon at all.

0:51:04.160 --> 0:51:04.399
<v Speaker 2>Huh.

0:51:04.400 --> 0:51:07.480
<v Speaker 1>No, it's a nice you know, there's this kind of

0:51:07.560 --> 0:51:09.440
<v Speaker 1>wishful thinking that all we have. You know, we figured

0:51:09.440 --> 0:51:11.840
<v Speaker 1>it out in one asset class in the sense that

0:51:11.840 --> 0:51:14.200
<v Speaker 1>we have a system that's kind of stable and performing

0:51:14.200 --> 0:51:16.600
<v Speaker 1>reasonably well that we that we have a feel for,

0:51:17.160 --> 0:51:20.280
<v Speaker 1>and now we want to take that system and somehow

0:51:20.320 --> 0:51:23.920
<v Speaker 1>replicate it in a different situation. And while we're going

0:51:23.960 --> 0:51:26.799
<v Speaker 1>to standardize the new situation to make it look like

0:51:26.800 --> 0:51:29.080
<v Speaker 1>the old situation. That's the principle. That principle kind of

0:51:29.120 --> 0:51:31.480
<v Speaker 1>quickly goes out the window when you when you start

0:51:31.520 --> 0:51:33.400
<v Speaker 1>to make contact with the reality of how the new

0:51:33.440 --> 0:51:34.800
<v Speaker 1>asset class actually behaves.

0:51:34.880 --> 0:51:37.360
<v Speaker 2>So stocks are different than credit, are different than bonds,

0:51:37.440 --> 0:51:40.360
<v Speaker 2>or different than commodities. They're all like starting fresh. Yeah,

0:51:40.400 --> 0:51:42.919
<v Speaker 2>over what some of the more surprising things you've learned

0:51:42.960 --> 0:51:46.880
<v Speaker 2>as you've applied machine learning to totally different asset classes.

0:51:47.040 --> 0:51:49.719
<v Speaker 1>Well, I think, you know, corporate bonds provide a lot

0:51:49.719 --> 0:51:52.480
<v Speaker 1>of examples of this. I mean, the fact that you

0:51:52.520 --> 0:51:57.279
<v Speaker 1>don't actually really know a good live price or a

0:51:57.320 --> 0:52:00.480
<v Speaker 1>good live bid offers it seems seems you know, it's surprising.

0:52:00.520 --> 0:52:03.520
<v Speaker 1>I mean, this is this fact has started to change.

0:52:03.560 --> 0:52:07.279
<v Speaker 1>Like over the years, there's been an accelerating electronification of

0:52:07.320 --> 0:52:09.880
<v Speaker 1>corporate bond treading, and that's you know, that's that's been

0:52:09.880 --> 0:52:11.839
<v Speaker 1>a big advantage for us actually because we were kind

0:52:11.880 --> 0:52:14.000
<v Speaker 1>of first movers and so we've really benefited from that.

0:52:14.440 --> 0:52:17.360
<v Speaker 1>So the problem is diminished relative to how it was,

0:52:17.719 --> 0:52:20.160
<v Speaker 1>you know, six seven years ago when we started, but

0:52:20.200 --> 0:52:23.440
<v Speaker 1>it's still relative equities, it's absolutely there.

0:52:23.520 --> 0:52:25.839
<v Speaker 2>Yeah, So you get so when in other words, if

0:52:25.840 --> 0:52:28.279
<v Speaker 2>I'm looking at a bond mutual fund or even a

0:52:28.280 --> 0:52:33.400
<v Speaker 2>bondytf that's trading during the day. That price is somebody's

0:52:33.440 --> 0:52:37.560
<v Speaker 2>best approximation of the value of all the bonds inside.

0:52:37.880 --> 0:52:41.439
<v Speaker 2>But really you don't know the nav, do you.

0:52:41.440 --> 0:52:43.600
<v Speaker 1>You're just kind of guessing, Barry, don't even get me

0:52:43.640 --> 0:52:46.160
<v Speaker 1>started on bonditfs real because.

0:52:45.960 --> 0:52:48.160
<v Speaker 2>That it seems like that would be the first place

0:52:48.200 --> 0:52:52.120
<v Speaker 2>that would show up. Hey, bondytf's sound like throughout the

0:52:52.200 --> 0:52:56.759
<v Speaker 2>day they're gonna be mispriced a little bit or wildly mispriced.

0:52:57.080 --> 0:53:00.319
<v Speaker 1>Well, the bond ETF there's a sense if you're a

0:53:00.520 --> 0:53:02.360
<v Speaker 1>if you're a market purist, in which they can't be

0:53:02.440 --> 0:53:05.280
<v Speaker 1>mispriced because there's their price is set by supplying demand

0:53:05.560 --> 0:53:08.560
<v Speaker 1>in the ETF market, and that's a super liquid market,

0:53:08.840 --> 0:53:11.720
<v Speaker 1>and so there may be a difference between the market

0:53:11.719 --> 0:53:14.120
<v Speaker 1>price of the ETF and the under the nave of

0:53:14.120 --> 0:53:18.520
<v Speaker 1>the underlying portfolio, except in many cases with bond ETF

0:53:18.560 --> 0:53:23.120
<v Speaker 1>there's not even a crisply defined underlying portfolio. It turns

0:53:23.120 --> 0:53:26.520
<v Speaker 1>out that the authorized participants in those ETF markets can

0:53:27.120 --> 0:53:32.279
<v Speaker 1>negotiate with the fund manager about exactly what the constituents

0:53:32.320 --> 0:53:35.160
<v Speaker 1>are of the create redeem baskets, and so it's not

0:53:35.200 --> 0:53:37.920
<v Speaker 1>even at all clear what you mean when you say

0:53:38.080 --> 0:53:40.239
<v Speaker 1>that the nav is this or that relative to the

0:53:40.320 --> 0:53:41.200
<v Speaker 1>price of the ETF.

0:53:41.520 --> 0:53:44.040
<v Speaker 2>So when I asked about what's surprising when you work

0:53:44.080 --> 0:53:46.000
<v Speaker 2>you in on a rabbit hole. Hey, we don't know

0:53:46.000 --> 0:53:48.120
<v Speaker 2>what the hell's in this bond ETF. Trust us, it's

0:53:48.160 --> 0:53:51.640
<v Speaker 2>all good. That's a pretty surprise. And I'm only exaggerating

0:53:51.640 --> 0:53:54.520
<v Speaker 2>a little bit, But that seems like that's kind of shocking.

0:53:55.160 --> 0:53:57.920
<v Speaker 1>It's it is surprising when you find out about it,

0:53:57.960 --> 0:54:00.919
<v Speaker 1>but you quickly come to understand. If you trade single

0:54:00.960 --> 0:54:03.160
<v Speaker 1>name bonds, as we do, you quickly come to understand

0:54:03.520 --> 0:54:05.719
<v Speaker 1>why bond ETFs work that way.

0:54:06.560 --> 0:54:08.680
<v Speaker 2>I recall a couple of years ago there was a

0:54:08.719 --> 0:54:12.480
<v Speaker 2>big Wall Street Journal article on the g l d

0:54:13.280 --> 0:54:17.279
<v Speaker 2>E t F, and from that article I learned that

0:54:18.040 --> 0:54:22.280
<v Speaker 2>GLD was formed because gold dealers had just excess gold

0:54:22.400 --> 0:54:25.120
<v Speaker 2>piling up in their warehouses and they needed a way

0:54:25.560 --> 0:54:27.920
<v Speaker 2>to move it. So that was kind of shocking about

0:54:27.960 --> 0:54:32.279
<v Speaker 2>that ETF any other space that that led to a

0:54:33.360 --> 0:54:35.719
<v Speaker 2>sort of big surprise as you worked your way into it.

0:54:37.160 --> 0:54:41.200
<v Speaker 1>Well, I think ETFs are a kind of a good

0:54:41.239 --> 0:54:45.239
<v Speaker 1>source of these examples. So the volatility ETFs, the you know,

0:54:45.280 --> 0:54:47.560
<v Speaker 1>the ETFs that are that are based on the VIX

0:54:47.640 --> 0:54:50.360
<v Speaker 1>or that are short the vics. You may remember several years.

0:54:50.160 --> 0:54:52.120
<v Speaker 2>Ago I was gonna say the ones that haven't blown up.

0:54:52.200 --> 0:54:55.600
<v Speaker 1>Yeah right, there was this event called Valmageddon where.

0:54:56.239 --> 0:54:58.640
<v Speaker 2>That was ETF notes, wasn't it the Yeah.

0:54:59.680 --> 0:55:03.160
<v Speaker 1>Right, there are these essentially these investment products that were

0:55:03.280 --> 0:55:07.040
<v Speaker 1>short VIX, and VIX went through a spike that caused

0:55:07.040 --> 0:55:09.160
<v Speaker 1>them to have to liquidate, which was part I mean,

0:55:09.239 --> 0:55:13.200
<v Speaker 1>the people who designed the sixteene traded note. They understood

0:55:13.239 --> 0:55:15.040
<v Speaker 1>that this was a possibility, so they had a sort

0:55:15.040 --> 0:55:18.600
<v Speaker 1>of uh descriptions in their in their contract for what

0:55:18.880 --> 0:55:23.680
<v Speaker 1>it would mean. But yeah, always surprising to watch something

0:55:24.040 --> 0:55:25.360
<v Speaker 1>suddenly go out of business.

0:55:25.600 --> 0:55:28.120
<v Speaker 2>We seem to get a thousand year flood every couple

0:55:28.120 --> 0:55:30.760
<v Speaker 2>of years. Maybe we shouldn't be calling these things thousand

0:55:30.800 --> 0:55:33.880
<v Speaker 2>year flood. That's right, that's a that's a big misnomer.

0:55:34.360 --> 0:55:36.879
<v Speaker 1>As statisticians, we tell people, you know, if you if

0:55:36.920 --> 0:55:39.960
<v Speaker 1>you think that you've experienced a six sigma event, the

0:55:40.000 --> 0:55:43.120
<v Speaker 1>problem is that you have underestimated sigma.

0:55:43.239 --> 0:55:46.759
<v Speaker 2>That that's really interesting. So so, given the gap in

0:55:46.840 --> 0:55:53.000
<v Speaker 2>the world between computer science and an investment management, how

0:55:53.080 --> 0:55:56.279
<v Speaker 2>long is it going to be before that narrows and

0:55:56.320 --> 0:55:58.560
<v Speaker 2>we start seeing a whole lot more of the sort

0:55:58.560 --> 0:56:02.239
<v Speaker 2>of work you're doing applied across the board to to

0:56:02.320 --> 0:56:03.440
<v Speaker 2>the world of investment.

0:56:04.520 --> 0:56:08.160
<v Speaker 1>Well, I think it's happening. It's been happening for for

0:56:08.239 --> 0:56:11.000
<v Speaker 1>quite a long time. I mean, for example, all of

0:56:11.440 --> 0:56:15.279
<v Speaker 1>modern portfolio theory. Really, it kind of began in the

0:56:15.320 --> 0:56:18.520
<v Speaker 1>fifties with you know, first of all, Markowitz and other

0:56:18.560 --> 0:56:21.600
<v Speaker 1>people thinking about, you know, what it means to benefit

0:56:21.640 --> 0:56:24.960
<v Speaker 1>from diversification, and the idea that you know, diversification is

0:56:24.960 --> 0:56:28.040
<v Speaker 1>the only free lunch in finance. So I would I

0:56:28.040 --> 0:56:32.880
<v Speaker 1>would say that, you know, the idea of thinking in

0:56:32.920 --> 0:56:38.120
<v Speaker 1>a in a systematic and scientific way about how to

0:56:38.120 --> 0:56:41.279
<v Speaker 1>to manage and grow wealth, not you know, not even

0:56:41.440 --> 0:56:45.359
<v Speaker 1>just for institutions, but also for individuals. Has is an

0:56:45.360 --> 0:56:48.920
<v Speaker 1>example of a way that these ideas have kind of

0:56:49.840 --> 0:56:51.880
<v Speaker 1>had profound effects.

0:56:52.120 --> 0:56:55.200
<v Speaker 2>I know, I only have you for a little while longer,

0:56:55.520 --> 0:56:58.319
<v Speaker 2>So let's jump to our favorite questions that we ask

0:56:59.040 --> 0:57:01.640
<v Speaker 2>all of our guests, starting with tell us what you're

0:57:01.640 --> 0:57:04.400
<v Speaker 2>streaming these days? What are you either listening to or

0:57:04.480 --> 0:57:06.680
<v Speaker 2>watching to keep yourself entertained.

0:57:08.160 --> 0:57:11.480
<v Speaker 1>I A few things I've been watching recently. The Bear,

0:57:11.520 --> 0:57:13.960
<v Speaker 1>I don't know if you've heard So Great, So great, right,

0:57:14.239 --> 0:57:16.200
<v Speaker 1>and I'm in Chicago, as I know, we were just

0:57:16.360 --> 0:57:19.680
<v Speaker 1>from Yeah, so.

0:57:18.920 --> 0:57:21.200
<v Speaker 2>So and and there are parts of that show that

0:57:21.280 --> 0:57:23.760
<v Speaker 2>are kind of a love letter to absolutely as you

0:57:23.800 --> 0:57:26.640
<v Speaker 2>get deeper into the series, because it starts out kind

0:57:26.640 --> 0:57:29.600
<v Speaker 2>of gritty and you're seeing the underside, and then as

0:57:29.640 --> 0:57:33.600
<v Speaker 2>we progress, it really becomes like a lovely postcard. Such

0:57:33.640 --> 0:57:34.400
<v Speaker 2>an amazing show.

0:57:34.480 --> 0:57:37.760
<v Speaker 1>So really really love that show. Was I was late

0:57:37.800 --> 0:57:41.040
<v Speaker 1>to better call Saul that I'm finishing up. I think

0:57:41.080 --> 0:57:45.600
<v Speaker 1>as good as as Breaking Bad, So I maybe when

0:57:45.640 --> 0:57:48.240
<v Speaker 1>you haven't heard of there's a show called Mister in Between.

0:57:48.160 --> 0:57:50.320
<v Speaker 2>Which is mister Yeah.

0:57:50.320 --> 0:57:53.040
<v Speaker 1>It's not Hulu, it's from it's from Australia. It's about

0:57:53.040 --> 0:57:58.360
<v Speaker 1>a guy who's, you know, a doting father living his life.

0:57:58.360 --> 0:58:03.120
<v Speaker 1>He's also essentially a muscle man and hitman for for

0:58:04.040 --> 0:58:07.479
<v Speaker 1>local criminals in his part of Australia. But it's half

0:58:07.480 --> 0:58:09.200
<v Speaker 1>hour dark comedy.

0:58:09.160 --> 0:58:12.440
<v Speaker 2>Right, so not quite Barry and not quite Sopranos somewhere.

0:58:12.960 --> 0:58:14.080
<v Speaker 1>Yeah, that's exactly.

0:58:14.240 --> 0:58:19.360
<v Speaker 2>Yeah, sounds really interesting. Tell us about your early mentors

0:58:19.360 --> 0:58:21.160
<v Speaker 2>who helped shape your career.

0:58:21.880 --> 0:58:24.440
<v Speaker 1>Well, Berry, I'd been lucky to have a lot of

0:58:24.840 --> 0:58:28.880
<v Speaker 1>people who were you know, both really smart and talented

0:58:28.920 --> 0:58:32.000
<v Speaker 1>and willing to you know, take the time to help

0:58:32.040 --> 0:58:35.840
<v Speaker 1>me learn and understand things. So actually, my co founder,

0:58:36.000 --> 0:58:40.240
<v Speaker 1>Michael Caratanov, he was kind of my first mentor in finance.

0:58:40.320 --> 0:58:42.680
<v Speaker 1>He he had been a d SHAW for several years

0:58:43.400 --> 0:58:46.120
<v Speaker 1>when I got there, and he he really taught me

0:58:46.560 --> 0:58:49.280
<v Speaker 1>kind of the ins and outs of of market micro structure.

0:58:50.440 --> 0:58:53.120
<v Speaker 1>I worked with a couple of people who managed me

0:58:53.280 --> 0:58:56.480
<v Speaker 1>at d SHAW yo see Friedman and Kapeel Mature, who

0:58:56.480 --> 0:59:00.480
<v Speaker 1>have gone on to hugely successful careers in quantitative finance,

0:59:00.560 --> 0:59:03.360
<v Speaker 1>and they taught me a lot to when I did

0:59:03.360 --> 0:59:06.800
<v Speaker 1>my PhD. My advisor Mike Jordan, who's a kind of

0:59:06.840 --> 0:59:12.320
<v Speaker 1>world famous machine learning researcher. You know, I learned enormously

0:59:12.320 --> 0:59:18.000
<v Speaker 1>from him. And there's another professor of statistics who sadly

0:59:18.000 --> 0:59:21.920
<v Speaker 1>passed away about fifteen years ago named David Friedman. He

0:59:22.040 --> 0:59:26.120
<v Speaker 1>was really just an intellectual giant of the twentieth century

0:59:26.120 --> 0:59:30.040
<v Speaker 1>and probability and statistics. He was both, you know, one

0:59:30.080 --> 0:59:35.120
<v Speaker 1>of the most brilliant probabilists and also an applied statistician.

0:59:35.160 --> 0:59:38.200
<v Speaker 1>And this is this is like a pink diamond kind

0:59:38.240 --> 0:59:42.240
<v Speaker 1>of combination. It's that rare to find someone who has

0:59:42.320 --> 0:59:46.320
<v Speaker 1>that kind of technical capability but also understands the pragmatics

0:59:46.320 --> 0:59:48.560
<v Speaker 1>of actually doing data analysis. He spent a lot of

0:59:48.600 --> 0:59:53.360
<v Speaker 1>time as an expert witness. He was the lead statistical

0:59:53.360 --> 0:59:56.440
<v Speaker 1>consultant for the case on census adjustment that went to

0:59:56.520 --> 1:00:02.000
<v Speaker 1>the Supreme Court. In fact, he told me, uh, what

1:00:02.360 --> 1:00:05.280
<v Speaker 1>went that in the end? Uh, you know, the the

1:00:05.320 --> 1:00:08.760
<v Speaker 1>people against adjustment they won in a unanimous Supreme Court decision.

1:00:08.800 --> 1:00:11.120
<v Speaker 1>And David Freeman told me, he said, you know, all

1:00:11.160 --> 1:00:13.240
<v Speaker 1>that work and we only convinced nine people.

1:00:15.440 --> 1:00:17.840
<v Speaker 2>But not nine people that kind of matter, Yeah, exactly.

1:00:18.160 --> 1:00:21.280
<v Speaker 1>So it was just it was a real it was

1:00:21.360 --> 1:00:24.480
<v Speaker 1>kind of a once in a lifetime privilege to get

1:00:24.520 --> 1:00:28.520
<v Speaker 1>to spend time with someone of that intellectual caliber. And

1:00:28.600 --> 1:00:30.520
<v Speaker 1>there were others too. I mean, I've been I've been

1:00:30.600 --> 1:00:31.520
<v Speaker 1>very fortunate that.

1:00:31.600 --> 1:00:35.360
<v Speaker 2>That's quite a list to begin with. Let's talk about books.

1:00:35.360 --> 1:00:36.920
<v Speaker 2>What are some of your favorites and what are you

1:00:36.960 --> 1:00:37.760
<v Speaker 2>reading right now?

1:00:38.880 --> 1:00:40.880
<v Speaker 1>Uh? Well, I'm a I'm a big book reader, so

1:00:41.080 --> 1:00:42.240
<v Speaker 1>I had a long list.

1:00:42.480 --> 1:00:45.800
<v Speaker 2>But probably by the way, this is everybody's favorite section

1:00:46.400 --> 1:00:50.200
<v Speaker 2>of the podcast. People are always looking for good book recommendations,

1:00:50.320 --> 1:00:54.520
<v Speaker 2>and if they like what you said earlier, they're gonna

1:00:54.520 --> 1:00:56.720
<v Speaker 2>love love your book recommendations, so fire away.

1:00:57.120 --> 1:01:02.800
<v Speaker 1>So I'm a big fan of kind of modernist dystopian fiction.

1:01:03.280 --> 1:01:05.800
<v Speaker 1>So a couple of examples of that would be the

1:01:05.800 --> 1:01:10.600
<v Speaker 1>book Infinite Jest by David Foster Wallace, wind Up Bird

1:01:10.680 --> 1:01:14.000
<v Speaker 1>Chronicle by Hirouki Murakami. Those are two of my all

1:01:14.040 --> 1:01:17.919
<v Speaker 1>time favorite books. There's a I think much less well

1:01:17.960 --> 1:01:22.840
<v Speaker 1>known but beautiful novel. It's a kind of academic coming

1:01:22.880 --> 1:01:28.440
<v Speaker 1>of age novel called Stoner by John Williams. A really moving,

1:01:28.560 --> 1:01:33.000
<v Speaker 1>just a tremendous book. Sort of more dystopia would be

1:01:33.360 --> 1:01:38.240
<v Speaker 1>White Noise to Lilo and kind of the classics that

1:01:38.240 --> 1:01:41.040
<v Speaker 1>everybody knows nineteen eighty four and Brave New World. Those

1:01:41.040 --> 1:01:42.920
<v Speaker 1>are two more of my favorite.

1:01:42.600 --> 1:01:46.800
<v Speaker 2>Huh, it's funny when you mentioned The Bear. I'm in

1:01:46.840 --> 1:01:49.920
<v Speaker 2>the middle of reading a book that I would swear

1:01:50.000 --> 1:01:54.880
<v Speaker 2>the writers of The Bear leaned on called Unreasonable Hospitality

1:01:55.520 --> 1:01:59.640
<v Speaker 2>by somebody who worked for the Danny Meyer's Hospitality Group.

1:02:00.080 --> 1:02:03.520
<v Speaker 2>Eleven Madison Park in Ramsey Tavern and all these famous

1:02:03.840 --> 1:02:08.040
<v Speaker 2>New York haunts, and the scene in The Bear where

1:02:08.440 --> 1:02:12.240
<v Speaker 2>they overhear a couple say, oh, we visited Chicago when

1:02:12.240 --> 1:02:14.640
<v Speaker 2>you never had deep dish, So they send the guy

1:02:14.720 --> 1:02:18.160
<v Speaker 2>out to get deep dish. There's part of the book

1:02:18.720 --> 1:02:23.720
<v Speaker 2>where at eleven Medicine Park this people actually showed up

1:02:23.760 --> 1:02:25.760
<v Speaker 2>with suitcases. It was the last thing they would eat

1:02:25.800 --> 1:02:28.360
<v Speaker 2>doing before they heading to the airport. And they said, oh,

1:02:28.360 --> 1:02:30.360
<v Speaker 2>we ate all these great places in New York, but

1:02:30.400 --> 1:02:32.640
<v Speaker 2>we never had a New York hot dog. And what

1:02:32.640 --> 1:02:34.320
<v Speaker 2>do they do. They send them out to get someone

1:02:34.360 --> 1:02:36.600
<v Speaker 2>out to get a hot dog. They played it and

1:02:37.280 --> 1:02:39.920
<v Speaker 2>use all the condiments to make it very special, and

1:02:39.960 --> 1:02:42.840
<v Speaker 2>it looks like it was ripped right out of the Bear,

1:02:43.000 --> 1:02:46.960
<v Speaker 2>or vice versa. But if you're interested in just hey,

1:02:47.000 --> 1:02:51.840
<v Speaker 2>how can we disrupt the restaurant business and make it

1:02:51.920 --> 1:02:54.000
<v Speaker 2>not just about the celebrity chef in the kitchen but

1:02:54.400 --> 1:02:58.160
<v Speaker 2>the whole experience. Fascinating kind of nonfiction book?

1:02:58.240 --> 1:02:59.240
<v Speaker 1>That does sound really interesting.

1:02:59.400 --> 1:03:02.080
<v Speaker 2>Yeah, really, you mentioned the Bear and it just popped

1:03:02.120 --> 1:03:04.160
<v Speaker 2>into my head. Any of the books you want to

1:03:04.160 --> 1:03:06.080
<v Speaker 2>mention that's that's a good list to start with.

1:03:06.440 --> 1:03:10.240
<v Speaker 1>Yeah. My other kind of big interest is science fiction,

1:03:10.480 --> 1:03:16.920
<v Speaker 1>speculative fiction. Unsurprisingly right, Sorry, sorry, but so there are

1:03:16.960 --> 1:03:19.640
<v Speaker 1>some classics that I think everybody should read. Ursula LeGuin

1:03:19.960 --> 1:03:24.040
<v Speaker 1>loves just amazing. So The Dispossessed and The Left Hand

1:03:24.040 --> 1:03:25.680
<v Speaker 1>of Darkness, those are just two of the best books

1:03:25.680 --> 1:03:26.520
<v Speaker 1>I've ever read period.

1:03:26.560 --> 1:03:30.040
<v Speaker 2>Forget Left Handed Darkness stays with you for a long time.

1:03:30.120 --> 1:03:35.200
<v Speaker 1>Yeah right, yeah, really really amazing books. I'm rereading right now,

1:03:35.360 --> 1:03:41.960
<v Speaker 1>Cryptonomicon Neil Stevenson. And one other thing I try to

1:03:42.000 --> 1:03:45.520
<v Speaker 1>do is I have very big gaps in my reading.

1:03:45.560 --> 1:03:48.280
<v Speaker 1>For example, I've never read Updyke, so I started reading

1:03:48.320 --> 1:03:49.000
<v Speaker 1>The Rabbit.

1:03:49.000 --> 1:03:52.280
<v Speaker 2>Serious World of Corn. It's a garb and they're they're

1:03:52.440 --> 1:03:53.800
<v Speaker 2>very much of an era.

1:03:54.000 --> 1:03:54.880
<v Speaker 1>Yeah, that's right.

1:03:55.880 --> 1:03:57.720
<v Speaker 2>What else give us more? Uh?

1:03:58.080 --> 1:03:59.880
<v Speaker 1>Wow? Okay, let's see George so.

1:04:01.600 --> 1:04:01.840
<v Speaker 2>He.

1:04:02.320 --> 1:04:05.040
<v Speaker 1>Oh wow, I think I think you'd love him. So

1:04:05.400 --> 1:04:09.280
<v Speaker 1>He's his real strength is short fiction. He had He's

1:04:09.280 --> 1:04:12.960
<v Speaker 1>written great novels too, but tenth of December this is

1:04:13.000 --> 1:04:16.320
<v Speaker 1>his best collection of of fiction. And that this is

1:04:16.360 --> 1:04:23.280
<v Speaker 1>more kind of modern dystopian, kind of comic dystopian stuff.

1:04:23.600 --> 1:04:27.560
<v Speaker 2>You keep coming back to dystopia, yeasinating.

1:04:26.760 --> 1:04:30.680
<v Speaker 1>I find, you know, it's uh, it's very different from

1:04:30.720 --> 1:04:32.919
<v Speaker 1>my my day to day reality. So I think it's

1:04:32.960 --> 1:04:36.120
<v Speaker 1>a you know, it's a great change of pace for

1:04:36.200 --> 1:04:41.320
<v Speaker 1>me to be able to read this stuff. So, uh,

1:04:41.960 --> 1:04:45.360
<v Speaker 1>some some science writing, I can tell you. Probably the

1:04:45.400 --> 1:04:48.360
<v Speaker 1>best science book I ever read is The Selfish Gene

1:04:48.800 --> 1:04:54.280
<v Speaker 1>by Richard Dawkins, which kind of really you know, you

1:04:54.360 --> 1:04:57.480
<v Speaker 1>have a kind of intuitive understanding of genetics and natural

1:04:57.480 --> 1:05:01.560
<v Speaker 1>selection in Darwin, but the language that Dawkins uses really

1:05:01.560 --> 1:05:06.160
<v Speaker 1>makes you appreciate just how how much the genes are

1:05:06.160 --> 1:05:08.840
<v Speaker 1>in charge and how little we as the as the

1:05:09.520 --> 1:05:13.320
<v Speaker 1>you know he calls he calls organisms survival machines that

1:05:13.400 --> 1:05:16.720
<v Speaker 1>the genes have kind of built and and exist inside

1:05:16.760 --> 1:05:19.200
<v Speaker 1>in order to ensure their propagation. And his whole the

1:05:19.200 --> 1:05:21.560
<v Speaker 1>whole point of view in that book just gives you, Uh,

1:05:22.360 --> 1:05:25.280
<v Speaker 1>it's really eye opening, makes you think about natural selection

1:05:25.360 --> 1:05:28.040
<v Speaker 1>and evolution and genetics in a completely different way, even

1:05:28.040 --> 1:05:30.600
<v Speaker 1>though it's all based on the same kind of facts that.

1:05:30.600 --> 1:05:32.400
<v Speaker 2>You know, it's just framing.

1:05:32.480 --> 1:05:34.760
<v Speaker 1>It's the framing and the perspective that are really that

1:05:34.840 --> 1:05:37.120
<v Speaker 1>really kind of blow your mind. So it's a great

1:05:37.200 --> 1:05:38.200
<v Speaker 1>it's a great book to read.

1:05:39.440 --> 1:05:41.439
<v Speaker 2>Huh, that's a hell of a list. You've given people

1:05:41.480 --> 1:05:44.080
<v Speaker 2>a lot of things to start with, and now down

1:05:44.080 --> 1:05:47.160
<v Speaker 2>to our last two questions, HM, what advice would you

1:05:47.200 --> 1:05:50.560
<v Speaker 2>give to a recent college grad who is interested in

1:05:50.600 --> 1:05:54.880
<v Speaker 2>a career in either investment management or machine learning.

1:05:56.600 --> 1:05:59.560
<v Speaker 1>Yeah? So, I mean I work in a very specialized

1:05:59.720 --> 1:06:01.600
<v Speaker 1>sub domain of finance, So there are a lot of

1:06:01.600 --> 1:06:03.200
<v Speaker 1>people who are going to be interested in investment in

1:06:03.240 --> 1:06:06.600
<v Speaker 1>finance that I that I couldn't give any specific advice to.

1:06:06.880 --> 1:06:11.240
<v Speaker 1>I have kind of general advice that I think is

1:06:11.520 --> 1:06:15.600
<v Speaker 1>useful both for finance and even more broadly. This advice

1:06:15.680 --> 1:06:19.400
<v Speaker 1>is really kind of top of Maslow's pyramid advice. If

1:06:19.520 --> 1:06:22.240
<v Speaker 1>you know, if you're trying to kind of write your

1:06:22.280 --> 1:06:25.040
<v Speaker 1>novel and pay the rent while you get it done,

1:06:25.040 --> 1:06:27.600
<v Speaker 1>this is I can't really help you with that. But

1:06:29.120 --> 1:06:32.360
<v Speaker 1>you know, if what you care about is building this career,

1:06:32.520 --> 1:06:34.520
<v Speaker 1>then I would say number one piece of advice is

1:06:34.520 --> 1:06:37.080
<v Speaker 1>work with incredible people. Like far and away, much more

1:06:37.080 --> 1:06:40.360
<v Speaker 1>important than what the particular field is the details of

1:06:40.400 --> 1:06:42.880
<v Speaker 1>what you're working on, is the caliber of the people

1:06:42.880 --> 1:06:45.640
<v Speaker 1>that you do it with, both in terms of your

1:06:45.680 --> 1:06:51.120
<v Speaker 1>own satisfaction and how much you learn and and and

1:06:51.600 --> 1:06:55.040
<v Speaker 1>all of that. I think you know you'll learn, you'll

1:06:55.040 --> 1:06:59.360
<v Speaker 1>benefit hugely on a personal level from working with incredible

1:06:59.400 --> 1:07:03.680
<v Speaker 1>people and if you don't work with people that are

1:07:04.000 --> 1:07:06.760
<v Speaker 1>like that, then you're probably going to have a lot

1:07:06.760 --> 1:07:09.080
<v Speaker 1>of professional unhappiness. So it's kind of either or.

1:07:09.600 --> 1:07:14.760
<v Speaker 2>That's a really intriguing answer. So final question, what do

1:07:14.800 --> 1:07:18.360
<v Speaker 2>you know about the world of investing, machine learning, large

1:07:18.440 --> 1:07:22.400
<v Speaker 2>language models, just the application of technology to the field

1:07:22.440 --> 1:07:25.439
<v Speaker 2>of investing that you wish you knew twenty five years

1:07:25.520 --> 1:07:28.520
<v Speaker 2>or so ago when you were really first ramping up.

1:07:29.840 --> 1:07:33.720
<v Speaker 1>I think one of the most important lessons that I learned,

1:07:33.760 --> 1:07:35.640
<v Speaker 1>had to learn the hard way kind of going through

1:07:35.840 --> 1:07:39.960
<v Speaker 1>and running these systems, was that it's kind of comes

1:07:40.000 --> 1:07:43.440
<v Speaker 1>back to the point you made earlier about the primacy

1:07:43.480 --> 1:07:47.680
<v Speaker 1>of prediction rules. And it may be true that the

1:07:47.720 --> 1:07:51.360
<v Speaker 1>most important thing is the prediction quality, but there are

1:07:51.480 --> 1:07:55.280
<v Speaker 1>lots of other very necessary, mandatory ingredients, and I would

1:07:55.280 --> 1:07:57.680
<v Speaker 1>put kind of risk management at the top of that list.

1:07:57.720 --> 1:08:02.560
<v Speaker 1>So I think it's easy to to maybe neglect risk

1:08:02.600 --> 1:08:06.480
<v Speaker 1>management to a certain extent and focus all of your

1:08:06.520 --> 1:08:10.520
<v Speaker 1>attention on predictive accuracy. But I think it really does

1:08:10.560 --> 1:08:13.720
<v Speaker 1>turn out that if you don't have high quality risk

1:08:13.760 --> 1:08:16.479
<v Speaker 1>management to go along with that predictive accuracy, you won't succeed.

1:08:17.439 --> 1:08:20.640
<v Speaker 1>And I guess I wish I had appreciated that in

1:08:21.080 --> 1:08:23.080
<v Speaker 1>a really deep way twenty five years ago.

1:08:23.320 --> 1:08:27.400
<v Speaker 2>John, This has been really, absolutely fascinating. I don't even

1:08:27.479 --> 1:08:30.000
<v Speaker 2>know where to begin other than saying thank you for

1:08:30.040 --> 1:08:34.080
<v Speaker 2>being so generous with your time and your expertise. We

1:08:34.280 --> 1:08:36.920
<v Speaker 2>have been speaking with John mccauloff. He is the co

1:08:37.040 --> 1:08:41.360
<v Speaker 2>founder and chief investment officer at the five billion dollar

1:08:41.479 --> 1:08:46.040
<v Speaker 2>hedge fund Volleyon Group. If you enjoy this conversation, well,

1:08:46.320 --> 1:08:48.599
<v Speaker 2>be sure and check out any of the previous five

1:08:48.720 --> 1:08:53.000
<v Speaker 2>hundred we've done over the past nine years. You can

1:08:53.040 --> 1:08:57.200
<v Speaker 2>find those at iTunes, Spotify, YouTube, or wherever you find

1:08:57.280 --> 1:09:01.040
<v Speaker 2>your favorite podcast. Sign up for my daily reading list

1:09:01.560 --> 1:09:06.200
<v Speaker 2>at rid Halts. Follow me on Twitter at Barry Underscore

1:09:06.280 --> 1:09:10.200
<v Speaker 2>Rit Halts until I get my hacked account at rid

1:09:10.200 --> 1:09:16.240
<v Speaker 2>Holt's back. I say that. I say that because the

1:09:16.320 --> 1:09:20.759
<v Speaker 2>process of dealing with the seventeen people left at once

1:09:20.840 --> 1:09:27.000
<v Speaker 2>Twitter now x is unbelievably frustrating and annoying. Follow all

1:09:27.080 --> 1:09:31.160
<v Speaker 2>of the fine family of podcasts on Twitter at podcast

1:09:31.720 --> 1:09:33.800
<v Speaker 2>I would be remiss if I did not thank the

1:09:33.800 --> 1:09:37.639
<v Speaker 2>crack team that helps put these conversations together each week.

1:09:38.320 --> 1:09:42.080
<v Speaker 2>Paris Woald is my producer. Attiko val Bron is my

1:09:42.280 --> 1:09:47.519
<v Speaker 2>project manager. Sean Russo is my director of Research. I'm

1:09:47.600 --> 1:09:51.000
<v Speaker 2>Barry rid Halts. You've been listening to Masters in Business

1:09:51.560 --> 1:10:03.160
<v Speaker 2>on Bloomberg Radio at