WEBVTT - Two Researchers Explain How Quants Are Going To Revolutionize Long-Term Investing

0:00:08.680 --> 0:00:12.799
<v Speaker 1>Hello, and welcome to another episode of the Odd Lots Podcast.

0:00:12.840 --> 0:00:16.760
<v Speaker 1>I'm Joe Wisenthal and I'm Tracy Alloway. Tracy, I really

0:00:16.800 --> 0:00:21.000
<v Speaker 1>liked last week's episode with Andrew Lowe talking about quant

0:00:21.120 --> 0:00:25.320
<v Speaker 1>stuff and his sort of evolution of the efficient market

0:00:25.400 --> 0:00:28.920
<v Speaker 1>hypothesis and where that might go. Yeah, I did too.

0:00:29.000 --> 0:00:32.839
<v Speaker 1>I really like the ecosystem analogy, the idea that you

0:00:32.840 --> 0:00:35.640
<v Speaker 1>have all these different players with different motivations and they're

0:00:35.680 --> 0:00:40.200
<v Speaker 1>constantly evolving and adapting to the market. That point about

0:00:40.200 --> 0:00:43.000
<v Speaker 1>it adapting, and of course that's the name of his book,

0:00:43.080 --> 0:00:47.360
<v Speaker 1>Adaptive Market is the name of his hypothesis. Is really

0:00:47.520 --> 0:00:50.600
<v Speaker 1>key because one of the main points that he made

0:00:50.920 --> 0:00:54.000
<v Speaker 1>that I loved was this idea of hedge funds as

0:00:54.080 --> 0:00:57.920
<v Speaker 1>sort of the R and D laboratory for all of

0:00:57.960 --> 0:01:02.160
<v Speaker 1>the financial industry. Right, the funds are where innovative new

0:01:02.240 --> 0:01:05.720
<v Speaker 1>techniques get to be sort of hashed out without doing

0:01:06.040 --> 0:01:09.280
<v Speaker 1>usually too much damage, I guess, to the wider ecosyst

0:01:09.280 --> 0:01:13.680
<v Speaker 1>hopefully right, not always. There's certainly examples of hedge funds

0:01:13.680 --> 0:01:17.919
<v Speaker 1>actually having done major damage from time to time. But ideally,

0:01:18.160 --> 0:01:22.160
<v Speaker 1>you know what the evolution seems to be that some

0:01:22.200 --> 0:01:24.759
<v Speaker 1>new idea sort of starts in the hedge fund world

0:01:25.040 --> 0:01:27.600
<v Speaker 1>and eventually makes its way to the broader world. I

0:01:27.600 --> 0:01:29.759
<v Speaker 1>think the most obvious example of that that we could

0:01:29.760 --> 0:01:32.160
<v Speaker 1>say these days is a lot of this sort of

0:01:32.520 --> 0:01:36.360
<v Speaker 1>popular smart beta strategies e t f s that are

0:01:36.360 --> 0:01:41.119
<v Speaker 1>built on things like momentum or value or other factors,

0:01:41.440 --> 0:01:44.640
<v Speaker 1>sort of quantitative ideas that for many years were only

0:01:44.680 --> 0:01:48.920
<v Speaker 1>available to uh, you know, researchers at hedge funds. Right. So,

0:01:48.960 --> 0:01:53.160
<v Speaker 1>these sorts of quantitative investment or trading methods were usually

0:01:53.240 --> 0:01:56.040
<v Speaker 1>the purview of sophisticated hedge funds who had the time

0:01:56.080 --> 0:01:58.880
<v Speaker 1>and resources to develop them. And then you had a

0:01:58.920 --> 0:02:01.120
<v Speaker 1>bunch of e t f s who kind of caught

0:02:01.160 --> 0:02:03.800
<v Speaker 1>on and managed to replicate them. And now we can

0:02:03.840 --> 0:02:08.679
<v Speaker 1>all trade lighthedge funds for zero percent fees, right, exactly right,

0:02:09.040 --> 0:02:12.280
<v Speaker 1>And of course, once everyone can do it for very

0:02:12.320 --> 0:02:15.560
<v Speaker 1>few fees, I think it's safe to say those strategies

0:02:15.560 --> 0:02:19.840
<v Speaker 1>aren't going to produce the same returns, and hence the

0:02:19.919 --> 0:02:23.560
<v Speaker 1>market is forced to adapt again. Right. Presumably the hedge

0:02:23.560 --> 0:02:26.400
<v Speaker 1>funds are always trying to stay one step ahead as well, right,

0:02:27.120 --> 0:02:30.360
<v Speaker 1>exactly so, which raises the idea of like what will

0:02:30.400 --> 0:02:34.960
<v Speaker 1>be the next thing. If anyone can sort of invest

0:02:35.000 --> 0:02:38.720
<v Speaker 1>in a crude momentum strategy for virtually no fees, then

0:02:38.760 --> 0:02:41.440
<v Speaker 1>that requires the people on the cutting edge, the people

0:02:41.520 --> 0:02:44.160
<v Speaker 1>doing the R and D of this industry, to uh,

0:02:44.240 --> 0:02:46.000
<v Speaker 1>you know, figure out what the next big thing it's

0:02:46.000 --> 0:02:48.359
<v Speaker 1>gonna be. Do you know what the next big thing

0:02:48.400 --> 0:02:50.239
<v Speaker 1>is going to be? Joe? Can you share it with

0:02:50.680 --> 0:02:55.360
<v Speaker 1>your fellow partner at Odd Thoughts LLC? Sadly and unfortunately

0:02:56.000 --> 0:02:59.760
<v Speaker 1>to all of the odd Lots fan, I myself do

0:02:59.800 --> 0:03:03.480
<v Speaker 1>not know what the next big thing in quantitative strategy

0:03:03.639 --> 0:03:06.480
<v Speaker 1>or sort of advanced investing is going to be. But

0:03:06.560 --> 0:03:09.960
<v Speaker 1>I'm hoping that our guests on today's episode might be

0:03:10.040 --> 0:03:13.960
<v Speaker 1>able to shed some light. Who are there? Okay, so

0:03:14.040 --> 0:03:17.040
<v Speaker 1>today we're going to be talking to John Elberg. He

0:03:17.160 --> 0:03:21.240
<v Speaker 1>is the founder of Euclidean Technologies, a quant firm, as

0:03:21.240 --> 0:03:24.919
<v Speaker 1>well as Zach Lipton, a professor at Carnegie Mellon University

0:03:24.960 --> 0:03:28.519
<v Speaker 1>in the Business School and expert on machine learning. They

0:03:28.639 --> 0:03:33.480
<v Speaker 1>recently published a paper titled Improving factor based Quantitative Investing

0:03:34.000 --> 0:03:39.520
<v Speaker 1>by Forecasting company fundamentals. So what I think that means

0:03:39.560 --> 0:03:41.000
<v Speaker 1>and we'll talk to them, is, you know, we talk

0:03:41.080 --> 0:03:44.800
<v Speaker 1>all this stuff about price and computers and algorithms figuring

0:03:44.840 --> 0:03:47.880
<v Speaker 1>out what signal we can get from price, But maybe

0:03:48.480 --> 0:03:51.200
<v Speaker 1>the next generation can actually tell us something about the

0:03:51.240 --> 0:03:54.640
<v Speaker 1>fundamental workings of the company itself, and maybe this could

0:03:54.680 --> 0:03:57.880
<v Speaker 1>be sort of the next wave of where quant investing goes.

0:03:58.440 --> 0:04:02.080
<v Speaker 1>And this sounds absolutely senating, Joe, let's bring them on.

0:04:12.280 --> 0:04:15.240
<v Speaker 1>John and Zach, thank you very much for joining us.

0:04:15.920 --> 0:04:19.800
<v Speaker 1>Was that a reasonable characterization of sort of where your

0:04:20.160 --> 0:04:24.080
<v Speaker 1>paper and where your research is taking things? Yeah, I

0:04:24.120 --> 0:04:27.280
<v Speaker 1>think it is so. So. First of all, machine learning

0:04:27.320 --> 0:04:30.160
<v Speaker 1>has been kind of on a rocket ship of innovation

0:04:30.240 --> 0:04:33.159
<v Speaker 1>for the last ten years or so, and with the

0:04:33.240 --> 0:04:36.800
<v Speaker 1>advent of deep learning, you know, computers and machine learning

0:04:36.800 --> 0:04:39.120
<v Speaker 1>have been able to do things that you know, historically

0:04:39.120 --> 0:04:44.400
<v Speaker 1>have been very challenging, like image captioning and language translation.

0:04:45.080 --> 0:04:47.840
<v Speaker 1>So we Zach and I, you know, a couple of

0:04:47.880 --> 0:04:52.080
<v Speaker 1>years back, thought of the idea of collaborating to apply

0:04:52.560 --> 0:04:56.480
<v Speaker 1>deep learning to the problem of long term investing. So

0:04:56.520 --> 0:04:59.240
<v Speaker 1>how did you actually go about doing that and what

0:04:59.400 --> 0:05:03.680
<v Speaker 1>exactly do you mean by deep learning? That's exactly what

0:05:03.720 --> 0:05:07.159
<v Speaker 1>I wanted to know. To deep learning sort of the

0:05:07.200 --> 0:05:10.560
<v Speaker 1>rebranding of neural networks research to say I say I

0:05:10.600 --> 0:05:13.279
<v Speaker 1>had some data about a company, right like I had

0:05:13.680 --> 0:05:16.000
<v Speaker 1>machine learning. We call a vector of features. But what

0:05:16.240 --> 0:05:18.080
<v Speaker 1>we mean it's just like a list of attributes, each

0:05:18.120 --> 0:05:20.880
<v Speaker 1>of which is somehow like be made into a numerical quantity,

0:05:21.000 --> 0:05:24.240
<v Speaker 1>whether it's like their income, they're the number of assets whatever.

0:05:24.640 --> 0:05:28.039
<v Speaker 1>One way of deciding how to predict what the say,

0:05:28.120 --> 0:05:30.360
<v Speaker 1>what the price will be or something, as we say, well,

0:05:30.600 --> 0:05:33.400
<v Speaker 1>we're going to have this long vector of features, and

0:05:33.440 --> 0:05:36.200
<v Speaker 1>then we're going for every single company, uh, you know,

0:05:36.279 --> 0:05:38.640
<v Speaker 1>at every single time while this vector of features corresponding

0:05:38.640 --> 0:05:40.360
<v Speaker 1>to the state of the company at some period of time,

0:05:40.880 --> 0:05:44.640
<v Speaker 1>and then we'll have some target that we want to predict.

0:05:44.680 --> 0:05:47.600
<v Speaker 1>This could be a binary quantity like will the stock

0:05:47.640 --> 0:05:50.280
<v Speaker 1>go up or down in the next you know, time

0:05:50.400 --> 0:05:53.320
<v Speaker 1>unit of your choice, whether it's the next day or

0:05:53.320 --> 0:05:55.400
<v Speaker 1>in the next month or in the next year. Or

0:05:55.440 --> 0:05:58.599
<v Speaker 1>you could try to directly predict say the relative price,

0:05:58.680 --> 0:06:02.520
<v Speaker 1>so like you know, the percent improvement or decrease based

0:06:02.560 --> 0:06:05.440
<v Speaker 1>on sort of the available features. So one of the

0:06:05.480 --> 0:06:07.680
<v Speaker 1>simplest ways you can make a model is you say, hey,

0:06:07.720 --> 0:06:09.120
<v Speaker 1>I've got a bunch of features. I'm gonna do is

0:06:09.120 --> 0:06:11.479
<v Speaker 1>I'm gonna take a weighted some of these features the

0:06:11.520 --> 0:06:14.000
<v Speaker 1>way like you'd calculate a score to see, like what's

0:06:14.000 --> 0:06:16.240
<v Speaker 1>your risk of a heart disease. Maybe you take you know, well,

0:06:16.240 --> 0:06:20.120
<v Speaker 1>four times your cholesterol plus two times your age minus

0:06:20.120 --> 0:06:23.440
<v Speaker 1>one times you know, your amount of good cholesterol is

0:06:23.480 --> 0:06:25.800
<v Speaker 1>something like this if you come up with some formula

0:06:25.920 --> 0:06:28.800
<v Speaker 1>that's expressed simply as a weighted sum, so that would

0:06:28.800 --> 0:06:31.719
<v Speaker 1>be a linear model. Where deep learning make things different

0:06:31.839 --> 0:06:35.479
<v Speaker 1>is that you have many different layers of computation that

0:06:35.800 --> 0:06:38.720
<v Speaker 1>you basically are learning very complex patterns that maybe couldn't

0:06:38.760 --> 0:06:41.120
<v Speaker 1>be expressed as a as a weighted sum. So maybe

0:06:41.120 --> 0:06:44.960
<v Speaker 1>you're uncovering interactions between all of your features. Um. So,

0:06:45.000 --> 0:06:48.159
<v Speaker 1>for example, if you want to learn to recognize a

0:06:48.240 --> 0:06:50.839
<v Speaker 1>dog versus a cat in an image, there's no weighted

0:06:50.880 --> 0:06:53.000
<v Speaker 1>sum of pixel values it's actually going to tell you

0:06:53.000 --> 0:06:55.359
<v Speaker 1>this because it's just the patterns too complicated. So in

0:06:55.360 --> 0:06:58.239
<v Speaker 1>that case you need some some more like heavy duty machinery.

0:06:58.360 --> 0:07:00.800
<v Speaker 1>So what you do in deep learning essentially is that

0:07:00.839 --> 0:07:05.320
<v Speaker 1>you learn multiple successive transformations of your data such that

0:07:05.480 --> 0:07:09.640
<v Speaker 1>after applying many such transformations, you know, could be two,

0:07:09.760 --> 0:07:12.480
<v Speaker 1>four or five, ten, whatever, you come out at the

0:07:12.600 --> 0:07:15.240
<v Speaker 1>end of a representation of your data where you actually

0:07:15.280 --> 0:07:17.680
<v Speaker 1>can learn a very simple model on top of that.

0:07:17.920 --> 0:07:21.280
<v Speaker 1>So we sometimes call deep learning representation learning because it's

0:07:21.320 --> 0:07:24.200
<v Speaker 1>what we're doing is we're both learning how to feature

0:07:24.200 --> 0:07:26.600
<v Speaker 1>eye our data essentially, how to transform it and how

0:07:26.600 --> 0:07:29.360
<v Speaker 1>to classify it at the same time. So one of

0:07:29.400 --> 0:07:32.720
<v Speaker 1>the things in sort of traditional quantity, a lot of

0:07:32.800 --> 0:07:37.600
<v Speaker 1>quantitative investing focuses a lot on price and sort of

0:07:37.960 --> 0:07:43.240
<v Speaker 1>listening to your characterization. It seems like price and this

0:07:43.320 --> 0:07:46.200
<v Speaker 1>is relatively speaking, of course, price is a fairly you know,

0:07:46.240 --> 0:07:49.240
<v Speaker 1>it's sort of easy idea to capture. So you can

0:07:49.280 --> 0:07:53.720
<v Speaker 1>come up with some definition of what momentum is and

0:07:53.760 --> 0:07:56.640
<v Speaker 1>then sort of say, okay, these stocks are experiencing momentum

0:07:56.760 --> 0:08:00.600
<v Speaker 1>right now, or these stocks aren't, and then his history

0:08:00.720 --> 0:08:02.560
<v Speaker 1>tell us the stocks are going to do next if

0:08:02.560 --> 0:08:07.120
<v Speaker 1>they sort of meet these characterizations. Your paper really looks

0:08:07.200 --> 0:08:10.440
<v Speaker 1>at what can you do with this technology for sort

0:08:10.440 --> 0:08:14.080
<v Speaker 1>of looking at future fundamental so looking at the sort

0:08:14.120 --> 0:08:17.080
<v Speaker 1>of characteristics of the company and not just trying to

0:08:17.560 --> 0:08:21.440
<v Speaker 1>see where prices going, but where those characteristics are going,

0:08:21.480 --> 0:08:25.960
<v Speaker 1>so explain sort of what your research specifically attempts to uncover.

0:08:26.680 --> 0:08:30.160
<v Speaker 1>So one thing that deep learning allows a researcher to

0:08:30.240 --> 0:08:35.280
<v Speaker 1>do is look at kind of more raw features um.

0:08:35.320 --> 0:08:38.520
<v Speaker 1>Like Zach explained in the image case, you're looking at

0:08:38.640 --> 0:08:42.840
<v Speaker 1>raw pixels. Now, if you think about most quant funds

0:08:43.000 --> 0:08:46.240
<v Speaker 1>and most quant models, they the features that go into

0:08:46.280 --> 0:08:49.160
<v Speaker 1>the model are highly engineered, and they include things like

0:08:49.240 --> 0:08:53.800
<v Speaker 1>price and maybe book value, price divided by book value,

0:08:53.840 --> 0:08:57.520
<v Speaker 1>price divided by earnings, and then maybe some momentum features.

0:08:58.000 --> 0:09:01.000
<v Speaker 1>The interesting thing about deep learning is it allows you

0:09:01.080 --> 0:09:05.720
<v Speaker 1>to potentially let it uncover what the best features are.

0:09:05.760 --> 0:09:09.240
<v Speaker 1>If you over engineer features, you may not find the

0:09:09.240 --> 0:09:12.040
<v Speaker 1>ones that are best to predict what you're interested in predicting.

0:09:12.600 --> 0:09:16.360
<v Speaker 1>So that, you know, allows you to potentially find features

0:09:16.360 --> 0:09:20.199
<v Speaker 1>in the data that you wouldn't find through which traditional

0:09:20.240 --> 0:09:24.640
<v Speaker 1>feature engineering process. Yeah, and you know, to directly address

0:09:24.679 --> 0:09:27.080
<v Speaker 1>your question, your point is that the very most obvious

0:09:27.080 --> 0:09:28.920
<v Speaker 1>thing you could say, now, if I have this, I

0:09:28.960 --> 0:09:31.199
<v Speaker 1>have this learning machine, I have a bunch of features,

0:09:31.200 --> 0:09:32.720
<v Speaker 1>and I have to choose what am I going to predict?

0:09:33.080 --> 0:09:35.000
<v Speaker 1>The very most obvious thing to try to predict is

0:09:35.000 --> 0:09:38.040
<v Speaker 1>the price, because if you can actually do that perfectly,

0:09:38.360 --> 0:09:41.240
<v Speaker 1>then you're done, right. If if you actually know which

0:09:41.240 --> 0:09:43.280
<v Speaker 1>way the price is going to move in the next year,

0:09:43.440 --> 0:09:46.679
<v Speaker 1>then you can make the perfect choice. So the problem

0:09:46.760 --> 0:09:49.440
<v Speaker 1>is that that's that's not so easy because the markets

0:09:49.440 --> 0:09:53.120
<v Speaker 1>are quite capricious, right, Um, So one problem that we

0:09:53.200 --> 0:09:55.280
<v Speaker 1>found is we actually did these models where we were

0:09:55.280 --> 0:09:58.280
<v Speaker 1>trying to predict price directly. But among the other things

0:09:58.280 --> 0:10:00.520
<v Speaker 1>that you have is that one, it's hard to learn

0:10:00.640 --> 0:10:03.280
<v Speaker 1>models that do a good job of this that are

0:10:03.360 --> 0:10:06.480
<v Speaker 1>sort of robust across different time periods. So you might

0:10:06.559 --> 0:10:08.600
<v Speaker 1>have like, hey, I'm going to train on these like

0:10:08.720 --> 0:10:11.080
<v Speaker 1>decades of data and I'm going to try to directly

0:10:11.160 --> 0:10:13.800
<v Speaker 1>predict the price. But then I come into periods of

0:10:13.840 --> 0:10:16.760
<v Speaker 1>time where the markets behaving a little bit differently, and

0:10:16.800 --> 0:10:20.080
<v Speaker 1>we call this nonstationarity. Basically, like you're modeled, there's a

0:10:20.080 --> 0:10:23.280
<v Speaker 1>great job of uncovering the pattern that's present in the

0:10:23.440 --> 0:10:25.760
<v Speaker 1>data that you gave to the model, but that data

0:10:25.880 --> 0:10:28.079
<v Speaker 1>is anchored to some period of time, and the future

0:10:28.160 --> 0:10:30.840
<v Speaker 1>data that comes in, you know, the patterns changed a

0:10:30.840 --> 0:10:33.680
<v Speaker 1>little bit, and so the kind of like function that

0:10:33.679 --> 0:10:35.840
<v Speaker 1>you've learned no longer does a great job. So so

0:10:35.880 --> 0:10:38.120
<v Speaker 1>what we do instead of directly trying to predict the price,

0:10:38.480 --> 0:10:40.719
<v Speaker 1>the idea that we had was to think, well, this

0:10:40.880 --> 0:10:43.880
<v Speaker 1>core idea behind a factor model, generally right, is to

0:10:43.960 --> 0:10:45.840
<v Speaker 1>just say, hey, I'm going to sort all the stocks

0:10:46.320 --> 0:10:49.360
<v Speaker 1>according to some reason idea, Hey, the price of the

0:10:49.360 --> 0:10:52.440
<v Speaker 1>company should be tied to its income, any company, and

0:10:52.520 --> 0:10:55.040
<v Speaker 1>somehow it is justified by like it's the long term

0:10:55.120 --> 0:10:58.760
<v Speaker 1>discounted cash as well. Let's just say a factor strategy

0:10:58.800 --> 0:11:00.600
<v Speaker 1>just something very simple. It says, well, let's just look

0:11:00.600 --> 0:11:03.760
<v Speaker 1>at the current income divided by say the current price

0:11:03.880 --> 0:11:06.640
<v Speaker 1>or current income divided by the current you know, market

0:11:06.640 --> 0:11:10.880
<v Speaker 1>cap or enterprise value, some some notion of income and

0:11:10.920 --> 0:11:13.880
<v Speaker 1>some notion notion of financial performance, and divided by some

0:11:13.920 --> 0:11:16.640
<v Speaker 1>notion of company size and this, and then I'm going

0:11:16.679 --> 0:11:18.640
<v Speaker 1>to sort the stocks according to this. The ones that

0:11:18.720 --> 0:11:22.200
<v Speaker 1>come out highest are like most cheaply priced, so let's

0:11:22.200 --> 0:11:25.200
<v Speaker 1>buy those. So the ideas to say, hey, well, what

0:11:25.240 --> 0:11:27.800
<v Speaker 1>if I told you so. We actually know that this

0:11:27.840 --> 0:11:29.720
<v Speaker 1>does pretty well in back testing whether or not the

0:11:29.720 --> 0:11:32.600
<v Speaker 1>patterns will hold in the future. But you know, many

0:11:32.600 --> 0:11:34.199
<v Speaker 1>people have made a lot of money for many years,

0:11:34.360 --> 0:11:36.680
<v Speaker 1>so there's an idea of if you knew the income,

0:11:36.720 --> 0:11:39.520
<v Speaker 1>this is a good thing, a reasonable thing to try

0:11:39.559 --> 0:11:43.200
<v Speaker 1>to do. Our question that we asked, Unfortunately, um, John,

0:11:43.440 --> 0:11:46.840
<v Speaker 1>because he's actually in finance and I'm not, has this

0:11:47.240 --> 0:11:50.360
<v Speaker 1>really great set of like industry grade tools that unlike

0:11:50.360 --> 0:11:52.600
<v Speaker 1>most academic papers that look at like one stock over

0:11:52.640 --> 0:11:54.280
<v Speaker 1>a short period of time or something, we actually had,

0:11:54.360 --> 0:11:57.439
<v Speaker 1>you know, forty plus years of financial data and can

0:11:57.480 --> 0:12:00.920
<v Speaker 1>actually simulate like an applausible that guess what's going on.

0:12:01.320 --> 0:12:04.720
<v Speaker 1>We said, well, what if you did a factor model,

0:12:05.000 --> 0:12:08.120
<v Speaker 1>but someone gave you a crystal ball. So basically, instead

0:12:08.120 --> 0:12:11.240
<v Speaker 1>of dividing the current income divided by the current enterprise value,

0:12:11.600 --> 0:12:15.160
<v Speaker 1>someone gave you next year's income, and so you sorted

0:12:15.160 --> 0:12:18.080
<v Speaker 1>the stocks according to next year's income divided by the

0:12:18.120 --> 0:12:22.080
<v Speaker 1>current enterprise value something like this. So you're you're able

0:12:22.120 --> 0:12:24.560
<v Speaker 1>to peek into the future. You know how the company

0:12:24.600 --> 0:12:28.160
<v Speaker 1>will be performing next year, and you're saying, is how

0:12:28.240 --> 0:12:31.360
<v Speaker 1>is its next year's performance? Is that based on next

0:12:31.440 --> 0:12:34.280
<v Speaker 1>year's performance? Is its current price? Is it currently priced

0:12:34.320 --> 0:12:36.640
<v Speaker 1>cheaply or not? So it's what we call like a

0:12:36.720 --> 0:12:39.959
<v Speaker 1>clairvoyant factor model. Like you don't actually have such a

0:12:39.960 --> 0:12:41.880
<v Speaker 1>crystal ball, but if you, you know, give us some

0:12:41.960 --> 0:12:44.360
<v Speaker 1>license and you imagine that you did what would have

0:12:44.360 --> 0:12:46.320
<v Speaker 1>happened if you went back in history and you had

0:12:46.320 --> 0:12:49.080
<v Speaker 1>this crystal ball and you traded based on a clairvoyant

0:12:49.120 --> 0:12:51.800
<v Speaker 1>factor model, and it turns out that the clairvoyant factor

0:12:51.880 --> 0:12:55.000
<v Speaker 1>model just crushes it. So it does really, really well

0:12:55.040 --> 0:12:58.640
<v Speaker 1>and and not surprisingly, the more clairvoyant the model is.

0:12:58.679 --> 0:13:01.480
<v Speaker 1>So if it if it knows the performance of the

0:13:01.480 --> 0:13:05.440
<v Speaker 1>company six months out versus now, or twelve months out

0:13:05.520 --> 0:13:08.040
<v Speaker 1>versus six months out, it keeps getting better and better

0:13:08.080 --> 0:13:12.280
<v Speaker 1>and better. So what we decided was, well, maybe trying

0:13:12.280 --> 0:13:15.160
<v Speaker 1>to predict price directly as a bit you know, subject

0:13:15.520 --> 0:13:18.720
<v Speaker 1>to you know, a kind of fickle market, but the

0:13:18.760 --> 0:13:23.280
<v Speaker 1>patterns present in the fundamental reporting data itself is more stable.

0:13:23.840 --> 0:13:25.880
<v Speaker 1>So in our method what we do is instead of

0:13:25.920 --> 0:13:28.480
<v Speaker 1>just trying to predict a return, we try to predict

0:13:28.640 --> 0:13:32.680
<v Speaker 1>actually the fundamental reporting data itself, just so we're given

0:13:33.360 --> 0:13:36.920
<v Speaker 1>um these these features for like a trailing window of

0:13:36.920 --> 0:13:42.040
<v Speaker 1>of time corresponding to the company's like financial reporting, and

0:13:42.080 --> 0:13:44.199
<v Speaker 1>then we're trying to predict what they're going to report

0:13:44.240 --> 0:13:46.600
<v Speaker 1>next year. And then based on what they're going we

0:13:46.640 --> 0:13:49.040
<v Speaker 1>think they're going to report next year, we sort the

0:13:49.040 --> 0:13:52.920
<v Speaker 1>companies according to a value factor. So in essence, you

0:13:52.960 --> 0:13:56.760
<v Speaker 1>can pick out of that future prediction the components of

0:13:56.800 --> 0:13:59.960
<v Speaker 1>the factor model. Let let it whether it's a few

0:14:00.000 --> 0:14:04.480
<v Speaker 1>future predicted earnings, and you can take that out of

0:14:04.520 --> 0:14:09.079
<v Speaker 1>the future predicted fundamentals, divide that by current enterprise value

0:14:09.679 --> 0:14:12.160
<v Speaker 1>and and sort and then you have basically a factor

0:14:12.240 --> 0:14:17.719
<v Speaker 1>model which you are using. Instead of trailing twelve months earnings,

0:14:18.120 --> 0:14:21.560
<v Speaker 1>you're using the future predicted earnings by the deep learning

0:14:21.760 --> 0:14:24.680
<v Speaker 1>the deep neural network. So, as I understand it, the

0:14:25.400 --> 0:14:28.960
<v Speaker 1>deep learning or the neural networks are used primarily to

0:14:29.000 --> 0:14:34.960
<v Speaker 1>forecast the future fundamentals based on historic performance. Is that right,

0:14:35.800 --> 0:14:40.680
<v Speaker 1>historic fundamentals? Yeah, okay, So walk us through how you

0:14:40.720 --> 0:14:45.040
<v Speaker 1>actually develop an application that's able to do that, Like

0:14:45.320 --> 0:14:48.400
<v Speaker 1>what are those neural networks looking at and what sort

0:14:48.440 --> 0:14:51.360
<v Speaker 1>of information are they drawing in other than you know,

0:14:51.440 --> 0:14:56.880
<v Speaker 1>past predictive data to make those forecasts. There's two parts

0:14:56.880 --> 0:14:59.120
<v Speaker 1>of that. One is the data that we use, and

0:14:59.120 --> 0:15:02.360
<v Speaker 1>then two is the technology we use to build the

0:15:02.400 --> 0:15:06.720
<v Speaker 1>deep you know, neural network models. So on the data side,

0:15:06.960 --> 0:15:11.680
<v Speaker 1>what you use is historical fundamentals on all companies you

0:15:11.720 --> 0:15:13.880
<v Speaker 1>know that have ever you know, been listed in the

0:15:14.000 --> 0:15:17.720
<v Speaker 1>US for the past fifty years, and so what a

0:15:17.800 --> 0:15:21.880
<v Speaker 1>historical fundamentals mean? What it means earnings, book value, anything

0:15:21.880 --> 0:15:24.080
<v Speaker 1>you can find on an income statement and balance sheet

0:15:24.120 --> 0:15:27.880
<v Speaker 1>going back in time. In addition to fundamentals, we also

0:15:28.600 --> 0:15:32.080
<v Speaker 1>use as inputs to the to the model, you know,

0:15:32.240 --> 0:15:37.080
<v Speaker 1>momentum over you know, one month, six months, twelve months.

0:15:37.120 --> 0:15:38.920
<v Speaker 1>So then you know, if you think of it as

0:15:39.000 --> 0:15:43.040
<v Speaker 1>like a big you know, spreadsheet table where each row

0:15:43.600 --> 0:15:47.760
<v Speaker 1>is a point in time for a specific company, and

0:15:47.800 --> 0:15:52.120
<v Speaker 1>then you can think of sequences going back through time.

0:15:52.400 --> 0:15:57.400
<v Speaker 1>You know IBM in March of and then all of

0:15:57.440 --> 0:16:00.720
<v Speaker 1>its fundamentals in one row, plus it's moment at them,

0:16:00.760 --> 0:16:03.920
<v Speaker 1>and then that going back five years and time. So

0:16:03.920 --> 0:16:07.880
<v Speaker 1>those sequences, both the fundamentals and the momentum are fed

0:16:08.080 --> 0:16:12.680
<v Speaker 1>into a neural network and uh and and all of

0:16:12.720 --> 0:16:15.840
<v Speaker 1>those sequences for all companies and all time are fed

0:16:15.880 --> 0:16:19.840
<v Speaker 1>into a neural network and are trained to predict what

0:16:20.000 --> 0:16:22.800
<v Speaker 1>the fundamentals will be, you know, one time step out

0:16:22.840 --> 0:16:36.440
<v Speaker 1>in the future. So just to sort of summarize it

0:16:36.480 --> 0:16:39.360
<v Speaker 1>all up, you know, it's like, if you have all

0:16:39.400 --> 0:16:43.440
<v Speaker 1>these strategies, if you have all these funds chasing things

0:16:43.480 --> 0:16:49.040
<v Speaker 1>like earnings, quality, earnings, growth, momentum, all kinds of stuff

0:16:49.080 --> 0:16:53.600
<v Speaker 1>like that, your goal is to anticipate today when those

0:16:53.640 --> 0:16:56.160
<v Speaker 1>funds are going to be buying tomorrow. Is that a

0:16:56.160 --> 0:16:58.320
<v Speaker 1>fair way to characterize it. I think that's a fair

0:16:58.440 --> 0:17:01.280
<v Speaker 1>way to characterize is it. I think what we're really

0:17:01.320 --> 0:17:03.720
<v Speaker 1>just doing is trying to build a better a better

0:17:03.760 --> 0:17:06.840
<v Speaker 1>factor model, A better factor model in the sense that

0:17:06.880 --> 0:17:09.560
<v Speaker 1>you know, as Zach explained, if you had a clairvoyant

0:17:09.560 --> 0:17:13.679
<v Speaker 1>model where you actually knew what future fundamentals were and

0:17:13.760 --> 0:17:16.680
<v Speaker 1>could plug that into a factor model, you do substantially

0:17:16.760 --> 0:17:21.399
<v Speaker 1>better than what you could achieve with a value factor model. Today,

0:17:21.840 --> 0:17:25.040
<v Speaker 1>we're not like directly considering the psychology of the other

0:17:25.040 --> 0:17:28.440
<v Speaker 1>players in the market in this particular approach, right, No, sure,

0:17:28.920 --> 0:17:32.920
<v Speaker 1>but it's essentially saying, like, maybe the way to characterize

0:17:32.920 --> 0:17:36.600
<v Speaker 1>it is, if you want to invest on some fundamental

0:17:36.680 --> 0:17:40.680
<v Speaker 1>factor like earnings quality or earnings growth, bottom line, is

0:17:41.320 --> 0:17:44.679
<v Speaker 1>better to look at future twelve month results rather than

0:17:44.720 --> 0:17:48.840
<v Speaker 1>trailing twelve months. You look at the trailing, but you're

0:17:48.960 --> 0:17:51.959
<v Speaker 1>trying to predict the future. Like, so those two components, right,

0:17:52.000 --> 0:17:54.720
<v Speaker 1>you could say, like one is we have the component

0:17:54.800 --> 0:17:58.080
<v Speaker 1>that is trying to predict the future fundamentals. You know,

0:17:58.119 --> 0:18:01.520
<v Speaker 1>imagine that I came for the future, and I got

0:18:01.520 --> 0:18:03.919
<v Speaker 1>out of my time machine, and I gave you the

0:18:03.960 --> 0:18:07.320
<v Speaker 1>earnings reports from the future. Right, So so the first

0:18:07.320 --> 0:18:09.520
<v Speaker 1>thing you need is how do I get an approximate

0:18:09.600 --> 0:18:12.200
<v Speaker 1>time machine, right, which in our case is a predictive

0:18:12.200 --> 0:18:14.440
<v Speaker 1>model that has a good guess about what the future

0:18:14.440 --> 0:18:17.040
<v Speaker 1>will look like. The second thing is you still need

0:18:17.080 --> 0:18:20.080
<v Speaker 1>a way of executing on the strategy ones I. You

0:18:20.119 --> 0:18:23.560
<v Speaker 1>still need a way to decide which stocks to buy, right, So,

0:18:24.119 --> 0:18:27.520
<v Speaker 1>based on based on this future information, Like, it's possible

0:18:27.560 --> 0:18:29.359
<v Speaker 1>that if I if I come from the future and

0:18:29.400 --> 0:18:32.160
<v Speaker 1>I give you the earnings report, and I tell you

0:18:32.200 --> 0:18:35.199
<v Speaker 1>what the future income will be, what, it's possible that

0:18:35.200 --> 0:18:36.600
<v Speaker 1>the income is going to go up with the stock

0:18:36.640 --> 0:18:38.960
<v Speaker 1>price is going to go down, you know, like say

0:18:39.119 --> 0:18:41.720
<v Speaker 1>it's an Apple and like they made a lot more money,

0:18:41.760 --> 0:18:44.600
<v Speaker 1>but it was also like announced that they had a

0:18:44.640 --> 0:18:48.920
<v Speaker 1>major plant failure in the iPhone fourteen or whatever they're

0:18:49.000 --> 0:18:52.040
<v Speaker 1>up to is going to be delayed. So these two

0:18:52.040 --> 0:18:54.640
<v Speaker 1>components are are a little bit modular, Like we could

0:18:54.640 --> 0:18:57.320
<v Speaker 1>come up with m. John I think is more the

0:18:57.359 --> 0:18:59.680
<v Speaker 1>domain expert, so I'm I'm more the machine learning guy.

0:18:59.760 --> 0:19:02.119
<v Speaker 1>Like I'm sure John could come up with you know,

0:19:02.400 --> 0:19:04.920
<v Speaker 1>a million other ways that you might imagine that someone

0:19:04.960 --> 0:19:08.000
<v Speaker 1>would try to execute on this information. In our case,

0:19:08.040 --> 0:19:10.320
<v Speaker 1>what we're doing is we've adapted a factor model to

0:19:10.560 --> 0:19:14.840
<v Speaker 1>work with this kind of future guess. So one other example,

0:19:15.400 --> 0:19:17.439
<v Speaker 1>so so again, in our case, what we're doing is

0:19:17.480 --> 0:19:20.879
<v Speaker 1>taking the predicted future fundamentals and feeding that into a

0:19:20.960 --> 0:19:25.440
<v Speaker 1>value factor model. But you could imagine using let's say

0:19:25.600 --> 0:19:29.480
<v Speaker 1>the deep neural networks said, you know, a company is

0:19:29.480 --> 0:19:33.080
<v Speaker 1>going to do a hundred million, but consensus estimates in

0:19:33.080 --> 0:19:36.560
<v Speaker 1>in in earnings. Let's say, but consensus estimates said it's

0:19:36.560 --> 0:19:41.359
<v Speaker 1>gonna do seventy five million in in earnings. Well, you

0:19:41.400 --> 0:19:43.640
<v Speaker 1>know that might be you could you can imagine devising

0:19:43.640 --> 0:19:46.280
<v Speaker 1>a strategy around that where you'd want to go, you know,

0:19:46.359 --> 0:19:49.919
<v Speaker 1>bet on those guys and ones where consensus estimates are

0:19:49.960 --> 0:19:53.120
<v Speaker 1>above what the deep neural network is predicting, you'd want

0:19:53.119 --> 0:19:56.679
<v Speaker 1>to bet against. Right, assuming the current price is pricing

0:19:56.760 --> 0:19:59.520
<v Speaker 1>and that that's a really you know, John, you shouldnt

0:19:59.520 --> 0:20:03.040
<v Speaker 1>give away so goods, that that's a really good idea.

0:20:03.920 --> 0:20:08.800
<v Speaker 1>So are these kinds of machine learning driven predictive models

0:20:09.080 --> 0:20:11.320
<v Speaker 1>the future of investing? You think is that the way

0:20:11.320 --> 0:20:14.199
<v Speaker 1>that we're heading. I think what this paper showed is

0:20:14.240 --> 0:20:17.560
<v Speaker 1>that there's a lot of potential in using deep learning

0:20:17.680 --> 0:20:21.400
<v Speaker 1>to long term investing. I think that there's been some

0:20:21.480 --> 0:20:25.520
<v Speaker 1>debate about whether, you know, deep learning, which requires a

0:20:25.560 --> 0:20:29.760
<v Speaker 1>lot of data um to to to build successful models,

0:20:30.400 --> 0:20:34.320
<v Speaker 1>um whether in finance there's enough data, or whether you

0:20:34.359 --> 0:20:38.680
<v Speaker 1>even need this these kinds of complex models in finance,

0:20:38.720 --> 0:20:40.960
<v Speaker 1>I mean a lot of quant people feel, you know,

0:20:41.119 --> 0:20:44.679
<v Speaker 1>linear simple factor models are the best route to go,

0:20:45.520 --> 0:20:47.919
<v Speaker 1>And I think what we showed here is that if

0:20:47.920 --> 0:20:51.080
<v Speaker 1>you're trying to predict price changes, that might be true.

0:20:51.760 --> 0:20:55.800
<v Speaker 1>But if you decompose the problem into first trying to

0:20:55.840 --> 0:20:59.040
<v Speaker 1>predict fundamentals and then later you know, through a factor

0:20:59.119 --> 0:21:01.920
<v Speaker 1>model or some other method, trying to use those predicted

0:21:01.920 --> 0:21:06.720
<v Speaker 1>fundamentals to predict price, deep learning has a lot of

0:21:06.720 --> 0:21:12.200
<v Speaker 1>potential and does does substantially better at predicting future fundamentals

0:21:12.200 --> 0:21:14.280
<v Speaker 1>than than what you could do with a linear model.

0:21:14.600 --> 0:21:17.560
<v Speaker 1>There's a sort of a technical reason to recommend the

0:21:17.600 --> 0:21:20.760
<v Speaker 1>way we've cast a problem also without going too far

0:21:20.760 --> 0:21:24.520
<v Speaker 1>into the weeds. Basically, uh, you think really really powerful

0:21:24.600 --> 0:21:27.639
<v Speaker 1>machine learning models, deep neural networks. The thing that you

0:21:27.720 --> 0:21:29.879
<v Speaker 1>worried about is John was talking about how people people

0:21:29.960 --> 0:21:32.240
<v Speaker 1>agonize over what can you bring us to bear on

0:21:32.280 --> 0:21:34.520
<v Speaker 1>long term investing because you don't have as much data

0:21:34.960 --> 0:21:38.119
<v Speaker 1>right as if you were looking at the you know,

0:21:38.400 --> 0:21:42.679
<v Speaker 1>micro second kind of trade frequency, then you'd have, you know,

0:21:42.960 --> 0:21:45.399
<v Speaker 1>trillions of trade examples or something you get on. But

0:21:45.800 --> 0:21:47.760
<v Speaker 1>if you if you're looking at you know, your your

0:21:47.760 --> 0:21:49.879
<v Speaker 1>time tick is I have a data point you know,

0:21:49.960 --> 0:21:53.000
<v Speaker 1>once per month or once per year suddenly, and I

0:21:53.040 --> 0:21:56.439
<v Speaker 1>only have thousands of stocks, not millions of stocks. You

0:21:56.480 --> 0:22:00.199
<v Speaker 1>don't have such a huge amount of data. Um, So

0:22:00.840 --> 0:22:03.080
<v Speaker 1>what you worry about is that a model given given

0:22:03.119 --> 0:22:05.919
<v Speaker 1>a super powerful model, like a super overpowered model, and

0:22:05.960 --> 0:22:08.399
<v Speaker 1>then not too much data, that there's a propensity for

0:22:08.440 --> 0:22:10.840
<v Speaker 1>the models to do what we call overfitting, which is

0:22:10.880 --> 0:22:13.000
<v Speaker 1>the model basically it does a really good job of

0:22:13.040 --> 0:22:16.600
<v Speaker 1>memorizing the training data it's seen, but it learns kind

0:22:16.600 --> 0:22:19.639
<v Speaker 1>of a spurious pattern that doesn't generalize to future data

0:22:19.680 --> 0:22:23.040
<v Speaker 1>that it hasn't seen. So one cool thing about the

0:22:23.080 --> 0:22:25.800
<v Speaker 1>way that we're casting the problem is that we're not

0:22:25.880 --> 0:22:28.600
<v Speaker 1>just trying to predict the factor of interest. We're actually

0:22:28.600 --> 0:22:31.119
<v Speaker 1>trying to predict all the factors in the future. And

0:22:31.160 --> 0:22:35.159
<v Speaker 1>this means that the model has to simultaneously get the

0:22:35.200 --> 0:22:37.679
<v Speaker 1>income right, and get the assets right, and get the

0:22:37.720 --> 0:22:40.600
<v Speaker 1>debt right, and get all these different factors that are available.

0:22:40.600 --> 0:22:43.239
<v Speaker 1>So John was a fifteen target factors that we have

0:22:43.359 --> 0:22:46.960
<v Speaker 1>that we're trying to predict. So so in this case,

0:22:47.200 --> 0:22:48.680
<v Speaker 1>this sort of like this is this is what we

0:22:48.720 --> 0:22:51.960
<v Speaker 1>call multitask learning and the machine learning literature. And one

0:22:52.080 --> 0:22:55.320
<v Speaker 1>nice effect of multitask learning as that has a generalization

0:22:56.119 --> 0:22:58.520
<v Speaker 1>effect in that it's it's harder to fit a spurious

0:22:58.560 --> 0:23:00.960
<v Speaker 1>hypothesis because you have to come up with a representation

0:23:01.080 --> 0:23:03.679
<v Speaker 1>that is good for task one and also good for

0:23:03.720 --> 0:23:06.199
<v Speaker 1>task two, and also good for tast three. And the

0:23:06.240 --> 0:23:10.040
<v Speaker 1>probability that you come up with a pattern that's that's

0:23:10.080 --> 0:23:12.479
<v Speaker 1>good for solving all of these tasks that is not

0:23:12.600 --> 0:23:15.399
<v Speaker 1>the true pattern is much smaller than if you're only

0:23:15.520 --> 0:23:18.960
<v Speaker 1>like trying to solve one task, where it's easier to

0:23:19.000 --> 0:23:21.280
<v Speaker 1>just kind of memorize those data points. So we have

0:23:21.320 --> 0:23:24.960
<v Speaker 1>like essentially sixteen times as much training data and in

0:23:25.040 --> 0:23:29.400
<v Speaker 1>some relevant sense, so I have to ask in the

0:23:29.560 --> 0:23:35.879
<v Speaker 1>abstract of your paper or in the intro, you say that, um,

0:23:36.040 --> 0:23:40.399
<v Speaker 1>with this approach, you can improve your annual returns pretty

0:23:40.400 --> 0:23:44.240
<v Speaker 1>substantially over a standard factor model. In a bad test.

0:23:44.280 --> 0:23:48.280
<v Speaker 1>Seventeen point one percent versus a fourteen point four percent,

0:23:48.880 --> 0:23:52.119
<v Speaker 1>just pretty big beat. But as we know, and as

0:23:52.160 --> 0:23:54.000
<v Speaker 1>there's a lot of people pointed out, there's a lot

0:23:54.040 --> 0:23:57.840
<v Speaker 1>of strategies that seem to work in academic papers and

0:23:57.880 --> 0:24:01.639
<v Speaker 1>then when they're put into pract is, they don't seem

0:24:01.880 --> 0:24:06.080
<v Speaker 1>the results don't seem to arrive as easily. John, in

0:24:06.200 --> 0:24:09.800
<v Speaker 1>your firm, are you seeing the results of your research

0:24:10.280 --> 0:24:12.600
<v Speaker 1>that on paper look very compelling actually play out in

0:24:12.600 --> 0:24:17.280
<v Speaker 1>the market. So so this this paper, uh, we we

0:24:17.359 --> 0:24:21.280
<v Speaker 1>have not put this model to test, so to speak,

0:24:21.760 --> 0:24:25.280
<v Speaker 1>in in a fund yet, but you know, we're very

0:24:25.320 --> 0:24:28.560
<v Speaker 1>interested in in in doing that. I will I will

0:24:28.600 --> 0:24:31.879
<v Speaker 1>add though here that many of the back tests that

0:24:31.920 --> 0:24:36.520
<v Speaker 1>are done in the industry are done where you just run,

0:24:36.800 --> 0:24:39.080
<v Speaker 1>you know, thousands of back tests on a data set

0:24:39.119 --> 0:24:42.000
<v Speaker 1>over some time period ten years, twenty years, thirty years,

0:24:42.920 --> 0:24:46.080
<v Speaker 1>and there's no out of sample testing, meaning that they

0:24:46.080 --> 0:24:48.600
<v Speaker 1>don't then take that and then apply it to a

0:24:48.640 --> 0:24:52.440
<v Speaker 1>new data set. One thing that machine learning. One technique

0:24:52.560 --> 0:24:57.560
<v Speaker 1>that is used in machine learning to prevent overfitting, and

0:24:57.920 --> 0:25:01.080
<v Speaker 1>that we do here is we train them or we

0:25:01.200 --> 0:25:04.760
<v Speaker 1>build the model on one data set and then test

0:25:04.840 --> 0:25:07.640
<v Speaker 1>it at a sample on another on another data set

0:25:07.720 --> 0:25:10.520
<v Speaker 1>during a different time period, and the results we present

0:25:10.600 --> 0:25:14.800
<v Speaker 1>there are at a sample out of sample, always being

0:25:15.000 --> 0:25:17.399
<v Speaker 1>sort of ahead right in the future, you could. So

0:25:17.440 --> 0:25:19.560
<v Speaker 1>the model is the model at every given time. It

0:25:19.640 --> 0:25:23.800
<v Speaker 1>is trained on the path data. So we're simulating like

0:25:23.840 --> 0:25:26.200
<v Speaker 1>what if, you know, if you train the model back

0:25:26.240 --> 0:25:28.199
<v Speaker 1>then based on that it was only available up to

0:25:28.240 --> 0:25:30.960
<v Speaker 1>that point. I think more broadly, there's a good question

0:25:31.000 --> 0:25:35.160
<v Speaker 1>there of um, it's hard to say which which patterns

0:25:35.160 --> 0:25:38.359
<v Speaker 1>are just you know, especially I think with short term investing,

0:25:38.359 --> 0:25:41.439
<v Speaker 1>it's very obvious that any any pattern that exists on

0:25:41.440 --> 0:25:44.120
<v Speaker 1>a scale of seconds is something that could be sort

0:25:44.160 --> 0:25:47.960
<v Speaker 1>of traded away. It's not as clear and I believe,

0:25:48.040 --> 0:25:50.320
<v Speaker 1>I mean, John can speak more to it, right, But

0:25:50.359 --> 0:25:52.560
<v Speaker 1>I believe part of the ethos of long term investing

0:25:52.640 --> 0:25:55.960
<v Speaker 1>is very much that rather than interacting in a place

0:25:56.000 --> 0:25:59.080
<v Speaker 1>where the price most price movements are due to the

0:25:59.119 --> 0:26:03.159
<v Speaker 1>behavior of the high frequency traders, when you're in the

0:26:03.200 --> 0:26:06.199
<v Speaker 1>long term space, the price movement is more tied to

0:26:06.400 --> 0:26:09.680
<v Speaker 1>the actual fiscal performance of the company, and that's maybe

0:26:09.680 --> 0:26:13.480
<v Speaker 1>a more durable pattern. So Joe and I were talking

0:26:13.640 --> 0:26:18.480
<v Speaker 1>about financial players and how quickly they adapt to new

0:26:18.560 --> 0:26:21.800
<v Speaker 1>markets and new situations. At the beginning of this episode,

0:26:22.320 --> 0:26:26.320
<v Speaker 1>from your respective viewpoints, how fast are these sorts of

0:26:26.359 --> 0:26:30.680
<v Speaker 1>technologies and models and applications being developed, And for how

0:26:30.760 --> 0:26:35.720
<v Speaker 1>long would something like, you know, a clairvoyant factor predicting

0:26:35.800 --> 0:26:39.080
<v Speaker 1>model actually give you an edge four until someone else,

0:26:39.520 --> 0:26:41.560
<v Speaker 1>maybe an E t F came along and copied it.

0:26:42.480 --> 0:26:46.280
<v Speaker 1>I think that's a hard question to answer, because again

0:26:46.320 --> 0:26:49.280
<v Speaker 1>it gets back to how you would use this model. Right,

0:26:49.400 --> 0:26:53.080
<v Speaker 1>So in the in the paper we give one very

0:26:53.119 --> 0:26:56.560
<v Speaker 1>specific example. We use the deep learning normal network to

0:26:56.600 --> 0:27:00.240
<v Speaker 1>predict fundamentals and then we plug that into one kind

0:27:00.280 --> 0:27:05.000
<v Speaker 1>of factor model, right uh, in particular operating income predicted

0:27:05.080 --> 0:27:08.760
<v Speaker 1>operating income over enterprise value. But as I suggested, you

0:27:08.800 --> 0:27:11.840
<v Speaker 1>could use it to you know, figure you know, figure

0:27:11.880 --> 0:27:15.480
<v Speaker 1>out whether consensus forecasts are good or bad. Um. So,

0:27:16.400 --> 0:27:19.320
<v Speaker 1>you know, I think that just saying in general, deep

0:27:19.400 --> 0:27:23.640
<v Speaker 1>learning applied to you know, investing is going to get

0:27:23.720 --> 0:27:25.919
<v Speaker 1>used and then a year later is going to be

0:27:26.000 --> 0:27:30.080
<v Speaker 1>arbitraged away. Miss is the point that, look, you know,

0:27:30.320 --> 0:27:33.119
<v Speaker 1>you can use deep learning in a myriad of ways

0:27:33.320 --> 0:27:36.960
<v Speaker 1>to attack the problem of long term investing and presumably

0:27:37.160 --> 0:27:42.080
<v Speaker 1>trading as well. To address your question about how quickly

0:27:42.400 --> 0:27:45.919
<v Speaker 1>is this kind of technology getting adopted? UM my sense

0:27:46.520 --> 0:27:49.840
<v Speaker 1>and based a little bit on an outsider's view as

0:27:49.840 --> 0:27:53.879
<v Speaker 1>an academic machine learning person, um talking to collegues who

0:27:53.920 --> 0:27:56.600
<v Speaker 1>have either gone into fintech or who flirted with it

0:27:56.840 --> 0:28:00.359
<v Speaker 1>or tried to recruit me into it. The sense that

0:28:00.440 --> 0:28:03.520
<v Speaker 1>I get is that actually, obviously a lot of people

0:28:03.560 --> 0:28:06.760
<v Speaker 1>aren't talking about what they're doing, right, But my sense

0:28:06.840 --> 0:28:09.120
<v Speaker 1>is that there's a lot of people doing this kind

0:28:09.160 --> 0:28:12.600
<v Speaker 1>of stuff in the high frequency space, not maybe on

0:28:12.600 --> 0:28:14.960
<v Speaker 1>the scale of you know, fractions of seconds, but but

0:28:15.080 --> 0:28:17.720
<v Speaker 1>on on a pretty short time scale. And the reason

0:28:17.760 --> 0:28:23.680
<v Speaker 1>why is because, um, right, it's it's easy to collect

0:28:23.720 --> 0:28:27.159
<v Speaker 1>a lot of data. If the patterns are very different, um,

0:28:27.320 --> 0:28:30.480
<v Speaker 1>a year from now, Well, you just you have enough data.

0:28:30.680 --> 0:28:34.359
<v Speaker 1>Like if you're trading like at the scale of months

0:28:34.440 --> 0:28:37.160
<v Speaker 1>or years, then you have to look back twenty years, right,

0:28:37.400 --> 0:28:39.360
<v Speaker 1>you have to look back thirty If you're trading at

0:28:39.360 --> 0:28:42.480
<v Speaker 1>the scale seconds, then your whole universe could be formed

0:28:42.480 --> 0:28:45.400
<v Speaker 1>by the previous four days. There's a very fast cycle

0:28:45.440 --> 0:28:47.520
<v Speaker 1>of development. So if you're in it and you just

0:28:47.600 --> 0:28:50.080
<v Speaker 1>wanna you don't you don't have any kind of strong

0:28:50.120 --> 0:28:54.000
<v Speaker 1>beliefs about finance. You're just a machine learning person throwing

0:28:54.000 --> 0:28:57.600
<v Speaker 1>your hammer at financing. Then going in the high frequency

0:28:57.600 --> 0:29:00.800
<v Speaker 1>space gives you or are the comparatively high frequency space

0:29:01.200 --> 0:29:04.200
<v Speaker 1>gives you like the sandbacked box to just really quickly

0:29:04.360 --> 0:29:07.440
<v Speaker 1>test stuff validated, see if it works. My feeling and

0:29:07.800 --> 0:29:09.640
<v Speaker 1>when I've talked to friends who are doing this kind

0:29:09.640 --> 0:29:12.040
<v Speaker 1>of stuff, but what we're doing is that I think

0:29:12.280 --> 0:29:14.680
<v Speaker 1>almost no one that I've talked to out of a

0:29:14.680 --> 0:29:16.400
<v Speaker 1>lot of people doing this stuff with finance, is looking

0:29:16.400 --> 0:29:19.640
<v Speaker 1>at the same kinds of time scales. And John might

0:29:19.680 --> 0:29:21.920
<v Speaker 1>be able to to speak to that because he might

0:29:22.280 --> 0:29:25.640
<v Speaker 1>actually be deeper and the I mean, he's definitely differ

0:29:25.680 --> 0:29:28.280
<v Speaker 1>in the finance community than I am. But my sense

0:29:28.320 --> 0:29:30.720
<v Speaker 1>is people doing deep learning for finance, and there are many,

0:29:31.200 --> 0:29:33.600
<v Speaker 1>Um it's on the rise, but they're not necessarily looking

0:29:33.640 --> 0:29:35.800
<v Speaker 1>at it in the same way, and certainly very few

0:29:35.960 --> 0:29:38.480
<v Speaker 1>on as long a time scale. Yeah, I mean, I

0:29:38.520 --> 0:29:40.800
<v Speaker 1>think if you look, you know, the a q r

0:29:40.920 --> 0:29:43.120
<v Speaker 1>s and the d f as of the world, which

0:29:43.120 --> 0:29:46.840
<v Speaker 1>are you know, these huge you know, quantitative shops. They do.

0:29:46.960 --> 0:29:51.000
<v Speaker 1>They certainly do long term investing, but um, there's not

0:29:51.040 --> 0:29:53.560
<v Speaker 1>a lot of evidence that there's a ton of machine

0:29:53.640 --> 0:29:56.440
<v Speaker 1>learning deep learning going on there. But you know, I

0:29:56.480 --> 0:29:59.040
<v Speaker 1>think if if if stuff is a successful, you know

0:29:59.080 --> 0:30:02.320
<v Speaker 1>it's likely to be a opted, so probably won't be

0:30:02.360 --> 0:30:06.920
<v Speaker 1>true forever. John Elberg and Zachary Lipton, that was a

0:30:07.000 --> 0:30:11.440
<v Speaker 1>fascinating conversation, so much to think about and wrap our

0:30:11.440 --> 0:30:15.320
<v Speaker 1>heads around. Really appreciate you both coming on. Thanks for

0:30:15.360 --> 0:30:29.280
<v Speaker 1>having us. Thank you guys. Well, Tracy, we didn't really

0:30:29.280 --> 0:30:30.920
<v Speaker 1>plan it that way, but I really do think that

0:30:31.000 --> 0:30:35.160
<v Speaker 1>was sort of the perfect follow up to Andrew Lot left. No, Joe,

0:30:35.200 --> 0:30:37.400
<v Speaker 1>you're supposed to pretend we did plan it that way

0:30:37.440 --> 0:30:40.240
<v Speaker 1>so everyone will think we're really working so good. I mean,

0:30:40.320 --> 0:30:45.280
<v Speaker 1>like we should continue this series on quantitative strategies and

0:30:45.440 --> 0:30:48.280
<v Speaker 1>new ways to evolve to beat the market. Let's continue

0:30:48.320 --> 0:30:52.160
<v Speaker 1>this continue. Yes, absolutely, Okay, in all seriousness, Yes, it

0:30:52.240 --> 0:30:55.240
<v Speaker 1>was fascinating. I really like the idea of well, who

0:30:55.280 --> 0:30:58.520
<v Speaker 1>doesn't like the idea of a clairvoyant robot who can

0:30:58.560 --> 0:31:00.760
<v Speaker 1>predict how well a company is going to do in

0:31:00.800 --> 0:31:04.480
<v Speaker 1>the future and then apply that to a factor based

0:31:04.720 --> 0:31:08.720
<v Speaker 1>investment model. If someone comes back in time and they're

0:31:08.760 --> 0:31:10.800
<v Speaker 1>like giving me hints on what the stock market is

0:31:10.800 --> 0:31:12.800
<v Speaker 1>gonna do, it's like just give me the winning stocks.

0:31:12.840 --> 0:31:15.200
<v Speaker 1>You know what I'm saying, Like, if you're time traveling,

0:31:15.320 --> 0:31:17.880
<v Speaker 1>don't like be a tease, just give me the winning stocks. No,

0:31:18.160 --> 0:31:21.960
<v Speaker 1>But in all seriousness, hey, I felt like several times

0:31:21.960 --> 0:31:24.840
<v Speaker 1>in that conversation, it's just like the level that they're

0:31:24.880 --> 0:31:28.160
<v Speaker 1>operating and thinking about the market on is so like

0:31:28.520 --> 0:31:31.280
<v Speaker 1>high above anything that you and I like typically talk

0:31:31.360 --> 0:31:33.640
<v Speaker 1>about it on a day. Like several times I felt

0:31:33.640 --> 0:31:37.200
<v Speaker 1>like I had to catch my breath speak for yourself, Joe,

0:31:38.840 --> 0:31:41.600
<v Speaker 1>because it's just like absorbing all of that, and you know,

0:31:41.600 --> 0:31:44.280
<v Speaker 1>obviously there's probably lots that I didn't get. But then

0:31:44.320 --> 0:31:47.680
<v Speaker 1>the other thing I really thought that last point was

0:31:47.760 --> 0:31:51.280
<v Speaker 1>very interesting about time. So obviously going back to the

0:31:51.320 --> 0:31:54.400
<v Speaker 1>adaptive framework for thinking about markets. You know, if there

0:31:54.520 --> 0:31:58.480
<v Speaker 1>is a strategy that works over a day and you

0:31:58.520 --> 0:32:00.440
<v Speaker 1>can get it, and you can you just have to

0:32:00.480 --> 0:32:03.560
<v Speaker 1>back test four days or whatever, it's very easy to see, Okay,

0:32:03.560 --> 0:32:05.760
<v Speaker 1>this works, this doesn't. Let's go with the thing that

0:32:05.800 --> 0:32:07.520
<v Speaker 1>works and then everyone can sort of figure out the

0:32:07.520 --> 0:32:09.880
<v Speaker 1>things that work and then it doesn't work anymore. But

0:32:09.960 --> 0:32:13.320
<v Speaker 1>this idea that maybe a quantitative approach to long term

0:32:13.360 --> 0:32:17.120
<v Speaker 1>fundamental investing, you don't get that sort of instant feedback

0:32:17.120 --> 0:32:19.719
<v Speaker 1>on whether it's working as fast, and so maybe winning

0:32:19.760 --> 0:32:22.120
<v Speaker 1>strategies might prove to be a bit more do Yeah,

0:32:22.120 --> 0:32:25.400
<v Speaker 1>and presumably it's much more difficult to actually develop them

0:32:25.440 --> 0:32:28.239
<v Speaker 1>and see them evolved. It's like, um, I guess it's

0:32:28.320 --> 0:32:33.240
<v Speaker 1>like if you bred successive generations of like rabbits, right,

0:32:33.400 --> 0:32:36.440
<v Speaker 1>Like it takes like a year two, well less than

0:32:36.480 --> 0:32:40.400
<v Speaker 1>a year, you could breathe like a hundred generations in

0:32:40.440 --> 0:32:43.200
<v Speaker 1>a year. Yeah, right, Or I guess, like, you know,

0:32:43.320 --> 0:32:46.320
<v Speaker 1>it's like laboratories use mice and because they do, like

0:32:46.360 --> 0:32:49.000
<v Speaker 1>they can get so many so fast. But if you

0:32:49.080 --> 0:32:52.960
<v Speaker 1>had to sort of, you know, breed hippopotamus is you

0:32:52.960 --> 0:32:56.760
<v Speaker 1>wouldn't know for a much longer period of time whether

0:32:56.840 --> 0:32:59.080
<v Speaker 1>you're down the road you had sort of created the

0:32:59.120 --> 0:33:04.480
<v Speaker 1>master hip hop. Does that make sense? Wow? This past? Yeah, Okay,

0:33:04.560 --> 0:33:07.440
<v Speaker 1>let's leave it at master hippo. Okay, alright. This has

0:33:07.480 --> 0:33:11.320
<v Speaker 1>been another episode of the Odd Thoughts Podcast. I'm Tracy Alloway.

0:33:11.440 --> 0:33:13.960
<v Speaker 1>You can follow me on Twitter at Tracy Alloway and

0:33:14.000 --> 0:33:16.800
<v Speaker 1>I'm Joe Why Isn't All? You can follow me on

0:33:16.840 --> 0:33:19.880
<v Speaker 1>Twitter at the Stalwart, and you can follow our guests

0:33:19.960 --> 0:33:24.280
<v Speaker 1>on Twitter John Elberg is at at John Elbert. There's

0:33:24.320 --> 0:33:27.600
<v Speaker 1>just one L in that. Zachary Lipton is on Twitter

0:33:27.800 --> 0:33:31.800
<v Speaker 1>at Zachary Lipton, and you can follow our producer Sarah

0:33:31.800 --> 0:33:35.640
<v Speaker 1>Patterson on Twitter at Sarah pat with two teams. Thanks

0:33:35.640 --> 0:33:36.160
<v Speaker 1>for listening,