1 00:00:08,680 --> 00:00:12,799 Speaker 1: Hello, and welcome to another episode of the Odd Lots Podcast. 2 00:00:12,840 --> 00:00:16,760 Speaker 1: I'm Joe Wisenthal and I'm Tracy Alloway. Tracy, I really 3 00:00:16,800 --> 00:00:21,000 Speaker 1: liked last week's episode with Andrew Lowe talking about quant 4 00:00:21,120 --> 00:00:25,320 Speaker 1: stuff and his sort of evolution of the efficient market 5 00:00:25,400 --> 00:00:28,920 Speaker 1: hypothesis and where that might go. Yeah, I did too. 6 00:00:29,000 --> 00:00:32,839 Speaker 1: I really like the ecosystem analogy, the idea that you 7 00:00:32,840 --> 00:00:35,640 Speaker 1: have all these different players with different motivations and they're 8 00:00:35,680 --> 00:00:40,200 Speaker 1: constantly evolving and adapting to the market. That point about 9 00:00:40,200 --> 00:00:43,000 Speaker 1: it adapting, and of course that's the name of his book, 10 00:00:43,080 --> 00:00:47,360 Speaker 1: Adaptive Market is the name of his hypothesis. Is really 11 00:00:47,520 --> 00:00:50,600 Speaker 1: key because one of the main points that he made 12 00:00:50,920 --> 00:00:54,000 Speaker 1: that I loved was this idea of hedge funds as 13 00:00:54,080 --> 00:00:57,920 Speaker 1: sort of the R and D laboratory for all of 14 00:00:57,960 --> 00:01:02,160 Speaker 1: the financial industry. Right, the funds are where innovative new 15 00:01:02,240 --> 00:01:05,720 Speaker 1: techniques get to be sort of hashed out without doing 16 00:01:06,040 --> 00:01:09,280 Speaker 1: usually too much damage, I guess, to the wider ecosyst 17 00:01:09,280 --> 00:01:13,680 Speaker 1: hopefully right, not always. There's certainly examples of hedge funds 18 00:01:13,680 --> 00:01:17,919 Speaker 1: actually having done major damage from time to time. But ideally, 19 00:01:18,160 --> 00:01:22,160 Speaker 1: you know what the evolution seems to be that some 20 00:01:22,200 --> 00:01:24,759 Speaker 1: new idea sort of starts in the hedge fund world 21 00:01:25,040 --> 00:01:27,600 Speaker 1: and eventually makes its way to the broader world. I 22 00:01:27,600 --> 00:01:29,759 Speaker 1: think the most obvious example of that that we could 23 00:01:29,760 --> 00:01:32,160 Speaker 1: say these days is a lot of this sort of 24 00:01:32,520 --> 00:01:36,360 Speaker 1: popular smart beta strategies e t f s that are 25 00:01:36,360 --> 00:01:41,119 Speaker 1: built on things like momentum or value or other factors, 26 00:01:41,440 --> 00:01:44,640 Speaker 1: sort of quantitative ideas that for many years were only 27 00:01:44,680 --> 00:01:48,920 Speaker 1: available to uh, you know, researchers at hedge funds. Right. So, 28 00:01:48,960 --> 00:01:53,160 Speaker 1: these sorts of quantitative investment or trading methods were usually 29 00:01:53,240 --> 00:01:56,040 Speaker 1: the purview of sophisticated hedge funds who had the time 30 00:01:56,080 --> 00:01:58,880 Speaker 1: and resources to develop them. And then you had a 31 00:01:58,920 --> 00:02:01,120 Speaker 1: bunch of e t f s who kind of caught 32 00:02:01,160 --> 00:02:03,800 Speaker 1: on and managed to replicate them. And now we can 33 00:02:03,840 --> 00:02:08,679 Speaker 1: all trade lighthedge funds for zero percent fees, right, exactly right, 34 00:02:09,040 --> 00:02:12,280 Speaker 1: And of course, once everyone can do it for very 35 00:02:12,320 --> 00:02:15,560 Speaker 1: few fees, I think it's safe to say those strategies 36 00:02:15,560 --> 00:02:19,840 Speaker 1: aren't going to produce the same returns, and hence the 37 00:02:19,919 --> 00:02:23,560 Speaker 1: market is forced to adapt again. Right. Presumably the hedge 38 00:02:23,560 --> 00:02:26,400 Speaker 1: funds are always trying to stay one step ahead as well, right, 39 00:02:27,120 --> 00:02:30,360 Speaker 1: exactly so, which raises the idea of like what will 40 00:02:30,400 --> 00:02:34,960 Speaker 1: be the next thing. If anyone can sort of invest 41 00:02:35,000 --> 00:02:38,720 Speaker 1: in a crude momentum strategy for virtually no fees, then 42 00:02:38,760 --> 00:02:41,440 Speaker 1: that requires the people on the cutting edge, the people 43 00:02:41,520 --> 00:02:44,160 Speaker 1: doing the R and D of this industry, to uh, 44 00:02:44,240 --> 00:02:46,000 Speaker 1: you know, figure out what the next big thing it's 45 00:02:46,000 --> 00:02:48,359 Speaker 1: gonna be. Do you know what the next big thing 46 00:02:48,400 --> 00:02:50,239 Speaker 1: is going to be? Joe? Can you share it with 47 00:02:50,680 --> 00:02:55,360 Speaker 1: your fellow partner at Odd Thoughts LLC? Sadly and unfortunately 48 00:02:56,000 --> 00:02:59,760 Speaker 1: to all of the odd Lots fan, I myself do 49 00:02:59,800 --> 00:03:03,480 Speaker 1: not know what the next big thing in quantitative strategy 50 00:03:03,639 --> 00:03:06,480 Speaker 1: or sort of advanced investing is going to be. But 51 00:03:06,560 --> 00:03:09,960 Speaker 1: I'm hoping that our guests on today's episode might be 52 00:03:10,040 --> 00:03:13,960 Speaker 1: able to shed some light. Who are there? Okay, so 53 00:03:14,040 --> 00:03:17,040 Speaker 1: today we're going to be talking to John Elberg. He 54 00:03:17,160 --> 00:03:21,240 Speaker 1: is the founder of Euclidean Technologies, a quant firm, as 55 00:03:21,240 --> 00:03:24,919 Speaker 1: well as Zach Lipton, a professor at Carnegie Mellon University 56 00:03:24,960 --> 00:03:28,519 Speaker 1: in the Business School and expert on machine learning. They 57 00:03:28,639 --> 00:03:33,480 Speaker 1: recently published a paper titled Improving factor based Quantitative Investing 58 00:03:34,000 --> 00:03:39,520 Speaker 1: by Forecasting company fundamentals. So what I think that means 59 00:03:39,560 --> 00:03:41,000 Speaker 1: and we'll talk to them, is, you know, we talk 60 00:03:41,080 --> 00:03:44,800 Speaker 1: all this stuff about price and computers and algorithms figuring 61 00:03:44,840 --> 00:03:47,880 Speaker 1: out what signal we can get from price, But maybe 62 00:03:48,480 --> 00:03:51,200 Speaker 1: the next generation can actually tell us something about the 63 00:03:51,240 --> 00:03:54,640 Speaker 1: fundamental workings of the company itself, and maybe this could 64 00:03:54,680 --> 00:03:57,880 Speaker 1: be sort of the next wave of where quant investing goes. 65 00:03:58,440 --> 00:04:02,080 Speaker 1: And this sounds absolutely senating, Joe, let's bring them on. 66 00:04:12,280 --> 00:04:15,240 Speaker 1: John and Zach, thank you very much for joining us. 67 00:04:15,920 --> 00:04:19,800 Speaker 1: Was that a reasonable characterization of sort of where your 68 00:04:20,160 --> 00:04:24,080 Speaker 1: paper and where your research is taking things? Yeah, I 69 00:04:24,120 --> 00:04:27,280 Speaker 1: think it is so. So. First of all, machine learning 70 00:04:27,320 --> 00:04:30,160 Speaker 1: has been kind of on a rocket ship of innovation 71 00:04:30,240 --> 00:04:33,159 Speaker 1: for the last ten years or so, and with the 72 00:04:33,240 --> 00:04:36,800 Speaker 1: advent of deep learning, you know, computers and machine learning 73 00:04:36,800 --> 00:04:39,120 Speaker 1: have been able to do things that you know, historically 74 00:04:39,120 --> 00:04:44,400 Speaker 1: have been very challenging, like image captioning and language translation. 75 00:04:45,080 --> 00:04:47,840 Speaker 1: So we Zach and I, you know, a couple of 76 00:04:47,880 --> 00:04:52,080 Speaker 1: years back, thought of the idea of collaborating to apply 77 00:04:52,560 --> 00:04:56,480 Speaker 1: deep learning to the problem of long term investing. So 78 00:04:56,520 --> 00:04:59,240 Speaker 1: how did you actually go about doing that and what 79 00:04:59,400 --> 00:05:03,680 Speaker 1: exactly do you mean by deep learning? That's exactly what 80 00:05:03,720 --> 00:05:07,159 Speaker 1: I wanted to know. To deep learning sort of the 81 00:05:07,200 --> 00:05:10,560 Speaker 1: rebranding of neural networks research to say I say I 82 00:05:10,600 --> 00:05:13,279 Speaker 1: had some data about a company, right like I had 83 00:05:13,680 --> 00:05:16,000 Speaker 1: machine learning. We call a vector of features. But what 84 00:05:16,240 --> 00:05:18,080 Speaker 1: we mean it's just like a list of attributes, each 85 00:05:18,120 --> 00:05:20,880 Speaker 1: of which is somehow like be made into a numerical quantity, 86 00:05:21,000 --> 00:05:24,240 Speaker 1: whether it's like their income, they're the number of assets whatever. 87 00:05:24,640 --> 00:05:28,039 Speaker 1: One way of deciding how to predict what the say, 88 00:05:28,120 --> 00:05:30,360 Speaker 1: what the price will be or something, as we say, well, 89 00:05:30,600 --> 00:05:33,400 Speaker 1: we're going to have this long vector of features, and 90 00:05:33,440 --> 00:05:36,200 Speaker 1: then we're going for every single company, uh, you know, 91 00:05:36,279 --> 00:05:38,640 Speaker 1: at every single time while this vector of features corresponding 92 00:05:38,640 --> 00:05:40,360 Speaker 1: to the state of the company at some period of time, 93 00:05:40,880 --> 00:05:44,640 Speaker 1: and then we'll have some target that we want to predict. 94 00:05:44,680 --> 00:05:47,600 Speaker 1: This could be a binary quantity like will the stock 95 00:05:47,640 --> 00:05:50,280 Speaker 1: go up or down in the next you know, time 96 00:05:50,400 --> 00:05:53,320 Speaker 1: unit of your choice, whether it's the next day or 97 00:05:53,320 --> 00:05:55,400 Speaker 1: in the next month or in the next year. Or 98 00:05:55,440 --> 00:05:58,599 Speaker 1: you could try to directly predict say the relative price, 99 00:05:58,680 --> 00:06:02,520 Speaker 1: so like you know, the percent improvement or decrease based 100 00:06:02,560 --> 00:06:05,440 Speaker 1: on sort of the available features. So one of the 101 00:06:05,480 --> 00:06:07,680 Speaker 1: simplest ways you can make a model is you say, hey, 102 00:06:07,720 --> 00:06:09,120 Speaker 1: I've got a bunch of features. I'm gonna do is 103 00:06:09,120 --> 00:06:11,479 Speaker 1: I'm gonna take a weighted some of these features the 104 00:06:11,520 --> 00:06:14,000 Speaker 1: way like you'd calculate a score to see, like what's 105 00:06:14,000 --> 00:06:16,240 Speaker 1: your risk of a heart disease. Maybe you take you know, well, 106 00:06:16,240 --> 00:06:20,120 Speaker 1: four times your cholesterol plus two times your age minus 107 00:06:20,120 --> 00:06:23,440 Speaker 1: one times you know, your amount of good cholesterol is 108 00:06:23,480 --> 00:06:25,800 Speaker 1: something like this if you come up with some formula 109 00:06:25,920 --> 00:06:28,800 Speaker 1: that's expressed simply as a weighted sum, so that would 110 00:06:28,800 --> 00:06:31,719 Speaker 1: be a linear model. Where deep learning make things different 111 00:06:31,839 --> 00:06:35,479 Speaker 1: is that you have many different layers of computation that 112 00:06:35,800 --> 00:06:38,720 Speaker 1: you basically are learning very complex patterns that maybe couldn't 113 00:06:38,760 --> 00:06:41,120 Speaker 1: be expressed as a as a weighted sum. So maybe 114 00:06:41,120 --> 00:06:44,960 Speaker 1: you're uncovering interactions between all of your features. Um. So, 115 00:06:45,000 --> 00:06:48,159 Speaker 1: for example, if you want to learn to recognize a 116 00:06:48,240 --> 00:06:50,839 Speaker 1: dog versus a cat in an image, there's no weighted 117 00:06:50,880 --> 00:06:53,000 Speaker 1: sum of pixel values it's actually going to tell you 118 00:06:53,000 --> 00:06:55,359 Speaker 1: this because it's just the patterns too complicated. So in 119 00:06:55,360 --> 00:06:58,239 Speaker 1: that case you need some some more like heavy duty machinery. 120 00:06:58,360 --> 00:07:00,800 Speaker 1: So what you do in deep learning essentially is that 121 00:07:00,839 --> 00:07:05,320 Speaker 1: you learn multiple successive transformations of your data such that 122 00:07:05,480 --> 00:07:09,640 Speaker 1: after applying many such transformations, you know, could be two, 123 00:07:09,760 --> 00:07:12,480 Speaker 1: four or five, ten, whatever, you come out at the 124 00:07:12,600 --> 00:07:15,240 Speaker 1: end of a representation of your data where you actually 125 00:07:15,280 --> 00:07:17,680 Speaker 1: can learn a very simple model on top of that. 126 00:07:17,920 --> 00:07:21,280 Speaker 1: So we sometimes call deep learning representation learning because it's 127 00:07:21,320 --> 00:07:24,200 Speaker 1: what we're doing is we're both learning how to feature 128 00:07:24,200 --> 00:07:26,600 Speaker 1: eye our data essentially, how to transform it and how 129 00:07:26,600 --> 00:07:29,360 Speaker 1: to classify it at the same time. So one of 130 00:07:29,400 --> 00:07:32,720 Speaker 1: the things in sort of traditional quantity, a lot of 131 00:07:32,800 --> 00:07:37,600 Speaker 1: quantitative investing focuses a lot on price and sort of 132 00:07:37,960 --> 00:07:43,240 Speaker 1: listening to your characterization. It seems like price and this 133 00:07:43,320 --> 00:07:46,200 Speaker 1: is relatively speaking, of course, price is a fairly you know, 134 00:07:46,240 --> 00:07:49,240 Speaker 1: it's sort of easy idea to capture. So you can 135 00:07:49,280 --> 00:07:53,720 Speaker 1: come up with some definition of what momentum is and 136 00:07:53,760 --> 00:07:56,640 Speaker 1: then sort of say, okay, these stocks are experiencing momentum 137 00:07:56,760 --> 00:08:00,600 Speaker 1: right now, or these stocks aren't, and then his history 138 00:08:00,720 --> 00:08:02,560 Speaker 1: tell us the stocks are going to do next if 139 00:08:02,560 --> 00:08:07,120 Speaker 1: they sort of meet these characterizations. Your paper really looks 140 00:08:07,200 --> 00:08:10,440 Speaker 1: at what can you do with this technology for sort 141 00:08:10,440 --> 00:08:14,080 Speaker 1: of looking at future fundamental so looking at the sort 142 00:08:14,120 --> 00:08:17,080 Speaker 1: of characteristics of the company and not just trying to 143 00:08:17,560 --> 00:08:21,440 Speaker 1: see where prices going, but where those characteristics are going, 144 00:08:21,480 --> 00:08:25,960 Speaker 1: so explain sort of what your research specifically attempts to uncover. 145 00:08:26,680 --> 00:08:30,160 Speaker 1: So one thing that deep learning allows a researcher to 146 00:08:30,240 --> 00:08:35,280 Speaker 1: do is look at kind of more raw features um. 147 00:08:35,320 --> 00:08:38,520 Speaker 1: Like Zach explained in the image case, you're looking at 148 00:08:38,640 --> 00:08:42,840 Speaker 1: raw pixels. Now, if you think about most quant funds 149 00:08:43,000 --> 00:08:46,240 Speaker 1: and most quant models, they the features that go into 150 00:08:46,280 --> 00:08:49,160 Speaker 1: the model are highly engineered, and they include things like 151 00:08:49,240 --> 00:08:53,800 Speaker 1: price and maybe book value, price divided by book value, 152 00:08:53,840 --> 00:08:57,520 Speaker 1: price divided by earnings, and then maybe some momentum features. 153 00:08:58,000 --> 00:09:01,000 Speaker 1: The interesting thing about deep learning is it allows you 154 00:09:01,080 --> 00:09:05,720 Speaker 1: to potentially let it uncover what the best features are. 155 00:09:05,760 --> 00:09:09,240 Speaker 1: If you over engineer features, you may not find the 156 00:09:09,240 --> 00:09:12,040 Speaker 1: ones that are best to predict what you're interested in predicting. 157 00:09:12,600 --> 00:09:16,360 Speaker 1: So that, you know, allows you to potentially find features 158 00:09:16,360 --> 00:09:20,199 Speaker 1: in the data that you wouldn't find through which traditional 159 00:09:20,240 --> 00:09:24,640 Speaker 1: feature engineering process. Yeah, and you know, to directly address 160 00:09:24,679 --> 00:09:27,080 Speaker 1: your question, your point is that the very most obvious 161 00:09:27,080 --> 00:09:28,920 Speaker 1: thing you could say, now, if I have this, I 162 00:09:28,960 --> 00:09:31,199 Speaker 1: have this learning machine, I have a bunch of features, 163 00:09:31,200 --> 00:09:32,720 Speaker 1: and I have to choose what am I going to predict? 164 00:09:33,080 --> 00:09:35,000 Speaker 1: The very most obvious thing to try to predict is 165 00:09:35,000 --> 00:09:38,040 Speaker 1: the price, because if you can actually do that perfectly, 166 00:09:38,360 --> 00:09:41,240 Speaker 1: then you're done, right. If if you actually know which 167 00:09:41,240 --> 00:09:43,280 Speaker 1: way the price is going to move in the next year, 168 00:09:43,440 --> 00:09:46,679 Speaker 1: then you can make the perfect choice. So the problem 169 00:09:46,760 --> 00:09:49,440 Speaker 1: is that that's that's not so easy because the markets 170 00:09:49,440 --> 00:09:53,120 Speaker 1: are quite capricious, right, Um, So one problem that we 171 00:09:53,200 --> 00:09:55,280 Speaker 1: found is we actually did these models where we were 172 00:09:55,280 --> 00:09:58,280 Speaker 1: trying to predict price directly. But among the other things 173 00:09:58,280 --> 00:10:00,520 Speaker 1: that you have is that one, it's hard to learn 174 00:10:00,640 --> 00:10:03,280 Speaker 1: models that do a good job of this that are 175 00:10:03,360 --> 00:10:06,480 Speaker 1: sort of robust across different time periods. So you might 176 00:10:06,559 --> 00:10:08,600 Speaker 1: have like, hey, I'm going to train on these like 177 00:10:08,720 --> 00:10:11,080 Speaker 1: decades of data and I'm going to try to directly 178 00:10:11,160 --> 00:10:13,800 Speaker 1: predict the price. But then I come into periods of 179 00:10:13,840 --> 00:10:16,760 Speaker 1: time where the markets behaving a little bit differently, and 180 00:10:16,800 --> 00:10:20,080 Speaker 1: we call this nonstationarity. Basically, like you're modeled, there's a 181 00:10:20,080 --> 00:10:23,280 Speaker 1: great job of uncovering the pattern that's present in the 182 00:10:23,440 --> 00:10:25,760 Speaker 1: data that you gave to the model, but that data 183 00:10:25,880 --> 00:10:28,079 Speaker 1: is anchored to some period of time, and the future 184 00:10:28,160 --> 00:10:30,840 Speaker 1: data that comes in, you know, the patterns changed a 185 00:10:30,840 --> 00:10:33,680 Speaker 1: little bit, and so the kind of like function that 186 00:10:33,679 --> 00:10:35,840 Speaker 1: you've learned no longer does a great job. So so 187 00:10:35,880 --> 00:10:38,120 Speaker 1: what we do instead of directly trying to predict the price, 188 00:10:38,480 --> 00:10:40,719 Speaker 1: the idea that we had was to think, well, this 189 00:10:40,880 --> 00:10:43,880 Speaker 1: core idea behind a factor model, generally right, is to 190 00:10:43,960 --> 00:10:45,840 Speaker 1: just say, hey, I'm going to sort all the stocks 191 00:10:46,320 --> 00:10:49,360 Speaker 1: according to some reason idea, Hey, the price of the 192 00:10:49,360 --> 00:10:52,440 Speaker 1: company should be tied to its income, any company, and 193 00:10:52,520 --> 00:10:55,040 Speaker 1: somehow it is justified by like it's the long term 194 00:10:55,120 --> 00:10:58,760 Speaker 1: discounted cash as well. Let's just say a factor strategy 195 00:10:58,800 --> 00:11:00,600 Speaker 1: just something very simple. It says, well, let's just look 196 00:11:00,600 --> 00:11:03,760 Speaker 1: at the current income divided by say the current price 197 00:11:03,880 --> 00:11:06,640 Speaker 1: or current income divided by the current you know, market 198 00:11:06,640 --> 00:11:10,880 Speaker 1: cap or enterprise value, some some notion of income and 199 00:11:10,920 --> 00:11:13,880 Speaker 1: some notion notion of financial performance, and divided by some 200 00:11:13,920 --> 00:11:16,640 Speaker 1: notion of company size and this, and then I'm going 201 00:11:16,679 --> 00:11:18,640 Speaker 1: to sort the stocks according to this. The ones that 202 00:11:18,720 --> 00:11:22,200 Speaker 1: come out highest are like most cheaply priced, so let's 203 00:11:22,200 --> 00:11:25,200 Speaker 1: buy those. So the ideas to say, hey, well, what 204 00:11:25,240 --> 00:11:27,800 Speaker 1: if I told you so. We actually know that this 205 00:11:27,840 --> 00:11:29,720 Speaker 1: does pretty well in back testing whether or not the 206 00:11:29,720 --> 00:11:32,600 Speaker 1: patterns will hold in the future. But you know, many 207 00:11:32,600 --> 00:11:34,199 Speaker 1: people have made a lot of money for many years, 208 00:11:34,360 --> 00:11:36,680 Speaker 1: so there's an idea of if you knew the income, 209 00:11:36,720 --> 00:11:39,520 Speaker 1: this is a good thing, a reasonable thing to try 210 00:11:39,559 --> 00:11:43,200 Speaker 1: to do. Our question that we asked, Unfortunately, um, John, 211 00:11:43,440 --> 00:11:46,840 Speaker 1: because he's actually in finance and I'm not, has this 212 00:11:47,240 --> 00:11:50,360 Speaker 1: really great set of like industry grade tools that unlike 213 00:11:50,360 --> 00:11:52,600 Speaker 1: most academic papers that look at like one stock over 214 00:11:52,640 --> 00:11:54,280 Speaker 1: a short period of time or something, we actually had, 215 00:11:54,360 --> 00:11:57,439 Speaker 1: you know, forty plus years of financial data and can 216 00:11:57,480 --> 00:12:00,920 Speaker 1: actually simulate like an applausible that guess what's going on. 217 00:12:01,320 --> 00:12:04,720 Speaker 1: We said, well, what if you did a factor model, 218 00:12:05,000 --> 00:12:08,120 Speaker 1: but someone gave you a crystal ball. So basically, instead 219 00:12:08,120 --> 00:12:11,240 Speaker 1: of dividing the current income divided by the current enterprise value, 220 00:12:11,600 --> 00:12:15,160 Speaker 1: someone gave you next year's income, and so you sorted 221 00:12:15,160 --> 00:12:18,080 Speaker 1: the stocks according to next year's income divided by the 222 00:12:18,120 --> 00:12:22,080 Speaker 1: current enterprise value something like this. So you're you're able 223 00:12:22,120 --> 00:12:24,560 Speaker 1: to peek into the future. You know how the company 224 00:12:24,600 --> 00:12:28,160 Speaker 1: will be performing next year, and you're saying, is how 225 00:12:28,240 --> 00:12:31,360 Speaker 1: is its next year's performance? Is that based on next 226 00:12:31,440 --> 00:12:34,280 Speaker 1: year's performance? Is its current price? Is it currently priced 227 00:12:34,320 --> 00:12:36,640 Speaker 1: cheaply or not? So it's what we call like a 228 00:12:36,720 --> 00:12:39,959 Speaker 1: clairvoyant factor model. Like you don't actually have such a 229 00:12:39,960 --> 00:12:41,880 Speaker 1: crystal ball, but if you, you know, give us some 230 00:12:41,960 --> 00:12:44,360 Speaker 1: license and you imagine that you did what would have 231 00:12:44,360 --> 00:12:46,320 Speaker 1: happened if you went back in history and you had 232 00:12:46,320 --> 00:12:49,080 Speaker 1: this crystal ball and you traded based on a clairvoyant 233 00:12:49,120 --> 00:12:51,800 Speaker 1: factor model, and it turns out that the clairvoyant factor 234 00:12:51,880 --> 00:12:55,000 Speaker 1: model just crushes it. So it does really, really well 235 00:12:55,040 --> 00:12:58,640 Speaker 1: and and not surprisingly, the more clairvoyant the model is. 236 00:12:58,679 --> 00:13:01,480 Speaker 1: So if it if it knows the performance of the 237 00:13:01,480 --> 00:13:05,440 Speaker 1: company six months out versus now, or twelve months out 238 00:13:05,520 --> 00:13:08,040 Speaker 1: versus six months out, it keeps getting better and better 239 00:13:08,080 --> 00:13:12,280 Speaker 1: and better. So what we decided was, well, maybe trying 240 00:13:12,280 --> 00:13:15,160 Speaker 1: to predict price directly as a bit you know, subject 241 00:13:15,520 --> 00:13:18,720 Speaker 1: to you know, a kind of fickle market, but the 242 00:13:18,760 --> 00:13:23,280 Speaker 1: patterns present in the fundamental reporting data itself is more stable. 243 00:13:23,840 --> 00:13:25,880 Speaker 1: So in our method what we do is instead of 244 00:13:25,920 --> 00:13:28,480 Speaker 1: just trying to predict a return, we try to predict 245 00:13:28,640 --> 00:13:32,680 Speaker 1: actually the fundamental reporting data itself, just so we're given 246 00:13:33,360 --> 00:13:36,920 Speaker 1: um these these features for like a trailing window of 247 00:13:36,920 --> 00:13:42,040 Speaker 1: of time corresponding to the company's like financial reporting, and 248 00:13:42,080 --> 00:13:44,199 Speaker 1: then we're trying to predict what they're going to report 249 00:13:44,240 --> 00:13:46,600 Speaker 1: next year. And then based on what they're going we 250 00:13:46,640 --> 00:13:49,040 Speaker 1: think they're going to report next year, we sort the 251 00:13:49,040 --> 00:13:52,920 Speaker 1: companies according to a value factor. So in essence, you 252 00:13:52,960 --> 00:13:56,760 Speaker 1: can pick out of that future prediction the components of 253 00:13:56,800 --> 00:13:59,960 Speaker 1: the factor model. Let let it whether it's a few 254 00:14:00,000 --> 00:14:04,480 Speaker 1: future predicted earnings, and you can take that out of 255 00:14:04,520 --> 00:14:09,079 Speaker 1: the future predicted fundamentals, divide that by current enterprise value 256 00:14:09,679 --> 00:14:12,160 Speaker 1: and and sort and then you have basically a factor 257 00:14:12,240 --> 00:14:17,719 Speaker 1: model which you are using. Instead of trailing twelve months earnings, 258 00:14:18,120 --> 00:14:21,560 Speaker 1: you're using the future predicted earnings by the deep learning 259 00:14:21,760 --> 00:14:24,680 Speaker 1: the deep neural network. So, as I understand it, the 260 00:14:25,400 --> 00:14:28,960 Speaker 1: deep learning or the neural networks are used primarily to 261 00:14:29,000 --> 00:14:34,960 Speaker 1: forecast the future fundamentals based on historic performance. Is that right, 262 00:14:35,800 --> 00:14:40,680 Speaker 1: historic fundamentals? Yeah, okay, So walk us through how you 263 00:14:40,720 --> 00:14:45,040 Speaker 1: actually develop an application that's able to do that, Like 264 00:14:45,320 --> 00:14:48,400 Speaker 1: what are those neural networks looking at and what sort 265 00:14:48,440 --> 00:14:51,360 Speaker 1: of information are they drawing in other than you know, 266 00:14:51,440 --> 00:14:56,880 Speaker 1: past predictive data to make those forecasts. There's two parts 267 00:14:56,880 --> 00:14:59,120 Speaker 1: of that. One is the data that we use, and 268 00:14:59,120 --> 00:15:02,360 Speaker 1: then two is the technology we use to build the 269 00:15:02,400 --> 00:15:06,720 Speaker 1: deep you know, neural network models. So on the data side, 270 00:15:06,960 --> 00:15:11,680 Speaker 1: what you use is historical fundamentals on all companies you 271 00:15:11,720 --> 00:15:13,880 Speaker 1: know that have ever you know, been listed in the 272 00:15:14,000 --> 00:15:17,720 Speaker 1: US for the past fifty years, and so what a 273 00:15:17,800 --> 00:15:21,880 Speaker 1: historical fundamentals mean? What it means earnings, book value, anything 274 00:15:21,880 --> 00:15:24,080 Speaker 1: you can find on an income statement and balance sheet 275 00:15:24,120 --> 00:15:27,880 Speaker 1: going back in time. In addition to fundamentals, we also 276 00:15:28,600 --> 00:15:32,080 Speaker 1: use as inputs to the to the model, you know, 277 00:15:32,240 --> 00:15:37,080 Speaker 1: momentum over you know, one month, six months, twelve months. 278 00:15:37,120 --> 00:15:38,920 Speaker 1: So then you know, if you think of it as 279 00:15:39,000 --> 00:15:43,040 Speaker 1: like a big you know, spreadsheet table where each row 280 00:15:43,600 --> 00:15:47,760 Speaker 1: is a point in time for a specific company, and 281 00:15:47,800 --> 00:15:52,120 Speaker 1: then you can think of sequences going back through time. 282 00:15:52,400 --> 00:15:57,400 Speaker 1: You know IBM in March of and then all of 283 00:15:57,440 --> 00:16:00,720 Speaker 1: its fundamentals in one row, plus it's moment at them, 284 00:16:00,760 --> 00:16:03,920 Speaker 1: and then that going back five years and time. So 285 00:16:03,920 --> 00:16:07,880 Speaker 1: those sequences, both the fundamentals and the momentum are fed 286 00:16:08,080 --> 00:16:12,680 Speaker 1: into a neural network and uh and and all of 287 00:16:12,720 --> 00:16:15,840 Speaker 1: those sequences for all companies and all time are fed 288 00:16:15,880 --> 00:16:19,840 Speaker 1: into a neural network and are trained to predict what 289 00:16:20,000 --> 00:16:22,800 Speaker 1: the fundamentals will be, you know, one time step out 290 00:16:22,840 --> 00:16:36,440 Speaker 1: in the future. So just to sort of summarize it 291 00:16:36,480 --> 00:16:39,360 Speaker 1: all up, you know, it's like, if you have all 292 00:16:39,400 --> 00:16:43,440 Speaker 1: these strategies, if you have all these funds chasing things 293 00:16:43,480 --> 00:16:49,040 Speaker 1: like earnings, quality, earnings, growth, momentum, all kinds of stuff 294 00:16:49,080 --> 00:16:53,600 Speaker 1: like that, your goal is to anticipate today when those 295 00:16:53,640 --> 00:16:56,160 Speaker 1: funds are going to be buying tomorrow. Is that a 296 00:16:56,160 --> 00:16:58,320 Speaker 1: fair way to characterize it. I think that's a fair 297 00:16:58,440 --> 00:17:01,280 Speaker 1: way to characterize is it. I think what we're really 298 00:17:01,320 --> 00:17:03,720 Speaker 1: just doing is trying to build a better a better 299 00:17:03,760 --> 00:17:06,840 Speaker 1: factor model, A better factor model in the sense that 300 00:17:06,880 --> 00:17:09,560 Speaker 1: you know, as Zach explained, if you had a clairvoyant 301 00:17:09,560 --> 00:17:13,679 Speaker 1: model where you actually knew what future fundamentals were and 302 00:17:13,760 --> 00:17:16,680 Speaker 1: could plug that into a factor model, you do substantially 303 00:17:16,760 --> 00:17:21,399 Speaker 1: better than what you could achieve with a value factor model. Today, 304 00:17:21,840 --> 00:17:25,040 Speaker 1: we're not like directly considering the psychology of the other 305 00:17:25,040 --> 00:17:28,440 Speaker 1: players in the market in this particular approach, right, No, sure, 306 00:17:28,920 --> 00:17:32,920 Speaker 1: but it's essentially saying, like, maybe the way to characterize 307 00:17:32,920 --> 00:17:36,600 Speaker 1: it is, if you want to invest on some fundamental 308 00:17:36,680 --> 00:17:40,680 Speaker 1: factor like earnings quality or earnings growth, bottom line, is 309 00:17:41,320 --> 00:17:44,679 Speaker 1: better to look at future twelve month results rather than 310 00:17:44,720 --> 00:17:48,840 Speaker 1: trailing twelve months. You look at the trailing, but you're 311 00:17:48,960 --> 00:17:51,959 Speaker 1: trying to predict the future. Like, so those two components, right, 312 00:17:52,000 --> 00:17:54,720 Speaker 1: you could say, like one is we have the component 313 00:17:54,800 --> 00:17:58,080 Speaker 1: that is trying to predict the future fundamentals. You know, 314 00:17:58,119 --> 00:18:01,520 Speaker 1: imagine that I came for the future, and I got 315 00:18:01,520 --> 00:18:03,919 Speaker 1: out of my time machine, and I gave you the 316 00:18:03,960 --> 00:18:07,320 Speaker 1: earnings reports from the future. Right, So so the first 317 00:18:07,320 --> 00:18:09,520 Speaker 1: thing you need is how do I get an approximate 318 00:18:09,600 --> 00:18:12,200 Speaker 1: time machine, right, which in our case is a predictive 319 00:18:12,200 --> 00:18:14,440 Speaker 1: model that has a good guess about what the future 320 00:18:14,440 --> 00:18:17,040 Speaker 1: will look like. The second thing is you still need 321 00:18:17,080 --> 00:18:20,080 Speaker 1: a way of executing on the strategy ones I. You 322 00:18:20,119 --> 00:18:23,560 Speaker 1: still need a way to decide which stocks to buy, right, So, 323 00:18:24,119 --> 00:18:27,520 Speaker 1: based on based on this future information, Like, it's possible 324 00:18:27,560 --> 00:18:29,359 Speaker 1: that if I if I come from the future and 325 00:18:29,400 --> 00:18:32,160 Speaker 1: I give you the earnings report, and I tell you 326 00:18:32,200 --> 00:18:35,199 Speaker 1: what the future income will be, what, it's possible that 327 00:18:35,200 --> 00:18:36,600 Speaker 1: the income is going to go up with the stock 328 00:18:36,640 --> 00:18:38,960 Speaker 1: price is going to go down, you know, like say 329 00:18:39,119 --> 00:18:41,720 Speaker 1: it's an Apple and like they made a lot more money, 330 00:18:41,760 --> 00:18:44,600 Speaker 1: but it was also like announced that they had a 331 00:18:44,640 --> 00:18:48,920 Speaker 1: major plant failure in the iPhone fourteen or whatever they're 332 00:18:49,000 --> 00:18:52,040 Speaker 1: up to is going to be delayed. So these two 333 00:18:52,040 --> 00:18:54,640 Speaker 1: components are are a little bit modular, Like we could 334 00:18:54,640 --> 00:18:57,320 Speaker 1: come up with m. John I think is more the 335 00:18:57,359 --> 00:18:59,680 Speaker 1: domain expert, so I'm I'm more the machine learning guy. 336 00:18:59,760 --> 00:19:02,119 Speaker 1: Like I'm sure John could come up with you know, 337 00:19:02,400 --> 00:19:04,920 Speaker 1: a million other ways that you might imagine that someone 338 00:19:04,960 --> 00:19:08,000 Speaker 1: would try to execute on this information. In our case, 339 00:19:08,040 --> 00:19:10,320 Speaker 1: what we're doing is we've adapted a factor model to 340 00:19:10,560 --> 00:19:14,840 Speaker 1: work with this kind of future guess. So one other example, 341 00:19:15,400 --> 00:19:17,439 Speaker 1: so so again, in our case, what we're doing is 342 00:19:17,480 --> 00:19:20,879 Speaker 1: taking the predicted future fundamentals and feeding that into a 343 00:19:20,960 --> 00:19:25,440 Speaker 1: value factor model. But you could imagine using let's say 344 00:19:25,600 --> 00:19:29,480 Speaker 1: the deep neural networks said, you know, a company is 345 00:19:29,480 --> 00:19:33,080 Speaker 1: going to do a hundred million, but consensus estimates in 346 00:19:33,080 --> 00:19:36,560 Speaker 1: in in earnings. Let's say, but consensus estimates said it's 347 00:19:36,560 --> 00:19:41,359 Speaker 1: gonna do seventy five million in in earnings. Well, you 348 00:19:41,400 --> 00:19:43,640 Speaker 1: know that might be you could you can imagine devising 349 00:19:43,640 --> 00:19:46,280 Speaker 1: a strategy around that where you'd want to go, you know, 350 00:19:46,359 --> 00:19:49,919 Speaker 1: bet on those guys and ones where consensus estimates are 351 00:19:49,960 --> 00:19:53,120 Speaker 1: above what the deep neural network is predicting, you'd want 352 00:19:53,119 --> 00:19:56,679 Speaker 1: to bet against. Right, assuming the current price is pricing 353 00:19:56,760 --> 00:19:59,520 Speaker 1: and that that's a really you know, John, you shouldnt 354 00:19:59,520 --> 00:20:03,040 Speaker 1: give away so goods, that that's a really good idea. 355 00:20:03,920 --> 00:20:08,800 Speaker 1: So are these kinds of machine learning driven predictive models 356 00:20:09,080 --> 00:20:11,320 Speaker 1: the future of investing? You think is that the way 357 00:20:11,320 --> 00:20:14,199 Speaker 1: that we're heading. I think what this paper showed is 358 00:20:14,240 --> 00:20:17,560 Speaker 1: that there's a lot of potential in using deep learning 359 00:20:17,680 --> 00:20:21,400 Speaker 1: to long term investing. I think that there's been some 360 00:20:21,480 --> 00:20:25,520 Speaker 1: debate about whether, you know, deep learning, which requires a 361 00:20:25,560 --> 00:20:29,760 Speaker 1: lot of data um to to to build successful models, 362 00:20:30,400 --> 00:20:34,320 Speaker 1: um whether in finance there's enough data, or whether you 363 00:20:34,359 --> 00:20:38,680 Speaker 1: even need this these kinds of complex models in finance, 364 00:20:38,720 --> 00:20:40,960 Speaker 1: I mean a lot of quant people feel, you know, 365 00:20:41,119 --> 00:20:44,679 Speaker 1: linear simple factor models are the best route to go, 366 00:20:45,520 --> 00:20:47,919 Speaker 1: And I think what we showed here is that if 367 00:20:47,920 --> 00:20:51,080 Speaker 1: you're trying to predict price changes, that might be true. 368 00:20:51,760 --> 00:20:55,800 Speaker 1: But if you decompose the problem into first trying to 369 00:20:55,840 --> 00:20:59,040 Speaker 1: predict fundamentals and then later you know, through a factor 370 00:20:59,119 --> 00:21:01,920 Speaker 1: model or some other method, trying to use those predicted 371 00:21:01,920 --> 00:21:06,720 Speaker 1: fundamentals to predict price, deep learning has a lot of 372 00:21:06,720 --> 00:21:12,200 Speaker 1: potential and does does substantially better at predicting future fundamentals 373 00:21:12,200 --> 00:21:14,280 Speaker 1: than than what you could do with a linear model. 374 00:21:14,600 --> 00:21:17,560 Speaker 1: There's a sort of a technical reason to recommend the 375 00:21:17,600 --> 00:21:20,760 Speaker 1: way we've cast a problem also without going too far 376 00:21:20,760 --> 00:21:24,520 Speaker 1: into the weeds. Basically, uh, you think really really powerful 377 00:21:24,600 --> 00:21:27,639 Speaker 1: machine learning models, deep neural networks. The thing that you 378 00:21:27,720 --> 00:21:29,879 Speaker 1: worried about is John was talking about how people people 379 00:21:29,960 --> 00:21:32,240 Speaker 1: agonize over what can you bring us to bear on 380 00:21:32,280 --> 00:21:34,520 Speaker 1: long term investing because you don't have as much data 381 00:21:34,960 --> 00:21:38,119 Speaker 1: right as if you were looking at the you know, 382 00:21:38,400 --> 00:21:42,679 Speaker 1: micro second kind of trade frequency, then you'd have, you know, 383 00:21:42,960 --> 00:21:45,399 Speaker 1: trillions of trade examples or something you get on. But 384 00:21:45,800 --> 00:21:47,760 Speaker 1: if you if you're looking at you know, your your 385 00:21:47,760 --> 00:21:49,879 Speaker 1: time tick is I have a data point you know, 386 00:21:49,960 --> 00:21:53,000 Speaker 1: once per month or once per year suddenly, and I 387 00:21:53,040 --> 00:21:56,439 Speaker 1: only have thousands of stocks, not millions of stocks. You 388 00:21:56,480 --> 00:22:00,199 Speaker 1: don't have such a huge amount of data. Um, So 389 00:22:00,840 --> 00:22:03,080 Speaker 1: what you worry about is that a model given given 390 00:22:03,119 --> 00:22:05,919 Speaker 1: a super powerful model, like a super overpowered model, and 391 00:22:05,960 --> 00:22:08,399 Speaker 1: then not too much data, that there's a propensity for 392 00:22:08,440 --> 00:22:10,840 Speaker 1: the models to do what we call overfitting, which is 393 00:22:10,880 --> 00:22:13,000 Speaker 1: the model basically it does a really good job of 394 00:22:13,040 --> 00:22:16,600 Speaker 1: memorizing the training data it's seen, but it learns kind 395 00:22:16,600 --> 00:22:19,639 Speaker 1: of a spurious pattern that doesn't generalize to future data 396 00:22:19,680 --> 00:22:23,040 Speaker 1: that it hasn't seen. So one cool thing about the 397 00:22:23,080 --> 00:22:25,800 Speaker 1: way that we're casting the problem is that we're not 398 00:22:25,880 --> 00:22:28,600 Speaker 1: just trying to predict the factor of interest. We're actually 399 00:22:28,600 --> 00:22:31,119 Speaker 1: trying to predict all the factors in the future. And 400 00:22:31,160 --> 00:22:35,159 Speaker 1: this means that the model has to simultaneously get the 401 00:22:35,200 --> 00:22:37,679 Speaker 1: income right, and get the assets right, and get the 402 00:22:37,720 --> 00:22:40,600 Speaker 1: debt right, and get all these different factors that are available. 403 00:22:40,600 --> 00:22:43,239 Speaker 1: So John was a fifteen target factors that we have 404 00:22:43,359 --> 00:22:46,960 Speaker 1: that we're trying to predict. So so in this case, 405 00:22:47,200 --> 00:22:48,680 Speaker 1: this sort of like this is this is what we 406 00:22:48,720 --> 00:22:51,960 Speaker 1: call multitask learning and the machine learning literature. And one 407 00:22:52,080 --> 00:22:55,320 Speaker 1: nice effect of multitask learning as that has a generalization 408 00:22:56,119 --> 00:22:58,520 Speaker 1: effect in that it's it's harder to fit a spurious 409 00:22:58,560 --> 00:23:00,960 Speaker 1: hypothesis because you have to come up with a representation 410 00:23:01,080 --> 00:23:03,679 Speaker 1: that is good for task one and also good for 411 00:23:03,720 --> 00:23:06,199 Speaker 1: task two, and also good for tast three. And the 412 00:23:06,240 --> 00:23:10,040 Speaker 1: probability that you come up with a pattern that's that's 413 00:23:10,080 --> 00:23:12,479 Speaker 1: good for solving all of these tasks that is not 414 00:23:12,600 --> 00:23:15,399 Speaker 1: the true pattern is much smaller than if you're only 415 00:23:15,520 --> 00:23:18,960 Speaker 1: like trying to solve one task, where it's easier to 416 00:23:19,000 --> 00:23:21,280 Speaker 1: just kind of memorize those data points. So we have 417 00:23:21,320 --> 00:23:24,960 Speaker 1: like essentially sixteen times as much training data and in 418 00:23:25,040 --> 00:23:29,400 Speaker 1: some relevant sense, so I have to ask in the 419 00:23:29,560 --> 00:23:35,879 Speaker 1: abstract of your paper or in the intro, you say that, um, 420 00:23:36,040 --> 00:23:40,399 Speaker 1: with this approach, you can improve your annual returns pretty 421 00:23:40,400 --> 00:23:44,240 Speaker 1: substantially over a standard factor model. In a bad test. 422 00:23:44,280 --> 00:23:48,280 Speaker 1: Seventeen point one percent versus a fourteen point four percent, 423 00:23:48,880 --> 00:23:52,119 Speaker 1: just pretty big beat. But as we know, and as 424 00:23:52,160 --> 00:23:54,000 Speaker 1: there's a lot of people pointed out, there's a lot 425 00:23:54,040 --> 00:23:57,840 Speaker 1: of strategies that seem to work in academic papers and 426 00:23:57,880 --> 00:24:01,639 Speaker 1: then when they're put into pract is, they don't seem 427 00:24:01,880 --> 00:24:06,080 Speaker 1: the results don't seem to arrive as easily. John, in 428 00:24:06,200 --> 00:24:09,800 Speaker 1: your firm, are you seeing the results of your research 429 00:24:10,280 --> 00:24:12,600 Speaker 1: that on paper look very compelling actually play out in 430 00:24:12,600 --> 00:24:17,280 Speaker 1: the market. So so this this paper, uh, we we 431 00:24:17,359 --> 00:24:21,280 Speaker 1: have not put this model to test, so to speak, 432 00:24:21,760 --> 00:24:25,280 Speaker 1: in in a fund yet, but you know, we're very 433 00:24:25,320 --> 00:24:28,560 Speaker 1: interested in in in doing that. I will I will 434 00:24:28,600 --> 00:24:31,879 Speaker 1: add though here that many of the back tests that 435 00:24:31,920 --> 00:24:36,520 Speaker 1: are done in the industry are done where you just run, 436 00:24:36,800 --> 00:24:39,080 Speaker 1: you know, thousands of back tests on a data set 437 00:24:39,119 --> 00:24:42,000 Speaker 1: over some time period ten years, twenty years, thirty years, 438 00:24:42,920 --> 00:24:46,080 Speaker 1: and there's no out of sample testing, meaning that they 439 00:24:46,080 --> 00:24:48,600 Speaker 1: don't then take that and then apply it to a 440 00:24:48,640 --> 00:24:52,440 Speaker 1: new data set. One thing that machine learning. One technique 441 00:24:52,560 --> 00:24:57,560 Speaker 1: that is used in machine learning to prevent overfitting, and 442 00:24:57,920 --> 00:25:01,080 Speaker 1: that we do here is we train them or we 443 00:25:01,200 --> 00:25:04,760 Speaker 1: build the model on one data set and then test 444 00:25:04,840 --> 00:25:07,640 Speaker 1: it at a sample on another on another data set 445 00:25:07,720 --> 00:25:10,520 Speaker 1: during a different time period, and the results we present 446 00:25:10,600 --> 00:25:14,800 Speaker 1: there are at a sample out of sample, always being 447 00:25:15,000 --> 00:25:17,399 Speaker 1: sort of ahead right in the future, you could. So 448 00:25:17,440 --> 00:25:19,560 Speaker 1: the model is the model at every given time. It 449 00:25:19,640 --> 00:25:23,800 Speaker 1: is trained on the path data. So we're simulating like 450 00:25:23,840 --> 00:25:26,200 Speaker 1: what if, you know, if you train the model back 451 00:25:26,240 --> 00:25:28,199 Speaker 1: then based on that it was only available up to 452 00:25:28,240 --> 00:25:30,960 Speaker 1: that point. I think more broadly, there's a good question 453 00:25:31,000 --> 00:25:35,160 Speaker 1: there of um, it's hard to say which which patterns 454 00:25:35,160 --> 00:25:38,359 Speaker 1: are just you know, especially I think with short term investing, 455 00:25:38,359 --> 00:25:41,439 Speaker 1: it's very obvious that any any pattern that exists on 456 00:25:41,440 --> 00:25:44,120 Speaker 1: a scale of seconds is something that could be sort 457 00:25:44,160 --> 00:25:47,960 Speaker 1: of traded away. It's not as clear and I believe, 458 00:25:48,040 --> 00:25:50,320 Speaker 1: I mean, John can speak more to it, right, But 459 00:25:50,359 --> 00:25:52,560 Speaker 1: I believe part of the ethos of long term investing 460 00:25:52,640 --> 00:25:55,960 Speaker 1: is very much that rather than interacting in a place 461 00:25:56,000 --> 00:25:59,080 Speaker 1: where the price most price movements are due to the 462 00:25:59,119 --> 00:26:03,159 Speaker 1: behavior of the high frequency traders, when you're in the 463 00:26:03,200 --> 00:26:06,199 Speaker 1: long term space, the price movement is more tied to 464 00:26:06,400 --> 00:26:09,680 Speaker 1: the actual fiscal performance of the company, and that's maybe 465 00:26:09,680 --> 00:26:13,480 Speaker 1: a more durable pattern. So Joe and I were talking 466 00:26:13,640 --> 00:26:18,480 Speaker 1: about financial players and how quickly they adapt to new 467 00:26:18,560 --> 00:26:21,800 Speaker 1: markets and new situations. At the beginning of this episode, 468 00:26:22,320 --> 00:26:26,320 Speaker 1: from your respective viewpoints, how fast are these sorts of 469 00:26:26,359 --> 00:26:30,680 Speaker 1: technologies and models and applications being developed, And for how 470 00:26:30,760 --> 00:26:35,720 Speaker 1: long would something like, you know, a clairvoyant factor predicting 471 00:26:35,800 --> 00:26:39,080 Speaker 1: model actually give you an edge four until someone else, 472 00:26:39,520 --> 00:26:41,560 Speaker 1: maybe an E t F came along and copied it. 473 00:26:42,480 --> 00:26:46,280 Speaker 1: I think that's a hard question to answer, because again 474 00:26:46,320 --> 00:26:49,280 Speaker 1: it gets back to how you would use this model. Right, 475 00:26:49,400 --> 00:26:53,080 Speaker 1: So in the in the paper we give one very 476 00:26:53,119 --> 00:26:56,560 Speaker 1: specific example. We use the deep learning normal network to 477 00:26:56,600 --> 00:27:00,240 Speaker 1: predict fundamentals and then we plug that into one kind 478 00:27:00,280 --> 00:27:05,000 Speaker 1: of factor model, right uh, in particular operating income predicted 479 00:27:05,080 --> 00:27:08,760 Speaker 1: operating income over enterprise value. But as I suggested, you 480 00:27:08,800 --> 00:27:11,840 Speaker 1: could use it to you know, figure you know, figure 481 00:27:11,880 --> 00:27:15,480 Speaker 1: out whether consensus forecasts are good or bad. Um. So, 482 00:27:16,400 --> 00:27:19,320 Speaker 1: you know, I think that just saying in general, deep 483 00:27:19,400 --> 00:27:23,640 Speaker 1: learning applied to you know, investing is going to get 484 00:27:23,720 --> 00:27:25,919 Speaker 1: used and then a year later is going to be 485 00:27:26,000 --> 00:27:30,080 Speaker 1: arbitraged away. Miss is the point that, look, you know, 486 00:27:30,320 --> 00:27:33,119 Speaker 1: you can use deep learning in a myriad of ways 487 00:27:33,320 --> 00:27:36,960 Speaker 1: to attack the problem of long term investing and presumably 488 00:27:37,160 --> 00:27:42,080 Speaker 1: trading as well. To address your question about how quickly 489 00:27:42,400 --> 00:27:45,919 Speaker 1: is this kind of technology getting adopted? UM my sense 490 00:27:46,520 --> 00:27:49,840 Speaker 1: and based a little bit on an outsider's view as 491 00:27:49,840 --> 00:27:53,879 Speaker 1: an academic machine learning person, um talking to collegues who 492 00:27:53,920 --> 00:27:56,600 Speaker 1: have either gone into fintech or who flirted with it 493 00:27:56,840 --> 00:28:00,359 Speaker 1: or tried to recruit me into it. The sense that 494 00:28:00,440 --> 00:28:03,520 Speaker 1: I get is that actually, obviously a lot of people 495 00:28:03,560 --> 00:28:06,760 Speaker 1: aren't talking about what they're doing, right, But my sense 496 00:28:06,840 --> 00:28:09,120 Speaker 1: is that there's a lot of people doing this kind 497 00:28:09,160 --> 00:28:12,600 Speaker 1: of stuff in the high frequency space, not maybe on 498 00:28:12,600 --> 00:28:14,960 Speaker 1: the scale of you know, fractions of seconds, but but 499 00:28:15,080 --> 00:28:17,720 Speaker 1: on on a pretty short time scale. And the reason 500 00:28:17,760 --> 00:28:23,680 Speaker 1: why is because, um, right, it's it's easy to collect 501 00:28:23,720 --> 00:28:27,159 Speaker 1: a lot of data. If the patterns are very different, um, 502 00:28:27,320 --> 00:28:30,480 Speaker 1: a year from now, Well, you just you have enough data. 503 00:28:30,680 --> 00:28:34,359 Speaker 1: Like if you're trading like at the scale of months 504 00:28:34,440 --> 00:28:37,160 Speaker 1: or years, then you have to look back twenty years, right, 505 00:28:37,400 --> 00:28:39,360 Speaker 1: you have to look back thirty If you're trading at 506 00:28:39,360 --> 00:28:42,480 Speaker 1: the scale seconds, then your whole universe could be formed 507 00:28:42,480 --> 00:28:45,400 Speaker 1: by the previous four days. There's a very fast cycle 508 00:28:45,440 --> 00:28:47,520 Speaker 1: of development. So if you're in it and you just 509 00:28:47,600 --> 00:28:50,080 Speaker 1: wanna you don't you don't have any kind of strong 510 00:28:50,120 --> 00:28:54,000 Speaker 1: beliefs about finance. You're just a machine learning person throwing 511 00:28:54,000 --> 00:28:57,600 Speaker 1: your hammer at financing. Then going in the high frequency 512 00:28:57,600 --> 00:29:00,800 Speaker 1: space gives you or are the comparatively high frequency space 513 00:29:01,200 --> 00:29:04,200 Speaker 1: gives you like the sandbacked box to just really quickly 514 00:29:04,360 --> 00:29:07,440 Speaker 1: test stuff validated, see if it works. My feeling and 515 00:29:07,800 --> 00:29:09,640 Speaker 1: when I've talked to friends who are doing this kind 516 00:29:09,640 --> 00:29:12,040 Speaker 1: of stuff, but what we're doing is that I think 517 00:29:12,280 --> 00:29:14,680 Speaker 1: almost no one that I've talked to out of a 518 00:29:14,680 --> 00:29:16,400 Speaker 1: lot of people doing this stuff with finance, is looking 519 00:29:16,400 --> 00:29:19,640 Speaker 1: at the same kinds of time scales. And John might 520 00:29:19,680 --> 00:29:21,920 Speaker 1: be able to to speak to that because he might 521 00:29:22,280 --> 00:29:25,640 Speaker 1: actually be deeper and the I mean, he's definitely differ 522 00:29:25,680 --> 00:29:28,280 Speaker 1: in the finance community than I am. But my sense 523 00:29:28,320 --> 00:29:30,720 Speaker 1: is people doing deep learning for finance, and there are many, 524 00:29:31,200 --> 00:29:33,600 Speaker 1: Um it's on the rise, but they're not necessarily looking 525 00:29:33,640 --> 00:29:35,800 Speaker 1: at it in the same way, and certainly very few 526 00:29:35,960 --> 00:29:38,480 Speaker 1: on as long a time scale. Yeah, I mean, I 527 00:29:38,520 --> 00:29:40,800 Speaker 1: think if you look, you know, the a q r 528 00:29:40,920 --> 00:29:43,120 Speaker 1: s and the d f as of the world, which 529 00:29:43,120 --> 00:29:46,840 Speaker 1: are you know, these huge you know, quantitative shops. They do. 530 00:29:46,960 --> 00:29:51,000 Speaker 1: They certainly do long term investing, but um, there's not 531 00:29:51,040 --> 00:29:53,560 Speaker 1: a lot of evidence that there's a ton of machine 532 00:29:53,640 --> 00:29:56,440 Speaker 1: learning deep learning going on there. But you know, I 533 00:29:56,480 --> 00:29:59,040 Speaker 1: think if if if stuff is a successful, you know 534 00:29:59,080 --> 00:30:02,320 Speaker 1: it's likely to be a opted, so probably won't be 535 00:30:02,360 --> 00:30:06,920 Speaker 1: true forever. John Elberg and Zachary Lipton, that was a 536 00:30:07,000 --> 00:30:11,440 Speaker 1: fascinating conversation, so much to think about and wrap our 537 00:30:11,440 --> 00:30:15,320 Speaker 1: heads around. Really appreciate you both coming on. Thanks for 538 00:30:15,360 --> 00:30:29,280 Speaker 1: having us. Thank you guys. Well, Tracy, we didn't really 539 00:30:29,280 --> 00:30:30,920 Speaker 1: plan it that way, but I really do think that 540 00:30:31,000 --> 00:30:35,160 Speaker 1: was sort of the perfect follow up to Andrew Lot left. No, Joe, 541 00:30:35,200 --> 00:30:37,400 Speaker 1: you're supposed to pretend we did plan it that way 542 00:30:37,440 --> 00:30:40,240 Speaker 1: so everyone will think we're really working so good. I mean, 543 00:30:40,320 --> 00:30:45,280 Speaker 1: like we should continue this series on quantitative strategies and 544 00:30:45,440 --> 00:30:48,280 Speaker 1: new ways to evolve to beat the market. Let's continue 545 00:30:48,320 --> 00:30:52,160 Speaker 1: this continue. Yes, absolutely, Okay, in all seriousness, Yes, it 546 00:30:52,240 --> 00:30:55,240 Speaker 1: was fascinating. I really like the idea of well, who 547 00:30:55,280 --> 00:30:58,520 Speaker 1: doesn't like the idea of a clairvoyant robot who can 548 00:30:58,560 --> 00:31:00,760 Speaker 1: predict how well a company is going to do in 549 00:31:00,800 --> 00:31:04,480 Speaker 1: the future and then apply that to a factor based 550 00:31:04,720 --> 00:31:08,720 Speaker 1: investment model. If someone comes back in time and they're 551 00:31:08,760 --> 00:31:10,800 Speaker 1: like giving me hints on what the stock market is 552 00:31:10,800 --> 00:31:12,800 Speaker 1: gonna do, it's like just give me the winning stocks. 553 00:31:12,840 --> 00:31:15,200 Speaker 1: You know what I'm saying, Like, if you're time traveling, 554 00:31:15,320 --> 00:31:17,880 Speaker 1: don't like be a tease, just give me the winning stocks. No, 555 00:31:18,160 --> 00:31:21,960 Speaker 1: But in all seriousness, hey, I felt like several times 556 00:31:21,960 --> 00:31:24,840 Speaker 1: in that conversation, it's just like the level that they're 557 00:31:24,880 --> 00:31:28,160 Speaker 1: operating and thinking about the market on is so like 558 00:31:28,520 --> 00:31:31,280 Speaker 1: high above anything that you and I like typically talk 559 00:31:31,360 --> 00:31:33,640 Speaker 1: about it on a day. Like several times I felt 560 00:31:33,640 --> 00:31:37,200 Speaker 1: like I had to catch my breath speak for yourself, Joe, 561 00:31:38,840 --> 00:31:41,600 Speaker 1: because it's just like absorbing all of that, and you know, 562 00:31:41,600 --> 00:31:44,280 Speaker 1: obviously there's probably lots that I didn't get. But then 563 00:31:44,320 --> 00:31:47,680 Speaker 1: the other thing I really thought that last point was 564 00:31:47,760 --> 00:31:51,280 Speaker 1: very interesting about time. So obviously going back to the 565 00:31:51,320 --> 00:31:54,400 Speaker 1: adaptive framework for thinking about markets. You know, if there 566 00:31:54,520 --> 00:31:58,480 Speaker 1: is a strategy that works over a day and you 567 00:31:58,520 --> 00:32:00,440 Speaker 1: can get it, and you can you just have to 568 00:32:00,480 --> 00:32:03,560 Speaker 1: back test four days or whatever, it's very easy to see, Okay, 569 00:32:03,560 --> 00:32:05,760 Speaker 1: this works, this doesn't. Let's go with the thing that 570 00:32:05,800 --> 00:32:07,520 Speaker 1: works and then everyone can sort of figure out the 571 00:32:07,520 --> 00:32:09,880 Speaker 1: things that work and then it doesn't work anymore. But 572 00:32:09,960 --> 00:32:13,320 Speaker 1: this idea that maybe a quantitative approach to long term 573 00:32:13,360 --> 00:32:17,120 Speaker 1: fundamental investing, you don't get that sort of instant feedback 574 00:32:17,120 --> 00:32:19,719 Speaker 1: on whether it's working as fast, and so maybe winning 575 00:32:19,760 --> 00:32:22,120 Speaker 1: strategies might prove to be a bit more do Yeah, 576 00:32:22,120 --> 00:32:25,400 Speaker 1: and presumably it's much more difficult to actually develop them 577 00:32:25,440 --> 00:32:28,239 Speaker 1: and see them evolved. It's like, um, I guess it's 578 00:32:28,320 --> 00:32:33,240 Speaker 1: like if you bred successive generations of like rabbits, right, 579 00:32:33,400 --> 00:32:36,440 Speaker 1: Like it takes like a year two, well less than 580 00:32:36,480 --> 00:32:40,400 Speaker 1: a year, you could breathe like a hundred generations in 581 00:32:40,440 --> 00:32:43,200 Speaker 1: a year. Yeah, right, Or I guess, like, you know, 582 00:32:43,320 --> 00:32:46,320 Speaker 1: it's like laboratories use mice and because they do, like 583 00:32:46,360 --> 00:32:49,000 Speaker 1: they can get so many so fast. But if you 584 00:32:49,080 --> 00:32:52,960 Speaker 1: had to sort of, you know, breed hippopotamus is you 585 00:32:52,960 --> 00:32:56,760 Speaker 1: wouldn't know for a much longer period of time whether 586 00:32:56,840 --> 00:32:59,080 Speaker 1: you're down the road you had sort of created the 587 00:32:59,120 --> 00:33:04,480 Speaker 1: master hip hop. Does that make sense? Wow? This past? Yeah, Okay, 588 00:33:04,560 --> 00:33:07,440 Speaker 1: let's leave it at master hippo. Okay, alright. This has 589 00:33:07,480 --> 00:33:11,320 Speaker 1: been another episode of the Odd Thoughts Podcast. I'm Tracy Alloway. 590 00:33:11,440 --> 00:33:13,960 Speaker 1: You can follow me on Twitter at Tracy Alloway and 591 00:33:14,000 --> 00:33:16,800 Speaker 1: I'm Joe Why Isn't All? You can follow me on 592 00:33:16,840 --> 00:33:19,880 Speaker 1: Twitter at the Stalwart, and you can follow our guests 593 00:33:19,960 --> 00:33:24,280 Speaker 1: on Twitter John Elberg is at at John Elbert. There's 594 00:33:24,320 --> 00:33:27,600 Speaker 1: just one L in that. Zachary Lipton is on Twitter 595 00:33:27,800 --> 00:33:31,800 Speaker 1: at Zachary Lipton, and you can follow our producer Sarah 596 00:33:31,800 --> 00:33:35,640 Speaker 1: Patterson on Twitter at Sarah pat with two teams. Thanks 597 00:33:35,640 --> 00:33:36,160 Speaker 1: for listening,