WEBVTT - Goldman Sachs CIO on How the Bank Is Actually Using AI 0:00:02.680 --> 0:00:19.919 Bloomberg Audio Studios, Podcasts, radio News. Hello and welcome to 0:00:19.960 --> 0:00:23.600 another episode of the Odd Blots podcast. I'm Tracy Alloway. 0:00:23.320 --> 0:00:24.520 And I'm Joe Wisenthal. 0:00:24.800 --> 0:00:28.920 Joe, what's been your favorite chat GPT or claude prompt 0:00:29.040 --> 0:00:29.400 so far? 0:00:31.320 --> 0:00:33.040 You know, it's funny because I have a lot of 0:00:33.040 --> 0:00:37.040 fun with them, and also I use them for serious things. 0:00:37.040 --> 0:00:40.560 So I'll like upload conference call transcripts and say, tell 0:00:40.600 --> 0:00:44.440 me what this company said about labor market indicators or 0:00:44.520 --> 0:00:46.880 something like that, and that'll be extremely useful for that. 0:00:47.159 --> 0:00:49.519 Wait, do you actually find that more efficient than just 0:00:49.520 --> 0:00:52.720 doing a word search for like labor or working? I don't. 0:00:52.760 --> 0:00:54.760 I hate uploading stuff because you can only do it 0:00:54.760 --> 0:00:55.840 in like fragments. 0:00:56.120 --> 0:01:01.960 No, what, Tracy, Oh, let me, I'll show you how prompt? Okay, No, 0:01:02.160 --> 0:01:05.040 I get a lot of professional use out of the 0:01:05.160 --> 0:01:08.080 various AI tools, but I also, you know, have a 0:01:08.120 --> 0:01:11.039 lot of fun with them. And there's even a song 0:01:11.160 --> 0:01:13.480 and I'm not going to say which one that I wrote. 0:01:13.880 --> 0:01:16.840 I didn't use the lyrics. No, I did not like 0:01:16.880 --> 0:01:18.800 because it's very good. Wait what did you use? 0:01:18.920 --> 0:01:21.080 Did he give you an actual melody? What happened? 0:01:21.160 --> 0:01:21.280 No? 0:01:21.560 --> 0:01:24.720 So there was a song that I liked, okay, and 0:01:24.840 --> 0:01:28.600 the song title sort of rested upon a pun okay, 0:01:28.959 --> 0:01:31.800 and so I asked chat GPT to come up with 0:01:31.920 --> 0:01:38.040 another song that sort of like had a similar twist 0:01:38.240 --> 0:01:40.880 based on the headline of that song. I needed basically 0:01:40.920 --> 0:01:43.000 a song prompt idea. 0:01:43.240 --> 0:01:45.800 This opens up a whole can of worms. No, this 0:01:45.920 --> 0:01:48.600 is actually the perfect segue into what we're going to 0:01:48.680 --> 0:01:52.400 talk about today, because for you and I, using something 0:01:53.040 --> 0:01:56.400 like a chat GPT, we don't really have the same 0:01:56.520 --> 0:02:01.800 concerns that a proper company or large which corporation would have, Like, 0:02:01.840 --> 0:02:04.640 it doesn't really matter to us if the answer is wrong. 0:02:04.720 --> 0:02:06.680 I mean, ideally you would like it to be correct, 0:02:06.680 --> 0:02:10.000 but if I'm just asking some silly question, it doesn't 0:02:10.000 --> 0:02:12.560 really matter what chat gpt spits out at me. And 0:02:12.680 --> 0:02:16.119 also copyright kind of doesn't matter, so we don't care 0:02:16.320 --> 0:02:18.400 what it spits out in terms of who owns it, 0:02:18.440 --> 0:02:21.040 and also we don't care what we're putting in in 0:02:21.160 --> 0:02:23.920 terms of who owns that. That's right, But if you 0:02:23.960 --> 0:02:27.919 are a company you are thinking about generative AI very differently. 0:02:28.280 --> 0:02:30.040 I just want to say one thing, which is. 0:02:29.960 --> 0:02:32.200 That your defense Okay, defend yourself. 0:02:32.240 --> 0:02:35.320 No, No, I'm not even trying to defend myself. If I upload, say, 0:02:35.760 --> 0:02:38.000 you know, the McDonald's earning transcript, and I say, what 0:02:38.040 --> 0:02:40.720 does McDonald say about the labor market, then there's some quote. 0:02:41.040 --> 0:02:43.440 I always go back and check that that quote is 0:02:43.520 --> 0:02:45.799 actually in there. So I do very good, you know, 0:02:45.840 --> 0:02:49.040 I'm not just blindly relying on it. I do also 0:02:49.320 --> 0:02:52.120 do my own work and everything. But yeah, it's very true. 0:02:52.160 --> 0:02:54.480 Like so I can say I get a tremendous amount 0:02:54.480 --> 0:02:56.840 of use from chat, GPT or Claude or whatever, and 0:02:56.880 --> 0:03:00.600 it is very useful to me. But it makes mistakes sometimes, 0:03:00.880 --> 0:03:03.920 and if you think about deploying AI in the sort 0:03:03.960 --> 0:03:08.720 of enterprise world, then maybe like a one percent mistake 0:03:08.840 --> 0:03:11.799 raid or a one percent hallucination or you ever want 0:03:11.800 --> 0:03:14.960 to call them, is just completely unacceptable and a level 0:03:15.000 --> 0:03:19.880 of risk that makes it almost unusual for professional purposes. 0:03:19.919 --> 0:03:22.480 Absolutely. And of course the other thing with AI is 0:03:22.639 --> 0:03:25.880 there is still this ongoing, very heated debate about how 0:03:25.919 --> 0:03:29.200 transformational it's actually going to be. So you and I 0:03:29.400 --> 0:03:32.160 are using it as you know, a productivity hack in 0:03:32.240 --> 0:03:35.880 some cases, or maybe to generate song lyrics or even 0:03:36.280 --> 0:03:41.720 songs in some cases, but what is the true use 0:03:41.840 --> 0:03:44.480 case for this particular technology. There's still a lot of 0:03:44.480 --> 0:03:47.800 debate about that, and so I'm very pleased to say 0:03:47.840 --> 0:03:50.400 we do, in fact have the perfect guest. We're going 0:03:50.440 --> 0:03:54.280 to be speaking to someone who is implementing AI at 0:03:54.280 --> 0:03:57.080 a very, very large financial institution. We're going to be 0:03:57.080 --> 0:04:02.080 speaking with Marco Urgenti, the chief information officer at Goldman Sachs. Marco, 0:04:02.160 --> 0:04:03.760 thank you so much for coming on of thoughts. 0:04:04.200 --> 0:04:05.200 Thank you for having me. 0:04:05.600 --> 0:04:08.680 Marco tell us what a chief information officer does at 0:04:08.680 --> 0:04:11.920 Goldman Sachs. Whenever I see CIO, I always think chief 0:04:11.920 --> 0:04:15.240 investment officer, as it's very confusing. Yeah, so what does 0:04:15.280 --> 0:04:16.640 the other CIO do? 0:04:17.480 --> 0:04:19.880 So last week I was in Italy visiting my mother. 0:04:20.200 --> 0:04:23.320 She's eighty three, and she obviously doesn't know much about 0:04:23.360 --> 0:04:26.800 technology or banking, and so she said, what do you 0:04:26.880 --> 0:04:29.080 do with Coleman? And I said, you know, I just 0:04:29.120 --> 0:04:31.320 tried to simplify. I say, make sure that the printers 0:04:31.320 --> 0:04:36.880 don't run out of And interestingly, the CIO job has 0:04:36.920 --> 0:04:40.359 been traditionally associated with the word it. 0:04:41.080 --> 0:04:42.360 Okay and it. 0:04:42.640 --> 0:04:45.200 I tell you, talk to any technologist, they don't want 0:04:45.200 --> 0:04:46.400 to be classified as IT. 0:04:47.320 --> 0:04:49.880 Right, because those are you associated with those are the 0:04:49.920 --> 0:04:51.960 people who like, see if the ethernet cable with. 0:04:51.960 --> 0:04:54.159 Those are the ones who tell you that those that 0:04:54.600 --> 0:04:56.480 you know, I mean, I have a lot of respect 0:04:56.520 --> 0:04:59.000 for it, but generally you go to the IT department 0:04:59.000 --> 0:05:02.440 when something doesn't work, okay, And so it's very back 0:05:02.520 --> 0:05:06.400 office and something that attracted me to this job. I've 0:05:06.440 --> 0:05:08.080 been here for five years and this is the first 0:05:08.120 --> 0:05:10.240 time that I do like a CIO job. Before I 0:05:10.279 --> 0:05:12.880 was doing more like, you know, creating technology, et cetera, 0:05:12.920 --> 0:05:15.200 and service. I can talk about that, but is the 0:05:15.240 --> 0:05:17.080 fact that the role of a CEO has actually changed 0:05:17.160 --> 0:05:21.599 quite a bit, and now it's about really asking the question, 0:05:21.800 --> 0:05:26.760 you know, how do we implement technology in order to 0:05:26.960 --> 0:05:30.480 achieve our strategic objectives and actually to be differentiated, And 0:05:30.520 --> 0:05:33.520 it's really sitting at the strategic table of the firm. 0:05:33.560 --> 0:05:33.880 Okay. 0:05:34.760 --> 0:05:37.440 So today we live in a world where obviously a 0:05:37.480 --> 0:05:39.279 lot of the things that we want to do, or 0:05:39.360 --> 0:05:42.400 every company wants to do, are really kind of determined 0:05:42.400 --> 0:05:45.599 by how good you are at technology. And so I 0:05:45.640 --> 0:05:48.080 think the role of the CIO has changed quite a bit. 0:05:48.200 --> 0:05:50.680 And now, you know, I would define it as in general, 0:05:51.279 --> 0:05:54.159 defining the technology strategy of a firm and also making 0:05:54.160 --> 0:05:56.520 sure that you have the right culture in the engineering 0:05:56.560 --> 0:05:58.240 team in order to execute on that. 0:05:58.600 --> 0:06:00.880 What's the day to day look like? Like, what's the 0:06:00.920 --> 0:06:03.360 typical day you get into the office and then what. 0:06:03.320 --> 0:06:03.599 Do you do? 0:06:04.120 --> 0:06:06.880 Well? I mean I get into the office, and I generally, 0:06:07.160 --> 0:06:09.440 like everybody else, you know, I talk to people every 0:06:09.520 --> 0:06:11.599 day all day, and so I talk to people. You know, 0:06:11.640 --> 0:06:13.640 we have a bunch of meetings one after the other. End. 0:06:13.640 --> 0:06:16.600 I have teams coming to me with either regularly scheduled 0:06:16.600 --> 0:06:19.800 meetings or meetings that have been requested to discuss a 0:06:19.839 --> 0:06:23.440 certain topic. And you know, we just go through is 0:06:23.480 --> 0:06:27.159 there a whiteboard? Well right now in the age of Zoom, 0:06:27.360 --> 0:06:29.880 I guess still. You know, we have a globally distributed 0:06:29.880 --> 0:06:31.479 team and so a lot of our people are not 0:06:31.600 --> 0:06:33.880 in the same office, and so we use virtual whiteboards 0:06:33.920 --> 0:06:36.640 like everybody else. But I would say, you know, one 0:06:36.680 --> 0:06:39.280 of the things that I tried to do while joining Golma, 0:06:39.320 --> 0:06:41.960 which was part of sort of the cultural agen that 0:06:42.160 --> 0:06:48.760 was emphasizing the importance of narratives and written world versus 0:06:48.800 --> 0:06:51.920 you know, PowerPoint and talking. Okay, so, which is kind 0:06:51.920 --> 0:06:54.000 of what I learned that Amazon over the years. Okay, 0:06:54.080 --> 0:06:56.880 all right, w I was a REDWS and one of 0:06:56.880 --> 0:06:59.200 the things you learned there as soon as you join Amazon, 0:06:59.279 --> 0:07:03.680 in any part of Amazon, like the first few meetings 0:07:03.680 --> 0:07:07.160 are kind of shocking because nobody talks. Everybody starts reading. 0:07:07.560 --> 0:07:10.559 You start reading for like sometimes thirty minutes or forty 0:07:10.600 --> 0:07:14.480 five minutes, and if you're the author of the document, 0:07:15.000 --> 0:07:18.000 you're just sitting there basically, and you just try to 0:07:18.080 --> 0:07:20.760 look at people's faces and understand what they think about 0:07:20.800 --> 0:07:22.840 your document. And sometimes, you know, if you're with Jeff 0:07:22.880 --> 0:07:25.720 Bezos or others, you know, at that time it can 0:07:25.760 --> 0:07:29.480 be pretty pretty terrifying. And so this kind of shift 0:07:29.760 --> 0:07:34.240 from a culture of people talk, people comment on a PowerPoint, 0:07:34.280 --> 0:07:37.720 and the discussion sometimes get you know, driven by who 0:07:37.720 --> 0:07:40.520 has the stronger personality versus, you know, who has the 0:07:40.520 --> 0:07:43.600 greatest ideas. One of the things that I try to 0:07:43.720 --> 0:07:45.280 change is that a lot of the meetings that we 0:07:45.360 --> 0:07:49.120 do today actually start the same way by reading a document. 0:07:49.880 --> 0:07:51.920 So I now read a lot of documents like I 0:07:52.000 --> 0:07:54.240 used to in Amazon. You know, I would say maybe 0:07:54.400 --> 0:07:56.800 thirty forty percent of the meeting are starting that way, 0:07:57.520 --> 0:08:00.280 and I think people love it because it breaks the 0:08:00.320 --> 0:08:02.560 barrier of language for someone like me, that English is 0:08:02.600 --> 0:08:05.720 obviously not my first language, breaks the Sometimes some of 0:08:05.760 --> 0:08:08.040 the people are more shy than others, et cetera. So 0:08:08.040 --> 0:08:11.120 people see that as a mechanism for inclusion. So back 0:08:11.160 --> 0:08:14.400 to your question, let's say thirty forty percent of my 0:08:14.520 --> 0:08:17.840 meetings actually now start by us reading a document together 0:08:17.920 --> 0:08:19.920 and then commenting on that and making decisions. 0:08:20.000 --> 0:08:22.800 Can I just say, Tracy, I've always thought more meetings 0:08:23.000 --> 0:08:24.920 you should start with just reading. Because you go to 0:08:25.000 --> 0:08:27.920 you hear like a quarterly call or a FED event, 0:08:28.280 --> 0:08:30.680 and someone just reads out of prepared text. It's like, 0:08:30.880 --> 0:08:33.080 just let everyone read it and just jump straight into like, 0:08:33.160 --> 0:08:34.400 let everyone do the reading first. 0:08:34.440 --> 0:08:35.320 You don't need someone. 0:08:35.160 --> 0:08:38.440 Standing up there talking about what's on a written piece 0:08:38.480 --> 0:08:39.240 of paper somewhere. 0:08:39.240 --> 0:08:43.880 Anyway, I agree that we could reduce the time of meetings. Yes, okay, 0:08:43.880 --> 0:08:47.400 So speaking of meetings and the decision making process, then 0:08:47.760 --> 0:08:52.160 talk to us about how Goldman Sachs decided to approach 0:08:52.520 --> 0:08:56.320 generative AI. What was the decision making process? Like there 0:08:56.440 --> 0:08:59.680 the development process, and you know, we'll get to what 0:08:59.720 --> 0:09:02.720 you're developing, but like, how did you initially approach it? 0:09:03.160 --> 0:09:07.679 So I think our initial approach was really to realize 0:09:07.880 --> 0:09:10.480 that there were so many more things that we didn't 0:09:10.520 --> 0:09:13.080 know compared to the things that we knew, because it's 0:09:13.120 --> 0:09:15.760 a really new thing, and even for companies like us 0:09:15.760 --> 0:09:19.240 that have been working on machine learning and traditionally I 0:09:19.400 --> 0:09:23.920 for literally decades, this felt like a very different thing. 0:09:24.400 --> 0:09:26.920 What sort of timeframe are we talking about? Like, was 0:09:26.960 --> 0:09:29.760 there a sort of like big realization that this is 0:09:29.800 --> 0:09:31.199 something that we need to focus on. 0:09:31.679 --> 0:09:35.000 Yes, because I was lucky enough that I got into 0:09:35.280 --> 0:09:40.480 the very very early version of GPT, even before it 0:09:40.520 --> 0:09:44.079 was called chat GIBT. So the very first version was 0:09:44.200 --> 0:09:49.440 essentially completing a sentence. It wasn't even allowing you to 0:09:49.440 --> 0:09:52.440 do interactive chat. You would just paste a text and 0:09:52.520 --> 0:09:55.240 that will just complete that text. And so I started 0:09:55.240 --> 0:09:56.920 to do that with a bunch of stuff, and then 0:09:56.960 --> 0:09:59.439 I was saying that the quality which this will continue 0:10:00.400 --> 0:10:03.200 was pretty much indistinguishable with the part that you actually 0:10:03.200 --> 0:10:05.800 put in that. And so we started to obviously talk 0:10:05.880 --> 0:10:08.559 between ourselves but also among other people in the industry, 0:10:08.600 --> 0:10:12.560 and we all realized very soon that this would be 0:10:12.880 --> 0:10:15.880 something very different, but be also something that could have 0:10:15.920 --> 0:10:18.120 a pretty profound impact in what we do. Because at 0:10:18.160 --> 0:10:21.599 the end of the day, we are a purely digital business. 0:10:21.760 --> 0:10:24.200 We don't bend metal, we don't you know, like use 0:10:24.280 --> 0:10:26.960 high temperatures. We don't really have physics. So it's all 0:10:27.000 --> 0:10:29.679 about how we service our clients. It's all about how 0:10:29.720 --> 0:10:33.000 smart we are. It's all about how we can process 0:10:33.160 --> 0:10:36.839 incredible amount of information. It's all about, you know, how 0:10:36.880 --> 0:10:40.200 we analyze data in a very sometimes opinionated way. We 0:10:40.280 --> 0:10:43.160 form our own views on the market, we form our 0:10:43.280 --> 0:10:47.000 views of investments, et cetera. And so given that this 0:10:47.240 --> 0:10:52.880 AI showed very early sign of being able to synthesize 0:10:53.000 --> 0:10:57.719 and summarize very complex set of information but also identify patterns, 0:10:58.320 --> 0:11:00.679 we thought that could be something that we definitely need 0:11:00.720 --> 0:11:03.920 to pay attention to. So given that, one of the 0:11:03.960 --> 0:11:06.640 things that we decided to do very early on was 0:11:06.840 --> 0:11:09.920 to put a structure and I can say that more 0:11:09.960 --> 0:11:13.120 about that, put a structure around this so that we 0:11:13.160 --> 0:11:17.640 could experiment but in a sort of safe and controlled way. 0:11:18.080 --> 0:11:21.760 Right, So you decided to develop your own Goldman Sachs 0:11:21.880 --> 0:11:26.199 AI model versus you know, use a chat, GPT or 0:11:26.320 --> 0:11:27.640 clod or getting something off the show. 0:11:27.679 --> 0:11:30.679 Actually, initially we kind of thought about that, but then 0:11:30.800 --> 0:11:34.120 very quickly. We decided that our time was spent much 0:11:34.160 --> 0:11:37.960 better with using existing models, which by the way, we're 0:11:37.960 --> 0:11:41.920 iterating really really quickly, but then put them in a 0:11:41.960 --> 0:11:44.960 condition so that they would be safe to use and 0:11:45.000 --> 0:11:48.240 also they would actually give us the most reliable information, 0:11:48.360 --> 0:11:51.960 because taken as they are, you can't just drop a 0:11:52.040 --> 0:11:55.560 model in an environment like Goldman and then, like you know, 0:11:55.640 --> 0:11:57.960 to your earlier point of a one percent in accuracy, 0:11:58.200 --> 0:12:02.800 zero point one percent in accuracy completely an acceptable class. 0:12:03.520 --> 0:12:06.520 There are a lot of potential issues related to you know, 0:12:06.559 --> 0:12:09.240 what data has it been used to train? And you know, 0:12:09.280 --> 0:12:12.240 there is a lot of uncertainty with regards to you know, 0:12:12.320 --> 0:12:15.079 like what are the boundaries between what you can safely 0:12:15.160 --> 0:12:17.560 use and what you can And so what we decided 0:12:17.600 --> 0:12:23.080 to do was instead to build a platform around the model. 0:12:23.160 --> 0:12:25.120 So think of that almost as if you had a 0:12:25.200 --> 0:12:28.840 nuclear reactor. You know that now you have invented fission 0:12:28.920 --> 0:12:30.880 or fusion, and there is a lot of power that 0:12:30.920 --> 0:12:33.280 can be generated from that, but then you need to 0:12:33.320 --> 0:12:36.080 contain it and direct it in a certain way. And 0:12:36.080 --> 0:12:40.280 so we build this GSAI platform, which essentially takes a 0:12:40.360 --> 0:12:43.200 variety of models that we select, puts them in the 0:12:43.240 --> 0:12:47.840 condition of being completely segregated and completely secluded and completely 0:12:47.880 --> 0:12:51.800 safe from an information a security standpoint. Abstract some of 0:12:51.840 --> 0:12:54.559 the ways to use the model, so that our developers 0:12:54.559 --> 0:12:58.360 can use the models interchangeably, and then creates a set 0:12:58.400 --> 0:13:03.480 of standardized way, for example, improve the accuracy using retrieval, 0:13:03.520 --> 0:13:09.199 a granted generation, access external or internal data sources, applying 0:13:09.960 --> 0:13:13.160 entitlement so that someone is on the private side, you know, 0:13:13.160 --> 0:13:15.160 I've got to see different information that someone is on 0:13:15.200 --> 0:13:18.240 the public side. And then on top of that, build 0:13:18.320 --> 0:13:21.920 a developer environment so that people will very easily be 0:13:22.000 --> 0:13:25.760 able to embed that AI in their own applications. So 0:13:25.880 --> 0:13:28.880 imagine this, we got a great engine and we decided 0:13:28.920 --> 0:13:30.320 to build a great car around that. 0:13:45.960 --> 0:13:47.440 What are you putting in the model? 0:13:47.440 --> 0:13:49.840 Because I have to imagine at a bank like Goldman, 0:13:50.080 --> 0:13:51.600 you know, you have a lot of data, but you 0:13:51.679 --> 0:13:54.720 must have just an extraordinary amount of unstructured data. There's 0:13:54.800 --> 0:13:59.880 conversations that bankers have with clients. There's other sort of meeting, 0:14:00.000 --> 0:14:02.320 the meetings you have, and there's words that are said 0:14:02.400 --> 0:14:05.840 during that meeting that could be synthesized in some way. 0:14:06.280 --> 0:14:11.200 In these early iterations, you know, I upload a conference 0:14:11.200 --> 0:14:12.960 called transcript and I ask a question, what do you 0:14:13.040 --> 0:14:16.040 upload it? What is the unstructured data that you have 0:14:16.800 --> 0:14:19.240 or the questions or these yeah, what are you what 0:14:19.280 --> 0:14:22.240 are you putting into it from your reams of knowledge 0:14:22.240 --> 0:14:23.280 that you must have internally. 0:14:23.720 --> 0:14:26.200 So one of the first things that we did was 0:14:26.680 --> 0:14:30.320 use the platform and the models to extract information from 0:14:30.520 --> 0:14:34.280 publicly available documents. That's kind of the safest way public 0:14:34.320 --> 0:14:36.720 filing all the case or the queues and you know, 0:14:36.760 --> 0:14:40.600 and obviously earnings, and put our bankers in a condition 0:14:40.720 --> 0:14:45.480 to be able to ask very very sophisticated multi dimensional 0:14:45.600 --> 0:14:50.960 questions around what was reported, cross refit with previous reports, 0:14:51.320 --> 0:14:55.560 cross refit with any announcement, any earnings, called transcripts, all 0:14:55.640 --> 0:14:57.880 things that are out there but just are difficult to 0:14:57.880 --> 0:15:00.920 bring together. And so that as a involved into a 0:15:01.000 --> 0:15:04.600 tool that physically we use and we're rolling it out 0:15:04.680 --> 0:15:08.520 right now as an assistant to our bankers so that 0:15:08.680 --> 0:15:11.360 they can you know, service their client or answer client 0:15:11.480 --> 0:15:14.720 questions or even their wrong questions. In a time there 0:15:14.760 --> 0:15:17.280 is a fraction of what you used to take even 0:15:17.400 --> 0:15:21.720 generate documents that then can be you know, shared the 0:15:21.760 --> 0:15:23.720 clients and so on and so forth. And obviously we 0:15:23.800 --> 0:15:27.360 always have as a rule, like when you drive a 0:15:27.400 --> 0:15:29.920 car that has some autonomous capability, that you always keep 0:15:30.000 --> 0:15:31.840 the hands on the wheel. Our rule is that there 0:15:31.840 --> 0:15:33.720 always needs to be a human in the loop. Okay, 0:15:34.200 --> 0:15:37.240 And so the way that works is actually interesting because 0:15:37.320 --> 0:15:40.720 we found out that you can't just shove something into 0:15:40.720 --> 0:15:42.960 a model and then pretend that the model is going 0:15:43.000 --> 0:15:47.560 to give you the answer right away. Why well, because models, 0:15:47.600 --> 0:15:51.800 by themselves, you know, they essentially apply a stochastic or 0:15:51.840 --> 0:15:54.560 a statistical way to understand what is the next world 0:15:54.560 --> 0:15:57.240 that they need to say. So, no matter how good 0:15:57.560 --> 0:16:00.440 is the material that you put in, there's always going 0:16:00.480 --> 0:16:03.160 to be some level of variability. There is almost like 0:16:03.240 --> 0:16:06.000 the intersection between the documents that you insert and what 0:16:06.200 --> 0:16:09.160 is I call it like the shadow of all the 0:16:09.200 --> 0:16:11.720 knowledge of all the things that the model has seen before. 0:16:12.520 --> 0:16:15.280 And so we really perfected this. You know, there are 0:16:15.280 --> 0:16:19.680 two techniques that are widely used to improve the accuracy 0:16:19.680 --> 0:16:23.920 of the answers. One is working on the way those 0:16:24.000 --> 0:16:28.960 models represent knowledge, which is called embeddings technically, and the 0:16:29.000 --> 0:16:31.760 concept of embeddings by the way, everybody talks about embeddings, 0:16:31.760 --> 0:16:34.720 but then for very few people actually it took me 0:16:34.760 --> 0:16:38.320 a while to understand that well. And embedding is simply 0:16:38.520 --> 0:16:41.920 a way for the model to parameterize and create a 0:16:42.040 --> 0:16:45.040 description of what they're seeing. So if I see a phone, 0:16:45.080 --> 0:16:47.280 for example, in front of me, the embeddings of a 0:16:47.320 --> 0:16:50.920 phone could be it's a piece of electronic Yes, one, 0:16:51.000 --> 0:16:54.840 it's definitely a piece of electronics. It's edible. Zero. You 0:16:54.880 --> 0:16:56.840 can't really eat it, you know, And then you have 0:16:56.880 --> 0:17:00.760 all these parameters. Is almost like twenty questions. I give 0:17:00.800 --> 0:17:02.720 you all these questions and then you finally understand that 0:17:02.800 --> 0:17:04.960 it's a phone, and that's what the embeddings is almost 0:17:04.960 --> 0:17:08.000 like the twenty questions of the reality instead of twenty 0:17:08.080 --> 0:17:11.479 is like twenty twenty thousands. And then you have DRAG, 0:17:11.560 --> 0:17:14.560 which is the retrieval augmented generation, which is actually interesting 0:17:14.600 --> 0:17:18.720 because you tell the model that instead of using its 0:17:18.760 --> 0:17:20.919 on internal knowledge in order to give you an answer, 0:17:20.960 --> 0:17:23.640 which sometimes, as I said, is like a representation of reality, 0:17:23.680 --> 0:17:26.840 but it's often not accurate, you point them to the 0:17:26.920 --> 0:17:30.520 right sections of the document that actually is more likely 0:17:30.560 --> 0:17:33.280 to answer your question. Okay, and that's the key. It 0:17:33.320 --> 0:17:35.480 needs to point to the right sections and then you 0:17:35.520 --> 0:17:38.560 get the citations back. So that took a lot of effort. 0:17:39.040 --> 0:17:41.880 But we're using that in many many cases because then 0:17:41.920 --> 0:17:45.399 we expanded the use case from purely like banker assistant 0:17:45.440 --> 0:17:48.840 in a way to more like okay, document management. You know, 0:17:48.880 --> 0:17:53.280 we process millions of documents. Think of that credit confirmation 0:17:53.600 --> 0:17:59.439 implements confirmation. Every document has a task called entity strauction. 0:17:59.560 --> 0:18:02.639 So you need to extract stuff from the document and 0:18:02.680 --> 0:18:05.320 then digitize it and then model it in a certain way. 0:18:05.880 --> 0:18:09.600 And so the use of general TVii there does a 0:18:09.680 --> 0:18:14.600 great job at extracting information. And this is an interesting 0:18:14.640 --> 0:18:19.760 concept because you don't have to actually tell a fixed pattern. 0:18:20.000 --> 0:18:22.520 You can just say, give a lot of examples, and 0:18:22.560 --> 0:18:24.840 then the AI will figure out from that pattern. One 0:18:24.840 --> 0:18:27.520 of my favorite example is the following. Let's say that 0:18:27.600 --> 0:18:31.480 my phone number is five five three two one three 0:18:31.640 --> 0:18:35.439 h five oh, and someone writes in the document instead 0:18:35.440 --> 0:18:39.600 of with zero rights an oh. Okay. You can test 0:18:39.640 --> 0:18:42.840 yourself even with GPT, if you give a number with 0:18:42.920 --> 0:18:45.480 an O instead of zero, and you ask GPT, what's 0:18:45.800 --> 0:18:50.320 likely wrong with this entity? GPT is gonna tell you, well, 0:18:50.960 --> 0:18:53.520 it looks like a phone number that is an all, 0:18:53.600 --> 0:18:56.280 which general is not in phone numbers. Most likely this 0:18:56.320 --> 0:19:00.600 is the correct phone number. Now, nobody has written software 0:19:01.040 --> 0:19:04.080 to do a pattern match in there. And imagine if 0:19:04.119 --> 0:19:06.800 in the tradition, in traditional way of doing antity instruction, 0:19:06.920 --> 0:19:10.280 there were developers that were writing rules. They were saying, okay, numbers, 0:19:10.480 --> 0:19:13.560 it needs to be ten digits and blah blah blah. 0:19:13.760 --> 0:19:16.000 The AI figures. 0:19:15.560 --> 0:19:18.520 Out their own rules. 0:19:17.960 --> 0:19:20.600 That are the most likely. So this is the key thing. 0:19:20.760 --> 0:19:24.240 It has common sense. And that common sense when you're 0:19:24.359 --> 0:19:28.840 dealing with millions of documents that contain all bunch of 0:19:28.960 --> 0:19:32.320 ways that you must might have written those things, and 0:19:32.400 --> 0:19:34.439 imagine the complexity of all the rules that you need 0:19:34.480 --> 0:19:37.560 to write. And every bank has the same problem. This 0:19:37.720 --> 0:19:42.680 simplifies things tremendously because it's able to figure out what's 0:19:42.800 --> 0:19:48.240 most likely by itself. And so that thing evolved into 0:19:48.560 --> 0:19:52.080 a tremendous time saving for everybody in the bank that 0:19:52.119 --> 0:19:54.639 has to do with the workflow documents. And so that 0:19:55.119 --> 0:19:57.359 was a very interesting finding that we did early on. 0:19:57.440 --> 0:20:02.119 And so again to summarize more, those are raw material 0:20:02.240 --> 0:20:05.800 of intelligence. You know you need to somehow direct them, 0:20:05.840 --> 0:20:07.960 you need to guide them, you need to instruct them, 0:20:07.960 --> 0:20:10.040 you need to put them in an environment that actually 0:20:10.080 --> 0:20:12.000 gets the most out of that, and that's what we've 0:20:12.000 --> 0:20:12.800 been focusing on. 0:20:13.119 --> 0:20:16.240 So going back to the analogy that you used previously, 0:20:16.280 --> 0:20:18.800 this idea of a nuclear reactor and sort of building 0:20:18.840 --> 0:20:22.400 the containment casing or the protective casing around it. I 0:20:22.400 --> 0:20:26.359 imagine one of the complications of being Goldman Sachs and 0:20:26.400 --> 0:20:29.960 working with AI is that you're a regulated financial entity. 0:20:30.560 --> 0:20:35.480 How does that added complexity affect your use of AI. 0:20:35.640 --> 0:20:39.680 Are there additional data considerations or additional infosec considerations. 0:20:40.240 --> 0:20:44.520 I think that's a great question, because obviously we live 0:20:44.560 --> 0:20:47.040 in a regulated world, and in fact, I have to 0:20:47.040 --> 0:20:49.880 tell you that in this case, regulation actually helps us 0:20:50.000 --> 0:20:53.520 think through all the possible unknown now, something that, as 0:20:53.520 --> 0:20:56.080 I said, is something that is still largely something that 0:20:56.119 --> 0:20:59.600 nobody really completely understands. And so what we did was 0:20:59.680 --> 0:21:03.240 to put but governance around the usage of the models 0:21:03.280 --> 0:21:05.760 and also governance with regards to the use cases that 0:21:05.800 --> 0:21:09.119 we can implement on the models. Every bank has a 0:21:09.160 --> 0:21:12.639 function called model risk, which, in the traditional sense, a 0:21:12.720 --> 0:21:18.040 model is any decision or any algorithm that is running 0:21:18.080 --> 0:21:21.040 automatically to do for example, pricing or you know, there 0:21:21.119 --> 0:21:24.400 is a lot of that tradition in every bank risk calculation, etc. 0:21:24.760 --> 0:21:27.920 So that's the traditional model risk. We use that very 0:21:27.960 --> 0:21:31.000 well established pattern. That is also you know, that has 0:21:31.040 --> 0:21:35.280 its own second and third line like controls and supervision 0:21:35.840 --> 0:21:38.840 also to validate what we do on the AI side. 0:21:38.880 --> 0:21:41.600 So there is a governance part which we really set 0:21:41.680 --> 0:21:44.359 up very early on. We have an AI committee that 0:21:44.400 --> 0:21:47.479