WEBVTT - Inside View: The AI Personas Needed to Diagnose Disease

0:00:13.960 --> 0:00:17.239
<v Speaker 1>Welcome to tech Stuff. This is the inside View. I'm

0:00:17.280 --> 0:00:19.440
<v Speaker 1>os Vloschen here with Cara Price.

0:00:19.840 --> 0:00:23.319
<v Speaker 2>Hello, so as I'm very curious to know more about

0:00:23.320 --> 0:00:25.000
<v Speaker 2>the story you've brought me this week, since it's a

0:00:25.040 --> 0:00:27.440
<v Speaker 2>topic we discussed a lot on this podcast.

0:00:27.840 --> 0:00:31.320
<v Speaker 1>Yes, so today I've got a story about AI in healthcare,

0:00:31.760 --> 0:00:35.960
<v Speaker 1>specifically AI and diagnosis. I spoke with doctor Matthew Lungren,

0:00:35.960 --> 0:00:39.040
<v Speaker 1>who is the chief Scientific officer for Microsoft Health and

0:00:39.080 --> 0:00:43.320
<v Speaker 1>Life Sciences, about this blog post that Microsoft recently published

0:00:43.600 --> 0:00:47.440
<v Speaker 1>with the title the Path to Medical Superintelligence.

0:00:47.880 --> 0:00:50.840
<v Speaker 2>Do I want to know what medical superintelligence is? It's

0:00:50.880 --> 0:00:54.639
<v Speaker 2>more big than just regular intelligence. But I actually heard

0:00:54.640 --> 0:00:57.760
<v Speaker 2>about this study. It was everywhere, and if I remember correctly,

0:00:57.840 --> 0:01:00.520
<v Speaker 2>it was that the AI were better at diagnosing than doctors.

0:01:00.600 --> 0:01:04.160
<v Speaker 1>Right, Yeah, that's right, In fact, four times better. There

0:01:04.240 --> 0:01:07.240
<v Speaker 1>was a headline in Time magazine which really says it all.

0:01:07.800 --> 0:01:12.240
<v Speaker 1>Microsoft's AI is better than doctors are diagnosing disease. Special

0:01:12.240 --> 0:01:14.679
<v Speaker 1>shout out here to Elliot Fishman, who's our old friend.

0:01:14.880 --> 0:01:18.000
<v Speaker 1>He's a professor of radiology at Johns Hopkins and he

0:01:18.080 --> 0:01:22.039
<v Speaker 1>runs this fascinating email group that discusses new developments in AI.

0:01:22.760 --> 0:01:24.880
<v Speaker 1>Matthew Lunger and I are both members of this group,

0:01:25.240 --> 0:01:28.120
<v Speaker 1>and Matthew is also one of the authors of the study.

0:01:28.360 --> 0:01:29.880
<v Speaker 2>What kind of doctor is Doctor Lungren?

0:01:30.240 --> 0:01:33.440
<v Speaker 1>Like Elliott Fishman our friend, he's a radiologist by training

0:01:33.520 --> 0:01:36.280
<v Speaker 1>and has a public health background. He was hired at

0:01:36.280 --> 0:01:41.400
<v Speaker 1>Stanford where he started using machine learning to analyze large

0:01:41.480 --> 0:01:43.200
<v Speaker 1>data sets. Here's Matthew.

0:01:43.640 --> 0:01:46.520
<v Speaker 3>Eventually my lab grew into a very large AI center

0:01:46.560 --> 0:01:49.880
<v Speaker 3>at Stanford, which bridged the computer science department in the

0:01:49.880 --> 0:01:54.280
<v Speaker 3>medical school and kind of saw translation of newest techniques

0:01:54.320 --> 0:01:58.920
<v Speaker 3>into healthcare applications accelerate. Taking that work further, I went

0:01:59.000 --> 0:02:03.520
<v Speaker 3>to Microsoft on sabbatical at Microsoft Research and realized that

0:02:03.640 --> 0:02:07.400
<v Speaker 3>a very similar opportunity was there in big tech if

0:02:07.440 --> 0:02:10.800
<v Speaker 3>you could start to connect the latest technology to problems

0:02:10.800 --> 0:02:12.799
<v Speaker 3>in healthcare. And so that's how I came to be here,

0:02:12.800 --> 0:02:14.679
<v Speaker 3>and that's kind of what I still do all day.

0:02:15.160 --> 0:02:17.120
<v Speaker 1>And Matthew is also one of the authors of the

0:02:17.120 --> 0:02:18.040
<v Speaker 1>Microsoft study.

0:02:18.480 --> 0:02:22.240
<v Speaker 3>I believe that the human expert plus these expert systems

0:02:22.280 --> 0:02:24.639
<v Speaker 3>together will ultimately deliver better care.

0:02:25.040 --> 0:02:25.680
<v Speaker 4>No matter what.

0:02:25.560 --> 0:02:29.480
<v Speaker 3>Profession you're in, there's always a gray haired person that has,

0:02:29.680 --> 0:02:31.440
<v Speaker 3>you know, in some sense, seen it all and kind

0:02:31.440 --> 0:02:34.080
<v Speaker 3>of compressed that into their brain and their pattern matching

0:02:34.120 --> 0:02:37.200
<v Speaker 3>in a way that is just faster than folks that

0:02:37.240 --> 0:02:39.920
<v Speaker 3>don't have as much experience. And that's true anywhere, but

0:02:40.000 --> 0:02:42.799
<v Speaker 3>certainly in medicine, right. I think that the assistance or

0:02:42.880 --> 0:02:46.360
<v Speaker 3>ability of AI to now sort of connect dots in

0:02:46.440 --> 0:02:51.680
<v Speaker 3>ways that maybe can achieve that wisdom or that experience

0:02:52.160 --> 0:02:53.400
<v Speaker 3>and bring that to the surface.

0:02:53.760 --> 0:02:55.160
<v Speaker 4>It's kind of an unprecedented time.

0:02:55.480 --> 0:02:59.359
<v Speaker 1>The only exceptional performance I four times better than human doctors.

0:03:00.040 --> 0:03:02.160
<v Speaker 1>One of the things I found most interesting about the

0:03:02.200 --> 0:03:06.040
<v Speaker 1>study was that it wasn't just one single AI model

0:03:06.120 --> 0:03:09.240
<v Speaker 1>doing a diagnosis. It was a whole team of AI

0:03:09.320 --> 0:03:11.920
<v Speaker 1>models that were able to talk to each other in

0:03:12.000 --> 0:03:15.280
<v Speaker 1>order to count with hypotheses, order tests, and ultimately count

0:03:15.280 --> 0:03:16.120
<v Speaker 1>with a diagnosis.

0:03:16.600 --> 0:03:20.360
<v Speaker 2>So multiple AI models seems a little bit unfair.

0:03:20.240 --> 0:03:22.800
<v Speaker 1>Yes, and in fact we talked about this. The doctors

0:03:22.800 --> 0:03:25.760
<v Speaker 1>in the study were not allowed to call specialists to

0:03:25.800 --> 0:03:28.720
<v Speaker 1>help them with their diagnosis, but the ais were allowed

0:03:28.760 --> 0:03:31.160
<v Speaker 1>to talk to each other. So doctors are not going

0:03:31.160 --> 0:03:32.600
<v Speaker 1>to be made obsolete anytime soon.

0:03:32.680 --> 0:03:34.520
<v Speaker 2>Well good, because I have a physical coming up and

0:03:34.560 --> 0:03:37.760
<v Speaker 2>I don't need four AI models being like, well, this

0:03:37.800 --> 0:03:39.520
<v Speaker 2>girl got real big this year.

0:03:40.880 --> 0:03:43.400
<v Speaker 1>Now, as you and I already know, people are already

0:03:43.520 --> 0:03:47.560
<v Speaker 1>using AI regularly to diagnose themselves. In fact, I think

0:03:47.600 --> 0:03:51.440
<v Speaker 1>more than ten percent of the overall CHATCHBT traffic is

0:03:51.480 --> 0:03:54.800
<v Speaker 1>around medical stuff. This is not always music to the

0:03:54.840 --> 0:03:57.480
<v Speaker 1>ear of doctors, so it was interesting to look at

0:03:57.520 --> 0:04:00.200
<v Speaker 1>an example where this is actually an AI build built

0:04:00.360 --> 0:04:03.320
<v Speaker 1>for doctors and to work with doctors rather than patient facing.

0:04:03.800 --> 0:04:06.200
<v Speaker 1>And the other interesting thing for me, which we talk

0:04:06.280 --> 0:04:09.160
<v Speaker 1>about with Lunger, which we'll get to, is how this

0:04:09.280 --> 0:04:13.560
<v Speaker 1>idea of multiple ais talking to each other can simulate

0:04:13.640 --> 0:04:17.120
<v Speaker 1>the experience of the best hospital systems in the US

0:04:17.360 --> 0:04:20.800
<v Speaker 1>for people who otherwise might not have access to these

0:04:20.839 --> 0:04:22.000
<v Speaker 1>panels and experts.

0:04:22.279 --> 0:04:24.920
<v Speaker 2>I can't wait to hear what you learned from him.

0:04:25.240 --> 0:04:28.159
<v Speaker 1>Well, here's the rest of my conversation with doctor Matthew Lungren.

0:04:28.920 --> 0:04:31.920
<v Speaker 1>So you're a trained doctor, and I want to start

0:04:31.960 --> 0:04:34.880
<v Speaker 1>with the basics, which is diagnosis. I'm not sure when

0:04:34.920 --> 0:04:38.200
<v Speaker 1>the last time you made a diagnosis on a patient was,

0:04:38.520 --> 0:04:40.880
<v Speaker 1>but I'd love to hear from you as a doctor.

0:04:41.320 --> 0:04:43.040
<v Speaker 1>What is the process of diagnosis?

0:04:43.320 --> 0:04:46.680
<v Speaker 4>Yeah, I mean it depends quite a bit on the specialty.

0:04:46.720 --> 0:04:50.680
<v Speaker 3>But as most people know, the classic image of a physician,

0:04:50.760 --> 0:04:52.960
<v Speaker 3>right is to speak with the.

0:04:52.880 --> 0:04:54.760
<v Speaker 4>Patient, kind of do a Sherlock Holmes kind of thing.

0:04:54.760 --> 0:04:56.840
<v Speaker 3>Everyone's seen the shows like House and Things are kind

0:04:56.839 --> 0:04:58.960
<v Speaker 3>of sensationalized sort of the approach.

0:04:58.960 --> 0:05:01.200
<v Speaker 4>But really there's a lot of unknowns that you have

0:05:01.240 --> 0:05:01.720
<v Speaker 4>to tease out.

0:05:01.800 --> 0:05:01.880
<v Speaker 2>Right.

0:05:01.960 --> 0:05:04.200
<v Speaker 3>You have to interview the page, you have to obviously

0:05:04.320 --> 0:05:07.120
<v Speaker 3>interpret labs and other information, and you have to start

0:05:07.160 --> 0:05:10.400
<v Speaker 3>to narrow things down and order appropriate tests. Try not

0:05:10.440 --> 0:05:13.240
<v Speaker 3>to chase too many what we call the zebras, but

0:05:13.600 --> 0:05:15.520
<v Speaker 3>keep those in mind in case you're dealing with one, and.

0:05:15.680 --> 0:05:18.160
<v Speaker 1>The zebra would be the classic House episode.

0:05:17.839 --> 0:05:20.039
<v Speaker 3>Right, yeah, right, Well every House episode is a zebra,

0:05:20.080 --> 0:05:22.960
<v Speaker 3>which actually has some relationship to the study we're going

0:05:23.040 --> 0:05:26.520
<v Speaker 3>to talk about today. But in general, it's more common

0:05:26.560 --> 0:05:29.920
<v Speaker 3>to have an uncommon presentation of a common disease than

0:05:30.000 --> 0:05:32.839
<v Speaker 3>in a common presentation of an uncommon disease, if that

0:05:32.880 --> 0:05:33.560
<v Speaker 3>makes sense.

0:05:33.920 --> 0:05:38.600
<v Speaker 1>Right, right, right, And this kind of relationship between AI

0:05:38.680 --> 0:05:41.440
<v Speaker 1>and doctors has been going on for a few years.

0:05:41.760 --> 0:05:45.040
<v Speaker 1>I remember reading a great piece in the Niyoka about

0:05:45.200 --> 0:05:48.840
<v Speaker 1>how one of the challenges for AI was that the

0:05:48.839 --> 0:05:52.920
<v Speaker 1>best doctors can't actually tell you in words why they're

0:05:52.920 --> 0:05:54.200
<v Speaker 1>good at making diagnoses.

0:05:54.320 --> 0:05:55.600
<v Speaker 4>That's right. It's interesting.

0:05:55.640 --> 0:05:58.640
<v Speaker 3>I think there are things that humans have, many cotton

0:05:58.680 --> 0:06:01.000
<v Speaker 3>adiases that are well undo and I think you know,

0:06:01.120 --> 0:06:05.760
<v Speaker 3>keeping that in check while also trying to leverage the

0:06:05.800 --> 0:06:08.359
<v Speaker 3>information in front of you not be affected by the

0:06:08.400 --> 0:06:10.640
<v Speaker 3>case you just saw or something you just heard at

0:06:10.640 --> 0:06:14.200
<v Speaker 3>a conference, or an error that you experienced years ago

0:06:14.240 --> 0:06:18.040
<v Speaker 3>that's still impacting the way that you think about diagnoses.

0:06:18.080 --> 0:06:21.760
<v Speaker 3>And I think those biases have been well published and

0:06:21.839 --> 0:06:25.559
<v Speaker 3>discussed at nauseum in healthcare, but we're kind of dealing

0:06:25.560 --> 0:06:28.760
<v Speaker 3>with this new human plus AI dance.

0:06:29.120 --> 0:06:33.560
<v Speaker 1>That's fascinating. Yeah. I mean I actually slipped and fell

0:06:33.640 --> 0:06:37.080
<v Speaker 1>down a few stairs at the weekend and bashed my

0:06:37.080 --> 0:06:39.160
<v Speaker 1>head slightly on one of the stairs, and then didn't

0:06:39.160 --> 0:06:41.080
<v Speaker 1>feel very well, and I was like, I wonder if

0:06:41.080 --> 0:06:43.800
<v Speaker 1>I could be concussed. So I did a selfie and

0:06:43.839 --> 0:06:45.800
<v Speaker 1>sent it to check GPT and it said my eyes

0:06:45.839 --> 0:06:48.800
<v Speaker 1>look fine. So I actually, if I'd been more wired,

0:06:48.800 --> 0:06:50.159
<v Speaker 1>I would have gone to the doctor. But there's a

0:06:50.200 --> 0:06:51.440
<v Speaker 1>kind of a duck side to that as well.

0:06:51.520 --> 0:06:53.400
<v Speaker 3>Yeah, I mean I think it sounds like you did okay,

0:06:53.400 --> 0:06:55.760
<v Speaker 3>But I would say that the old saying in healthcare

0:06:55.920 --> 0:06:57.880
<v Speaker 3>during the particularly the rise of the Internet, right, which

0:06:57.920 --> 0:07:00.279
<v Speaker 3>is kind of the other similar kind of technology logic

0:07:00.320 --> 0:07:03.840
<v Speaker 3>advancement that impacted healthcare. We used to say to our patients,

0:07:04.400 --> 0:07:07.160
<v Speaker 3>you know, your Google search does not replace our medical degree, right,

0:07:07.160 --> 0:07:09.480
<v Speaker 3>And that wasn't meant to be a condescending but it

0:07:09.560 --> 0:07:11.120
<v Speaker 3>was just sort of like we had to sort of

0:07:11.520 --> 0:07:13.960
<v Speaker 3>pull them back from the abyss of going down a

0:07:14.000 --> 0:07:17.240
<v Speaker 3>rabbit hole and every ache and pain was immediately terminal cancer, right,

0:07:17.240 --> 0:07:19.960
<v Speaker 3>that kind of But today it's different. It sort of

0:07:19.960 --> 0:07:24.240
<v Speaker 3>reference the experience you just mentioned that's happening everywhere. In fact,

0:07:24.280 --> 0:07:27.800
<v Speaker 3>the recent open Ai launch of GPD five, they spent

0:07:27.960 --> 0:07:32.720
<v Speaker 3>fifteen minutes talking with a patient who went through a

0:07:32.840 --> 0:07:35.880
<v Speaker 3>very difficult battle with cancer and worked with the model

0:07:35.880 --> 0:07:40.360
<v Speaker 3>herself and was able to have very complex medical jard

0:07:40.360 --> 0:07:44.000
<v Speaker 3>and explain to her in plain English, was able to

0:07:44.040 --> 0:07:47.120
<v Speaker 3>help her with questions to ask the position. And as

0:07:47.120 --> 0:07:50.360
<v Speaker 3>someone who still practices and sees patients today, I have

0:07:50.400 --> 0:07:52.720
<v Speaker 3>to say my patients are better informed than maybe ever

0:07:52.840 --> 0:07:56.520
<v Speaker 3>and it's kind of changing the bar with this classic

0:07:56.600 --> 0:07:59.600
<v Speaker 3>information asymmetry problem where the patient has to kind of

0:07:59.680 --> 0:08:02.160
<v Speaker 3>keep up up with the technical speak and all the

0:08:02.160 --> 0:08:03.960
<v Speaker 3>information that we spend decades learning.

0:08:04.400 --> 0:08:06.600
<v Speaker 4>It feels like there's almost a better playing field.

0:08:06.600 --> 0:08:08.679
<v Speaker 3>So I can have this conversation with my patient almost

0:08:08.680 --> 0:08:10.520
<v Speaker 3>at a peer level, is right, and then we can

0:08:10.560 --> 0:08:14.040
<v Speaker 3>go through the care journey together. I'm extremely excited about

0:08:14.040 --> 0:08:15.560
<v Speaker 3>that prospect.

0:08:15.880 --> 0:08:17.880
<v Speaker 1>Taking a couple of steps back, I mean, you mentioned

0:08:17.960 --> 0:08:21.240
<v Speaker 1>you've been in and around this since twenty twelve, twenty thirteen.

0:08:22.240 --> 0:08:24.520
<v Speaker 1>Why do people want to use AI medicine.

0:08:24.600 --> 0:08:28.560
<v Speaker 3>Well, it's an incredibly challenging discipline and it has only

0:08:28.720 --> 0:08:31.680
<v Speaker 3>become more so maybe in the last ten or fifteen years.

0:08:32.640 --> 0:08:34.320
<v Speaker 3>One of the things that is going on is that

0:08:34.400 --> 0:08:39.439
<v Speaker 3>information is doubling roughly every ninety days medical information. That

0:08:39.760 --> 0:08:41.800
<v Speaker 3>trend has been going on for a really long time.

0:08:41.840 --> 0:08:45.960
<v Speaker 3>And what does publication of papers, publication of papers, new therapies,

0:08:46.120 --> 0:08:49.240
<v Speaker 3>new guidelines, all these things keep stacking up, right, And

0:08:49.280 --> 0:08:52.679
<v Speaker 3>so just because you've been through medical school and training, right,

0:08:52.720 --> 0:08:54.760
<v Speaker 3>we have lots of systems in place to help us

0:08:54.800 --> 0:08:57.880
<v Speaker 3>continue our education. But really the reaction to that has

0:08:57.920 --> 0:09:01.480
<v Speaker 3>been to sub in some cases sub sub specialize. So

0:09:01.520 --> 0:09:04.640
<v Speaker 3>to give you an example, I am a diagnostic radiologist,

0:09:04.640 --> 0:09:08.079
<v Speaker 3>so that's the bigger specialty, and then I specialize in

0:09:08.200 --> 0:09:11.640
<v Speaker 3>interventional radiology, which is an image guid to procedures basically,

0:09:12.080 --> 0:09:14.720
<v Speaker 3>and then I am further specialized in pediatric version of that.

0:09:15.080 --> 0:09:17.440
<v Speaker 3>So that's like a Russian nesting doll of specialties. And

0:09:17.480 --> 0:09:21.000
<v Speaker 3>you see that throughout healthcare. And that is partly due

0:09:21.040 --> 0:09:25.120
<v Speaker 3>to the complexity of care that's required for some patients,

0:09:25.160 --> 0:09:29.120
<v Speaker 3>but also it's due to the information tidle wave and

0:09:29.200 --> 0:09:31.600
<v Speaker 3>being able to hold all that in a human mind

0:09:32.000 --> 0:09:34.679
<v Speaker 3>right with all of our limitations, and so AI, I

0:09:34.760 --> 0:09:38.280
<v Speaker 3>think at least the work that we've been doing here

0:09:38.480 --> 0:09:43.680
<v Speaker 3>is starting to provide a counter narrative to needing to

0:09:43.720 --> 0:09:46.480
<v Speaker 3>be sub subspecialized in order to be able to manage

0:09:46.520 --> 0:09:48.800
<v Speaker 3>information and take really good care of your patients across

0:09:48.840 --> 0:09:52.760
<v Speaker 3>a wide variety of complex diagnoses. And I think that

0:09:52.760 --> 0:09:55.040
<v Speaker 3>that's really where the excitement is. I think right now

0:09:55.120 --> 0:09:59.360
<v Speaker 3>is can I use this system to augment my ability

0:09:59.360 --> 0:10:00.120
<v Speaker 3>to care for PAYP.

0:10:00.920 --> 0:10:05.360
<v Speaker 1>And why isn't AI more ubiquitous in medicine? And what

0:10:05.760 --> 0:10:08.680
<v Speaker 1>has been integration challenge up until now, Well.

0:10:08.480 --> 0:10:11.160
<v Speaker 3>There's a whole podcast just on that odds, I would say,

0:10:11.200 --> 0:10:14.439
<v Speaker 3>but the short version is that we have been an

0:10:14.480 --> 0:10:20.480
<v Speaker 3>incredibly skeptical discipline it's skeptical of new technology and at

0:10:20.480 --> 0:10:24.120
<v Speaker 3>the same time extraordinarily risk averse for good reason, right,

0:10:24.320 --> 0:10:28.800
<v Speaker 3>we require significant evidence, right to change the way we practice.

0:10:29.240 --> 0:10:31.680
<v Speaker 3>We have you know, as you know, clinical trials take

0:10:31.920 --> 0:10:35.199
<v Speaker 3>years and years, and some still fail, actually many fail,

0:10:35.320 --> 0:10:37.720
<v Speaker 3>and we accept that as the system that keeps our

0:10:37.760 --> 0:10:40.400
<v Speaker 3>patients safe and keeps us on the cutting edge. I

0:10:40.440 --> 0:10:44.120
<v Speaker 3>think in terms of just the technical mechanics of adoption,

0:10:44.360 --> 0:10:46.719
<v Speaker 3>we have a very rigid system in the software two

0:10:46.760 --> 0:10:49.880
<v Speaker 3>world that is changing. What's so again, what's so exciting

0:10:49.880 --> 0:10:53.439
<v Speaker 3>about this is that again any physician can pull out

0:10:53.480 --> 0:10:56.440
<v Speaker 3>their cell phone and interact with this cutting edge AI

0:10:56.559 --> 0:10:59.520
<v Speaker 3>without needing to have you know, three four year long

0:10:59.600 --> 0:11:03.120
<v Speaker 3>cycles of integration with software. Right, and it's just the

0:11:03.160 --> 0:11:04.880
<v Speaker 3>early days, but as of the trends that we're saying,

0:11:05.240 --> 0:11:05.520
<v Speaker 3>just to.

0:11:05.480 --> 0:11:09.000
<v Speaker 1>Take a step back, I guess the classic model of

0:11:09.440 --> 0:11:14.160
<v Speaker 1>measuring AI performance versus doctor performance was to present a

0:11:14.160 --> 0:11:18.120
<v Speaker 1>hard problem or a hard diagnostic conundrum and ask for

0:11:18.160 --> 0:11:21.280
<v Speaker 1>an answer and measure answer versus answer. How is that

0:11:21.280 --> 0:11:22.240
<v Speaker 1>different to what you've done?

0:11:22.520 --> 0:11:25.480
<v Speaker 4>Yeah, well it's even less precise than that.

0:11:25.559 --> 0:11:28.400
<v Speaker 3>So that the way up until now, at least for

0:11:28.520 --> 0:11:31.400
<v Speaker 3>large language models, when people talk about they have medical capabilities,

0:11:32.160 --> 0:11:35.320
<v Speaker 3>they were actually using medical examination questions.

0:11:35.320 --> 0:11:37.520
<v Speaker 4>So there's a question stem and then there's a multiple

0:11:37.600 --> 0:11:38.320
<v Speaker 4>choice answer.

0:11:39.080 --> 0:11:41.680
<v Speaker 3>That's not medicine, but it is how we you know,

0:11:41.800 --> 0:11:45.640
<v Speaker 3>qualify our humans, right, human physicians to be granted a

0:11:45.679 --> 0:11:48.080
<v Speaker 3>medical license, so that we think we kind of use

0:11:48.120 --> 0:11:50.600
<v Speaker 3>that for a long time as a as a surrogate

0:11:50.679 --> 0:11:52.360
<v Speaker 3>or a bell weather, But it wasn't.

0:11:52.480 --> 0:11:54.480
<v Speaker 1>Could it pause a test to be a doctor rather

0:11:54.520 --> 0:11:57.360
<v Speaker 1>than could it actually be effective at acting as a doctor.

0:11:57.440 --> 0:12:00.280
<v Speaker 3>That's interesting, right, And we were able to show very

0:12:00.280 --> 0:12:03.840
<v Speaker 3>early on with GPD four that these models outperform positions

0:12:03.840 --> 0:12:06.240
<v Speaker 3>on these multiple choice tests. But there's all kinds of

0:12:06.280 --> 0:12:10.080
<v Speaker 3>caveats there. Is that really medicine? Has it seen some

0:12:10.160 --> 0:12:11.959
<v Speaker 3>of that data and it's training assuredly?

0:12:12.080 --> 0:12:14.319
<v Speaker 4>Yes? Right? And is that useful?

0:12:14.360 --> 0:12:18.120
<v Speaker 3>I think those questions came up now in practice, it's

0:12:18.480 --> 0:12:22.000
<v Speaker 3>estimated that ten to twenty percent of AI interactions with

0:12:22.040 --> 0:12:27.200
<v Speaker 3>these common chatbots like GPT are around a medical use case.

0:12:27.200 --> 0:12:29.360
<v Speaker 3>So we know that there's someone is getting value out

0:12:29.400 --> 0:12:31.520
<v Speaker 3>of that somewhere, right, and we see it with our

0:12:31.520 --> 0:12:33.360
<v Speaker 3>own eyes. So how do we bridge the gap to

0:12:33.400 --> 0:12:37.439
<v Speaker 3>something a slightly more realistic in terms of not giving

0:12:37.440 --> 0:12:39.160
<v Speaker 3>you all the information up front, just like we would

0:12:39.360 --> 0:12:42.240
<v Speaker 3>in real healthcare. One of the principal thoughts around the

0:12:42.240 --> 0:12:45.880
<v Speaker 3>study was is there a way to take advantage of

0:12:45.880 --> 0:12:50.520
<v Speaker 3>the incredible capabilities that these models have in medical diagnosis.

0:12:49.880 --> 0:12:53.760
<v Speaker 4>And knowledge but also push it a bit further.

0:12:53.880 --> 0:12:56.400
<v Speaker 3>And not have it kind of just be a question

0:12:56.440 --> 0:12:59.240
<v Speaker 3>answering machine. And so we thought, can we kind of

0:12:59.280 --> 0:13:01.679
<v Speaker 3>have several versions of the model kind of act as

0:13:01.720 --> 0:13:04.400
<v Speaker 3>different humans or this is that idea of an agent,

0:13:04.760 --> 0:13:07.840
<v Speaker 3>and give them jobs. One job is to look at

0:13:07.840 --> 0:13:10.960
<v Speaker 3>the economics of the tests that you're trying to order.

0:13:11.000 --> 0:13:15.360
<v Speaker 3>One is to question your next decision point. So the

0:13:15.360 --> 0:13:17.559
<v Speaker 3>information isn't just in and out with one model, it's

0:13:17.600 --> 0:13:20.160
<v Speaker 3>actually in and out through a system of models. And

0:13:20.200 --> 0:13:22.120
<v Speaker 3>we showed that no matter what model you use, whether

0:13:22.160 --> 0:13:24.880
<v Speaker 3>it's Google's model, whether it's open the Eyes model, whether

0:13:24.880 --> 0:13:28.520
<v Speaker 3>it's an open source model, it improves that diagnostic capability

0:13:28.600 --> 0:13:32.240
<v Speaker 3>on these extraordinarily challenging diagnostic tests.

0:13:32.640 --> 0:13:35.320
<v Speaker 1>So you had ten co authors on this study, and

0:13:36.000 --> 0:13:38.080
<v Speaker 1>you know, as we talked about when it was released,

0:13:38.240 --> 0:13:40.600
<v Speaker 1>took the world by storm, and so, I mean, how

0:13:40.600 --> 0:13:44.160
<v Speaker 1>did you go about designing the study and what was

0:13:44.200 --> 0:13:46.800
<v Speaker 1>the hypothesis and what have you found?

0:13:47.200 --> 0:13:50.520
<v Speaker 3>So this was a cross Microsoft collaboration, but harsh and Noori,

0:13:50.559 --> 0:13:53.000
<v Speaker 3>who is the lead on this, really wanted to say,

0:13:53.240 --> 0:13:55.360
<v Speaker 3>you know, we have a lot of evidence that these

0:13:55.400 --> 0:13:59.559
<v Speaker 3>models perform well for these standardized tests, and then we

0:13:59.600 --> 0:14:03.320
<v Speaker 3>see the real world situation where that's not how people present.

0:14:03.400 --> 0:14:05.880
<v Speaker 3>They don't show up with hey, these are all my tests,

0:14:05.920 --> 0:14:07.320
<v Speaker 3>these are all my problems, and these are the four

0:14:07.440 --> 0:14:10.000
<v Speaker 3>choices of what I may have right. And then taking

0:14:10.120 --> 0:14:13.079
<v Speaker 3>what are essentially some of the most difficult questions out

0:14:13.120 --> 0:14:15.880
<v Speaker 3>of New England Journal and structuring them in a way

0:14:16.520 --> 0:14:20.120
<v Speaker 3>that requires a model to ask for more information or

0:14:20.200 --> 0:14:21.000
<v Speaker 3>order tests, just.

0:14:20.960 --> 0:14:21.840
<v Speaker 4>Like a physician would.

0:14:22.640 --> 0:14:25.080
<v Speaker 3>The hypothesis was that that would be interesting and of itself,

0:14:25.200 --> 0:14:27.080
<v Speaker 3>but then what if we also put humans through that

0:14:27.120 --> 0:14:32.680
<v Speaker 3>same system. In other words, here's the first step headache, Okay,

0:14:32.880 --> 0:14:33.680
<v Speaker 3>what do you do next?

0:14:33.680 --> 0:14:33.880
<v Speaker 1>Well?

0:14:33.920 --> 0:14:35.520
<v Speaker 4>Do I need to ask more questions? Do I need

0:14:35.520 --> 0:14:36.880
<v Speaker 4>to order a test, et cetera, et cetera.

0:14:37.520 --> 0:14:40.160
<v Speaker 3>One of the really brilliant outcomes here was by having

0:14:40.240 --> 0:14:43.320
<v Speaker 3>that system of agents as opposed to just the single model,

0:14:43.720 --> 0:14:48.240
<v Speaker 3>allowed us to have a more realistic understanding of the capabilities.

0:14:48.240 --> 0:14:50.560
<v Speaker 3>In other words, if I wanted to know the answer,

0:14:50.600 --> 0:14:52.960
<v Speaker 3>and I'm a chatbot, my answer could be, let's order

0:14:53.000 --> 0:14:56.080
<v Speaker 3>every single test that there is, and that would probably

0:14:56.080 --> 0:14:56.920
<v Speaker 3>get you the right answer.

0:14:57.160 --> 0:14:58.040
<v Speaker 4>Is that feasible?

0:14:58.560 --> 0:14:58.760
<v Speaker 2>No?

0:14:59.160 --> 0:14:59.320
<v Speaker 4>Right?

0:14:59.400 --> 0:14:59.480
<v Speaker 2>Ye?

0:15:00.000 --> 0:15:04.000
<v Speaker 3>So forcing it to think about resources cost of the

0:15:04.240 --> 0:15:07.120
<v Speaker 3>care actually found a very interesting what we would call

0:15:07.200 --> 0:15:13.040
<v Speaker 3>the pride or frontier of capability underconstrained resources. So they

0:15:13.040 --> 0:15:16.600
<v Speaker 3>were actually getting to an incredible diagnoses very very accurately,

0:15:17.240 --> 0:15:20.440
<v Speaker 3>but also cost efficiently, and that was really one of

0:15:20.440 --> 0:15:21.960
<v Speaker 3>the biggest takeaways from this work.

0:15:22.960 --> 0:15:25.680
<v Speaker 1>Can you just to make it more concrete for our listeners,

0:15:25.720 --> 0:15:28.560
<v Speaker 1>can you kind of set up one of these cases

0:15:28.800 --> 0:15:32.720
<v Speaker 1>as though an episode of House Dare I say, and

0:15:32.760 --> 0:15:35.920
<v Speaker 1>then what the human doctors did and what the AI

0:15:36.400 --> 0:15:38.840
<v Speaker 1>agents did, and then how you compare that performance.

0:15:39.320 --> 0:15:42.040
<v Speaker 3>Let's just say it was someone that had easy bleeding

0:15:42.120 --> 0:15:44.640
<v Speaker 3>that unexpected. They were brushing their teeth and they started

0:15:44.640 --> 0:15:46.560
<v Speaker 3>bleeding and it was kind of unusual, and they noticed

0:15:46.600 --> 0:15:48.440
<v Speaker 3>that they were getting a lot of bruising, and there's

0:15:48.480 --> 0:15:50.520
<v Speaker 3>just a certain battery of tests. I think that was

0:15:50.560 --> 0:15:53.960
<v Speaker 3>pretty comparable on both sides in terms of what they ordered.

0:15:54.320 --> 0:15:55.840
<v Speaker 3>But taking continued to.

0:15:55.840 --> 0:15:58.000
<v Speaker 1>Be what the AI ordered and what than human doctors.

0:15:57.680 --> 0:15:59.280
<v Speaker 4>Are human and AI pretty much right.

0:15:59.360 --> 0:16:01.480
<v Speaker 3>So the first few steps, I think a lot there

0:16:01.520 --> 0:16:05.080
<v Speaker 3>was a lot of similarity, which is expected. Where we

0:16:05.080 --> 0:16:08.680
<v Speaker 3>started to see early diversions was because of that agent setup.

0:16:09.040 --> 0:16:11.880
<v Speaker 3>Humans did kind of jump to more advanced tests more quickly,

0:16:11.880 --> 0:16:15.320
<v Speaker 3>more expensive tests, and that was interesting because the models

0:16:15.320 --> 0:16:17.040
<v Speaker 3>were able to kind of get to the next step

0:16:17.440 --> 0:16:19.720
<v Speaker 3>with a battery of less expensive tests. So we thought

0:16:19.720 --> 0:16:21.360
<v Speaker 3>that was a kind of an interesting starting to see

0:16:21.360 --> 0:16:24.000
<v Speaker 3>some divergence. And then, to be fair to the humans,

0:16:24.840 --> 0:16:27.680
<v Speaker 3>they're still kind of handcuffed. In other words, they're just

0:16:27.760 --> 0:16:31.600
<v Speaker 3>getting text feedback as they're interacting with the system, whereas

0:16:31.960 --> 0:16:34.680
<v Speaker 3>when I'm with a patient, I'm seeing them, I'm able

0:16:34.720 --> 0:16:37.640
<v Speaker 3>to kind of take some cues, I'm able to examine them.

0:16:37.640 --> 0:16:40.240
<v Speaker 3>So there was some limitations there, but then the less

0:16:40.240 --> 0:16:43.280
<v Speaker 3>once it got to the stage where you had a

0:16:43.280 --> 0:16:47.280
<v Speaker 3>differential diagnosis, so a list of likely things, more often

0:16:47.320 --> 0:16:49.720
<v Speaker 3>than not, the model was ranking them in a much

0:16:49.760 --> 0:16:53.440
<v Speaker 3>more data driven order that ultimately led to the correct

0:16:53.440 --> 0:16:56.560
<v Speaker 3>diagnosis much more quickly. Whereas you know, as us you

0:16:56.600 --> 0:16:58.600
<v Speaker 3>would with humans, with these limitations, you're kind of going

0:16:58.600 --> 0:17:02.040
<v Speaker 3>in some rabbit holes, you're maybe not ordering them in

0:17:02.120 --> 0:17:04.240
<v Speaker 3>the best order, and so you're kind of going down

0:17:04.280 --> 0:17:07.120
<v Speaker 3>other paths that end up increasing the time or expense

0:17:07.200 --> 0:17:08.920
<v Speaker 3>or potentially leading to the rown diagnosis.

0:17:15.720 --> 0:17:19.040
<v Speaker 2>After the break, how the multi agent system the diagnostic

0:17:19.200 --> 0:17:21.880
<v Speaker 2>orchestrator actually works stay with us.

0:17:38.240 --> 0:17:42.560
<v Speaker 1>I put the study through chet GPT describe the diagnostic

0:17:42.680 --> 0:17:45.720
<v Speaker 1>orchestrator as like a virtual team of five doctors, each

0:17:45.760 --> 0:17:49.520
<v Speaker 1>with a different role. One less possible illnesses, one chooses

0:17:49.520 --> 0:17:53.680
<v Speaker 1>the best tests, one plays devil's advocate, one watches the budget,

0:17:53.760 --> 0:17:56.359
<v Speaker 1>and one checks the quality of everything. The team talks

0:17:56.400 --> 0:17:58.440
<v Speaker 1>it out step by set, but decides what to do next.

0:17:58.520 --> 0:18:00.600
<v Speaker 1>Is that is that a fair summary? That's exactly right?

0:18:00.640 --> 0:18:03.000
<v Speaker 1>And you can have infinite numbers of those agents.

0:18:03.040 --> 0:18:05.560
<v Speaker 3>I think these five were just kind of a scratching

0:18:05.560 --> 0:18:08.439
<v Speaker 3>the surface of what's possible. I will say just quickly

0:18:08.480 --> 0:18:11.320
<v Speaker 3>that I was incredibly happy to see that the curmudgeon

0:18:11.359 --> 0:18:13.679
<v Speaker 3>agent we called it, or the Devil's advocate agent was

0:18:13.720 --> 0:18:17.280
<v Speaker 3>helpful because you get into these group things situations, and

0:18:17.320 --> 0:18:20.399
<v Speaker 3>it's kind of fun to watch a model argue with

0:18:20.520 --> 0:18:24.720
<v Speaker 3>other models about some of the decisions being made in

0:18:24.840 --> 0:18:28.280
<v Speaker 3>questioning the steps. So where the models fall short today

0:18:29.320 --> 0:18:32.640
<v Speaker 3>is outside of the text domain. And what I mean

0:18:32.680 --> 0:18:37.320
<v Speaker 3>by that is models are incredibly good at understanding medical

0:18:37.320 --> 0:18:41.320
<v Speaker 3>concepts as their communicated in text form, but when you

0:18:41.359 --> 0:18:44.280
<v Speaker 3>get into the images and genomics and waveforms and all

0:18:44.280 --> 0:18:46.240
<v Speaker 3>the other types of ways that we take care of

0:18:46.320 --> 0:18:51.720
<v Speaker 3>our patients, the models are vastly underperforming humans. And a

0:18:51.760 --> 0:18:53.520
<v Speaker 3>good example of that is if I needed to look

0:18:53.520 --> 0:18:56.400
<v Speaker 3>at a chest sexuray in one of these diagnostic steps

0:18:57.000 --> 0:18:58.840
<v Speaker 3>and the model had to interpret the chess sector, it

0:18:58.840 --> 0:19:01.520
<v Speaker 3>couldn't read the report actually had to look at the image,

0:19:01.880 --> 0:19:04.080
<v Speaker 3>it would fall short and fail nine times out of ten.

0:19:04.560 --> 0:19:07.159
<v Speaker 3>So we know that that's a significant gap. But on

0:19:07.200 --> 0:19:11.080
<v Speaker 3>the other hand, most healthcare right eighty percent of physician

0:19:11.400 --> 0:19:15.320
<v Speaker 3>or patients interaction with their healthcare systems involve some kind

0:19:15.359 --> 0:19:20.840
<v Speaker 3>of other information like a ECG or a biopsy path

0:19:21.040 --> 0:19:25.600
<v Speaker 3>slide right or a MRI for example. So I'm hoping

0:19:25.600 --> 0:19:28.639
<v Speaker 3>to see agents that have those competencies included into the mix,

0:19:29.320 --> 0:19:31.200
<v Speaker 3>or we can start to really get to a place

0:19:31.240 --> 0:19:35.640
<v Speaker 3>where the diagnostic environment meets how we're testing the systems.

0:19:36.160 --> 0:19:39.199
<v Speaker 1>There was a study last year which I was fascinated by.

0:19:39.280 --> 0:19:44.120
<v Speaker 1>Wish is that AI diagnosis in this study was better

0:19:44.200 --> 0:19:47.119
<v Speaker 1>than human plus AI. In other words, I was a study,

0:19:47.119 --> 0:19:49.399
<v Speaker 1>and you would assume, or you would hope, that a

0:19:49.400 --> 0:19:51.760
<v Speaker 1>doctor using AI would be better than just an AI

0:19:51.800 --> 0:19:56.280
<v Speaker 1>diagnosis alone. But in fact, the human plus AI model

0:19:56.400 --> 0:19:59.439
<v Speaker 1>was worse than the pure AI model. And one of

0:19:59.440 --> 0:20:02.239
<v Speaker 1>the conclusions from this was maybe that the doctors what

0:20:02.240 --> 0:20:04.120
<v Speaker 1>didn't want to listen to what AI was telling them.

0:20:04.119 --> 0:20:06.479
<v Speaker 1>But I mean, did you see that study and did

0:20:06.480 --> 0:20:07.240
<v Speaker 1>it give you pause?

0:20:07.680 --> 0:20:10.520
<v Speaker 3>For more than a decade we've been kind of dealing

0:20:10.560 --> 0:20:14.359
<v Speaker 3>with this unexpected result. This goes all again, all the

0:20:14.400 --> 0:20:16.440
<v Speaker 3>way back to the earliest days of applying at least

0:20:16.440 --> 0:20:19.760
<v Speaker 3>some of the powerful deep learning systems in healthcare, we

0:20:19.960 --> 0:20:23.840
<v Speaker 3>have consistently seen that, in other words, in whatever set

0:20:23.920 --> 0:20:26.640
<v Speaker 3>up the AI, if you just leave it alone, typically

0:20:26.640 --> 0:20:28.920
<v Speaker 3>does better than the human plus THEI or.

0:20:28.840 --> 0:20:29.640
<v Speaker 4>The human alone.

0:20:29.880 --> 0:20:34.719
<v Speaker 3>Now is that a indictment on the human ability or

0:20:34.760 --> 0:20:36.840
<v Speaker 3>is that more of a Have we set this up

0:20:36.880 --> 0:20:40.200
<v Speaker 3>in a way that either doesn't favor the real world,

0:20:40.640 --> 0:20:44.119
<v Speaker 3>or have we not figured out the ideal human computer

0:20:44.200 --> 0:20:46.639
<v Speaker 3>interaction or how we should be What task should we

0:20:46.680 --> 0:20:48.840
<v Speaker 3>be offloading to the system versus the task that we

0:20:48.880 --> 0:20:51.280
<v Speaker 3>should be collaborating with the system on I think that's

0:20:51.320 --> 0:20:54.560
<v Speaker 3>really where the exploration is that I'm interested in, because

0:20:54.600 --> 0:20:59.920
<v Speaker 3>I still hold out hope and sort of some sense

0:21:00.240 --> 0:21:03.480
<v Speaker 3>of self preservation, but that there is a future where

0:21:03.520 --> 0:21:06.720
<v Speaker 3>the two are better. Just how to offload what job

0:21:07.440 --> 0:21:11.240
<v Speaker 3>and in what sort of system that ultimately becomes. Maybe

0:21:11.320 --> 0:21:14.880
<v Speaker 3>it's five agents, maybe it's ten, maybe it's a thousand.

0:21:15.080 --> 0:21:17.119
<v Speaker 3>You know, we don't know the answer yet. We're just

0:21:17.119 --> 0:21:20.639
<v Speaker 3>barely scratching the surface. But in three years time, I

0:21:20.680 --> 0:21:23.920
<v Speaker 3>expect this to be fairly common, that clinicians of all

0:21:23.960 --> 0:21:28.080
<v Speaker 3>types will be working alongside and or even consulting with

0:21:28.119 --> 0:21:30.160
<v Speaker 3>some of these systems for their care their patients.

0:21:30.480 --> 0:21:33.040
<v Speaker 1>And what is the adoption rate today? I mean, how

0:21:33.119 --> 0:21:36.240
<v Speaker 1>far what would need to happen for this, you know

0:21:36.440 --> 0:21:38.680
<v Speaker 1>paper that you've written in the system that you developed

0:21:38.680 --> 0:21:42.840
<v Speaker 1>to be widely deployed in US or global healthcare.

0:21:42.680 --> 0:21:45.960
<v Speaker 3>In a very practical sense, there is a lot of

0:21:46.000 --> 0:21:50.399
<v Speaker 3>regulation around this, and regulation requires very rigorous study and

0:21:50.440 --> 0:21:52.600
<v Speaker 3>evidence and real world deployment, all the things that you

0:21:52.600 --> 0:21:55.439
<v Speaker 3>would expect right if you're you know, care team is

0:21:55.600 --> 0:21:57.520
<v Speaker 3>using some of these things to take to take care

0:21:57.520 --> 0:22:00.720
<v Speaker 3>of you and your health problems generating that evidence, working

0:22:00.720 --> 0:22:03.760
<v Speaker 3>with policy makers, trying to figure out exactly what evidence

0:22:04.520 --> 0:22:07.320
<v Speaker 3>would get to the point where we can say definitively

0:22:07.359 --> 0:22:10.879
<v Speaker 3>this is at standard of care or beyond and it

0:22:10.960 --> 0:22:13.720
<v Speaker 3>should be used and here's how you use it. Those

0:22:13.720 --> 0:22:17.040
<v Speaker 3>are very mechanical, but they're very important. It may also

0:22:17.160 --> 0:22:20.479
<v Speaker 3>require a change in how we approach the regulation of

0:22:20.520 --> 0:22:24.400
<v Speaker 3>medical software because these kinds of systems are challenging our

0:22:24.440 --> 0:22:27.240
<v Speaker 3>traditional software that we have used for decades in healthcare.

0:22:27.320 --> 0:22:28.520
<v Speaker 4>Right, they're very different.

0:22:28.560 --> 0:22:32.520
<v Speaker 3>They're non deterministic, they have moments of brilliance and moments

0:22:32.560 --> 0:22:35.360
<v Speaker 3>of you know, stupidity. I should say, right, you've seen

0:22:35.400 --> 0:22:38.159
<v Speaker 3>these kind of things, and so how do we actually

0:22:38.200 --> 0:22:41.760
<v Speaker 3>design a system where it's safe, effective, and actually improving outcomes?

0:22:41.800 --> 0:22:44.199
<v Speaker 4>And that's ultimately the evidence we have to generate.

0:22:44.640 --> 0:22:48.240
<v Speaker 1>Yeah, I mean beyond stupid mistakes. How do you see

0:22:48.280 --> 0:22:51.479
<v Speaker 1>the risks here? I mean we're seeing this research around

0:22:51.560 --> 0:22:54.600
<v Speaker 1>you know the problems of cognitive offloading with AI, some

0:22:54.600 --> 0:22:57.560
<v Speaker 1>suggestions that if you use AI too much you become

0:22:57.640 --> 0:23:01.120
<v Speaker 1>dumba and deskill yourself. I mean, is there a risk

0:23:01.200 --> 0:23:03.960
<v Speaker 1>of de skilling doctors? Like what are some of the

0:23:04.840 --> 0:23:08.800
<v Speaker 1>maybe intangible but nonetheless medium time risks that we should

0:23:08.800 --> 0:23:09.720
<v Speaker 1>be considering here.

0:23:10.160 --> 0:23:13.080
<v Speaker 3>What they refer to a skill atrophy as real, and

0:23:13.400 --> 0:23:16.200
<v Speaker 3>we've seen this in various other disciplines too. I think

0:23:16.840 --> 0:23:22.160
<v Speaker 3>it will also require a shift in how we think

0:23:22.240 --> 0:23:25.720
<v Speaker 3>and perform our knowledge work jobs. And in one way

0:23:25.760 --> 0:23:27.480
<v Speaker 3>this has been sort of looked at is via the

0:23:27.520 --> 0:23:31.119
<v Speaker 3>idea of meta cognition. So rather than you having to

0:23:31.119 --> 0:23:34.760
<v Speaker 3>be the central source of decision making, are there things

0:23:34.760 --> 0:23:37.760
<v Speaker 3>that you can manage? So the imagine you managing a

0:23:37.840 --> 0:23:41.679
<v Speaker 3>team of a these agents. You have a goal, but

0:23:41.760 --> 0:23:45.360
<v Speaker 3>you're offloading some of the cognitive tasks to those agents.

0:23:45.720 --> 0:23:49.199
<v Speaker 3>Those are some of the early discussions around it. But

0:23:49.760 --> 0:23:54.639
<v Speaker 3>I fundamentally believe that everyone that's in a knowledge work

0:23:54.720 --> 0:23:58.960
<v Speaker 3>industry or role will have to rethink how that role

0:23:59.359 --> 0:24:01.359
<v Speaker 3>evolves in the future. And this is kind of that

0:24:01.400 --> 0:24:03.600
<v Speaker 3>first step, at least for us in the healthcare space,

0:24:03.640 --> 0:24:07.200
<v Speaker 3>which is that do you need to memorize all these

0:24:07.200 --> 0:24:09.680
<v Speaker 3>facts or do you just need to be able to

0:24:09.760 --> 0:24:12.520
<v Speaker 3>have the right judgment and know which where the models

0:24:12.800 --> 0:24:14.640
<v Speaker 3>are good and not good, and be able to fill

0:24:14.640 --> 0:24:16.399
<v Speaker 3>those gaps and manage it like you would a team,

0:24:16.440 --> 0:24:20.000
<v Speaker 3>like any manager would know one oh one, this person's

0:24:20.000 --> 0:24:21.879
<v Speaker 3>good at this, they have some weaknesses here, right, I'm

0:24:21.920 --> 0:24:24.119
<v Speaker 3>going to sign this task to them versus this.

0:24:24.280 --> 0:24:24.520
<v Speaker 4>You know.

0:24:25.000 --> 0:24:28.880
<v Speaker 3>That's that metacognition world that where I think we're rapidly

0:24:28.880 --> 0:24:30.960
<v Speaker 3>heading towards, and healthcare is going to have to figure

0:24:31.000 --> 0:24:32.440
<v Speaker 3>out a way to do that as well.

0:24:32.960 --> 0:24:37.200
<v Speaker 1>Yeah, I mean the other question around deployment is conflict

0:24:37.240 --> 0:24:40.480
<v Speaker 1>of interest. Right, So the previous research I've seen is

0:24:40.520 --> 0:24:44.200
<v Speaker 1>all around AI versus human doctors. But this element you've

0:24:44.200 --> 0:24:47.000
<v Speaker 1>added is to cost as well. It's not just outperforming,

0:24:47.040 --> 0:24:50.479
<v Speaker 1>but it's outperforming at less costs in terms of tests.

0:24:50.560 --> 0:24:54.320
<v Speaker 1>Is a really interesting element, but it adds the potential

0:24:54.440 --> 0:24:56.320
<v Speaker 1>for major conflict of interests on both sides.

0:24:56.400 --> 0:24:56.520
<v Speaker 2>Right.

0:24:56.560 --> 0:24:59.920
<v Speaker 1>So for example, I'm British, grew up with the NH

0:25:00.520 --> 0:25:03.600
<v Speaker 1>and one of the consistent themes of the NHS was

0:25:04.119 --> 0:25:09.920
<v Speaker 1>death panels. Are there bureaucrats deciding when people should die?

0:25:09.400 --> 0:25:12.439
<v Speaker 1>What is the appropriate level of care to give to

0:25:12.520 --> 0:25:15.000
<v Speaker 1>people to prevent them from dying, given that it's a

0:25:15.080 --> 0:25:17.720
<v Speaker 1>drain on a public budget. That's in the UK. Here

0:25:17.720 --> 0:25:21.159
<v Speaker 1>in the US we have these for profit healthcare model

0:25:21.240 --> 0:25:24.879
<v Speaker 1>where there is an incentive which if you're insured do

0:25:24.920 --> 0:25:28.159
<v Speaker 1>you sometimes worry about that your position or healthcare system

0:25:28.720 --> 0:25:32.120
<v Speaker 1>is pushing you through numerous medical procedures because ultimately it's

0:25:32.119 --> 0:25:34.560
<v Speaker 1>a profit center and you may not actually need them.

0:25:34.960 --> 0:25:38.640
<v Speaker 1>So how do you begin to grapple with those problems

0:25:38.680 --> 0:25:40.120
<v Speaker 1>when you think about a system like this.

0:25:41.200 --> 0:25:43.840
<v Speaker 3>These are problems that have existed even as you know

0:25:43.960 --> 0:25:49.480
<v Speaker 3>before before AI, and I think that the responsibility for

0:25:50.000 --> 0:25:53.199
<v Speaker 3>those of us kind of generating the evidence and the capabilities,

0:25:53.200 --> 0:25:56.520
<v Speaker 3>and kind of displaying the rationale behind how these things

0:25:56.560 --> 0:26:00.200
<v Speaker 3>work or don't work and where they work is a

0:26:00.200 --> 0:26:04.199
<v Speaker 3>conversation entirely from both the economics and the cultural societal

0:26:04.720 --> 0:26:07.200
<v Speaker 3>aspects of just how we deliber care. I think it

0:26:07.240 --> 0:26:09.639
<v Speaker 3>wouldn't be a controversial statement to say that, at least

0:26:09.640 --> 0:26:13.880
<v Speaker 3>from the US perspective, that our healthcare system is not ideal. Right,

0:26:14.240 --> 0:26:16.320
<v Speaker 3>And that's true whether you're in a capitated system, a

0:26:16.320 --> 0:26:20.080
<v Speaker 3>fee for service system, or a government based system like

0:26:20.119 --> 0:26:22.919
<v Speaker 3>the VA. I have to hope that the better angels

0:26:23.160 --> 0:26:26.040
<v Speaker 3>prevail here. But I agree with you and share your

0:26:26.080 --> 0:26:30.000
<v Speaker 3>concerns that in the wrong hands, or with the various

0:26:30.320 --> 0:26:33.840
<v Speaker 3>misalignments that happen in these systems at all different levels,

0:26:34.600 --> 0:26:38.240
<v Speaker 3>we could end up causing some disruption in a way

0:26:38.280 --> 0:26:39.560
<v Speaker 3>that we aren't hoping to see.

0:26:39.560 --> 0:26:42.040
<v Speaker 1>In the end, it was an interesting blog post that

0:26:42.160 --> 0:26:46.520
<v Speaker 1>you wrote around cancer care, and it al really struck

0:26:46.600 --> 0:26:48.560
<v Speaker 1>me because I mean, I don't want to put the

0:26:48.600 --> 0:26:51.520
<v Speaker 1>words into your mouth, but as I read it, if

0:26:51.560 --> 0:26:53.840
<v Speaker 1>you are one of the lucky few who gets to

0:26:53.880 --> 0:26:56.960
<v Speaker 1>go to one of the great cancer centers like M

0:26:57.040 --> 0:27:00.680
<v Speaker 1>d Anderson when you're sick with cancer and have access

0:27:00.760 --> 0:27:04.520
<v Speaker 1>to these cross field panel of experts who, as you

0:27:04.560 --> 0:27:08.159
<v Speaker 1>mentioned earlier, sub or sub sub or even subsub sub specialized,

0:27:08.920 --> 0:27:12.240
<v Speaker 1>you have a measurably better outcome, whereas in fact, most

0:27:12.240 --> 0:27:15.560
<v Speaker 1>people in the US and certainly almost all people practically

0:27:15.600 --> 0:27:20.040
<v Speaker 1>speaking globally don't have access to these cancer centers. Talk

0:27:20.080 --> 0:27:23.080
<v Speaker 1>about that and about this idea of the multi agentic

0:27:23.560 --> 0:27:26.639
<v Speaker 1>AI and how it sort of reflects or refracts what

0:27:26.680 --> 0:27:28.800
<v Speaker 1>we've been talking about with the diagnosis piece.

0:27:29.040 --> 0:27:31.720
<v Speaker 3>Yeah, this was a very important So, yeah, thanks for

0:27:31.720 --> 0:27:33.399
<v Speaker 3>bringing this up. I think a lot of people don't

0:27:34.000 --> 0:27:35.919
<v Speaker 3>know some of the inner workings of healthcare where some

0:27:35.960 --> 0:27:38.240
<v Speaker 3>of the really big bottlenecks are in terms of getting

0:27:38.280 --> 0:27:40.720
<v Speaker 3>the best possible outcome, and one of them is in

0:27:40.800 --> 0:27:43.960
<v Speaker 3>cancer care, as you're pointing out where some of the

0:27:44.280 --> 0:27:48.240
<v Speaker 3>leading centers, and in particularly larger cities, they have the

0:27:48.320 --> 0:27:53.000
<v Speaker 3>ability to bring specialists from all different disciplines together to

0:27:53.080 --> 0:27:56.359
<v Speaker 3>discuss the patient's care, and that's called the tumor boarder

0:27:56.480 --> 0:28:00.919
<v Speaker 3>multidimary tumor board. The reason that not everyone can do

0:28:01.000 --> 0:28:03.320
<v Speaker 3>that is not just because they don't maybe have that

0:28:03.400 --> 0:28:06.560
<v Speaker 3>specialist in house, but also because of the massive amount

0:28:06.600 --> 0:28:09.520
<v Speaker 3>of prep time it takes to gather all the information.

0:28:10.080 --> 0:28:12.640
<v Speaker 3>It's not just the patient's data that you have to gather.

0:28:12.760 --> 0:28:16.080
<v Speaker 3>You have to gather what clinical trials are new and

0:28:16.119 --> 0:28:18.800
<v Speaker 3>availed in this patient eligible for, what does the latest

0:28:18.840 --> 0:28:21.960
<v Speaker 3>literature say, And someone has to go through all that information,

0:28:22.480 --> 0:28:25.159
<v Speaker 3>prepare that and then present it to a group. And

0:28:25.240 --> 0:28:27.879
<v Speaker 3>what we found from the ASCO, which is the large

0:28:28.600 --> 0:28:32.080
<v Speaker 3>society in cancer care in the US, was that it

0:28:32.200 --> 0:28:33.760
<v Speaker 3>takes between two and a half and three and a

0:28:33.800 --> 0:28:37.920
<v Speaker 3>half hours of preparation time per patient, and some centers

0:28:38.000 --> 0:28:40.880
<v Speaker 3>run thousands of these tumor boards a year, and those

0:28:40.920 --> 0:28:43.240
<v Speaker 3>are the ones that have the most resources and certainly

0:28:43.240 --> 0:28:47.800
<v Speaker 3>the most access. The idea of AI, fundamentally for me,

0:28:47.880 --> 0:28:50.160
<v Speaker 3>and the reason I'm in this field is that I

0:28:50.200 --> 0:28:55.720
<v Speaker 3>want to democratize that experience for everyone, increase the access.

0:28:56.120 --> 0:28:57.840
<v Speaker 3>So no matter where you live or mate matter what

0:28:57.840 --> 0:29:00.280
<v Speaker 3>you do for a living, should that same level of

0:29:00.880 --> 0:29:03.800
<v Speaker 3>precision when it comes to your healthcare. And so this

0:29:04.000 --> 0:29:06.320
<v Speaker 3>was that first step to that. I do think this

0:29:06.360 --> 0:29:09.160
<v Speaker 3>is going to continue to evolve back to our conversation

0:29:09.280 --> 0:29:13.360
<v Speaker 3>around managing a team of experts as your primary physician,

0:29:13.440 --> 0:29:17.160
<v Speaker 3>could I call on a team of expert agents to

0:29:17.280 --> 0:29:19.360
<v Speaker 3>help walk through some of the things that we might

0:29:19.480 --> 0:29:21.800
<v Speaker 3>not be considering in our fifteen minutes we have together

0:29:21.880 --> 0:29:24.840
<v Speaker 3>once every six months or whatever that looks like. I'm

0:29:24.960 --> 0:29:28.480
<v Speaker 3>very hopeful that given the right circumstances in the way

0:29:28.520 --> 0:29:31.880
<v Speaker 3>the technology is progressing, we're going to get to a place,

0:29:31.920 --> 0:29:35.200
<v Speaker 3>I think, in a perfect world at least where the

0:29:35.240 --> 0:29:38.680
<v Speaker 3>access for every patient is equivalent to those who may

0:29:38.680 --> 0:29:40.440
<v Speaker 3>have access to the best resources.

0:29:41.320 --> 0:29:44.120
<v Speaker 1>I mean, you mentioned that twenty percent of all AI

0:29:44.280 --> 0:29:46.480
<v Speaker 1>search is or op to twenty percent of a AI

0:29:46.560 --> 0:29:50.520
<v Speaker 1>searches are around medicine, which is fascinating. I didn't know that.

0:29:51.160 --> 0:29:53.960
<v Speaker 1>But there are of course other people who don't want

0:29:54.000 --> 0:29:56.680
<v Speaker 1>AI in the healthcare settings, or who worried about their

0:29:56.760 --> 0:30:00.440
<v Speaker 1>human docture or their primary care physician being replaced by

0:30:01.160 --> 0:30:03.840
<v Speaker 1>an unfeeling machine. What do you say to them?

0:30:04.040 --> 0:30:06.040
<v Speaker 3>It's interesting, I think going all the way back to

0:30:06.080 --> 0:30:09.440
<v Speaker 3>the earliest days of search, that stat was still about

0:30:09.440 --> 0:30:13.160
<v Speaker 3>the same. Up to twenty percent of Internet searches were

0:30:13.600 --> 0:30:17.479
<v Speaker 3>healthcare related. And we're seeing two interesting trends. One from

0:30:17.520 --> 0:30:19.920
<v Speaker 3>the economists that showed that these searches that are going

0:30:19.960 --> 0:30:22.800
<v Speaker 3>on today in the typical search engines, the one that's

0:30:22.840 --> 0:30:27.800
<v Speaker 3>going down the fastest is healthcare. Isn't that interesting because

0:30:28.200 --> 0:30:29.080
<v Speaker 3>where are people going?

0:30:29.120 --> 0:30:31.960
<v Speaker 4>Then? Well, they're probably going to the models. So I

0:30:32.000 --> 0:30:33.080
<v Speaker 4>actually push back on that.

0:30:33.120 --> 0:30:36.520
<v Speaker 3>I think that most people want to be educated about

0:30:36.560 --> 0:30:38.960
<v Speaker 3>their medical condition, and they want to be they want

0:30:38.960 --> 0:30:41.959
<v Speaker 3>to feel safe and free to ask questions about their

0:30:42.000 --> 0:30:48.280
<v Speaker 3>own healthcare and essentially infinitely patient knowledgeable sort of oracle environment.

0:30:48.400 --> 0:30:50.920
<v Speaker 3>And again we're not there yet, so I don't want

0:30:50.920 --> 0:30:54.720
<v Speaker 3>to make that claim. But I even me, I put

0:30:54.760 --> 0:30:57.360
<v Speaker 3>my data into these models and ask questions about it,

0:30:57.360 --> 0:30:59.680
<v Speaker 3>and I walk away sometimes learning something or at least

0:31:00.160 --> 0:31:03.040
<v Speaker 3>what I should be asking my physicians. So again, would

0:31:03.080 --> 0:31:05.760
<v Speaker 3>I rather do that than any healthcare Not me personally,

0:31:06.080 --> 0:31:08.920
<v Speaker 3>I do want to have that relationship with my physicians,

0:31:08.920 --> 0:31:11.120
<v Speaker 3>but I also want to walk in much more knowledgeable,

0:31:11.560 --> 0:31:13.360
<v Speaker 3>so I feel like we're on a pure level when

0:31:13.360 --> 0:31:15.600
<v Speaker 3>we're speaking about my care decisions.

0:31:16.440 --> 0:31:17.800
<v Speaker 4>Matt, thank you, Thank you so much.

0:31:17.800 --> 0:31:43.600
<v Speaker 2>As that's it for this week for tech Stuff. I'm

0:31:43.680 --> 0:31:44.760
<v Speaker 2>Cara Price and.

0:31:44.640 --> 0:31:47.520
<v Speaker 1>I'm as Valosha And this episode was produced by Eliza Dennis,

0:31:47.560 --> 0:31:50.720
<v Speaker 1>Tyler Hill and Melissa Slaughter. It was executive produced by

0:31:50.760 --> 0:31:54.920
<v Speaker 1>me Cara Price and Kate Osborne for Kaleidoscope and Katria

0:31:55.040 --> 0:31:59.000
<v Speaker 1>Novelle for iHeart Podcast. The Engineer is Behit Fraser and

0:31:59.120 --> 0:32:02.840
<v Speaker 1>Jack Insley mixed this episode. Kyle Murdoch wrote our theme song.

0:32:03.320 --> 0:32:05.920
<v Speaker 1>Please do rate, review and reach out to us at

0:32:05.920 --> 0:32:08.480
<v Speaker 1>tech Stuff Podcast at gmail dot com. We love hearing

0:32:08.520 --> 0:32:08.800
<v Speaker 1>from you.