WEBVTT - Can Your Phone Tell When You're Getting Sick? 0:00:15.356 --> 0:00:22.596 Pushkin. There are a lot of reasons that I'm excited 0:00:22.636 --> 0:00:25.836 about today's show. I'm going to tell you three right now. 0:00:26.796 --> 0:00:30.836 Number One, the show is about this whole dimension of 0:00:30.996 --> 0:00:37.596 medicine that I essentially didn't know existed, acoustic biomarkers, basically 0:00:38.076 --> 0:00:41.996 using a person's voice to assess their health. Second thing 0:00:42.036 --> 0:00:45.076 I'm excited about the show is about the intersection of 0:00:45.436 --> 0:00:51.196 AI and healthcare, one of my top say five intersections. 0:00:51.676 --> 0:00:57.316 Love that intersection. And three, today's guest doctor Yael Bensusan, 0:00:57.796 --> 0:01:01.436 gave me what was truly the best excuse that anyone 0:01:01.436 --> 0:01:04.076 has ever given me for canceling an interview at the 0:01:04.156 --> 0:01:04.916 last minute. 0:01:05.156 --> 0:01:07.116 Yeah, so I'm really sorry for having to cancel on 0:01:07.156 --> 0:01:07.916 you yesterday. 0:01:09.196 --> 0:01:12.756 What was the what was the surgery you had to 0:01:12.756 --> 0:01:13.356 do yesterday? 0:01:13.596 --> 0:01:17.916 So yesterday we call it airway surgery, where I take 0:01:17.956 --> 0:01:20.076 a base to the oar and I have to open 0:01:20.196 --> 0:01:22.996 up their windpipe or their trachia because they're a scar 0:01:23.116 --> 0:01:26.716 tissue that's blocking them from breathing. So I have to 0:01:26.716 --> 0:01:29.356 go with a laser and cut the scar tissue out 0:01:30.276 --> 0:01:32.796 and then take a balloon and open up their windpipe 0:01:32.916 --> 0:01:35.996 so that they can wake up and breathe better, and 0:01:36.076 --> 0:01:38.756 that translates to a different sound when they're breathing. 0:01:38.836 --> 0:01:39.516 So when they're not. 0:01:39.596 --> 0:01:41.956 Breathing because of the scar tissue, we can sound like, 0:01:43.116 --> 0:01:45.756 you know, very noisy breathing. We call it the Darth 0:01:45.836 --> 0:01:49.236 Vader breathing. And then when wake up they wake up 0:01:49.236 --> 0:01:51.956 from surgery and they're done, they have silent breathing, which 0:01:51.956 --> 0:01:53.916 means that I know that I did a good job. 0:02:00.236 --> 0:02:02.556 I'm Jacob Goldstein and this is What's your Problem the 0:02:02.596 --> 0:02:05.156 show where I talk to people. We're trying to make 0:02:05.236 --> 0:02:09.796 technological progress. Doctor Yeah. Eld and Susan run the Health 0:02:09.916 --> 0:02:13.076 Voice Center at the University of South Florida, and she 0:02:13.156 --> 0:02:16.596 is also leading a team of researchers that's building a 0:02:16.716 --> 0:02:21.316 giant database of human voices and breadths and health information. 0:02:22.116 --> 0:02:25.236 Her problem is this, how do you record the voices 0:02:25.276 --> 0:02:29.876 of thousands of people without violating patient privacy laws while 0:02:29.876 --> 0:02:34.276 building a giant public database that could someday allow your 0:02:34.356 --> 0:02:37.996 phone to warn you, based solely on your voice that 0:02:38.036 --> 0:02:40.996 you may be getting sick. Yeah El told me that 0:02:41.116 --> 0:02:43.996 she got into this field in part because she used 0:02:43.996 --> 0:02:44.636 to be a singer. 0:02:46.636 --> 0:02:50.316 So I growing up, you know, I always was in 0:02:50.356 --> 0:02:53.116 a very musical family. I took singing lessons when I 0:02:53.196 --> 0:02:57.636 was a kid, and then I started singing more professionally 0:02:57.996 --> 0:03:00.956 around eighteen years old, and I had a short but 0:03:01.076 --> 0:03:05.356 exciting singing career. I wrote pop folk music. We had 0:03:05.356 --> 0:03:08.236 a bend, and we toured. We had an album out 0:03:08.276 --> 0:03:12.316 in two thousand and twelve. Yeah, and I mean it 0:03:12.436 --> 0:03:15.836 was a lot of fun. And actually the reason I 0:03:15.956 --> 0:03:18.876 was able to have that short and exciting career was 0:03:19.276 --> 0:03:21.876 because I met a speech pathologist when I was fifteen. 0:03:21.956 --> 0:03:25.836 So I was taking singing classes and one day my 0:03:25.916 --> 0:03:28.036 teacher looked at me and she said, there's something wrong 0:03:28.076 --> 0:03:28.756 with your voice. 0:03:28.996 --> 0:03:29.876 Go get checked. 0:03:31.716 --> 0:03:35.236 And I met a laryngologist who put his camera down 0:03:35.636 --> 0:03:37.996 and she said, you have nodules on your vocal cords 0:03:38.036 --> 0:03:39.956 and you might not be able to sing again if 0:03:39.996 --> 0:03:42.316 you don't take this seriously. And I went to see 0:03:42.316 --> 0:03:45.236 a speech pathologist. I did rehabilitation with my voice for 0:03:45.316 --> 0:03:47.596 six months, and I was able to sing again. And 0:03:47.636 --> 0:03:49.916 I mean that's what led me to then become a 0:03:49.916 --> 0:03:54.156 speech pathologist growing up, and then eventually go to med 0:03:54.156 --> 0:03:56.476 school and then decide to become a laryngologist. 0:03:56.756 --> 0:03:58.596 So it was kind of all interconnected. 0:03:59.916 --> 0:04:03.676 So I know that your research now and most of 0:04:04.356 --> 0:04:08.716 what I'm really interested to talk with you about is 0:04:08.716 --> 0:04:12.836 is around acoustic biomarkers. So just to start, I mean, 0:04:13.796 --> 0:04:15.196 what's an acoustic biomarker? 0:04:16.076 --> 0:04:19.116 Very good question. So what is a biomarker? First, A 0:04:19.116 --> 0:04:24.196 biomarker is something that indicates the presence of a disease, right, 0:04:25.396 --> 0:04:28.836 So if you think about a biomarker for a cancer, 0:04:28.916 --> 0:04:31.716 so different cancers have different types of biomarker. For example, 0:04:31.756 --> 0:04:35.476 for ovarian cancer, we're looking for a specific thing you know, 0:04:35.676 --> 0:04:38.756 called ca in your blood. For different types of cancers, 0:04:38.756 --> 0:04:41.316 they could take a blood draw and find a specific biomarker. 0:04:41.316 --> 0:04:44.676 It's an indicator of a disease. An acoustic biomarkers is 0:04:45.116 --> 0:04:47.756 something that can indicate a presence of a disease, but 0:04:47.836 --> 0:04:50.316 that you can hear. So that's the definition of an 0:04:50.356 --> 0:04:54.036 acoustic biomarker. So I always say, you know, when you 0:04:54.156 --> 0:04:57.676 have people in your family that are not well, you 0:04:57.716 --> 0:05:00.836 will always notice first and you'll say you don't sound 0:05:00.876 --> 0:05:04.996 good right, or you sound funny. And I have the 0:05:05.116 --> 0:05:08.036 luxury to know that because I'm a voice doctor. So 0:05:08.196 --> 0:05:10.756 then people will bring me their family members or people 0:05:10.796 --> 0:05:13.356 will come saying, I don't know what's wrong with me, 0:05:13.796 --> 0:05:16.276 but my wife told me to come because my voice 0:05:16.316 --> 0:05:20.756 is not good. And sometimes it's because their vocal cords 0:05:20.916 --> 0:05:23.116 are not working, but a lot of times it's because 0:05:23.156 --> 0:05:25.956 they can have a neurological issue or a cardiac issue 0:05:26.316 --> 0:05:27.956 that is affecting their voice. 0:05:28.196 --> 0:05:34.036 So, more broadly, what's going on with AI and acoustic biomarkers. 0:05:34.756 --> 0:05:37.436 Yeah, so, so many exciting things are going on. I 0:05:37.476 --> 0:05:42.036 think that's the first answer. There are so many startups, 0:05:42.076 --> 0:05:46.876 so many companies, industry researchers, academic researchers that are working 0:05:46.876 --> 0:05:50.956 and looking into voice AI. And the reason is it's 0:05:50.996 --> 0:05:54.916 really cheap to collect. Right to think about this, If 0:05:54.916 --> 0:05:56.796 you have a phone, it's really cheap to collect Compared 0:05:56.836 --> 0:05:56.996 to this. 0:05:57.036 --> 0:05:59.556 You don't have to pick a blood sample. You have 0:05:59.636 --> 0:06:02.276 exactly just you've got the phone. You've got the device 0:06:02.796 --> 0:06:05.236 literally in your hand already. All you have to do 0:06:05.356 --> 0:06:07.636 is talk, and you're talking already. 0:06:07.436 --> 0:06:09.756 And you're talking already, so it's cheap to that's why 0:06:09.836 --> 0:06:12.956 pharmaceutical industries are also very interested, and there's a lot 0:06:12.956 --> 0:06:16.156 of pharmaceutical projects around it. So there are a lot 0:06:16.276 --> 0:06:21.196 of projects that are going on and the state or 0:06:21.236 --> 0:06:24.476 the The current landscape is that there's tons of people 0:06:24.556 --> 0:06:29.316 working on very similar things and very interesting and various disease. 0:06:29.356 --> 0:06:31.876 So I always I kind of categorize them in three 0:06:31.916 --> 0:06:36.076 categories of diseases that are being studied. One is the 0:06:36.276 --> 0:06:41.276 disease that affects the voice box. Okay, so vocal court paralysis, absolutely, 0:06:41.316 --> 0:06:43.756 it's intuitive. There's going to be vocal biomarkers in that 0:06:44.476 --> 0:06:48.356 voice box cancer, right, that's easy. Then there's a voice 0:06:48.396 --> 0:06:52.316 and speech affecting disorders, so disorders that don't affect the 0:06:52.396 --> 0:06:55.436 voice box, but that have an impact on the voice 0:06:55.436 --> 0:06:59.316 and the speech. Parkinson is one of them, right, Alzheimer's 0:06:59.356 --> 0:07:01.996 is one of them. A stroke somebody having a stroke, 0:07:02.156 --> 0:07:04.076 they don't have a problem with their voice box, but 0:07:04.116 --> 0:07:06.236 their speech is going to be altered. So these are 0:07:06.316 --> 0:07:09.476 voice and speech affecting conditions. So lots of work is 0:07:09.476 --> 0:07:11.916 being done in that field. And the third one is 0:07:12.116 --> 0:07:15.636 diseases that you don't think would affect speech, but still 0:07:15.676 --> 0:07:17.556 people are doing research on that. So there was a 0:07:17.596 --> 0:07:21.316 really interesting study on diabetes. They're saying that there was 0:07:21.356 --> 0:07:24.356 a group that published that they could diagnose people that 0:07:24.396 --> 0:07:28.516 were diabetic versus non diabetics based on their speech and this. 0:07:28.836 --> 0:07:34.356 So this third group is one presumably where there's at 0:07:34.436 --> 0:07:39.476 least the potential for AI to detect differences that even 0:07:39.876 --> 0:07:43.196 experts like you cannot detect, right, I mean, is that 0:07:43.276 --> 0:07:45.316 what's going on there? What? 0:07:45.596 --> 0:07:48.316 So AI is not magical, you know, I think it's 0:07:48.436 --> 0:07:50.156 It does a lot of things. But what AI does 0:07:50.236 --> 0:07:53.316 that the layperson doesn't do is that it can analyze 0:07:53.316 --> 0:07:54.596 a lot more data faster. 0:07:55.516 --> 0:07:56.076 Yeah. 0:07:56.196 --> 0:07:59.676 Right, So AI has the possibility, if you have a 0:07:59.836 --> 0:08:03.916 large data set, to then find small differences in these 0:08:04.036 --> 0:08:06.316 data sets that we don't have. I mean, I would 0:08:06.356 --> 0:08:09.076 have to listen to, you know, thousands and thousands of 0:08:09.196 --> 0:08:10.916 voices and compare them statistically. 0:08:11.116 --> 0:08:13.356 It might it might, right. It might also be able 0:08:13.396 --> 0:08:16.716 to detect differences that are not even audible. 0:08:17.156 --> 0:08:20.636 It could exactly. I can give it an example. There's 0:08:20.676 --> 0:08:25.156 a company looking at atrial fibrillation, and I cannot validate 0:08:25.196 --> 0:08:27.796 their data because that's one of the limitations that we're 0:08:27.836 --> 0:08:30.076 going to talk about. But obviously their data set is 0:08:30.076 --> 0:08:33.316 not public. But they're saying that they can diagnose atrial 0:08:33.356 --> 0:08:36.316 fibrillation based on the voice. And their explanation is that 0:08:36.756 --> 0:08:39.396 our voice vibrates to the sound of our heartbeats. 0:08:40.796 --> 0:08:42.756 Big if true? Fun if true? 0:08:43.076 --> 0:08:45.916 I mean you know, again, the limitation here is that 0:08:45.996 --> 0:08:48.356 it's there's a lot of things you can't validate. But 0:08:48.716 --> 0:08:52.276 they say that they've been validating it with EKGs and 0:08:52.396 --> 0:08:54.476 that they can see it. They can hear a difference 0:08:54.516 --> 0:08:56.436 in the voice between patient patients with a. 0:08:56.476 --> 0:09:00.476 Fib atrial fibrillation. It puts you at risk for a stroke, right, 0:09:00.516 --> 0:09:04.156 it can go undiagnosed. So like, if if this works, 0:09:04.196 --> 0:09:08.636 that would be very helpful to many people, right, absolutely, absolutely. 0:09:09.116 --> 0:09:13.196 So you're mentioning like that's super interesting. It's it's interesting 0:09:13.236 --> 0:09:17.716 more generally. So, so you're building a giant database, right, 0:09:18.756 --> 0:09:21.196 and I find that interesting for a lot of reasons. 0:09:21.036 --> 0:09:23.996 It happens. I don't have you come across the work 0:09:24.036 --> 0:09:27.636 of faith A Lee. Absolutely, yeses, So I talked to 0:09:27.636 --> 0:09:30.796 faith A Lee for this show not long ago. Wow. Right, 0:09:30.916 --> 0:09:35.916 she's like nerd famous, right yeah, And so you know, 0:09:36.036 --> 0:09:40.756 as you know, she built this giant database of images 0:09:40.996 --> 0:09:44.516 about ten years ago a little more now called image net. 0:09:44.956 --> 0:09:50.156 And that was that giant database was what allowed these 0:09:50.316 --> 0:09:55.316 early machine learning models AI models to you know, start 0:09:55.476 --> 0:10:02.076 recognizing images, right, and so the database was this necessary tool, 0:10:02.916 --> 0:10:05.996 necessary thing for the AI to really work, right, And 0:10:06.116 --> 0:10:12.796 so are you building the acoustic biomarker version of that? 0:10:13.636 --> 0:10:16.636 So the first the short answer is yes, but I'd 0:10:16.676 --> 0:10:18.716 like to start by saying that I am not building 0:10:19.156 --> 0:10:20.116 it's our distortion. 0:10:20.596 --> 0:10:22.916 Yes, yes, are you all are? 0:10:23.116 --> 0:10:24.276 Actually, I'll just. 0:10:24.316 --> 0:10:27.036 First start by recognizing here that it's it's a it's 0:10:27.116 --> 0:10:29.196 a huge team. So we're the Bridge to Way I 0:10:29.316 --> 0:10:33.116 Voice Constortium is a team of fifty investigators across the 0:10:33.236 --> 0:10:36.556 US and Canada. We're funded by the NIH through the 0:10:36.596 --> 0:10:40.076 Bridge to Way I program and the goal absolutely this 0:10:40.156 --> 0:10:41.916 is the first time I hear the analogy to the 0:10:41.956 --> 0:10:43.356 image net database. 0:10:43.396 --> 0:10:43.756 I like it. 0:10:43.796 --> 0:10:47.076 I usually give the example of the genomic database, the 0:10:47.196 --> 0:10:48.996 Human Genome Project, huge. 0:10:49.076 --> 0:10:52.196 Project, more famous, more famous, they're. 0:10:51.716 --> 0:10:53.716 Both very famous. But I like this analogy. 0:10:53.876 --> 0:10:56.196 Well. Image net is maybe a little bit closer of 0:10:56.236 --> 0:11:00.156 an analogy, but maybe less Yeah yeah, sexy, yeah. 0:10:59.836 --> 0:11:02.316 Well, but I mean it's interesting because the genome project 0:11:02.356 --> 0:11:06.116 has also very interesting ethical particularities like voice, right, the 0:11:06.196 --> 0:11:08.996 image has a little bit less of the ethical constraints. 0:11:08.996 --> 0:11:11.036 For is, when we talk about whole genome. 0:11:10.716 --> 0:11:15.636 Sequencing or genomics data people kind of understand that voice 0:11:15.636 --> 0:11:18.436 has similar concerns in terms of process. 0:11:18.476 --> 0:11:20.596 We want to get to the concerns, but I want 0:11:20.596 --> 0:11:23.276 to first talk about what you're doing and and then 0:11:23.316 --> 0:11:28.716 we can talk about you know, not doing anything wrong. Yeah. 0:11:28.836 --> 0:11:32.676 So broadly, if it becomes the thing you hope it 0:11:32.756 --> 0:11:35.076 will be, what, what is it going to be? What 0:11:35.196 --> 0:11:38.636 is the bridge to AI voice database going to be? 0:11:39.436 --> 0:11:42.516 So it's going to be this large database of thousands 0:11:42.516 --> 0:11:47.796 of human voices linked to other health information that are 0:11:47.876 --> 0:11:52.716 going to be available to researchers and potentially people other 0:11:52.796 --> 0:11:56.756 than researchers as well, to be able to make discoveries, right, 0:11:56.916 --> 0:12:00.756 to learn to use a voice AI, to train you know, 0:12:00.796 --> 0:12:02.956 the next generation of people on how to learn to 0:12:03.196 --> 0:12:07.516 build models on voice AI, to help pharmaceutical companies develop 0:12:07.556 --> 0:12:11.676 products or learn even to to develop products, right, And 0:12:11.716 --> 0:12:15.876 the other really important thing is to teach people what 0:12:15.956 --> 0:12:18.716 type of standards we need right right now, a lot 0:12:18.796 --> 0:12:21.916 of different projects, there's really a lack of standards. People 0:12:21.996 --> 0:12:25.116 collect voice in different ways. That's why it's really hard 0:12:25.116 --> 0:12:29.956 to pull data together. So our dream was really to say, like, hey, 0:12:30.196 --> 0:12:33.956 you want to do voice research, here's a manual, my friend, right, 0:12:34.036 --> 0:12:36.236 like here is how you collect the voice to make 0:12:36.276 --> 0:12:39.356 it accurate. This is the protocols with the task that 0:12:39.396 --> 0:12:43.956 we think, based on our studies, give the best biomarkers. Right, 0:12:44.316 --> 0:12:46.556 These are the type of biomarkers you can look for 0:12:46.836 --> 0:12:50.116 and this is the data you can train, so really 0:12:50.156 --> 0:12:52.716 create a manual of operations also for people to be 0:12:52.756 --> 0:12:55.596 able to make discoveries, and that's the goal to have 0:12:56.476 --> 0:12:57.916 the most impact on patient care. 0:12:58.116 --> 0:13:00.396 So what are the biomarkers? What are you asking people 0:13:00.396 --> 0:13:01.396 to do? What are you collecting? 0:13:02.356 --> 0:13:07.316 So I separate things between. So there are respiratory biomarkers, 0:13:08.356 --> 0:13:13.196 voice biome markers, speech biomarkers, and linguistics biomarkers, and they're 0:13:13.236 --> 0:13:16.476 all different. So let's go about why these are different. 0:13:16.876 --> 0:13:20.076 So respiratory is easy, right, So we ask people to breathe, 0:13:20.556 --> 0:13:23.916 to cough, to take big breaths in and that has 0:13:24.036 --> 0:13:27.236 a lot of information on our pulmonary capacity, on how 0:13:27.276 --> 0:13:31.836 our windpipe is shaped. Okay, that's respiratory. Then voice and 0:13:31.876 --> 0:13:34.236 speech what's the difference. So voice is really the sound 0:13:34.316 --> 0:13:38.156 that we make when our vocal cords come together. So 0:13:38.436 --> 0:13:42.836 when we say, like birds can voice, but they can't speak. 0:13:43.636 --> 0:13:46.436 If you have a bird that speaks, then you'll be very. 0:13:46.316 --> 0:13:50.236 Rich or you have a parent. 0:13:52.676 --> 0:13:55.236 So when we when we do voice tasks, we ask 0:13:55.356 --> 0:13:57.516 patients to say E or. 0:13:57.516 --> 0:13:59.436 Ah or I. 0:13:59.516 --> 0:14:00.116 Get the difference. 0:14:01.716 --> 0:14:06.636 Birds and voice biomarkers will be impacted when our voice 0:14:06.636 --> 0:14:11.436 box is changed or our resp is changed. Right, So 0:14:11.476 --> 0:14:14.916 somebody with pneumonia probably cannot hold a note for very long, 0:14:15.036 --> 0:14:18.596 So that's voice biomarkers. When we talk about speech biomarkers, 0:14:18.636 --> 0:14:22.476 then you go into articulation. So some people, for example, 0:14:22.476 --> 0:14:25.796 who have neurological deficits or their mouth is not working correctly, 0:14:25.796 --> 0:14:28.236 they're going to have trouble articulating. They're going to have 0:14:28.236 --> 0:14:31.916 trouble saying some words. So these are biomarkers we can extract. 0:14:32.196 --> 0:14:36.436 And then lastly there's linguistic biomarkers. So what type of 0:14:36.476 --> 0:14:41.156 words are people using, what type of semantic how fast 0:14:41.476 --> 0:14:44.076 do they speak for example? These are all different types 0:14:44.076 --> 0:14:48.316 of biomarkers that. 0:14:46.596 --> 0:14:47.476 That we can extract. 0:14:47.476 --> 0:14:49.636 So to give you a very tangible example, I was 0:14:49.676 --> 0:14:53.316 reading a paper from a group looking at biomarkers of depression, 0:14:54.836 --> 0:14:58.196 and rate of speech was one of the important biomarkers 0:14:58.196 --> 0:15:01.236 they found. So people who are sad or depressed will 0:15:01.276 --> 0:15:06.956 speak at a slower pace, so words per second is smaller. 0:15:06.996 --> 0:15:08.676 So that's simple when you think about it, it's a 0:15:08.676 --> 0:15:12.996 simple by marker, right, So that's to give up tangible examples. 0:15:13.276 --> 0:15:14.956 So in terms of I think I didn't answer your 0:15:15.036 --> 0:15:18.516 question fully, So what are we asking patients? So we 0:15:18.636 --> 0:15:21.916 ask people to do all these tasks so coughing, breathing, 0:15:22.316 --> 0:15:27.516 a e. Then we make them read those validated passages, 0:15:27.596 --> 0:15:30.956 and we also ask open questions. And then when we 0:15:31.036 --> 0:15:33.836 ask open questions, we have to ask about questions that 0:15:34.116 --> 0:15:37.676 make them emotional and some that don't make them emotional, 0:15:37.716 --> 0:15:40.236 because if you trigger emotion, that causes a bias on 0:15:40.276 --> 0:15:41.516 how your voice will sound. 0:15:42.676 --> 0:15:45.636 What what question do you ask to make people emotional? 0:15:46.036 --> 0:15:47.116 So it's really interesting. 0:15:47.396 --> 0:15:50.676 So at first we would ask, you know, our first 0:15:50.756 --> 0:15:53.436 question was, you know, can you talk to me about 0:15:53.836 --> 0:15:56.276 something that makes you sad? It could be somebody that 0:15:56.356 --> 0:15:58.796 died in your family or you know, So that was 0:15:58.836 --> 0:16:03.756 our prompt. And then our question without emotion was tell 0:16:03.836 --> 0:16:06.356 us about your disease and. 0:16:07.036 --> 0:16:09.676 Only a doctor. What'd think that's for that emotional question? 0:16:09.836 --> 0:16:10.316 Exactly? 0:16:10.396 --> 0:16:12.196 I mean, but it's like when you think about it, 0:16:12.236 --> 0:16:14.916 like Our consortium is like tons of experts that put 0:16:14.956 --> 0:16:16.516 their minds together to develop. 0:16:16.276 --> 0:16:19.516 Tell me about having Parkinson's. That's the unemotional question we're 0:16:19.516 --> 0:16:19.956 going to ask. 0:16:19.916 --> 0:16:22.196 And then we I mean, we like, why are you here? 0:16:22.236 --> 0:16:23.796 I think it was not that obvious, but it's like, 0:16:24.556 --> 0:16:27.116 tell us about why you're here to see your doctor today. 0:16:27.316 --> 0:16:29.956 And then analyzing the data, because we do pilots, right, 0:16:30.036 --> 0:16:33.316 we audit our data. We realized that people were starting 0:16:33.356 --> 0:16:36.236 to tear up, like we had people crying while talking 0:16:36.276 --> 0:16:38.516 about why they were coming to the doctor today, which is. 0:16:38.516 --> 0:16:41.436 Supposed to be the example of unemotional. 0:16:40.836 --> 0:16:42.636 Sure, correct, So we had to change that. 0:16:45.396 --> 0:16:50.556 Yes, interesting, So okay, this is great. So you're getting 0:16:50.596 --> 0:16:55.836 a lot of auditory information from every patient. What other 0:16:55.876 --> 0:16:58.356 information you're getting from each person? So much? 0:16:58.676 --> 0:17:00.876 So to give you an idea, our full protocol is 0:17:00.876 --> 0:17:01.516 about one. 0:17:01.396 --> 0:17:05.396 Hour okay, so of the patient with the patient. 0:17:05.076 --> 0:17:08.236 With an ipassion it's an iPad, So everything is based 0:17:08.276 --> 0:17:10.756 on an iPad and there's a helper right now or 0:17:10.756 --> 0:17:14.476 research assistant. So we collect data. We collect very extensive 0:17:14.516 --> 0:17:18.396 demographics in terms of you know, age, race, geographical location. 0:17:19.356 --> 0:17:22.556 We collect language, So what language do you speak? How 0:17:22.596 --> 0:17:25.636 many languages is do you speak, what languages do you write? 0:17:26.436 --> 0:17:28.196 You know, what part of the world are you from? 0:17:28.356 --> 0:17:32.156 That's really important. Then we collect about disabilities. Are you 0:17:32.236 --> 0:17:34.836 hearing compared are you visually impaired? Because that makes a 0:17:34.916 --> 0:17:40.076 change in your voice, your smoking status, your hydration status, 0:17:40.396 --> 0:17:43.796 your fatigue status, because that's so we're we kind of 0:17:43.836 --> 0:17:47.196 thought about anything that could affect voice, right, your socio 0:17:47.236 --> 0:17:50.996 economical status because if you think about it, that's going 0:17:51.036 --> 0:17:54.956 to affect you know, your linguistics as well. And then 0:17:55.036 --> 0:17:59.916 so other that extensive demographics, then we collect confounders, so 0:17:59.956 --> 0:18:02.196 we think about anything that could change your voice. Do 0:18:02.236 --> 0:18:05.556 you have allergies? Do you do you have dental issues? 0:18:05.596 --> 0:18:09.636 Do you wear braces? So everybody gets a basic test 0:18:09.756 --> 0:18:13.036 about if they are depressed. So no matter what disease 0:18:13.076 --> 0:18:15.716 you have, you kind of get the basic tests for 0:18:15.796 --> 0:18:18.756 all the other disease to measure if it's possible that 0:18:18.836 --> 0:18:21.356 you have concurrent diseases at the same time. 0:18:21.396 --> 0:18:24.716 Because presumably because people are in fact complex, and there 0:18:24.756 --> 0:18:27.676 are many people who have depression and Parkinson's and you 0:18:27.716 --> 0:18:29.716 want to understand what's going on there. 0:18:30.116 --> 0:18:33.836 I mean, most people are complex, right, It's really rare 0:18:33.916 --> 0:18:36.396 to have and people that go to the doctor are 0:18:36.436 --> 0:18:39.636 not twenty year old and healthy. Right, most of the 0:18:39.676 --> 0:18:42.596 people who will use our technology or will benefit from 0:18:42.636 --> 0:18:45.716 these database will be your typical sixty year old chronic 0:18:45.756 --> 0:18:48.196 disease patient that comes into the doctor and they're not 0:18:48.316 --> 0:18:50.076 they don't have a sterile bill of health. 0:18:50.916 --> 0:18:53.556 How many people do you want to have in the database? Like, 0:18:53.636 --> 0:18:55.436 is there a final number you're going for? 0:18:55.876 --> 0:18:58.476 So at the beginning, we were aiming for thirty thousand, 0:19:00.436 --> 0:19:03.796 which is extremely it's extremely ambitious, I think to be fair, 0:19:03.836 --> 0:19:06.236 I mean, if after four years we get to ten thousand, 0:19:06.276 --> 0:19:10.356 I think it'll be a huge success. Okay, And you 0:19:10.356 --> 0:19:13.636 know the data collection. I think what we're learning is 0:19:13.676 --> 0:19:17.556 that data collection is very resource intensive. To have good 0:19:17.676 --> 0:19:20.196 data is very resource intensive. 0:19:20.996 --> 0:19:25.436 So what happened that made you realize that thirty thousand 0:19:25.636 --> 0:19:27.876 was maybe harder than you thought? 0:19:28.796 --> 0:19:31.756 So? I think we thought that we wanted to collect 0:19:31.796 --> 0:19:34.196 as much data as possible, and our original plan was 0:19:34.236 --> 0:19:38.996 to collect a lot shorter protocols, you know, like shorter clips. 0:19:40.236 --> 0:19:43.516 But as we started working with patients, we realized that 0:19:44.076 --> 0:19:48.076 by getting more data from the same patients, we can 0:19:48.116 --> 0:19:51.636 actually have a lot more information and it provides a 0:19:51.716 --> 0:19:55.276 lot of interesting you know biomarkers. So we're focusing more 0:19:55.316 --> 0:19:58.716 on getting more data from a smaller amount of patients 0:19:59.156 --> 0:20:02.356 and really with the right data, kind of right data 0:20:02.436 --> 0:20:04.636 with a lot of clinical information attached to it. 0:20:08.036 --> 0:20:10.516 After the break, what the world will look like in 0:20:10.556 --> 0:20:22.876 a few years if everything goes well. So this is 0:20:22.916 --> 0:20:25.356 a big project that yeah, Elle and her colleagues are 0:20:25.356 --> 0:20:27.556 embarked on. It's a four year project. They're about a 0:20:27.676 --> 0:20:31.356 year in and there will be interim data releases along 0:20:31.356 --> 0:20:34.156 the way. So I asked her, how long will it 0:20:34.196 --> 0:20:37.036 take for this project to advance the state of the 0:20:37.116 --> 0:20:39.396 science in acoustic biomarkers. 0:20:39.876 --> 0:20:43.036 Yeah, I would say to say at the end of 0:20:43.076 --> 0:20:46.956 the four years would be a probably the best answer. 0:20:47.036 --> 0:20:49.236 I think at the end of the four years. But 0:20:49.316 --> 0:20:51.356 I think that you know, you can just say, oh, 0:20:51.356 --> 0:20:53.196 we'll just start training models at the end of the 0:20:53.196 --> 0:20:55.396 four year once we have all the data. Right, It's 0:20:55.396 --> 0:20:57.636 not just about you know, building one model that I'll 0:20:57.636 --> 0:21:02.036 answer your question, is about continuously training models to understand 0:21:02.196 --> 0:21:05.756 which biomarkers to extract the then build products that walk. 0:21:06.356 --> 0:21:13.756 So, so, if things go well, what will this world 0:21:13.796 --> 0:21:16.596 look like in whatever five years? 0:21:17.276 --> 0:21:21.156 Yes, So, I mean there's there's a few things that 0:21:21.196 --> 0:21:24.036 this can help with in general, voice biomarkers. Let's not 0:21:24.076 --> 0:21:28.596 talk about just our project. Diagnosis is one thing, right, 0:21:28.676 --> 0:21:35.276 early diagnosis, but that's probably the hardest thing, Huh. Screening 0:21:35.516 --> 0:21:38.996 is most more important. So when we think about screening, 0:21:39.076 --> 0:21:41.596 it means you, let's say you live really far you 0:21:41.596 --> 0:21:43.956 don't have access to a doctor, but your doctor has 0:21:43.956 --> 0:21:46.356 an iPhone and you can talk into the iPhone and 0:21:46.356 --> 0:21:49.076 it can say, hey, something's wrong. You know, you need 0:21:49.236 --> 0:21:53.156 a neurological specialist, for example. So to help screen and triage. 0:21:53.236 --> 0:21:55.956 I think this probably we're looking at in the next 0:21:55.996 --> 0:22:00.476 five years, something definitely possible. The other product that I 0:22:00.516 --> 0:22:03.596 think will be very possible within five years is tracking 0:22:03.596 --> 0:22:07.196 of diseases. If you want to monitor the evolution of 0:22:07.236 --> 0:22:11.876 parkinson or how people respond to drugs. That's why pharmaceutical 0:22:11.876 --> 0:22:13.196 companies are very interested. 0:22:13.396 --> 0:22:16.716 Right. So the acoustic biomarker is not just a binary 0:22:16.796 --> 0:22:19.676 signal of disease, no disease. It can tell you a 0:22:19.716 --> 0:22:23.796 lot about the status of disease. Is it getting better, 0:22:23.836 --> 0:22:24.556 is it getting worse? 0:22:24.956 --> 0:22:27.996 Evolution, especially if you train it on your own voice. Right, 0:22:28.476 --> 0:22:32.596 it's even easier to detect changes in somebody's voice as 0:22:32.716 --> 0:22:36.676 they progress, like your sory for example, or Alexa that 0:22:36.756 --> 0:22:39.716 learns listens to your voice. So that's going to be 0:22:39.716 --> 0:22:42.396