WEBVTT - Improving Accessibility with AI

0:00:03.320 --> 0:00:05.080
<v Speaker 1>If you are hearing the sound of my voice, then

0:00:05.080 --> 0:00:08.840
<v Speaker 1>you are not actually hearing my voice at all. What

0:00:08.880 --> 0:00:10.520
<v Speaker 1>I mean is that the voice you are hearing is

0:00:10.560 --> 0:00:13.160
<v Speaker 1>actually an assistive text of voice cloning tool that my

0:00:13.240 --> 0:00:17.760
<v Speaker 1>company created, and it is completely powered by AI. There

0:00:17.760 --> 0:00:20.120
<v Speaker 1>are many different types of AI tools to help people

0:00:20.120 --> 0:00:23.600
<v Speaker 1>who are differently abled. For me, it has restored my voice,

0:00:23.600 --> 0:00:26.280
<v Speaker 1>but there is so much more it can do. I'm

0:00:26.320 --> 0:00:31.880
<v Speaker 1>excited to see how it grows. Hey, there, I'm grain

0:00:31.960 --> 0:00:35.920
<v Speaker 1>class and this is technically speaking an Intel podcast. The

0:00:35.960 --> 0:00:39.199
<v Speaker 1>show is dedicated to highlighting the way technology is revolutionizing

0:00:39.320 --> 0:00:43.120
<v Speaker 1>the way we live, work and move. In every episode,

0:00:43.240 --> 0:00:46.400
<v Speaker 1>we'll connect with innovators in areas like artificial intelligence to

0:00:46.479 --> 0:00:50.440
<v Speaker 1>better understand the human centered technology they've developed. As early

0:00:50.479 --> 0:00:53.120
<v Speaker 1>as the discovery of fire and the invention of the wheel,

0:00:53.640 --> 0:00:57.280
<v Speaker 1>technology has always been an innovation to improve people's lives. However,

0:00:57.920 --> 0:01:01.760
<v Speaker 1>sometimes leaders and technology unintentionally exclude those who may deal

0:01:01.760 --> 0:01:07.759
<v Speaker 1>with uncommon issues such as physical immobility, neurodivergence, visual impairments,

0:01:07.880 --> 0:01:11.320
<v Speaker 1>or even old age. While governments usually put systems in

0:01:11.360 --> 0:01:13.959
<v Speaker 1>place to acknowledge and care for these communities, it has

0:01:14.000 --> 0:01:16.840
<v Speaker 1>been the role of technology to create advancements necessary for

0:01:16.880 --> 0:01:20.080
<v Speaker 1>those dealing with disabilities to thrive just as much as

0:01:20.080 --> 0:01:24.280
<v Speaker 1>they abled counterparts. With the revelation of artificial intelligence, there

0:01:24.280 --> 0:01:27.400
<v Speaker 1>are many new advancements that are providing accessibility to these

0:01:27.440 --> 0:01:30.920
<v Speaker 1>communities in the ways we never thought possible until now.

0:01:31.480 --> 0:01:33.399
<v Speaker 1>And I have two experts with me who are leading

0:01:33.440 --> 0:01:38.319
<v Speaker 1>the charge to more accessible future. Lama and Ackmann is

0:01:38.360 --> 0:01:42.440
<v Speaker 1>a visionary leader at the intersection of technology and human experience.

0:01:43.080 --> 0:01:46.720
<v Speaker 1>With a distinguished career spanning academia and industry, Lama has

0:01:46.800 --> 0:01:50.200
<v Speaker 1>consistently pushed the boundaries of technology to enhance our daily

0:01:50.200 --> 0:01:53.840
<v Speaker 1>lives and redefine the way we interact with computers. Her

0:01:53.840 --> 0:01:56.840
<v Speaker 1>innovative work has not only advanced the field of AI,

0:01:57.320 --> 0:01:59.600
<v Speaker 1>but has also paved the way for more intuitive and

0:02:00.440 --> 0:02:04.200
<v Speaker 1>human machine interfaces. As an Intel Fellow and director of

0:02:04.240 --> 0:02:07.240
<v Speaker 1>its Human and AI Systems Research Lab, she leads the

0:02:07.280 --> 0:02:10.880
<v Speaker 1>team defining and executing the research for contextually aware and

0:02:10.919 --> 0:02:15.920
<v Speaker 1>personalized computing, developing sensing systems, algorithms, and applications to make

0:02:15.960 --> 0:02:16.840
<v Speaker 1>it all possible.

0:02:17.160 --> 0:02:19.880
<v Speaker 2>Welcome Lama, thank you very nice to be here.

0:02:20.320 --> 0:02:24.040
<v Speaker 1>We're also joined by Jagadesh Mahendram, a visionary entrepreneur and

0:02:24.080 --> 0:02:27.240
<v Speaker 1>tech innovator who has made significant contributions to the fields

0:02:27.240 --> 0:02:32.080
<v Speaker 1>of artificial intelligence, renewable energy, and sustainable development. With a

0:02:32.120 --> 0:02:36.080
<v Speaker 1>relentless passion for cutting edge technology to address global challenges,

0:02:36.560 --> 0:02:39.520
<v Speaker 1>Jakodesh has emerged as a driving force in shaping a

0:02:39.560 --> 0:02:44.040
<v Speaker 1>more sustainable and inconnected world. Most recently, he joined Camera

0:02:44.520 --> 0:02:47.040
<v Speaker 1>LLC with his founding partners and a team of visually

0:02:47.080 --> 0:02:51.000
<v Speaker 1>impaired volunteers. He uses AI to develop solutions and assistance

0:02:51.040 --> 0:02:55.760
<v Speaker 1>for those dealing with site lost and low visibility. Welcome, Jaggedesh,

0:02:56.040 --> 0:03:02.240
<v Speaker 1>Thank you very much. It's so honor to be here. Okay,

0:03:02.280 --> 0:03:05.520
<v Speaker 1>I've just start with Lama. Lama, how did you get

0:03:05.520 --> 0:03:07.639
<v Speaker 1>your start in tech and AI?

0:03:08.480 --> 0:03:12.480
<v Speaker 2>And I would say that I've been in love with

0:03:12.600 --> 0:03:17.120
<v Speaker 2>tech probably since I was like two years old, you know.

0:03:17.160 --> 0:03:19.640
<v Speaker 2>I've always been into kind of like the latest and

0:03:19.680 --> 0:03:24.400
<v Speaker 2>the greatest technology growing up. But then after I graduated

0:03:24.720 --> 0:03:29.639
<v Speaker 2>from UW Madison, I actually joined Intel out of college

0:03:29.919 --> 0:03:32.240
<v Speaker 2>and then I worked there for a while. I went

0:03:32.280 --> 0:03:35.680
<v Speaker 2>out and did a few startups and then came back

0:03:35.720 --> 0:03:40.920
<v Speaker 2>to Intel specifically focused on that intersection of sensing and

0:03:41.080 --> 0:03:45.240
<v Speaker 2>understanding the world through that to create really compelling technology.

0:03:45.320 --> 0:03:47.240
<v Speaker 2>So that's been kind of like almost like a very

0:03:47.320 --> 0:03:50.080
<v Speaker 2>long career progression that brought me back to what I

0:03:50.160 --> 0:03:51.000
<v Speaker 2>was excited about.

0:03:51.560 --> 0:03:54.760
<v Speaker 1>Excellent, and then in terms of the AI component, how

0:03:54.760 --> 0:03:56.360
<v Speaker 1>did you start to get involved in that?

0:03:57.040 --> 0:03:59.600
<v Speaker 2>Yeah, so early on when I went back to Intel,

0:03:59.640 --> 0:04:02.680
<v Speaker 2>actually in two thousand and three, I started to look at,

0:04:02.800 --> 0:04:04.640
<v Speaker 2>you know, how do we make sense out of the

0:04:04.680 --> 0:04:08.480
<v Speaker 2>world around us? So to be able to understand a

0:04:08.480 --> 0:04:11.320
<v Speaker 2>lot of that sensor data that we were processing, whether

0:04:11.360 --> 0:04:15.040
<v Speaker 2>it's vision or audio or tex or whatever, right, that

0:04:15.200 --> 0:04:18.240
<v Speaker 2>really required work and AI to actually make sense out

0:04:18.240 --> 0:04:20.200
<v Speaker 2>of that data. So that's where it kind of started

0:04:20.240 --> 0:04:23.200
<v Speaker 2>around two thousand and four, and then since then it's

0:04:23.279 --> 0:04:28.000
<v Speaker 2>kind of looking at different ways of intersecting AI and

0:04:28.560 --> 0:04:33.840
<v Speaker 2>HCI to actually bring about really compelling experiences for users

0:04:33.920 --> 0:04:36.839
<v Speaker 2>and helping them perform all sorts of things in their lives.

0:04:37.240 --> 0:04:42.039
<v Speaker 1>Excellent and Jackets, how did you get your start in technology?

0:04:43.120 --> 0:04:45.800
<v Speaker 3>I have a very different story here. I was not

0:04:45.880 --> 0:04:49.440
<v Speaker 3>interested in technology or all. Actually wanted to become a doctor,

0:04:49.760 --> 0:04:52.760
<v Speaker 3>but you know, it's very competitive in India, so I

0:04:52.760 --> 0:04:55.279
<v Speaker 3>didn't really get good ranking, so I couldn't join the

0:04:55.320 --> 0:04:58.559
<v Speaker 3>college that I wanted to join, and the second option

0:04:58.720 --> 0:05:03.000
<v Speaker 3>was engineering, and I chose to do the computer science.

0:05:03.320 --> 0:05:05.520
<v Speaker 3>I think the very best turn out to me is

0:05:05.600 --> 0:05:10.920
<v Speaker 3>good that we're enjoying artificial intelligence and more than doctor.

0:05:11.040 --> 0:05:13.520
<v Speaker 3>I think I'm a bettery engineer than a doctor.

0:05:16.000 --> 0:05:18.800
<v Speaker 1>Oh that kind of makes sense with your I guess

0:05:18.839 --> 0:05:21.559
<v Speaker 1>love of medicine and the type of projects that you've

0:05:21.839 --> 0:05:23.680
<v Speaker 1>come up with. So we'll talk about that in a

0:05:23.680 --> 0:05:26.279
<v Speaker 1>little bit. But one thing I'll go back to Lama

0:05:26.560 --> 0:05:30.440
<v Speaker 1>in terms of AI improving the human experience you've mentioned HI.

0:05:30.520 --> 0:05:32.200
<v Speaker 1>First of all, if you can define for the audience

0:05:32.200 --> 0:05:37.680
<v Speaker 1>what HDI is and also how do you address solutions

0:05:38.120 --> 0:05:40.279
<v Speaker 1>and maybe you could educate me around what's the difference

0:05:40.279 --> 0:05:43.880
<v Speaker 1>between accessibility and accommodation when you're designing a system.

0:05:44.720 --> 0:05:48.400
<v Speaker 2>Yeah, so, first of all, HDI is a human computer interaction,

0:05:48.880 --> 0:05:53.120
<v Speaker 2>So it's really trying to understand how would people directly

0:05:53.160 --> 0:05:57.160
<v Speaker 2>interact with technology. And sometimes that technology is something that's

0:05:57.200 --> 0:06:01.719
<v Speaker 2>actually physical that you're picking something on a computer or whatever.

0:06:01.800 --> 0:06:03.359
<v Speaker 2>But a lot of times, you know, some of the

0:06:03.400 --> 0:06:06.560
<v Speaker 2>work that we really work on is embedding it into

0:06:06.600 --> 0:06:11.120
<v Speaker 2>the environment so that it almost becomes like invisible. And

0:06:11.160 --> 0:06:13.120
<v Speaker 2>that's kind of one of the most interesting things is

0:06:13.120 --> 0:06:17.640
<v Speaker 2>like to really architect for interactions of things that are invisible. Honestly,

0:06:17.640 --> 0:06:19.960
<v Speaker 2>if you think about any technology that you're developing, you

0:06:20.040 --> 0:06:23.320
<v Speaker 2>have to think about how you're making it accessible, how

0:06:23.320 --> 0:06:26.400
<v Speaker 2>the interfaces are accessible, how different people with different types

0:06:26.440 --> 0:06:30.919
<v Speaker 2>of disabilities and abilities can actually interact with your technology

0:06:31.400 --> 0:06:36.520
<v Speaker 2>throughout like the development cycle. In some sense, part of

0:06:36.560 --> 0:06:41.200
<v Speaker 2>what I've really been focused on is creating technologies for

0:06:41.320 --> 0:06:46.479
<v Speaker 2>people who are severely disabled, where you really need very

0:06:46.480 --> 0:06:50.640
<v Speaker 2>different ways of interacting with the technology to enable that

0:06:50.720 --> 0:06:54.200
<v Speaker 2>to happen. Really, that focus specifically on the work with

0:06:54.240 --> 0:06:57.800
<v Speaker 2>ACAT and the workforce even Hawking, has really been about

0:06:57.839 --> 0:07:01.280
<v Speaker 2>how do you get around these on strengths to enable

0:07:01.600 --> 0:07:05.200
<v Speaker 2>people to access the technologies just like all of us.

0:07:06.800 --> 0:07:11.080
<v Speaker 1>Lama mentions Intel's ACAT, which stands for Assistive Context Aware talkit.

0:07:11.720 --> 0:07:14.880
<v Speaker 1>This technology was key in enabling Stephen Hawking's ability to

0:07:14.920 --> 0:07:19.440
<v Speaker 1>continue to communicate and inspire people around the world. Listening

0:07:19.520 --> 0:07:23.200
<v Speaker 1>to Lama speak about the human computer interaction processes, she

0:07:23.280 --> 0:07:27.000
<v Speaker 1>sounds less like a tech person and more like an anthropologist.

0:07:27.760 --> 0:07:30.240
<v Speaker 1>We often think of data and algorithms as being this

0:07:30.480 --> 0:07:34.120
<v Speaker 1>cold and personal assessment of people. But Lama has such

0:07:34.120 --> 0:07:37.360
<v Speaker 1>a passion for her programs, it makes me wonder just

0:07:37.400 --> 0:07:40.520
<v Speaker 1>how impactful that passion is to the way AI tools

0:07:40.680 --> 0:07:43.920
<v Speaker 1>interpret how to assist us. While she has spent so

0:07:44.000 --> 0:07:47.040
<v Speaker 1>much time learning how to program and manage computers, it

0:07:47.120 --> 0:07:51.040
<v Speaker 1>seems her real passion is in trying to understand humanity.

0:07:52.520 --> 0:07:57.280
<v Speaker 2>My passion has really been focused on how do we

0:07:57.360 --> 0:08:02.400
<v Speaker 2>bring more equity with technology. The work towards specifically extreme

0:08:02.440 --> 0:08:07.280
<v Speaker 2>disability really came about from my interaction with Professor Hawking.

0:08:07.800 --> 0:08:09.680
<v Speaker 2>So before that, a lot of the work that I

0:08:09.720 --> 0:08:13.080
<v Speaker 2>had focused on in terms of accessibility is really bridging

0:08:13.840 --> 0:08:17.480
<v Speaker 2>where people's needs were as they're doing different aspects of

0:08:17.520 --> 0:08:20.320
<v Speaker 2>their lives. Right you're driving, for example, how can that

0:08:20.480 --> 0:08:23.920
<v Speaker 2>be contextually aware so it can help support you without

0:08:24.200 --> 0:08:27.440
<v Speaker 2>assuming that you have all of your abilities there. But

0:08:27.600 --> 0:08:31.600
<v Speaker 2>once I started working with Professor Stephen Hawking, it became

0:08:31.760 --> 0:08:36.319
<v Speaker 2>very obvious to me that to bridge that extreme disability

0:08:36.440 --> 0:08:39.880
<v Speaker 2>you really have to think very differently about how technology

0:08:39.920 --> 0:08:42.280
<v Speaker 2>comes in. And that's what really got me excited about

0:08:42.320 --> 0:08:42.760
<v Speaker 2>that work.

0:08:43.240 --> 0:08:46.320
<v Speaker 1>Okay, and in terms of the involvement you had with

0:08:46.920 --> 0:08:50.559
<v Speaker 1>Professor Hawking's technology to help him interact with the world,

0:08:50.920 --> 0:08:53.000
<v Speaker 1>What were the areas that you looked at?

0:08:53.360 --> 0:08:56.000
<v Speaker 2>The lab I lead is actually a multi disciplinary lab, right,

0:08:56.040 --> 0:08:59.719
<v Speaker 2>so we bring social science, design, and AI together. So

0:09:00.240 --> 0:09:03.280
<v Speaker 2>the first place you start is we needed to understand

0:09:03.360 --> 0:09:06.559
<v Speaker 2>how Stephen interacts with the world, what he is trying

0:09:06.559 --> 0:09:10.160
<v Speaker 2>to accomplish, and where are his bottomnecks in terms of

0:09:10.200 --> 0:09:12.440
<v Speaker 2>being able to do that with existing technology that he

0:09:12.559 --> 0:09:14.760
<v Speaker 2>was using. So there was a lot of observation to

0:09:14.800 --> 0:09:18.280
<v Speaker 2>try to understand how do we define the problem and

0:09:18.400 --> 0:09:21.160
<v Speaker 2>from there for people who are not aware of this, right,

0:09:21.280 --> 0:09:25.080
<v Speaker 2>Professor Hawking really didn't have an ability to speak, and

0:09:25.160 --> 0:09:27.800
<v Speaker 2>he didn't really have an ability to move, so he

0:09:27.840 --> 0:09:31.160
<v Speaker 2>couldn't really utilize many of the technologies that are available.

0:09:31.559 --> 0:09:35.200
<v Speaker 2>You couldn't do, for example, ASR where he could speak

0:09:35.240 --> 0:09:38.240
<v Speaker 2>and then the computer could be controlled by speech, Nor

0:09:38.280 --> 0:09:41.240
<v Speaker 2>could he type because he had no control over his hands.

0:09:41.720 --> 0:09:44.920
<v Speaker 2>So then we started to basically look at if we

0:09:45.080 --> 0:09:48.600
<v Speaker 2>really had a very very tiny signal, and in this

0:09:48.640 --> 0:09:51.480
<v Speaker 2>specific case for Professor Hawking, it was actually his ability

0:09:51.520 --> 0:09:55.160
<v Speaker 2>to move his cheek. Can we get access to that

0:09:55.400 --> 0:10:00.160
<v Speaker 2>one signal and then turn that into a complete access

0:10:00.200 --> 0:10:02.719
<v Speaker 2>for his whole machine? And then we went onto that

0:10:02.840 --> 0:10:07.959
<v Speaker 2>path of essentially building a software platform and a sensing

0:10:08.040 --> 0:10:11.319
<v Speaker 2>subsystem that allowed for that to happen. All he can

0:10:11.360 --> 0:10:13.839
<v Speaker 2>do is confirm something with the movement of a cheek,

0:10:13.880 --> 0:10:17.360
<v Speaker 2>and now he can type, he can email, he can

0:10:17.800 --> 0:10:20.000
<v Speaker 2>serve the web, he can give lectures, he could do all.

0:10:19.960 --> 0:10:22.719
<v Speaker 1>Of that in What year was that that you were

0:10:22.720 --> 0:10:23.160
<v Speaker 1>working on?

0:10:24.080 --> 0:10:27.560
<v Speaker 2>So we started our interaction with Stephen and twenty eleven

0:10:28.200 --> 0:10:31.240
<v Speaker 2>and it kind of lasted throughout his life until he

0:10:31.280 --> 0:10:35.320
<v Speaker 2>passed away, which was twenty eighteen. So we after a

0:10:35.320 --> 0:10:37.560
<v Speaker 2>couple of years we were able to put together a

0:10:37.600 --> 0:10:39.720
<v Speaker 2>system that he could use that he could switch to,

0:10:39.880 --> 0:10:42.640
<v Speaker 2>and then over the years we just continued to enhance

0:10:42.679 --> 0:10:45.160
<v Speaker 2>it and add more capabilities. We open sourced it so

0:10:45.200 --> 0:10:46.560
<v Speaker 2>that we could take it into the world.

0:10:46.600 --> 0:10:48.800
<v Speaker 1>And yeah, that was my next question in terms of

0:10:48.840 --> 0:10:52.120
<v Speaker 1>the technology that was developed. Have you seen it applied

0:10:52.559 --> 0:10:54.120
<v Speaker 1>more broadly to others.

0:10:54.800 --> 0:10:58.240
<v Speaker 2>Yeah, And initially we were hoping that we could find

0:10:58.679 --> 0:11:02.280
<v Speaker 2>some technology out there we could take and modify slightly

0:11:02.360 --> 0:11:04.800
<v Speaker 2>so that he could use it. And then after being

0:11:04.840 --> 0:11:08.120
<v Speaker 2>proven wrong, we then went onto this path to go

0:11:08.160 --> 0:11:10.880
<v Speaker 2>and develop something from scratch. But from the get go

0:11:11.120 --> 0:11:14.200
<v Speaker 2>our goal was to develop it so that it could

0:11:14.280 --> 0:11:17.760
<v Speaker 2>support a lot of different users and be a platform

0:11:17.840 --> 0:11:21.079
<v Speaker 2>for developers to build on top of, because we realized

0:11:21.080 --> 0:11:24.319
<v Speaker 2>that there was that gap in what existed out there

0:11:24.400 --> 0:11:27.920
<v Speaker 2>in the open world. And Stephen was a huge contributor

0:11:27.960 --> 0:11:30.600
<v Speaker 2>to this project, right he he you know, he was

0:11:31.280 --> 0:11:33.800
<v Speaker 2>a designer, he was a validator, he was you know,

0:11:33.840 --> 0:11:36.080
<v Speaker 2>he gave a lot of his insights. So throughout all

0:11:36.080 --> 0:11:39.559
<v Speaker 2>of this he was really focused on ensuring that that

0:11:39.640 --> 0:11:43.240
<v Speaker 2>actually went to open source because you know, people reached

0:11:43.280 --> 0:11:45.720
<v Speaker 2>out to him all the time because he was, you know,

0:11:45.800 --> 0:11:49.320
<v Speaker 2>an own figure with that extreme disability, and everybody was

0:11:49.360 --> 0:11:52.040
<v Speaker 2>asking him, like, what technology is available to us to

0:11:52.120 --> 0:11:54.959
<v Speaker 2>actually use. So he's been like really focused quite a

0:11:55.000 --> 0:11:57.600
<v Speaker 2>bit on making that available to the world.

0:11:59.120 --> 0:12:01.960
<v Speaker 1>You can really sense how dedicated Lama is to helping

0:12:01.960 --> 0:12:06.280
<v Speaker 1>those with disabilities communicate with others. However, talking is just

0:12:06.480 --> 0:12:09.880
<v Speaker 1>one way we communicate, and moreover, there are a combination

0:12:09.960 --> 0:12:13.000
<v Speaker 1>of ways that we engage and interact with our environments.

0:12:14.160 --> 0:12:16.320
<v Speaker 1>As a way to help people that may struggle with

0:12:16.360 --> 0:12:21.600
<v Speaker 1>another sense is our other guest Jagged Dish originally designing

0:12:21.600 --> 0:12:24.240
<v Speaker 1>a backpack that uses AI to help guide the blind.

0:12:24.920 --> 0:12:28.280
<v Speaker 1>His project expanded into really dissecting what it means to

0:12:28.320 --> 0:12:32.040
<v Speaker 1>be visually impaired. Lama and Jagged Dish both have different

0:12:32.040 --> 0:12:35.559
<v Speaker 1>approaches to their missions. The work compliments each other so well.

0:12:38.600 --> 0:12:42.440
<v Speaker 1>Jagsh I'd like to get you into conversation now, in

0:12:42.480 --> 0:12:47.280
<v Speaker 1>particular the AI powered backpack that has been developed by

0:12:47.280 --> 0:12:49.880
<v Speaker 1>yourself and others. Can you just tell me a little

0:12:49.920 --> 0:12:52.880
<v Speaker 1>bit about I guess the genesis of that idea.

0:12:53.600 --> 0:12:58.040
<v Speaker 3>I've always wanted to do something using the technology that

0:12:58.120 --> 0:13:00.720
<v Speaker 3>can help the society in what way or the other.

0:13:01.360 --> 0:13:05.360
<v Speaker 3>And when it came to Masters in twenty thirteen, one

0:13:05.400 --> 0:13:07.120
<v Speaker 3>of the first things that occurred to me was like,

0:13:07.160 --> 0:13:10.439
<v Speaker 3>you know, we should use EI and use a bunch

0:13:10.480 --> 0:13:13.720
<v Speaker 3>of sensors to help the usually impaired see the world,

0:13:13.800 --> 0:13:18.200
<v Speaker 3>like how sighted people see. And one of the primary

0:13:18.280 --> 0:13:20.600
<v Speaker 3>visions that I used to occur to me was when

0:13:20.640 --> 0:13:24.640
<v Speaker 3>somebody is standing in public places like buz stop, there

0:13:24.640 --> 0:13:27.160
<v Speaker 3>should be a solution in such a way that the

0:13:27.200 --> 0:13:31.400
<v Speaker 3>person who is blind should get totally unnoticed. And around

0:13:31.440 --> 0:13:34.800
<v Speaker 3>that time the technology was not as good as how

0:13:34.840 --> 0:13:37.960
<v Speaker 3>it is now. The real inspiration occurred to me when

0:13:37.960 --> 0:13:41.000
<v Speaker 3>I met my friend. The day when I met her,

0:13:41.360 --> 0:13:43.840
<v Speaker 3>she had a black mark in her face, and I

0:13:43.920 --> 0:13:46.440
<v Speaker 3>was like, you know, what happened to your face? And

0:13:46.480 --> 0:13:49.640
<v Speaker 3>she's usually impair and she was saying, as she was

0:13:49.679 --> 0:13:52.760
<v Speaker 3>walking outside in the sidewalk, she ran into a tree

0:13:52.760 --> 0:13:55.520
<v Speaker 3>branch and then that left a mark in her face.

0:13:56.360 --> 0:14:00.520
<v Speaker 3>And that was such an ironical for me because by

0:14:00.600 --> 0:14:04.920
<v Speaker 3>then I was already a perception engineer, teaching robots to

0:14:04.960 --> 0:14:07.800
<v Speaker 3>see things, you know, do complex us. But at the

0:14:07.800 --> 0:14:11.040
<v Speaker 3>same time, there are so many people who cannot see

0:14:11.240 --> 0:14:17.520
<v Speaker 3>right and that sort of spart my desire to work

0:14:17.600 --> 0:14:21.080
<v Speaker 3>on this project sooner than later. Around the same time,

0:14:21.480 --> 0:14:24.400
<v Speaker 3>this competition of Open Sea Special ai I was going on,

0:14:24.560 --> 0:14:28.960
<v Speaker 3>sponsored by Intel, and I submitted this idea and the

0:14:29.000 --> 0:14:31.640
<v Speaker 3>project ended up winning the first price. And this friend

0:14:31.640 --> 0:14:34.360
<v Speaker 3>has been helping you throughout how to develop a system

0:14:34.360 --> 0:14:38.680
<v Speaker 3>that is more user friendly and actually solves important use cases.

0:14:39.440 --> 0:14:43.520
<v Speaker 3>And through this competition received a lot of attention, and

0:14:43.600 --> 0:14:45.280
<v Speaker 3>this is when we started to think, you know, we

0:14:45.280 --> 0:14:49.560
<v Speaker 3>should probably you know, get incorporated and try to create

0:14:49.680 --> 0:14:52.880
<v Speaker 3>a full fledged open source system so that anybody in

0:14:52.880 --> 0:14:56.040
<v Speaker 3>the world can use it and help in improving the

0:14:56.040 --> 0:14:59.320
<v Speaker 3>lives of the visually impaired. Currently, we are supported by

0:15:00.280 --> 0:15:04.800
<v Speaker 3>Intel's irt A program Intel Rise Technological Initiative program, and

0:15:05.200 --> 0:15:08.840
<v Speaker 3>we are receiving an in collaboration with Accenture. Through this partnership,

0:15:08.840 --> 0:15:11.760
<v Speaker 3>you're able to gain a lot of support both on

0:15:11.800 --> 0:15:14.400
<v Speaker 3>the technical and non technical side. And soon we will

0:15:14.440 --> 0:15:18.600
<v Speaker 3>be releasing our improved version of the system, which we

0:15:18.640 --> 0:15:20.640
<v Speaker 3>call as Phoenix in a few months.

0:15:21.240 --> 0:15:24.480
<v Speaker 1>Okay, excellent, looking forward to it. And can you tell

0:15:24.520 --> 0:15:26.360
<v Speaker 1>me what I've seen a little bit of a video

0:15:26.400 --> 0:15:28.680
<v Speaker 1>on it where you've got a backpack. Maybe you could

0:15:28.680 --> 0:15:30.640
<v Speaker 1>just describe some of the main system elements.

0:15:31.280 --> 0:15:34.800
<v Speaker 3>Yeah, the physical system mainly consists of a backpack that

0:15:34.920 --> 0:15:38.640
<v Speaker 3>has Intel Look with a couple of new compute sticks

0:15:38.880 --> 0:15:42.040
<v Speaker 3>and this is the sort of the compute resource. And

0:15:42.160 --> 0:15:45.360
<v Speaker 3>at the front we have a camera. It's Obi camera

0:15:45.400 --> 0:15:47.800
<v Speaker 3>that is is put in the front and connect it

0:15:47.800 --> 0:15:52.720
<v Speaker 3>to the system behind and whatever this sensor collects the data.

0:15:53.200 --> 0:15:57.359
<v Speaker 3>We run some AA processing behind using deep learning techniques

0:15:57.800 --> 0:16:01.920
<v Speaker 3>and the system will infer useful information about the environment

0:16:02.000 --> 0:16:05.320
<v Speaker 3>and update the user such as where are the obstacles

0:16:05.680 --> 0:16:08.160
<v Speaker 3>and what are the common objects seen in the scenario,

0:16:08.400 --> 0:16:11.760
<v Speaker 3>what are the moving objects, what are the traffic conditions?

0:16:11.800 --> 0:16:16.400
<v Speaker 3>And more similar features for communicating there is audio interface

0:16:16.680 --> 0:16:21.680
<v Speaker 3>through bluetooth headphones, and we're also working on a haptic

0:16:21.760 --> 0:16:25.640
<v Speaker 3>band to communicate the same sort of information in form

0:16:25.640 --> 0:16:27.680
<v Speaker 3>of vibrations through tacktail information.

0:16:28.520 --> 0:16:31.160
<v Speaker 1>Lama was just wondering if you had any comments or

0:16:31.240 --> 0:16:33.440
<v Speaker 1>thoughts on this AI backpack.

0:16:34.120 --> 0:16:37.240
<v Speaker 2>I mean, it's a fantastic idea, and you know, if

0:16:37.280 --> 0:16:41.640
<v Speaker 2>you think about what is actually now possible with perception

0:16:42.040 --> 0:16:44.400
<v Speaker 2>and AI, I mean, it's it's just kind of like

0:16:44.440 --> 0:16:48.360
<v Speaker 2>the most natural thing to do to actually empower users

0:16:48.400 --> 0:16:52.280
<v Speaker 2>with such a capability that are vision impaired. I was

0:16:52.320 --> 0:16:56.760
<v Speaker 2>actually also quite intrigued by the haptic aspect of what

0:16:56.800 --> 0:16:59.800
<v Speaker 2>you mentioned, and I think it's something that tends to

0:16:59.800 --> 0:17:03.560
<v Speaker 2>be un they're utilized, but really kind of a natural

0:17:03.680 --> 0:17:06.720
<v Speaker 2>thing for this type of application, especially if you're trying

0:17:06.720 --> 0:17:08.959
<v Speaker 2>to kind of guide somebody in a direction. So I

0:17:08.960 --> 0:17:11.280
<v Speaker 2>was wondering maybe you can say a few words about that.

0:17:11.359 --> 0:17:12.760
<v Speaker 2>I was really intrigued by that.

0:17:13.560 --> 0:17:17.840
<v Speaker 3>Yeah, So if the first prototype contained mainly the audio interface,

0:17:18.440 --> 0:17:22.199
<v Speaker 3>all the information is actually shared via the wireless headphones

0:17:22.760 --> 0:17:26.679
<v Speaker 3>and not all the users prefer that main reason is

0:17:26.720 --> 0:17:30.320
<v Speaker 3>because they usually impair people rely on audio cues when

0:17:30.320 --> 0:17:34.800
<v Speaker 3>they're wearing earphones. We are sort of blocking a lot

0:17:34.840 --> 0:17:37.840
<v Speaker 3>of environmental cues, which is why we wanted to introduce

0:17:38.000 --> 0:17:43.560
<v Speaker 3>another modality for user interface, which is haptic bands. Basically,

0:17:43.680 --> 0:17:48.920
<v Speaker 3>using a combination of motors and vibration patterns, we can

0:17:49.000 --> 0:17:53.520
<v Speaker 3>communicate tons of information just using few motors, like even

0:17:53.600 --> 0:17:57.440
<v Speaker 3>less than ten morrors. And the current prototype that we're

0:17:57.440 --> 0:18:01.400
<v Speaker 3>working on is pretty simple version. It can be put

0:18:01.480 --> 0:18:06.480
<v Speaker 3>on the wrist and this can communicate potentially hundreds of

0:18:06.480 --> 0:18:10.880
<v Speaker 3>combinations of vibrations, and at some point we're really aiming

0:18:10.960 --> 0:18:14.240
<v Speaker 3>for a set of where we can communicate pretty much

0:18:14.280 --> 0:18:18.760
<v Speaker 3>everything the system sees through the happic vibrations. If a

0:18:18.880 --> 0:18:23.359
<v Speaker 3>user prefers completely one hundred percent haptic bands, that is

0:18:23.359 --> 0:18:26.080
<v Speaker 3>something that we are targeting for. At the same time,

0:18:26.119 --> 0:18:29.119
<v Speaker 3>some users might prefer, you know what, I want this

0:18:29.240 --> 0:18:32.200
<v Speaker 3>sort of information to be communicated via audio and some

0:18:32.240 --> 0:18:35.800
<v Speaker 3>sort of information with the haptics. We're also are working

0:18:35.840 --> 0:18:39.080
<v Speaker 3>with the combination of system as well, but having hapic

0:18:39.119 --> 0:18:42.919
<v Speaker 3>bands in a solution like this opens a sort of

0:18:42.920 --> 0:18:47.240
<v Speaker 3>a different dimension for the users here, especially we're visually impaired.

0:18:48.960 --> 0:18:52.399
<v Speaker 1>What Jagadish hints at with his explanation of haptic bands

0:18:52.480 --> 0:18:56.520
<v Speaker 1>versus audio interface is a very fascinating, multi pronged approach

0:18:56.560 --> 0:18:59.920
<v Speaker 1>to the solution. In technology terms, haptics is all about

0:19:00.040 --> 0:19:03.240
<v Speaker 1>how your device interacts with you through touch. Think of

0:19:03.320 --> 0:19:06.080
<v Speaker 1>the times when your phone vibrates in your pocket, or

0:19:06.359 --> 0:19:08.359
<v Speaker 1>when you play a video game and the controller shakes

0:19:08.359 --> 0:19:12.080
<v Speaker 1>in response to you taking damage from the endgame boss. Oftentimes,

0:19:12.080 --> 0:19:15.720
<v Speaker 1>in developing solutions for disabilities, there is a one size

0:19:15.720 --> 0:19:18.320
<v Speaker 1>fits all approach that seeks to do an adequate job

0:19:18.400 --> 0:19:21.359
<v Speaker 1>for the most number of people. This strategy fails to

0:19:21.400 --> 0:19:24.960
<v Speaker 1>take into account the nuances of the human experience in

0:19:25.000 --> 0:19:27.840
<v Speaker 1>the same way that some people are audio learners while

0:19:27.880 --> 0:19:30.679
<v Speaker 1>others are visual. When it comes to aiding someone with

0:19:30.720 --> 0:19:34.720
<v Speaker 1>a disability, it is important to consider what methods complement

0:19:34.760 --> 0:19:38.680
<v Speaker 1>their strengths and experiences. The beauty of how jagger Dish

0:19:38.960 --> 0:19:41.600
<v Speaker 1>seeks to develop this AI tool is that it is

0:19:41.720 --> 0:19:45.840
<v Speaker 1>constantly studying and creating more specialized options for the users,

0:19:46.880 --> 0:19:49.640
<v Speaker 1>from audio to haptics. It has the potential to grow

0:19:49.680 --> 0:19:52.240
<v Speaker 1>in a number of ways to accommodate the visually impaired

0:19:52.920 --> 0:19:55.800
<v Speaker 1>ways we never thought to supplement them, and maybe these

0:19:55.840 --> 0:19:58.880
<v Speaker 1>developments will even have an impact on those with perfect sight.

0:20:00.040 --> 0:20:03.479
<v Speaker 1>Leans into the human computer interface component that Lama mentioned earlier,

0:20:03.880 --> 0:20:06.840
<v Speaker 1>the constant study and assessment of how people will actually

0:20:06.920 --> 0:20:14.000
<v Speaker 1>use the tools. Given you're listening to technically Speaking an

0:20:14.000 --> 0:20:27.200
<v Speaker 1>Intel podcast will be right back. Welcome back to technically

0:20:27.240 --> 0:20:34.720
<v Speaker 1>speaking an Intel podcast. I'd just like to get more

0:20:34.760 --> 0:20:39.280
<v Speaker 1>broadly into Intel and it's AI efforts now, Jaged you

0:20:39.320 --> 0:20:42.879
<v Speaker 1>have a partnership with Intel. You've mentioned it before, how

0:20:43.200 --> 0:20:45.800
<v Speaker 1>you're working with Intel and how they've come to the party,

0:20:45.840 --> 0:20:49.160
<v Speaker 1>so to speak. What's it like working with their team

0:20:49.640 --> 0:20:52.159
<v Speaker 1>in terms of their support and assistance they've given you.

0:20:52.600 --> 0:20:55.960
<v Speaker 1>It's fantastic. The amount of exposure and the support that

0:20:56.000 --> 0:20:59.600
<v Speaker 1>we've received from Intel is really amazing. What we admire

0:20:59.600 --> 0:21:03.560
<v Speaker 1>about Intel is how open they are in developing the

0:21:03.800 --> 0:21:08.440
<v Speaker 1>solutions for accessibility, and they have a dedicated team who's

0:21:08.440 --> 0:21:10.920
<v Speaker 1>purely working on solutions like this. And we also got

0:21:10.960 --> 0:21:14.840
<v Speaker 1>opportunity to look into the projects that Lama's team has

0:21:14.840 --> 0:21:18.439
<v Speaker 1>been working on. They're simply superb. I think these solutions

0:21:18.480 --> 0:21:22.520
<v Speaker 1>like this are transformative and it's going to change lives

0:21:22.520 --> 0:21:26.040
<v Speaker 1>for people. And in terms of the support that we've received,

0:21:26.400 --> 0:21:30.760
<v Speaker 1>they have been helping us on many aspects all the

0:21:30.760 --> 0:21:35.280
<v Speaker 1>way from helping with putting up a process training the

0:21:35.320 --> 0:21:38.600
<v Speaker 1>model assets, you know, creating a platform for training models,

0:21:38.680 --> 0:21:42.920
<v Speaker 1>and also sharing the connections, and also with funding. So

0:21:43.160 --> 0:21:45.520
<v Speaker 1>a lot of the features that is going to come

0:21:45.560 --> 0:21:49.000
<v Speaker 1>as part of Phoenix is coming out of the IrDA project,

0:21:49.440 --> 0:21:53.520
<v Speaker 1>and the sort of feedbacks that would risk in improving

0:21:53.560 --> 0:21:57.720
<v Speaker 1>the solution is something that we don't get outset easily.

0:21:59.080 --> 0:22:01.800
<v Speaker 2>Yeah, and it's one of the things that I believe

0:22:02.280 --> 0:22:04.440
<v Speaker 2>we're really all about at Intel, right If you look

0:22:04.480 --> 0:22:07.440
<v Speaker 2>at our mission, it's really enriched the life of every

0:22:07.560 --> 0:22:10.280
<v Speaker 2>person on the planet, and every person, right, not just

0:22:10.359 --> 0:22:14.000
<v Speaker 2>able people. So it's really wonderful that you're seeing that

0:22:14.119 --> 0:22:17.479
<v Speaker 2>support and the diversity of the type of platforms and

0:22:17.480 --> 0:22:20.000
<v Speaker 2>solutions that we have, right. So I'm really just very

0:22:20.040 --> 0:22:23.240
<v Speaker 2>heartened by what you said, Drakhandesh. One of the things

0:22:23.280 --> 0:22:27.320
<v Speaker 2>maybe that is a top of mind project something called Omnibridge,

0:22:27.960 --> 0:22:32.080
<v Speaker 2>and Omnibridge is essentially a software that is meant to

0:22:32.520 --> 0:22:36.440
<v Speaker 2>bridge again the silence gap, but for people who are

0:22:36.960 --> 0:22:40.439
<v Speaker 2>hearing impaired, so that you know, essentially you're translating in

0:22:40.480 --> 0:22:43.720
<v Speaker 2>and out of like sign language, so you know, people

0:22:43.800 --> 0:22:47.400
<v Speaker 2>can sign into their PC and then the PC can

0:22:47.440 --> 0:22:51.959
<v Speaker 2>actually translate that into language on the other end, and

0:22:52.000 --> 0:22:54.679
<v Speaker 2>then vice versa. Right, So it's like, you know, what

0:22:54.720 --> 0:22:59.000
<v Speaker 2>you're really enabling again is to enable people in their

0:22:59.320 --> 0:23:01.720
<v Speaker 2>everyday life life to actually be able to do that.

0:23:01.840 --> 0:23:04.280
<v Speaker 2>And to be able to do that, you need a

0:23:04.320 --> 0:23:07.439
<v Speaker 2>lot of the AI support and AI compute on these platforms.

0:23:07.440 --> 0:23:09.119
<v Speaker 2>So one of the reasons again what I was saying,

0:23:09.400 --> 0:23:12.280
<v Speaker 2>it's really nice to see it at these platforms and

0:23:12.359 --> 0:23:14.840
<v Speaker 2>at the lowest cost that you can actually bring it.

0:23:14.920 --> 0:23:17.520
<v Speaker 2>You start to really democratize AI in ways that really

0:23:17.560 --> 0:23:18.800
<v Speaker 2>improve people's lives.

0:23:19.000 --> 0:23:21.200
<v Speaker 1>Yeah, for me, I mean one of the key things

0:23:21.200 --> 0:23:25.640
<v Speaker 1>you've just said is democratizing technology, and I think that's

0:23:25.680 --> 0:23:28.440
<v Speaker 1>the real power of it is. Yes, we can have

0:23:28.480 --> 0:23:33.320
<v Speaker 1>those really fancy solutions that Professor Hawking has, but for me,

0:23:33.480 --> 0:23:37.000
<v Speaker 1>it's about trying to get that cost down so that

0:23:37.080 --> 0:23:40.880
<v Speaker 1>it makes it so much easier for people to use.

0:23:41.320 --> 0:23:44.600
<v Speaker 2>So actually, just as a correction, so Professor Hawking didn't

0:23:44.640 --> 0:23:47.199
<v Speaker 2>have a fancy system, I was actually at PC with

0:23:47.280 --> 0:23:50.840
<v Speaker 2>a very lightweight sensor and in fact, like a big

0:23:50.880 --> 0:23:52.800
<v Speaker 2>part of what we've been really trying to do with

0:23:52.880 --> 0:23:55.920
<v Speaker 2>BCI is also democratize that because the problem with BCI

0:23:56.240 --> 0:23:58.840
<v Speaker 2>is if you want something with really high fidelity, you're

0:23:58.880 --> 0:24:02.320
<v Speaker 2>paying fifteen to two thousand dollars on a headset versus

0:24:02.359 --> 0:24:04.960
<v Speaker 2>what we're really trying to do is like use OpenBCI,

0:24:05.160 --> 0:24:08.159
<v Speaker 2>so it's you know, a really low cost, but you know,

0:24:08.280 --> 0:24:12.399
<v Speaker 2>compensate for the fidelity constraints with a lot of machine learning.

0:24:12.720 --> 0:24:16.679
<v Speaker 1>Okay, great, and jacket ashly did say it was relatively

0:24:17.320 --> 0:24:21.520
<v Speaker 1>low cost. Is that one of the primary motivating factors

0:24:21.520 --> 0:24:24.960
<v Speaker 1>for you? And how do you go about designing systems

0:24:24.960 --> 0:24:27.600
<v Speaker 1>to try and get that cost down.

0:24:28.240 --> 0:24:31.879
<v Speaker 3>Absolutely, it's a major restricting factor. Just a bit of context,

0:24:32.560 --> 0:24:36.320
<v Speaker 3>The unemployment rate in the visually impaired people community is

0:24:36.359 --> 0:24:39.880
<v Speaker 3>extremely high. I think it's more than sixty percent, so

0:24:40.000 --> 0:24:44.679
<v Speaker 3>it's hard for them to afford any product that is expensive.

0:24:45.160 --> 0:24:47.600
<v Speaker 3>And this is something that we want to change by

0:24:47.920 --> 0:24:51.439
<v Speaker 3>one making it completely open source, so that anybody in

0:24:51.480 --> 0:24:54.760
<v Speaker 3>the world they can just if they have the technical skills,

0:24:54.800 --> 0:24:57.240
<v Speaker 3>they can just assemble the system and they can get

0:24:57.240 --> 0:25:00.000
<v Speaker 3>the system. If not, we can help them assemble the system.

0:25:00.520 --> 0:25:03.400
<v Speaker 3>The complete solution is going to be open source. Two

0:25:03.640 --> 0:25:09.600
<v Speaker 3>is building the product using the hardware systems that are cheap.

0:25:10.280 --> 0:25:13.160
<v Speaker 3>At the same time, that are efficient, and that's where

0:25:13.240 --> 0:25:17.200
<v Speaker 3>products like Intelook stands apart one because it has very

0:25:17.200 --> 0:25:19.719
<v Speaker 3>good capability for running a lot of models in bartle

0:25:20.359 --> 0:25:24.600
<v Speaker 3>and also using accelerators like neural computer stick. So things

0:25:24.680 --> 0:25:27.359
<v Speaker 3>like that help us in shrinking the form factor and

0:25:27.400 --> 0:25:30.760
<v Speaker 3>also the cost quite a bit. And at the same time,

0:25:31.680 --> 0:25:34.919
<v Speaker 3>at the software design level, if you are putting in

0:25:34.960 --> 0:25:39.040
<v Speaker 3>a modular based design, where if somebody wants to use

0:25:39.080 --> 0:25:42.280
<v Speaker 3>a cheaper sensor, they could plug in a different sensor.

0:25:42.880 --> 0:25:46.600
<v Speaker 3>The rest of the robotics or stack will remain intact

0:25:46.880 --> 0:25:49.360
<v Speaker 3>as far as they take care of the sensor obstruction layer.

0:25:49.440 --> 0:25:53.080
<v Speaker 3>And same thing goes for probably for haptic interface, probably

0:25:53.080 --> 0:25:56.680
<v Speaker 3>audio interface, and potentially for computer interface. So we want

0:25:56.720 --> 0:25:59.159
<v Speaker 3>to modularize it as much as possible and shrink the

0:25:59.240 --> 0:26:01.400
<v Speaker 3>cost as much as possible.

0:26:02.960 --> 0:26:06.679
<v Speaker 1>Jagadish mentions something I had never really considered, which is

0:26:06.680 --> 0:26:10.679
<v Speaker 1>the difficulty in finding gainful employment for those with visual impairments.

0:26:11.760 --> 0:26:14.560
<v Speaker 1>In the US and other developed nations, there are protocols

0:26:14.560 --> 0:26:19.200
<v Speaker 1>to provide reasonable accommodations to workers with disabilities, but globally

0:26:19.320 --> 0:26:22.359
<v Speaker 1>that has yet to become a common practice. With an

0:26:22.359 --> 0:26:26.240
<v Speaker 1>AI tool such as Jagadees being open source. It really

0:26:26.280 --> 0:26:28.199
<v Speaker 1>helps move the needle in terms of what those with

0:26:28.320 --> 0:26:32.920
<v Speaker 1>visual impairments can do for themselves. Lama also mentions BCI,

0:26:33.359 --> 0:26:37.760
<v Speaker 1>or brain computer interfaces. Most brain computer interfaces use electrical

0:26:37.880 --> 0:26:41.240
<v Speaker 1>energy of the brain to directly interface with computers or machines.

0:26:42.080 --> 0:26:44.760
<v Speaker 1>The best way to imagine BCI is the character Cyborg

0:26:45.200 --> 0:26:48.040
<v Speaker 1>from the Teen Titan series, where he developed superpowers from

0:26:48.040 --> 0:26:52.800
<v Speaker 1>interfacing computing technology with his biological self. I'm wondering what

0:26:52.840 --> 0:26:56.280
<v Speaker 1>accommodations are considered when the user has ADHD or some

0:26:56.400 --> 0:26:58.440
<v Speaker 1>other form of cognitive processing disorder.

0:27:01.800 --> 0:27:07.359
<v Speaker 2>So we've been looking specifically at utilizing BCI for communication

0:27:07.920 --> 0:27:11.240
<v Speaker 2>for locked in patients, right, And really, you don't want

0:27:11.240 --> 0:27:14.240
<v Speaker 2>to use BCI for communication unless you have to, because

0:27:14.280 --> 0:27:18.360
<v Speaker 2>it's not i mean, unless you're actually have something that's

0:27:18.400 --> 0:27:21.639
<v Speaker 2>implanted in your brain. If you're going outside of the skull,

0:27:21.840 --> 0:27:25.040
<v Speaker 2>you have a very very noisy signal. So that's in

0:27:25.080 --> 0:27:27.360
<v Speaker 2>some sense you can think of it as a last resort. However,

0:27:27.680 --> 0:27:30.200
<v Speaker 2>what you just mentioned is something very different, right, which

0:27:30.280 --> 0:27:35.160
<v Speaker 2>is utilizing BCI as another sensing modality for all sorts

0:27:35.240 --> 0:27:39.879
<v Speaker 2>of other inferences, not to communicate your intention, but to

0:27:40.000 --> 0:27:43.960
<v Speaker 2>actually understand your state, and that is something that is

0:27:44.240 --> 0:27:48.680
<v Speaker 2>you know, yes, can be totally utilized for understanding, for example,

0:27:49.200 --> 0:27:54.080
<v Speaker 2>things like emotional state and concentration and focus and all

0:27:54.119 --> 0:27:57.400
<v Speaker 2>sorts of things like that that can help in cases

0:27:57.440 --> 0:28:00.240
<v Speaker 2>where you have people with autism, for example, and they're

0:28:00.280 --> 0:28:04.480
<v Speaker 2>having a hard time expressing emotional state as it's actually

0:28:04.480 --> 0:28:05.119
<v Speaker 2>getting worse.

0:28:05.240 --> 0:28:05.440
<v Speaker 1>Right.

0:28:05.520 --> 0:28:07.920
<v Speaker 2>There has been actually quite a lot of interesting research

0:28:08.080 --> 0:28:11.399
<v Speaker 2>out of Jojia Tech, for example, specifically looking at that

0:28:11.560 --> 0:28:15.000
<v Speaker 2>as an interesting modality for these type of settings.

0:28:15.440 --> 0:28:20.040
<v Speaker 1>In terms of other individuals and organizations contributing, you've mentioned

0:28:20.240 --> 0:28:24.480
<v Speaker 1>both of you mentioned the open source initiative that Intel's pushing.

0:28:25.160 --> 0:28:29.639
<v Speaker 1>If individuals and organizations want to be involved, what's the

0:28:29.640 --> 0:28:33.160
<v Speaker 1>best way for them to get in and start contributing.

0:28:33.920 --> 0:28:37.920
<v Speaker 2>So basically with ACAT, we have essentially it's an open

0:28:37.960 --> 0:28:41.239
<v Speaker 2>source project, right, and it's open to developers. We have

0:28:41.280 --> 0:28:44.800
<v Speaker 2>different people contributing all sorts of different things, right. I mean,

0:28:44.840 --> 0:28:48.400
<v Speaker 2>for example, we've seen a lot of interest in having

0:28:48.560 --> 0:28:52.120
<v Speaker 2>ACAT beyond in different languages for people around the world, Right,

0:28:52.160 --> 0:28:54.960
<v Speaker 2>So we have a way for having people easily contribute

0:28:55.000 --> 0:28:58.560
<v Speaker 2>to extend it to other languages. As an example, extending

0:28:58.640 --> 0:29:00.600
<v Speaker 2>it to other sensing modalities and so on, so you

0:29:00.600 --> 0:29:03.560
<v Speaker 2>can go through that project and then just kind of

0:29:03.560 --> 0:29:06.959
<v Speaker 2>communicate and submit what you want and communicate with us

0:29:06.840 --> 0:29:09.560
<v Speaker 2>as the people who are still kind of overseeing the project.

0:29:10.040 --> 0:29:13.120
<v Speaker 2>There are also like specific groups that we work with

0:29:13.160 --> 0:29:15.760
<v Speaker 2>because we're trying to also kind of get access to

0:29:16.600 --> 0:29:19.600
<v Speaker 2>users that we can test that technology with. So for example,

0:29:19.640 --> 0:29:21.760
<v Speaker 2>you know the M and D or the ALS groups

0:29:21.800 --> 0:29:25.200
<v Speaker 2>and things like that. So depending on usually some of

0:29:25.240 --> 0:29:27.760
<v Speaker 2>these groups have access to a lot of the different

0:29:27.800 --> 0:29:30.719
<v Speaker 2>solutions and open source systems that exist there. So that

0:29:30.880 --> 0:29:33.120
<v Speaker 2>also is a way I mean not necessarily just for ACAT,

0:29:33.160 --> 0:29:34.720
<v Speaker 2>but more broadly.

0:29:35.000 --> 0:29:37.840
<v Speaker 3>We are seeing the very strong trend of a lot

0:29:37.840 --> 0:29:43.120
<v Speaker 3>of projects being open sourced, and because of this trend,

0:29:43.160 --> 0:29:46.640
<v Speaker 3>we're seeing a lot of powerful projects being democratized and

0:29:46.800 --> 0:29:51.280
<v Speaker 3>reaching people much easily than before. In fact, a lot

0:29:51.320 --> 0:29:54.320
<v Speaker 3>of companies are actually following this model, starting to switch

0:29:54.360 --> 0:29:56.680
<v Speaker 3>from a different model to open sourcing model, which is

0:29:56.720 --> 0:30:01.440
<v Speaker 3>fantastic for the community. It's just fantastic for the world. However,

0:30:01.480 --> 0:30:04.680
<v Speaker 3>there are certain things needs to be considered when developing

0:30:04.680 --> 0:30:07.719
<v Speaker 3>open source solution. One of the most important things is

0:30:08.720 --> 0:30:11.920
<v Speaker 3>how an open source project is defined, how can it

0:30:12.000 --> 0:30:15.280
<v Speaker 3>evolve by itself at some point. Initiatively there is going

0:30:15.320 --> 0:30:17.280
<v Speaker 3>to be primary contributors, but at some point there is

0:30:17.320 --> 0:30:18.560
<v Speaker 3>going to be a lot of people. You're going to

0:30:18.560 --> 0:30:22.160
<v Speaker 3>get contribution from all over the world, and this can

0:30:22.200 --> 0:30:26.400
<v Speaker 3>be both good and bad. If the response is very high,

0:30:26.520 --> 0:30:29.480
<v Speaker 3>then the initial contributors cannot handle it, right, it might

0:30:29.560 --> 0:30:32.240
<v Speaker 3>end up pretty damaging, right, But at the same time,

0:30:32.320 --> 0:30:35.640
<v Speaker 3>you need those responses, So it's important to know that

0:30:35.800 --> 0:30:39.160
<v Speaker 3>balance and also come up with how do we address

0:30:39.200 --> 0:30:41.000
<v Speaker 3>this as a process in general?

0:30:41.080 --> 0:30:41.360
<v Speaker 1>Right?

0:30:41.520 --> 0:30:43.600
<v Speaker 3>How can somebody contribue, but how can somebody create a

0:30:43.600 --> 0:30:46.160
<v Speaker 3>pr It's going to be completely democratized and there will

0:30:46.200 --> 0:30:49.040
<v Speaker 3>be more reviewers distributed throughout the world.

0:30:49.720 --> 0:30:53.040
<v Speaker 2>One of the things that I'm really happy to see

0:30:53.440 --> 0:30:57.680
<v Speaker 2>is really the amount of contribution in the open source

0:30:58.240 --> 0:31:02.720
<v Speaker 2>on all sorts of AI capabilities and language models, and

0:31:02.880 --> 0:31:07.760
<v Speaker 2>which really I think is enabling a lot of democratization

0:31:07.920 --> 0:31:12.160
<v Speaker 2>of AI, specifically to all of these different usages. Right,

0:31:12.200 --> 0:31:14.600
<v Speaker 2>because if you think about a sist of computing in

0:31:14.680 --> 0:31:17.760
<v Speaker 2>some sense, in many cases you're trying to compensate for

0:31:17.880 --> 0:31:21.840
<v Speaker 2>some sort of a sense impairment. Right, So if you're

0:31:21.880 --> 0:31:25.280
<v Speaker 2>able to actually use AI to help extract that sense

0:31:25.320 --> 0:31:29.840
<v Speaker 2>automatically from the world. Having access to that democratization in

0:31:29.920 --> 0:31:33.800
<v Speaker 2>AI models and algorithms is something that is really transformational

0:31:33.920 --> 0:31:36.440
<v Speaker 2>for this space. And if I remember, for example, like

0:31:36.480 --> 0:31:39.480
<v Speaker 2>in the past, right even getting something access to something

0:31:39.520 --> 0:31:42.320
<v Speaker 2>like an ASR was really hard to do right in

0:31:42.640 --> 0:31:44.640
<v Speaker 2>the open source, at least the level of quality that

0:31:44.680 --> 0:31:48.520
<v Speaker 2>you would see. But now lately, because of that quick movement,

0:31:49.080 --> 0:31:51.640
<v Speaker 2>you're seeing a lot of capability in the open source

0:31:51.680 --> 0:31:55.320
<v Speaker 2>that actually rivals that of the really you know, big companies,

0:31:55.320 --> 0:31:57.800
<v Speaker 2>which is I think is absolutely transformational.

0:31:58.280 --> 0:32:01.240
<v Speaker 1>Yeah, that's great, no question for both of you. Are

0:32:01.320 --> 0:32:04.360
<v Speaker 1>start with jaggedesh. You know, we're seeing AI being used

0:32:04.400 --> 0:32:08.640
<v Speaker 1>for accessibility efforts. Looking forward ten years, what's the number

0:32:08.640 --> 0:32:12.680
<v Speaker 1>one area which you would want AI to help in

0:32:12.720 --> 0:32:13.760
<v Speaker 1>this industry.

0:32:14.400 --> 0:32:17.400
<v Speaker 3>I'd be really pleased to see a system that is

0:32:17.480 --> 0:32:20.720
<v Speaker 3>really small that somebody can put in like a glass

0:32:20.840 --> 0:32:24.720
<v Speaker 3>or any firm, that goes totally unnoticed and it provides

0:32:24.840 --> 0:32:27.959
<v Speaker 3>all the capabilities of human eye. I think that'll be fantastic,

0:32:28.360 --> 0:32:31.480
<v Speaker 3>And same thing goes for other forms of disabilities. I

0:32:31.480 --> 0:32:35.920
<v Speaker 3>think that will be fantastic to see and in tenuous timeline,

0:32:35.960 --> 0:32:37.320
<v Speaker 3>I think it might be possible.

0:32:38.040 --> 0:32:40.480
<v Speaker 2>My number one area that I want to see solved,

0:32:40.680 --> 0:32:43.920
<v Speaker 2>not necessarily in a sist of computing, but actually climate change.

0:32:44.000 --> 0:32:46.800
<v Speaker 2>That's where I think, like we all need this otherwise

0:32:46.800 --> 0:32:48.040
<v Speaker 2>I'm not sure we're going to have a world to

0:32:48.080 --> 0:32:50.640
<v Speaker 2>actually do anything else. And in the area of asis

0:32:50.640 --> 0:32:53.160
<v Speaker 2>of computing, it's really what I was saying earlier, which

0:32:53.200 --> 0:32:58.080
<v Speaker 2>is I envision being able to compensate for every single

0:32:58.120 --> 0:33:03.240
<v Speaker 2>sense that the human is missing, and that, to Jaggediesh's point,

0:33:03.760 --> 0:33:06.240
<v Speaker 2>is only going to be possible if that is meeting

0:33:06.280 --> 0:33:08.880
<v Speaker 2>people where there are in the world, which means they

0:33:08.880 --> 0:33:11.680
<v Speaker 2>have to be sustainable, they have to be extremely power efficient,

0:33:11.720 --> 0:33:13.640
<v Speaker 2>they need to be robust enough to everything that it

0:33:13.680 --> 0:33:16.200
<v Speaker 2>hasn't seen in the world right, so, which is really

0:33:16.240 --> 0:33:19.480
<v Speaker 2>not necessarily where things are today, but you know, given

0:33:19.520 --> 0:33:22.360
<v Speaker 2>that appid improvement, I would really hope that that's where

0:33:22.360 --> 0:33:23.680
<v Speaker 2>we would be in ten years from now.

0:33:24.000 --> 0:33:27.479
<v Speaker 1>Excellent. Okay, thank you very much, thank you, thank you.

0:33:30.160 --> 0:33:32.760
<v Speaker 1>I would like to thank my guests Jaggedish Mahindra and

0:33:32.840 --> 0:33:36.400
<v Speaker 1>Lama Nachman for joining me on this episode of Technically Speaking,

0:33:36.600 --> 0:33:41.200
<v Speaker 1>an Intel podcast. I really enjoyed this conversation with Jagged

0:33:41.240 --> 0:33:43.480
<v Speaker 1>Ash and Lama. I love being able to delve into

0:33:43.480 --> 0:33:46.760
<v Speaker 1>the motivations of the why, but also the how. You

0:33:46.800 --> 0:33:49.200
<v Speaker 1>heard from Jaggedesh and the story of his visually impaired

0:33:49.240 --> 0:33:51.800
<v Speaker 1>friend being struck by a tree branch, and that was

0:33:51.800 --> 0:33:54.880
<v Speaker 1>the seed for his idea for an AI assistant backpack.

0:33:55.560 --> 0:33:59.240
<v Speaker 1>For me, this is the true technological empowerment, the ability

0:33:59.280 --> 0:34:01.680
<v Speaker 1>for individuals to use their skills and talent to make

0:34:01.720 --> 0:34:05.240
<v Speaker 1>a difference, taking action rather than just talking about it.

0:34:05.880 --> 0:34:08.880
<v Speaker 1>These are the true innovators. It was great to hear

0:34:08.920 --> 0:34:11.880
<v Speaker 1>of Lama's work with Professor Stephen Hawking and the context

0:34:11.920 --> 0:34:14.800
<v Speaker 1>of where system her team developed. What is so pleasing

0:34:14.800 --> 0:34:17.080
<v Speaker 1>to me was that it wasn't a Rolls Royce design,

0:34:17.440 --> 0:34:21.440
<v Speaker 1>but rather an elegant yet simple system of sensors connected

0:34:21.440 --> 0:34:24.160
<v Speaker 1>to a PC to allow Professor Hawking to interact and

0:34:24.160 --> 0:34:29.080
<v Speaker 1>communicate with others. Because of this relatively inexpensive solution, it

0:34:29.120 --> 0:34:31.920
<v Speaker 1>can be used by a wider range of people. This

0:34:32.080 --> 0:34:36.000
<v Speaker 1>is what democratization of technology does for the world. I

0:34:36.040 --> 0:34:38.839
<v Speaker 1>hope that Lama and Jaggedish's stories inspire you to take

0:34:38.840 --> 0:34:41.600
<v Speaker 1>the leap and contribute to improving the lives of people,

0:34:41.640 --> 0:34:46.400
<v Speaker 1>regardless of their background. Please join us on Tuesday, November

0:34:46.400 --> 0:34:50.160
<v Speaker 1>fourteenth for the next episode of Technically Speaking, an Intel podcast.

0:34:54.480 --> 0:34:57.960
<v Speaker 1>Technically Speaking was produced by Ruby Studios from iHeartRadio in

0:34:58.000 --> 0:35:01.879
<v Speaker 1>partnership with Intel, and hosted by me Graham Class. Our

0:35:01.920 --> 0:35:05.360
<v Speaker 1>executive producer is Molly Sosher, our EP of Post Production

0:35:05.480 --> 0:35:09.040
<v Speaker 1>is James Foster, and our Supervising producer is Nikair Swinton.

0:35:09.880 --> 0:35:13.080
<v Speaker 1>This episode was edited by Sierra Spreen and written and

0:35:13.120 --> 0:35:23.719
<v Speaker 1>produced by Tyree Rush.