WEBVTT - The Siri Story: Joining Apple

0:00:04.400 --> 0:00:07.800
<v Speaker 1>Welcome to tech Stuff, a production from I Heart Radio.

0:00:12.240 --> 0:00:15.040
<v Speaker 1>Hey there, and welcome to tech Stuff. I'm your host,

0:00:15.200 --> 0:00:18.080
<v Speaker 1>Jonathan Strickland. I'm an executive producer with I Heart Radio

0:00:18.120 --> 0:00:20.920
<v Speaker 1>and I love all things tech, and today we are

0:00:21.000 --> 0:00:27.760
<v Speaker 1>continuing our exploration of Apple's Virtual Assistant SIRIE, which originally

0:00:27.840 --> 0:00:31.840
<v Speaker 1>debuted as an official Apple feature on October four, two

0:00:31.840 --> 0:00:36.520
<v Speaker 1>th eleven. If you haven't listened to the previous episode,

0:00:36.560 --> 0:00:39.080
<v Speaker 1>I recommend you do that, as I covered a lot

0:00:39.120 --> 0:00:41.760
<v Speaker 1>of the backstory to SIRI, as well as some of

0:00:41.760 --> 0:00:45.640
<v Speaker 1>the technical challenges engineers faced as they built on the

0:00:45.720 --> 0:00:48.960
<v Speaker 1>underlying technologies that you know makes SIRIE a possibility in

0:00:49.000 --> 0:00:53.160
<v Speaker 1>the first place. However, I will do a super quick

0:00:53.240 --> 0:00:57.320
<v Speaker 1>summary of the important points just in case. So SIRIE

0:00:57.400 --> 0:01:00.880
<v Speaker 1>really started off as an offshoot of a Department of

0:01:00.960 --> 0:01:05.520
<v Speaker 1>Defense funded project to create a virtual assistant that military

0:01:05.560 --> 0:01:09.759
<v Speaker 1>commanders would be able to rely upon. S r I International,

0:01:10.120 --> 0:01:15.959
<v Speaker 1>a nonprofit scientific research organization, landed the contract and ultimately

0:01:16.000 --> 0:01:19.800
<v Speaker 1>would spin off a company called SIRI s I r

0:01:19.920 --> 0:01:23.840
<v Speaker 1>I that would explore how to take a subset of

0:01:23.880 --> 0:01:27.679
<v Speaker 1>the technologies that were developed for this Department of Defense

0:01:27.720 --> 0:01:31.200
<v Speaker 1>project and then use it in the consumer market with

0:01:31.360 --> 0:01:34.559
<v Speaker 1>an eye on mobile devices. Now, that was in two

0:01:34.560 --> 0:01:38.600
<v Speaker 1>thousand seven, just as consumer smartphones were becoming a thing,

0:01:38.680 --> 0:01:43.080
<v Speaker 1>primarily with of course, the Apple iPhone. By the company

0:01:43.120 --> 0:01:45.959
<v Speaker 1>had a working app that they submitted to the Apple

0:01:46.040 --> 0:01:50.600
<v Speaker 1>App Store, and a few weeks later, Apple CEO Steve

0:01:50.760 --> 0:01:55.200
<v Speaker 1>Jobs called up Dog Kittlause, the head of Sirie, to

0:01:55.440 --> 0:01:59.680
<v Speaker 1>talk about the app and ultimately to make an acquisition offer,

0:02:00.280 --> 0:02:04.600
<v Speaker 1>which the company ultimately accepted. A year later, and one

0:02:04.720 --> 0:02:08.959
<v Speaker 1>day before Steve Jobs's death, Apple would unveil Siri as

0:02:09.000 --> 0:02:12.440
<v Speaker 1>an Apple product, though as we know, the bulk of

0:02:12.480 --> 0:02:16.080
<v Speaker 1>the work on Siri had been done elsewhere. Now that's

0:02:16.120 --> 0:02:21.200
<v Speaker 1>the super fast summary of the actually really interesting backstory

0:02:21.320 --> 0:02:24.280
<v Speaker 1>on Sirie. You can listen to the previous episode to

0:02:24.360 --> 0:02:26.840
<v Speaker 1>get the full thing. But we've got a few other

0:02:26.880 --> 0:02:29.480
<v Speaker 1>things to clear up before we can continue down the

0:02:29.560 --> 0:02:35.240
<v Speaker 1>path of series development. As I mentioned in the previous episode, Sirie,

0:02:35.400 --> 0:02:40.720
<v Speaker 1>the company before Apple's acquisition, had arranged a deal with

0:02:40.919 --> 0:02:44.920
<v Speaker 1>the cell phone carrier company Verizon, and the deal was

0:02:45.000 --> 0:02:48.239
<v Speaker 1>that Siri was going to create an Android app version

0:02:48.440 --> 0:02:52.320
<v Speaker 1>of its service that would be a flagship feature on

0:02:52.480 --> 0:02:56.040
<v Speaker 1>Verizon Android phones. And this is a good time to

0:02:56.080 --> 0:02:59.760
<v Speaker 1>point out one of the major differences between Apple and Android.

0:03:00.360 --> 0:03:03.960
<v Speaker 1>Apple only allows iOS, you know, the operating system of

0:03:04.000 --> 0:03:06.720
<v Speaker 1>the iPhone and the iPad and stuff like that. They

0:03:06.720 --> 0:03:10.079
<v Speaker 1>only allow that to run on Apple products. So if

0:03:10.120 --> 0:03:14.440
<v Speaker 1>you want an iPhone or something running the iPhone operating system,

0:03:14.639 --> 0:03:17.119
<v Speaker 1>you have to go through Apple. I mean, you could

0:03:17.520 --> 0:03:21.000
<v Speaker 1>try to make some other device run iOS on top

0:03:21.080 --> 0:03:23.480
<v Speaker 1>of that device, but my point is that if you

0:03:23.480 --> 0:03:25.600
<v Speaker 1>want the real thing, you have to go to Apple.

0:03:25.639 --> 0:03:29.840
<v Speaker 1>There's nowhere else to go. Google when a totally different route.

0:03:29.960 --> 0:03:33.560
<v Speaker 1>They built out Android, and they made it available for

0:03:33.600 --> 0:03:37.440
<v Speaker 1>installation on all different types of handsets from different companies,

0:03:37.920 --> 0:03:40.600
<v Speaker 1>and so you had a lot of flavors of Android.

0:03:40.800 --> 0:03:43.560
<v Speaker 1>And I'm not just referring to the fact that Google

0:03:43.960 --> 0:03:48.280
<v Speaker 1>uses dessert names for various versions of Android. What I

0:03:48.280 --> 0:03:52.120
<v Speaker 1>actually mean is that you could find one handset manufacturer

0:03:52.520 --> 0:03:56.800
<v Speaker 1>that would preinstall a certain suite of apps on their phones,

0:03:57.360 --> 0:04:00.200
<v Speaker 1>or a cell phone service provider like Verizon might do

0:04:00.320 --> 0:04:03.400
<v Speaker 1>the same thing. So while you could shop around with

0:04:03.480 --> 0:04:07.400
<v Speaker 1>different handsets and providers for an Android phone, the versions

0:04:07.480 --> 0:04:10.600
<v Speaker 1>might look a little different and have different flagship apps

0:04:10.840 --> 0:04:14.480
<v Speaker 1>on the home page. In some cases, you might come

0:04:14.520 --> 0:04:17.680
<v Speaker 1>across a lot of bloatwear. Now those are apps that

0:04:17.720 --> 0:04:21.240
<v Speaker 1>you don't really want or maybe you never even use them,

0:04:21.240 --> 0:04:25.760
<v Speaker 1>but they're pre installed and sometimes they're impossible to remove

0:04:25.839 --> 0:04:30.880
<v Speaker 1>from the phone without going to some pretty big effort. Anyway,

0:04:31.040 --> 0:04:34.440
<v Speaker 1>Verizon wanted Sirie to be an intrinsic app on the

0:04:34.520 --> 0:04:40.480
<v Speaker 1>Android smartphones moving forward, essentially making siri uh an Android

0:04:40.520 --> 0:04:42.800
<v Speaker 1>feature the way it would turn out to be an

0:04:42.800 --> 0:04:47.479
<v Speaker 1>iPhone feature. So Verizon and Sirie began making arrangements in

0:04:47.480 --> 0:04:51.320
<v Speaker 1>the fall of two thousand nine, but Verizon did not

0:04:52.120 --> 0:04:56.039
<v Speaker 1>actually acquire Sirie the company. If it had, our story

0:04:56.080 --> 0:05:00.279
<v Speaker 1>would be very different. Verizon reportedly planned to make Srie

0:05:00.360 --> 0:05:03.800
<v Speaker 1>exclusive to Android, but the companies were doing this through

0:05:03.839 --> 0:05:08.440
<v Speaker 1>a planned partnership, and during the meanwhile, there just was

0:05:08.480 --> 0:05:12.279
<v Speaker 1>no exclusivity in place, so Sirie the company continued to

0:05:12.320 --> 0:05:16.040
<v Speaker 1>develop Sirie the app for the iPhone. The app debut

0:05:16.120 --> 0:05:19.919
<v Speaker 1>a few months after the Verizon deal, and that you

0:05:19.960 --> 0:05:22.480
<v Speaker 1>know was that A couple of weeks later, Steve Jobs

0:05:22.480 --> 0:05:25.559
<v Speaker 1>was calling up kit Loss to talk about bringing SyRI

0:05:25.680 --> 0:05:29.880
<v Speaker 1>on board Apple as an acquisition. And Apple doesn't typically

0:05:29.920 --> 0:05:31.960
<v Speaker 1>acquire that many companies, so this was kind of a

0:05:31.960 --> 0:05:35.680
<v Speaker 1>big deal. That acquisition would take a few months to complete,

0:05:35.960 --> 0:05:41.279
<v Speaker 1>and business journals reported on it around April. The financial

0:05:41.320 --> 0:05:45.240
<v Speaker 1>details of the deal were not made public. Business insider

0:05:45.320 --> 0:05:48.320
<v Speaker 1>analysts estimated that the deal may have fallen somewhere in

0:05:48.360 --> 0:05:51.719
<v Speaker 1>the one hundred to two hundred million dollar range, which

0:05:51.760 --> 0:05:55.640
<v Speaker 1>is a princely sum. Indeed, they based that off the

0:05:55.680 --> 0:05:59.480
<v Speaker 1>fact that Siri had successfully raised twenty four million dollars

0:05:59.480 --> 0:06:02.480
<v Speaker 1>and invest monts up to that point. Now, one of

0:06:02.520 --> 0:06:06.880
<v Speaker 1>the conditions of this acquisition was that Sirie was going

0:06:06.920 --> 0:06:10.520
<v Speaker 1>to be an Apple exclusive feature. It would be baked

0:06:10.520 --> 0:06:14.599
<v Speaker 1>into the basic functionality of iOS moving forward, and a

0:06:14.680 --> 0:06:18.479
<v Speaker 1>consequence of this is that the Verizon deal that Siri

0:06:18.600 --> 0:06:22.080
<v Speaker 1>had made would be null and void and Siri would

0:06:22.160 --> 0:06:24.920
<v Speaker 1>go dark for more than a year as Apple would

0:06:24.920 --> 0:06:28.400
<v Speaker 1>bring the serie team over to Cupertino at Apple HQ

0:06:29.240 --> 0:06:31.719
<v Speaker 1>and then began making some pretty big changes to the

0:06:31.760 --> 0:06:35.000
<v Speaker 1>app in order to integrate it more closely with iOS.

0:06:35.839 --> 0:06:39.840
<v Speaker 1>Not all of those tweaks were additions. For example, the

0:06:39.920 --> 0:06:43.719
<v Speaker 1>original Sirie app, the one that could still make some

0:06:43.880 --> 0:06:49.000
<v Speaker 1>pretty sassy responses to requests, could make restaurant reservations on

0:06:49.080 --> 0:06:52.720
<v Speaker 1>behalf of the user. When Apple would debut Siri as

0:06:52.720 --> 0:06:57.680
<v Speaker 1>an integral iOS feature, it would not have that capability,

0:06:57.920 --> 0:07:00.920
<v Speaker 1>though the service would eventually get the feature back by

0:07:01.040 --> 0:07:04.599
<v Speaker 1>two thousand twelve. My point being that some of the

0:07:04.680 --> 0:07:08.880
<v Speaker 1>changes that Apple was making was to remove functionality from Siri.

0:07:09.279 --> 0:07:12.320
<v Speaker 1>The serie team had worked to secure agreements with various

0:07:12.400 --> 0:07:16.960
<v Speaker 1>other services which allowed Sirie the app to interoperate with

0:07:17.080 --> 0:07:21.600
<v Speaker 1>those services. But those agreements were, you know, with a

0:07:21.720 --> 0:07:27.560
<v Speaker 1>much smaller, nimble and private company, right, Siri was this tiny, entiteam.

0:07:27.760 --> 0:07:31.840
<v Speaker 1>Once Apple entered the picture, all of that changed. Apple

0:07:32.000 --> 0:07:36.720
<v Speaker 1>is Well, let's say that Apple is particular in how

0:07:36.760 --> 0:07:40.640
<v Speaker 1>its technology can interact with stuff that is made by

0:07:40.720 --> 0:07:45.760
<v Speaker 1>other companies. Not all of those earlier relationships that Sirie

0:07:45.880 --> 0:07:49.480
<v Speaker 1>had made when it was an independent company would survive

0:07:49.560 --> 0:07:52.960
<v Speaker 1>the transition over winds SII became part of Apple. On

0:07:53.080 --> 0:07:57.040
<v Speaker 1>top of that, Apple's focus was much more broad than

0:07:57.400 --> 0:08:01.240
<v Speaker 1>series focus had been. I mentioned in the previous episode

0:08:01.280 --> 0:08:05.280
<v Speaker 1>that the discipline of natural language processing, that is, trying

0:08:05.320 --> 0:08:09.800
<v Speaker 1>to make systems that can handle the way we typically

0:08:09.920 --> 0:08:13.840
<v Speaker 1>speak or communicate with one another, as opposed to us

0:08:13.880 --> 0:08:16.000
<v Speaker 1>having to learn how to speak in a way that

0:08:16.040 --> 0:08:19.800
<v Speaker 1>a computer can, you know, follow what we're saying. Well, anyway,

0:08:19.800 --> 0:08:22.960
<v Speaker 1>it's particularly tricky because we humans have different ways to

0:08:23.000 --> 0:08:25.840
<v Speaker 1>express things, and none of them tend to be in

0:08:25.880 --> 0:08:28.800
<v Speaker 1>the way that machines process information. And one of the

0:08:28.840 --> 0:08:32.720
<v Speaker 1>big complicating factors that I mentioned in our previous episode

0:08:32.760 --> 0:08:36.840
<v Speaker 1>is that there are thousands of languages and it's hard

0:08:36.840 --> 0:08:40.960
<v Speaker 1>to develop programs that have good natural language processing for

0:08:41.120 --> 0:08:45.520
<v Speaker 1>a single language like English, but it becomes exponentially more

0:08:45.559 --> 0:08:49.480
<v Speaker 1>difficult as you try to support additional languages, each with

0:08:49.520 --> 0:08:54.239
<v Speaker 1>their own peculiarities. Apple, a global company, was selling iPhones

0:08:54.240 --> 0:08:57.600
<v Speaker 1>in more than seventy countries at that point. For Siri

0:08:57.960 --> 0:09:02.360
<v Speaker 1>to be a flagship feature would be across all those phones.

0:09:02.480 --> 0:09:04.280
<v Speaker 1>It would need to be able to respond to a

0:09:04.320 --> 0:09:07.760
<v Speaker 1>lot of different languages. Now, upon its initial release as

0:09:07.800 --> 0:09:12.160
<v Speaker 1>an Apple feature, Siri would just be limited to English, French,

0:09:12.360 --> 0:09:16.280
<v Speaker 1>and German, but it had the goal of expanding those

0:09:16.320 --> 0:09:20.599
<v Speaker 1>capabilities to other languages as well. Building in that capability

0:09:20.920 --> 0:09:23.560
<v Speaker 1>would take a lot of effort and time, and so

0:09:23.800 --> 0:09:26.200
<v Speaker 1>while the serie team might have been able to go

0:09:26.240 --> 0:09:30.280
<v Speaker 1>on deeper dives and you know, give Sirie more incredible abilities,

0:09:30.679 --> 0:09:33.839
<v Speaker 1>a lot of their work had to shift towards localization.

0:09:34.120 --> 0:09:36.520
<v Speaker 1>And this is not meant to be a knock against

0:09:36.640 --> 0:09:40.840
<v Speaker 1>Sirie or Apple. It's just an acknowledgement that the job

0:09:40.880 --> 0:09:45.400
<v Speaker 1>of the designers was really challenging and there were only

0:09:45.440 --> 0:09:47.640
<v Speaker 1>so many hours in a day to make it all work.

0:09:48.320 --> 0:09:52.320
<v Speaker 1>So the acquisition happened. In April twenty ten, the Verizon

0:09:52.400 --> 0:09:57.600
<v Speaker 1>deal died as a result. The serie team moved to Cupertino, California,

0:09:57.679 --> 0:10:00.120
<v Speaker 1>and a lot of work was being done behind in

0:10:00.240 --> 0:10:04.120
<v Speaker 1>the scenes now At the same time, Steve Jobs's health

0:10:04.520 --> 0:10:08.240
<v Speaker 1>was on the decline. He had been diagnosed with pancreatic

0:10:08.280 --> 0:10:11.520
<v Speaker 1>cancer back in two thousand four. In two thousand nine,

0:10:11.600 --> 0:10:15.360
<v Speaker 1>he had temporarily stepped down as CEO to recover from

0:10:15.400 --> 0:10:19.760
<v Speaker 1>what he originally reported as a hormone imbalance, though it

0:10:19.840 --> 0:10:23.880
<v Speaker 1>was later revealed he had undergone a liver transplant. Tim Cook,

0:10:24.000 --> 0:10:27.400
<v Speaker 1>Apple's chief operating officer at the time, would take over

0:10:27.480 --> 0:10:31.160
<v Speaker 1>as sort of an interim CEO, and then Jobs returned

0:10:31.160 --> 0:10:34.120
<v Speaker 1>to work six months later. Jobs was leading the company

0:10:34.120 --> 0:10:36.839
<v Speaker 1>when he made the move to acquire Sirie, but he

0:10:36.880 --> 0:10:41.960
<v Speaker 1>would subsequently suffer more health setbacks. In January eleven, Jobs

0:10:42.000 --> 0:10:46.400
<v Speaker 1>would again take medical leave from Apple, but he would

0:10:46.400 --> 0:10:50.840
<v Speaker 1>continue to Services CEO. During his leave, Cook would take

0:10:50.920 --> 0:10:54.480
<v Speaker 1>over the day to day operations of the company. Jobs

0:10:54.480 --> 0:10:57.719
<v Speaker 1>would then take the stage one final time during the

0:10:57.760 --> 0:11:01.439
<v Speaker 1>two thousand eleven Worldwide Developed First Conference or w w

0:11:01.720 --> 0:11:06.320
<v Speaker 1>d C that took place from June six to June tenth,

0:11:06.360 --> 0:11:09.680
<v Speaker 1>two thousand eleven. He received a standing ovation from the

0:11:09.720 --> 0:11:13.040
<v Speaker 1>crowd before going into the introduction for the week's events,

0:11:13.120 --> 0:11:15.880
<v Speaker 1>and at that event, he talked about the new versions

0:11:15.960 --> 0:11:19.760
<v Speaker 1>of the Apple Mac operating system as well as the

0:11:19.840 --> 0:11:24.680
<v Speaker 1>latest build of iOS, but SIRIE was not quite ready

0:11:24.800 --> 0:11:27.640
<v Speaker 1>to debut, and so there was no mention of the

0:11:27.679 --> 0:11:31.360
<v Speaker 1>innovative service that was going to be reserved for October.

0:11:32.040 --> 0:11:36.000
<v Speaker 1>While the serie team prepared for the official debut of

0:11:36.160 --> 0:11:39.480
<v Speaker 1>SIRIE as an Apple feature, Jobs had to make a

0:11:39.520 --> 0:11:44.040
<v Speaker 1>tough call. On August twenty four, two thousand eleven, Steve

0:11:44.120 --> 0:11:48.640
<v Speaker 1>Jobs resigned as Chief executive Officer of Apple. Tim Cook

0:11:48.679 --> 0:11:53.000
<v Speaker 1>would become the new CEO of the company. Jobs's health

0:11:53.080 --> 0:11:55.280
<v Speaker 1>had declined to a point where he felt he could

0:11:55.360 --> 0:11:58.120
<v Speaker 1>no longer devote energy to running the company that he

0:11:58.200 --> 0:12:01.319
<v Speaker 1>had co founded back in the ninetees seventies, and there

0:12:01.360 --> 0:12:04.920
<v Speaker 1>was a lot of speculation at the time that Apple

0:12:05.120 --> 0:12:08.640
<v Speaker 1>might fall apart without him. After all, when Jobs had

0:12:08.760 --> 0:12:12.120
<v Speaker 1>left Apple in the mid nineteen eighties, the company went

0:12:12.160 --> 0:12:15.920
<v Speaker 1>on to make some pretty awful mistakes that would lead

0:12:15.920 --> 0:12:19.720
<v Speaker 1>it to the brink of bankruptcy. And when Jobs returned

0:12:19.760 --> 0:12:23.480
<v Speaker 1>to Apple in the late nineteen nineties, he would lead

0:12:23.520 --> 0:12:26.280
<v Speaker 1>the company to turn things around and then reach a

0:12:26.360 --> 0:12:29.960
<v Speaker 1>dominant position that no other company had really seen before

0:12:30.040 --> 0:12:34.560
<v Speaker 1>in the consumer marketplace. Steve Jobs seemed integral to the

0:12:34.600 --> 0:12:38.040
<v Speaker 1>corporate identity of Apple, kind of the same way that

0:12:38.400 --> 0:12:42.920
<v Speaker 1>you know, Walt Disney seems inseparable from the Disney Company.

0:12:43.160 --> 0:12:45.679
<v Speaker 1>So people would still say, you know, what would Walt

0:12:45.720 --> 0:12:47.520
<v Speaker 1>do well? A lot of people would say at Apple,

0:12:47.559 --> 0:12:51.319
<v Speaker 1>what would Steve Jobs do well. In October four, two

0:12:51.360 --> 0:12:53.800
<v Speaker 1>thousand and eleven, Apple would hold a press conference to

0:12:53.960 --> 0:12:59.160
<v Speaker 1>launch the iPhone for S. This was technically the fifth iPhone,

0:12:59.600 --> 0:13:02.800
<v Speaker 1>following the original iPhone. Then we had the three G

0:13:03.240 --> 0:13:07.199
<v Speaker 1>so the iPhone two was the three g UM. There's

0:13:07.240 --> 0:13:10.920
<v Speaker 1>reasons for all these numbers and why they are kind

0:13:10.920 --> 0:13:13.360
<v Speaker 1>of confusing, but I won't go into all of them. Anyway.

0:13:13.360 --> 0:13:16.000
<v Speaker 1>After the three G we got the three G S.

0:13:16.520 --> 0:13:19.920
<v Speaker 1>Then we got the iPhone four and that's all because

0:13:20.000 --> 0:13:22.480
<v Speaker 1>numbers are hard. But to be more fair to Apple,

0:13:22.840 --> 0:13:26.320
<v Speaker 1>the iPhone four S would end up bridging a gap

0:13:26.640 --> 0:13:30.960
<v Speaker 1>between the older iPhone four and the future iPhone five.

0:13:31.440 --> 0:13:34.720
<v Speaker 1>The iPhone four S looked a lot like the iPhone four,

0:13:34.760 --> 0:13:38.239
<v Speaker 1>but it did have several hardware improvements, including a faster processor,

0:13:38.800 --> 0:13:42.160
<v Speaker 1>and it also had something that Apple said justified adding

0:13:42.200 --> 0:13:46.640
<v Speaker 1>the letter S to the phone it had, Siri. At

0:13:46.640 --> 0:13:51.000
<v Speaker 1>the October fourth event, which was just called a special event,

0:13:51.400 --> 0:13:54.120
<v Speaker 1>Tim Cook took the stage for the first time as

0:13:54.200 --> 0:13:58.400
<v Speaker 1>CEO to announce a product launch. He handed the presentation

0:13:58.440 --> 0:14:01.200
<v Speaker 1>over to Phil Schiller, who had frequently stood in for

0:14:01.240 --> 0:14:05.239
<v Speaker 1>Steve Jobs at various keynotes during jobs as leaves of absence,

0:14:05.559 --> 0:14:09.080
<v Speaker 1>and Schiller, in turn would bring up Scott Forstall, head

0:14:09.200 --> 0:14:14.720
<v Speaker 1>of iOS. Shortly after announcing Sirie and Forestall would give

0:14:14.760 --> 0:14:18.719
<v Speaker 1>the audience an overview of series capabilities a demo of them,

0:14:18.720 --> 0:14:22.080
<v Speaker 1>as well as well as stress multiple times that Siri

0:14:22.400 --> 0:14:25.840
<v Speaker 1>was in beta mode and just in case you don't

0:14:25.840 --> 0:14:29.640
<v Speaker 1>know what that means, a beta build is a stage

0:14:29.640 --> 0:14:33.680
<v Speaker 1>in development where you're trying to refine a product before

0:14:33.720 --> 0:14:36.120
<v Speaker 1>you get to the point where you you know you

0:14:36.120 --> 0:14:39.600
<v Speaker 1>you release it. And so if you have an alpha build,

0:14:39.840 --> 0:14:42.680
<v Speaker 1>that's really more of a work in progress, and it's

0:14:42.720 --> 0:14:46.520
<v Speaker 1>one that could have potentially enormous changes made to the

0:14:46.560 --> 0:14:49.560
<v Speaker 1>product before it ever goes to production. So really you're

0:14:49.560 --> 0:14:52.960
<v Speaker 1>just trying to make stuff work in alpha. A beta

0:14:53.040 --> 0:14:55.520
<v Speaker 1>build is meant to be one that's fairly close to

0:14:55.560 --> 0:14:58.960
<v Speaker 1>what a company plans to release, but it may still

0:14:59.040 --> 0:15:01.400
<v Speaker 1>need some tweak to make it work just right. So

0:15:01.720 --> 0:15:03.360
<v Speaker 1>you can think of a beta build as being like

0:15:03.560 --> 0:15:08.400
<v Speaker 1>almost ready. And Apple doesn't typically allow people to get

0:15:08.400 --> 0:15:11.720
<v Speaker 1>a look at stuff that's in beta. The company has

0:15:11.720 --> 0:15:15.920
<v Speaker 1>a history of locking things down pretty tightly before showing

0:15:15.960 --> 0:15:18.920
<v Speaker 1>it off. So perhaps the fact that this was an

0:15:18.920 --> 0:15:23.600
<v Speaker 1>exception points at how Series integration into iOS didn't go

0:15:23.720 --> 0:15:28.440
<v Speaker 1>without some setbacks. Now, all that being said, the demo

0:15:28.800 --> 0:15:31.600
<v Speaker 1>at this special event went off without a hitch. Siri

0:15:31.760 --> 0:15:36.880
<v Speaker 1>didn't make any embarrassing mistakes and interpreted Forstall's requests correctly.

0:15:37.520 --> 0:15:40.520
<v Speaker 1>It retrieved relevant information, it didn't have to ask for

0:15:40.680 --> 0:15:44.240
<v Speaker 1>Stall to repeat anything, and it also didn't sass back

0:15:44.640 --> 0:15:49.320
<v Speaker 1>the personal assistant with an attitude had been toned down significantly.

0:15:49.880 --> 0:15:53.840
<v Speaker 1>Siri was ready for its journey with Apple. I've got

0:15:53.880 --> 0:15:55.880
<v Speaker 1>a lot more to say about Sirie, but before I

0:15:55.960 --> 0:16:06.520
<v Speaker 1>get to that, let's take a quick break. At the

0:16:06.560 --> 0:16:09.680
<v Speaker 1>special event, Scott Forstall would walk through some of the

0:16:09.720 --> 0:16:13.560
<v Speaker 1>initial capabilities Siri would have with the launch of the

0:16:13.640 --> 0:16:18.240
<v Speaker 1>new iPhone for s That included looking up information on Wikipedia,

0:16:18.880 --> 0:16:23.640
<v Speaker 1>or providing directions to a landmark, or looking up restaurant

0:16:23.640 --> 0:16:26.360
<v Speaker 1>options that are near you, and that kind of thing,

0:16:26.400 --> 0:16:31.720
<v Speaker 1>including a really neat geo fencing feature. Geo fencing involves

0:16:31.840 --> 0:16:36.400
<v Speaker 1>location data. Essentially, you create an instruction that's dependent upon

0:16:36.560 --> 0:16:41.120
<v Speaker 1>your physical location on the planet. So you might say,

0:16:41.520 --> 0:16:44.120
<v Speaker 1>remind me when I leave the house to swing by

0:16:44.120 --> 0:16:47.680
<v Speaker 1>the post office to your virtual assistant. Now that command

0:16:48.040 --> 0:16:52.000
<v Speaker 1>would then have to be interpreted by the assistants system.

0:16:52.040 --> 0:16:55.440
<v Speaker 1>So if it were Sirie, Siri would monitor your phone's

0:16:55.480 --> 0:16:59.720
<v Speaker 1>location and if your phone indicated that it was leaving

0:17:00.120 --> 0:17:04.160
<v Speaker 1>the area generally known as being home, Siri would then

0:17:04.240 --> 0:17:07.639
<v Speaker 1>send a push notification to the phone to remind you

0:17:07.720 --> 0:17:10.480
<v Speaker 1>that you need to go to the post office. Geo

0:17:10.560 --> 0:17:12.760
<v Speaker 1>fencing is one of those features I think is really neat,

0:17:13.480 --> 0:17:16.720
<v Speaker 1>and yet I don't actually make that much use out

0:17:16.760 --> 0:17:19.000
<v Speaker 1>of it. But maybe I'm the odd one out do

0:17:19.040 --> 0:17:21.479
<v Speaker 1>you guys use a lot of geo fencing features. I

0:17:21.480 --> 0:17:24.680
<v Speaker 1>remember I looked at them as the possibility of setting

0:17:24.720 --> 0:17:26.639
<v Speaker 1>them up so when I was in the office, I

0:17:26.680 --> 0:17:30.480
<v Speaker 1>would get fewer notifications, for example, But I never really

0:17:30.520 --> 0:17:33.439
<v Speaker 1>got to do what I was hoping it would do.

0:17:33.680 --> 0:17:35.280
<v Speaker 1>And I don't know if that was a failure on

0:17:35.320 --> 0:17:38.600
<v Speaker 1>the part of the technology or more likely user failure.

0:17:38.640 --> 0:17:40.359
<v Speaker 1>So if you guys have a lot of, you know,

0:17:40.400 --> 0:17:42.960
<v Speaker 1>experience doing geo fencing type stuff, I'd be curious to

0:17:43.000 --> 0:17:47.879
<v Speaker 1>hear your thoughts. Anyway, the presentation went pretty well. People

0:17:48.080 --> 0:17:50.920
<v Speaker 1>seemed interested in SIRIE. However, I would say this wasn't

0:17:50.960 --> 0:17:53.240
<v Speaker 1>the sort of reaction that really makes it into clip

0:17:53.320 --> 0:17:56.520
<v Speaker 1>montages or anything. You didn't see, you know, people losing

0:17:56.560 --> 0:18:01.399
<v Speaker 1>their minds over the the implications of SIRIE. But it

0:18:01.520 --> 0:18:06.480
<v Speaker 1>was definitely positive. It wasn't like a clunker or anything.

0:18:06.920 --> 0:18:09.720
<v Speaker 1>I'm just not sure how many people grasped how complicated

0:18:09.760 --> 0:18:13.480
<v Speaker 1>the accomplishment actually was. It didn't really get how hard

0:18:14.400 --> 0:18:17.840
<v Speaker 1>these problems were to solve in the back end. Now,

0:18:17.880 --> 0:18:20.920
<v Speaker 1>one other thing they likely did not grasp is that

0:18:21.080 --> 0:18:23.880
<v Speaker 1>Siri didn't actually start out as an Apple product, which

0:18:23.960 --> 0:18:27.000
<v Speaker 1>we know because of the episodes I've just done. But

0:18:27.880 --> 0:18:31.920
<v Speaker 1>during that presentation there really wasn't an indication that it

0:18:32.000 --> 0:18:35.480
<v Speaker 1>was from some other developer. You would walk away thinking

0:18:35.920 --> 0:18:39.880
<v Speaker 1>that Apple developed the whole thing in full at Apple HQ,

0:18:40.119 --> 0:18:43.240
<v Speaker 1>although that wasn't the case now. As I mentioned, this

0:18:43.320 --> 0:18:46.479
<v Speaker 1>event happened on October four, two thousand eleven, and the

0:18:46.520 --> 0:18:52.240
<v Speaker 1>following day, on October five, Steve Jobs passed away. Understandably,

0:18:52.359 --> 0:18:57.000
<v Speaker 1>that piece of news eclipsed all other Apple news that week.

0:18:57.040 --> 0:19:00.240
<v Speaker 1>Heck it it eclipse pretty much all tech news that week.

0:19:00.880 --> 0:19:03.399
<v Speaker 1>Jobs had been a pivotal figure in the world of

0:19:03.400 --> 0:19:08.320
<v Speaker 1>technology in general and Silicon Valley in particular. Under his leadership,

0:19:08.359 --> 0:19:11.639
<v Speaker 1>Apple had gone from teetering on bankruptcy to a company

0:19:11.680 --> 0:19:14.960
<v Speaker 1>that was defining the next generation of hardware with products

0:19:15.000 --> 0:19:18.800
<v Speaker 1>like the iPod, the iPhone, and the iPad. His loss

0:19:19.080 --> 0:19:22.679
<v Speaker 1>was felt around the world, and Sirie perhaps got a

0:19:22.680 --> 0:19:26.760
<v Speaker 1>little less attention as a result. The iPhone Forest would

0:19:26.760 --> 0:19:30.480
<v Speaker 1>go on sale the following week, on October four. Within

0:19:30.520 --> 0:19:33.680
<v Speaker 1>the first weekend of sales, Apple would sell more than

0:19:33.720 --> 0:19:37.119
<v Speaker 1>four million units, which meant that the iPhone Forest broke

0:19:37.200 --> 0:19:41.080
<v Speaker 1>previous records of the fastest selling product ever at that point.

0:19:41.680 --> 0:19:45.399
<v Speaker 1>And about a week after that, Kit Loss Dog kit Loss,

0:19:45.440 --> 0:19:49.280
<v Speaker 1>one of the co founders of Sirie the Company, would

0:19:49.359 --> 0:19:53.080
<v Speaker 1>leave Apple Now. According to reports, the parting was on

0:19:53.200 --> 0:19:55.919
<v Speaker 1>good terms, it was amicable, and it was one that

0:19:56.040 --> 0:19:59.280
<v Speaker 1>Dog had been preparing for over the second half of

0:19:59.359 --> 0:20:01.720
<v Speaker 1>two thousand all eleven. He said he wanted to spend

0:20:01.720 --> 0:20:03.879
<v Speaker 1>more time with his family, and they were living in

0:20:04.000 --> 0:20:06.720
<v Speaker 1>Chicago than For those of you who are not familiar

0:20:06.760 --> 0:20:10.600
<v Speaker 1>with US geography, Chicago, Illinois is about two thousand, one

0:20:11.040 --> 0:20:14.879
<v Speaker 1>fifty miles or three thousand, four hundred sixty kilometers away

0:20:14.880 --> 0:20:20.160
<v Speaker 1>from Cupertino, California. It is not close. He also wanted

0:20:20.280 --> 0:20:25.159
<v Speaker 1>to pursue new entrepreneurial opportunities, so his initial motivation was

0:20:25.200 --> 0:20:28.119
<v Speaker 1>to get Sirie the Company up and running, and that

0:20:28.240 --> 0:20:30.040
<v Speaker 1>had turned out to be a huge success, so he

0:20:30.080 --> 0:20:32.720
<v Speaker 1>was ready to move on to a new challenge. One

0:20:32.800 --> 0:20:36.119
<v Speaker 1>thing I haven't really talked about in these episodes, however,

0:20:36.280 --> 0:20:38.639
<v Speaker 1>is series voice, and I feel like I should at

0:20:38.680 --> 0:20:42.280
<v Speaker 1>least touch on that. So the original voices for Sirie

0:20:42.400 --> 0:20:45.400
<v Speaker 1>came from clips of audio that were recorded by a

0:20:45.480 --> 0:20:48.880
<v Speaker 1>voice actor named Susan Bennett. And when I say original voice,

0:20:48.880 --> 0:20:53.440
<v Speaker 1>I mean the American English voice of Sirie and Bennett

0:20:54.040 --> 0:20:56.159
<v Speaker 1>actually lived in Atlanta at the time. I have no

0:20:56.200 --> 0:20:58.639
<v Speaker 1>idea if she still does. But when she made the

0:20:58.640 --> 0:21:01.760
<v Speaker 1>recordings she did, and here's the kicker, she didn't even

0:21:01.800 --> 0:21:04.840
<v Speaker 1>know she was going to be the voice of Syrie.

0:21:05.080 --> 0:21:08.680
<v Speaker 1>She landed a gig through a voice acting service called

0:21:08.880 --> 0:21:12.200
<v Speaker 1>GM Voices kind of like a talent agency for voice

0:21:12.240 --> 0:21:16.280
<v Speaker 1>over actors, and they specialize in finding voice talent to

0:21:16.320 --> 0:21:21.560
<v Speaker 1>provide audio clips for various automated services. Typically a company

0:21:21.600 --> 0:21:25.040
<v Speaker 1>called scan Soft wanted a voice actor to say a

0:21:25.080 --> 0:21:29.080
<v Speaker 1>ton of phrases, mostly nonsense stuff that made you know,

0:21:29.240 --> 0:21:34.520
<v Speaker 1>no real connective tissue sense or anything. And they wanted

0:21:34.560 --> 0:21:36.600
<v Speaker 1>to make all these different words sounds to build out

0:21:36.640 --> 0:21:41.679
<v Speaker 1>a computerized speech database, essentially to build out the building

0:21:41.720 --> 0:21:44.800
<v Speaker 1>blocks for American English speech, all the little phonemes that

0:21:44.960 --> 0:21:48.159
<v Speaker 1>make up the sounds that then in turn makeup the

0:21:48.240 --> 0:21:53.000
<v Speaker 1>spoken word. Scan Soft took the recorded audio and broke

0:21:53.040 --> 0:21:56.639
<v Speaker 1>it down into those different based phonemes of English, and

0:21:56.680 --> 0:22:01.880
<v Speaker 1>through a process called concatenation, which is simply means that

0:22:01.880 --> 0:22:06.359
<v Speaker 1>that you're stringing together units to make a whole, they

0:22:06.440 --> 0:22:09.640
<v Speaker 1>could construct speech that way, taking all the little basic

0:22:09.760 --> 0:22:14.840
<v Speaker 1>sounds and making up words and sentences that way. During development,

0:22:14.880 --> 0:22:18.359
<v Speaker 1>the serie team turned to a company called Nuance for

0:22:18.600 --> 0:22:22.720
<v Speaker 1>technologies related to voice generation and speech recognition. Nuance had

0:22:22.800 --> 0:22:27.040
<v Speaker 1>merged with scan Soft, and so Bennett's voice, which she

0:22:27.080 --> 0:22:30.560
<v Speaker 1>had recorded way back in two thousand five, was in

0:22:31.400 --> 0:22:35.520
<v Speaker 1>nuances speech database tool set, and someone on the project,

0:22:35.600 --> 0:22:38.600
<v Speaker 1>either at Sirie or at Nuance, chose Bennett's voice for

0:22:38.640 --> 0:22:41.360
<v Speaker 1>the Sirie app and that would make the leap over

0:22:41.400 --> 0:22:44.760
<v Speaker 1>to Apple when Sirie the company was acquired, and so

0:22:44.800 --> 0:22:48.840
<v Speaker 1>in October two thousand eleven, six years after she had

0:22:48.920 --> 0:22:52.919
<v Speaker 1>made the original recordings, she would unknowingly become the voice

0:22:53.080 --> 0:22:56.880
<v Speaker 1>of Sirie. Over in the UK, John Briggs would become

0:22:56.880 --> 0:23:00.560
<v Speaker 1>the first British male voice for Sirie, and down under

0:23:00.640 --> 0:23:05.480
<v Speaker 1>Karen Jacobson would become the Ausee Serie and series voice.

0:23:05.720 --> 0:23:08.679
<v Speaker 1>Or voices I guess I should say would change a

0:23:08.720 --> 0:23:12.879
<v Speaker 1>few times, once in with the introduction of iOS seven,

0:23:13.400 --> 0:23:16.639
<v Speaker 1>and it changed again in twenty nineteen with the introduction

0:23:16.680 --> 0:23:19.879
<v Speaker 1>of iOS thirteen, but in each case the goal was

0:23:19.920 --> 0:23:23.160
<v Speaker 1>to move towards a more natural tone and delivery. As

0:23:23.200 --> 0:23:26.080
<v Speaker 1>the state of the art in voice generation improved and

0:23:26.119 --> 0:23:30.480
<v Speaker 1>became less robotic. If you listen to those early examples

0:23:30.520 --> 0:23:34.080
<v Speaker 1>of Siri, it does get a little robotic in places.

0:23:34.560 --> 0:23:37.600
<v Speaker 1>Back in eleven series, speech was pretty good, but it

0:23:37.680 --> 0:23:41.520
<v Speaker 1>wasn't perfect, and occasionally Siri would mispronounce something as it

0:23:41.560 --> 0:23:44.840
<v Speaker 1>would use one set of phone emes instead of another.

0:23:45.440 --> 0:23:48.439
<v Speaker 1>And it actually makes me think of a joke in

0:23:48.520 --> 0:23:52.080
<v Speaker 1>the show The Play that Goes Wrong Uh, in which

0:23:52.119 --> 0:23:55.160
<v Speaker 1>the characters of the show are putting on a play

0:23:55.200 --> 0:24:00.320
<v Speaker 1>and one of the actors has a tendency to mispronounce words,

0:24:00.400 --> 0:24:05.679
<v Speaker 1>so in one example, he pronounces the word philanthropist as philanthropist,

0:24:06.480 --> 0:24:08.840
<v Speaker 1>And that sort of thing could happen with Syria on

0:24:08.880 --> 0:24:12.119
<v Speaker 1>occasion where there would just be a string of sounds

0:24:12.520 --> 0:24:15.120
<v Speaker 1>to represent a word, but it would be slightly wrong,

0:24:15.560 --> 0:24:19.680
<v Speaker 1>and it would often just be funny. Upon the features

0:24:19.720 --> 0:24:23.560
<v Speaker 1>debut with the iPhone for US, Sirie received some fairly

0:24:23.760 --> 0:24:28.359
<v Speaker 1>positive reviews. Over at Wired, Brian x Chen wrote, quote

0:24:28.680 --> 0:24:33.240
<v Speaker 1>the fifth generation iPhones, superb camera and speedy dual core

0:24:33.320 --> 0:24:37.680
<v Speaker 1>processor are classy additions, but Siri is the reason people

0:24:37.680 --> 0:24:42.080
<v Speaker 1>should buy this phone end quote. Chen praised the apps utility,

0:24:42.320 --> 0:24:44.520
<v Speaker 1>pointing out that it took less time for him to

0:24:44.520 --> 0:24:48.480
<v Speaker 1>do certain tasks using Siri than it would have if

0:24:48.520 --> 0:24:51.600
<v Speaker 1>he had to use his phone and type things in manually.

0:24:51.920 --> 0:24:54.600
<v Speaker 1>He also praised the versatility of the app, including its

0:24:54.600 --> 0:24:58.159
<v Speaker 1>ability to interpret a fairly wide range of commands and

0:24:58.200 --> 0:25:02.960
<v Speaker 1>return with relevant responses, and the positive comments didn't stop there.

0:25:02.960 --> 0:25:05.280
<v Speaker 1>Others praise the app for having what appeared to be

0:25:05.440 --> 0:25:09.280
<v Speaker 1>a wry sense of humor. Hailey Suka Yama. The Washington

0:25:09.359 --> 0:25:12.520
<v Speaker 1>Post actually collected a few examples of people attempting to

0:25:12.520 --> 0:25:15.640
<v Speaker 1>be a little cheeky with Siri and how the assistant

0:25:15.680 --> 0:25:18.920
<v Speaker 1>would often respond in kind, though not with the sort

0:25:18.920 --> 0:25:21.800
<v Speaker 1>of foul language of the original Sirie app from back

0:25:21.840 --> 0:25:26.439
<v Speaker 1>in so For example, when Jason Snell of Macworld asked

0:25:26.520 --> 0:25:29.600
<v Speaker 1>Sirie what was the meaning of life, the universe, and everything,

0:25:30.119 --> 0:25:34.560
<v Speaker 1>Sirie didn't hesitate. Siri responded with forty two, of course,

0:25:34.880 --> 0:25:37.320
<v Speaker 1>which is a reference to the classic sci fi comedy

0:25:37.560 --> 0:25:41.240
<v Speaker 1>The Hitchhiker's Guide to the Galaxy by Douglas Adams. The

0:25:41.280 --> 0:25:46.119
<v Speaker 1>Harvard Business Review proclaimed that Siri quote is as revolutionary

0:25:46.160 --> 0:25:51.040
<v Speaker 1>as the mac end quote, which is a pretty grand statement.

0:25:51.400 --> 0:25:54.760
<v Speaker 1>The journal pointed out that Sirie represented a real step

0:25:54.800 --> 0:25:59.600
<v Speaker 1>forward and creating a natural language interface for computers. For decades,

0:26:00.000 --> 0:26:03.960
<v Speaker 1>computer users had to learn how best to use computers.

0:26:04.200 --> 0:26:06.080
<v Speaker 1>We had to learn how to use a mouse and

0:26:06.160 --> 0:26:09.000
<v Speaker 1>a keyboard. We had to learn how to navigate file

0:26:09.160 --> 0:26:12.600
<v Speaker 1>systems and hunt down programs and type things in in

0:26:12.640 --> 0:26:15.080
<v Speaker 1>a way that a computer could process it. But Siri

0:26:15.640 --> 0:26:19.320
<v Speaker 1>represented a big change in that dynamic. With a system

0:26:19.400 --> 0:26:22.320
<v Speaker 1>like Sirie, you could potentially interact with a computer in

0:26:22.359 --> 0:26:25.640
<v Speaker 1>a much more natural way. The machine would learn how

0:26:25.680 --> 0:26:28.280
<v Speaker 1>to work best with you, not the other way around.

0:26:28.359 --> 0:26:30.720
<v Speaker 1>It would be like the computers in Star Trek the

0:26:30.760 --> 0:26:33.800
<v Speaker 1>next generation. You could speak out what it was you

0:26:33.880 --> 0:26:36.960
<v Speaker 1>wanted to do. The computer would then interpret it and

0:26:37.080 --> 0:26:40.560
<v Speaker 1>do whatever it was you were asking. And this change

0:26:40.600 --> 0:26:43.639
<v Speaker 1>in interfaces could open up a lot of opportunities for

0:26:43.680 --> 0:26:47.040
<v Speaker 1>people who might find it difficult or even impossible to

0:26:47.080 --> 0:26:51.080
<v Speaker 1>interact with computers using the more traditional mouse and keyboard approach,

0:26:51.600 --> 0:26:54.520
<v Speaker 1>or touch screens for that matter. Having something like Siri

0:26:54.720 --> 0:26:59.280
<v Speaker 1>as an alternative to these other interfaces would improve accessibility.

0:26:59.320 --> 0:27:01.680
<v Speaker 1>And that's own that's no small thing. That's not to

0:27:01.720 --> 0:27:05.080
<v Speaker 1>be dismissed. There are millions of people who either cannot

0:27:05.119 --> 0:27:08.439
<v Speaker 1>fully take advantage of technology or they have to go

0:27:08.520 --> 0:27:12.080
<v Speaker 1>to much greater efforts to access it due to challenges

0:27:12.160 --> 0:27:16.920
<v Speaker 1>like vision impairments or mobility issues, and systems like Siri

0:27:17.320 --> 0:27:21.680
<v Speaker 1>could potentially help people gain a higher level of autonomy.

0:27:21.800 --> 0:27:25.320
<v Speaker 1>Of course, with SIRIE, that process was a bit more

0:27:25.359 --> 0:27:29.280
<v Speaker 1>complicated on the back end than you might otherwise imagine.

0:27:30.040 --> 0:27:33.320
<v Speaker 1>The phone wasn't really doing all or even most of

0:27:33.400 --> 0:27:37.520
<v Speaker 1>the work for SIRIE. The voice input was being recorded.

0:27:37.720 --> 0:27:40.399
<v Speaker 1>That audio clip would then be sent to a remote server,

0:27:40.960 --> 0:27:43.959
<v Speaker 1>which would analyze the voice recording, you know, and and

0:27:44.080 --> 0:27:48.440
<v Speaker 1>transcribe it. Other servers would activate to pull relevant information

0:27:48.480 --> 0:27:52.240
<v Speaker 1>from whatever sources were needed, and then that information would

0:27:52.240 --> 0:27:55.320
<v Speaker 1>make its way back to the end users phone and

0:27:55.480 --> 0:27:59.159
<v Speaker 1>series response. So to the user, it seems like the

0:27:59.160 --> 0:28:01.960
<v Speaker 1>phone is doing all the work for you, but in reality,

0:28:02.040 --> 0:28:05.359
<v Speaker 1>the phone is more like a conduit and there's a

0:28:05.400 --> 0:28:08.800
<v Speaker 1>sophisticated network of computers that we're doing the heavy lifting

0:28:08.800 --> 0:28:12.119
<v Speaker 1>in the background. The reason I even mentioned that is

0:28:12.160 --> 0:28:14.960
<v Speaker 1>because it's really good to remember the power that sits

0:28:15.080 --> 0:28:18.760
<v Speaker 1>behind stuff like Siri. It's a pretty big jump to

0:28:18.880 --> 0:28:22.760
<v Speaker 1>go from a relatively thin device that is sending data

0:28:22.840 --> 0:28:26.680
<v Speaker 1>to a large system that's behind the scenes, and then

0:28:26.800 --> 0:28:29.800
<v Speaker 1>make that move to a standalone computer that can interact

0:28:29.840 --> 0:28:33.080
<v Speaker 1>with a person through natural language. The fact that most

0:28:33.119 --> 0:28:36.440
<v Speaker 1>of us aren't interfacing with our electronics through voice might

0:28:36.440 --> 0:28:39.760
<v Speaker 1>be an indicator that this is still a pretty hard

0:28:39.840 --> 0:28:43.800
<v Speaker 1>problem to solve, even when we have a story as

0:28:43.840 --> 0:28:48.480
<v Speaker 1>successful as Siri. Although that success is you know, it's

0:28:48.520 --> 0:28:52.360
<v Speaker 1>not it's not a total success, as we will learn

0:28:52.400 --> 0:28:56.320
<v Speaker 1>as we go on. Uh So, Yeah, the phrase success

0:28:56.360 --> 0:28:59.479
<v Speaker 1>story is a little dangerous to use because it's important

0:28:59.520 --> 0:29:02.760
<v Speaker 1>to acknowledge that not everyone was thrilled with the apps performance.

0:29:03.200 --> 0:29:06.960
<v Speaker 1>In the spring of twelve, a group of Apple customers

0:29:06.960 --> 0:29:10.320
<v Speaker 1>began to file lawsuits against the company. They would have

0:29:10.440 --> 0:29:14.440
<v Speaker 1>ultimately become a class action lawsuit, and the lawsuit alleged

0:29:14.680 --> 0:29:20.440
<v Speaker 1>that the plaintiffs had been falsely advertised to regarding series capabilities.

0:29:20.480 --> 0:29:23.920
<v Speaker 1>Apple had released a bunch of different commercials that were

0:29:23.920 --> 0:29:27.160
<v Speaker 1>showcasing what Siri should be able to do, and these

0:29:27.160 --> 0:29:30.080
<v Speaker 1>people were saying, Sirie doesn't do that, or at least

0:29:30.080 --> 0:29:31.880
<v Speaker 1>it doesn't do it the way you said it does,

0:29:32.880 --> 0:29:38.000
<v Speaker 1>so that became the crux of this lawsuit. They complained

0:29:38.040 --> 0:29:42.360
<v Speaker 1>that reality was falling far short of what had been promised.

0:29:43.200 --> 0:29:47.360
<v Speaker 1>Apple's response to the lawsuits was understandable, but also a

0:29:47.440 --> 0:29:51.040
<v Speaker 1>little bit caddy. Not that I'm judging, but it just

0:29:51.120 --> 0:29:55.280
<v Speaker 1>comes across kind of that way. So here's an excerpt

0:29:55.400 --> 0:29:59.040
<v Speaker 1>from Apple's response to the complaints, as was reported on

0:29:59.200 --> 0:30:03.280
<v Speaker 1>by the Wall Street Journal. Quote they and the thing

0:30:03.320 --> 0:30:08.040
<v Speaker 1>in this case means the plaintiffs offer only general descriptions

0:30:08.080 --> 0:30:12.880
<v Speaker 1>of Apple's advertisements, in complete summaries of Apple's website materials,

0:30:12.920 --> 0:30:18.120
<v Speaker 1>and vague descriptions of their alleged and highly individualized disappointment

0:30:18.160 --> 0:30:23.320
<v Speaker 1>with Sirie Telling Lee. Although plaintiff's claim they became dissatisfied

0:30:23.320 --> 0:30:27.920
<v Speaker 1>with Series performance soon after purchasing their iPhones, they made

0:30:27.960 --> 0:30:31.800
<v Speaker 1>no attempt to avail themselves of Apple's thirty day return

0:30:31.920 --> 0:30:37.360
<v Speaker 1>policy or one year warranty, which remains in effect. Instead,

0:30:37.760 --> 0:30:41.080
<v Speaker 1>they seek to take an alleged personal grievance about the

0:30:41.120 --> 0:30:45.080
<v Speaker 1>purported performance of a popular product and turn it into

0:30:45.160 --> 0:30:50.880
<v Speaker 1>a nationwide class action under California's consumer protection statutes. The

0:30:50.920 --> 0:30:53.640
<v Speaker 1>complaint does not come close to meeting the heavy burden

0:30:53.720 --> 0:30:59.520
<v Speaker 1>necessary to sustain such claims end quote. Now, judges would

0:31:00.120 --> 0:31:05.840
<v Speaker 1>subsequently agree with Apple that the plaintiffs claims really lacked

0:31:05.960 --> 0:31:09.160
<v Speaker 1>specificity and they were just too vague to constitute a

0:31:09.240 --> 0:31:13.440
<v Speaker 1>valid argument. As a result, the class action lawsuit would

0:31:13.440 --> 0:31:19.480
<v Speaker 1>be dismissed in and again in But this would not

0:31:19.520 --> 0:31:23.320
<v Speaker 1>be the only case of legal trouble, and Sirie will

0:31:23.480 --> 0:31:26.680
<v Speaker 1>touch on that again a little bit later. But when

0:31:26.680 --> 0:31:29.920
<v Speaker 1>we come back, we'll talk about when some other members

0:31:29.960 --> 0:31:33.600
<v Speaker 1>of the original serie team took their leave from Apple.

0:31:34.160 --> 0:31:44.080
<v Speaker 1>But first let's take a quick break. By late spring,

0:31:45.520 --> 0:31:48.680
<v Speaker 1>Series Shine was starting to wear off a little bit

0:31:48.720 --> 0:31:53.000
<v Speaker 1>after its debut. The journal nine to five MAC, dedicated

0:31:53.080 --> 0:31:57.240
<v Speaker 1>to All Things Apple, ran a survey that found of

0:31:57.280 --> 0:31:59.720
<v Speaker 1>those who responded said that Sirie needed a bit more

0:31:59.760 --> 0:32:03.320
<v Speaker 1>were but overall was helpful. So they saw the potential,

0:32:03.920 --> 0:32:07.680
<v Speaker 1>but they agreed that it needed a little more time

0:32:07.680 --> 0:32:10.360
<v Speaker 1>in the oven now. To be fair to Apple, for

0:32:10.560 --> 0:32:14.520
<v Speaker 1>Stall had stressed at the October event that Sirie was

0:32:14.520 --> 0:32:17.120
<v Speaker 1>still in beta. He said it a few times, so

0:32:17.200 --> 0:32:20.320
<v Speaker 1>the company was certainly aware that while the tool was

0:32:20.360 --> 0:32:22.640
<v Speaker 1>really interesting and had a lot of potential, it was

0:32:22.720 --> 0:32:27.000
<v Speaker 1>not perfect. Whether consumers were unaware of the caveats that

0:32:27.080 --> 0:32:30.920
<v Speaker 1>Forestall had made, or their expectations were just ultra high

0:32:31.080 --> 0:32:35.120
<v Speaker 1>due to Apple's string of monumental successes. Some folks ended

0:32:35.200 --> 0:32:38.960
<v Speaker 1>up being a little less impressed than others. In September

0:32:38.960 --> 0:32:42.800
<v Speaker 1>twenty twelve, tech journalists reported that Adam Share, another co

0:32:42.960 --> 0:32:47.760
<v Speaker 1>founder of the serie company, had left Apple. Actually he

0:32:47.840 --> 0:32:54.000
<v Speaker 1>had departed Apple in June of but wasn't reported until September.

0:32:54.120 --> 0:32:57.680
<v Speaker 1>He had been serving as an engineer director for iOS.

0:32:58.440 --> 0:33:02.360
<v Speaker 1>He left the company to quote pursue other projects end quote.

0:33:02.720 --> 0:33:05.680
<v Speaker 1>One of those projects was change dot org, the site

0:33:05.680 --> 0:33:09.480
<v Speaker 1>that allows people to create and distribute various petitions. Another

0:33:09.560 --> 0:33:13.080
<v Speaker 1>project was viv Labs, which Samsung would later acquire and

0:33:13.080 --> 0:33:17.800
<v Speaker 1>which also centered around the voice assistant features. And he

0:33:17.880 --> 0:33:21.360
<v Speaker 1>also occasionally performs magic, you know, like in stadium shows

0:33:21.400 --> 0:33:24.000
<v Speaker 1>and for Penn and Teller that's not a joke. He

0:33:24.200 --> 0:33:30.160
<v Speaker 1>is a literal magician. Outside of a relatively small text circle,

0:33:30.960 --> 0:33:33.720
<v Speaker 1>not a lot of folks really knew about him in

0:33:33.840 --> 0:33:37.120
<v Speaker 1>his role of developing Sirie, however, and so I'm not

0:33:37.200 --> 0:33:40.600
<v Speaker 1>sure that many people registered his departure. It wasn't like

0:33:40.920 --> 0:33:45.000
<v Speaker 1>the general public associated Sirie with a specific real life person.

0:33:45.720 --> 0:33:48.440
<v Speaker 1>Another person who did leave Apple around that time, and

0:33:48.480 --> 0:33:52.200
<v Speaker 1>also not this time, not by choice, was Scott Forstall.

0:33:52.520 --> 0:33:54.840
<v Speaker 1>That was the man who had introduced Siri at that

0:33:54.920 --> 0:33:57.720
<v Speaker 1>Apple special event in October two thousand and eleven, he

0:33:57.800 --> 0:34:01.320
<v Speaker 1>had headed up iOS. But a year later in October,

0:34:02.560 --> 0:34:05.200
<v Speaker 1>Apple announced that four Sol would be leaving the company

0:34:05.200 --> 0:34:07.760
<v Speaker 1>by the end of the year. And in this case,

0:34:08.239 --> 0:34:10.680
<v Speaker 1>we kind of know what was going on that led

0:34:10.719 --> 0:34:14.520
<v Speaker 1>to four Stalls departure. So when Syrie debuted, it was

0:34:14.520 --> 0:34:18.280
<v Speaker 1>with the iPhone four S and iOS five. The next

0:34:18.320 --> 0:34:22.400
<v Speaker 1>build of iOS, which thank goodness, was just called iOS six,

0:34:22.960 --> 0:34:28.359
<v Speaker 1>came with a very nasty albatross around its metaphorical neck,

0:34:28.960 --> 0:34:34.000
<v Speaker 1>and that albatross was Apple Maps. Now you may, oh,

0:34:34.040 --> 0:34:37.720
<v Speaker 1>my Drew Gees have forgotten what a mess Apple Maps

0:34:37.880 --> 0:34:41.640
<v Speaker 1>was when it first came out, So let me remind you.

0:34:42.560 --> 0:34:45.880
<v Speaker 1>At the time Apple was cutting ties with Google, the

0:34:45.920 --> 0:34:48.960
<v Speaker 1>Android operating system was beginning to get some real traction,

0:34:49.480 --> 0:34:53.319
<v Speaker 1>and Google's strategy to aim at more moderate markets than

0:34:53.560 --> 0:34:57.680
<v Speaker 1>Apple's prestige products allowed Android to get a big user

0:34:57.719 --> 0:35:01.839
<v Speaker 1>base pretty quickly. One thing Apple did in response was

0:35:02.000 --> 0:35:05.640
<v Speaker 1>to give Google Maps the boot off of iOS. It

0:35:05.640 --> 0:35:09.279
<v Speaker 1>would eventually come back, but Apple famously said, nope, We're

0:35:09.280 --> 0:35:15.120
<v Speaker 1>gonna disable that functionality and replace it with Apple Maps only.

0:35:15.360 --> 0:35:18.440
<v Speaker 1>The trouble was that Apple Maps wasn't as reliable as

0:35:18.480 --> 0:35:21.520
<v Speaker 1>Google Maps, and Google Maps wasn't perfect by any stretch

0:35:21.560 --> 0:35:26.240
<v Speaker 1>of the imagination, but Apple Maps was notably worse. Apple's

0:35:26.320 --> 0:35:29.719
<v Speaker 1>version wasn't as detailed as Google Maps and didn't have

0:35:29.800 --> 0:35:33.640
<v Speaker 1>some of the information that Google included in its applications. So,

0:35:33.719 --> 0:35:37.920
<v Speaker 1>for example, if you were visiting a new city, you

0:35:37.960 --> 0:35:40.520
<v Speaker 1>could use Google Maps to see how to take public

0:35:40.560 --> 0:35:44.080
<v Speaker 1>transit to get around town. Apple Maps did not have

0:35:44.160 --> 0:35:48.520
<v Speaker 1>that functionality when it launched. Worse than that, for some cities,

0:35:48.560 --> 0:35:53.479
<v Speaker 1>particularly cities outside of the United States, major landmarks would

0:35:53.480 --> 0:35:56.920
<v Speaker 1>be mislabeled or would appear in the wrong location on

0:35:56.920 --> 0:36:01.880
<v Speaker 1>a map. It just wasn't good. Apple would ultimately issue

0:36:01.920 --> 0:36:05.560
<v Speaker 1>an apology, which is a huge thing. I mean, Apple

0:36:05.640 --> 0:36:08.759
<v Speaker 1>just doesn't do that typically. I mean it was a

0:36:08.800 --> 0:36:11.680
<v Speaker 1>really rare move for the company to make, but for

0:36:11.880 --> 0:36:16.760
<v Speaker 1>Stall refused to sign the apology. Essentially, he wasn't admitting

0:36:16.800 --> 0:36:20.720
<v Speaker 1>any accountability for the failures of iOS six and Apple

0:36:20.800 --> 0:36:25.520
<v Speaker 1>Maps in particular. Tim Cook would fire Forestall partly because

0:36:25.560 --> 0:36:28.120
<v Speaker 1>of that and for a few other reasons. Tim Cook

0:36:28.680 --> 0:36:33.160
<v Speaker 1>said he wanted a harmonious workplace, which implied that Forstall

0:36:33.920 --> 0:36:36.560
<v Speaker 1>was not the easiest person to work with, and also

0:36:36.600 --> 0:36:40.680
<v Speaker 1>that he wanted a workplace without politics, also implying that

0:36:40.760 --> 0:36:45.480
<v Speaker 1>Forrestall perhaps was playing the little politics in the office

0:36:45.520 --> 0:36:49.040
<v Speaker 1>as well. There were some who argued that Forrestall was

0:36:49.120 --> 0:36:53.280
<v Speaker 1>likely to claim credit where perhaps he wasn't fully justified

0:36:53.320 --> 0:36:55.440
<v Speaker 1>in doing so. I don't know if any of that

0:36:55.600 --> 0:36:58.960
<v Speaker 1>is true, but that's kind of the reporting around it.

0:36:59.400 --> 0:37:03.279
<v Speaker 1>For Stall wasn't one of series founders. He rather was

0:37:03.320 --> 0:37:06.839
<v Speaker 1>sort of the liaison between the serie team and the

0:37:06.840 --> 0:37:10.200
<v Speaker 1>public once Serrie made the move to Apple, because Forestall

0:37:10.360 --> 0:37:15.560
<v Speaker 1>was overseeing iOS in general. Tom Gruber, the third co founder,

0:37:16.000 --> 0:37:18.840
<v Speaker 1>would actually stick with Apple for several more years. He

0:37:18.960 --> 0:37:22.000
<v Speaker 1>was the last one to remain with the company, the

0:37:22.000 --> 0:37:24.880
<v Speaker 1>other two co founders had already left. He would retire

0:37:24.920 --> 0:37:29.719
<v Speaker 1>in ten to pursue his interests, which included ocean conservation

0:37:30.000 --> 0:37:34.720
<v Speaker 1>and photography. But let's get back to twelve. As Apple

0:37:34.800 --> 0:37:38.680
<v Speaker 1>continued to change and adjust to a post Steve Jobs world,

0:37:39.360 --> 0:37:42.360
<v Speaker 1>other companies began to bring products out in an attempt

0:37:42.480 --> 0:37:47.200
<v Speaker 1>to compete with Sirie. Google had its own virtual assistant,

0:37:47.480 --> 0:37:49.800
<v Speaker 1>which sort of grew out of a voice to text

0:37:49.880 --> 0:37:53.840
<v Speaker 1>search feature. It would eventually be known as Google Assistant,

0:37:54.400 --> 0:37:58.520
<v Speaker 1>and Samsung brought out the s Voice Assistant. Amazon would

0:37:58.560 --> 0:38:03.640
<v Speaker 1>introduce Alexa in that's the same year that Microsoft would

0:38:03.640 --> 0:38:07.719
<v Speaker 1>bring its voice assistant Cortana to Windows. Though these days

0:38:07.719 --> 0:38:11.120
<v Speaker 1>I think Alexa, Syrie, and Google Assistant are really the

0:38:11.160 --> 0:38:15.040
<v Speaker 1>three best known virtual assistance. I think Alexa has the

0:38:15.160 --> 0:38:19.400
<v Speaker 1>clear edge on the other two. Cortana has seen a

0:38:19.400 --> 0:38:25.080
<v Speaker 1>lot of quote unquote her functionality reduced since twenty nineteen.

0:38:25.160 --> 0:38:29.040
<v Speaker 1>Microsoft is kind of removed a lot of Cortana's features.

0:38:30.400 --> 0:38:33.560
<v Speaker 1>Apparently the deep integration into Windows just didn't work out,

0:38:34.239 --> 0:38:36.399
<v Speaker 1>and that kind of comes back to Apple to One

0:38:36.400 --> 0:38:39.840
<v Speaker 1>of the big criticisms about SIRIE was that once Apple

0:38:40.200 --> 0:38:44.000
<v Speaker 1>brought it on board, once they acquired the company, Apple

0:38:44.080 --> 0:38:47.520
<v Speaker 1>made a lot of decisions that ultimately either reduced series

0:38:47.560 --> 0:38:51.440
<v Speaker 1>functionality or cut it off from growing the way it

0:38:51.480 --> 0:38:56.440
<v Speaker 1>could have otherwise. Before Sirie moved over to Apple, the

0:38:56.480 --> 0:39:00.120
<v Speaker 1>team was hard at work integrating the assistance functionality with

0:39:00.160 --> 0:39:04.440
<v Speaker 1>a whole suite of apps from different developers, and ideally

0:39:04.719 --> 0:39:06.680
<v Speaker 1>you would reach a point where you could just tell

0:39:06.719 --> 0:39:10.080
<v Speaker 1>your device what you wanted and the assistant would work

0:39:10.080 --> 0:39:14.560
<v Speaker 1>out the rest, pulling information or accessing whatever apps are necessary.

0:39:14.560 --> 0:39:16.479
<v Speaker 1>In order to do the thing you wanted it to do.

0:39:17.040 --> 0:39:20.440
<v Speaker 1>In fact, you can imagine an implementation of this in

0:39:20.480 --> 0:39:24.600
<v Speaker 1>which the assistant could even compare different ways to accomplish

0:39:24.680 --> 0:39:27.359
<v Speaker 1>the same task you give it and then go with

0:39:27.400 --> 0:39:30.120
<v Speaker 1>whatever is the best option. So let me give you

0:39:30.160 --> 0:39:34.600
<v Speaker 1>an example. Since the start of the pandemic, I've been

0:39:34.640 --> 0:39:37.960
<v Speaker 1>staying at home, and on occasion I will order food

0:39:38.160 --> 0:39:41.160
<v Speaker 1>from a nearby restaurant for delivery. You'll get a craving

0:39:41.200 --> 0:39:42.879
<v Speaker 1>and I'll say, all right, I want to eat such

0:39:42.880 --> 0:39:44.920
<v Speaker 1>and such, so I'm gonna order from so and so.

0:39:45.600 --> 0:39:49.759
<v Speaker 1>Only not every restaurant works with every delivery service, right,

0:39:50.120 --> 0:39:51.799
<v Speaker 1>So that kind of means that I have to make

0:39:51.840 --> 0:39:56.120
<v Speaker 1>accounts with all these different services like Postmates or door

0:39:56.239 --> 0:39:58.560
<v Speaker 1>Dash or grub Hub, or here in Atlanta we have

0:39:58.600 --> 0:40:02.480
<v Speaker 1>one called zifty. You get the point. And then let's

0:40:02.480 --> 0:40:05.560
<v Speaker 1>say I get a craving for some amazing Thai food

0:40:05.600 --> 0:40:07.640
<v Speaker 1>and I have a specific restaurant I have in mind.

0:40:08.000 --> 0:40:10.160
<v Speaker 1>I have to figure out which of the services works

0:40:10.200 --> 0:40:13.399
<v Speaker 1>with that restaurant, right uh. And if more than one

0:40:13.680 --> 0:40:16.120
<v Speaker 1>of the services works with the restaurant, I might even

0:40:16.120 --> 0:40:18.480
<v Speaker 1>want to compare them to find out who's gonna get

0:40:18.480 --> 0:40:21.600
<v Speaker 1>me the food the quickest, or who's going to do

0:40:21.600 --> 0:40:24.400
<v Speaker 1>it the cheapest, like who's gonna have the lowest service fees.

0:40:24.640 --> 0:40:28.800
<v Speaker 1>So in an ideal implementation of a virtual assistant, the

0:40:28.840 --> 0:40:31.040
<v Speaker 1>assistant would do all of that work for me. It

0:40:31.040 --> 0:40:34.120
<v Speaker 1>would compare the different options, and it might give me

0:40:34.160 --> 0:40:36.920
<v Speaker 1>an informed choice. You know, maybe I just tell the

0:40:36.960 --> 0:40:39.919
<v Speaker 1>assistant that I just want the food and I wanted

0:40:39.960 --> 0:40:42.080
<v Speaker 1>as fast as possible, and that's what matters to me.

0:40:42.160 --> 0:40:44.440
<v Speaker 1>So just find the services going to get it here

0:40:44.480 --> 0:40:48.480
<v Speaker 1>the fastest based on their estimations. Or maybe I might say,

0:40:48.920 --> 0:40:52.360
<v Speaker 1>you know what, money's tight, I want the least expensive

0:40:52.440 --> 0:40:56.960
<v Speaker 1>way to get that order here, and so on. But

0:40:57.040 --> 0:41:01.240
<v Speaker 1>now expand that single use case across us, all apps

0:41:01.400 --> 0:41:05.560
<v Speaker 1>and all potential uses of the assistant, and you see

0:41:05.560 --> 0:41:08.799
<v Speaker 1>where it could really come in handy. Now, I'm not

0:41:08.880 --> 0:41:12.760
<v Speaker 1>the worst when it comes to installing tons of apps

0:41:12.760 --> 0:41:15.960
<v Speaker 1>on my phone and then forgetting about them and never

0:41:16.080 --> 0:41:18.680
<v Speaker 1>using them. But I'm not the best about it either.

0:41:19.000 --> 0:41:22.960
<v Speaker 1>About once every five or six months, I'll go through

0:41:23.000 --> 0:41:25.400
<v Speaker 1>my phone and just start uninstalling apps where I'm like,

0:41:25.440 --> 0:41:28.399
<v Speaker 1>I don't remember the last time I've used this. So

0:41:28.600 --> 0:41:31.680
<v Speaker 1>there are undoubtedly apps that would be helpful in various

0:41:31.680 --> 0:41:34.799
<v Speaker 1>situations that I get into throughout the day, but I

0:41:34.840 --> 0:41:37.120
<v Speaker 1>never think about them even though I do have them

0:41:37.120 --> 0:41:41.359
<v Speaker 1>on my phone. However, a virtual assistant, if implemented well,

0:41:41.920 --> 0:41:44.799
<v Speaker 1>might potentially make better use of the apps that are

0:41:44.840 --> 0:41:47.719
<v Speaker 1>on my phone than I do, and then I get

0:41:47.760 --> 0:41:51.480
<v Speaker 1>to benefit from that. But Apple cut off a lot

0:41:51.480 --> 0:41:56.840
<v Speaker 1>of that functionality when it bought Siri and limited Series capabilities,

0:41:57.200 --> 0:42:00.480
<v Speaker 1>and in a way that's understandable. With Series being an

0:42:00.480 --> 0:42:04.400
<v Speaker 1>official Apple product, it would be tricky to implement the

0:42:04.480 --> 0:42:09.000
<v Speaker 1>service with some apps and not others. So for example,

0:42:09.040 --> 0:42:13.600
<v Speaker 1>if you integrated Siri with one rideshare service but it

0:42:13.680 --> 0:42:17.960
<v Speaker 1>didn't integrate with another one, would that open up Apple

0:42:18.080 --> 0:42:21.400
<v Speaker 1>to accusations that it was playing favorites and thus creating

0:42:21.400 --> 0:42:26.040
<v Speaker 1>an unfair advantage for one company versus another company. It

0:42:26.080 --> 0:42:31.520
<v Speaker 1>gets tricky. Sirie has also attracted some unwanted attention from

0:42:31.600 --> 0:42:35.840
<v Speaker 1>companies that accused Apple of infringing on intellectual property. In

0:42:35.880 --> 0:42:39.360
<v Speaker 1>ten Apple would settle a lawsuit from rensel Or Polytechnic

0:42:39.400 --> 0:42:43.480
<v Speaker 1>Institute r p I in New York that claimed that

0:42:43.560 --> 0:42:46.080
<v Speaker 1>Apple had infringed upon a patent that r p I

0:42:46.200 --> 0:42:51.200
<v Speaker 1>had licensed to a Dallas company called Dynamic Advances back

0:42:51.239 --> 0:42:53.960
<v Speaker 1>in two thousand seven, so if you recall that's the

0:42:53.960 --> 0:42:57.640
<v Speaker 1>same year that SIRIE the company was founded. The settlement

0:42:57.719 --> 0:43:01.200
<v Speaker 1>was for about twenty five million dollars. Is The specific

0:43:01.239 --> 0:43:05.520
<v Speaker 1>allegation had to do with natural language interface, and just

0:43:05.640 --> 0:43:08.840
<v Speaker 1>this year in twenty Apple was hit with another lawsuit.

0:43:08.960 --> 0:43:12.319
<v Speaker 1>This one is a true doozy. The lawsuit comes in

0:43:12.400 --> 0:43:16.320
<v Speaker 1>at a one point four three billion with a B dollars.

0:43:16.920 --> 0:43:20.200
<v Speaker 1>The plaintiff in this case is a Chinese AI company

0:43:20.239 --> 0:43:26.040
<v Speaker 1>called the Shanghai Shijin Intelligent Network Technology or also known

0:43:26.160 --> 0:43:30.240
<v Speaker 1>as shao I Robot, and the claim is that Siri

0:43:30.400 --> 0:43:34.440
<v Speaker 1>infringes upon a patented technology that the company filed for

0:43:34.680 --> 0:43:38.080
<v Speaker 1>way back in two thousand four, and then received a

0:43:38.160 --> 0:43:41.479
<v Speaker 1>patent for that filing five years later in two thousand nine.

0:43:41.600 --> 0:43:44.760
<v Speaker 1>So for those keeping track, the project that would ultimately

0:43:44.880 --> 0:43:48.520
<v Speaker 1>spawn Siri started in two thousand three, but Syria itself

0:43:48.560 --> 0:43:51.160
<v Speaker 1>wasn't a thing until two thousand seven. So an Apple

0:43:51.239 --> 0:43:55.760
<v Speaker 1>spokesperson downplayed this accusation, saying quote, SIRIE does not contain

0:43:55.880 --> 0:43:59.440
<v Speaker 1>features included in their patent, which relates to games and

0:43:59.560 --> 0:44:03.600
<v Speaker 1>instant messaging and we are disappointed. Shall I Robot has

0:44:03.640 --> 0:44:08.640
<v Speaker 1>filed another lawsuit. Independent appraisers certified by the Supreme People's

0:44:08.680 --> 0:44:12.080
<v Speaker 1>Court have also concluded that Apple does not infringe shall

0:44:12.239 --> 0:44:16.520
<v Speaker 1>I Robots technology end quote. It turns out that this

0:44:16.680 --> 0:44:21.040
<v Speaker 1>same Chinese company sued Apple back in Apple filed a

0:44:21.120 --> 0:44:24.879
<v Speaker 1>motion to have the company's patent invalidated. I didn't even

0:44:24.880 --> 0:44:28.880
<v Speaker 1>know that was a thing. The Beijing High Court initially

0:44:28.920 --> 0:44:32.560
<v Speaker 1>agreed with Apple, but more recently the Supreme People's Court

0:44:32.600 --> 0:44:36.640
<v Speaker 1>in China has overturned that decision. And so now we're

0:44:36.719 --> 0:44:38.879
<v Speaker 1>kind of back to where we were when the first

0:44:38.960 --> 0:44:41.359
<v Speaker 1>lawsuit was filed eight years ago, but now for one

0:44:41.360 --> 0:44:45.120
<v Speaker 1>point four billion dollars. Now, that story is still ongoing.

0:44:45.440 --> 0:44:47.720
<v Speaker 1>I don't have any other updates to give you guys

0:44:47.719 --> 0:44:50.839
<v Speaker 1>about it, but I will say that natural language processing

0:44:50.880 --> 0:44:53.040
<v Speaker 1>has been a field that a lot of people have

0:44:53.120 --> 0:44:56.360
<v Speaker 1>been working on independently over the years. Now, is it

0:44:56.440 --> 0:44:59.120
<v Speaker 1>possible that the serie team learned of work being done

0:44:59.160 --> 0:45:03.440
<v Speaker 1>in China and perhaps lifted some of it? Maybe? Is

0:45:03.480 --> 0:45:06.480
<v Speaker 1>it possible we have two groups that independently arrived at

0:45:06.480 --> 0:45:10.719
<v Speaker 1>a similar solution. No, that's also possible, But it's even

0:45:10.760 --> 0:45:14.160
<v Speaker 1>possible that the lawsuit itself is totally without grounds, as

0:45:14.200 --> 0:45:19.000
<v Speaker 1>Apple claims it is. I honestly just don't know. What

0:45:19.160 --> 0:45:22.320
<v Speaker 1>I do know is that voice assistants have in large

0:45:22.320 --> 0:45:25.760
<v Speaker 1>part failed to be the massive see change that people

0:45:25.800 --> 0:45:31.000
<v Speaker 1>expected when they first began to emerge around We've seen

0:45:31.040 --> 0:45:34.920
<v Speaker 1>improvements including the ability to follow a thread of questions

0:45:35.120 --> 0:45:38.680
<v Speaker 1>or commands about a single topic without having to restate

0:45:38.840 --> 0:45:41.640
<v Speaker 1>the topic each time. So, in other words, the assistance

0:45:41.719 --> 0:45:44.560
<v Speaker 1>now kind of have short term memory, so if you

0:45:44.640 --> 0:45:48.080
<v Speaker 1>ask a follow up question, they can figure out that

0:45:48.200 --> 0:45:50.200
<v Speaker 1>it's a follow up question to the thing you just

0:45:50.400 --> 0:45:53.160
<v Speaker 1>asked about a moment earlier. And there have been some

0:45:53.239 --> 0:45:57.320
<v Speaker 1>other controversies to like the fear that these assistants Sirie

0:45:57.400 --> 0:46:00.600
<v Speaker 1>among them, are listening to way more stuff than what

0:46:00.680 --> 0:46:04.399
<v Speaker 1>we're aware of, or that by analyzing our behaviors such

0:46:04.440 --> 0:46:08.399
<v Speaker 1>as when and how we use certain apps, they could

0:46:08.440 --> 0:46:11.839
<v Speaker 1>be building out enormous dossiers on who we are and

0:46:11.880 --> 0:46:14.880
<v Speaker 1>what we like, and information that can be really valuable

0:46:14.920 --> 0:46:17.880
<v Speaker 1>to third parties out there. I do like the thing

0:46:18.000 --> 0:46:21.319
<v Speaker 1>that the potential of voice assistance ties in with another

0:46:21.480 --> 0:46:24.480
<v Speaker 1>concept that I've covered in the past, that of the

0:46:24.520 --> 0:46:27.920
<v Speaker 1>semantic Web. The idea of the semantic web is that

0:46:27.960 --> 0:46:30.759
<v Speaker 1>you would have a version of the web, a new

0:46:30.880 --> 0:46:34.040
<v Speaker 1>version of the web, where you have kind of this

0:46:34.160 --> 0:46:37.640
<v Speaker 1>personal assistant built into the web itself, and then it

0:46:37.719 --> 0:46:41.680
<v Speaker 1>learns from your behaviors and can anticipate what you need

0:46:41.760 --> 0:46:45.200
<v Speaker 1>when you ask questions and get you the most relevant results.

0:46:45.239 --> 0:46:48.400
<v Speaker 1>So the more you use it, the more it learns

0:46:48.719 --> 0:46:51.719
<v Speaker 1>who you are and how you operate, and thus can

0:46:51.880 --> 0:46:56.239
<v Speaker 1>conform better to your preferences and make it a more

0:46:56.280 --> 0:47:00.239
<v Speaker 1>seamless experience to use the web. Voice assistants follow a

0:47:00.400 --> 0:47:04.480
<v Speaker 1>very similar philosophy. Uh, you can't do it without it,

0:47:04.560 --> 0:47:07.360
<v Speaker 1>also feeling a little creepy, I just don't think it's possible.

0:47:07.880 --> 0:47:10.719
<v Speaker 1>I also don't think that anyone has made um the

0:47:10.840 --> 0:47:13.960
<v Speaker 1>killer app version of it yet. Sirie, I think is

0:47:13.960 --> 0:47:17.879
<v Speaker 1>still one that has really high name recognition. And I'm

0:47:17.920 --> 0:47:20.280
<v Speaker 1>curious how many of you out there who have iPhones

0:47:20.320 --> 0:47:24.400
<v Speaker 1>how often you use it? If ever, my wife rarely

0:47:24.560 --> 0:47:27.040
<v Speaker 1>uses it. She'll use it if she's driving in the car,

0:47:27.120 --> 0:47:30.640
<v Speaker 1>for example, so it's hands free, but other than that,

0:47:30.800 --> 0:47:33.759
<v Speaker 1>I don't think she uses it that often. And to

0:47:33.840 --> 0:47:37.560
<v Speaker 1>be fair, I have an Android phone. I've got Google Assistant.

0:47:37.719 --> 0:47:39.960
<v Speaker 1>I don't use that on my phone either. I will

0:47:40.000 --> 0:47:43.080
<v Speaker 1>occasionally use the smart speaker we have and use Google

0:47:43.120 --> 0:47:47.399
<v Speaker 1>Assistant on that, but typically it's just to turn on

0:47:47.520 --> 0:47:50.920
<v Speaker 1>or off the lights or maybe play some music or something.

0:47:51.000 --> 0:47:53.080
<v Speaker 1>I don't really use it to its full potential, so

0:47:53.160 --> 0:47:54.920
<v Speaker 1>I also ask what the weather is going to be.

0:47:55.080 --> 0:47:57.640
<v Speaker 1>That's the other thing other than that kind of goes

0:47:57.680 --> 0:48:01.799
<v Speaker 1>to waste. So this one as an example of technology

0:48:01.840 --> 0:48:05.400
<v Speaker 1>that has and still to this day, has incredible potential,

0:48:05.960 --> 0:48:08.359
<v Speaker 1>but I don't think it's living up to it yet,

0:48:08.600 --> 0:48:12.280
<v Speaker 1>and that is in part the fault of the technology,

0:48:12.600 --> 0:48:15.200
<v Speaker 1>and in part it's just the fault of our behaviors,

0:48:15.239 --> 0:48:18.480
<v Speaker 1>people like me who have access to stuff like this

0:48:18.920 --> 0:48:21.920
<v Speaker 1>and yet wasted by asking if I need to wear

0:48:21.960 --> 0:48:24.600
<v Speaker 1>a raincoat the next day. But I'm curious to hear

0:48:24.640 --> 0:48:26.200
<v Speaker 1>what you guys think. You can reach out to me.

0:48:26.280 --> 0:48:28.160
<v Speaker 1>The best way to do it actually is on Twitter.

0:48:28.600 --> 0:48:31.719
<v Speaker 1>The handle for the show is text Stuff hs W.

0:48:31.960 --> 0:48:34.040
<v Speaker 1>I look forward to hearing from you, and I'll tell

0:48:34.080 --> 0:48:42.560
<v Speaker 1>to you again really soon. Text Stuff is an I

0:48:42.640 --> 0:48:46.160
<v Speaker 1>Heart Radio production. For more podcasts from my Heart Radio,

0:48:46.480 --> 0:48:49.680
<v Speaker 1>visit the I Heart Radio app, Apple Podcasts, or wherever

0:48:49.760 --> 0:48:51.280
<v Speaker 1>you listen to your favorite shows