WEBVTT - Nest Audio smart speaker uses data to make audio sound its best

0:00:00.320 --> 0:00:02.759
<v Speaker 1>Rich Demiro here, Rich On Tech here with Mark Spates,

0:00:02.759 --> 0:00:05.680
<v Speaker 1>product manager for Google Next. Mark, thanks so much for

0:00:05.680 --> 0:00:06.560
<v Speaker 1>being on the podcast.

0:00:07.320 --> 0:00:08.520
<v Speaker 2>Oh Rich, happy to be here.

0:00:08.680 --> 0:00:12.560
<v Speaker 1>All right, let's talk about your brand new product. You've

0:00:12.600 --> 0:00:16.239
<v Speaker 1>got the Nest Audio smart speaker. This is a ninety

0:00:16.320 --> 0:00:19.400
<v Speaker 1>nine dollars smart speaker that kind of replaces a speaker

0:00:19.400 --> 0:00:22.680
<v Speaker 1>that came out many years ago, the original Google Home.

0:00:23.160 --> 0:00:25.920
<v Speaker 1>So tell me about this new speaker. I've got one

0:00:26.000 --> 0:00:27.720
<v Speaker 1>right here, and it has a whole new look and

0:00:27.720 --> 0:00:28.240
<v Speaker 1>feel to it.

0:00:28.400 --> 0:00:32.400
<v Speaker 2>Yeah, Rich, I think the first visual that you saw

0:00:32.640 --> 0:00:35.519
<v Speaker 2>is part of the advancements that we made. And so

0:00:35.560 --> 0:00:38.479
<v Speaker 2>when we launched the original Google Home, I think what

0:00:38.720 --> 0:00:42.080
<v Speaker 2>was really interesting where people were surprised by its sound

0:00:42.159 --> 0:00:44.879
<v Speaker 2>and they actually appreciated that for the value in the

0:00:44.960 --> 0:00:47.320
<v Speaker 2>sound that they got. And so when we thought about

0:00:47.600 --> 0:00:50.360
<v Speaker 2>the next generation of it, Nest Audio, there are a

0:00:50.360 --> 0:00:54.120
<v Speaker 2>few things that we want to prove. On one, the

0:00:54.240 --> 0:00:57.680
<v Speaker 2>sound quality itself, and for us, what we wanted to

0:00:57.680 --> 0:01:01.600
<v Speaker 2>make sure is that there's a theoristic that users felt

0:01:01.640 --> 0:01:03.880
<v Speaker 2>when they had this sound with Nest Audio. It should

0:01:03.880 --> 0:01:06.560
<v Speaker 2>be full balanced with the lows and the mids and

0:01:06.600 --> 0:01:09.440
<v Speaker 2>the highs. It should be very clear as well, that

0:01:09.480 --> 0:01:12.720
<v Speaker 2>the vocals really come through and there's no distortion and

0:01:12.760 --> 0:01:16.120
<v Speaker 2>it's natural. Really, what the users should feel is what

0:01:16.160 --> 0:01:19.160
<v Speaker 2>the artists created and what they wanted to come through.

0:01:19.600 --> 0:01:21.480
<v Speaker 2>And there's a few ways that we achieved this rich

0:01:21.880 --> 0:01:25.200
<v Speaker 2>One is the Nest Audio actually has a seventy five

0:01:25.240 --> 0:01:28.920
<v Speaker 2>milimeter Wolfer and a nineteen milimeter Twitter and this is

0:01:29.319 --> 0:01:34.319
<v Speaker 2>really important because that allows us to really get better

0:01:34.360 --> 0:01:38.160
<v Speaker 2>base and allows us to even be awesome with the

0:01:38.240 --> 0:01:43.199
<v Speaker 2>vocals at extremely high values. But I think the key part,

0:01:43.520 --> 0:01:49.520
<v Speaker 2>and I think the natural, like how Google solves problems,

0:01:49.880 --> 0:01:52.880
<v Speaker 2>was we went about the tuning in a very software

0:01:53.080 --> 0:01:56.520
<v Speaker 2>driven way, a very data driven way. And I'll tell

0:01:56.520 --> 0:02:00.200
<v Speaker 2>you what that means. First, the team actually creates it

0:02:00.480 --> 0:02:04.240
<v Speaker 2>with nes Mini, our own proprietary tuning software, and what

0:02:04.280 --> 0:02:07.160
<v Speaker 2>that allows us to do is we can simulate over

0:02:07.280 --> 0:02:11.240
<v Speaker 2>two thousand and five hundred possible listening positions for the device.

0:02:11.760 --> 0:02:14.640
<v Speaker 2>And why is that important. It's because people have very

0:02:14.680 --> 0:02:18.040
<v Speaker 2>different homes. They put these devices everywhere, and we want

0:02:18.080 --> 0:02:20.359
<v Speaker 2>to make sure that it sounds great no matter where

0:02:20.400 --> 0:02:24.359
<v Speaker 2>you put it. And the features that really come out

0:02:24.440 --> 0:02:28.959
<v Speaker 2>of this data driven approach are three things. One, it's

0:02:29.000 --> 0:02:32.400
<v Speaker 2>called base extension. What we're able to do is extend

0:02:32.480 --> 0:02:35.280
<v Speaker 2>the base even when the device is at lower values,

0:02:35.680 --> 0:02:38.000
<v Speaker 2>and so if you're listening to it on valume two

0:02:38.080 --> 0:02:40.480
<v Speaker 2>or three, you still don't lose that emotional feeling of

0:02:40.520 --> 0:02:43.480
<v Speaker 2>the base coming through. The next one is ambient IQ.

0:02:44.320 --> 0:02:47.919
<v Speaker 2>What that allows us to do is leverage the background noise.

0:02:48.040 --> 0:02:51.800
<v Speaker 2>Let's say the faucet is running or the dryer is running.

0:02:51.960 --> 0:02:54.960
<v Speaker 2>We can make sure that the assistant voice actually adjusts

0:02:55.080 --> 0:02:57.000
<v Speaker 2>so that you can hear it no matter where the

0:02:57.040 --> 0:02:59.720
<v Speaker 2>device is in the room. And last, and I actually

0:02:59.720 --> 0:03:04.359
<v Speaker 2>thinks important for Nest Audio is media EQ. What we're

0:03:04.400 --> 0:03:07.600
<v Speaker 2>able to do is actually a just the tuning of

0:03:07.760 --> 0:03:10.200
<v Speaker 2>nests Audio to the music you're listening to, be it

0:03:10.480 --> 0:03:15.640
<v Speaker 2>hip hop or jazz or classical or the podcast as well.

0:03:15.919 --> 0:03:18.960
<v Speaker 2>And so what you're really seeing is is Google really

0:03:19.040 --> 0:03:22.640
<v Speaker 2>starting to develop a really clear way that we believe

0:03:22.720 --> 0:03:25.320
<v Speaker 2>things should sound, and then the technology behind it to

0:03:25.360 --> 0:03:26.120
<v Speaker 2>actually achieve it.

0:03:26.480 --> 0:03:29.440
<v Speaker 1>Now, because you have so many speakers in the lineup

0:03:29.480 --> 0:03:32.080
<v Speaker 1>at this point, everything from the Mini to the Max

0:03:32.160 --> 0:03:36.160
<v Speaker 1>to the different displays, and this of course new Nest Audio,

0:03:37.120 --> 0:03:39.720
<v Speaker 1>there's almost a reason to have one in every room.

0:03:39.720 --> 0:03:42.880
<v Speaker 1>And you can at these price points. So tell me

0:03:42.920 --> 0:03:46.240
<v Speaker 1>about the idea of building a whole house system. You've

0:03:46.240 --> 0:03:50.520
<v Speaker 1>got this stream transfer feature and also the stereo pair feature,

0:03:50.760 --> 0:03:53.880
<v Speaker 1>so explain how those sort of work in the realm

0:03:53.920 --> 0:03:54.440
<v Speaker 1>of a house.

0:03:54.840 --> 0:03:56.960
<v Speaker 2>Oh, this is this is my favorite part. You know,

0:03:57.200 --> 0:03:59.480
<v Speaker 2>if I think about the things that I've been using

0:03:59.680 --> 0:04:03.080
<v Speaker 2>the most since we've all been working from home, these

0:04:03.080 --> 0:04:07.360
<v Speaker 2>are the features. And so the thing about creating that

0:04:07.440 --> 0:04:10.080
<v Speaker 2>whole home audio experience is that first we want to

0:04:10.080 --> 0:04:12.720
<v Speaker 2>make sure all our devices work together. And they all

0:04:12.880 --> 0:04:16.880
<v Speaker 2>are using Cast, which is awesome, and so it can

0:04:16.920 --> 0:04:19.839
<v Speaker 2>work with your hub, your hub Max, I can even

0:04:19.880 --> 0:04:22.080
<v Speaker 2>work with your Crome cast devices that are connected to

0:04:22.120 --> 0:04:24.880
<v Speaker 2>your TV right, So it really gives a pretty broad

0:04:24.960 --> 0:04:28.520
<v Speaker 2>range of things that work together. But the real outcome

0:04:28.680 --> 0:04:32.400
<v Speaker 2>is fill your home with music. And first it's about grouping,

0:04:32.800 --> 0:04:34.400
<v Speaker 2>and so you can set up a group and I

0:04:34.520 --> 0:04:37.560
<v Speaker 2>do this all the time. I literally have a group

0:04:37.600 --> 0:04:41.440
<v Speaker 2>called home and it's like Yo playing music on home.

0:04:41.839 --> 0:04:43.560
<v Speaker 2>But then I think there are other features that are

0:04:43.560 --> 0:04:46.479
<v Speaker 2>about the context of what you're doing in that day.

0:04:46.680 --> 0:04:48.360
<v Speaker 2>And so let's just say it's a Saturday and you're

0:04:48.360 --> 0:04:52.599
<v Speaker 2>cleaning a house. Something like stream transfer is perfect, right,

0:04:52.920 --> 0:04:54.560
<v Speaker 2>because what you can do is you're cleaning up in

0:04:54.560 --> 0:04:56.760
<v Speaker 2>the kitchen as I do, then I actually start to

0:04:56.800 --> 0:04:59.680
<v Speaker 2>move to the bathroom and I can say, hey, move

0:04:59.760 --> 0:05:02.560
<v Speaker 2>the move the music to the bathroom. And then you

0:05:02.560 --> 0:05:04.280
<v Speaker 2>get down clean up the bathroom. You got to go,

0:05:04.480 --> 0:05:06.040
<v Speaker 2>you know, finish up, clean up the living room, and

0:05:06.040 --> 0:05:08.039
<v Speaker 2>say hey, move the music to the living room. And

0:05:08.080 --> 0:05:08.919
<v Speaker 2>that allows the.

0:05:08.960 --> 0:05:11.600
<v Speaker 1>Music to say you just say those that term to it.

0:05:11.640 --> 0:05:13.880
<v Speaker 1>And you sort of have to know what your speakers

0:05:13.880 --> 0:05:15.440
<v Speaker 1>are named. I'm assuming right.

0:05:16.240 --> 0:05:17.800
<v Speaker 2>Yeah, if you only have one speaker in our room,

0:05:17.800 --> 0:05:19.840
<v Speaker 2>you can you can just use the room. But it

0:05:19.839 --> 0:05:22.960
<v Speaker 2>also works if you say, hey, GI, move the room,

0:05:23.040 --> 0:05:25.320
<v Speaker 2>move the music to the living room speaker, which is

0:05:25.320 --> 0:05:28.559
<v Speaker 2>the name of the speaker. And so uh, this this, this,

0:05:28.560 --> 0:05:31.200
<v Speaker 2>this easiness of being able to just use it with

0:05:31.279 --> 0:05:34.240
<v Speaker 2>your voice is awesome. And if you have a Hub

0:05:34.320 --> 0:05:37.440
<v Speaker 2>Max or you have a hub, you can also just

0:05:37.839 --> 0:05:40.560
<v Speaker 2>do it with with touch you can go to the interface.

0:05:40.600 --> 0:05:42.880
<v Speaker 2>You actually can do the same thing using the interface

0:05:43.360 --> 0:05:45.680
<v Speaker 2>on a Hub or a Hub Max. Uh. The next

0:05:45.680 --> 0:05:49.479
<v Speaker 2>feature which we just recently introduced, which is multi room expansion.

0:05:49.960 --> 0:05:52.000
<v Speaker 2>This is the thought that sometimes you want to create

0:05:52.000 --> 0:05:55.839
<v Speaker 2>a group dynamically on the on the fly that you

0:05:55.880 --> 0:05:59.279
<v Speaker 2>haven't created before. And so the core things for this is,

0:05:59.320 --> 0:06:01.880
<v Speaker 2>like you know, all of us sometimes we get into it,

0:06:01.880 --> 0:06:04.320
<v Speaker 2>we get into the point where we're working out and

0:06:04.360 --> 0:06:07.320
<v Speaker 2>so like this is what I do. I'm like, Okay,

0:06:07.440 --> 0:06:09.360
<v Speaker 2>I want the sound to come through a little bit more,

0:06:09.480 --> 0:06:12.200
<v Speaker 2>and so let me put it on my Nest hub

0:06:12.200 --> 0:06:14.960
<v Speaker 2>in the kitchen, let me put it on the audio

0:06:15.160 --> 0:06:17.080
<v Speaker 2>and the bedroom, and also let me put it on

0:06:17.320 --> 0:06:20.960
<v Speaker 2>my stereo pair that's out in the living room, and automatically,

0:06:21.000 --> 0:06:22.880
<v Speaker 2>in real time, I can create that group by just

0:06:22.880 --> 0:06:26.240
<v Speaker 2>selecting those devices. And so that's really awesome. And then

0:06:26.279 --> 0:06:28.920
<v Speaker 2>the last one that you mentioned, Rich is stereo pair.

0:06:29.320 --> 0:06:33.120
<v Speaker 2>And what's cool about Nests Audio from the ground up,

0:06:33.279 --> 0:06:35.800
<v Speaker 2>it was designed for stereo pair. You put two of

0:06:35.800 --> 0:06:38.960
<v Speaker 2>these devices together, man, they look beautiful, and it's because

0:06:39.000 --> 0:06:41.159
<v Speaker 2>we thought that people would actually set them up for

0:06:41.200 --> 0:06:43.560
<v Speaker 2>stereo pairs. And what this allows you to do is

0:06:43.560 --> 0:06:47.000
<v Speaker 2>give stereo sound out of both devices. And it's really cool.

0:06:47.080 --> 0:06:49.440
<v Speaker 2>Especially when you were listening to a song that has

0:06:49.720 --> 0:06:52.320
<v Speaker 2>like vary ways of how the audio comes in or

0:06:52.360 --> 0:06:54.880
<v Speaker 2>the sound comes in, you can hear the device is

0:06:54.920 --> 0:06:58.560
<v Speaker 2>actually working together to add this extra level of dynamic

0:06:58.880 --> 0:07:00.280
<v Speaker 2>to the actual sound experience.

0:07:01.040 --> 0:07:04.000
<v Speaker 1>So what are people doing on these smart speakers? I mean,

0:07:04.080 --> 0:07:07.200
<v Speaker 1>music is clearly probably one of the top things, I

0:07:07.200 --> 0:07:09.600
<v Speaker 1>mean questions whether what are some of the things that

0:07:09.640 --> 0:07:11.560
<v Speaker 1>your research tells you that people are using these for.

0:07:12.920 --> 0:07:18.160
<v Speaker 2>Well, you know what's cool is things like routines are

0:07:18.160 --> 0:07:22.720
<v Speaker 2>becoming extremely important, right because now our routine is all

0:07:22.760 --> 0:07:25.720
<v Speaker 2>in the same place, and you have to figure out

0:07:25.760 --> 0:07:28.720
<v Speaker 2>how to either make those easier or break them up.

0:07:28.880 --> 0:07:30.760
<v Speaker 2>And so I'll give you a good example what I do.

0:07:31.000 --> 0:07:33.280
<v Speaker 2>I have a routine that starts every day around five

0:07:33.280 --> 0:07:35.840
<v Speaker 2>point thirty and all it does is it cuts on

0:07:35.880 --> 0:07:38.600
<v Speaker 2>a few lights and it starts playing music. Rich. It's

0:07:38.600 --> 0:07:40.560
<v Speaker 2>a really simple thing to remind me to get up

0:07:40.560 --> 0:07:43.600
<v Speaker 2>and stop working. Right. It's like cause you can easily

0:07:43.640 --> 0:07:46.400
<v Speaker 2>keep rolling if you don't have something that breaks it up, right,

0:07:46.480 --> 0:07:48.880
<v Speaker 2>And so that's one thing that we're seeing a lot.

0:07:49.040 --> 0:07:51.680
<v Speaker 2>And also like for children, Hey, there's like, hey, this

0:07:51.760 --> 0:07:54.200
<v Speaker 2>is the worktime routine. This is like, okay, this is

0:07:54.200 --> 0:07:57.040
<v Speaker 2>your playtime routine. And so I think the Google Assistant

0:07:57.200 --> 0:07:59.400
<v Speaker 2>and our smart speakers are being super help on the

0:07:59.440 --> 0:08:01.040
<v Speaker 2>home around those areas.

0:08:01.480 --> 0:08:04.400
<v Speaker 1>Tell me about launching a product during a pandemic. I mean,

0:08:04.600 --> 0:08:09.920
<v Speaker 1>planning for this probably began before COVID. And so when

0:08:09.920 --> 0:08:12.000
<v Speaker 1>you realize that you have to continue this through this,

0:08:12.400 --> 0:08:13.080
<v Speaker 1>how do you do it?

0:08:14.280 --> 0:08:17.440
<v Speaker 2>Well? You take a deep breath, right and then and

0:08:17.440 --> 0:08:19.640
<v Speaker 2>then you get back to work. And I think what

0:08:19.680 --> 0:08:23.360
<v Speaker 2>the team did was was very valid. Like So what

0:08:23.360 --> 0:08:26.680
<v Speaker 2>it was amazing to me is I thought about, Hey,

0:08:26.680 --> 0:08:29.040
<v Speaker 2>there's things that in the testing where we have like

0:08:30.120 --> 0:08:32.760
<v Speaker 2>echo chambers, and we have lots of things where we

0:08:32.880 --> 0:08:36.960
<v Speaker 2>tested fabric and all these labs were actually on campus.

0:08:37.559 --> 0:08:41.280
<v Speaker 2>Rich people took trucks and rented trucks and actually got

0:08:41.320 --> 0:08:44.080
<v Speaker 2>the equipment from the campus and they put it in

0:08:44.120 --> 0:08:47.560
<v Speaker 2>places like their garage or their extra their extra room.

0:08:47.920 --> 0:08:49.839
<v Speaker 2>And so I would say in a couple of weeks

0:08:49.840 --> 0:08:52.480
<v Speaker 2>we were back up and running. When you think about

0:08:52.520 --> 0:08:55.839
<v Speaker 2>things like tuning, I actually think because of work for

0:08:55.960 --> 0:08:58.760
<v Speaker 2>home ness, audio sounds better. And the reason why is

0:08:59.040 --> 0:09:02.680
<v Speaker 2>our engineers we're spending so much time with the devices

0:09:02.679 --> 0:09:05.600
<v Speaker 2>at home and they've got to experience the sound and

0:09:05.760 --> 0:09:09.120
<v Speaker 2>real spaces on a constant basis. And I actually think

0:09:09.440 --> 0:09:13.160
<v Speaker 2>things like the tuning and things like making the assistant

0:09:13.200 --> 0:09:16.400
<v Speaker 2>faster got way better because we were living with these

0:09:16.440 --> 0:09:18.760
<v Speaker 2>devices on a day in and day out basis.

0:09:19.080 --> 0:09:21.640
<v Speaker 1>All right, now we're talking smart speakers, and some of

0:09:21.640 --> 0:09:24.480
<v Speaker 1>the feedback I get is privacy. People are worried about

0:09:24.559 --> 0:09:28.040
<v Speaker 1>something or someone or Google listening to them, tell me

0:09:28.080 --> 0:09:31.360
<v Speaker 1>about privacy, and as that front and center on this device.

0:09:31.520 --> 0:09:35.040
<v Speaker 2>Privacy is core to the experience that we want to

0:09:35.080 --> 0:09:37.800
<v Speaker 2>deliver it to users in. It stems from our privacy

0:09:37.800 --> 0:09:40.920
<v Speaker 2>commitments that we made a few years ago around how

0:09:40.960 --> 0:09:43.680
<v Speaker 2>our devices work and how they're going to expect users privacy.

0:09:44.160 --> 0:09:47.559
<v Speaker 2>And similar to our other devices, Nets Audio has a

0:09:47.640 --> 0:09:49.760
<v Speaker 2>mute switch on the back, so you can easily turn

0:09:49.800 --> 0:09:52.400
<v Speaker 2>it off if you want, and you can easily turn

0:09:52.440 --> 0:09:54.360
<v Speaker 2>it back on when you need to. And then we

0:09:54.400 --> 0:09:56.679
<v Speaker 2>also have the assistant privacy features where you can tell

0:09:56.679 --> 0:09:59.320
<v Speaker 2>the system to forget the things that you said, or

0:09:59.360 --> 0:10:01.240
<v Speaker 2>you can also go to your my activity and you

0:10:01.240 --> 0:10:03.959
<v Speaker 2>can delete those items as well, And so we have

0:10:04.080 --> 0:10:07.319
<v Speaker 2>multiple ways to make sure that we're respecting users' privacy.

0:10:07.440 --> 0:10:10.280
<v Speaker 2>It is a core fundamental value that we want to

0:10:10.280 --> 0:10:11.840
<v Speaker 2>make sure that we deliver to users.

0:10:12.040 --> 0:10:13.839
<v Speaker 1>All right, before we go, can you give me any

0:10:13.920 --> 0:10:16.439
<v Speaker 1>tips or tricks, any commands to try that I haven't

0:10:16.440 --> 0:10:16.840
<v Speaker 1>thought of.

0:10:17.040 --> 0:10:19.960
<v Speaker 2>Oh. I think one of the coolest things is actually

0:10:20.360 --> 0:10:24.160
<v Speaker 2>starting to use your phone as a way to start

0:10:24.200 --> 0:10:26.800
<v Speaker 2>to interact with the device. So casting some of your

0:10:26.800 --> 0:10:30.000
<v Speaker 2>favorite apps to the device I think is really cool.

0:10:30.320 --> 0:10:33.120
<v Speaker 2>I think actually going to the Google Home app and

0:10:33.160 --> 0:10:35.440
<v Speaker 2>starting to use the interface to control how you're moving

0:10:35.440 --> 0:10:37.840
<v Speaker 2>your music is really awesome. The last thing that I

0:10:37.840 --> 0:10:41.720
<v Speaker 2>will say from an assistance standpoint that it's kind of

0:10:41.720 --> 0:10:44.720
<v Speaker 2>exciting to me is now we're just playing a little

0:10:44.720 --> 0:10:47.840
<v Speaker 2>game of trivia at home where we use the assistant

0:10:47.840 --> 0:10:50.839
<v Speaker 2>as kind of like the game show hosts. And it's

0:10:50.840 --> 0:10:53.160
<v Speaker 2>pretty fun, right, And so we'll sit there and right

0:10:53.160 --> 0:10:55.520
<v Speaker 2>now we're doing it because the NBA Finals have started,

0:10:55.800 --> 0:10:59.200
<v Speaker 2>Like it's all about the status, and so we'll sit

0:10:59.240 --> 0:11:02.360
<v Speaker 2>there and well, you know, have a fun time going

0:11:02.400 --> 0:11:06.080
<v Speaker 2>back and forth about like, hey, what set player percentage

0:11:06.120 --> 0:11:10.360
<v Speaker 2>for shooting? What set player? Hey what set player this

0:11:10.400 --> 0:11:13.760
<v Speaker 2>and that, And it's a really cool, fun way to

0:11:13.840 --> 0:11:16.360
<v Speaker 2>expand the way that you're interacting and change the way

0:11:16.360 --> 0:11:18.560
<v Speaker 2>that you're interacting with the people in your home. But

0:11:18.640 --> 0:11:20.439
<v Speaker 2>it's also using the smart speaker, and I think in

0:11:20.520 --> 0:11:21.479
<v Speaker 2>a really unique.

0:11:21.200 --> 0:11:25.160
<v Speaker 1>Way awesome mark Space, product manager for Google Nest. Thanks

0:11:25.200 --> 0:11:27.719
<v Speaker 1>so much for joining me today. I do appreciate it.

0:11:27.760 --> 0:11:31.959
<v Speaker 1>Talking about the new Google Nest Audio smart speaker. My

0:11:32.040 --> 0:11:34.440
<v Speaker 1>name is rich dmiro Rich on tech dot TV. Thanks

0:11:34.480 --> 0:11:34.960
<v Speaker 1>for listening.