WEBVTT - Audiophiles and the Perfect Sound

0:00:04.400 --> 0:00:07.800
<v Speaker 1>Welcome to tech Stuff, a production from I Heart Radio.

0:00:12.119 --> 0:00:14.800
<v Speaker 1>Hey there, and welcome to tech Stuff. I'm your host,

0:00:14.880 --> 0:00:18.079
<v Speaker 1>Jonathan Strickland. I'm an executive producer with I Heart Radio.

0:00:18.120 --> 0:00:21.480
<v Speaker 1>And how the tech are you? You know? Recently I

0:00:21.520 --> 0:00:24.560
<v Speaker 1>did an episode on the rise and fall of san Sui,

0:00:24.800 --> 0:00:29.880
<v Speaker 1>a Japanese company best known for its audio receivers and amplifiers,

0:00:29.880 --> 0:00:32.839
<v Speaker 1>though it did make other things, and San sue had

0:00:32.880 --> 0:00:35.760
<v Speaker 1>its heyday in the late sixties and into the seventies

0:00:36.080 --> 0:00:39.600
<v Speaker 1>and became a sought after brand associated with hi fi

0:00:39.720 --> 0:00:42.960
<v Speaker 1>or high fidelity. So today I thought we should talk

0:00:43.000 --> 0:00:45.200
<v Speaker 1>a little bit more about hi fi and the tech

0:00:45.280 --> 0:00:50.560
<v Speaker 1>that feeds into an obsession. So first, what is that obsession? Well,

0:00:50.600 --> 0:00:56.120
<v Speaker 1>generally speaking, audio files seek out a listening experience that

0:00:56.240 --> 0:01:01.040
<v Speaker 1>gets them as close to the originally produced sound as possible.

0:01:01.600 --> 0:01:05.240
<v Speaker 1>So in an ideal world, it would sound as you

0:01:05.319 --> 0:01:08.880
<v Speaker 1>listen to playback, it would sound as though you were

0:01:08.920 --> 0:01:12.440
<v Speaker 1>in the recording studio as a producer was putting together

0:01:12.840 --> 0:01:16.840
<v Speaker 1>a master tape you know, of a recording of a session,

0:01:17.080 --> 0:01:20.920
<v Speaker 1>or alternatively, that you were actually in the studio when

0:01:21.080 --> 0:01:25.120
<v Speaker 1>say a band was laying down the recording tracks, because

0:01:25.480 --> 0:01:27.840
<v Speaker 1>you know even the act of producing can emphasize some

0:01:27.880 --> 0:01:30.679
<v Speaker 1>stuff all de emphasizing other stuff. So you could make

0:01:30.680 --> 0:01:33.839
<v Speaker 1>a good argument that even the master recording could muck

0:01:33.920 --> 0:01:36.160
<v Speaker 1>things up all down the rest of the line. But

0:01:36.240 --> 0:01:39.960
<v Speaker 1>the idea is, how do you create a system. What

0:01:40.040 --> 0:01:44.360
<v Speaker 1>components do you need and in what combination and um

0:01:44.360 --> 0:01:47.400
<v Speaker 1>and at what quality in order to get the ideal

0:01:47.760 --> 0:01:53.080
<v Speaker 1>listening experience. And you want to be able to accurately

0:01:53.120 --> 0:01:56.440
<v Speaker 1>reproduce the full range of sound frequencies and harmonics that

0:01:56.520 --> 0:02:00.680
<v Speaker 1>were present in the original sound, and preferably you want

0:02:00.680 --> 0:02:04.160
<v Speaker 1>to be able to also boost that to higher levels

0:02:04.160 --> 0:02:07.760
<v Speaker 1>of volume without losing any of the precision. Right, So

0:02:07.800 --> 0:02:10.440
<v Speaker 1>the idea being that the louder you turn the system,

0:02:10.480 --> 0:02:13.760
<v Speaker 1>you don't lose it, Like things don't distort as you

0:02:13.800 --> 0:02:16.440
<v Speaker 1>turn up the volume, until you get to a point

0:02:16.480 --> 0:02:18.240
<v Speaker 1>where you know you don't want to go any higher

0:02:18.919 --> 0:02:23.120
<v Speaker 1>because you're gonna end up in a twisted sister music

0:02:23.200 --> 0:02:26.760
<v Speaker 1>video if you do so. This is tricky, not the

0:02:26.760 --> 0:02:30.080
<v Speaker 1>part about avoiding being in a twisted twisted sister music video.

0:02:30.400 --> 0:02:33.880
<v Speaker 1>I've successfully done that all my life. But no, creating

0:02:33.919 --> 0:02:35.679
<v Speaker 1>a system like this is tricky because there are a

0:02:35.760 --> 0:02:39.519
<v Speaker 1>lot of different components to a sound system, and one

0:02:39.600 --> 0:02:44.639
<v Speaker 1>bad component can mess up pretty much everything else. It's

0:02:44.639 --> 0:02:48.080
<v Speaker 1>not always true that the more expensive something is the

0:02:48.120 --> 0:02:52.680
<v Speaker 1>better it is, though that frequently can seem to be

0:02:52.720 --> 0:02:58.360
<v Speaker 1>the case in the stereo like the high fidelity stereo world. Uh. However,

0:02:58.400 --> 0:03:01.200
<v Speaker 1>it is fairly safe to bet that the cheapest stuff

0:03:01.240 --> 0:03:03.200
<v Speaker 1>on the market is not going to be the best.

0:03:03.760 --> 0:03:06.160
<v Speaker 1>You can be pretty sure that the cheapest stuff is

0:03:06.200 --> 0:03:08.920
<v Speaker 1>not going to be great. You cannot be sure that

0:03:08.960 --> 0:03:11.560
<v Speaker 1>the most expensive stuff is going to be the best

0:03:11.560 --> 0:03:14.800
<v Speaker 1>of the best. Now, when you boil down a sound system,

0:03:14.880 --> 0:03:17.560
<v Speaker 1>which I should add you should never do because most

0:03:17.560 --> 0:03:20.760
<v Speaker 1>of the time they are not waterproof, what you have

0:03:21.080 --> 0:03:25.440
<v Speaker 1>is a signal chain. So on one end, you've got

0:03:25.440 --> 0:03:29.440
<v Speaker 1>a component designed to playback some form of media or,

0:03:29.480 --> 0:03:31.359
<v Speaker 1>in the case of a radio, to pick up a

0:03:31.400 --> 0:03:36.800
<v Speaker 1>transmission of a broadcasted media, and the playback is in

0:03:36.840 --> 0:03:41.440
<v Speaker 1>the form of an electrical signal. And ultimately that electrical

0:03:41.480 --> 0:03:44.160
<v Speaker 1>signal has to make its way to some form of speakers,

0:03:44.200 --> 0:03:48.840
<v Speaker 1>whether they are like a standalone speaker system or headphones

0:03:48.960 --> 0:03:53.080
<v Speaker 1>or something along those lines. Now, the whole chain from

0:03:53.120 --> 0:03:55.680
<v Speaker 1>that component to the speakers, every part of that chain

0:03:55.760 --> 0:04:00.240
<v Speaker 1>is important, and just like physical chains, your experience tipically

0:04:00.360 --> 0:04:03.480
<v Speaker 1>is only as good as whatever the weakest link in

0:04:03.560 --> 0:04:06.920
<v Speaker 1>that chain is able to manage. Like, if there's one

0:04:07.000 --> 0:04:10.480
<v Speaker 1>component in that in that chain that is not up

0:04:10.520 --> 0:04:14.080
<v Speaker 1>to snuff, then it doesn't matter how good this everything

0:04:14.160 --> 0:04:16.160
<v Speaker 1>is on either side of it, you're not going to

0:04:16.279 --> 0:04:19.480
<v Speaker 1>get better on the other end than what's going through

0:04:19.520 --> 0:04:24.039
<v Speaker 1>that weak link. So this is an idea that an

0:04:24.200 --> 0:04:28.240
<v Speaker 1>entire industry is built upon. So let's consider some of

0:04:28.240 --> 0:04:31.640
<v Speaker 1>these components. And we'll start with the speakers, because that's

0:04:31.680 --> 0:04:35.000
<v Speaker 1>the part that plays the sounds that we hear. I

0:04:35.000 --> 0:04:36.960
<v Speaker 1>could have started from the other end, but I figured

0:04:37.040 --> 0:04:39.760
<v Speaker 1>let's go to the output. So a speaker, when you

0:04:39.800 --> 0:04:43.000
<v Speaker 1>really get down to it, consists of a diaphragm, at

0:04:43.040 --> 0:04:47.720
<v Speaker 1>least one diaphragm that vibrates according to magnetic signals, and

0:04:47.760 --> 0:04:52.400
<v Speaker 1>those magnetic signals correspond with that electrical signal that represents

0:04:52.400 --> 0:04:57.800
<v Speaker 1>an encoded recorded sound. So the electrical signal powers and

0:04:57.880 --> 0:05:01.719
<v Speaker 1>electro magnet which interacts with a permanent magnet in the speaker.

0:05:01.960 --> 0:05:05.760
<v Speaker 1>This creates the attractive and repulsive forces that caused the

0:05:05.880 --> 0:05:11.280
<v Speaker 1>diaphragm inside the speaker to vibrate, the vibrating diaphragm begins

0:05:11.320 --> 0:05:15.960
<v Speaker 1>to move air at specific frequencies and amplitudes, and this

0:05:16.040 --> 0:05:19.600
<v Speaker 1>generates the sounds we hear when those air fluctuations here

0:05:19.760 --> 0:05:23.839
<v Speaker 1>hit our ear drums. So human hearing spans a pretty

0:05:23.960 --> 0:05:28.400
<v Speaker 1>wide range of sound frequencies. At the low end is

0:05:28.440 --> 0:05:32.400
<v Speaker 1>twenty hurts. That's twenty full vibrations per second. So imagine

0:05:32.920 --> 0:05:36.120
<v Speaker 1>a very long string and you pluck the string and

0:05:36.320 --> 0:05:38.560
<v Speaker 1>you're able to slow down time and count how many

0:05:38.600 --> 0:05:41.720
<v Speaker 1>times it does a full vibration that is, from the

0:05:41.760 --> 0:05:43.800
<v Speaker 1>starting point all the way to the other side and

0:05:43.839 --> 0:05:46.800
<v Speaker 1>back to the starting point again twenty times a second.

0:05:46.839 --> 0:05:50.280
<v Speaker 1>That's the lowest end of pitches that humans typically can hear.

0:05:50.760 --> 0:05:53.440
<v Speaker 1>We have to use words like typical because not everyone

0:05:53.480 --> 0:05:55.400
<v Speaker 1>can hear sounds that low, and some may be able

0:05:55.400 --> 0:05:58.520
<v Speaker 1>to sense them a little bit lower. We can actually

0:05:58.520 --> 0:06:02.400
<v Speaker 1>sense lower vibration, but typically we are not hearing them.

0:06:02.400 --> 0:06:08.360
<v Speaker 1>We feel them, because again we're talking about fluctuations vibrations, uh,

0:06:08.400 --> 0:06:11.159
<v Speaker 1>and you can feel vibrations, particularly if they're at pretty

0:06:11.200 --> 0:06:15.000
<v Speaker 1>high amplitudes. So then on the opposite side of the range,

0:06:15.040 --> 0:06:19.080
<v Speaker 1>we have the highest pitches within human hearing typical human hearing,

0:06:19.760 --> 0:06:22.760
<v Speaker 1>and we usually say that this tops out around twenty

0:06:22.880 --> 0:06:26.480
<v Speaker 1>thousand hurts, also known as twenty killer hurts, so that

0:06:26.520 --> 0:06:31.159
<v Speaker 1>would be twenty thousand full vibrations per second. As we

0:06:31.240 --> 0:06:34.840
<v Speaker 1>get older, our ability to perceive the highest pitches diminishes

0:06:35.640 --> 0:06:38.039
<v Speaker 1>mine certainly has. I think I mentioned this in the

0:06:38.120 --> 0:06:41.440
<v Speaker 1>San Sui episode. But this means that if you want

0:06:41.440 --> 0:06:44.760
<v Speaker 1>to keep those punk kids away from your convenience store,

0:06:45.320 --> 0:06:47.880
<v Speaker 1>you can play a piercing high pitch sound over the

0:06:47.880 --> 0:06:50.960
<v Speaker 1>store's speakers and only the kids will be able to

0:06:51.000 --> 0:06:53.600
<v Speaker 1>hear it. All the adults will have lost that ability.

0:06:54.040 --> 0:06:59.240
<v Speaker 1>Fun with physics and ages, um, I guess anyway, twenty

0:06:59.360 --> 0:07:03.200
<v Speaker 1>hurts to twenty thousand hurts, that's a pretty big range

0:07:03.200 --> 0:07:08.920
<v Speaker 1>of frequencies, and that doesn't factor in harmonics. Harmonic is

0:07:08.960 --> 0:07:11.920
<v Speaker 1>a sound wave that has an integer multiple of a

0:07:12.000 --> 0:07:16.080
<v Speaker 1>fundamental tone. And you might think, what are you talking about?

0:07:16.120 --> 0:07:18.520
<v Speaker 1>All right, So this is easier if we talk about

0:07:18.640 --> 0:07:21.600
<v Speaker 1>musical notes. Let's say we've got a musical instrument like

0:07:21.680 --> 0:07:25.400
<v Speaker 1>a guitar, and let's say we pluck a string, and

0:07:25.560 --> 0:07:30.120
<v Speaker 1>the lowest frequency sound produced by that string vibrating is

0:07:30.160 --> 0:07:34.560
<v Speaker 1>what we call the fundamental tone frequency, and if we

0:07:34.560 --> 0:07:38.160
<v Speaker 1>were to multiply that frequency by two, we would get

0:07:38.200 --> 0:07:41.000
<v Speaker 1>what we call the second harmonic. If we multiplied it

0:07:41.040 --> 0:07:43.880
<v Speaker 1>by three, we get the third harmonic, and so on

0:07:43.960 --> 0:07:47.080
<v Speaker 1>and so forth, the fourth harmonic, the fifth harmonic, each

0:07:47.120 --> 0:07:52.960
<v Speaker 1>time multiplying by the next highest integer. So if you

0:07:53.000 --> 0:07:57.360
<v Speaker 1>pluck a guitar tuned to middle C the second harmonic,

0:07:57.440 --> 0:08:00.720
<v Speaker 1>you would multiply the frequency of middle C by two,

0:08:01.040 --> 0:08:03.440
<v Speaker 1>and you would end up with C again, but one

0:08:03.480 --> 0:08:07.680
<v Speaker 1>octave up or C ish, because it ends up being

0:08:07.960 --> 0:08:11.080
<v Speaker 1>a little more wibbly wobbly than that. But you're essentially

0:08:11.080 --> 0:08:15.320
<v Speaker 1>playing another C note, but it's an octave higher. The

0:08:15.320 --> 0:08:19.320
<v Speaker 1>third harmonic would be a G, the fourth harmonic would

0:08:19.320 --> 0:08:22.960
<v Speaker 1>be another even higher C. The fifth harmonic will be E,

0:08:23.160 --> 0:08:26.160
<v Speaker 1>and so on and so forth. Now, a musical instrument's

0:08:26.200 --> 0:08:30.360
<v Speaker 1>tone or tambre is largely shaped by the relative strength

0:08:30.360 --> 0:08:34.400
<v Speaker 1>of each harmonic to the fundamental frequency. This is why

0:08:34.440 --> 0:08:36.760
<v Speaker 1>we've got lots of different musical instruments that can produce

0:08:36.800 --> 0:08:40.320
<v Speaker 1>a C note, but they don't all sound exactly the same.

0:08:40.320 --> 0:08:44.199
<v Speaker 1>They're playing the same note, but they don't sound the same.

0:08:44.280 --> 0:08:46.959
<v Speaker 1>If you played the same note on a clarinet as

0:08:46.960 --> 0:08:49.200
<v Speaker 1>you did on a piano. Well, you would still be

0:08:49.200 --> 0:08:51.400
<v Speaker 1>able to tell which was which, right, you'd say, oh,

0:08:51.400 --> 0:08:53.480
<v Speaker 1>they're playing the same note. But this is a clarinet

0:08:53.520 --> 0:08:56.319
<v Speaker 1>and that's a piano, and that's largely due to harmonics.

0:08:56.400 --> 0:08:58.760
<v Speaker 1>Now there's other stuff too that plays into tamber, but

0:08:59.760 --> 0:09:01.320
<v Speaker 1>to it into all that would get us off track.

0:09:01.360 --> 0:09:04.400
<v Speaker 1>Plus I've done an episode about it. So we perceive

0:09:04.520 --> 0:09:10.400
<v Speaker 1>these harmonics as collectively being a single sensation. So when

0:09:10.400 --> 0:09:13.439
<v Speaker 1>you hit a C on a piano, what we're essentially

0:09:13.440 --> 0:09:17.480
<v Speaker 1>hearing is that fundamental frequency of C. And rather than

0:09:17.520 --> 0:09:21.679
<v Speaker 1>perceiving all the harmonics distinctly, like saying, oh, I also

0:09:21.800 --> 0:09:24.520
<v Speaker 1>hear a higher C and a G and an E,

0:09:25.080 --> 0:09:30.000
<v Speaker 1>rather than saying that these harmonics combine into what we

0:09:30.080 --> 0:09:32.720
<v Speaker 1>think of as the timbre of a piano, we perceive

0:09:32.800 --> 0:09:36.560
<v Speaker 1>it as a single experience. And if you were to

0:09:36.600 --> 0:09:38.800
<v Speaker 1>play that same C on a clarinet or on a

0:09:38.840 --> 0:09:42.080
<v Speaker 1>guitar or on a xylophone, we'd be able to tell

0:09:42.280 --> 0:09:46.359
<v Speaker 1>the same note. But difference in which harmonics are emphasized

0:09:46.360 --> 0:09:48.880
<v Speaker 1>that would determine the timbre of each instrument. We'd be

0:09:48.880 --> 0:09:51.640
<v Speaker 1>able to tell them apart, all right, Now, the reason

0:09:52.080 --> 0:09:55.840
<v Speaker 1>all that is important is that really good speakers will

0:09:55.880 --> 0:09:58.520
<v Speaker 1>be able to play back a wide range of frequencies

0:09:58.520 --> 0:10:02.360
<v Speaker 1>with incredible accuracy. And it's important to represent not just

0:10:02.880 --> 0:10:06.520
<v Speaker 1>the individual notes, but those harmonics Otherwise, like your piano

0:10:06.520 --> 0:10:09.079
<v Speaker 1>wouldn't sound like a piano right if it couldn't represent

0:10:09.120 --> 0:10:12.040
<v Speaker 1>all the harmonics that the piano was creating when it

0:10:12.120 --> 0:10:16.480
<v Speaker 1>was being played, then it wouldn't sound right. So this

0:10:16.640 --> 0:10:19.679
<v Speaker 1>range can stretch well beyond human hearing, because the idea

0:10:19.720 --> 0:10:23.040
<v Speaker 1>is that harmonics that can even go above where the

0:10:23.080 --> 0:10:26.800
<v Speaker 1>typical human can hear can still shape the notes we

0:10:26.960 --> 0:10:30.760
<v Speaker 1>do here, and it's hard for for speakers to be

0:10:30.840 --> 0:10:34.480
<v Speaker 1>able to represent the full range, even really good speakers.

0:10:34.800 --> 0:10:38.480
<v Speaker 1>Often high end speakers will have specific voice coils dedicated

0:10:38.480 --> 0:10:42.120
<v Speaker 1>to replicating a certain subset of frequencies in order to

0:10:42.160 --> 0:10:45.120
<v Speaker 1>cover the full spectrum. And this is where we can

0:10:45.200 --> 0:10:49.440
<v Speaker 1>get into sub whoffers, whoffers and tweeters. Subwhiffers have the

0:10:49.559 --> 0:10:54.320
<v Speaker 1>largest diaphragms and replicate the lowest frequency sounds, so they

0:10:54.320 --> 0:10:57.560
<v Speaker 1>need those big diaphragms in order to move enough air

0:10:57.720 --> 0:11:01.240
<v Speaker 1>at low frequencies and large enough and plitudes so that

0:11:01.280 --> 0:11:04.880
<v Speaker 1>we can actually experience them, hear them, or feel them.

0:11:04.920 --> 0:11:07.160
<v Speaker 1>They also tend to require their own power source in

0:11:07.240 --> 0:11:09.800
<v Speaker 1>order to make this happen. Like they can't just draw

0:11:09.920 --> 0:11:13.360
<v Speaker 1>power through speaker wires the way some other speakers do.

0:11:13.440 --> 0:11:16.040
<v Speaker 1>You typically have to plug these into a wall socket

0:11:16.080 --> 0:11:19.320
<v Speaker 1>on their own. Whoofers tend to cover most of the

0:11:19.400 --> 0:11:22.320
<v Speaker 1>frequencies we experience when we're listening to music from the

0:11:22.320 --> 0:11:25.680
<v Speaker 1>low tones. They're not the lowest to replicating most of

0:11:25.679 --> 0:11:29.080
<v Speaker 1>the tones we hear from musical instruments. Human voices also

0:11:29.120 --> 0:11:32.280
<v Speaker 1>fall into this frequency range covered by whoofers, so these

0:11:32.280 --> 0:11:35.400
<v Speaker 1>are really important for the vast majority of the sounds

0:11:35.480 --> 0:11:39.880
<v Speaker 1>we can distinctly hear. And the tweeters those are speakers

0:11:39.880 --> 0:11:43.319
<v Speaker 1>that are designed to replicate sounds above the threshold of whoofers.

0:11:43.760 --> 0:11:47.040
<v Speaker 1>They can also replicate sounds that go beyond human hearing. Again,

0:11:47.080 --> 0:11:49.559
<v Speaker 1>you might need to have those in order to represent

0:11:49.600 --> 0:11:52.800
<v Speaker 1>those harmonics I was talking about. Now. A combination of

0:11:52.800 --> 0:11:55.920
<v Speaker 1>these speakers is usually necessary to create a true high

0:11:55.920 --> 0:11:59.480
<v Speaker 1>fidelity experience, and UH where you put the speakers ends

0:11:59.520 --> 0:12:02.839
<v Speaker 1>up mattering to both because of stuff like stereo or

0:12:02.960 --> 0:12:06.880
<v Speaker 1>quadrophonic sound or surround sound. UH in that you're sending

0:12:06.920 --> 0:12:10.080
<v Speaker 1>specific channels of sound to specific directions, So you want

0:12:10.080 --> 0:12:13.960
<v Speaker 1>to have your speaker set up to two replicate that, right,

0:12:14.480 --> 0:12:16.560
<v Speaker 1>It wouldn't make sense to have your left and right

0:12:16.640 --> 0:12:19.680
<v Speaker 1>speakers all just directly in front of you. You wouldn't

0:12:19.679 --> 0:12:22.760
<v Speaker 1>really benefit from the stereo sound that way. Uh. And

0:12:22.840 --> 0:12:25.720
<v Speaker 1>also some speakers, like the tweeters in particular, can be

0:12:25.840 --> 0:12:29.200
<v Speaker 1>very highly directional, so you need to have them positioned

0:12:29.280 --> 0:12:32.800
<v Speaker 1>just right. So for true audio files, where you put

0:12:32.840 --> 0:12:37.119
<v Speaker 1>speakers ends up being a very big part of the equation.

0:12:37.679 --> 0:12:40.880
<v Speaker 1>And suffice it to say, cheap speakers typically can't do

0:12:41.040 --> 0:12:43.800
<v Speaker 1>all of this. Many of the cheaper speakers are kind

0:12:43.800 --> 0:12:46.800
<v Speaker 1>of marketed as full range speakers, meaning they're meant to

0:12:46.840 --> 0:12:50.520
<v Speaker 1>replicate sounds throughout the range of human hearing. But these

0:12:50.559 --> 0:12:53.640
<v Speaker 1>speakers are limited by their size. If the diaphragm and

0:12:53.679 --> 0:12:56.440
<v Speaker 1>the speakers is like of a medium size, well the

0:12:56.440 --> 0:12:59.280
<v Speaker 1>speakers should be able to cover most, but not all,

0:12:59.360 --> 0:13:02.240
<v Speaker 1>of the frequent these we can experience. If it's a

0:13:02.320 --> 0:13:04.679
<v Speaker 1>larger diaphragm, well then you might lose some of the

0:13:04.760 --> 0:13:07.040
<v Speaker 1>higher pitches. You might not be able to replicate those

0:13:07.080 --> 0:13:10.079
<v Speaker 1>as well. If it's a smaller speaker, then you might

0:13:10.200 --> 0:13:13.720
<v Speaker 1>end up losing some of the base tones. Materials, wiring,

0:13:13.720 --> 0:13:17.960
<v Speaker 1>and engineering all matter when it comes to speaker quality. However,

0:13:19.000 --> 0:13:21.520
<v Speaker 1>you can reach a point of diminishing returns, and in fact,

0:13:21.720 --> 0:13:23.959
<v Speaker 1>that is going to be true for pretty much every

0:13:24.000 --> 0:13:26.679
<v Speaker 1>component we're going to be talking about today. And by

0:13:26.720 --> 0:13:29.920
<v Speaker 1>that I mean you might listen to a dinky, cheap

0:13:30.000 --> 0:13:33.160
<v Speaker 1>speaker and then listen to the same music on a

0:13:33.240 --> 0:13:36.600
<v Speaker 1>moderately priced speaker, and you could hear a huge difference

0:13:36.600 --> 0:13:39.120
<v Speaker 1>in quality. You could say, all right, this, this more

0:13:39.160 --> 0:13:43.319
<v Speaker 1>expensive speaker is clearly leagues better than the cheap one.

0:13:43.960 --> 0:13:46.440
<v Speaker 1>But then you might go from the moderate speaker to

0:13:46.720 --> 0:13:49.800
<v Speaker 1>an expensive speaker and you say, yeah, I can tell

0:13:49.920 --> 0:13:52.720
<v Speaker 1>there's a difference, but it's not nearly as dramatic a

0:13:52.920 --> 0:13:55.920
<v Speaker 1>jump as the first one. Then you might go to

0:13:56.440 --> 0:14:00.080
<v Speaker 1>like a luxury speaker, some ludicrously expensive speakers, and they

0:14:00.120 --> 0:14:03.000
<v Speaker 1>are some really expensive ones, and you might not be

0:14:03.040 --> 0:14:06.160
<v Speaker 1>able to tell the difference between the you know, expensive

0:14:06.160 --> 0:14:10.080
<v Speaker 1>speaker and the ludicrously expensive speaker. Uh. There are a

0:14:10.080 --> 0:14:11.760
<v Speaker 1>lot of people who suggest that if you were to

0:14:11.800 --> 0:14:15.559
<v Speaker 1>do a double blind study where neither the person administering

0:14:15.600 --> 0:14:18.880
<v Speaker 1>the test nor the person experiencing it knows which system

0:14:18.960 --> 0:14:21.920
<v Speaker 1>is being played. You probably wouldn't be able to pick

0:14:21.960 --> 0:14:24.680
<v Speaker 1>out too many differences between some of these, you know,

0:14:25.000 --> 0:14:28.400
<v Speaker 1>high end and ultra high end speaker systems. Now, it

0:14:28.480 --> 0:14:29.840
<v Speaker 1>might mean that if you were to take some very

0:14:29.840 --> 0:14:34.200
<v Speaker 1>sensitive electronic equipment to measure stuff like signal loss or

0:14:34.240 --> 0:14:37.800
<v Speaker 1>the purity of tones being omitted, that maybe on those

0:14:37.840 --> 0:14:39.920
<v Speaker 1>instruments you might be able to see a difference, like

0:14:40.120 --> 0:14:43.120
<v Speaker 1>these very highly tuned instruments might indicate that, But that

0:14:43.160 --> 0:14:45.800
<v Speaker 1>doesn't mean your ears and brain would pick up on that.

0:14:46.400 --> 0:14:49.080
<v Speaker 1>So that's another red flag for audio files. Just because

0:14:49.080 --> 0:14:52.360
<v Speaker 1>something might technically perform quote unquote better doesn't mean you

0:14:52.400 --> 0:14:56.200
<v Speaker 1>to actually perceive any meaningful improvements. So don't just jump

0:14:56.200 --> 0:14:58.520
<v Speaker 1>out there and by the most expensive stuff on the market.

0:14:59.280 --> 0:15:02.600
<v Speaker 1>All right, where we've got a lot more ground to cover.

0:15:02.680 --> 0:15:04.840
<v Speaker 1>But before we get into any of that, let's take

0:15:04.920 --> 0:15:15.320
<v Speaker 1>a quick break. Okay, we're back now. For speakers to

0:15:15.360 --> 0:15:19.080
<v Speaker 1>be able to play anything at all, you need an amplifier.

0:15:19.360 --> 0:15:22.600
<v Speaker 1>So an amplifier's job is to take a weak incoming

0:15:22.720 --> 0:15:26.760
<v Speaker 1>electrical signal and boost it to a more powerful signal

0:15:27.080 --> 0:15:30.880
<v Speaker 1>that can drive speakers all the while avoiding the introduction

0:15:31.280 --> 0:15:35.160
<v Speaker 1>of unwanted changes to the signal. In other words, you

0:15:35.200 --> 0:15:38.880
<v Speaker 1>want to turn up the power without adding in noise

0:15:39.560 --> 0:15:43.680
<v Speaker 1>or distortion. This is easier said than done. Now it

0:15:43.720 --> 0:15:45.720
<v Speaker 1>helps if we think about the nature of the signal

0:15:45.800 --> 0:15:49.040
<v Speaker 1>we need to boost. Uh. And let's take the example

0:15:49.080 --> 0:15:53.040
<v Speaker 1>of a microphone, which is typically, you know, the opposite

0:15:53.040 --> 0:15:55.680
<v Speaker 1>of a speaker. You can think of like a microphone

0:15:55.680 --> 0:15:59.200
<v Speaker 1>and a speaker working on the same principle, just in reverse.

0:15:59.560 --> 0:16:02.360
<v Speaker 1>So with a speaker, we have an electrical signal going

0:16:02.400 --> 0:16:08.560
<v Speaker 1>to a device that that electrical signal is essentially encoded

0:16:08.600 --> 0:16:12.320
<v Speaker 1>audio is what it really boils down to, and it

0:16:12.400 --> 0:16:15.160
<v Speaker 1>goes to a voice coil that ends up causing a

0:16:15.240 --> 0:16:22.400
<v Speaker 1>diaphragm to vibrate. Those vibrations end up causing air to fluctuate.

0:16:22.680 --> 0:16:25.400
<v Speaker 1>Air molecules start to fluctuate, and our ears pick up

0:16:25.400 --> 0:16:28.359
<v Speaker 1>on that and we hear the sound. With a microphone,

0:16:28.880 --> 0:16:31.560
<v Speaker 1>we have a device that has a tiny little diaphragm

0:16:31.600 --> 0:16:35.520
<v Speaker 1>inside it that vibrates when sound hits that diaphragm, and

0:16:36.440 --> 0:16:40.440
<v Speaker 1>the diaphragm's vibrations generate and a very weak electrical signal.

0:16:41.000 --> 0:16:44.000
<v Speaker 1>Uh And and this is all thanks to electromagnetic physics

0:16:44.040 --> 0:16:46.400
<v Speaker 1>that I'm not going to get into here because I've

0:16:46.440 --> 0:16:49.200
<v Speaker 1>covered it to death now. As you might imagine, the

0:16:49.240 --> 0:16:53.720
<v Speaker 1>diaphragm and a microphone is really small and it does vibrate,

0:16:53.800 --> 0:16:57.880
<v Speaker 1>but the movements are very small as well. So what

0:16:57.920 --> 0:17:01.760
<v Speaker 1>you're generating is an incredibly weak electric signal with a

0:17:01.800 --> 0:17:06.639
<v Speaker 1>microphone that represents those vibrations, and it's too weak to

0:17:06.720 --> 0:17:09.679
<v Speaker 1>do anything really useful with it. Right, you wouldn't have

0:17:09.840 --> 0:17:13.959
<v Speaker 1>enough umph there to drive a speaker. The speaker wouldn't

0:17:14.000 --> 0:17:17.280
<v Speaker 1>have enough electricity to make the magnets move the diaphragm

0:17:17.359 --> 0:17:19.399
<v Speaker 1>to a point where you could play an audible sound.

0:17:20.000 --> 0:17:23.439
<v Speaker 1>So typically you would then pass this very weak signal

0:17:24.000 --> 0:17:27.440
<v Speaker 1>to what is called a preamp. That's an amplifier that

0:17:27.480 --> 0:17:31.200
<v Speaker 1>can take a very weak signal and make it less weak,

0:17:31.560 --> 0:17:36.359
<v Speaker 1>like to align level feed. This would then move along

0:17:36.400 --> 0:17:40.280
<v Speaker 1>to a power amplifier, which would take this slightly stronger

0:17:40.359 --> 0:17:42.200
<v Speaker 1>signal and then boost it to a point where it

0:17:42.240 --> 0:17:46.119
<v Speaker 1>could drive a speaker system. But if you have a

0:17:46.160 --> 0:17:51.080
<v Speaker 1>bad amplifier, one that's prone to introducing noise and distortion, well,

0:17:51.200 --> 0:17:55.080
<v Speaker 1>that signal gets corrupted as it continues down. The signal chain,

0:17:55.600 --> 0:17:59.359
<v Speaker 1>and you know, anywhere could be the point where you

0:17:59.440 --> 0:18:02.560
<v Speaker 1>have a problem. If the preamp is bad, then a

0:18:02.560 --> 0:18:06.520
<v Speaker 1>little change in that week's signal will get boosted into

0:18:06.560 --> 0:18:10.920
<v Speaker 1>bigger changes further down the line. Now, numerous companies have

0:18:11.040 --> 0:18:14.840
<v Speaker 1>developed different circuit designs to reduce the possibility of introducing

0:18:15.400 --> 0:18:20.239
<v Speaker 1>background noise and distortion into a signal, or you know,

0:18:20.600 --> 0:18:24.040
<v Speaker 1>introducing disharmonics and things of that nature. There are far

0:18:24.080 --> 0:18:28.080
<v Speaker 1>too many different types of circuits and components to really

0:18:28.119 --> 0:18:31.120
<v Speaker 1>go into here. Uh And it does get very technical

0:18:31.320 --> 0:18:35.920
<v Speaker 1>about how you align these things in order to accurately

0:18:36.119 --> 0:18:40.959
<v Speaker 1>boost the signal without altering it otherwise. Uh. And amplifiers

0:18:40.960 --> 0:18:43.640
<v Speaker 1>are kind of like speakers. Some are really really good

0:18:43.680 --> 0:18:47.359
<v Speaker 1>at boosting signals that represent certain frequency ranges and not

0:18:47.520 --> 0:18:51.080
<v Speaker 1>as great at other frequency ranges. So you occasionally end

0:18:51.160 --> 0:18:56.160
<v Speaker 1>up with amplifiers or receivers which very frequently have amplifiers

0:18:56.160 --> 0:18:59.520
<v Speaker 1>built into them, that get a reputation for being really

0:18:59.560 --> 0:19:03.480
<v Speaker 1>good with certain ranges of sound representation, but not so

0:19:03.520 --> 0:19:08.680
<v Speaker 1>good at others. Also, amplifiers can introduce harmonic distortion. There's

0:19:08.760 --> 0:19:12.760
<v Speaker 1>usually a measure of an amplifier's total harmonic distortion plus noise,

0:19:13.520 --> 0:19:16.760
<v Speaker 1>so if you ever are looking at hi fi equipment

0:19:16.840 --> 0:19:20.240
<v Speaker 1>and you find that there's a number that represents total

0:19:20.320 --> 0:19:23.040
<v Speaker 1>harmonic distortion plus noise, you want that number to be

0:19:23.080 --> 0:19:27.240
<v Speaker 1>as low as it can be. The lower that number,

0:19:27.440 --> 0:19:30.359
<v Speaker 1>the better it is. Higher numbers indicate that the amplifier

0:19:30.440 --> 0:19:35.399
<v Speaker 1>is more prone to introducing unwanted signal. Meanwhile, there's another

0:19:35.600 --> 0:19:39.040
<v Speaker 1>metric called signal to noise ratio. In that case, you

0:19:39.040 --> 0:19:41.679
<v Speaker 1>want the number to be very big because indicates that

0:19:41.720 --> 0:19:45.320
<v Speaker 1>the amplifier is is very effective at boosting signal and

0:19:46.280 --> 0:19:50.680
<v Speaker 1>uh suppressing background noise. Then you have a thing called

0:19:50.720 --> 0:19:55.119
<v Speaker 1>cross talk. So with stereo systems, you've got a left

0:19:55.320 --> 0:19:58.560
<v Speaker 1>channel for sound and a right channel for sound. Now,

0:19:58.600 --> 0:20:02.520
<v Speaker 1>ideally you would have perfect isolation for those two channels,

0:20:02.560 --> 0:20:05.200
<v Speaker 1>so that only the stuff that's intended for the left

0:20:05.200 --> 0:20:08.320
<v Speaker 1>side goes to the left and only the stuff intended

0:20:08.359 --> 0:20:11.399
<v Speaker 1>for the right side goes to the right. And you

0:20:11.440 --> 0:20:13.639
<v Speaker 1>could even have it where you've got two instruments that

0:20:13.680 --> 0:20:16.399
<v Speaker 1>are essentially doing like a call and response and have

0:20:16.520 --> 0:20:19.760
<v Speaker 1>them perfectly isolated so that you're hearing like a guitar

0:20:19.880 --> 0:20:22.520
<v Speaker 1>on your right and a violin on your left or something.

0:20:23.280 --> 0:20:26.280
<v Speaker 1>But there's this effect called cross talk, which describes to

0:20:26.440 --> 0:20:29.240
<v Speaker 1>what extent the stuff that was meant for the right

0:20:29.320 --> 0:20:33.240
<v Speaker 1>channel bleed over into the left channel, and vice versa.

0:20:33.400 --> 0:20:35.840
<v Speaker 1>So you don't want a lot of cross talk. You

0:20:35.880 --> 0:20:37.760
<v Speaker 1>want cross talk to be at a minimum so that

0:20:37.800 --> 0:20:41.720
<v Speaker 1>you have as pure and isolated experience as you can

0:20:41.880 --> 0:20:44.880
<v Speaker 1>for the stuff that was recorded for that purpose. Keep

0:20:44.920 --> 0:20:48.280
<v Speaker 1>in mind, what I'm describing here is completely dependent upon

0:20:48.320 --> 0:20:51.560
<v Speaker 1>the media you're playing. Right, If you're playing back media

0:20:51.640 --> 0:20:55.440
<v Speaker 1>that's say mono, well none of that matters because the

0:20:55.520 --> 0:20:57.960
<v Speaker 1>same signal is going to go to every single speaker.

0:20:58.320 --> 0:21:01.800
<v Speaker 1>That's what mono is. If it stereo, but the mix

0:21:02.040 --> 0:21:05.439
<v Speaker 1>doesn't go to that level of separation, then it doesn't

0:21:05.480 --> 0:21:09.359
<v Speaker 1>matter quite as much there either. But generally speaking, you

0:21:09.440 --> 0:21:12.400
<v Speaker 1>don't want a lot of cross talk, so you want

0:21:12.440 --> 0:21:15.760
<v Speaker 1>that number to be low. Uh and we typically measure

0:21:15.800 --> 0:21:19.119
<v Speaker 1>that in negative deciples, So the lower the number, like

0:21:19.200 --> 0:21:22.520
<v Speaker 1>negative one decibles, is better than negative sixty deciples for

0:21:22.600 --> 0:21:25.360
<v Speaker 1>cross talk. Now, you might also have a separate receiver,

0:21:25.680 --> 0:21:28.960
<v Speaker 1>which is the device that manages all the input connections

0:21:29.000 --> 0:21:33.119
<v Speaker 1>to your speakers, which would be the output, or you

0:21:33.200 --> 0:21:35.960
<v Speaker 1>might have a receiver that has its own amplifier, So

0:21:36.000 --> 0:21:38.520
<v Speaker 1>the receiver and amplifier are the same piece of equipment.

0:21:39.119 --> 0:21:42.639
<v Speaker 1>That simplifies the signal chain a little bit. But generally speaking,

0:21:42.720 --> 0:21:46.360
<v Speaker 1>the stuff I refer to with amplifiers also largely applies

0:21:46.480 --> 0:21:51.720
<v Speaker 1>to receivers. And again the signal chain is very important here.

0:21:51.760 --> 0:21:56.439
<v Speaker 1>The simpler it is, to an extent, the better it

0:21:56.480 --> 0:21:59.120
<v Speaker 1>can be. The more stuff you're adding in, the more

0:21:59.160 --> 0:22:03.080
<v Speaker 1>filters and things adding in, uh, the more opportunities you

0:22:03.119 --> 0:22:05.800
<v Speaker 1>have for a point of failure. However, you could also

0:22:05.880 --> 0:22:09.719
<v Speaker 1>argue that these are things that could potentially compensate should

0:22:10.240 --> 0:22:12.600
<v Speaker 1>there be another issue with the signal chain somewhere else.

0:22:13.400 --> 0:22:16.160
<v Speaker 1>And then beyond that you have your actual music sources

0:22:16.200 --> 0:22:20.800
<v Speaker 1>like a turntable or a tape deck or CD player, radio,

0:22:20.920 --> 0:22:24.920
<v Speaker 1>that kind of thing. These also range in quality. There

0:22:24.920 --> 0:22:27.959
<v Speaker 1>are folks who will drop more than ten thousand dollars

0:22:28.000 --> 0:22:32.840
<v Speaker 1>on a turntable alone. So does it ten grand turntable

0:22:33.080 --> 0:22:36.199
<v Speaker 1>sound a hundred times better than a hundred dollar turntable.

0:22:37.160 --> 0:22:40.920
<v Speaker 1>That's hard to quantify, especially if you aren't otherwise using

0:22:40.960 --> 0:22:44.119
<v Speaker 1>the exact same system from that point forward. Right, if

0:22:44.160 --> 0:22:47.119
<v Speaker 1>you're talking about an all in one turntable that has

0:22:47.160 --> 0:22:50.680
<v Speaker 1>its own little speaker. Obviously that's going to sound different

0:22:51.200 --> 0:22:55.359
<v Speaker 1>from a high end turntable that's part of a finely

0:22:55.520 --> 0:22:59.119
<v Speaker 1>tuned audio system. But it does get to this point

0:22:59.160 --> 0:23:03.840
<v Speaker 1>again of diminished returns, of at what point is the

0:23:03.920 --> 0:23:10.280
<v Speaker 1>improvement perceptible or not perceptible. Obviously, components can range from

0:23:10.359 --> 0:23:13.840
<v Speaker 1>excellent to terrible. A turntable that isn't weighted properly might

0:23:13.880 --> 0:23:16.840
<v Speaker 1>introduce unwanted motion while playing back albums. You might get

0:23:16.880 --> 0:23:19.800
<v Speaker 1>some wobble in there that can end up coming through

0:23:19.840 --> 0:23:22.200
<v Speaker 1>with the playback and you're not going to get an

0:23:22.240 --> 0:23:25.439
<v Speaker 1>ideal listening experience. Or it could be that the stylists

0:23:26.040 --> 0:23:31.159
<v Speaker 1>isn't picking up vibrations as since as sensitive as you

0:23:31.200 --> 0:23:33.679
<v Speaker 1>wanted to, so you might be missing some of the

0:23:33.760 --> 0:23:37.200
<v Speaker 1>subtle parts of the recording that you would otherwise get

0:23:37.240 --> 0:23:40.840
<v Speaker 1>if you had a better quality cartridge in your in

0:23:40.920 --> 0:23:45.280
<v Speaker 1>your turntable. So these are all things that matter, but

0:23:45.359 --> 0:23:50.800
<v Speaker 1>again diminishing returns do come into play, and it also

0:23:50.840 --> 0:23:53.440
<v Speaker 1>again depends upon matching the turntable up with the rest

0:23:53.480 --> 0:23:57.560
<v Speaker 1>of your components. If everything else isn't tuned properly, then

0:23:57.600 --> 0:23:59.160
<v Speaker 1>you could have the best turn table in the world

0:23:59.240 --> 0:24:02.480
<v Speaker 1>and still not it an ideal experience. At the other

0:24:02.600 --> 0:24:06.199
<v Speaker 1>end of it, um, I will say that I've I

0:24:06.240 --> 0:24:08.840
<v Speaker 1>looked into turntables a lot. I ended up buying a

0:24:08.960 --> 0:24:13.000
<v Speaker 1>very simple one like audio files would immediately consider me

0:24:13.320 --> 0:24:17.119
<v Speaker 1>a total pleab which I am. That's fine, But I

0:24:17.160 --> 0:24:19.159
<v Speaker 1>didn't go for one that would need its own like

0:24:19.280 --> 0:24:22.919
<v Speaker 1>preamp or amplifier. Uh. It was not that kind of thing.

0:24:22.960 --> 0:24:25.719
<v Speaker 1>It was sort of an all in one little radio

0:24:26.280 --> 0:24:30.560
<v Speaker 1>CD player turntable system, which from my purposes is fine.

0:24:30.600 --> 0:24:34.200
<v Speaker 1>I know I'm not getting the highest quality audio experience

0:24:34.200 --> 0:24:36.520
<v Speaker 1>out of it, but for what I like, it's fine.

0:24:37.560 --> 0:24:41.840
<v Speaker 1>But if you really want that rich audio experience, obviously

0:24:41.880 --> 0:24:44.240
<v Speaker 1>you have to take even more time and put more

0:24:44.320 --> 0:24:47.640
<v Speaker 1>money toward it in order to achieve that. And then

0:24:47.680 --> 0:24:49.960
<v Speaker 1>finally we've got all the wires that we use to

0:24:50.000 --> 0:24:53.439
<v Speaker 1>connect all these different pieces together. There are companies like

0:24:53.560 --> 0:24:57.040
<v Speaker 1>Monster Incorporated that have built up an entire industry around

0:24:57.040 --> 0:25:00.119
<v Speaker 1>the idea of high end cables, some of them with

0:25:00.200 --> 0:25:03.560
<v Speaker 1>like precious metals that are said to provide the absolute

0:25:03.640 --> 0:25:08.679
<v Speaker 1>best connection so you have no signal loss. And here's

0:25:08.680 --> 0:25:12.240
<v Speaker 1>the thing. It is true that really bad cables can

0:25:12.280 --> 0:25:15.800
<v Speaker 1>have a negative impact on a high fidelity experience. A

0:25:15.880 --> 0:25:19.679
<v Speaker 1>cable with poor shielding might pick up interference from radio

0:25:19.760 --> 0:25:23.919
<v Speaker 1>signals or other electrical wires, for example, and that will

0:25:23.960 --> 0:25:27.640
<v Speaker 1>affect output. You will get noise in the output if

0:25:27.720 --> 0:25:32.400
<v Speaker 1>in fact that's what you're using. But that's like the

0:25:32.480 --> 0:25:35.640
<v Speaker 1>worst cables that you can find. Most cables these days

0:25:35.640 --> 0:25:39.600
<v Speaker 1>are decently shielded, and anything better than like the bargain

0:25:39.600 --> 0:25:42.440
<v Speaker 1>bin stuff is likely to work just fine and last

0:25:42.480 --> 0:25:45.960
<v Speaker 1>a good long time. Uh. In fact, there are home

0:25:46.000 --> 0:25:47.879
<v Speaker 1>theater sites out there that go so far as to

0:25:47.880 --> 0:25:50.800
<v Speaker 1>say that once you get over a thresholder around fifty

0:25:50.880 --> 0:25:54.440
<v Speaker 1>dollars for high end cables, you're not really going to

0:25:54.520 --> 0:25:57.720
<v Speaker 1>see any perceptible difference in output quality, and arguably you

0:25:57.760 --> 0:26:01.080
<v Speaker 1>won't even see it before you hit that d dollar threshold.

0:26:01.520 --> 0:26:04.280
<v Speaker 1>And again, you might be able to tell a difference

0:26:04.280 --> 0:26:08.000
<v Speaker 1>if you're using sensitive electrical instruments that can detect things

0:26:08.080 --> 0:26:12.000
<v Speaker 1>like a signal leak or anything alone those lines, but

0:26:13.000 --> 0:26:17.080
<v Speaker 1>that's again a very finely tuned piece of equipment. The

0:26:17.160 --> 0:26:20.800
<v Speaker 1>human ear would not necessarily be able to register any

0:26:20.880 --> 0:26:26.800
<v Speaker 1>difference or any decline in quality, So buyer beware when

0:26:26.880 --> 0:26:30.119
<v Speaker 1>it comes to high end cables. In particular, I feel

0:26:30.119 --> 0:26:34.520
<v Speaker 1>like that, out of all the different high thigh world stuff,

0:26:34.640 --> 0:26:38.120
<v Speaker 1>has got the most snake oil around it. Now, I'm

0:26:38.119 --> 0:26:41.440
<v Speaker 1>gonna have some really tough news to deliver to audio

0:26:41.520 --> 0:26:44.359
<v Speaker 1>files and to people who are interested in getting into

0:26:44.400 --> 0:26:49.719
<v Speaker 1>like a high fidelity situation. Uh, but you know, I

0:26:49.760 --> 0:26:52.200
<v Speaker 1>feel like we need to brace ourselves. So let's take

0:26:52.240 --> 0:26:55.960
<v Speaker 1>another quick break and when we come back then hopefully

0:26:56.000 --> 0:27:06.439
<v Speaker 1>we'll all be ready for it. All right, here's that

0:27:06.560 --> 0:27:09.960
<v Speaker 1>harsh news I was talking about. I am sad to

0:27:10.000 --> 0:27:15.840
<v Speaker 1>tell you there is no best audio set up. There

0:27:15.920 --> 0:27:22.680
<v Speaker 1>is no ultimate collection of receivers, amplifiers, speakers, audio components

0:27:22.720 --> 0:27:28.600
<v Speaker 1>and wires that categorically makes that system superior over every

0:27:28.680 --> 0:27:31.640
<v Speaker 1>other one on the market. The reason I feel confident

0:27:31.720 --> 0:27:37.560
<v Speaker 1>saying this is because of the gray matter between our ears. Yep,

0:27:37.640 --> 0:27:42.240
<v Speaker 1>we're talking brains, the stuff what processes our thoughts and senses.

0:27:42.760 --> 0:27:45.960
<v Speaker 1>It's also Zombie's favorite snack food. And we're gonna talk

0:27:45.960 --> 0:27:48.680
<v Speaker 1>about some really obvious stuff here, but it's also stuff

0:27:48.760 --> 0:27:51.520
<v Speaker 1>that we often take for granted or just outright forget.

0:27:52.040 --> 0:27:54.520
<v Speaker 1>So one of the things we tend to forget is

0:27:54.520 --> 0:27:59.480
<v Speaker 1>that our sensory experience is filtered through our brains. You've

0:27:59.480 --> 0:28:03.040
<v Speaker 1>probably heard the philosophical question, if a tree falls in

0:28:03.080 --> 0:28:05.400
<v Speaker 1>the forest and there's no one around to hear, it

0:28:05.440 --> 0:28:08.359
<v Speaker 1>doesn't make a sound. Well, on the one hand, you

0:28:08.359 --> 0:28:10.760
<v Speaker 1>could say, well, of course it makes a sound because

0:28:10.760 --> 0:28:14.760
<v Speaker 1>sound is vibration. Sound travels through the air because air

0:28:14.800 --> 0:28:18.520
<v Speaker 1>molecules vibrate. We get these little pockets of air pressure fluctuation,

0:28:19.000 --> 0:28:22.320
<v Speaker 1>and when those fluctuations interact with the tympanic membrane that

0:28:22.480 --> 0:28:26.399
<v Speaker 1>is our ear drum in our ears, we then experience

0:28:26.520 --> 0:28:30.159
<v Speaker 1>this as sound. We perceive it as sound, we hear

0:28:30.440 --> 0:28:33.560
<v Speaker 1>the vibrations. And of course it doesn't just travel through

0:28:33.600 --> 0:28:36.879
<v Speaker 1>the air. Sound can travel through solids and liquids as well.

0:28:37.560 --> 0:28:40.680
<v Speaker 1>This is why sound doesn't exist out in space, or

0:28:40.720 --> 0:28:43.600
<v Speaker 1>at least sound doesn't travel in space, rather because you

0:28:43.600 --> 0:28:45.800
<v Speaker 1>don't have enough particles out in space that are close

0:28:45.880 --> 0:28:49.680
<v Speaker 1>enough together to allow vibration to propagate through a medium.

0:28:50.200 --> 0:28:52.440
<v Speaker 1>So that's why there's no sound in space. There's not

0:28:52.600 --> 0:28:56.360
<v Speaker 1>enough stuff to wiggle against each other to make sound go.

0:28:57.080 --> 0:29:00.240
<v Speaker 1>So we can be confident that if a tree falls

0:29:00.240 --> 0:29:03.720
<v Speaker 1>in the forest, there will be vibrations. Stuff will be vibrating.

0:29:04.160 --> 0:29:07.200
<v Speaker 1>But then you get to the more philosophical question, if

0:29:07.240 --> 0:29:10.400
<v Speaker 1>there is no one to hear those vibrations, do we

0:29:10.480 --> 0:29:14.720
<v Speaker 1>really call it sound? Or is sound actually what we

0:29:14.800 --> 0:29:20.480
<v Speaker 1>call the experience of perceiving of hearing vibrations, Because if

0:29:20.520 --> 0:29:23.200
<v Speaker 1>there is no one to experience it, well, then maybe

0:29:23.240 --> 0:29:27.240
<v Speaker 1>there is no sound. Not that there isn't something to hear,

0:29:27.320 --> 0:29:29.360
<v Speaker 1>but rather there is no one there to hear it.

0:29:29.480 --> 0:29:32.600
<v Speaker 1>And it's the hearing that is sound rather than the

0:29:32.720 --> 0:29:35.240
<v Speaker 1>vibration that is sound. So it all depends on how

0:29:35.320 --> 0:29:38.400
<v Speaker 1>you define it. Now why would I even bother to

0:29:38.520 --> 0:29:43.240
<v Speaker 1>bring up this philosophical conundrum, Well, it's because it kind

0:29:43.240 --> 0:29:45.680
<v Speaker 1>of gets to the heart of the matter. The experience

0:29:45.680 --> 0:29:50.800
<v Speaker 1>of sound is a personal one, a subjective one. It

0:29:51.000 --> 0:29:55.640
<v Speaker 1>isn't just physics. It's a point where physics and our

0:29:55.680 --> 0:29:59.800
<v Speaker 1>minds interact with each other. The vibrations are ear jumps

0:29:59.840 --> 0:30:03.320
<v Speaker 1>pick up, get translated into electrical signals that our brains

0:30:03.400 --> 0:30:07.760
<v Speaker 1>process and interpret, and we experience it as sound. So

0:30:08.280 --> 0:30:13.080
<v Speaker 1>at least part of what makes sound happen is in

0:30:13.160 --> 0:30:16.440
<v Speaker 1>our heads, and that gets right to the point. Sound

0:30:16.560 --> 0:30:19.480
<v Speaker 1>is somewhat subjective, and I have no way of knowing

0:30:19.640 --> 0:30:24.600
<v Speaker 1>if what you hear and what I hear sounds the same,

0:30:24.720 --> 0:30:27.200
<v Speaker 1>even if it's coming from the same source. I don't

0:30:27.240 --> 0:30:31.080
<v Speaker 1>know that your experience is exactly the same as mine,

0:30:31.200 --> 0:30:33.520
<v Speaker 1>and because I have no way of experiencing the way

0:30:33.840 --> 0:30:37.720
<v Speaker 1>you experience it, I'll never know. And that means that

0:30:37.760 --> 0:30:40.479
<v Speaker 1>I could theoretically build out a sound system that I

0:30:40.520 --> 0:30:43.440
<v Speaker 1>find to be a perfect one. You know, to me,

0:30:43.600 --> 0:30:47.240
<v Speaker 1>it reproduces sounds in a way that I find flawless,

0:30:47.800 --> 0:30:52.720
<v Speaker 1>that all the levels are perfect, the volume from each

0:30:52.880 --> 0:30:58.480
<v Speaker 1>component is perfect, and the tone is sublime. But it's

0:30:58.520 --> 0:31:01.560
<v Speaker 1>possible that to you listening to it, that the experience

0:31:01.560 --> 0:31:04.760
<v Speaker 1>would seem a little long. Maybe there's some harmonics that

0:31:04.800 --> 0:31:07.719
<v Speaker 1>you can sense that I can't just because of like

0:31:07.840 --> 0:31:11.200
<v Speaker 1>hearing loss that I have, Or maybe there are certain

0:31:11.200 --> 0:31:14.800
<v Speaker 1>frequencies that you just hear better than I do. Because again,

0:31:14.840 --> 0:31:17.560
<v Speaker 1>I'm getting older, so those higher frequencies are starting to

0:31:17.640 --> 0:31:19.360
<v Speaker 1>drop out for me, and I may not be able

0:31:19.400 --> 0:31:22.640
<v Speaker 1>to hear them the way you can, So I don't

0:31:22.720 --> 0:31:25.640
<v Speaker 1>realize that I don't have my system tuned properly to

0:31:25.880 --> 0:31:28.880
<v Speaker 1>deal with those. There are many factors that might mean

0:31:28.920 --> 0:31:32.280
<v Speaker 1>what sounds good to me doesn't sound good to you.

0:31:32.800 --> 0:31:36.680
<v Speaker 1>Now complicating this is the vocabulary used by audio files,

0:31:36.720 --> 0:31:40.440
<v Speaker 1>and they rely on words like round and warm and

0:31:40.600 --> 0:31:45.480
<v Speaker 1>full and brittle to describe sound, And there they've essentially

0:31:45.520 --> 0:31:49.960
<v Speaker 1>had to appropriate words from other contexts in order to

0:31:50.000 --> 0:31:54.560
<v Speaker 1>describe sound properly, because we just never really coined words

0:31:54.840 --> 0:31:58.920
<v Speaker 1>specific to sounds qualities, at least not beyond some pretty

0:31:58.920 --> 0:32:02.240
<v Speaker 1>simple ones like loud or soft, or low pitched or

0:32:02.320 --> 0:32:05.000
<v Speaker 1>high pitched, that kind of thing. Now, I have read

0:32:05.960 --> 0:32:11.440
<v Speaker 1>enthusiastic reviews of high end sound systems, and after reading them,

0:32:11.440 --> 0:32:14.800
<v Speaker 1>I've walked away not knowing what the heck the reviewer

0:32:14.960 --> 0:32:19.160
<v Speaker 1>meant in their review, Like, clearly the reviewer loved the system,

0:32:19.200 --> 0:32:21.959
<v Speaker 1>but the words the reviewer used to describe the sound

0:32:22.920 --> 0:32:26.080
<v Speaker 1>didn't have any real meaning for me. So in my experience,

0:32:26.080 --> 0:32:30.280
<v Speaker 1>and I admit this is from my perspective, I have

0:32:30.360 --> 0:32:34.320
<v Speaker 1>to actually hear something for myself to kind of understand

0:32:34.360 --> 0:32:37.400
<v Speaker 1>what people are getting at. I don't think I could

0:32:37.440 --> 0:32:40.800
<v Speaker 1>adequately describe what it means when a sound is warm,

0:32:41.160 --> 0:32:44.360
<v Speaker 1>for example, but I know it when I hear it,

0:32:44.440 --> 0:32:48.040
<v Speaker 1>so it creates a certain feeling, and that's where perception

0:32:48.120 --> 0:32:51.000
<v Speaker 1>is coming in, right. So the collection of components you

0:32:51.040 --> 0:32:54.920
<v Speaker 1>gather will determine the range of tones and volume that

0:32:55.040 --> 0:32:58.200
<v Speaker 1>you can get out of your sound system, and high

0:32:58.240 --> 0:33:01.960
<v Speaker 1>fidelity systems typically have several controls you can tweak to

0:33:02.120 --> 0:33:05.640
<v Speaker 1>play with the output to get it just the way

0:33:05.840 --> 0:33:08.360
<v Speaker 1>you want it. It might be that you know, out

0:33:08.400 --> 0:33:11.360
<v Speaker 1>of the box it doesn't sound that great, but then

0:33:11.400 --> 0:33:14.959
<v Speaker 1>you spend some time tweaking levels. Maybe you change how

0:33:15.040 --> 0:33:18.160
<v Speaker 1>much base is coming through to the speakers, how much

0:33:18.200 --> 0:33:20.800
<v Speaker 1>of the troubles coming through. You might have have to

0:33:20.840 --> 0:33:24.360
<v Speaker 1>deal with the balance slightly because maybe you've set up

0:33:24.360 --> 0:33:27.120
<v Speaker 1>a listening environment where let's say you've got a couch

0:33:27.160 --> 0:33:30.560
<v Speaker 1>set up, and your couch is just slightly closer to

0:33:30.600 --> 0:33:32.840
<v Speaker 1>the left speakers than to the right speakers, and so

0:33:33.280 --> 0:33:35.120
<v Speaker 1>with the balance, you tweak it just a bit so

0:33:35.160 --> 0:33:38.000
<v Speaker 1>that the right speakers get a little more juice to

0:33:38.320 --> 0:33:40.200
<v Speaker 1>make up for the fact that they're a little further

0:33:40.280 --> 0:33:43.400
<v Speaker 1>away from you. Tiny things like this do matter, especially

0:33:43.440 --> 0:33:47.320
<v Speaker 1>if you're trying to recreate the feeling that you were

0:33:47.400 --> 0:33:49.760
<v Speaker 1>there when something was being recorded. A lot of people

0:33:49.800 --> 0:33:54.960
<v Speaker 1>talk about orchestral recordings because with a really good orchestral

0:33:55.160 --> 0:33:58.880
<v Speaker 1>recording in stereo, you get the sensation that you are

0:33:59.280 --> 0:34:03.240
<v Speaker 1>in the middle of the orchestra that you can hear

0:34:03.280 --> 0:34:06.200
<v Speaker 1>the you can hear the position of the instruments like

0:34:06.240 --> 0:34:09.960
<v Speaker 1>they sound like they are ahead of you, into your

0:34:10.040 --> 0:34:12.600
<v Speaker 1>right or off to the left or whatever when you

0:34:12.640 --> 0:34:15.840
<v Speaker 1>have it set up really well. But that requires putting

0:34:15.840 --> 0:34:18.000
<v Speaker 1>in that work right, and a lot of Hi fi

0:34:18.080 --> 0:34:23.120
<v Speaker 1>systems have tons of different controls so that you can

0:34:23.160 --> 0:34:26.920
<v Speaker 1>manage this down at a micro level in order to

0:34:26.960 --> 0:34:30.279
<v Speaker 1>get exactly the experience you one out of it. And

0:34:30.320 --> 0:34:33.000
<v Speaker 1>when you do hit upon an experience that appeals to you,

0:34:33.400 --> 0:34:37.239
<v Speaker 1>it can be magical, like it can feel amazing to

0:34:37.520 --> 0:34:41.920
<v Speaker 1>experience music at that level. It's transcendent in some cases.

0:34:42.320 --> 0:34:45.399
<v Speaker 1>But keep in mind that if it feels transcendent to you,

0:34:45.560 --> 0:34:48.759
<v Speaker 1>it doesn't necessarily translate like that to someone else. Right.

0:34:49.080 --> 0:34:51.400
<v Speaker 1>That's why I go back to saying there's no universal

0:34:51.560 --> 0:34:55.680
<v Speaker 1>perfect sound system out there. Uh. There are lots of

0:34:55.800 --> 0:35:00.440
<v Speaker 1>really expensive systems and components out there. Some of them

0:35:00.440 --> 0:35:04.080
<v Speaker 1>are very very good. They have a very high build quality,

0:35:04.280 --> 0:35:09.480
<v Speaker 1>They have tried and true circuitry that reduces the level

0:35:09.520 --> 0:35:13.120
<v Speaker 1>of distortion and noise that get introduced to a signal. Uh.

0:35:13.160 --> 0:35:17.239
<v Speaker 1>There are speaker systems that are known to be able

0:35:17.280 --> 0:35:22.520
<v Speaker 1>to replicate sounds faithfully and with minimum distortion, even at

0:35:22.640 --> 0:35:26.480
<v Speaker 1>high volume. But there are plenty of them out there

0:35:26.480 --> 0:35:29.719
<v Speaker 1>that are overhyped considering what they're capable of. I mean,

0:35:29.960 --> 0:35:33.080
<v Speaker 1>I took a quick peek online just to find, you know,

0:35:33.600 --> 0:35:35.799
<v Speaker 1>what some of the most expensive speaker systems were, and

0:35:35.840 --> 0:35:39.000
<v Speaker 1>I found some that were over a million dollars just

0:35:39.120 --> 0:35:42.560
<v Speaker 1>for the speakers, that's without anything else. And I feel

0:35:42.560 --> 0:35:45.720
<v Speaker 1>pretty confident to say that I personally would be unable

0:35:45.760 --> 0:35:48.560
<v Speaker 1>to tell the difference in quality between a million dollars

0:35:48.680 --> 0:35:51.680
<v Speaker 1>of speakers and a two thousand dollars set of speakers,

0:35:52.080 --> 0:35:56.040
<v Speaker 1>maybe even less. And that's me, like, that's that's because

0:35:56.080 --> 0:36:02.000
<v Speaker 1>of my capabilities and my limitations. I also will admit

0:36:02.080 --> 0:36:03.840
<v Speaker 1>that there could be people out there who could be

0:36:03.880 --> 0:36:05.719
<v Speaker 1>able to tell the difference. I don't know that it

0:36:05.719 --> 0:36:08.200
<v Speaker 1>would be such a dramatic difference that you'd say, oh,

0:36:08.239 --> 0:36:11.440
<v Speaker 1>it's night and day um. And I do think that

0:36:11.480 --> 0:36:13.200
<v Speaker 1>if we were to do a lot of double blind

0:36:13.239 --> 0:36:17.160
<v Speaker 1>tests with some of these components, you would find that

0:36:17.239 --> 0:36:20.040
<v Speaker 1>most people would not be able to tell significant differences,

0:36:20.160 --> 0:36:25.240
<v Speaker 1>assuming that each system was set up with comparable settings.

0:36:25.680 --> 0:36:27.360
<v Speaker 1>And by that, I mean like if you had the

0:36:27.400 --> 0:36:30.440
<v Speaker 1>two thousand dollars set of speakers set up, but you

0:36:30.520 --> 0:36:33.440
<v Speaker 1>turn the base way down for those, and you didn't

0:36:33.440 --> 0:36:35.680
<v Speaker 1>do that for the million dollars, says speakers. Of course

0:36:36.160 --> 0:36:38.480
<v Speaker 1>there'd be a more perceptible difference. You have to make

0:36:38.480 --> 0:36:41.960
<v Speaker 1>sure that everything else is as equal as it can be,

0:36:42.360 --> 0:36:45.120
<v Speaker 1>and then test to see if there's a perceptible difference.

0:36:45.120 --> 0:36:48.080
<v Speaker 1>And I don't think, especially at the high high end,

0:36:48.719 --> 0:36:53.160
<v Speaker 1>that you come across that much. Anyway, A true audio

0:36:53.200 --> 0:36:56.480
<v Speaker 1>file seeks out the system and settings then most closely

0:36:56.520 --> 0:37:01.840
<v Speaker 1>matches their perception of acoustic perfection. This is also a

0:37:01.920 --> 0:37:05.880
<v Speaker 1>point of debate in the audio file world. I mentioned

0:37:05.880 --> 0:37:08.960
<v Speaker 1>in San Sui that you know, you have amplifiers out

0:37:08.960 --> 0:37:13.000
<v Speaker 1>there that still work with vacuum tubes, and you've got

0:37:13.000 --> 0:37:17.080
<v Speaker 1>amplifiers that work with solid state transistors, And there are

0:37:17.160 --> 0:37:20.480
<v Speaker 1>proponents in both camps. There are some who say the

0:37:20.520 --> 0:37:25.440
<v Speaker 1>acoustic amplifiers are really the best because they create a round,

0:37:25.520 --> 0:37:29.760
<v Speaker 1>warm sound. There are others who say no transistor based

0:37:29.760 --> 0:37:33.560
<v Speaker 1>amplifiers are best because they are the most accurate in

0:37:34.719 --> 0:37:39.319
<v Speaker 1>representing the original sounds signal. Right, you you get the

0:37:39.360 --> 0:37:44.520
<v Speaker 1>most true copy or boosted copy of the original signal

0:37:44.560 --> 0:37:48.880
<v Speaker 1>with transistor based amplifiers, and this debate goes back and forth,

0:37:49.160 --> 0:37:51.759
<v Speaker 1>and I think it really just comes down to your

0:37:51.760 --> 0:37:56.239
<v Speaker 1>own personal preference. And again me, I'm simple. I'm a plebium.

0:37:56.239 --> 0:37:58.560
<v Speaker 1>I love a good sound system, but I have so

0:37:58.640 --> 0:38:01.520
<v Speaker 1>far resisted the sirens all to become obsessed and to

0:38:01.800 --> 0:38:04.799
<v Speaker 1>dive into the world of high end audio. It's not

0:38:04.840 --> 0:38:08.319
<v Speaker 1>to say that I won't ever do it. I think

0:38:08.440 --> 0:38:10.959
<v Speaker 1>as far as hobbies go, it's one that I would

0:38:11.000 --> 0:38:15.280
<v Speaker 1>probably find a lot of pleasure in, assuming that I wasn't,

0:38:16.320 --> 0:38:20.240
<v Speaker 1>you know, spending an irresponsible amount of money to pursue it,

0:38:20.280 --> 0:38:22.680
<v Speaker 1>which is a big assumption because again, some of these

0:38:22.680 --> 0:38:26.520
<v Speaker 1>components get into like tens of thousands of dollars. I

0:38:26.560 --> 0:38:29.799
<v Speaker 1>can't ever imagine spending money on a sound system, you know,

0:38:30.000 --> 0:38:33.879
<v Speaker 1>at that level. But um, you know, I I could

0:38:33.920 --> 0:38:36.680
<v Speaker 1>see myself getting kind of into this hobby. I just

0:38:36.760 --> 0:38:40.560
<v Speaker 1>worry that there is no kind of I feel like

0:38:41.040 --> 0:38:43.399
<v Speaker 1>once you start, it starts to pull you in, kind

0:38:43.400 --> 0:38:46.319
<v Speaker 1>of like tattoos. I got one, said that was it,

0:38:46.520 --> 0:38:49.400
<v Speaker 1>and none. I've got two more after that, and I

0:38:49.520 --> 0:38:52.319
<v Speaker 1>keep thinking about getting another one. So I already know

0:38:53.200 --> 0:38:58.279
<v Speaker 1>that pathway would lead me to a dangerous place. But anyway,

0:38:58.480 --> 0:39:00.799
<v Speaker 1>I thought it was important to talk about because it's

0:39:00.800 --> 0:39:05.120
<v Speaker 1>not that the audio files are delusional or anything like that.

0:39:05.680 --> 0:39:12.680
<v Speaker 1>It's that it's difficult to talk about an experience so

0:39:12.800 --> 0:39:20.000
<v Speaker 1>subjective as the perfect listening experience and then pourt that

0:39:20.200 --> 0:39:24.520
<v Speaker 1>over into a discussion about what technical equipment allows you

0:39:24.560 --> 0:39:28.400
<v Speaker 1>to achieve that, because it's two different things. It's almost

0:39:28.480 --> 0:39:32.120
<v Speaker 1>like trying to make an art a science or vice versa.

0:39:32.600 --> 0:39:34.160
<v Speaker 1>And that I think is where a lot of the

0:39:34.200 --> 0:39:37.600
<v Speaker 1>frustration for me comes in when I look into things

0:39:37.640 --> 0:39:41.319
<v Speaker 1>like the audio file world. But that's me. I thought

0:39:41.360 --> 0:39:43.480
<v Speaker 1>that would be great for a tech Stuff tidbit. There's

0:39:43.520 --> 0:39:46.799
<v Speaker 1>another long tidbit episode. Hope you enjoyed it. If you

0:39:46.840 --> 0:39:49.280
<v Speaker 1>have suggestions for topics I should cover in future episodes

0:39:49.280 --> 0:39:52.040
<v Speaker 1>of tech Stuff, please reach out to me. You can

0:39:52.040 --> 0:39:54.080
<v Speaker 1>do so on Twitter the handle for the show's tech

0:39:54.120 --> 0:39:56.960
<v Speaker 1>Stuff h s W and I'll talk to you again

0:39:57.640 --> 0:40:06.080
<v Speaker 1>really soon. Text Stuff is an I Heart Radio production.

0:40:06.280 --> 0:40:09.120
<v Speaker 1>For more podcasts from I Heart Radio, visit the I

0:40:09.239 --> 0:40:12.480
<v Speaker 1>Heart Radio app, Apple Podcasts, or wherever you listen to

0:40:12.520 --> 0:40:13.440
<v Speaker 1>your favorite shows.