1
00:00:04,400 --> 00:00:07,800
Speaker 1: Welcome to tech Stuff, a production from I Heart Radio.

2
00:00:12,160 --> 00:00:15,200
Speaker 1: Hey there, and welcome to tech Stuff. I'm your host,

3
00:00:15,360 --> 00:00:18,240
Speaker 1: Jonathan Strickland. I'm an executive producer with I Heart Radio

4
00:00:18,320 --> 00:00:21,640
Speaker 1: and I love all things tech and recently I received

5
00:00:21,720 --> 00:00:26,200
Speaker 1: a tweet from Twitter user Salvatore del Knock, a k

6
00:00:26,400 --> 00:00:30,320
Speaker 1: A non juror, asking if I would do a breakdown

7
00:00:30,720 --> 00:00:33,840
Speaker 1: on how analog to digital and digital to analog audio

8
00:00:33,920 --> 00:00:38,040
Speaker 1: converters work. And that's a great request. Um, it is

9
00:00:38,200 --> 00:00:41,440
Speaker 1: incredibly technical when you really get down to it. So

10
00:00:41,560 --> 00:00:45,360
Speaker 1: I'm going to do a very high level view of

11
00:00:45,400 --> 00:00:49,360
Speaker 1: the concept because otherwise we're gonna have to get into

12
00:00:49,960 --> 00:00:54,560
Speaker 1: the various methodologies that DAC and a d c's work,

13
00:00:55,120 --> 00:00:58,560
Speaker 1: and uh, it would quickly become like a technical manual.

14
00:00:58,680 --> 00:01:01,600
Speaker 1: But if people want that, then I can do a

15
00:01:01,680 --> 00:01:04,880
Speaker 1: subsequent episode and go into more detail. But one of

16
00:01:04,880 --> 00:01:08,119
Speaker 1: the things about this is that lets us talk about

17
00:01:08,160 --> 00:01:13,160
Speaker 1: the differences between analog and digital audio and why converters

18
00:01:13,200 --> 00:01:15,720
Speaker 1: are necessary in the first place, and to open up

19
00:01:15,760 --> 00:01:19,640
Speaker 1: the eternal argument about whether one is inherently better than

20
00:01:19,800 --> 00:01:22,640
Speaker 1: the other. This one goes out to all you audio

21
00:01:22,720 --> 00:01:26,120
Speaker 1: files out there, so get ready to send me angry messages,

22
00:01:26,120 --> 00:01:28,679
Speaker 1: because no matter what I say, some of y'all are

23
00:01:28,720 --> 00:01:32,040
Speaker 1: going to get upset. Anyway, let's start with what it

24
00:01:32,080 --> 00:01:35,920
Speaker 1: means to be analog versus digital. Now, when I was

25
00:01:35,959 --> 00:01:38,640
Speaker 1: a young boy, nobody loved me. I was a poor

26
00:01:38,680 --> 00:01:43,280
Speaker 1: boy from a poor family. No, hang on, that's now,

27
00:01:43,280 --> 00:01:46,440
Speaker 1: that's Queen's bohemian rhapsty Now, when I was a young boy,

28
00:01:46,560 --> 00:01:51,200
Speaker 1: analog was the standard. Digital did not even enter into

29
00:01:51,200 --> 00:01:54,720
Speaker 1: my awareness until I was a teenager, when compact discs

30
00:01:54,720 --> 00:01:56,640
Speaker 1: were starting to become popular. They had been around for

31
00:01:56,640 --> 00:01:59,160
Speaker 1: a while before I was a teenager, but I was

32
00:01:59,240 --> 00:02:02,680
Speaker 1: not really aware of them, because I mean, I grew

33
00:02:02,720 --> 00:02:05,120
Speaker 1: up in rural Georgia. We would get technology a few

34
00:02:05,200 --> 00:02:09,560
Speaker 1: years behind everybody else. Anyway, I grew up thinking analog

35
00:02:09,680 --> 00:02:13,680
Speaker 1: essentially meant old and digital meant new, Like that was

36
00:02:13,760 --> 00:02:16,280
Speaker 1: the sort of the abstract distinction between the two in

37
00:02:16,360 --> 00:02:20,000
Speaker 1: my head. But the differences are obviously more complicated than that,

38
00:02:20,400 --> 00:02:23,560
Speaker 1: and we need to understand how sound works, which I

39
00:02:23,600 --> 00:02:26,720
Speaker 1: know I've covered many many times, but it's important so

40
00:02:26,800 --> 00:02:29,760
Speaker 1: that we know how the analog and digital methods of

41
00:02:29,840 --> 00:02:35,120
Speaker 1: recording and thus reproducing and eventually playing back sound. You

42
00:02:35,160 --> 00:02:38,760
Speaker 1: know how they work with relation to the original sound

43
00:02:39,000 --> 00:02:43,280
Speaker 1: that existed. So sound is, when you really get down

44
00:02:43,320 --> 00:02:48,720
Speaker 1: to it, vibration or pressure waves. Now, we mostly experienced

45
00:02:48,720 --> 00:02:52,320
Speaker 1: sound by hearing these vibrations travel through the air, but

46
00:02:52,520 --> 00:02:55,840
Speaker 1: you can also experience this underwater. Sound can move through

47
00:02:55,880 --> 00:02:59,600
Speaker 1: different media, including solid material. Like if you put your

48
00:03:00,040 --> 00:03:02,639
Speaker 1: ear against a table, a really long table, and some

49
00:03:02,720 --> 00:03:06,280
Speaker 1: one on the other end is tapping very lightly on

50
00:03:06,360 --> 00:03:09,560
Speaker 1: that table, you'll hear it. And it's not because the

51
00:03:09,560 --> 00:03:12,400
Speaker 1: sound is traveling effectively through the air, though it is

52
00:03:12,760 --> 00:03:15,480
Speaker 1: doing a little bit of that too, but that it

53
00:03:15,560 --> 00:03:19,320
Speaker 1: travels through the table to you. Sound also travels at

54
00:03:19,360 --> 00:03:23,000
Speaker 1: different speeds through different media, and in fact, stuff like

55
00:03:23,080 --> 00:03:26,520
Speaker 1: air temperature can affect how quickly sound travels, which is

56
00:03:26,560 --> 00:03:30,240
Speaker 1: why when we talk about the speed of sound, we

57
00:03:30,400 --> 00:03:33,440
Speaker 1: technically actually need to be a little more specific than that.

58
00:03:33,800 --> 00:03:36,480
Speaker 1: So the standard way of describing the speed of sound

59
00:03:36,680 --> 00:03:39,960
Speaker 1: is to say that it moves at three per second

60
00:03:40,160 --> 00:03:44,840
Speaker 1: in dry air at twenty celsius, that's about sixty eight fahrenheit.

61
00:03:45,240 --> 00:03:47,960
Speaker 1: And if you start changing those parameters, you know, if

62
00:03:47,960 --> 00:03:50,480
Speaker 1: you introduce, say a lot of humidity into the air,

63
00:03:50,960 --> 00:03:53,840
Speaker 1: or you change the air temperature like it goes up

64
00:03:53,960 --> 00:03:56,520
Speaker 1: or it goes down. Well, sound will travel at a

65
00:03:56,600 --> 00:04:00,840
Speaker 1: slightly different speed than at that standard I was talking about.

66
00:04:01,160 --> 00:04:04,160
Speaker 1: Now I could get into how the vibrations cause air

67
00:04:04,160 --> 00:04:06,880
Speaker 1: molecules to move back and forth, creating little changes in

68
00:04:06,960 --> 00:04:11,480
Speaker 1: air pressure. And it's these pressure waves, these air fluctuation changes,

69
00:04:11,520 --> 00:04:13,520
Speaker 1: that our ear drums pick up and transfer to our

70
00:04:13,560 --> 00:04:17,200
Speaker 1: inner ears. That's where special nerves pick up these fluctuations

71
00:04:17,200 --> 00:04:19,760
Speaker 1: in our inner ears, and then our brains process those

72
00:04:20,200 --> 00:04:23,120
Speaker 1: those nerve signals as sound. But most of this isn't

73
00:04:23,160 --> 00:04:27,280
Speaker 1: important for the rest of this episode, so instead, let's

74
00:04:27,320 --> 00:04:31,960
Speaker 1: talk about sound waves, all right. So we can think

75
00:04:31,960 --> 00:04:34,599
Speaker 1: of a vibration as something in which a particle is

76
00:04:34,640 --> 00:04:37,960
Speaker 1: moved out of its usual place and then it snaps

77
00:04:38,040 --> 00:04:40,640
Speaker 1: back to its usual place, and it might do this

78
00:04:40,880 --> 00:04:44,279
Speaker 1: several times. Think of a guitar string. If you pluck

79
00:04:44,320 --> 00:04:47,280
Speaker 1: a guitar string, you're pulling the string out of where

80
00:04:47,320 --> 00:04:50,719
Speaker 1: it usually sits, and then it snaps back and forth

81
00:04:51,000 --> 00:04:55,880
Speaker 1: and oscillates around its normal position until it settles down again.

82
00:04:56,240 --> 00:04:58,919
Speaker 1: So we can describe the number of times that a

83
00:04:59,000 --> 00:05:02,440
Speaker 1: particle does as a frequency, you know, or the number

84
00:05:02,440 --> 00:05:05,800
Speaker 1: of times a string goes from one point all the

85
00:05:05,839 --> 00:05:08,960
Speaker 1: way across and back to that starting point over the

86
00:05:08,960 --> 00:05:11,520
Speaker 1: course of a second. So with sound, we usually use

87
00:05:11,560 --> 00:05:16,080
Speaker 1: the unit hurts to measure frequency. If a particle only

88
00:05:16,120 --> 00:05:19,680
Speaker 1: did one cycle of vibration per second, if it took

89
00:05:19,680 --> 00:05:22,560
Speaker 1: a full second for it to go from the you know,

90
00:05:23,120 --> 00:05:29,240
Speaker 1: the one crest to the next crest, uh, then it

91
00:05:29,279 --> 00:05:31,640
Speaker 1: would be one hurts. That would also, by the way,

92
00:05:31,680 --> 00:05:33,560
Speaker 1: be a frequency that was way too low for us

93
00:05:33,600 --> 00:05:36,760
Speaker 1: to hear. Typical human hearing has a range of around

94
00:05:36,839 --> 00:05:40,480
Speaker 1: twenty hurts at the low end, to twenty thousand hurts

95
00:05:40,560 --> 00:05:43,160
Speaker 1: or twenty killer hurts, in other words, on the high end.

96
00:05:43,640 --> 00:05:46,880
Speaker 1: So for stuff vibrating in a cycle that's twenty times

97
00:05:46,880 --> 00:05:50,119
Speaker 1: a second all the way up to twenty thousand times

98
00:05:50,120 --> 00:05:53,320
Speaker 1: a second, that's something we could potentially hear. Now. I

99
00:05:53,360 --> 00:05:57,280
Speaker 1: say potentially because that is typical human hearing. There are

100
00:05:57,279 --> 00:05:59,640
Speaker 1: people who can hear outside of that range a little bit,

101
00:06:00,120 --> 00:06:02,760
Speaker 1: and then there are some of us, especially as we

102
00:06:02,800 --> 00:06:07,080
Speaker 1: get older, who can hear a more narrow range of frequencies.

103
00:06:08,120 --> 00:06:11,080
Speaker 1: But frequency is just one part of how we describe sound.

104
00:06:11,440 --> 00:06:14,600
Speaker 1: We can also describe sound by how loud it is.

105
00:06:15,160 --> 00:06:18,640
Speaker 1: The volume of sound. So from a physics perspective, we

106
00:06:18,680 --> 00:06:21,159
Speaker 1: can think of this is how much pressure the sound

107
00:06:21,200 --> 00:06:24,200
Speaker 1: places upon our ear drums. You know how dramatic those

108
00:06:24,200 --> 00:06:28,240
Speaker 1: fluctuations and air pressure are. In other words, But loudness

109
00:06:28,400 --> 00:06:31,680
Speaker 1: isn't just down to physics. The way we experience loudness

110
00:06:31,680 --> 00:06:35,880
Speaker 1: depends not just on that sound pressure itself, but stuff

111
00:06:35,920 --> 00:06:39,479
Speaker 1: like psychoacoustics. That's how our brains perceive sound in the

112
00:06:39,520 --> 00:06:43,360
Speaker 1: first place. But now we've got two criteria we can

113
00:06:43,480 --> 00:06:46,200
Speaker 1: use to assign to any sound correct Like, we can

114
00:06:46,279 --> 00:06:49,440
Speaker 1: talk about the frequency of that sound, you know, how

115
00:06:49,480 --> 00:06:53,120
Speaker 1: frequently that those particles are vibrating, And then we can

116
00:06:53,160 --> 00:06:56,200
Speaker 1: also talk about the displacement of those particles vibrating, or

117
00:06:56,240 --> 00:06:58,839
Speaker 1: what we might think of as the loudness or volume

118
00:06:58,920 --> 00:07:02,520
Speaker 1: of that sound. We could then plot a sound wave

119
00:07:02,800 --> 00:07:06,200
Speaker 1: as a transverse wave on a graph, and we could

120
00:07:06,200 --> 00:07:09,400
Speaker 1: have the X axis, you know, the horizontal axis of

121
00:07:09,440 --> 00:07:13,080
Speaker 1: this graph representing the passage of time. So on the

122
00:07:13,160 --> 00:07:15,800
Speaker 1: left side we might say zero, and we say time

123
00:07:15,840 --> 00:07:18,679
Speaker 1: increases as you go to the right. The y axis

124
00:07:18,760 --> 00:07:22,400
Speaker 1: we could have being displacement, which kind of you know,

125
00:07:22,440 --> 00:07:25,960
Speaker 1: amplitude or volume in other words, and we could then

126
00:07:26,000 --> 00:07:29,360
Speaker 1: plot all the points where a particular vibrating particle would

127
00:07:29,360 --> 00:07:33,400
Speaker 1: occupy over a given span of time. If we had

128
00:07:33,560 --> 00:07:36,880
Speaker 1: a sound of a steady frequency, then we would end

129
00:07:36,920 --> 00:07:38,440
Speaker 1: up with a wave that would look a lot like

130
00:07:38,480 --> 00:07:42,240
Speaker 1: a sign or cosign wave. The distance between two consecutive

131
00:07:42,320 --> 00:07:47,000
Speaker 1: crests of this wave would be the wavelength for that sound,

132
00:07:47,280 --> 00:07:52,080
Speaker 1: and the sounds wavelength has an inversely proportional relationship with

133
00:07:52,160 --> 00:07:56,120
Speaker 1: the sounds frequency, So the higher the frequency of sound,

134
00:07:56,760 --> 00:08:00,160
Speaker 1: the shorter the wavelength will be. So deep bay Ace

135
00:08:00,280 --> 00:08:03,920
Speaker 1: notes would have sound waves that have much longer wavelengths

136
00:08:04,240 --> 00:08:09,120
Speaker 1: than very high pitched high frequency notes. Uh frequency relates

137
00:08:09,160 --> 00:08:12,840
Speaker 1: to pitch. There isn't like an easy mathematical way we

138
00:08:12,920 --> 00:08:17,000
Speaker 1: can kind of relate pitch, by the way, There are

139
00:08:17,120 --> 00:08:20,160
Speaker 1: easy ways we can relate frequencies, but it gets a

140
00:08:20,160 --> 00:08:25,640
Speaker 1: little tricky anyway. The reason I even talk about plotting

141
00:08:25,720 --> 00:08:28,560
Speaker 1: sound waves at all is that it makes us easier

142
00:08:28,600 --> 00:08:31,720
Speaker 1: for us to consider the differences between analog and digital

143
00:08:31,760 --> 00:08:34,760
Speaker 1: audio recording. Keep in mind, if we plotted that sound wave,

144
00:08:35,200 --> 00:08:38,640
Speaker 1: that's not that's not the physical sound wave that we've

145
00:08:38,679 --> 00:08:43,160
Speaker 1: just plotted. That's our description of that sound wave, its frequency,

146
00:08:43,200 --> 00:08:47,439
Speaker 1: and its loudness. Um The classic sign wave like depiction

147
00:08:47,480 --> 00:08:49,840
Speaker 1: of the sound wave shows us that there's a continuous

148
00:08:49,960 --> 00:08:55,199
Speaker 1: representation of sound across time. It is unbroken. We can

149
00:08:55,480 --> 00:08:58,360
Speaker 1: put plot, you know, even complicated sounds with changes in

150
00:08:58,400 --> 00:09:01,640
Speaker 1: amplitude and frequency, and the shape of the waves tells

151
00:09:01,720 --> 00:09:05,320
Speaker 1: us a little bit about the tambre or quality of sound. Now,

152
00:09:05,320 --> 00:09:08,440
Speaker 1: by quality, I don't mean, oh, this sound is very

153
00:09:08,480 --> 00:09:12,440
Speaker 1: good quality or this sound is really bad quality. Instead,

154
00:09:12,480 --> 00:09:17,960
Speaker 1: I'm talking about the elements that differentiate say piano playing

155
00:09:18,160 --> 00:09:22,480
Speaker 1: middle C from a guitar playing that same note middle C.

156
00:09:23,040 --> 00:09:26,880
Speaker 1: Both instruments are producing the same note at the same frequency,

157
00:09:27,000 --> 00:09:29,719
Speaker 1: assuming both instruments are you know, properly tuned, and both

158
00:09:29,800 --> 00:09:33,000
Speaker 1: of them are using the same pitch tuning, but you

159
00:09:33,040 --> 00:09:36,560
Speaker 1: would hear a difference in the type of sound between them, right,

160
00:09:36,640 --> 00:09:41,199
Speaker 1: A piano and a guitar sound different. Otherwise all instruments

161
00:09:41,200 --> 00:09:44,560
Speaker 1: would produce exactly the same kind of sound as each other.

162
00:09:45,080 --> 00:09:46,959
Speaker 1: But you know, you can tell the difference between a

163
00:09:47,000 --> 00:09:50,439
Speaker 1: piano and a guitar, or a clarinet or a flute

164
00:09:50,520 --> 00:09:54,480
Speaker 1: or whatever. The tambre is different, even if the instruments

165
00:09:54,480 --> 00:09:58,000
Speaker 1: are all producing you know, technically the same frequency, even

166
00:09:58,040 --> 00:10:01,040
Speaker 1: at the same volume. This leads us to the fact

167
00:10:01,120 --> 00:10:05,040
Speaker 1: that sound is this continuous thing for us. It isn't

168
00:10:05,080 --> 00:10:08,720
Speaker 1: happening in discrete units. It's kind of like the difference

169
00:10:08,760 --> 00:10:12,480
Speaker 1: between jumping into a pool filled filled with water, which

170
00:10:12,559 --> 00:10:15,320
Speaker 1: is you know, continuous to us because we can't you know,

171
00:10:15,600 --> 00:10:19,480
Speaker 1: experience it down on the molecular level, or jumping into

172
00:10:19,480 --> 00:10:23,199
Speaker 1: a pool that's filled with plastic balls. So to us,

173
00:10:23,600 --> 00:10:27,440
Speaker 1: sound is kind of like a fluid, and analog recording

174
00:10:27,600 --> 00:10:33,200
Speaker 1: captures that. The analog approach to recording is older than digital.

175
00:10:33,400 --> 00:10:37,320
Speaker 1: So way way back in the nineteenth century, folks like

176
00:10:37,360 --> 00:10:39,480
Speaker 1: Alexander Graham Bell, we're trying to figure out how to

177
00:10:39,520 --> 00:10:43,600
Speaker 1: transmit the human voice across great distances using electricity, and

178
00:10:43,720 --> 00:10:46,679
Speaker 1: the microphone was one half of what was needed to

179
00:10:46,720 --> 00:10:50,000
Speaker 1: do this, the loud speaker being the other half. And

180
00:10:50,200 --> 00:10:54,240
Speaker 1: the basic way a standard microphone works is to convert

181
00:10:54,559 --> 00:10:59,160
Speaker 1: sound that continuous you know phenomena of pressure wave changes

182
00:11:00,080 --> 00:11:03,600
Speaker 1: to a varying electric signal, an electric signal that has

183
00:11:04,280 --> 00:11:08,880
Speaker 1: varying voltage. This is another continuous phenomena, right, it's unbroken,

184
00:11:09,000 --> 00:11:12,440
Speaker 1: it's it's like another wave. Here's how it works. So

185
00:11:12,520 --> 00:11:17,559
Speaker 1: inside an analog microphone is a tiny little diaphragm, typically

186
00:11:17,640 --> 00:11:20,000
Speaker 1: made of very thin plastic, and it behaves in a

187
00:11:20,040 --> 00:11:23,960
Speaker 1: way similar to how our ear drums work in our ears.

188
00:11:23,960 --> 00:11:29,480
Speaker 1: So when sound, you know, these pressure waves hit that microphone,

189
00:11:29,840 --> 00:11:33,240
Speaker 1: it moves the diaphragm back and forth, and the diaphragm

190
00:11:33,280 --> 00:11:37,320
Speaker 1: is actually attached to an electro magnet. A simple microphone

191
00:11:37,400 --> 00:11:40,679
Speaker 1: could have a permanent magnet inside it, and wrapped around

192
00:11:40,679 --> 00:11:43,680
Speaker 1: this permanent magnet is a little coil of metal wire

193
00:11:43,920 --> 00:11:47,160
Speaker 1: that connects to the diaphragm. So the diaphragm moves the coil,

194
00:11:47,320 --> 00:11:50,560
Speaker 1: which then moves along the length of this permanent magnet.

195
00:11:51,280 --> 00:11:54,640
Speaker 1: That introduces a fluctuating magnetic field, or rather, you know

196
00:11:54,720 --> 00:11:58,240
Speaker 1: the effect of a fluctuating magnetic field. The permanent magnets

197
00:11:58,280 --> 00:12:01,920
Speaker 1: magnetic field is stable, but moving a coil through a

198
00:12:01,960 --> 00:12:04,520
Speaker 1: magnetic field, it's the same thing as if you were

199
00:12:04,559 --> 00:12:08,680
Speaker 1: to fluctuate a magnetic field around a you know, non

200
00:12:08,720 --> 00:12:12,640
Speaker 1: moving coil, you get the same effect. Now, the laws

201
00:12:12,640 --> 00:12:16,160
Speaker 1: of electromagnetism tell us that if you have a conductive

202
00:12:16,200 --> 00:12:21,200
Speaker 1: material and it encounters a fluctuating magnetic field, that field

203
00:12:21,640 --> 00:12:25,840
Speaker 1: will then induce an electric current in the conductive material.

204
00:12:25,960 --> 00:12:29,920
Speaker 1: So now you've got the microphone producing an electric current,

205
00:12:30,400 --> 00:12:33,640
Speaker 1: and again the voltage of this current varies depending upon

206
00:12:33,720 --> 00:12:37,679
Speaker 1: the sound hitting the microphone. That means the microphone is

207
00:12:37,720 --> 00:12:41,040
Speaker 1: a type of transducer. That's a device that converts one

208
00:12:41,120 --> 00:12:44,880
Speaker 1: form of energy, in this case acoustic pressure, into another

209
00:12:44,920 --> 00:12:49,040
Speaker 1: form electric signals. Now, you could send this electric current

210
00:12:49,280 --> 00:12:52,680
Speaker 1: with varying voltage somewhere to do something else interesting, like

211
00:12:53,160 --> 00:12:57,040
Speaker 1: you could have it go directly to allowed speaker for playback. Now,

212
00:12:57,080 --> 00:13:01,560
Speaker 1: of course, this electric current is really uh there are

213
00:13:01,880 --> 00:13:04,880
Speaker 1: you know, very small elements in your microphone, right, so

214
00:13:05,840 --> 00:13:10,600
Speaker 1: it cannot produce an incredibly strong electric current. So typically

215
00:13:11,200 --> 00:13:14,520
Speaker 1: you would first pass this electric current through an amplifier,

216
00:13:15,040 --> 00:13:17,840
Speaker 1: which increases the strength of the signal. I'm not going

217
00:13:17,920 --> 00:13:20,560
Speaker 1: to go into how amplifiers work. I've talked about in

218
00:13:20,640 --> 00:13:23,720
Speaker 1: other episodes, and it would mean that this this episode

219
00:13:23,760 --> 00:13:25,760
Speaker 1: would go like an hour and a half long if

220
00:13:25,760 --> 00:13:28,640
Speaker 1: I were to to dive into that. The important thing

221
00:13:28,679 --> 00:13:32,839
Speaker 1: to think of is that amplifiers take incoming week signals

222
00:13:33,200 --> 00:13:36,680
Speaker 1: and then push out a stronger version of that same signal.

223
00:13:36,760 --> 00:13:40,760
Speaker 1: Assuming the amplifiers working properly, then that signal could go

224
00:13:40,880 --> 00:13:44,360
Speaker 1: to a speaker and you would have the same process

225
00:13:44,400 --> 00:13:47,000
Speaker 1: that you had with the microphone, only in reverse. The

226
00:13:47,080 --> 00:13:51,600
Speaker 1: speaker also has a voice coil inside it, a coil

227
00:13:51,760 --> 00:13:56,040
Speaker 1: of you know, conductive of metal wire, and also a

228
00:13:56,040 --> 00:13:59,920
Speaker 1: magnet inside the loudspeaker. So the incoming current goes to

229
00:14:00,120 --> 00:14:02,920
Speaker 1: the wire, and we know by the laws of electro

230
00:14:02,960 --> 00:14:06,960
Speaker 1: magnetism that this means the flowing current through the wire

231
00:14:07,000 --> 00:14:09,280
Speaker 1: will also produce a magnetic field. I mean, this is

232
00:14:09,320 --> 00:14:12,839
Speaker 1: how electro magnetism works, and that this magnetic field will

233
00:14:12,880 --> 00:14:16,439
Speaker 1: then pull and push against the magnetic field generated by

234
00:14:16,480 --> 00:14:20,080
Speaker 1: the permanent magnet that's already inside the speaker, and this

235
00:14:20,160 --> 00:14:24,240
Speaker 1: in turn creates the force that pushes and pulls the

236
00:14:24,400 --> 00:14:28,600
Speaker 1: cone inside the speaker that connects to another diaphragm. This

237
00:14:28,680 --> 00:14:31,480
Speaker 1: is a much larger diaphragm than the one that's on

238
00:14:31,520 --> 00:14:34,280
Speaker 1: the microphone on the other side. Right, Because you've boosted

239
00:14:34,280 --> 00:14:37,360
Speaker 1: the electric signal, it can then have enough power to

240
00:14:37,520 --> 00:14:41,120
Speaker 1: move this larger diaphragm. So this larger diaphragm begins to

241
00:14:41,120 --> 00:14:43,640
Speaker 1: move in and out, and it's pushing and pulling air

242
00:14:44,200 --> 00:14:49,960
Speaker 1: and it's just recreating the acoustic pressure waves that we're

243
00:14:50,120 --> 00:14:53,040
Speaker 1: used to go into the microphone and generate the electric

244
00:14:53,040 --> 00:14:55,360
Speaker 1: signal in the first place, so you're kind of preserved

245
00:14:55,560 --> 00:15:00,800
Speaker 1: this experience from sound going into a microphone. The microphone

246
00:15:01,080 --> 00:15:06,120
Speaker 1: as a transducer, transforming that acoustic pressure into an electric

247
00:15:06,160 --> 00:15:10,520
Speaker 1: current with varying voltage, sending that to an amplifier, and

248
00:15:10,560 --> 00:15:13,760
Speaker 1: then a speaker, which then does the opposite. It's also

249
00:15:13,760 --> 00:15:17,200
Speaker 1: a transducer. It takes this electric current with varying voltage

250
00:15:17,480 --> 00:15:20,520
Speaker 1: and converts it back into acoustic pressure and we get

251
00:15:20,520 --> 00:15:24,680
Speaker 1: the playback. That's an analog chain from start to finish. Now,

252
00:15:24,720 --> 00:15:27,160
Speaker 1: if you've got a good quality microphone and a good

253
00:15:27,200 --> 00:15:31,160
Speaker 1: amplifier and a good speaker, you can transmit sound pretty effectively.

254
00:15:31,560 --> 00:15:34,120
Speaker 1: And because the whole process is using that continuous and

255
00:15:34,200 --> 00:15:39,120
Speaker 1: varying signal, it is analogous to the experience of hearing

256
00:15:39,160 --> 00:15:43,200
Speaker 1: the sound itself. We've transformed the energy from one kind

257
00:15:43,240 --> 00:15:49,200
Speaker 1: to another, but apart from that, it is an unbroken chain. Now,

258
00:15:49,240 --> 00:15:53,640
Speaker 1: analog media includes stuff like magnetic tape and vinyl records,

259
00:15:54,160 --> 00:15:58,360
Speaker 1: which are produced in a way where you are transmitting

260
00:15:58,400 --> 00:16:02,520
Speaker 1: analog signals and they are effectively carved into a surface

261
00:16:03,320 --> 00:16:07,000
Speaker 1: that then can be picked up with a stylus on

262
00:16:07,120 --> 00:16:10,920
Speaker 1: a turntable and then converted back into an electric signal

263
00:16:11,000 --> 00:16:13,800
Speaker 1: that then can be sent to speakers. So either way

264
00:16:14,240 --> 00:16:20,000
Speaker 1: you are preserving that analog signal with magnetic tape. You've

265
00:16:20,000 --> 00:16:22,320
Speaker 1: got a recording device set up that takes that varying

266
00:16:22,320 --> 00:16:26,320
Speaker 1: electric signal from the recording and then creates a magnetic

267
00:16:26,440 --> 00:16:30,920
Speaker 1: field with the the writer the right head. Uh, And

268
00:16:31,000 --> 00:16:33,760
Speaker 1: you've got a little electro magnet in this thing, and

269
00:16:33,840 --> 00:16:38,120
Speaker 1: that magnetic field rearranges particles that aren't a strip of

270
00:16:38,320 --> 00:16:42,000
Speaker 1: plastic tape. That's how cassette tapes work. That's all VHS

271
00:16:42,080 --> 00:16:46,160
Speaker 1: tapes work. So attached to this strip of plastic that

272
00:16:46,400 --> 00:16:50,040
Speaker 1: is the actual tape in a tape, are these tiny

273
00:16:50,120 --> 00:16:53,920
Speaker 1: magnetic particles that are bound to that plastic. And by

274
00:16:53,960 --> 00:16:56,640
Speaker 1: applying the magnetic field to the tape, using in a

275
00:16:56,640 --> 00:16:59,520
Speaker 1: tiny electro magnet, you can change the direction that these

276
00:16:59,560 --> 00:17:03,840
Speaker 1: particles are facing on the tape itself. So this process

277
00:17:03,920 --> 00:17:07,280
Speaker 1: arranges particles on magnetic tape in a specific way to

278
00:17:07,440 --> 00:17:11,360
Speaker 1: record that original electric signal you were using. The magnetic

279
00:17:11,400 --> 00:17:15,000
Speaker 1: particles represent the original signal and then in turn represents

280
00:17:15,000 --> 00:17:18,320
Speaker 1: the sound that was used to generate the electric signal

281
00:17:18,480 --> 00:17:21,200
Speaker 1: during the recording process. So when you play a tape back,

282
00:17:22,160 --> 00:17:26,400
Speaker 1: the tape passes underneath an electro magnet at a distance

283
00:17:26,440 --> 00:17:29,280
Speaker 1: that's close enough that the electro magnet is picking up

284
00:17:29,280 --> 00:17:32,600
Speaker 1: the magnetic fields of all those tiny particles, and the

285
00:17:32,680 --> 00:17:35,960
Speaker 1: particles have been arranged in patterns because of that, you know,

286
00:17:36,040 --> 00:17:39,960
Speaker 1: recording process, right. So the fluctuating magnetic field that is

287
00:17:40,040 --> 00:17:43,080
Speaker 1: created because these particles are now passing by an electro

288
00:17:43,160 --> 00:17:47,800
Speaker 1: magnet are again reversing that process. The electro magnet starts

289
00:17:47,840 --> 00:17:50,919
Speaker 1: to generate an electric signal because of that magnetic field,

290
00:17:51,400 --> 00:17:53,600
Speaker 1: and then can go to an amplifier and then go

291
00:17:53,640 --> 00:17:55,960
Speaker 1: out to speakers. So again we use a lot of

292
00:17:56,000 --> 00:18:00,480
Speaker 1: transformational processes to record this sound, right, because you're in

293
00:18:00,520 --> 00:18:05,080
Speaker 1: this case, we took pressure waves, vibrations, The sound went

294
00:18:05,080 --> 00:18:08,360
Speaker 1: into a microphone, creates an electric current with varying voltage.

295
00:18:08,480 --> 00:18:12,840
Speaker 1: That electric current then goes to a tape recorder essentially

296
00:18:13,359 --> 00:18:17,560
Speaker 1: that uses magnetic fields to record onto tape. We take

297
00:18:17,560 --> 00:18:20,639
Speaker 1: that tape, we put that tape into a tape player,

298
00:18:21,160 --> 00:18:25,879
Speaker 1: and that magnetic record then produces an electric current in

299
00:18:25,920 --> 00:18:28,760
Speaker 1: our tape player, which goes to an amplifier and then

300
00:18:28,800 --> 00:18:31,240
Speaker 1: goes to drive speakers and replicate the sound that we

301
00:18:31,320 --> 00:18:33,920
Speaker 1: record in the first place. So again we transformed things

302
00:18:34,000 --> 00:18:41,200
Speaker 1: multiple times, but the analogous sound process has remained stable. Now,

303
00:18:41,400 --> 00:18:44,560
Speaker 1: there's a lot in this process that I have not covered.

304
00:18:44,760 --> 00:18:47,480
Speaker 1: The equipment and methods you use in recording and playback

305
00:18:48,080 --> 00:18:50,240
Speaker 1: determine whether or not the copy you have is a

306
00:18:50,320 --> 00:18:54,200
Speaker 1: really like accurate representation of the original sound like does

307
00:18:54,240 --> 00:18:57,560
Speaker 1: it sound like you were actually there? Or is the

308
00:18:57,640 --> 00:19:00,320
Speaker 1: nuance lost? And the same is true for a back.

309
00:19:00,400 --> 00:19:04,119
Speaker 1: Playback on a really sophisticated system will likely sound better

310
00:19:04,320 --> 00:19:07,920
Speaker 1: than one that's played on some super cheap stereo. Though

311
00:19:08,280 --> 00:19:11,080
Speaker 1: pretty quickly you do reach a point where the returns

312
00:19:11,240 --> 00:19:14,520
Speaker 1: are harder to detect, right like where you might listen

313
00:19:14,560 --> 00:19:16,640
Speaker 1: to something on a good system, and then you might

314
00:19:16,680 --> 00:19:19,720
Speaker 1: listen to that same thing on what's considered like the

315
00:19:19,800 --> 00:19:22,800
Speaker 1: highest of high end systems, and you might not be

316
00:19:22,920 --> 00:19:26,000
Speaker 1: able to tell a whole lot of difference. But the

317
00:19:26,040 --> 00:19:29,119
Speaker 1: basics for analog recording and playback are all there. Now.

318
00:19:29,200 --> 00:19:32,200
Speaker 1: When we come back, we'll talk about the digital approach,

319
00:19:32,480 --> 00:19:42,720
Speaker 1: but first let's take a quick break. Okay, So now,

320
00:19:42,760 --> 00:19:45,959
Speaker 1: we've got an idea of how the analog process of

321
00:19:46,040 --> 00:19:49,880
Speaker 1: recording and playback works. We transform stuff, but we still

322
00:19:49,920 --> 00:19:53,960
Speaker 1: have a continuous signal that represents sound, which is, you know,

323
00:19:54,000 --> 00:19:57,959
Speaker 1: a continuous phenomena as sound changes, as the pitch and

324
00:19:58,040 --> 00:20:01,120
Speaker 1: the frequency shifts, or as the volume changes, or as

325
00:20:01,160 --> 00:20:05,080
Speaker 1: different instruments or voices produced sounds. All those subtle and

326
00:20:05,160 --> 00:20:08,680
Speaker 1: maybe not so subtle shifts are part of that recording method.

327
00:20:09,040 --> 00:20:13,840
Speaker 1: It's an unbroken wave. Digital recording uses a different approach

328
00:20:14,119 --> 00:20:17,760
Speaker 1: in a way. Digital recording is like taking snapshots of

329
00:20:17,800 --> 00:20:21,480
Speaker 1: what is going on during a recording session. And I

330
00:20:21,560 --> 00:20:23,720
Speaker 1: thought of a kind of goofy analogy to sort of

331
00:20:23,760 --> 00:20:27,119
Speaker 1: explain what I mean. So imagine for a moment that

332
00:20:27,200 --> 00:20:30,720
Speaker 1: you are in a soundproofed room and you cannot hear

333
00:20:30,760 --> 00:20:34,760
Speaker 1: anything that's going on outside of this room. However, you

334
00:20:34,760 --> 00:20:37,200
Speaker 1: do have a little panel like almost like a hatch

335
00:20:37,440 --> 00:20:39,959
Speaker 1: in this room, and it happens to be facing a

336
00:20:40,000 --> 00:20:43,840
Speaker 1: really big orchestra pit, and the orchestra is playing. And

337
00:20:43,880 --> 00:20:45,600
Speaker 1: you know this because there's a light in the room

338
00:20:45,640 --> 00:20:47,439
Speaker 1: that lights up when the orchestra is playing. But you

339
00:20:47,480 --> 00:20:50,960
Speaker 1: can't hear anything because the rooms sound proved However, next

340
00:20:51,000 --> 00:20:52,840
Speaker 1: to the panel is a button, and if you press

341
00:20:52,880 --> 00:20:55,280
Speaker 1: the button, the panel opens up, but only for a

342
00:20:55,320 --> 00:20:58,560
Speaker 1: split second. Next to the panel, you have a table,

343
00:20:58,680 --> 00:21:01,080
Speaker 1: you get some paper, you got a pen, and your

344
00:21:01,200 --> 00:21:04,560
Speaker 1: job is to press the button, listen for that split second,

345
00:21:05,000 --> 00:21:06,959
Speaker 1: and then write down what you think is going on

346
00:21:07,040 --> 00:21:10,480
Speaker 1: in the orchestra. You know, like you could write down

347
00:21:10,560 --> 00:21:15,840
Speaker 1: everything from the specific instruments that you're hearing, the relative

348
00:21:15,920 --> 00:21:19,359
Speaker 1: volume of those instruments, any sort of harmonies you're hearing.

349
00:21:19,640 --> 00:21:23,080
Speaker 1: Maybe you're even just trying to play name that tune. Now,

350
00:21:23,160 --> 00:21:26,360
Speaker 1: let's say there's some other rules in place too. If

351
00:21:26,359 --> 00:21:28,440
Speaker 1: you push the button, you are not allowed to push

352
00:21:28,440 --> 00:21:31,680
Speaker 1: it again until five seconds have passed. So every five

353
00:21:31,680 --> 00:21:34,919
Speaker 1: seconds you get another instant of sound as the panel

354
00:21:34,960 --> 00:21:39,080
Speaker 1: opens and closes. This is that little snapshot of what's happening.

355
00:21:39,080 --> 00:21:43,240
Speaker 1: It would be really hard to accurately describe the music

356
00:21:43,320 --> 00:21:46,640
Speaker 1: because you wouldn't have a lot of information to go by, right,

357
00:21:47,119 --> 00:21:50,360
Speaker 1: you would just have this instant of sound every five seconds.

358
00:21:50,520 --> 00:21:53,119
Speaker 1: It might as well be noise at that point. But

359
00:21:53,240 --> 00:21:56,480
Speaker 1: then let's say we start to decrease the delay, where

360
00:21:56,600 --> 00:21:59,000
Speaker 1: you get to have the panel open so that you're

361
00:21:59,040 --> 00:22:03,960
Speaker 1: getting these instants is of sound more close together. As

362
00:22:04,040 --> 00:22:06,200
Speaker 1: that gets closer and closer, it will start to sound

363
00:22:06,200 --> 00:22:10,600
Speaker 1: more like uninterrupted music. Maybe we even rig up the button.

364
00:22:10,640 --> 00:22:13,120
Speaker 1: We tape down the button so it's always pressed down,

365
00:22:13,560 --> 00:22:15,600
Speaker 1: and the panel still has to open and close, but

366
00:22:16,040 --> 00:22:19,240
Speaker 1: it can open immediately after it shuts, so it's effectively

367
00:22:19,280 --> 00:22:22,840
Speaker 1: a shutter. At a fast enough rate, you wouldn't necessarily

368
00:22:22,920 --> 00:22:26,359
Speaker 1: even notice the shutters effect on the music. To you.

369
00:22:26,560 --> 00:22:30,240
Speaker 1: It would sound unbroken if it were fast enough, And

370
00:22:30,280 --> 00:22:33,600
Speaker 1: then you could accurately describe the music you could write down,

371
00:22:34,040 --> 00:22:35,919
Speaker 1: you know, depending on how quickly you can write, you

372
00:22:35,920 --> 00:22:39,080
Speaker 1: can write down a really accurate explanation of what is

373
00:22:39,080 --> 00:22:42,080
Speaker 1: going on with the music, or maybe you're just identifying

374
00:22:42,320 --> 00:22:46,040
Speaker 1: what pieces playing. But uh, you know, in this case,

375
00:22:46,200 --> 00:22:49,000
Speaker 1: if you've got that shutter going at a high enough rate,

376
00:22:49,720 --> 00:22:53,080
Speaker 1: it's almost like you're not in a soundproof room at all. Well,

377
00:22:53,160 --> 00:22:57,800
Speaker 1: this kind of is how digital recording works. So rather

378
00:22:57,840 --> 00:23:02,760
Speaker 1: than preserving an unbroken sign Knoll, the digital process breaks

379
00:23:02,840 --> 00:23:07,160
Speaker 1: up a signal into discrete units. It has to because digital,

380
00:23:07,160 --> 00:23:10,040
Speaker 1: when we get down to it, we're talking about binary

381
00:23:10,160 --> 00:23:14,320
Speaker 1: data zeros and ones. You cannot use zeros and ones

382
00:23:14,880 --> 00:23:18,560
Speaker 1: to uh to to do anything other than talk about

383
00:23:18,760 --> 00:23:22,320
Speaker 1: discrete units. It can't be a continuous thing. Now. As

384
00:23:22,359 --> 00:23:24,400
Speaker 1: I mentioned earlier in this episode, there are a lot

385
00:23:24,440 --> 00:23:27,680
Speaker 1: of quantifiable elements we can look at when it comes

386
00:23:27,720 --> 00:23:30,800
Speaker 1: to sound. We can describe how loud it is, or

387
00:23:30,840 --> 00:23:34,320
Speaker 1: what frequency or pitch it is. We can describe the

388
00:23:34,359 --> 00:23:36,760
Speaker 1: timbre or quality of the sound. That that kind of

389
00:23:36,760 --> 00:23:39,360
Speaker 1: gets us into areas that are a little less concrete

390
00:23:39,400 --> 00:23:43,480
Speaker 1: at least in human language and digital equipment like computers

391
00:23:44,119 --> 00:23:48,359
Speaker 1: are pretty good at handling things that are discrete and quantifiable.

392
00:23:48,520 --> 00:23:52,480
Speaker 1: This is the realm of computers. And remember, ultimately computers

393
00:23:52,480 --> 00:23:55,360
Speaker 1: are relying on those zeros and ones to describe everything.

394
00:23:55,680 --> 00:23:58,159
Speaker 1: Just to be clear, to get to this point, we

395
00:23:58,200 --> 00:24:01,600
Speaker 1: would need to use an analog to digital converter, but

396
00:24:01,640 --> 00:24:04,720
Speaker 1: I'm actually gonna circle round back to that later on.

397
00:24:05,000 --> 00:24:07,040
Speaker 1: For now, we're just going to focus on the basics

398
00:24:07,040 --> 00:24:11,119
Speaker 1: of digital recording because understanding that makes the whole you know,

399
00:24:11,320 --> 00:24:14,600
Speaker 1: a D C and d a C stuff way more

400
00:24:14,600 --> 00:24:18,960
Speaker 1: easy to understand. So, the way digital recording systems work

401
00:24:19,440 --> 00:24:23,960
Speaker 1: is that they take snapshots of a continuous wave. They're

402
00:24:24,000 --> 00:24:28,960
Speaker 1: measuring precisely all the elements at that moment in time

403
00:24:29,600 --> 00:24:33,359
Speaker 1: of the wave and it's or signal signal is probably

404
00:24:33,359 --> 00:24:35,760
Speaker 1: a better word than wave. Really, we're talking about the

405
00:24:35,800 --> 00:24:40,320
Speaker 1: electric signal generated as you're using a transducer to pick

406
00:24:40,400 --> 00:24:45,399
Speaker 1: up sound from you know, wherever. So in this way,

407
00:24:45,440 --> 00:24:48,480
Speaker 1: they're like that panel in that soundproof room. If the

408
00:24:48,520 --> 00:24:51,760
Speaker 1: sample rate is too low, if you are not sampling

409
00:24:51,880 --> 00:24:55,520
Speaker 1: the signal frequently enough, then you do not get an

410
00:24:55,520 --> 00:24:59,480
Speaker 1: accurate representation of that original signal. You know, You're you're

411
00:24:59,480 --> 00:25:01,640
Speaker 1: having to make a lot of guesses of what's happening

412
00:25:01,680 --> 00:25:05,400
Speaker 1: between each snapshot. Just like if you had a camera

413
00:25:05,880 --> 00:25:10,280
Speaker 1: and you were taking pictures of a fast moving, you know,

414
00:25:10,440 --> 00:25:14,000
Speaker 1: scenario in front of you. If the rate of which

415
00:25:14,000 --> 00:25:17,119
Speaker 1: you're taking pictures is pretty slow, you've got to make

416
00:25:17,160 --> 00:25:19,720
Speaker 1: a lot of interpretation of what happened between picture one

417
00:25:19,760 --> 00:25:22,440
Speaker 1: and picture two, and picture two in picture three. Same

418
00:25:22,480 --> 00:25:26,439
Speaker 1: thing with these digital recording systems. If you were to

419
00:25:26,480 --> 00:25:29,240
Speaker 1: try and play a recording like that back, it would

420
00:25:29,240 --> 00:25:32,199
Speaker 1: not sound very good because it would not be a

421
00:25:32,240 --> 00:25:35,520
Speaker 1: good representation of the original signal. So you need a

422
00:25:35,560 --> 00:25:39,280
Speaker 1: really fast sample rate to get an accurate representation of

423
00:25:39,320 --> 00:25:43,679
Speaker 1: what was really happening. This is the major difference between

424
00:25:43,720 --> 00:25:49,560
Speaker 1: analog and digital. Analog is continuous and unbroken. Digital is discreet.

425
00:25:50,160 --> 00:25:53,600
Speaker 1: But if you are using a very fast sample rate,

426
00:25:53,800 --> 00:25:56,919
Speaker 1: you can create a digital record of a continuous signal

427
00:25:57,680 --> 00:26:02,240
Speaker 1: that to human ears appears to be continuous itself. Again,

428
00:26:02,760 --> 00:26:05,720
Speaker 1: if that shutter is opening and closing fast enough, it's

429
00:26:05,720 --> 00:26:09,119
Speaker 1: almost like it's not even there. Now, let's imagine that

430
00:26:09,200 --> 00:26:14,280
Speaker 1: we've got two graphs that are showing the same signal, right,

431
00:26:14,440 --> 00:26:18,240
Speaker 1: and on the left side we've got the analog signal represented,

432
00:26:18,600 --> 00:26:21,480
Speaker 1: and on the rights that we've got the digital signal. Now,

433
00:26:22,080 --> 00:26:25,080
Speaker 1: let's say at first glance, these two are identical. They

434
00:26:25,080 --> 00:26:27,960
Speaker 1: both look like, you know, a typical sign wave. But

435
00:26:28,000 --> 00:26:31,080
Speaker 1: then you zoom into the analog representation. But no matter

436
00:26:31,119 --> 00:26:33,000
Speaker 1: how how far you zoom in, you see it's just

437
00:26:33,040 --> 00:26:39,120
Speaker 1: a continuous, unbroken line that's representing this this signe wave. Now,

438
00:26:39,200 --> 00:26:41,920
Speaker 1: let's say we take the digital one and we zoom

439
00:26:41,960 --> 00:26:43,760
Speaker 1: way in. Well, as we zoom wag in, we and

440
00:26:43,840 --> 00:26:46,639
Speaker 1: we get closer, we see that rather than being continuous,

441
00:26:46,680 --> 00:26:50,679
Speaker 1: it's actually a series of discrete moments, like almost like

442
00:26:50,760 --> 00:26:54,880
Speaker 1: steps or stairs. That's kind of what we're talking about here.

443
00:26:54,920 --> 00:26:57,640
Speaker 1: The question is how many stairs do we use? Like,

444
00:26:57,680 --> 00:27:01,960
Speaker 1: what's the resolution that we're using here. You can kind

445
00:27:01,960 --> 00:27:05,280
Speaker 1: of think of it like megapixels in a picture. If

446
00:27:05,280 --> 00:27:08,080
Speaker 1: you don't have a lot of megapixels, then you might

447
00:27:08,160 --> 00:27:11,480
Speaker 1: see some blockiness in a photo once you get to

448
00:27:11,520 --> 00:27:15,359
Speaker 1: a certain density, depending on you know, the size of

449
00:27:15,400 --> 00:27:17,440
Speaker 1: the image you're looking at, Like if you're looking at

450
00:27:17,520 --> 00:27:19,119
Speaker 1: on the side of a building, you're gonna need a

451
00:27:19,160 --> 00:27:22,240
Speaker 1: lot of megapixels so it doesn't look blocky. But depending

452
00:27:22,280 --> 00:27:25,200
Speaker 1: on that, uh, it may look really smooth. Same sort

453
00:27:25,240 --> 00:27:28,359
Speaker 1: of thing with sound. Now, if you've ever played with

454
00:27:28,480 --> 00:27:33,440
Speaker 1: digital audio recorders, you've probably seen something labeled sample rate

455
00:27:33,760 --> 00:27:37,320
Speaker 1: or project rate. This refers to the number of samples

456
00:27:37,359 --> 00:27:41,000
Speaker 1: that the recording is taking every second, and to record

457
00:27:41,000 --> 00:27:43,840
Speaker 1: a sound, that sample rate has to be fast enough

458
00:27:44,080 --> 00:27:48,600
Speaker 1: to take two samples within one wavelength of every sound

459
00:27:48,680 --> 00:27:52,359
Speaker 1: that's appearing in that in that recording. And remember I

460
00:27:52,400 --> 00:27:56,520
Speaker 1: said that it sounds wavelength is inversely proportional or has

461
00:27:56,560 --> 00:28:00,919
Speaker 1: an inverse proportional relationship to the sounds frequency, So the

462
00:28:01,040 --> 00:28:05,240
Speaker 1: higher frequency sounds have shorter wavelengths, and you do need

463
00:28:05,320 --> 00:28:09,320
Speaker 1: two samples per wavelength to capture the data necessary. To

464
00:28:09,440 --> 00:28:12,040
Speaker 1: have a recording of that sound. If the wavelength is

465
00:28:12,080 --> 00:28:15,080
Speaker 1: too small, then your sample rate will not be sufficient

466
00:28:15,119 --> 00:28:17,680
Speaker 1: to get all, you know, the full information about that

467
00:28:17,880 --> 00:28:20,080
Speaker 1: sound wave. You won't be able to record it, at

468
00:28:20,119 --> 00:28:23,960
Speaker 1: least not accurately. So remember I said the typical range

469
00:28:23,960 --> 00:28:27,840
Speaker 1: of human hearing is between twenty hurts to twenty killer

470
00:28:27,920 --> 00:28:32,399
Speaker 1: hurts or twenty thousand hurts. That's twenty thousand cycles per second,

471
00:28:32,720 --> 00:28:35,040
Speaker 1: and you have to gather two samples per wavelength or

472
00:28:35,320 --> 00:28:37,760
Speaker 1: or cycle. So that means you need a sampling rate

473
00:28:37,800 --> 00:28:41,040
Speaker 1: of at least forty thousand times per second or forty

474
00:28:41,160 --> 00:28:44,040
Speaker 1: killer hurts to be able to sample everything that's within

475
00:28:44,120 --> 00:28:48,000
Speaker 1: the typical human hearing range. Well, a basic sample rate

476
00:28:48,040 --> 00:28:50,960
Speaker 1: that a lot of people will use for various recording

477
00:28:50,960 --> 00:28:55,160
Speaker 1: projects is forty four point one killer hurts, uh, and

478
00:28:55,200 --> 00:28:57,000
Speaker 1: then they go up from there. In fact, we use

479
00:28:57,200 --> 00:29:00,600
Speaker 1: forty eight killer hurts when we're recording our episodes. I'm

480
00:29:00,680 --> 00:29:02,880
Speaker 1: using forty eight killer hurts right now. I had to

481
00:29:02,960 --> 00:29:06,280
Speaker 1: check because I did accidentally do forty four point one

482
00:29:06,360 --> 00:29:09,600
Speaker 1: for an episode a few weeks back, and Sary needed

483
00:29:09,600 --> 00:29:12,040
Speaker 1: to gently remind me that I need to fix that.

484
00:29:12,480 --> 00:29:14,960
Speaker 1: So it's a forty eight Killer Hurts. So I also

485
00:29:15,040 --> 00:29:18,719
Speaker 1: mentioned that we're quantifying all those elements about the sound.

486
00:29:18,800 --> 00:29:22,440
Speaker 1: We want as accurate a representation of the original sound

487
00:29:23,120 --> 00:29:25,920
Speaker 1: as possible. That means we're not just concerned with the

488
00:29:26,040 --> 00:29:30,000
Speaker 1: number of snapshots that we're taking every second. We're also

489
00:29:30,160 --> 00:29:33,960
Speaker 1: concerned with the quality of each of those snapshots. If

490
00:29:34,000 --> 00:29:36,640
Speaker 1: we were using a literal camera to take pictures, we

491
00:29:36,680 --> 00:29:38,880
Speaker 1: would want stuff like the lighting and the lens to

492
00:29:38,920 --> 00:29:42,080
Speaker 1: be perfect so that every single photo we got was

493
00:29:42,120 --> 00:29:45,320
Speaker 1: an accurate representation of what we were seeing when we

494
00:29:45,320 --> 00:29:48,400
Speaker 1: were there. Well, with digital recording, you know, we're not

495
00:29:48,440 --> 00:29:51,120
Speaker 1: talking about lights and cameras. We're talking about how much

496
00:29:51,280 --> 00:29:55,760
Speaker 1: data we're using to describe the original signal. This is

497
00:29:55,800 --> 00:29:59,920
Speaker 1: called bit depth. Bit depth refers to how many potential

498
00:30:00,160 --> 00:30:03,440
Speaker 1: values we can assign to a signal in an effort

499
00:30:03,480 --> 00:30:06,480
Speaker 1: to describe it. The more potential values we can use,

500
00:30:07,000 --> 00:30:10,640
Speaker 1: the more accurately we can describe the signal. So let's

501
00:30:10,640 --> 00:30:13,920
Speaker 1: do another analogy. All right, Let's say that we're in

502
00:30:13,960 --> 00:30:16,960
Speaker 1: a room. It's you and your best friend. Your best

503
00:30:16,960 --> 00:30:18,920
Speaker 1: friends all the way, I across the other side of

504
00:30:18,920 --> 00:30:22,800
Speaker 1: the room, and I hand you a picture. Your job

505
00:30:23,040 --> 00:30:26,520
Speaker 1: is to subscribe that picture to your best friend who's

506
00:30:26,520 --> 00:30:28,959
Speaker 1: across the room. Your best friend cannot see the picture,

507
00:30:29,280 --> 00:30:32,400
Speaker 1: They can only hear your description. Their job is to

508
00:30:32,480 --> 00:30:35,520
Speaker 1: try and recreate the picture, to draw it as you

509
00:30:35,560 --> 00:30:38,640
Speaker 1: describe it. However, I give you some more restrictions. I say,

510
00:30:39,080 --> 00:30:42,040
Speaker 1: you can only use five adjectives. Uh, you can only

511
00:30:42,120 --> 00:30:44,720
Speaker 1: use five sentences, and they have to be simple sentences.

512
00:30:44,720 --> 00:30:48,680
Speaker 1: They can't be compound or complex or anything. Five simple

513
00:30:48,720 --> 00:30:52,680
Speaker 1: short sentences with a maximum of five adjectives to describe

514
00:30:52,720 --> 00:30:56,720
Speaker 1: that picture. Well, chances are your best friend would draw

515
00:30:56,760 --> 00:30:59,800
Speaker 1: something that's kind of similar to the picture I gave you,

516
00:31:00,360 --> 00:31:02,840
Speaker 1: but it wouldn't be an accurate copy of it. Right.

517
00:31:02,960 --> 00:31:04,480
Speaker 1: You might be like, Okay, I can see where you

518
00:31:04,560 --> 00:31:07,520
Speaker 1: got that based upon the description. But let's say we

519
00:31:07,560 --> 00:31:09,880
Speaker 1: repeat this task, and each time we repeat it, I

520
00:31:09,960 --> 00:31:12,320
Speaker 1: give you a little more freedom and how you can

521
00:31:12,360 --> 00:31:15,320
Speaker 1: describe the picture you're looking at to your friend. So

522
00:31:15,360 --> 00:31:17,760
Speaker 1: you get to use more adjectives, you get to use

523
00:31:17,760 --> 00:31:21,280
Speaker 1: more complex sentences, and each time you're given a larger

524
00:31:21,360 --> 00:31:24,320
Speaker 1: set of potential values that you can express to your

525
00:31:24,360 --> 00:31:27,880
Speaker 1: best friend. Well, that's kind of like bit depths. If

526
00:31:27,880 --> 00:31:32,080
Speaker 1: you're using sixteen bit bit depths. That means you're using

527
00:31:32,120 --> 00:31:35,239
Speaker 1: sixteen bits to determine the range of values that can

528
00:31:35,320 --> 00:31:39,040
Speaker 1: describe the signal. So a bit is either a zero

529
00:31:39,160 --> 00:31:42,160
Speaker 1: or a one. With sixteen bits, you can represent up

530
00:31:42,160 --> 00:31:46,400
Speaker 1: to sixty five thousand, five hundred thirty six values. However,

531
00:31:46,880 --> 00:31:48,640
Speaker 1: let's say you were to go to thirty two bit,

532
00:31:48,960 --> 00:31:52,680
Speaker 1: so sixteen to thirty two, you would think, oh, you

533
00:31:52,680 --> 00:31:55,400
Speaker 1: could do twice as many. That's not that's not the case.

534
00:31:55,640 --> 00:31:58,160
Speaker 1: With thirty two bit depth, you wouldn't be talking about

535
00:31:58,200 --> 00:32:01,040
Speaker 1: twice as many as sixteen bit. With thirty two bits,

536
00:32:01,080 --> 00:32:04,040
Speaker 1: you would be able to describe up to four billion,

537
00:32:04,200 --> 00:32:08,360
Speaker 1: two million, nine d sixty seven thousand, two hundred nineties

538
00:32:08,440 --> 00:32:12,400
Speaker 1: six values. So the greater the bit depth, the more

539
00:32:12,560 --> 00:32:18,960
Speaker 1: accurately you can describe something. Essentially, uh So, it's both

540
00:32:19,000 --> 00:32:22,320
Speaker 1: the sample rate and bit depth together that can allow

541
00:32:22,400 --> 00:32:26,880
Speaker 1: a digital system to create a digital recording that represents

542
00:32:27,320 --> 00:32:32,160
Speaker 1: that continuous signal. It was sampling. Again, the digital recording

543
00:32:32,520 --> 00:32:35,200
Speaker 1: is not continuous. If we zoomed way in, we would

544
00:32:35,200 --> 00:32:36,959
Speaker 1: see it's a bunch of these little steps that are

545
00:32:37,000 --> 00:32:39,560
Speaker 1: all linked together. But if the sample rate is high

546
00:32:39,680 --> 00:32:42,080
Speaker 1: enough and the bit depth is great, enough, we can

547
00:32:42,120 --> 00:32:44,920
Speaker 1: reach a point where the human ear really can't discern

548
00:32:45,000 --> 00:32:48,800
Speaker 1: the difference. Does this mean at lower settings we would

549
00:32:48,800 --> 00:32:52,640
Speaker 1: actually notice a difference if you go low enough. Yeah,

550
00:32:52,760 --> 00:32:56,120
Speaker 1: but really, most of the time even sixteen bit is

551
00:32:56,160 --> 00:33:00,520
Speaker 1: sufficient for just plain old recording and playback. However, if

552
00:33:00,560 --> 00:33:02,520
Speaker 1: you want to work on a project. Let's say you're

553
00:33:02,840 --> 00:33:06,880
Speaker 1: an editor and you're you're trying to edit together music

554
00:33:06,960 --> 00:33:10,760
Speaker 1: files or audio files, larger bit depth gives you much

555
00:33:10,760 --> 00:33:14,160
Speaker 1: more space to work in without introducing stuff like distortion.

556
00:33:14,680 --> 00:33:18,040
Speaker 1: This is called headroom. And if you remember the character

557
00:33:18,120 --> 00:33:21,440
Speaker 1: Max Headroom, that name is a pun on this very

558
00:33:21,480 --> 00:33:25,880
Speaker 1: sort of thing. Technically, at the lower rates you get

559
00:33:26,000 --> 00:33:30,719
Speaker 1: deviations from the true sound. You're essentially inserting errors into

560
00:33:31,000 --> 00:33:35,160
Speaker 1: the digital file. Uh As you increase semple rate and

561
00:33:35,200 --> 00:33:38,320
Speaker 1: bit depth, you can decrease those errors until you reach

562
00:33:38,360 --> 00:33:42,160
Speaker 1: a point where any errors that exist are are impossible

563
00:33:42,200 --> 00:33:45,840
Speaker 1: to detect, at least with our natural equipment. Maybe you

564
00:33:45,840 --> 00:33:49,720
Speaker 1: could detect them if you had supersensitive electronic equipment to

565
00:33:49,920 --> 00:33:52,680
Speaker 1: indicate it, but it wouldn't be something that would be

566
00:33:52,760 --> 00:33:57,600
Speaker 1: necessarily perceptible to human ears. One other interesting thing, or

567
00:33:57,600 --> 00:34:00,080
Speaker 1: a couple of interesting things that I should mention with

568
00:34:00,120 --> 00:34:03,040
Speaker 1: sample rates. So I said, like, your sample rate has

569
00:34:03,080 --> 00:34:05,360
Speaker 1: to be fast enough to capture two points of data

570
00:34:05,560 --> 00:34:08,279
Speaker 1: along the wavelength of every sound, and for most of us,

571
00:34:08,320 --> 00:34:11,560
Speaker 1: that hearing range caps out at twenty killer hurts. That

572
00:34:11,680 --> 00:34:13,600
Speaker 1: might lead you to the question, well, why would you

573
00:34:13,600 --> 00:34:16,560
Speaker 1: bother to go higher than forty killer hurts? Now, if

574
00:34:16,600 --> 00:34:19,560
Speaker 1: twenty killer hurts is the limit of human hearing, typical

575
00:34:19,600 --> 00:34:23,360
Speaker 1: human hearing, why go to forty four point one? Well,

576
00:34:24,280 --> 00:34:26,160
Speaker 1: there are some other things that we need to think

577
00:34:26,200 --> 00:34:28,960
Speaker 1: about that play a factor in this. One of those

578
00:34:28,960 --> 00:34:33,200
Speaker 1: are harmonics. Uh Now, harmonics are way too complicated for

579
00:34:33,239 --> 00:34:36,120
Speaker 1: me to really fully get into in this episode. But

580
00:34:36,239 --> 00:34:39,800
Speaker 1: harmonics can actually exist above the range of human hearing

581
00:34:40,120 --> 00:34:43,960
Speaker 1: and yet still shape how we experience a sound. You

582
00:34:43,960 --> 00:34:48,319
Speaker 1: can almost think of it as the harmonics are sculpting

583
00:34:49,080 --> 00:34:51,440
Speaker 1: the sounds we hear. So even harmonics that are outside

584
00:34:51,440 --> 00:34:53,880
Speaker 1: of our hearing range might be affecting the sounds we

585
00:34:54,000 --> 00:34:57,640
Speaker 1: still can here. So we're not hearing the harmonics directly,

586
00:34:58,040 --> 00:35:00,960
Speaker 1: We're rather experiencing how they are affecting the rest of

587
00:35:01,000 --> 00:35:04,080
Speaker 1: the stuff we can perceive. If that makes sense, Well,

588
00:35:04,920 --> 00:35:07,279
Speaker 1: if you're sampling at a rate that's too low to

589
00:35:07,360 --> 00:35:10,640
Speaker 1: capture those harmonics. Those harmonics are not going to be

590
00:35:10,680 --> 00:35:13,160
Speaker 1: in the digital recording, so they won't be in the playback.

591
00:35:13,200 --> 00:35:16,279
Speaker 1: When you listen to it, you lose that sound. So

592
00:35:16,560 --> 00:35:19,040
Speaker 1: when you do listen back, you're gonna be losing those

593
00:35:19,080 --> 00:35:23,879
Speaker 1: effects and you're not going to experience the sound as

594
00:35:23,920 --> 00:35:25,840
Speaker 1: you would had you been in the place when it

595
00:35:25,920 --> 00:35:28,839
Speaker 1: was being recorded. Also, one thing that we can do

596
00:35:28,880 --> 00:35:31,600
Speaker 1: with recordings is we can change the pitch when we

597
00:35:31,640 --> 00:35:34,200
Speaker 1: record stuff. You know, like if you have a digital recording,

598
00:35:34,760 --> 00:35:39,319
Speaker 1: you can digitally change the pitch. In fact, Tari, if

599
00:35:39,360 --> 00:35:43,160
Speaker 1: you would like to digitally alter the pitch of my voice,

600
00:35:43,280 --> 00:35:47,479
Speaker 1: maybe increase the pitch so that I get that kind

601
00:35:47,520 --> 00:35:51,160
Speaker 1: of chipmunk sound to it. That's you know, boosting the

602
00:35:51,239 --> 00:35:55,960
Speaker 1: frequency up or maybe bringing that frequency way down and

603
00:35:56,000 --> 00:36:00,759
Speaker 1: giving me that deep, bass, booming voice that I know

604
00:36:00,840 --> 00:36:04,720
Speaker 1: I'll never have and I'll never be able to really

605
00:36:05,160 --> 00:36:10,279
Speaker 1: play like a baritone in a musical. Feel free to

606
00:36:10,320 --> 00:36:13,960
Speaker 1: do it. The world is your plaything. So you can

607
00:36:14,080 --> 00:36:16,759
Speaker 1: record audio with a sample rate of forty four point

608
00:36:16,760 --> 00:36:20,000
Speaker 1: one killer hurts. Then on playback, maybe you decide you

609
00:36:20,040 --> 00:36:23,960
Speaker 1: want to pitch everything down well, you'll hit a ceiling

610
00:36:24,320 --> 00:36:27,319
Speaker 1: of the sounds that you'll have in that recording once

611
00:36:27,320 --> 00:36:29,719
Speaker 1: you get to killer hurts or so. So, if there

612
00:36:29,760 --> 00:36:34,200
Speaker 1: were sounds that were above killer hurts, you're not really

613
00:36:34,200 --> 00:36:36,000
Speaker 1: going to be able to hear them with the pitched

614
00:36:36,080 --> 00:36:39,680
Speaker 1: down recording. Remember that pitch down recording will bring stuff

615
00:36:39,719 --> 00:36:43,200
Speaker 1: that is outside human hearing into the range of human

616
00:36:43,200 --> 00:36:46,000
Speaker 1: hearing because you've pitched it down. But if your sample

617
00:36:46,120 --> 00:36:50,160
Speaker 1: rate is too slow, too low, in other words, you

618
00:36:50,160 --> 00:36:55,040
Speaker 1: won't have captured those higher pitches. So let's say that

619
00:36:55,080 --> 00:36:58,520
Speaker 1: you're recording something that's in a very very high frequency,

620
00:36:58,680 --> 00:37:01,239
Speaker 1: like beyond the range of human hearing. But then you

621
00:37:01,280 --> 00:37:03,799
Speaker 1: want to do a pitch adjustment so that people can

622
00:37:03,840 --> 00:37:08,640
Speaker 1: actually hear a sound, even though you know normally they

623
00:37:08,640 --> 00:37:10,000
Speaker 1: wouldn't be able to hear it at all because it

624
00:37:10,000 --> 00:37:11,759
Speaker 1: would be outside their range. Maybe you're doing like a

625
00:37:11,840 --> 00:37:16,160
Speaker 1: nature documentary and there's a critter that makes sounds that

626
00:37:16,200 --> 00:37:18,600
Speaker 1: typically we cannot hear, but by pitching it down, you

627
00:37:18,600 --> 00:37:20,920
Speaker 1: can say this is what it sounds like once we

628
00:37:20,960 --> 00:37:23,200
Speaker 1: reduce the pitch. Well, you have to have a sample

629
00:37:23,280 --> 00:37:27,520
Speaker 1: rate that's high enough so that you capture that range

630
00:37:27,520 --> 00:37:30,319
Speaker 1: of sound in the first place. Right, So that's one

631
00:37:30,400 --> 00:37:33,120
Speaker 1: reason why you might want a very high sample rate.

632
00:37:34,000 --> 00:37:36,239
Speaker 1: I just thought that was neat all. Right, we need

633
00:37:36,280 --> 00:37:38,799
Speaker 1: to take another break. When we come back, we'll talk

634
00:37:38,840 --> 00:37:40,759
Speaker 1: about the process we need to follow in order to

635
00:37:40,800 --> 00:37:43,799
Speaker 1: go from analog to digital and back again. It's gonna

636
00:37:43,840 --> 00:37:45,160
Speaker 1: be a lot of us talking about some of the

637
00:37:45,160 --> 00:37:47,919
Speaker 1: stuff we just chatted about, and we'll also talk about

638
00:37:48,000 --> 00:37:51,160
Speaker 1: audio files a little bit. But first let's take another

639
00:37:51,239 --> 00:38:03,080
Speaker 1: quick break. Now, before I dive into the converter's part,

640
00:38:03,160 --> 00:38:06,719
Speaker 1: I should add there are some outliers, right There are

641
00:38:06,719 --> 00:38:10,560
Speaker 1: digital microphones. For example. Now there's some digital microphones that

642
00:38:10,719 --> 00:38:13,399
Speaker 1: are analog at the front end, so in other words,

643
00:38:13,440 --> 00:38:16,440
Speaker 1: they still have the diaphragm, they still have the electromagnet,

644
00:38:17,360 --> 00:38:21,560
Speaker 1: they're still generating an electric current with varying voltage. But

645
00:38:21,680 --> 00:38:25,160
Speaker 1: then they'll have an analog to digital converter built into

646
00:38:25,200 --> 00:38:27,400
Speaker 1: the microphone itself. So you have an A D C

647
00:38:28,080 --> 00:38:31,400
Speaker 1: and it's right there in the device, and then you

648
00:38:31,480 --> 00:38:36,480
Speaker 1: have the signal go to other elements of your recording studio.

649
00:38:37,400 --> 00:38:41,720
Speaker 1: There are other digital microphones that use the pressure waves

650
00:38:41,800 --> 00:38:47,600
Speaker 1: to move elements that immediately convert into digital data, getting

651
00:38:47,600 --> 00:38:50,960
Speaker 1: into that is pretty complicated. They are not super common.

652
00:38:51,080 --> 00:38:53,840
Speaker 1: It's not like that's the type of microphone that everyone

653
00:38:53,960 --> 00:38:59,239
Speaker 1: is using. Um, they're important, but you could argue that

654
00:38:59,280 --> 00:39:02,600
Speaker 1: it's a microphon own and a d C all in

655
00:39:02,680 --> 00:39:07,880
Speaker 1: one because you're taking audio, which is an analog you know, signal,

656
00:39:08,840 --> 00:39:12,680
Speaker 1: and you're converting it immediately into binary or digital information.

657
00:39:13,719 --> 00:39:17,320
Speaker 1: But we're really going to talk about analog to digital

658
00:39:17,360 --> 00:39:21,279
Speaker 1: and digital to analog, which is what most equipment is

659
00:39:21,440 --> 00:39:24,360
Speaker 1: dealing with. When we're speaking about this kind of stuff,

660
00:39:24,360 --> 00:39:28,160
Speaker 1: We're not gonna worry about stuff that's native digital because

661
00:39:28,200 --> 00:39:32,600
Speaker 1: it's just it's not that common. Um like digital speakers

662
00:39:33,760 --> 00:39:37,399
Speaker 1: are a different thing altogether as well, and um yeah,

663
00:39:37,480 --> 00:39:40,120
Speaker 1: we're just gonna wipe those out. We're gonna look at

664
00:39:40,160 --> 00:39:42,480
Speaker 1: what most people use, which is that you know, your

665
00:39:42,480 --> 00:39:48,640
Speaker 1: typical stereo system or your typical audio recording setup. So again,

666
00:39:48,719 --> 00:39:53,960
Speaker 1: typically the end equipment that you use to either record

667
00:39:54,080 --> 00:39:56,640
Speaker 1: or listen to audio, the stuff at the very ends

668
00:39:56,640 --> 00:40:01,080
Speaker 1: of that chain are typically analog in nature. Again, there

669
00:40:01,080 --> 00:40:03,600
Speaker 1: are outliers, but for the vast majority of cases, we're

670
00:40:03,640 --> 00:40:08,280
Speaker 1: talking about an analog device that generates an analog signal

671
00:40:08,480 --> 00:40:12,279
Speaker 1: or plays back an analog signal. So we take an

672
00:40:12,280 --> 00:40:15,759
Speaker 1: analog phenomena, the pressure waves that make up sound. We

673
00:40:15,920 --> 00:40:18,960
Speaker 1: feed that through a transducer to create a different but

674
00:40:19,120 --> 00:40:22,080
Speaker 1: still analog signal, in this case, an electric current with

675
00:40:22,200 --> 00:40:25,680
Speaker 1: variable voltage. But now we get to a point where

676
00:40:25,680 --> 00:40:28,000
Speaker 1: we say, all right, we want to transform that into

677
00:40:28,040 --> 00:40:33,800
Speaker 1: a digital file that quantifies this signal. When we play

678
00:40:33,880 --> 00:40:37,400
Speaker 1: the digital file back, that signal ultimately needs to go

679
00:40:37,480 --> 00:40:40,520
Speaker 1: through some kind of loud speaker for us to hear it.

680
00:40:41,160 --> 00:40:44,200
Speaker 1: Maybe that loud speakers in our headphones, maybe it's a

681
00:40:44,239 --> 00:40:47,640
Speaker 1: stereo system, maybe it's you know, the speaker on your smartphone.

682
00:40:48,600 --> 00:40:50,839
Speaker 1: Maybe it's a sound system in a stadium. But we

683
00:40:50,880 --> 00:40:53,759
Speaker 1: need a way to transform that digital information, all those

684
00:40:53,800 --> 00:40:57,560
Speaker 1: zeros and ones into an electric signal with variable voltage,

685
00:40:58,000 --> 00:41:00,640
Speaker 1: and we probably have to amplify that nal so that

686
00:41:00,680 --> 00:41:03,719
Speaker 1: it's strong enough to drive whatever speakers were using to

687
00:41:03,760 --> 00:41:08,360
Speaker 1: create the sound, which again we experience as an analog phenomena. Now,

688
00:41:08,640 --> 00:41:11,520
Speaker 1: if there was some way that we can interface directly

689
00:41:11,600 --> 00:41:15,640
Speaker 1: with machines and have those digital signals interact with our brains,

690
00:41:16,640 --> 00:41:18,879
Speaker 1: maybe we wouldn't need to do this kind of transformation.

691
00:41:19,160 --> 00:41:21,600
Speaker 1: But as it stands we do have to do this,

692
00:41:21,880 --> 00:41:25,520
Speaker 1: and this is where converters come into play. The converters

693
00:41:25,719 --> 00:41:29,839
Speaker 1: could be standalone devices, or frequently they're worked into the

694
00:41:29,880 --> 00:41:34,040
Speaker 1: design of various pieces of equipment. So for example, a

695
00:41:34,160 --> 00:41:36,480
Speaker 1: USB microphone, if you have one of those that you

696
00:41:36,480 --> 00:41:39,040
Speaker 1: plug into your computer, like I'm using one right now

697
00:41:39,080 --> 00:41:44,080
Speaker 1: to record this, they have that a d C converter

698
00:41:44,320 --> 00:41:47,080
Speaker 1: built into them. And that I'm being repetitive because that's

699
00:41:47,160 --> 00:41:50,360
Speaker 1: analog to digital converter. And then I said a DC converter.

700
00:41:50,400 --> 00:41:53,560
Speaker 1: It's like saying a t M machine. Anyway, the microphone

701
00:41:53,600 --> 00:41:56,920
Speaker 1: still acts just as a traditional analog mic on that end,

702
00:41:57,360 --> 00:41:59,279
Speaker 1: but then the electric signal has to go through a

703
00:41:59,280 --> 00:42:02,840
Speaker 1: converter converts into a digital signal, and that's what transmits

704
00:42:02,880 --> 00:42:04,960
Speaker 1: through the USB cable too, you know, whatever you got

705
00:42:04,960 --> 00:42:08,239
Speaker 1: hooked up to, like in my case, it's my work laptop.

706
00:42:08,719 --> 00:42:13,000
Speaker 1: Now here's the thing. There's more than one way to

707
00:42:13,120 --> 00:42:18,319
Speaker 1: convert an analog signal into a digital one. All of

708
00:42:18,360 --> 00:42:22,640
Speaker 1: these ways get pretty technical talking about the way it's sampled,

709
00:42:23,120 --> 00:42:28,439
Speaker 1: the way it ends up taking these measurements of the signal. So,

710
00:42:28,560 --> 00:42:31,200
Speaker 1: for example, with analog to digital converters or a d

711
00:42:31,280 --> 00:42:34,720
Speaker 1: c s. There are several popular methodologies, but generally speaking,

712
00:42:35,600 --> 00:42:39,720
Speaker 1: they all do the same thing on a big picture scale.

713
00:42:39,840 --> 00:42:43,200
Speaker 1: They all sample a signal. This is the snapshots that

714
00:42:43,239 --> 00:42:45,920
Speaker 1: I was talking about earlier. They look at a signal

715
00:42:46,440 --> 00:42:49,680
Speaker 1: and a specific frequency, like a specific They look at

716
00:42:49,680 --> 00:42:52,960
Speaker 1: the signal a specific number of times every second, and

717
00:42:53,000 --> 00:42:57,200
Speaker 1: they quantified the signal. They measure the signal, which determines

718
00:42:57,239 --> 00:43:01,239
Speaker 1: the resolution that you get of the signal. Obviously, if

719
00:43:01,239 --> 00:43:03,640
Speaker 1: you want high quality sound, you need both a good

720
00:43:03,640 --> 00:43:06,759
Speaker 1: sample rate and a good resolution, which we can think

721
00:43:06,800 --> 00:43:09,720
Speaker 1: of as you know, the accuracy in capturing the nature

722
00:43:09,880 --> 00:43:13,120
Speaker 1: of that signal. You can think of it as an

723
00:43:13,120 --> 00:43:17,120
Speaker 1: A d C is measuring the electric current many many

724
00:43:17,160 --> 00:43:21,760
Speaker 1: times per second and quantifies that measurement as digital data.

725
00:43:21,960 --> 00:43:26,960
Speaker 1: And it's not just like how important is this signal

726
00:43:27,080 --> 00:43:30,520
Speaker 1: at this specific moment in time, but also how important

727
00:43:30,560 --> 00:43:35,480
Speaker 1: are the changes in that signal over greater lengths of time. Now,

728
00:43:35,520 --> 00:43:37,680
Speaker 1: the bit depth we can think of is how detailed

729
00:43:37,719 --> 00:43:40,759
Speaker 1: these measurements can be. So the number of measurements and

730
00:43:40,800 --> 00:43:43,759
Speaker 1: the detail we get together determine the quality of the

731
00:43:43,800 --> 00:43:48,240
Speaker 1: digital signal compared to the original analog signal. And again

732
00:43:48,960 --> 00:43:52,760
Speaker 1: we're talking about digitally describing an electric current. At this point,

733
00:43:52,800 --> 00:43:57,680
Speaker 1: we're not talking about describing the sound necessarily. We're describing

734
00:43:57,680 --> 00:44:01,640
Speaker 1: the electric current that the transducer created after the sound

735
00:44:01,760 --> 00:44:04,920
Speaker 1: went through the transducer. Now, if the sample rate of

736
00:44:04,920 --> 00:44:07,359
Speaker 1: an A d C is too low, you get what's

737
00:44:07,360 --> 00:44:10,720
Speaker 1: called alias sing. Now, this means that the digital signal

738
00:44:10,760 --> 00:44:14,680
Speaker 1: will differ greatly from the original signal. Uh. And that

739
00:44:14,760 --> 00:44:16,959
Speaker 1: means that you're not going to have a good representation

740
00:44:17,040 --> 00:44:20,000
Speaker 1: of what was originally creating that signal in the first place,

741
00:44:20,160 --> 00:44:24,560
Speaker 1: in this case, whatever the sound was. UH. So that

742
00:44:24,560 --> 00:44:28,000
Speaker 1: that's what alias sing means in this context. Now, a

743
00:44:28,000 --> 00:44:30,440
Speaker 1: A DOCK or d a C is a digital to

744
00:44:30,480 --> 00:44:33,440
Speaker 1: audio converter, and it's basically the same thing we just

745
00:44:33,480 --> 00:44:37,160
Speaker 1: talked about, but in reverse. The d a C takes

746
00:44:37,480 --> 00:44:42,520
Speaker 1: digital information, which essentially is describing an analog signal an

747
00:44:42,600 --> 00:44:47,840
Speaker 1: electric current of variable voltage. Then it produces that analog signal.

748
00:44:48,760 --> 00:44:51,560
Speaker 1: The way it does again depends upon the type of

749
00:44:51,719 --> 00:44:54,520
Speaker 1: d A C. Just as A d C s have

750
00:44:54,640 --> 00:44:58,400
Speaker 1: different methodologies, so do d a C s. H. I

751
00:44:58,520 --> 00:45:01,239
Speaker 1: might do an episode that goes into more detail, like

752
00:45:01,280 --> 00:45:03,719
Speaker 1: I mentioned at the top of this episode, But honestly,

753
00:45:03,800 --> 00:45:07,719
Speaker 1: once you really start diving in there, it gets incredibly

754
00:45:07,760 --> 00:45:13,200
Speaker 1: technical very quickly. Generally speaking, we're talking about sophisticated circuit

755
00:45:13,239 --> 00:45:16,520
Speaker 1: boards that are designed to convert digital to analog or

756
00:45:16,600 --> 00:45:20,239
Speaker 1: vice versa, to switch between the data made up of

757
00:45:20,360 --> 00:45:25,439
Speaker 1: zeros and ones and a continuous electric signal. And again,

758
00:45:25,480 --> 00:45:27,760
Speaker 1: if there's interest, I'll go into more about how that works,

759
00:45:27,800 --> 00:45:31,200
Speaker 1: but believe me, it gets really complicated, and without visual

760
00:45:31,480 --> 00:45:37,200
Speaker 1: aids it's really hard to kind of get it across. Anyway,

761
00:45:37,440 --> 00:45:40,040
Speaker 1: Now let's talk about audio files. Also, I should mention

762
00:45:40,320 --> 00:45:42,319
Speaker 1: there's a ton of stuff I did not talk about, right,

763
00:45:42,360 --> 00:45:46,600
Speaker 1: I didn't talk about multiplexing or anything like that, So

764
00:45:46,680 --> 00:45:48,600
Speaker 1: there is a lot more to it than just the

765
00:45:49,360 --> 00:45:52,880
Speaker 1: general information I'm giving anyway. Audio files. So, back in

766
00:45:52,920 --> 00:45:56,279
Speaker 1: the day when c d s were fairly new, there

767
00:45:56,280 --> 00:46:00,000
Speaker 1: were audio files who just despised digital media. The process

768
00:46:00,040 --> 00:46:03,880
Speaker 1: us of converting an analog signal into a digital file

769
00:46:04,239 --> 00:46:08,239
Speaker 1: and then back again to analog. Well, that represented a

770
00:46:08,280 --> 00:46:12,080
Speaker 1: potential loss in quality, right, the playback experience might not

771
00:46:12,239 --> 00:46:16,520
Speaker 1: be as vibrant. Audio files typically use words like warm

772
00:46:16,680 --> 00:46:20,080
Speaker 1: or full to describe sound. These are words that are

773
00:46:20,120 --> 00:46:24,719
Speaker 1: hard to quantify they are experiential, I guess, and they

774
00:46:24,719 --> 00:46:29,600
Speaker 1: would lament that digitization removed some of those elements from recordings.

775
00:46:30,239 --> 00:46:33,840
Speaker 1: The thing is, depending upon how you're digitally sampling a signal,

776
00:46:34,000 --> 00:46:37,360
Speaker 1: some of that could be actually happening. You could be

777
00:46:37,480 --> 00:46:40,800
Speaker 1: losing harmonics. And this isn't even touching on the issue

778
00:46:40,840 --> 00:46:43,600
Speaker 1: that you start getting if you're if you're doing stuff

779
00:46:43,640 --> 00:46:47,279
Speaker 1: like compression file compression in this sense. I'll talk about

780
00:46:47,280 --> 00:46:51,400
Speaker 1: audio compression in a bit, but file compression can involve

781
00:46:51,520 --> 00:46:56,239
Speaker 1: using what are called lossy formats. A lossy format discards

782
00:46:56,440 --> 00:47:01,600
Speaker 1: part of a digital file that describe a signal, and

783
00:47:01,719 --> 00:47:04,480
Speaker 1: typically the way it does this is that the encoding

784
00:47:04,600 --> 00:47:08,439
Speaker 1: process is getting rid of information that it deems as

785
00:47:08,480 --> 00:47:13,040
Speaker 1: being irrelevant. So let me explain that last bit. I

786
00:47:13,040 --> 00:47:15,840
Speaker 1: did a full series of episodes about MP three's that

787
00:47:15,920 --> 00:47:18,879
Speaker 1: goes into this into far more detail, but i'll give

788
00:47:18,880 --> 00:47:21,480
Speaker 1: it down in dirty version for this episode. So, the

789
00:47:21,600 --> 00:47:25,880
Speaker 1: MP three method of compressing a file takes a psycho

790
00:47:25,960 --> 00:47:30,200
Speaker 1: acoustic approach in part when figuring out how to make

791
00:47:30,239 --> 00:47:34,920
Speaker 1: an audio file size smaller, because raw audio files can

792
00:47:34,960 --> 00:47:40,000
Speaker 1: be huge if you're really using a very high sample

793
00:47:40,120 --> 00:47:44,040
Speaker 1: rate and a big bit depth. During your recording process,

794
00:47:44,080 --> 00:47:48,680
Speaker 1: you're generating enormous files, right because the system is taking

795
00:47:48,760 --> 00:47:53,160
Speaker 1: data many many many times, many thousands of times every

796
00:47:53,200 --> 00:47:56,480
Speaker 1: second and using an enormous amount of information to try

797
00:47:56,520 --> 00:48:01,879
Speaker 1: and describe that signal each time, every single snapshot. That's

798
00:48:01,920 --> 00:48:05,839
Speaker 1: a lot of information and that isn't really convenient if

799
00:48:05,880 --> 00:48:08,480
Speaker 1: you want to store that file on like an old

800
00:48:08,560 --> 00:48:11,160
Speaker 1: MP three player. You know, if you remember those where

801
00:48:11,200 --> 00:48:13,440
Speaker 1: you had to like in the old old days, you

802
00:48:13,480 --> 00:48:15,839
Speaker 1: had to connect them physically to your computer. You would

803
00:48:15,920 --> 00:48:20,279
Speaker 1: download or rip music and you would then send that

804
00:48:20,680 --> 00:48:24,480
Speaker 1: music file to your device. These devices had very limited

805
00:48:24,760 --> 00:48:27,719
Speaker 1: storage space on them, so you couldn't really hold a

806
00:48:27,719 --> 00:48:30,640
Speaker 1: lot of raw audio. Like a single file might end

807
00:48:30,680 --> 00:48:33,000
Speaker 1: up taking up the entire storage on your m P

808
00:48:33,120 --> 00:48:36,640
Speaker 1: three player. And maybe you really like journeys Don't stop Believing,

809
00:48:36,920 --> 00:48:39,200
Speaker 1: but you might want some other songs on there too.

810
00:48:39,320 --> 00:48:41,120
Speaker 1: This is also more complicated if you want to do

811
00:48:41,200 --> 00:48:45,440
Speaker 1: something like stream music. You don't want to have enormous

812
00:48:45,480 --> 00:48:49,719
Speaker 1: files that would require like a gigabit Internet connection in

813
00:48:49,800 --> 00:48:52,200
Speaker 1: order to be able to stream it, So you have

814
00:48:52,280 --> 00:48:55,040
Speaker 1: to have a way to compress files down to sizes

815
00:48:55,040 --> 00:48:57,879
Speaker 1: that are easier to handle well. The way the MP

816
00:48:58,080 --> 00:49:01,840
Speaker 1: three algorithm does this is that once you set some

817
00:49:01,920 --> 00:49:05,719
Speaker 1: general parameters, like you decide how compressed you want to

818
00:49:05,760 --> 00:49:10,040
Speaker 1: make this file, essentially you're telling the MP three algorithm

819
00:49:10,080 --> 00:49:13,080
Speaker 1: how hard it needs to go. When it's starting to

820
00:49:13,120 --> 00:49:16,239
Speaker 1: cut stuff, well, then the algorithm begins to toss out

821
00:49:16,320 --> 00:49:19,040
Speaker 1: data that, at least in theory, should not affect your

822
00:49:19,120 --> 00:49:23,600
Speaker 1: experience when you listen back to the audio playback. So,

823
00:49:23,640 --> 00:49:27,239
Speaker 1: for example, let's say you've got a sound file and

824
00:49:27,320 --> 00:49:29,960
Speaker 1: in that sound file you have a very soft sound

825
00:49:30,400 --> 00:49:34,000
Speaker 1: that immediately follows a very loud sound. So loud sound happens,

826
00:49:34,239 --> 00:49:38,840
Speaker 1: soft sound happens immediately after that. Well, you wouldn't actually

827
00:49:38,840 --> 00:49:41,719
Speaker 1: hear that really soft sound, and that's just because of

828
00:49:41,719 --> 00:49:46,200
Speaker 1: how our ears work. The loud sound effectively masks the

829
00:49:46,239 --> 00:49:49,120
Speaker 1: softer one, so it's almost like the soft one didn't

830
00:49:49,120 --> 00:49:51,640
Speaker 1: exist at all. Well, if it's like the soft one

831
00:49:51,719 --> 00:49:55,799
Speaker 1: didn't exist, then there's no reason to keep it right.

832
00:49:55,960 --> 00:49:58,920
Speaker 1: If you couldn't hear it anyway, there's no reason that

833
00:49:58,920 --> 00:50:02,719
Speaker 1: that should be in uh the file, right, So the

834
00:50:02,840 --> 00:50:08,080
Speaker 1: algorithm effectively, through analyzing this data says, ah, that doesn't

835
00:50:08,120 --> 00:50:09,879
Speaker 1: need to be in there. No one would hear it,

836
00:50:09,960 --> 00:50:12,560
Speaker 1: so it tosses the data out. That's why it's a

837
00:50:12,640 --> 00:50:17,000
Speaker 1: lossy file format. The same goes for frequencies that would

838
00:50:17,040 --> 00:50:20,439
Speaker 1: be outside the range of human hearing. The logic is, well,

839
00:50:20,480 --> 00:50:24,080
Speaker 1: you can't hear something that's at killer hurts, so we're

840
00:50:24,120 --> 00:50:26,560
Speaker 1: just gonna get rid of anything that's occurring at that

841
00:50:26,600 --> 00:50:31,120
Speaker 1: frequency because there's no reason to keep it. However, depending

842
00:50:31,120 --> 00:50:33,200
Speaker 1: on how much you want to compress that file, those

843
00:50:33,239 --> 00:50:36,920
Speaker 1: cuts can really start to affect the quality of the

844
00:50:37,000 --> 00:50:40,400
Speaker 1: playback audio when you put it back through you know,

845
00:50:40,440 --> 00:50:42,279
Speaker 1: a decode er and you get the audio on the

846
00:50:42,320 --> 00:50:46,719
Speaker 1: other end. By the way, as I mentioned, file compression

847
00:50:46,880 --> 00:50:50,640
Speaker 1: is not the same thing as audio compression. I'll explain

848
00:50:51,239 --> 00:50:53,640
Speaker 1: what I mean by that, but first let's take one

849
00:50:53,760 --> 00:51:04,360
Speaker 1: last break. Okay, before the break, I said that audio

850
00:51:04,400 --> 00:51:06,879
Speaker 1: compression and file compression are two different things. It does

851
00:51:06,960 --> 00:51:10,439
Speaker 1: get confusing, and I myself have been guilty of kind

852
00:51:10,440 --> 00:51:13,960
Speaker 1: of interchanging the words or not clarifying enough while talking

853
00:51:14,000 --> 00:51:18,000
Speaker 1: about compression and uh, thus I have been guilty of

854
00:51:18,040 --> 00:51:21,279
Speaker 1: confusing it even more so. My apologies for that, but

855
00:51:21,360 --> 00:51:25,680
Speaker 1: let's get to it. Audio compression refers to reducing the

856
00:51:25,760 --> 00:51:30,680
Speaker 1: dynamic range of volume in a recording. Uh So, in

857
00:51:30,719 --> 00:51:34,960
Speaker 1: other words, it's about reducing the volume distance between the

858
00:51:35,080 --> 00:51:38,480
Speaker 1: softest sounds and the loudest sounds. Now, this can be

859
00:51:38,520 --> 00:51:41,600
Speaker 1: really important for certain types of recording. I'll give you

860
00:51:41,600 --> 00:51:44,840
Speaker 1: an example that I frequently run into that drives me nuts.

861
00:51:44,880 --> 00:51:48,520
Speaker 1: And this happens a lot with like streaming media for me,

862
00:51:49,120 --> 00:51:52,440
Speaker 1: so movies and television. Have you ever watched like an

863
00:51:52,440 --> 00:51:56,120
Speaker 1: action movie where you can barely hear some of the dialogue,

864
00:51:56,200 --> 00:51:59,160
Speaker 1: especially if people are speaking in like low voices and

865
00:51:59,239 --> 00:52:01,759
Speaker 1: you know they're trying to be secretive or whatever. And

866
00:52:01,800 --> 00:52:03,440
Speaker 1: then so you turn the volume up so that you

867
00:52:03,440 --> 00:52:05,439
Speaker 1: can hear what people are saying. But then the next

868
00:52:05,480 --> 00:52:09,359
Speaker 1: time something explodes, you're worried that you've just destroyed all

869
00:52:09,400 --> 00:52:12,880
Speaker 1: your speakers, or maybe you've caused yourself permanent hearing damage.

870
00:52:13,320 --> 00:52:16,160
Speaker 1: This happens to me all the time, where the softest

871
00:52:16,200 --> 00:52:20,799
Speaker 1: sounds and the loudest sounds are so far apart that

872
00:52:20,880 --> 00:52:24,480
Speaker 1: there's no comfortable volume. I can select where I can

873
00:52:24,480 --> 00:52:28,399
Speaker 1: hear everything and not feel like one I'm missing out

874
00:52:28,440 --> 00:52:31,239
Speaker 1: on some dialogue, or two my neighbors are going to

875
00:52:31,280 --> 00:52:33,919
Speaker 1: come over and complain that I've got my volume turned

876
00:52:34,000 --> 00:52:37,080
Speaker 1: up too loud. So compression in a case like that

877
00:52:37,239 --> 00:52:40,799
Speaker 1: can narrow the gap between the softest parts and the

878
00:52:40,840 --> 00:52:43,840
Speaker 1: loudest parts so that you can find that kind of

879
00:52:43,880 --> 00:52:49,600
Speaker 1: comfortable volume where you can hear everything. However, going overboard

880
00:52:50,000 --> 00:52:53,400
Speaker 1: with audio compression will reduce the dynamic range in a

881
00:52:53,440 --> 00:52:56,399
Speaker 1: recorded piece of audio, and if you do that too much,

882
00:52:56,640 --> 00:52:59,959
Speaker 1: it can make the audio sound flat and uninteresting, where

883
00:53:00,040 --> 00:53:03,000
Speaker 1: everything is just coming out at exactly the same volume.

884
00:53:03,719 --> 00:53:07,120
Speaker 1: If there's no real volume range, then your ears just

885
00:53:07,200 --> 00:53:10,120
Speaker 1: kind of get tired of hearing everything played back at

886
00:53:10,160 --> 00:53:15,719
Speaker 1: that same level. Some digital recordings really suffered from this

887
00:53:16,160 --> 00:53:19,520
Speaker 1: kind of processing, Like there was an era of music

888
00:53:20,000 --> 00:53:24,480
Speaker 1: where audio files in particular were really complaining that everything

889
00:53:24,520 --> 00:53:27,480
Speaker 1: that was being laid down had so much compression in

890
00:53:27,520 --> 00:53:30,920
Speaker 1: it that there was no real dynamic range and audio,

891
00:53:31,920 --> 00:53:34,320
Speaker 1: and it just meant that the music wasn't as interesting,

892
00:53:34,440 --> 00:53:39,640
Speaker 1: like there wasn't there wasn't enough variation, and it makes

893
00:53:39,920 --> 00:53:43,680
Speaker 1: music kind of boring. Uh, it wouldn't matter if you

894
00:53:43,760 --> 00:53:47,600
Speaker 1: had an analog pressing of a digital recording session, because

895
00:53:47,640 --> 00:53:52,160
Speaker 1: analog does not magically fix the problems of the recording process.

896
00:53:52,200 --> 00:53:55,640
Speaker 1: So if you're recording digitally, and then you make a

897
00:53:55,719 --> 00:53:59,839
Speaker 1: vinyl record pressing of that digital recording. I mean, all

898
00:53:59,840 --> 00:54:02,640
Speaker 1: that processing you did on the digital side, that's still

899
00:54:02,680 --> 00:54:06,960
Speaker 1: going to be part of what ends up being recorded

900
00:54:07,000 --> 00:54:11,240
Speaker 1: on the vinyl. It's it's not like vinyl suddenly cures

901
00:54:11,280 --> 00:54:15,280
Speaker 1: all sins of digital. So even if you were to

902
00:54:15,280 --> 00:54:19,400
Speaker 1: to go with analog audio media, you would still have

903
00:54:19,440 --> 00:54:23,920
Speaker 1: the same problems that were introduced in the digital processing. Now,

904
00:54:24,360 --> 00:54:27,040
Speaker 1: this does not mean that all digital to audio is

905
00:54:27,080 --> 00:54:31,560
Speaker 1: inherently flawed. Even if we just look at the analog chain,

906
00:54:31,840 --> 00:54:35,200
Speaker 1: we have to acknowledge that the process of recording in

907
00:54:35,280 --> 00:54:38,880
Speaker 1: playback means, you know, you're taking pressure waves of the

908
00:54:38,880 --> 00:54:42,160
Speaker 1: original sound, you pass them through a system that converts

909
00:54:42,200 --> 00:54:45,520
Speaker 1: those pressure waves into an analog electric signal, and then

910
00:54:45,520 --> 00:54:48,880
Speaker 1: you've got to reverse that process during playback, and stuff

911
00:54:48,960 --> 00:54:52,840
Speaker 1: can happen along that pathway that could affect either the

912
00:54:52,880 --> 00:54:56,600
Speaker 1: recording process or the playback process or both. So, in

913
00:54:56,640 --> 00:55:01,800
Speaker 1: other words, analog does not necessarily mean better, because flaws

914
00:55:01,880 --> 00:55:05,000
Speaker 1: can exist in the analog approach just as they can

915
00:55:05,080 --> 00:55:08,400
Speaker 1: with the digital approach. And there are other elements as well,

916
00:55:08,520 --> 00:55:12,359
Speaker 1: such as low level noise. Analog systems can introduce a

917
00:55:12,400 --> 00:55:17,600
Speaker 1: low level noise into a signal. Digital avoids that. Now,

918
00:55:17,640 --> 00:55:20,200
Speaker 1: that does not mean that digital is better, mind you,

919
00:55:20,280 --> 00:55:23,120
Speaker 1: because there are other ways to reduce and eliminate noise

920
00:55:23,120 --> 00:55:27,239
Speaker 1: and analog systems and digital can introduce other artifacts that

921
00:55:27,360 --> 00:55:30,560
Speaker 1: didn't exist in the original signal, and then that comes

922
00:55:30,560 --> 00:55:34,719
Speaker 1: across as like errors in your playback, Like you might

923
00:55:34,760 --> 00:55:37,480
Speaker 1: hear some weird blip noise and think, what the heck

924
00:55:37,560 --> 00:55:40,840
Speaker 1: was that, and it wasn't necessarily present in the original

925
00:55:40,880 --> 00:55:45,359
Speaker 1: recording session, but was introduced as a digital artifact. This

926
00:55:45,440 --> 00:55:49,359
Speaker 1: is just another example of how one format is not

927
00:55:49,600 --> 00:55:52,839
Speaker 1: necessarily superior to the other. It depends on way too

928
00:55:52,840 --> 00:55:58,040
Speaker 1: many other factors. They're just different. And honestly, I'm fairly

929
00:55:58,160 --> 00:56:02,040
Speaker 1: confident that if you were to do a double blind test,

930
00:56:02,600 --> 00:56:05,840
Speaker 1: and just in case you're unfamiliar with that term, double

931
00:56:05,840 --> 00:56:09,560
Speaker 1: blind is a type of scientific test where neither the

932
00:56:09,600 --> 00:56:12,560
Speaker 1: subject that's going through the test nor the person who

933
00:56:12,680 --> 00:56:18,280
Speaker 1: is in charge of administering the test knows which version

934
00:56:18,480 --> 00:56:20,480
Speaker 1: anyone is getting. So if you have a control group,

935
00:56:20,719 --> 00:56:24,680
Speaker 1: the person administering the test doesn't know if that's a

936
00:56:24,719 --> 00:56:27,920
Speaker 1: control group or if it's an actual test group. That

937
00:56:28,000 --> 00:56:31,240
Speaker 1: they're testing at any given time. That way, the person

938
00:56:31,280 --> 00:56:35,000
Speaker 1: administering the test does not give bias to the person

939
00:56:35,040 --> 00:56:38,040
Speaker 1: who's experiencing the test. The thought is, if I know

940
00:56:38,200 --> 00:56:41,960
Speaker 1: as an administrator, then I might give a tell to

941
00:56:42,080 --> 00:56:44,839
Speaker 1: the test subject. Right. So let's say it's a double

942
00:56:44,840 --> 00:56:47,520
Speaker 1: blind test and the audio files are going into a

943
00:56:47,600 --> 00:56:51,279
Speaker 1: room and they're going to experience the same piece of

944
00:56:51,320 --> 00:56:56,440
Speaker 1: audio recording, but it's over different systems. So like some

945
00:56:56,520 --> 00:57:00,279
Speaker 1: of the pieces of the same stretch of audio are

946
00:57:00,360 --> 00:57:04,120
Speaker 1: analog sources, some of them are digital sources. They might

947
00:57:04,160 --> 00:57:07,520
Speaker 1: even include different like systems like premium systems, like super

948
00:57:07,680 --> 00:57:11,000
Speaker 1: super high end systems that cost maybe upwards of hundreds

949
00:57:11,040 --> 00:57:14,040
Speaker 1: of thousands of dollars, and maybe just on some that

950
00:57:14,040 --> 00:57:18,040
Speaker 1: are really good systems, like you know, they're still expensive.

951
00:57:18,120 --> 00:57:21,280
Speaker 1: Maybe it's a few thousand dollars, but they're not, you know,

952
00:57:21,440 --> 00:57:25,880
Speaker 1: monumentally expensive. My bet is that most audio files would

953
00:57:25,880 --> 00:57:30,200
Speaker 1: have trouble picking out which ones are analog systems versus

954
00:57:30,240 --> 00:57:33,680
Speaker 1: digital systems unless there's some giveaway, like if you hear

955
00:57:33,720 --> 00:57:38,080
Speaker 1: the scratch of a needle hitting record, then that's kind

956
00:57:38,120 --> 00:57:40,400
Speaker 1: of a dead giveaway. But let's say you know you're

957
00:57:40,440 --> 00:57:43,560
Speaker 1: you're talking about, like the highest of high ends. I

958
00:57:43,600 --> 00:57:45,280
Speaker 1: don't think they would be able to tell the difference

959
00:57:45,480 --> 00:57:48,800
Speaker 1: very easily. And that's because our approach to digital processing

960
00:57:48,880 --> 00:57:52,480
Speaker 1: has become sophisticated enough that to our human ears, it's

961
00:57:52,480 --> 00:57:57,840
Speaker 1: pretty close to an analog signal. And that you know. Also,

962
00:57:57,840 --> 00:58:01,000
Speaker 1: I want to mention the returns on those high end

963
00:58:01,080 --> 00:58:05,880
Speaker 1: audio equipment, like the differences that you start to see

964
00:58:05,920 --> 00:58:09,920
Speaker 1: when you're really hitting that upper echelon of audio equipment.

965
00:58:10,280 --> 00:58:13,440
Speaker 1: Some of those returns are so minor that after you

966
00:58:13,480 --> 00:58:17,840
Speaker 1: reach a certain point, they are largely meaningless. Like like,

967
00:58:18,160 --> 00:58:20,720
Speaker 1: as far as perception goes, you wouldn't be able to

968
00:58:20,760 --> 00:58:24,320
Speaker 1: tell the difference. Uh, And for people like me, people

969
00:58:24,360 --> 00:58:27,400
Speaker 1: who have had some hearing loss, it matters even less

970
00:58:27,400 --> 00:58:30,920
Speaker 1: than that, right because like for me, like you could

971
00:58:30,920 --> 00:58:33,760
Speaker 1: almost say, it's like I have an unsophisticated palette, Like

972
00:58:33,920 --> 00:58:36,960
Speaker 1: you could serve me an amazing meal, but I'm not

973
00:58:37,080 --> 00:58:39,920
Speaker 1: likely to notice it being any better than you know,

974
00:58:40,400 --> 00:58:46,360
Speaker 1: a cheeseburger. But again, this is my hypothesis. I I

975
00:58:46,440 --> 00:58:50,760
Speaker 1: believe this is probably true. It is entirely possible, and

976
00:58:50,840 --> 00:58:53,840
Speaker 1: I admit this that if I actually were to conduct

977
00:58:54,040 --> 00:58:56,600
Speaker 1: this kind of study, I might find that I'm totally

978
00:58:56,600 --> 00:58:59,240
Speaker 1: wrong that the audio files are like no, that is

979
00:58:59,280 --> 00:59:02,880
Speaker 1: clearly the premium, and maybe the differences are subtle, but

980
00:59:03,000 --> 00:59:06,120
Speaker 1: maybe they're detectable. Right, It could be that I'm wrong

981
00:59:06,160 --> 00:59:09,120
Speaker 1: about that. Uh. I just think that there gets to

982
00:59:09,160 --> 00:59:14,919
Speaker 1: be a point where people start to buy into a philosophy,

983
00:59:15,080 --> 00:59:20,840
Speaker 1: especially with audio files that isn't necessarily supportable by you know,

984
00:59:21,160 --> 00:59:25,360
Speaker 1: quantifiable evidence. It becomes so subjective that once you remove

985
00:59:25,400 --> 00:59:28,240
Speaker 1: the subjective element, like you remove the ability for them

986
00:59:28,320 --> 00:59:31,040
Speaker 1: to know whether or not they're listening to their preferred

987
00:59:31,080 --> 00:59:35,000
Speaker 1: set up, that it starts to disappear. Maybe I'm wrong

988
00:59:35,040 --> 00:59:38,400
Speaker 1: about that. I don't think I am, But that's the

989
00:59:38,480 --> 00:59:41,560
Speaker 1: overview of analog and digital and why you have to

990
00:59:41,600 --> 00:59:44,200
Speaker 1: have the converters. As I said, to get into specifics

991
00:59:44,640 --> 00:59:47,320
Speaker 1: would take more time, you know, talking about delta Sigma

992
00:59:47,360 --> 00:59:50,640
Speaker 1: processing and that kind of stuff. But if you want it,

993
00:59:51,160 --> 00:59:53,600
Speaker 1: let me know and I will try and put that

994
00:59:53,680 --> 00:59:58,080
Speaker 1: episode together. It will just be far more niche oriented

995
00:59:58,120 --> 01:00:02,000
Speaker 1: than even this one was. B It is a fascinating subject,

996
01:00:02,120 --> 01:00:06,160
Speaker 1: like there's some really cool technology that goes into making

997
01:00:06,200 --> 01:00:10,400
Speaker 1: this all work, and the fact that that technology does

998
01:00:10,480 --> 01:00:12,920
Speaker 1: work and that it has become so sophisticated is why

999
01:00:12,960 --> 01:00:16,880
Speaker 1: I feel pretty confident in saying that with a sufficiently

1000
01:00:16,960 --> 01:00:19,280
Speaker 1: good system, you wouldn't be able to tell the difference.

1001
01:00:19,840 --> 01:00:22,920
Speaker 1: But that's it for this episode. If you would like

1002
01:00:23,080 --> 01:00:25,760
Speaker 1: me to cover any kind of topic, whatever it might

1003
01:00:25,800 --> 01:00:28,200
Speaker 1: be in the tech world, let me know. The best

1004
01:00:28,200 --> 01:00:30,560
Speaker 1: way to get in touch is on Twitter. The handle

1005
01:00:30,600 --> 01:00:34,040
Speaker 1: for the show is tech Stuff h SW and I

1006
01:00:34,040 --> 01:00:37,439
Speaker 1: greatly appreciate it. I'm getting some wonderful suggestions. Really makes

1007
01:00:37,440 --> 01:00:39,640
Speaker 1: my job easier because I know exactly what people want

1008
01:00:39,680 --> 01:00:43,360
Speaker 1: to hear um. So yeah, reach out and let me

1009
01:00:43,360 --> 01:00:46,360
Speaker 1: know what you think and I'll talk to you again

1010
01:00:47,240 --> 01:00:55,160
Speaker 1: really soon. Tech Stuff is an I Heart Radio production.

1011
01:00:55,400 --> 01:00:58,240
Speaker 1: For more podcasts from my Heart Radio, visit the i

1012
01:00:58,320 --> 01:01:01,439
Speaker 1: Heart Radio app, Apple pine Casts, or wherever you listen

1013
01:01:01,480 --> 01:01:02,520
Speaker 1: to your favorite shows.