WEBVTT - Musical Analysis at Moogfest

0:00:04.519 --> 0:00:12.760
<v Speaker 1>Technology with tech Stuff from stuff works dot com. Hey there,

0:00:12.800 --> 0:00:17.080
<v Speaker 1>and welcome to tech Stuff. I'm your host, Jonathan Strickland.

0:00:17.200 --> 0:00:20.279
<v Speaker 1>I'm a senior writer with how stuff works dot com.

0:00:20.360 --> 0:00:23.360
<v Speaker 1>I talk about all things tech and today we're gonna

0:00:23.400 --> 0:00:26.840
<v Speaker 1>get a little musical with things and get a little

0:00:26.880 --> 0:00:31.920
<v Speaker 1>help from our buddy Noel. Noel, who is the producer extraordinary.

0:00:32.000 --> 0:00:36.440
<v Speaker 1>He's the head of of of podcast production here at

0:00:36.440 --> 0:00:39.159
<v Speaker 1>how stuff Works, also one of the co hosts of

0:00:39.200 --> 0:00:41.800
<v Speaker 1>Stuff they Don't Want You to Know. Noel went to

0:00:41.880 --> 0:00:45.599
<v Speaker 1>mog Fest in and and got the chance to talk

0:00:45.600 --> 0:00:48.600
<v Speaker 1>to a whole bunch of really cool people, including Alexander

0:00:48.720 --> 0:00:51.960
<v Speaker 1>Lurch and we'll hear more about that a little bit

0:00:52.000 --> 0:00:56.920
<v Speaker 1>later in this podcast. Mog Fest ostensibly is about music

0:00:56.960 --> 0:01:00.600
<v Speaker 1>and technology, but it actually involves lot lots of other

0:01:00.680 --> 0:01:05.080
<v Speaker 1>stuff to not just not just those two already broad fields,

0:01:05.120 --> 0:01:09.680
<v Speaker 1>but other ones as well, including elements of philosophy and

0:01:09.680 --> 0:01:13.160
<v Speaker 1>and even particle physics. Will have an episode in the

0:01:13.200 --> 0:01:18.200
<v Speaker 1>near future that will include some elements from uh interviews

0:01:18.240 --> 0:01:21.360
<v Speaker 1>we had with folks from the Large Hadron Collider. So

0:01:21.480 --> 0:01:24.800
<v Speaker 1>mog Fest has all sorts of really smart, talented people

0:01:24.840 --> 0:01:30.320
<v Speaker 1>getting together and having these incredible symposia and and and performances,

0:01:30.360 --> 0:01:32.880
<v Speaker 1>And so Noel was able to go and talk with

0:01:32.920 --> 0:01:36.880
<v Speaker 1>someone about some really cool stuff, and that kind of

0:01:36.880 --> 0:01:40.320
<v Speaker 1>ties into what I wanted to chat about today. You know,

0:01:40.360 --> 0:01:42.920
<v Speaker 1>once upon a time here at How Stuff Works, we

0:01:42.959 --> 0:01:45.520
<v Speaker 1>had a show called Stuff from the B Side, and

0:01:45.640 --> 0:01:49.360
<v Speaker 1>this was a podcast all about music. Episodes focused on

0:01:49.800 --> 0:01:53.760
<v Speaker 1>everything musical, including elements that are more general concepts or

0:01:53.800 --> 0:01:58.080
<v Speaker 1>philosophical ideas. And music and technology are two things that

0:01:58.200 --> 0:02:03.080
<v Speaker 1>really do closely tied together. After all, almost every musical

0:02:03.160 --> 0:02:07.600
<v Speaker 1>instrument is some form of technology, ranging from the relatively

0:02:07.720 --> 0:02:11.720
<v Speaker 1>primitive versions of certain percussive instruments all the way up

0:02:11.760 --> 0:02:14.840
<v Speaker 1>to high tech digital rigs. So I thought it might

0:02:14.840 --> 0:02:16.720
<v Speaker 1>be cool to revisit music and tech and look at

0:02:16.720 --> 0:02:22.400
<v Speaker 1>a particular subset of it, musical analysis and music generation. Now,

0:02:22.480 --> 0:02:26.520
<v Speaker 1>music analysis and technology are also related in that we

0:02:26.639 --> 0:02:30.600
<v Speaker 1>now have various automated recommendation engines that will suggest music

0:02:30.680 --> 0:02:33.320
<v Speaker 1>for us to listen to based upon what we've already

0:02:33.320 --> 0:02:36.400
<v Speaker 1>said we enjoy. Now these engines look for new pieces

0:02:36.440 --> 0:02:39.560
<v Speaker 1>of music that in some way match criteria we seem

0:02:39.639 --> 0:02:43.400
<v Speaker 1>to find appealing. We have indicated to that service that

0:02:43.480 --> 0:02:46.079
<v Speaker 1>we like that particular type of music, so it starts

0:02:46.080 --> 0:02:48.720
<v Speaker 1>to try and find matches that kind of follow in

0:02:48.760 --> 0:02:51.600
<v Speaker 1>the same lines. As they become more adept at figuring

0:02:51.600 --> 0:02:54.600
<v Speaker 1>out what qualities we really enjoy, they can hone in

0:02:54.639 --> 0:02:57.400
<v Speaker 1>on songs that appeal to us, perhaps even changing them

0:02:57.520 --> 0:03:00.080
<v Speaker 1>up based upon other criterias, which is the time of

0:03:00.160 --> 0:03:03.079
<v Speaker 1>day or an activity. We're doing so, for example, with

0:03:03.200 --> 0:03:06.920
<v Speaker 1>Google Music, and this show is not sponsored by Google

0:03:07.000 --> 0:03:10.280
<v Speaker 1>Music or anything of that nature, but it will detect

0:03:10.360 --> 0:03:14.160
<v Speaker 1>if I'm on my way somewhere. It might suggest music

0:03:14.240 --> 0:03:17.480
<v Speaker 1>that would be conducive to a trip, or if it

0:03:17.560 --> 0:03:20.840
<v Speaker 1>knows that I'm at the gym, it may suggest music

0:03:20.919 --> 0:03:23.799
<v Speaker 1>that's good for keeping my heart rate up, stuff like that.

0:03:24.600 --> 0:03:28.639
<v Speaker 1>So we'll just imagine a hypothetical situation. I've just woken

0:03:28.720 --> 0:03:31.640
<v Speaker 1>up and the recommendation engine might find some peppy music

0:03:31.680 --> 0:03:34.800
<v Speaker 1>to get me on my way. So Google Music is saying, hey,

0:03:34.840 --> 0:03:37.440
<v Speaker 1>it's Monday morning, you need all the help you can get.

0:03:37.600 --> 0:03:40.200
<v Speaker 1>Here's a radio station based off the song Walking on

0:03:40.320 --> 0:03:43.640
<v Speaker 1>Sunshine by Katrina and the Waves. And then my phone

0:03:43.680 --> 0:03:46.280
<v Speaker 1>detects that I'm going to the gem, so then the

0:03:46.360 --> 0:03:49.840
<v Speaker 1>music engine switches to the song's meant to keep me

0:03:49.880 --> 0:03:51.960
<v Speaker 1>moving at a particular pace while I desperately try to

0:03:52.000 --> 0:03:55.760
<v Speaker 1>find the exit to the gym. I'm sorry, I'm uh

0:03:55.960 --> 0:03:59.160
<v Speaker 1>to actually work out. So in that case, it's probably

0:03:59.600 --> 0:04:03.200
<v Speaker 1>you know, something with a nice driving beat a good

0:04:03.240 --> 0:04:07.040
<v Speaker 1>tempo to it. These are basic things that music engines

0:04:07.080 --> 0:04:08.920
<v Speaker 1>can do now, but the reason they can do them

0:04:08.960 --> 0:04:12.720
<v Speaker 1>at all is because of music analysis. This isn't always

0:04:12.800 --> 0:04:16.560
<v Speaker 1>done in an automated fashion. In fact, automating music analysis

0:04:16.680 --> 0:04:19.919
<v Speaker 1>is pretty tricky. Sometimes it relies instead on just a

0:04:20.000 --> 0:04:23.440
<v Speaker 1>lot of work, and that's work done by real, live,

0:04:23.720 --> 0:04:29.280
<v Speaker 1>human beings. So let's take the Music Genome Project for example.

0:04:29.520 --> 0:04:33.160
<v Speaker 1>This is the database that the internet radio service Pandora

0:04:33.320 --> 0:04:36.040
<v Speaker 1>relies upon when it creates a radio station based off

0:04:36.080 --> 0:04:39.320
<v Speaker 1>an artist or a song that you've submitted as the

0:04:39.440 --> 0:04:43.200
<v Speaker 1>seed for a new channel. For more than ten years,

0:04:43.240 --> 0:04:47.640
<v Speaker 1>Pandora's staff have analyzed and categorized music, breaking down songs

0:04:47.880 --> 0:04:51.400
<v Speaker 1>into all the basic components, which they call genes. These

0:04:51.440 --> 0:04:56.440
<v Speaker 1>are the elements that make songs what they are. And

0:04:56.960 --> 0:05:00.799
<v Speaker 1>I find this approach both fascinating and and a little odd,

0:05:01.720 --> 0:05:04.719
<v Speaker 1>because in a way, it seems a little weird to

0:05:04.760 --> 0:05:09.000
<v Speaker 1>take a really awesome song. Let's say it's um Blue

0:05:09.000 --> 0:05:12.279
<v Speaker 1>Oyster Cults, Don't Fear the Reaper, one of the best

0:05:12.279 --> 0:05:15.320
<v Speaker 1>songs ever written. And then you have to sift it

0:05:15.400 --> 0:05:19.320
<v Speaker 1>down to all those little basic components, those genes that

0:05:19.400 --> 0:05:22.760
<v Speaker 1>make up that song. It also reinforces this notion that

0:05:22.800 --> 0:05:25.520
<v Speaker 1>a song is more than just the sum of all

0:05:25.600 --> 0:05:28.680
<v Speaker 1>its parts. If you were to look at those components

0:05:28.680 --> 0:05:31.240
<v Speaker 1>and attempt to make a song that included all of them,

0:05:31.279 --> 0:05:33.400
<v Speaker 1>I bet it wouldn't be half as awesome as Don't

0:05:33.400 --> 0:05:37.880
<v Speaker 1>Fear the Reaper. So you take a song, you identify

0:05:38.000 --> 0:05:41.200
<v Speaker 1>all these different qualities of it, and may involve things

0:05:41.320 --> 0:05:46.120
<v Speaker 1>like the tempo of the song, the the the structure

0:05:46.160 --> 0:05:49.040
<v Speaker 1>of it, as far as versus and choruses are concerned,

0:05:49.640 --> 0:05:52.520
<v Speaker 1>the whether what kind of vocalists there are, what kind

0:05:52.520 --> 0:05:57.040
<v Speaker 1>of instruments are used, all of these different individual, tiny

0:05:57.120 --> 0:06:00.680
<v Speaker 1>components of the song, and you put them into say spreadsheet,

0:06:01.120 --> 0:06:06.719
<v Speaker 1>and that represents the collection of genes that are possessed

0:06:06.800 --> 0:06:10.040
<v Speaker 1>by Don't Fear the Reaper. You take that same collection,

0:06:10.160 --> 0:06:11.960
<v Speaker 1>you give them to a musician and say, I want

0:06:12.000 --> 0:06:14.200
<v Speaker 1>you to write me a song that has all of

0:06:14.240 --> 0:06:18.600
<v Speaker 1>these components in it. Well, again, probably not gonna get

0:06:18.640 --> 0:06:21.039
<v Speaker 1>Don't Fear the Reaper. You'll get something, and maybe it

0:06:21.040 --> 0:06:23.159
<v Speaker 1>will be good. Maybe it'll even be better than Don't

0:06:23.160 --> 0:06:27.960
<v Speaker 1>Fear the Reaper. I doubt it, but yeah, there's there's

0:06:27.960 --> 0:06:35.080
<v Speaker 1>something magical or apparently magical about music that transcends the

0:06:35.200 --> 0:06:40.400
<v Speaker 1>quantitative elements that we can list now. Pandora's Music Genome

0:06:40.480 --> 0:06:45.400
<v Speaker 1>project identifies four hundred fifty different musical attributes or genes.

0:06:45.880 --> 0:06:49.039
<v Speaker 1>They include lots of different types of data. Some of

0:06:49.080 --> 0:06:51.719
<v Speaker 1>them are relatively straightforward, such as does the song have

0:06:51.800 --> 0:06:54.480
<v Speaker 1>a vocalist? If it does have a vocalist, is it

0:06:54.520 --> 0:06:58.040
<v Speaker 1>a male vocalist or a female vocalist? Are there multiple vocalists?

0:06:58.640 --> 0:07:01.360
<v Speaker 1>Then starts getting way more granular. So if a song

0:07:01.400 --> 0:07:04.720
<v Speaker 1>has electric guitar, for example, there might be a subset

0:07:04.800 --> 0:07:08.279
<v Speaker 1>of information about that, such as how much distortion is

0:07:08.360 --> 0:07:10.800
<v Speaker 1>on that guitar? Does it have a lot of distortion

0:07:10.840 --> 0:07:13.520
<v Speaker 1>in this song or not a lot? And so you

0:07:13.600 --> 0:07:17.920
<v Speaker 1>start to subdivide down the line. Same thing is true

0:07:17.920 --> 0:07:20.880
<v Speaker 1>for other instruments as well. Now, not all songs have

0:07:21.160 --> 0:07:24.320
<v Speaker 1>the same number of genes, meaning some genres of music

0:07:24.320 --> 0:07:28.000
<v Speaker 1>are actually easier to describe with a fewer terms than others.

0:07:28.480 --> 0:07:32.720
<v Speaker 1>For example, rock songs have about one fifty genes. You

0:07:32.720 --> 0:07:34.920
<v Speaker 1>can break down your rock song into about a hundred

0:07:34.960 --> 0:07:39.280
<v Speaker 1>fifty different little individual components. Rap songs are more like

0:07:39.360 --> 0:07:42.440
<v Speaker 1>three d fifty. So that indicates that there are gradations

0:07:42.440 --> 0:07:47.560
<v Speaker 1>and variations between different songs within the same genre. Uh So,

0:07:47.840 --> 0:07:51.119
<v Speaker 1>to make a recommendation engine, you first have to put

0:07:51.160 --> 0:07:55.920
<v Speaker 1>all the music within the library. Through this process, you

0:07:55.960 --> 0:07:59.120
<v Speaker 1>need to identify the important qualities that make the music

0:07:59.280 --> 0:08:01.760
<v Speaker 1>what it is is. And you could use something like

0:08:01.800 --> 0:08:03.880
<v Speaker 1>a spreadsheet and you lay it all out, and then

0:08:03.920 --> 0:08:06.400
<v Speaker 1>when someone wants to make a new radio station off

0:08:06.440 --> 0:08:09.880
<v Speaker 1>of a song, you can use that song's genome all

0:08:09.920 --> 0:08:13.680
<v Speaker 1>the jenes listed for that specific song to guide a

0:08:13.760 --> 0:08:17.120
<v Speaker 1>decision engine to pick other songs that are similar to

0:08:17.160 --> 0:08:20.440
<v Speaker 1>the first one within a certain degree. So you could

0:08:20.480 --> 0:08:23.920
<v Speaker 1>set this dynamically in your search engine. Right Like, let's

0:08:23.920 --> 0:08:26.880
<v Speaker 1>say that you are the one designing the new, latest

0:08:26.920 --> 0:08:30.720
<v Speaker 1>and greatest version of Pandora, and you've got this enormous

0:08:30.840 --> 0:08:34.600
<v Speaker 1>database of music that's all been analyzed by professionals. We're

0:08:34.600 --> 0:08:38.040
<v Speaker 1>talking about actual musicians and musicologists who have listened to

0:08:38.040 --> 0:08:41.559
<v Speaker 1>the music, broken it down into its basic elements identified

0:08:41.559 --> 0:08:45.640
<v Speaker 1>all of them, and someone has joined your service and

0:08:45.679 --> 0:08:48.960
<v Speaker 1>they say, I'm going to make a radio station based

0:08:49.040 --> 0:08:52.640
<v Speaker 1>off the song, Uh, the statue got me high by

0:08:52.640 --> 0:08:58.520
<v Speaker 1>they might be giants. You would end up accessing the database,

0:08:59.520 --> 0:09:02.160
<v Speaker 1>pulling the record for the statue that got me high,

0:09:02.360 --> 0:09:05.840
<v Speaker 1>looking at all the genes that are associated with that,

0:09:06.240 --> 0:09:08.560
<v Speaker 1>and then you would look for a certain percentage of

0:09:08.600 --> 0:09:11.640
<v Speaker 1>similarity with other songs, like are there other songs that

0:09:11.720 --> 0:09:17.600
<v Speaker 1>have the same genes as this song does? If so,

0:09:18.040 --> 0:09:21.000
<v Speaker 1>serve it up see if the person likes it. You

0:09:21.080 --> 0:09:23.960
<v Speaker 1>might set the threshold higher or lower. If it's a

0:09:24.040 --> 0:09:27.960
<v Speaker 1>song that's particularly avant garde. There may not be a

0:09:28.000 --> 0:09:33.240
<v Speaker 1>lot of other songs that strongly resemble your original, so

0:09:33.360 --> 0:09:36.600
<v Speaker 1>you have to kind of play fast and loose with this. Now,

0:09:37.280 --> 0:09:42.560
<v Speaker 1>an important component of this service is user feedback. Services

0:09:42.600 --> 0:09:46.120
<v Speaker 1>like Pandora nearly always include a method for users to

0:09:46.200 --> 0:09:49.720
<v Speaker 1>indicate if they like or don't like a particular song.

0:09:50.320 --> 0:09:54.480
<v Speaker 1>The recommendation engine uses that data to fine tune its selections.

0:09:54.520 --> 0:09:57.679
<v Speaker 1>No two songs are going to be exactly alike, so

0:09:57.720 --> 0:10:01.160
<v Speaker 1>it may be that the ways the news song deviated

0:10:01.200 --> 0:10:05.000
<v Speaker 1>from your seed songs format were the parts that made

0:10:05.040 --> 0:10:09.199
<v Speaker 1>you detest it, So it could have been that the

0:10:09.200 --> 0:10:13.400
<v Speaker 1>the the engine said, well, this song resembles the seed song,

0:10:13.480 --> 0:10:17.319
<v Speaker 1>the original tune of the way. Let's serve it up

0:10:17.440 --> 0:10:19.959
<v Speaker 1>and you listen to it for like three seconds, you say, no,

0:10:20.360 --> 0:10:22.719
<v Speaker 1>this is this is not what I want. You give

0:10:22.760 --> 0:10:25.960
<v Speaker 1>it a thumbs down. The algorithm might say, all right, well,

0:10:26.040 --> 0:10:28.720
<v Speaker 1>I'm gonna keep note of where it was the same

0:10:28.760 --> 0:10:31.560
<v Speaker 1>and where it was different from that original song. Meanwhile,

0:10:31.600 --> 0:10:35.920
<v Speaker 1>I'll serve up this next song that has similarity. And

0:10:35.960 --> 0:10:37.760
<v Speaker 1>if you say, yeah, that's a good song. I really

0:10:37.760 --> 0:10:39.640
<v Speaker 1>like it, and you give it the thumbs up, then

0:10:39.679 --> 0:10:43.000
<v Speaker 1>the recommendation engine starts looking at the differences between the

0:10:43.040 --> 0:10:46.280
<v Speaker 1>song you said no two and the song you said

0:10:46.360 --> 0:10:50.760
<v Speaker 1>yeah too, and it starts to identify stuff that you

0:10:50.880 --> 0:10:54.079
<v Speaker 1>might not even be aware you don't like. It might

0:10:54.120 --> 0:10:57.280
<v Speaker 1>be certain elements of songs, and the recommendation engine has

0:10:57.280 --> 0:11:01.080
<v Speaker 1>figured it out. Maybe it's figured out, oh uh, Jonathan

0:11:01.200 --> 0:11:05.840
<v Speaker 1>really doesn't like it when there's a clarinet in the

0:11:05.960 --> 0:11:10.319
<v Speaker 1>song for no reason, but he isn't able to vocalize

0:11:10.360 --> 0:11:13.760
<v Speaker 1>that he doesn't he's not aware of it consciously, but

0:11:13.880 --> 0:11:16.439
<v Speaker 1>every time it's popping up he's saying no to that song,

0:11:16.520 --> 0:11:19.280
<v Speaker 1>So we're gonna We're gonna put the kai bosh on

0:11:19.320 --> 0:11:21.959
<v Speaker 1>the clarinet from here on out. That was just a

0:11:22.320 --> 0:11:25.360
<v Speaker 1>random example. I don't I don't have a hatred of

0:11:25.400 --> 0:11:29.600
<v Speaker 1>the clarinet, but it is a way for the engine

0:11:29.600 --> 0:11:32.040
<v Speaker 1>to work with the user in order to get a

0:11:32.080 --> 0:11:34.840
<v Speaker 1>better understanding of the type of songs that it should

0:11:34.920 --> 0:11:38.560
<v Speaker 1>serve up to you. Now, there are plenty of other

0:11:38.600 --> 0:11:45.640
<v Speaker 1>ways to analyze and describe music besides this genetic approach.

0:11:45.720 --> 0:11:49.880
<v Speaker 1>There are entire courses dedicated to this. Musicology is a

0:11:50.000 --> 0:11:53.720
<v Speaker 1>rich and interesting field, and some of these approaches go

0:11:53.800 --> 0:11:58.280
<v Speaker 1>beyond the components that are directly perceptible. These analytic methods

0:11:58.320 --> 0:12:01.720
<v Speaker 1>try to capture the essence of the feel of music.

0:12:02.000 --> 0:12:05.640
<v Speaker 1>For example, if you take a bunch of components individually,

0:12:06.080 --> 0:12:10.640
<v Speaker 1>you might quantitatively describe the music with accuracy, but you

0:12:10.720 --> 0:12:15.679
<v Speaker 1>can't capture how they collectively create a particular effect. Perceptual

0:12:15.720 --> 0:12:19.680
<v Speaker 1>analysis attempts to bring human perception and emotional reaction into

0:12:19.679 --> 0:12:24.240
<v Speaker 1>account with everything else. But why is the Music Genome

0:12:24.280 --> 0:12:27.720
<v Speaker 1>project powered by humans? Why is Pandora using actual human

0:12:27.760 --> 0:12:30.120
<v Speaker 1>beings to listen to music and then write out all

0:12:30.160 --> 0:12:34.760
<v Speaker 1>these genes, couldn't you find some easier way? Well? Listening

0:12:34.800 --> 0:12:37.439
<v Speaker 1>to music and being able to describe its structure beyond

0:12:37.520 --> 0:12:42.720
<v Speaker 1>some relatively simple angles is a particularly tricky computational problem.

0:12:42.840 --> 0:12:46.360
<v Speaker 1>It's something that's easy for humans and hard for machines.

0:12:46.960 --> 0:12:50.160
<v Speaker 1>In two thousand five, Way Chai of m I T

0:12:50.400 --> 0:12:54.640
<v Speaker 1>wrote a paper titled Automated Analysis of Musical Structure in

0:12:54.679 --> 0:12:57.800
<v Speaker 1>which she laid out the challenges of creating an automatic

0:12:57.800 --> 0:13:01.679
<v Speaker 1>approach to analyzing music. Her pay Earth is nineties six

0:13:01.720 --> 0:13:03.760
<v Speaker 1>pages long, and that kind of gives you an idea

0:13:03.760 --> 0:13:06.600
<v Speaker 1>of how complicated a problem this is that we're talking

0:13:06.640 --> 0:13:12.400
<v Speaker 1>about here. China's team relied on music cognition, machine learning,

0:13:12.480 --> 0:13:16.280
<v Speaker 1>and signal processing to segment and analyze pieces of music,

0:13:16.640 --> 0:13:20.720
<v Speaker 1>with the goal of isolating and analyzing the recurrent structures

0:13:20.760 --> 0:13:23.040
<v Speaker 1>of a piece. You know, the whole verse, course, verse,

0:13:23.920 --> 0:13:27.559
<v Speaker 1>all my fellow Pixies fans out there, the chord progression

0:13:27.840 --> 0:13:31.800
<v Speaker 1>or key changes that are present in music. Identifying parts

0:13:31.920 --> 0:13:34.960
<v Speaker 1>of a piece that make it representative of the whole.

0:13:35.000 --> 0:13:38.280
<v Speaker 1>In other words, finding that hook or finding that element

0:13:38.280 --> 0:13:41.000
<v Speaker 1>of a song that make it stand out. China's team

0:13:41.000 --> 0:13:42.720
<v Speaker 1>had to figure out how to make a machine do

0:13:42.880 --> 0:13:45.960
<v Speaker 1>stuff that we tend to do naturally, even without the

0:13:45.960 --> 0:13:49.600
<v Speaker 1>benefit of formal musical training. So, for example, I have

0:13:49.720 --> 0:13:55.360
<v Speaker 1>never taken any class beyond music appreciation, which is about

0:13:55.440 --> 0:13:58.280
<v Speaker 1>as one oh one as you get, and yet I

0:13:58.320 --> 0:14:03.280
<v Speaker 1>am able to voke realize certain things about music easily.

0:14:03.320 --> 0:14:07.000
<v Speaker 1>I can recognize these differences, things that a computer cannot

0:14:07.080 --> 0:14:10.040
<v Speaker 1>natively do without all and it requires a whole lot

0:14:10.080 --> 0:14:13.400
<v Speaker 1>of work. The whole paper is available to read online.

0:14:13.640 --> 0:14:16.840
<v Speaker 1>It's really interesting. I recommend checking it out. There's a

0:14:16.880 --> 0:14:19.240
<v Speaker 1>PDF you can just download for free and read over it,

0:14:19.280 --> 0:14:22.720
<v Speaker 1>and it's fascinating. It delves into not just the programming

0:14:22.800 --> 0:14:26.960
<v Speaker 1>challenge of creating this analysis software, but also the peculiarities

0:14:27.080 --> 0:14:30.280
<v Speaker 1>of music itself. For example, what makes one piece of

0:14:30.360 --> 0:14:35.280
<v Speaker 1>music more memorable than another piece? What element does repetition

0:14:35.360 --> 0:14:38.160
<v Speaker 1>play when it comes to making a masterpiece? Was the

0:14:38.200 --> 0:14:41.160
<v Speaker 1>relationship between music, which, when you get down to it,

0:14:41.200 --> 0:14:44.960
<v Speaker 1>really is just math and motion and human perception. And

0:14:45.000 --> 0:14:47.360
<v Speaker 1>I could do an entire episode on Chi's work and

0:14:47.440 --> 0:14:50.400
<v Speaker 1>what her team developed and how they set out to

0:14:50.440 --> 0:14:53.080
<v Speaker 1>design this automated system to analyze music, but that's gonna

0:14:53.080 --> 0:14:55.440
<v Speaker 1>have to wait for a later episode. For now, it's

0:14:55.480 --> 0:14:58.040
<v Speaker 1>just important to understand the music is something that we're

0:14:58.080 --> 0:15:02.880
<v Speaker 1>able to experience in a level that machine just cannot. Now,

0:15:03.120 --> 0:15:05.240
<v Speaker 1>when we come back from the break, we're going to

0:15:05.360 --> 0:15:08.240
<v Speaker 1>listen in on an interview that Noel Brown had with

0:15:08.360 --> 0:15:12.680
<v Speaker 1>Alexander Lurch and learn more about musical analysis and music generation.

0:15:12.920 --> 0:15:23.080
<v Speaker 1>But first let's take a quick break to thank our sponsor. Now,

0:15:23.120 --> 0:15:25.600
<v Speaker 1>Like I said the top of the show, earlier this year,

0:15:25.600 --> 0:15:29.800
<v Speaker 1>in Producer Extraordinary, Noel Brown took a trip to mog Fest,

0:15:29.840 --> 0:15:32.040
<v Speaker 1>which was a you know, it's a conference about music

0:15:32.080 --> 0:15:34.840
<v Speaker 1>and technology and science and lots of other awesome stuff,

0:15:35.120 --> 0:15:37.440
<v Speaker 1>and he got to speak with a music analysis expert,

0:15:37.520 --> 0:15:42.480
<v Speaker 1>Alexander Larch And what follows is their conversation. So as

0:15:42.480 --> 0:15:45.000
<v Speaker 1>a bit of a layman, I interpret a lot of

0:15:45.040 --> 0:15:47.600
<v Speaker 1>what you do in the field of like generative music.

0:15:47.800 --> 0:15:52.800
<v Speaker 1>Is that kind of along the right lines. So um,

0:15:52.840 --> 0:15:55.600
<v Speaker 1>I would say my book may kind of lead to

0:15:55.720 --> 0:15:58.760
<v Speaker 1>generative music, but what I'm actually currently focusing on is

0:15:58.800 --> 0:16:02.880
<v Speaker 1>more analyzing music, so figuring out what's going on in

0:16:02.920 --> 0:16:06.520
<v Speaker 1>the music. So, um, it might start with you just

0:16:06.720 --> 0:16:09.120
<v Speaker 1>have an audio signal and you want to know, okay,

0:16:09.160 --> 0:16:11.360
<v Speaker 1>what is the temple, what is the what is the key,

0:16:11.400 --> 0:16:13.200
<v Speaker 1>what is the hook line, what is the base doing?

0:16:13.600 --> 0:16:16.920
<v Speaker 1>What is the mood of this piece of music? And

0:16:16.960 --> 0:16:20.720
<v Speaker 1>that is when trying to apply artificial intelligence and signal

0:16:20.760 --> 0:16:24.920
<v Speaker 1>processing methods to get this information to extract this inflammation

0:16:25.400 --> 0:16:28.840
<v Speaker 1>from the signal. So that's something like the hit factories

0:16:28.880 --> 0:16:31.720
<v Speaker 1>in Sweden would be all about, you know what they're

0:16:31.720 --> 0:16:33.480
<v Speaker 1>all about, Like it seems that they take a very

0:16:33.520 --> 0:16:36.680
<v Speaker 1>analytical approach to writing pop songs, where you know, they've

0:16:36.680 --> 0:16:38.520
<v Speaker 1>got people that are experts in hooks, they have people

0:16:38.560 --> 0:16:40.680
<v Speaker 1>that are experts in versus, and they have all these

0:16:40.800 --> 0:16:44.240
<v Speaker 1>kind of human algorithms on like how long everything needs

0:16:44.280 --> 0:16:46.800
<v Speaker 1>to play for in order to elicit the proper response.

0:16:47.240 --> 0:16:49.360
<v Speaker 1>Is it sort of along those lines as well, yes,

0:16:49.480 --> 0:16:53.000
<v Speaker 1>and so so you you want to find out, um,

0:16:53.120 --> 0:16:56.720
<v Speaker 1>what kind of makes the songs successful and this might

0:16:56.800 --> 0:17:00.800
<v Speaker 1>have really many many different factors impacting that. Right. So

0:17:00.840 --> 0:17:04.160
<v Speaker 1>there's the structure, of course, but there's there's so many

0:17:04.160 --> 0:17:07.320
<v Speaker 1>other dimensions here that it's really hard to nail it down.

0:17:07.760 --> 0:17:12.320
<v Speaker 1>So using using the computer to analyze this, we try

0:17:12.359 --> 0:17:15.200
<v Speaker 1>to find out more about what's going on and maybe

0:17:15.320 --> 0:17:21.280
<v Speaker 1>identifying these little things that might make something popular or

0:17:21.400 --> 0:17:24.639
<v Speaker 1>might give you goose bumps, or something that an example

0:17:24.680 --> 0:17:28.320
<v Speaker 1>or something that maybe one wouldn't expect might accomplish something

0:17:28.359 --> 0:17:29.960
<v Speaker 1>like that, or just just like an element that maybe

0:17:30.440 --> 0:17:36.480
<v Speaker 1>isn't so obvious to the average listener. It's okay, let

0:17:36.480 --> 0:17:39.400
<v Speaker 1>me let me think. Like it's it's hard to come

0:17:39.480 --> 0:17:42.080
<v Speaker 1>up with a very good example that would be surprising

0:17:42.160 --> 0:17:47.800
<v Speaker 1>to everybody. But it's definitely the combination of tiny things

0:17:47.880 --> 0:17:52.560
<v Speaker 1>like maybe intonation that is somehow a little bit off,

0:17:52.800 --> 0:17:57.040
<v Speaker 1>so you would say, or timing is a very obvious thing.

0:17:57.280 --> 0:17:59.800
<v Speaker 1>If something grooves or not it might have the same rhythm,

0:18:00.160 --> 0:18:03.400
<v Speaker 1>it might really impact you on a on a completely

0:18:03.440 --> 0:18:07.200
<v Speaker 1>different level. Right, So these are examples that are maybe

0:18:07.280 --> 0:18:11.439
<v Speaker 1>not surprising, but but still um point to the direction. Yeah,

0:18:11.560 --> 0:18:15.440
<v Speaker 1>is it maybe an element of human human human interaction?

0:18:15.520 --> 0:18:19.000
<v Speaker 1>Like I think things are too quantized, it's maybe less emotional,

0:18:19.200 --> 0:18:21.560
<v Speaker 1>whereas when people enter the notes by hand and they're

0:18:21.600 --> 0:18:25.159
<v Speaker 1>a little bit imperfect, or for example, the singer Adele,

0:18:25.200 --> 0:18:27.240
<v Speaker 1>there was an article about how she sort of slides

0:18:27.280 --> 0:18:29.800
<v Speaker 1>into her notes and that gives you goose bumps because

0:18:29.840 --> 0:18:33.080
<v Speaker 1>it's got this human quality where you sense that raw

0:18:33.240 --> 0:18:36.120
<v Speaker 1>human emotion in the same way. Maybe someone who does

0:18:36.200 --> 0:18:39.520
<v Speaker 1>electronic music makes mistakes and leaves them in and that's

0:18:39.560 --> 0:18:44.080
<v Speaker 1>what kind of makes it more approachable. Absolutely. I mean,

0:18:44.160 --> 0:18:45.679
<v Speaker 1>one thing you have to keep in mind is that

0:18:45.760 --> 0:18:48.800
<v Speaker 1>it's all jover and artists dependent as well, right, so

0:18:48.840 --> 0:18:51.800
<v Speaker 1>there's there will definitely never be a formula. So if

0:18:51.800 --> 0:18:53.560
<v Speaker 1>you want to have goose bumps, just do that and

0:18:53.600 --> 0:18:57.480
<v Speaker 1>then it looks right, So you can always analyze in retrospect. Okay,

0:18:57.520 --> 0:19:01.000
<v Speaker 1>this artist has this specific thing thing that he or

0:19:01.040 --> 0:19:06.680
<v Speaker 1>she does and that makes things so so um fascinating

0:19:06.800 --> 0:19:12.119
<v Speaker 1>or also that makes you hooked on that, But that

0:19:12.280 --> 0:19:14.840
<v Speaker 1>might not work for a different genre or for a

0:19:14.880 --> 0:19:19.080
<v Speaker 1>new song, right, especially because it's also about expectation and

0:19:19.119 --> 0:19:22.560
<v Speaker 1>what you already know. So um, I can maybe let

0:19:22.560 --> 0:19:26.040
<v Speaker 1>a computer compose something in Mozart style, right, and it

0:19:26.160 --> 0:19:29.119
<v Speaker 1>might be a really good motor piece, but that doesn't

0:19:29.119 --> 0:19:34.280
<v Speaker 1>mean it really gets you as a listener because you

0:19:34.320 --> 0:19:37.240
<v Speaker 1>have heard so many Mozart pieces and the original will

0:19:37.400 --> 0:19:41.200
<v Speaker 1>still be better. It's it's always an imitation, right, so

0:19:41.200 --> 0:19:44.399
<v Speaker 1>so then it might actually miss something there, right, Even

0:19:44.440 --> 0:19:48.920
<v Speaker 1>if the composition itself is very much like Mozart did it, well,

0:19:49.320 --> 0:19:52.160
<v Speaker 1>so is the end product of your research to make

0:19:52.200 --> 0:19:55.360
<v Speaker 1>computers better at doing this or are you just interested

0:19:55.359 --> 0:19:58.679
<v Speaker 1>in kind of you know, breaking down pieces of music

0:19:58.720 --> 0:20:01.760
<v Speaker 1>and to their based elements. So at the moment, I'm

0:20:01.800 --> 0:20:04.480
<v Speaker 1>doing exactly that, I'm breaking it down. I I want

0:20:04.520 --> 0:20:08.200
<v Speaker 1>to be able to let a computer transcribe what's going

0:20:08.240 --> 0:20:11.199
<v Speaker 1>on in the music. I want to understand maybe on

0:20:11.200 --> 0:20:14.679
<v Speaker 1>a perceptional level. So what makes what parameters that you

0:20:14.680 --> 0:20:18.840
<v Speaker 1>can objectively extract from the audio signal? Um? What impact

0:20:19.080 --> 0:20:22.080
<v Speaker 1>might they have on the listener? Right? So so how

0:20:22.119 --> 0:20:28.359
<v Speaker 1>does the listener react to certain um specific characteristics of

0:20:28.560 --> 0:20:34.680
<v Speaker 1>the music. But this knowledge is then also can most

0:20:34.720 --> 0:20:39.960
<v Speaker 1>definitely be used to actually generate new music, um, following

0:20:40.000 --> 0:20:44.320
<v Speaker 1>specific rules that you have extracted from the music and

0:20:44.359 --> 0:20:47.640
<v Speaker 1>then create something new. And this is what my colleague

0:20:48.160 --> 0:20:52.720
<v Speaker 1>Gil Weinberg woks a lot on with his robots that

0:20:52.760 --> 0:20:55.680
<v Speaker 1>make music. Okay, tell me more about that. And let's

0:20:55.720 --> 0:20:57.960
<v Speaker 1>not he the mr what it was interested? Right? Yeah?

0:20:58.000 --> 0:21:00.600
<v Speaker 1>So so there's um he has a robot called him On.

0:21:01.240 --> 0:21:08.320
<v Speaker 1>So she's a marimba playing robot. Um. So what Also,

0:21:08.480 --> 0:21:11.400
<v Speaker 1>my my colleague is a lot into jazz, so Simon

0:21:11.520 --> 0:21:14.280
<v Speaker 1>plays also a lot of jazz UM. So there's a

0:21:14.320 --> 0:21:18.760
<v Speaker 1>lot of um interaction on the stage with the live musicians,

0:21:18.880 --> 0:21:22.800
<v Speaker 1>and the question answer games between what what Simon plays

0:21:22.840 --> 0:21:26.520
<v Speaker 1>on the marimba and what the musician then plays, and

0:21:26.800 --> 0:21:31.159
<v Speaker 1>so it's it's constantly analyzed what's being what's being played,

0:21:31.200 --> 0:21:35.920
<v Speaker 1>and then the robot improvises or tries to um give

0:21:36.000 --> 0:21:38.600
<v Speaker 1>some answers to that jazz. I mean, you have to listen,

0:21:38.640 --> 0:21:40.439
<v Speaker 1>you have to be able to follow the leads that

0:21:40.520 --> 0:21:43.600
<v Speaker 1>you're you know, fellow musicians are putting out there, otherwise

0:21:43.640 --> 0:21:46.680
<v Speaker 1>you're not any good exactly. This whole interaction thing is

0:21:46.680 --> 0:21:49.120
<v Speaker 1>is part of the of the research obviously, and it's

0:21:49.119 --> 0:21:52.639
<v Speaker 1>not only the music, right, it's only it's also just

0:21:52.640 --> 0:21:55.400
<v Speaker 1>just it's eye contact and so on. So that's why

0:21:55.440 --> 0:21:58.680
<v Speaker 1>this robot, even if it doesn't make any sound, has

0:21:58.720 --> 0:22:03.520
<v Speaker 1>actually ahead where where she can look at specific musicians

0:22:04.080 --> 0:22:07.480
<v Speaker 1>um and not her head and so on. So you see,

0:22:07.720 --> 0:22:10.400
<v Speaker 1>you kind of can interact with the robot. So this,

0:22:10.400 --> 0:22:14.080
<v Speaker 1>this human robot interaction is part of the research as well. Fascinating.

0:22:14.520 --> 0:22:18.000
<v Speaker 1>What can you describe the difference between an algorithm that

0:22:18.520 --> 0:22:20.960
<v Speaker 1>does what you're talking about and analyzes music and one

0:22:21.040 --> 0:22:23.840
<v Speaker 1>that might create generative music. It seems like there's sort

0:22:23.840 --> 0:22:25.920
<v Speaker 1>of a crossover between the two, and I'm just I

0:22:26.000 --> 0:22:27.840
<v Speaker 1>just was probably you could kind of like spell that

0:22:27.840 --> 0:22:31.800
<v Speaker 1>out a little bit for us. So, UM, in essence,

0:22:32.040 --> 0:22:36.240
<v Speaker 1>the the algorithm that analyzes music is kind of the

0:22:36.280 --> 0:22:39.359
<v Speaker 1>information you gain from that algorithm has to feed the

0:22:39.440 --> 0:22:43.720
<v Speaker 1>generative algorithm. So, for example, you cannot compose something in

0:22:43.760 --> 0:22:46.919
<v Speaker 1>classical style if you don't know classical style, right, so

0:22:47.000 --> 0:22:49.040
<v Speaker 1>you have to learn it from data. That is the

0:22:49.040 --> 0:22:55.040
<v Speaker 1>analysis part, and then you try to infer models from that. Right.

0:22:55.080 --> 0:22:58.840
<v Speaker 1>So you you have all this data, you have you know, um,

0:22:58.880 --> 0:23:01.960
<v Speaker 1>you have structural data, you have voice leading, you have

0:23:02.440 --> 0:23:05.680
<v Speaker 1>maybe intonation if it's about performance, and then you try

0:23:05.720 --> 0:23:09.880
<v Speaker 1>to fix this data into rules, and these rules then

0:23:10.280 --> 0:23:15.600
<v Speaker 1>would generate music, for example, jazz improvisation or something that.

0:23:16.280 --> 0:23:18.960
<v Speaker 1>So Brian you know, has has been kind of delving

0:23:18.960 --> 0:23:22.240
<v Speaker 1>into generative music lately, and it's actually really interesting. There's

0:23:22.240 --> 0:23:24.760
<v Speaker 1>a BBC documentary of him kind of showing his methods

0:23:24.760 --> 0:23:27.359
<v Speaker 1>and he's just using logic and he has these little

0:23:27.440 --> 0:23:29.000
<v Speaker 1>kind of nodes I guess you could call on the

0:23:29.040 --> 0:23:32.480
<v Speaker 1>scripts or whatever that can set rules for like a

0:23:32.560 --> 0:23:34.800
<v Speaker 1>drum part or something like that where it will say,

0:23:34.840 --> 0:23:38.080
<v Speaker 1>subdivide every other whatever, like any number of things that

0:23:38.119 --> 0:23:42.359
<v Speaker 1>you could input like that. Um, I guess are we

0:23:42.440 --> 0:23:44.679
<v Speaker 1>at a place where that's still just kind of a

0:23:44.720 --> 0:23:47.600
<v Speaker 1>gimmick or are we Are we really trying to recreate

0:23:48.680 --> 0:23:51.480
<v Speaker 1>a human mind creating music or is it just kind

0:23:51.480 --> 0:23:54.520
<v Speaker 1>of a different animal altogether, you know what I mean? Like,

0:23:54.600 --> 0:23:57.119
<v Speaker 1>I'm wondering, are we really trying to have AI that

0:23:57.200 --> 0:24:01.320
<v Speaker 1>can compose mozart, or that can place a producer or

0:24:01.320 --> 0:24:03.720
<v Speaker 1>replace a songwriter, or is it just sort of like

0:24:03.800 --> 0:24:07.480
<v Speaker 1>its own thing that's fascinating in and of itself. So

0:24:07.560 --> 0:24:12.359
<v Speaker 1>I don't think that the goal here is to replace musicians,

0:24:12.520 --> 0:24:16.480
<v Speaker 1>but I think it's um from a research perspective, Um,

0:24:16.520 --> 0:24:20.800
<v Speaker 1>giving a machine creativity is a really fascinating topic, right,

0:24:20.840 --> 0:24:27.120
<v Speaker 1>So is it possible if you just have something algorithm driven, um,

0:24:27.160 --> 0:24:30.880
<v Speaker 1>that it actually creates something new that it hasn't seen before. Right?

0:24:31.440 --> 0:24:37.919
<v Speaker 1>So um I UM, I wouldn't be worried about being replaced,

0:24:38.440 --> 0:24:41.719
<v Speaker 1>although I mean I could see in the future, like

0:24:41.800 --> 0:24:47.399
<v Speaker 1>for example, generating elevator music, right, UM that that I

0:24:47.440 --> 0:24:51.600
<v Speaker 1>can easily see being automatically generated in the future. Um.

0:24:51.640 --> 0:24:55.879
<v Speaker 1>And there, yes, you would actually the AI would actually

0:24:55.880 --> 0:24:59.600
<v Speaker 1>replace the human composer and that in that area. But

0:25:00.040 --> 0:25:04.960
<v Speaker 1>I don't. I don't think that. Um. I think the

0:25:05.160 --> 0:25:09.520
<v Speaker 1>the phenomenon of creativity is still not completely understood. Um.

0:25:09.560 --> 0:25:13.240
<v Speaker 1>And it's with current technologies, it's I think it's really

0:25:13.280 --> 0:25:16.200
<v Speaker 1>hard to get there. I mean, we do use some

0:25:16.520 --> 0:25:20.040
<v Speaker 1>random randomizations and so on, so it generates something that

0:25:20.080 --> 0:25:24.639
<v Speaker 1>you haven't heard before, but well it's random, right, so

0:25:24.720 --> 0:25:29.239
<v Speaker 1>it's not necessarily an act of creativity here. So so

0:25:29.359 --> 0:25:31.520
<v Speaker 1>we're trying to get there, but I think it's still

0:25:31.520 --> 0:25:34.600
<v Speaker 1>a long way to have to create something that is

0:25:34.640 --> 0:25:37.920
<v Speaker 1>really creative. It's not getting a Creativity seems to be

0:25:38.000 --> 0:25:40.520
<v Speaker 1>sort of subjective in and of itself. It's like, does

0:25:40.640 --> 0:25:43.760
<v Speaker 1>creativity mean that it was created by a human? You know,

0:25:43.840 --> 0:25:47.199
<v Speaker 1>like is that exclusively what creativity is? And if we

0:25:47.280 --> 0:25:50.760
<v Speaker 1>have something that is somewhat sentient, can it be creative?

0:25:51.119 --> 0:25:54.840
<v Speaker 1>You know? I would say that the definition of creativity

0:25:54.920 --> 0:26:00.720
<v Speaker 1>is mostly subject based. So there's no godlike instance who says, Okay,

0:26:00.720 --> 0:26:03.000
<v Speaker 1>this is creative and this is not creative. But what

0:26:03.000 --> 0:26:10.080
<v Speaker 1>what it depends on is what the listeners thinks of this, right, um? So,

0:26:11.400 --> 0:26:14.720
<v Speaker 1>which is then in a way makes it really difficult

0:26:14.760 --> 0:26:18.800
<v Speaker 1>to do research because as there's no clear definition of

0:26:18.800 --> 0:26:22.080
<v Speaker 1>what we're measuring. Um, it's it's all the subject driven.

0:26:22.400 --> 0:26:25.440
<v Speaker 1>It's really hard to say, Okay, this is something where

0:26:25.440 --> 0:26:27.359
<v Speaker 1>it's going in the right direction and this is not

0:26:27.400 --> 0:26:31.040
<v Speaker 1>so much. Yeah, but I mean it's so that the

0:26:31.119 --> 0:26:35.399
<v Speaker 1>problem is kind of mentioned learning about official intelligence algorithms,

0:26:35.720 --> 0:26:39.720
<v Speaker 1>they all try to they learned from data, and they

0:26:39.960 --> 0:26:43.280
<v Speaker 1>essentially always try to reproduce something that they learned from

0:26:43.320 --> 0:26:47.680
<v Speaker 1>the data. Right, while real creativity is always thinking all

0:26:47.720 --> 0:26:51.560
<v Speaker 1>of the box. I wanted to be unexpected, like you know,

0:26:51.800 --> 0:26:54.879
<v Speaker 1>uses these algorithms because he wants to surprise himself, but

0:26:54.920 --> 0:26:57.920
<v Speaker 1>he likes to set certain conditions that are appealing to him.

0:26:58.240 --> 0:27:00.240
<v Speaker 1>It's sort of like being the prime mover in the

0:27:00.280 --> 0:27:02.840
<v Speaker 1>situation and then just sort of letting the pieces fall

0:27:02.880 --> 0:27:04.399
<v Speaker 1>where they may at the end of the day. But

0:27:04.480 --> 0:27:07.200
<v Speaker 1>you are sort of still putting yourself into the equation.

0:27:07.440 --> 0:27:10.679
<v Speaker 1>But then you are hoping for unexpected results to surprise yourself.

0:27:11.040 --> 0:27:14.560
<v Speaker 1>And this is definitely one very good way of dealing

0:27:14.640 --> 0:27:17.440
<v Speaker 1>with that. Right because you you have some kind of

0:27:17.680 --> 0:27:21.280
<v Speaker 1>random components there, Um, you don't trust everything that is

0:27:21.280 --> 0:27:24.920
<v Speaker 1>being output. It right, So, but something might be good,

0:27:25.040 --> 0:27:27.879
<v Speaker 1>So you generate a lot of variations of of what

0:27:28.000 --> 0:27:30.760
<v Speaker 1>you might want to achieve, and then you just pick

0:27:30.840 --> 0:27:34.320
<v Speaker 1>something that that really bokes and then you um use

0:27:34.400 --> 0:27:36.399
<v Speaker 1>this as a starting point from where you want to

0:27:36.440 --> 0:27:39.520
<v Speaker 1>go to where you want to go. I mentioned elevator music,

0:27:39.560 --> 0:27:41.119
<v Speaker 1>and I get that for sure, but aren't they already

0:27:41.160 --> 0:27:43.320
<v Speaker 1>using generative music and video games where they have to

0:27:43.400 --> 0:27:47.040
<v Speaker 1>have music constantly playing And obviously it would take ages

0:27:47.119 --> 0:27:50.080
<v Speaker 1>for a single person to compose, you know, hundreds of

0:27:50.080 --> 0:27:52.439
<v Speaker 1>hours of music. And I know there are cues in

0:27:52.600 --> 0:27:55.000
<v Speaker 1>games that are composed, but then there are parts where

0:27:55.000 --> 0:27:58.000
<v Speaker 1>you're maybe wandering around and like the you know RPG

0:27:58.160 --> 0:28:00.520
<v Speaker 1>type game and it's sort of ambient music. It just

0:28:00.560 --> 0:28:03.840
<v Speaker 1>seems to morph and change, right, I mean, this is

0:28:03.880 --> 0:28:05.600
<v Speaker 1>but this is rule based as far as I know.

0:28:05.640 --> 0:28:07.720
<v Speaker 1>I'm I'm far from being an expert in in what

0:28:08.240 --> 0:28:11.719
<v Speaker 1>really happens in these game engines, but my understanding is

0:28:12.160 --> 0:28:16.919
<v Speaker 1>that they define specific states, um and then they have

0:28:17.080 --> 0:28:22.480
<v Speaker 1>certain rules for either looping something, looping specific loops or

0:28:22.640 --> 0:28:27.200
<v Speaker 1>just generating some some more atmospheric background tones within a

0:28:27.320 --> 0:28:30.760
<v Speaker 1>palet or within like a scale or something that's you know,

0:28:30.880 --> 0:28:33.640
<v Speaker 1>but I'm I'm I'm pretty sure that this is not

0:28:33.680 --> 0:28:37.120
<v Speaker 1>necessarily automatically generated. I mean, there might be randomness in there,

0:28:37.119 --> 0:28:40.120
<v Speaker 1>but I think it's basically rule based. So somebody during

0:28:40.160 --> 0:28:45.120
<v Speaker 1>the development specified, okay in this state do something like this.

0:28:46.120 --> 0:28:49.120
<v Speaker 1>How do you think that technology will shape music over

0:28:49.160 --> 0:28:51.800
<v Speaker 1>the next ten twenty years. I mean, obviously, we're at

0:28:51.800 --> 0:28:54.680
<v Speaker 1>a conference festival that is very much involved in the

0:28:55.680 --> 0:28:59.000
<v Speaker 1>connection between technology and music. I love it. I think

0:28:59.080 --> 0:29:01.040
<v Speaker 1>it's amazing. There are some people that are kind of

0:29:01.040 --> 0:29:02.800
<v Speaker 1>freaks out, But I wonder what you think about, like,

0:29:02.920 --> 0:29:06.880
<v Speaker 1>where's it going? Oh, well, that's that's obviously very hard

0:29:06.920 --> 0:29:12.240
<v Speaker 1>to answer. I mean, I mean, so, okay, let me

0:29:12.280 --> 0:29:15.520
<v Speaker 1>start historically, right, so, so technology and music they have

0:29:15.680 --> 0:29:20.360
<v Speaker 1>always interacted very closely. Right, So, there's actually genres who

0:29:20.400 --> 0:29:25.680
<v Speaker 1>would not which would not be there without the technology

0:29:26.800 --> 0:29:30.360
<v Speaker 1>technology exactly, so the electric guitar rock and roll wouldn't

0:29:30.360 --> 0:29:32.640
<v Speaker 1>have happened with all the electric guitar, and the electric

0:29:32.680 --> 0:29:37.320
<v Speaker 1>guitar was in essence and engineering effort, right synthesizer. Obviously,

0:29:37.360 --> 0:29:40.920
<v Speaker 1>we are here at mook Fest. Um so, so there

0:29:41.000 --> 0:29:47.800
<v Speaker 1>was always close interaction between technology, um so. Um what

0:29:48.440 --> 0:29:50.840
<v Speaker 1>the trends that I currently see, and they are not

0:29:50.920 --> 0:29:55.800
<v Speaker 1>really surprising, I guess, but I think that, um, the

0:29:55.840 --> 0:29:59.480
<v Speaker 1>interaction of the performer with any kind of sound generation

0:29:59.480 --> 0:30:06.840
<v Speaker 1>of music generation will will um grow more cohesive. So

0:30:07.240 --> 0:30:11.560
<v Speaker 1>any kind of controller will be easier to use and

0:30:11.560 --> 0:30:14.600
<v Speaker 1>and uh, it will also be easier to use for

0:30:14.640 --> 0:30:17.600
<v Speaker 1>everybody to create music. And this is definitely a trend

0:30:17.640 --> 0:30:19.640
<v Speaker 1>you already see with d J apps and so on,

0:30:20.320 --> 0:30:23.400
<v Speaker 1>where they automatically create matchups for you and and all

0:30:23.480 --> 0:30:27.400
<v Speaker 1>this stuff. Um, it's this is this is definitely going

0:30:27.440 --> 0:30:31.600
<v Speaker 1>to happen that the user will be even if they

0:30:31.600 --> 0:30:36.160
<v Speaker 1>have no music background, will be able to create music

0:30:37.080 --> 0:30:39.960
<v Speaker 1>in a way that that makes sense. It might only

0:30:39.960 --> 0:30:44.240
<v Speaker 1>be loop based for now, but there's a lot of

0:30:44.280 --> 0:30:48.960
<v Speaker 1>possibilities here. Um, I see all the possibilities in more

0:30:49.000 --> 0:30:53.840
<v Speaker 1>crowd based approaches to this. Right, So, um, what happens

0:30:53.880 --> 0:30:56.280
<v Speaker 1>if you put a hundred people into a room and

0:30:56.320 --> 0:30:58.840
<v Speaker 1>give them, I don't know, an app or something that

0:30:58.880 --> 0:31:01.760
<v Speaker 1>they can control and then they make music together? Neural

0:31:01.880 --> 0:31:07.959
<v Speaker 1>network music exactly. And and there's also in this context

0:31:08.040 --> 0:31:11.360
<v Speaker 1>there's new forms how artists can communicate with their fans. Right,

0:31:11.440 --> 0:31:14.560
<v Speaker 1>so you could release something that is actually interactive, So

0:31:14.920 --> 0:31:18.360
<v Speaker 1>so fans could, in the easiest form could vote on something,

0:31:18.400 --> 0:31:21.400
<v Speaker 1>but maybe some more complex input would shape the music

0:31:21.440 --> 0:31:24.160
<v Speaker 1>and outcome there. So I think these are very very

0:31:24.200 --> 0:31:27.840
<v Speaker 1>interesting forms where you already see the seats in what's

0:31:27.880 --> 0:31:31.600
<v Speaker 1>currently happening, um, and I think this will definitely evolve.

0:31:32.080 --> 0:31:35.200
<v Speaker 1>Knowl and Lurch makes some great points about the subtleties

0:31:35.240 --> 0:31:38.040
<v Speaker 1>of music and analysis as well as the potential for

0:31:38.120 --> 0:31:40.920
<v Speaker 1>their future. And when we come back, we'll talk more

0:31:40.960 --> 0:31:52.280
<v Speaker 1>about generating music from a computational standpoint. Generating music, like

0:31:52.640 --> 0:31:55.480
<v Speaker 1>musical analysis, is a non trivial task. How do you

0:31:55.520 --> 0:31:59.640
<v Speaker 1>program a computer so that it might dynamically create, esthetically

0:32:00.080 --> 0:32:03.520
<v Speaker 1>leasing measures of music without becoming too repetitive or boring,

0:32:04.040 --> 0:32:06.600
<v Speaker 1>or straying too far away from a melody line to

0:32:06.680 --> 0:32:10.200
<v Speaker 1>sound like anything other than just random series of notes. Now,

0:32:10.400 --> 0:32:14.360
<v Speaker 1>some music, maybe even a lot of music, is written

0:32:14.520 --> 0:32:19.360
<v Speaker 1>very deliberately. You know, your painstakingly sitting down and figuring

0:32:19.400 --> 0:32:22.480
<v Speaker 1>out what chord comes next, when should you put in

0:32:22.520 --> 0:32:26.040
<v Speaker 1>the key change, how many times should you repeat the chorus.

0:32:26.560 --> 0:32:28.880
<v Speaker 1>It's not as if some mythical muse has reached down

0:32:28.960 --> 0:32:32.160
<v Speaker 1>to touch the musician's brain and create the song fully formed.

0:32:32.560 --> 0:32:35.080
<v Speaker 1>But there have been attempts by humans to create music

0:32:35.120 --> 0:32:39.640
<v Speaker 1>from an almost engineering perspective, so that it almost it

0:32:39.680 --> 0:32:42.040
<v Speaker 1>almost feels like you're taking the artistry out. That's not

0:32:42.200 --> 0:32:45.240
<v Speaker 1>entirely fair. I don't really believe that is so, but

0:32:45.640 --> 0:32:48.720
<v Speaker 1>there do. There are some songs out there that were

0:32:48.760 --> 0:32:52.240
<v Speaker 1>created by committee, and you could argue that some of

0:32:52.280 --> 0:32:58.880
<v Speaker 1>them perhaps seem to have less merit to them than others. Now,

0:32:59.160 --> 0:33:04.600
<v Speaker 1>there's some commit the design music that is amazing for

0:33:04.760 --> 0:33:07.360
<v Speaker 1>reasons that are difficult to put into words. For example,

0:33:07.400 --> 0:33:11.960
<v Speaker 1>in n Dave Soldier, a composer, worked with two artists,

0:33:12.720 --> 0:33:17.040
<v Speaker 1>Komar and Melamid, to create what they titled the Most

0:33:17.280 --> 0:33:21.360
<v Speaker 1>Unwanted Song. They conducted a public survey to find out

0:33:21.400 --> 0:33:24.720
<v Speaker 1>what people most liked and hated in music, and then

0:33:24.760 --> 0:33:28.880
<v Speaker 1>they created two different songs that incorporated many of those elements.

0:33:28.960 --> 0:33:33.600
<v Speaker 1>The ones that included the lowest scoring elements became part

0:33:33.680 --> 0:33:36.560
<v Speaker 1>of the Most Unwanted Song. And it's a song that

0:33:36.640 --> 0:33:41.440
<v Speaker 1>lasts about twenty minutes. It's incredibly long. It's a song

0:33:41.520 --> 0:33:47.160
<v Speaker 1>that includes accordion, bagpipes, children's voices, and opera singer rapping,

0:33:47.400 --> 0:33:53.840
<v Speaker 1>and also incorporated advertising. It's gloriously awful and it sounds

0:33:53.880 --> 0:34:26.480
<v Speaker 1>like this now. They also did the most Wanted Music,

0:34:26.840 --> 0:34:29.160
<v Speaker 1>and they created a song that incorporated the elements that

0:34:29.200 --> 0:34:32.719
<v Speaker 1>the survey takers identified as being the most pleasant components

0:34:33.040 --> 0:34:35.839
<v Speaker 1>of music. The result is something that would likely put

0:34:35.920 --> 0:34:39.799
<v Speaker 1>Kenny g into a coma. It's listening so easy you

0:34:39.840 --> 0:34:43.279
<v Speaker 1>don't even know you're listening. It's a shout out to

0:34:43.320 --> 0:34:45.920
<v Speaker 1>Peter shik Ali right there. I actually think that this

0:34:45.960 --> 0:34:49.239
<v Speaker 1>song is worse than the most Unwanted Song, but take

0:34:49.239 --> 0:35:18.920
<v Speaker 1>a listen the world. Both examples illustrate the power of

0:35:19.000 --> 0:35:22.200
<v Speaker 1>music analysis, as well as how it can easily be

0:35:22.320 --> 0:35:25.560
<v Speaker 1>misinterpreted or misused, which can create I think we can

0:35:25.600 --> 0:35:30.359
<v Speaker 1>all agree horrific results. But neither of those pieces were

0:35:30.440 --> 0:35:33.440
<v Speaker 1>actually generated by computers. That was all the work of

0:35:33.520 --> 0:35:37.080
<v Speaker 1>human beings. Human beings with a wonky sense of humor,

0:35:37.200 --> 0:35:39.439
<v Speaker 1>but still human. And you might think that the first

0:35:39.440 --> 0:35:42.319
<v Speaker 1>computer generated music must have come a decade or so later.

0:35:42.400 --> 0:35:45.359
<v Speaker 1>I mean, the Unwanted Song and Wanted Song both came

0:35:45.360 --> 0:35:50.799
<v Speaker 1>out nine, but now was late for computer generated music.

0:35:50.880 --> 0:35:54.560
<v Speaker 1>The first actual piece written by computer was the Iliac

0:35:54.800 --> 0:35:59.440
<v Speaker 1>Sweet for String Quartet, created in nineteen fifty seven. This

0:35:59.520 --> 0:36:03.319
<v Speaker 1>was the work of Learn Hiller, a composer, and Leonard Isaacson,

0:36:03.480 --> 0:36:07.440
<v Speaker 1>a mathematician, and their approach was fairly straightforward. They created

0:36:07.440 --> 0:36:11.440
<v Speaker 1>a program that would generate pseudo random integers, which in

0:36:11.480 --> 0:36:15.319
<v Speaker 1>turn would represent important information with regards to musical composition

0:36:15.480 --> 0:36:21.000
<v Speaker 1>such as pitch, rhythm, dynamics, and other factors. This processed

0:36:21.080 --> 0:36:24.000
<v Speaker 1>information would then go through a pass on a filter,

0:36:24.200 --> 0:36:26.840
<v Speaker 1>and that filter would force the data to follow rules

0:36:26.840 --> 0:36:31.040
<v Speaker 1>of composition, so it sort out anything that went outside

0:36:31.040 --> 0:36:34.279
<v Speaker 1>of the rules of composition and anything that was when

0:36:34.320 --> 0:36:37.520
<v Speaker 1>then the rules would get a pass and the resulting

0:36:37.560 --> 0:36:41.640
<v Speaker 1>piece of music for a string quartet sounds a bit experimental,

0:36:41.920 --> 0:36:45.239
<v Speaker 1>but it doesn't exactly sound mechanical. It sounds kind of

0:36:45.280 --> 0:37:02.640
<v Speaker 1>like this. Other experiments and music generation followed, but they

0:37:02.680 --> 0:37:07.120
<v Speaker 1>all depended pretty heavily on computers working within relatively strict

0:37:07.239 --> 0:37:10.200
<v Speaker 1>sets of rules, with a good deal of human guidance

0:37:10.200 --> 0:37:12.759
<v Speaker 1>along the way, and of course the computers had no

0:37:12.800 --> 0:37:17.080
<v Speaker 1>actual understanding of music. You could program in rules for

0:37:17.120 --> 0:37:20.560
<v Speaker 1>different musical genres and computers can do that. That's what

0:37:20.640 --> 0:37:23.680
<v Speaker 1>computers do. They're really good at following rules, but the

0:37:23.719 --> 0:37:26.520
<v Speaker 1>machines have no way of knowing why those rules exist

0:37:26.680 --> 0:37:29.680
<v Speaker 1>or what sort of effect those rules have on the

0:37:29.760 --> 0:37:34.279
<v Speaker 1>music itself. Computer scientists have created some interesting experiments to

0:37:34.360 --> 0:37:38.800
<v Speaker 1>build music generators. For example, Matt Vitelli and Erin Naiebe

0:37:39.160 --> 0:37:42.360
<v Speaker 1>built software that analyzed a piece of music by Medean,

0:37:42.960 --> 0:37:47.120
<v Speaker 1>a French DJ, from the day on I suppose I

0:37:47.160 --> 0:37:52.759
<v Speaker 1>apologize my Francie is uh not very good. The software

0:37:52.840 --> 0:37:56.759
<v Speaker 1>analyzed Medeans work and then attempted to replicate it. It

0:37:56.960 --> 0:38:00.400
<v Speaker 1>used recurrent neural networks an attempt to capture the essence

0:38:00.440 --> 0:38:03.680
<v Speaker 1>of the music and make something similar. The neural network

0:38:03.800 --> 0:38:07.279
<v Speaker 1>learned with every iteration of music uh, and learned how

0:38:07.320 --> 0:38:10.800
<v Speaker 1>to more closely mimic the style, So when it first

0:38:10.840 --> 0:38:18.680
<v Speaker 1>started it sounded like pure noise. It took two thousand

0:38:18.760 --> 0:38:22.240
<v Speaker 1>iterations before it generated something that resembled a song more

0:38:22.600 --> 0:38:31.359
<v Speaker 1>than noise. But it shows that these learning algorithms are

0:38:31.400 --> 0:38:35.719
<v Speaker 1>able to start focusing on what those elements are that

0:38:35.800 --> 0:38:42.720
<v Speaker 1>represent meaningful information versus meaningless information. So would this eventually

0:38:42.760 --> 0:38:45.120
<v Speaker 1>be able to create its own music if you were

0:38:45.239 --> 0:38:48.840
<v Speaker 1>to say, said it to listening to a radio station

0:38:48.880 --> 0:38:52.600
<v Speaker 1>for long enough. Who's to say? Over at Google, the

0:38:52.680 --> 0:38:55.240
<v Speaker 1>Brain team is working on a ton of different projects

0:38:55.239 --> 0:38:59.799
<v Speaker 1>related to machine learning and artificial intelligence, including exploring opportunities

0:38:59.800 --> 0:39:03.880
<v Speaker 1>for computer generated music. This falls under something called the

0:39:03.880 --> 0:39:07.640
<v Speaker 1>Magenta Project, and the project has two purposes. The first

0:39:07.719 --> 0:39:11.239
<v Speaker 1>is to experiment with machines creating different forms of art automatically,

0:39:11.400 --> 0:39:15.640
<v Speaker 1>including music. The second purpose is to foster a community

0:39:15.719 --> 0:39:18.560
<v Speaker 1>of artists and programmers to find new and interesting ways

0:39:18.600 --> 0:39:23.160
<v Speaker 1>to use this technology that Google has created. On the

0:39:23.200 --> 0:39:27.360
<v Speaker 1>official page for Magenta, Douglas Eck points out that artists

0:39:27.360 --> 0:39:30.239
<v Speaker 1>have always found innovative ways to put technology to use

0:39:30.320 --> 0:39:33.319
<v Speaker 1>beyond what the creators had in mind, and that's where

0:39:33.360 --> 0:39:36.760
<v Speaker 1>true innovation lies. So in other words, when you create

0:39:36.800 --> 0:39:39.920
<v Speaker 1>an electric guitar for the first time, you're probably not

0:39:40.040 --> 0:39:43.040
<v Speaker 1>anticipating the way Jimmy Hendrix is going to play that

0:39:43.400 --> 0:39:46.959
<v Speaker 1>decades later. So artists have been able to take things

0:39:46.960 --> 0:39:51.120
<v Speaker 1>that people have created and move it beyond even the

0:39:51.120 --> 0:39:53.960
<v Speaker 1>creator's expectations. That's kind of what they're hoping over at

0:39:53.960 --> 0:39:57.840
<v Speaker 1>the Magenta Project. Ck goes on to point out that

0:39:57.880 --> 0:40:00.759
<v Speaker 1>short form machine generated music can be quite effective, and

0:40:00.800 --> 0:40:03.839
<v Speaker 1>it's been around for a while. There are generators out

0:40:03.840 --> 0:40:08.000
<v Speaker 1>there that can make short songs essentially are short pieces

0:40:08.000 --> 0:40:13.120
<v Speaker 1>of music. But if you increase the duration requirement, if

0:40:13.120 --> 0:40:16.880
<v Speaker 1>you require the music to last longer, you start running

0:40:16.920 --> 0:40:19.399
<v Speaker 1>into the limitations of the technology. They start to become

0:40:19.440 --> 0:40:22.360
<v Speaker 1>more apparent, and it becomes clear that machines aren't really

0:40:22.400 --> 0:40:25.840
<v Speaker 1>good at sustaining a long term narrative in any format.

0:40:26.560 --> 0:40:30.200
<v Speaker 1>The Magenta project isn't just a single approach. It's not

0:40:30.280 --> 0:40:33.040
<v Speaker 1>like a group of folks who are just working on

0:40:33.040 --> 0:40:36.400
<v Speaker 1>one set of algorithms. Think of it more like a

0:40:36.440 --> 0:40:41.480
<v Speaker 1>platform or a list of assets, a list of available

0:40:42.000 --> 0:40:46.759
<v Speaker 1>uh bits and pieces other people can use, and programmers

0:40:46.760 --> 0:40:49.680
<v Speaker 1>and musicians can build tools out of those pieces for

0:40:49.719 --> 0:40:52.960
<v Speaker 1>generating music. Now, some of those tools may end up

0:40:52.960 --> 0:40:56.919
<v Speaker 1>being way more effective than other tools. Just figuring out

0:40:56.920 --> 0:41:00.480
<v Speaker 1>how to evaluate the abilities of the software could end

0:41:00.520 --> 0:41:02.960
<v Speaker 1>up becoming a challenge. How can you tell if one

0:41:03.000 --> 0:41:07.080
<v Speaker 1>autonomous music generator is quote unquote better than another one.

0:41:07.719 --> 0:41:10.879
<v Speaker 1>Music is pretty subjective and what I might like might

0:41:10.920 --> 0:41:13.960
<v Speaker 1>not be what you like, And there are some qualitative

0:41:13.960 --> 0:41:17.840
<v Speaker 1>elements that we can look at that are pretty difficult

0:41:17.880 --> 0:41:20.719
<v Speaker 1>to to get a conversation going, because if you have

0:41:20.800 --> 0:41:24.400
<v Speaker 1>a very different set of of pros and cons or

0:41:24.440 --> 0:41:27.719
<v Speaker 1>or set of preferences I should say about music than

0:41:27.760 --> 0:41:31.000
<v Speaker 1>I do. Then we might hit a wall. But there's

0:41:31.040 --> 0:41:34.360
<v Speaker 1>some quantitative elements such as the amount of variation in

0:41:34.400 --> 0:41:38.160
<v Speaker 1>a piece and whether the music generated fits whatever genre

0:41:38.239 --> 0:41:41.440
<v Speaker 1>you're aiming for, that you can use those. That's a

0:41:41.480 --> 0:41:44.719
<v Speaker 1>little bit easier because it's a quantitative or more or

0:41:44.800 --> 0:41:47.399
<v Speaker 1>less a quantitative element. But pretty soon you get into

0:41:47.440 --> 0:41:49.680
<v Speaker 1>more subjective territory, and that's where it all breaks down.

0:41:50.200 --> 0:41:53.480
<v Speaker 1>At the moment, machines are better at interpreting and combining

0:41:53.560 --> 0:41:57.480
<v Speaker 1>musical pieces than they aren't creating something entirely new. For example,

0:41:57.840 --> 0:42:00.239
<v Speaker 1>David Cope, who is a professor emerit us at the

0:42:00.320 --> 0:42:04.000
<v Speaker 1>University of California, Santa Cruz, is also a composer, launched

0:42:04.000 --> 0:42:07.520
<v Speaker 1>a project called Experiments in Musical Intelligence many years ago

0:42:07.920 --> 0:42:11.480
<v Speaker 1>and use the computer program to analyze various classical composers

0:42:11.560 --> 0:42:15.920
<v Speaker 1>musical work. Then the program would construct new pieces using

0:42:16.160 --> 0:42:19.920
<v Speaker 1>the elements it had analyzed as building blocks for that piece.

0:42:20.040 --> 0:42:23.799
<v Speaker 1>So the program wasn't really writing something entirely new, but

0:42:23.920 --> 0:42:28.480
<v Speaker 1>rather combining found elements in new ways. Now, perhaps in

0:42:28.520 --> 0:42:31.000
<v Speaker 1>the future machines will be able to make art on

0:42:31.040 --> 0:42:34.440
<v Speaker 1>their own with minimal human input, and if that happens,

0:42:34.440 --> 0:42:37.560
<v Speaker 1>we'll likely have to face some tough philosophical questions about

0:42:37.600 --> 0:42:40.440
<v Speaker 1>the nature of art. If a machine doesn't possess self

0:42:40.480 --> 0:42:44.359
<v Speaker 1>awareness or consciousness and really is just a complicated set

0:42:44.400 --> 0:42:48.520
<v Speaker 1>of equations that generate data according to some general rules,

0:42:49.280 --> 0:42:54.319
<v Speaker 1>is its production actually art? Is intent required for it

0:42:54.400 --> 0:42:56.680
<v Speaker 1>to be art? Does the artist have to intend something

0:42:56.719 --> 0:42:59.520
<v Speaker 1>in order for it to be art? If people enjoy

0:42:59.600 --> 0:43:04.080
<v Speaker 1>the work and find it intellectually or emotionally stimulating, does

0:43:04.160 --> 0:43:08.320
<v Speaker 1>that make it real music? If if I like something

0:43:08.600 --> 0:43:10.920
<v Speaker 1>and I find out later on that a computer generated

0:43:10.920 --> 0:43:13.919
<v Speaker 1>it completely from start to finish, does that at all

0:43:14.040 --> 0:43:17.239
<v Speaker 1>lesson the value of that music? Or does the fact

0:43:17.280 --> 0:43:19.719
<v Speaker 1>that I like it mean that it's quote unquote real.

0:43:20.760 --> 0:43:23.080
<v Speaker 1>We're none at the stage right now where those questions

0:43:23.080 --> 0:43:25.719
<v Speaker 1>need urgent answers, but I do think they're really interesting,

0:43:25.960 --> 0:43:28.760
<v Speaker 1>and now it's time that we play our own music

0:43:29.000 --> 0:43:31.000
<v Speaker 1>and get the heck out of here. So if you

0:43:31.000 --> 0:43:34.520
<v Speaker 1>guys have any suggestions for future episodes of tech Stuff,

0:43:35.440 --> 0:43:37.919
<v Speaker 1>right to me. Let me know what you think. Our

0:43:38.040 --> 0:43:41.680
<v Speaker 1>email address is tech Stuff at how stuff works dot com,

0:43:41.960 --> 0:43:44.520
<v Speaker 1>or you can drop me a line on Twitter or Facebook.

0:43:44.840 --> 0:43:46.520
<v Speaker 1>The handle for the show at both of those is

0:43:46.560 --> 0:43:50.880
<v Speaker 1>tech Stuff hsw on Wednesdays and Friday's. I record in

0:43:51.000 --> 0:43:54.080
<v Speaker 1>the studio and you can watch me live on twitch

0:43:54.160 --> 0:43:57.920
<v Speaker 1>dot tv slash tech Stuff. Watch as I struggle for

0:43:58.000 --> 0:44:03.120
<v Speaker 1>words and fail and then head desk and then tell

0:44:03.200 --> 0:44:05.520
<v Speaker 1>Dylan to pause the recording so I can come up

0:44:05.560 --> 0:44:07.680
<v Speaker 1>with something and then start the recording again. You get

0:44:07.719 --> 0:44:10.040
<v Speaker 1>to see the whole thing, so all the stuff that

0:44:10.080 --> 0:44:12.440
<v Speaker 1>gets cut out of the podcasts, you can watch it

0:44:12.520 --> 0:44:17.320
<v Speaker 1>happen live. Sometimes I dance. I hope to see you

0:44:17.400 --> 0:44:21.080
<v Speaker 1>Wednesdays and Fridays at twitch dot tv slash text Stuff

0:44:21.080 --> 0:44:30.440
<v Speaker 1>and I'll talk to you again really soon. For more

0:44:30.480 --> 0:44:32.799
<v Speaker 1>on this and thousands of other topics, because it has

0:44:32.800 --> 0:44:43.480
<v Speaker 1>stop works dot com