WEBVTT - Chocolate Chicken Chicken Cake

0:00:04.600 --> 0:00:08.360
<v Speaker 1>Sleepwalkers is a production of our Heart Radio and unusual productions.

0:00:12.360 --> 0:00:16.079
<v Speaker 1>So I'm here for a surprise poetry reading. It's about

0:00:16.120 --> 0:00:21.120
<v Speaker 1>to start. The silence is hardly final. Somewhere in the street,

0:00:21.400 --> 0:00:24.600
<v Speaker 1>I can see the trees begin to rise and fall

0:00:24.720 --> 0:00:27.319
<v Speaker 1>for the light of the dark thing above me. The

0:00:27.400 --> 0:00:29.600
<v Speaker 1>dream is like a shiny black hair, and the sun

0:00:29.760 --> 0:00:32.800
<v Speaker 1>is like a dream. I stand up and watch the

0:00:32.800 --> 0:00:35.800
<v Speaker 1>sun shine on a single day, and the sun has

0:00:35.840 --> 0:00:39.320
<v Speaker 1>a chance to accomplish from the springs of my own delight.

0:00:40.760 --> 0:00:45.960
<v Speaker 1>Kind of haunting, abstract, yes, but beautiful too. And crucially,

0:00:47.400 --> 0:00:50.360
<v Speaker 1>when I read this, I felt, as you just did.

0:00:50.400 --> 0:00:53.120
<v Speaker 1>I hope that it is beautiful. I found it evocative

0:00:53.240 --> 0:00:55.280
<v Speaker 1>of experience that I've had it in the past. I

0:00:55.280 --> 0:00:58.279
<v Speaker 1>found it nostalgic um and I'm like, oh my god,

0:00:58.320 --> 0:01:01.720
<v Speaker 1>I'm having real human being emotions. That was filmmaker Oscar Sharp,

0:01:02.000 --> 0:01:05.639
<v Speaker 1>and that poem wasn't written by him or by anyone else.

0:01:06.200 --> 0:01:10.119
<v Speaker 1>It was written by a computer, a machine poet. We're

0:01:10.120 --> 0:01:13.160
<v Speaker 1>more and more worried about robots coming to take our jobs.

0:01:13.400 --> 0:01:16.160
<v Speaker 1>And though perhaps few would regret less hips to poets

0:01:16.160 --> 0:01:20.280
<v Speaker 1>shambly around Brooklyn, somehow machines in the creative world are

0:01:20.480 --> 0:01:24.800
<v Speaker 1>especially uncanny, even frightening, because poetry and music and humor

0:01:24.840 --> 0:01:27.560
<v Speaker 1>are supposed to be the things that define our humanity,

0:01:27.600 --> 0:01:30.720
<v Speaker 1>aren't they. In this episode, we look at how AI

0:01:30.920 --> 0:01:33.520
<v Speaker 1>is being used in the creative arts, and in doing so,

0:01:33.959 --> 0:01:36.920
<v Speaker 1>we understand a lot more about how this often intimidating

0:01:36.959 --> 0:01:55.520
<v Speaker 1>technology actually works. I'm Moza Lush and welcome to Sleepwalkers. Hi,

0:01:56.040 --> 0:02:01.120
<v Speaker 1>Hi Karen So did Oscar's poems spring your delight? I

0:02:01.160 --> 0:02:04.040
<v Speaker 1>would have preferred it in your British act? Then I think, well,

0:02:04.040 --> 0:02:06.040
<v Speaker 1>I can't blame you for that. It did remind me

0:02:06.080 --> 0:02:08.360
<v Speaker 1>of Um. My friend is in a band called the

0:02:08.520 --> 0:02:12.040
<v Speaker 1>x Ambassadors and they partnered with this producer called Alex

0:02:12.160 --> 0:02:15.040
<v Speaker 1>the Kid who was asked by IBM to make a

0:02:15.080 --> 0:02:17.799
<v Speaker 1>song using Watson. And the song is not bad. The

0:02:17.840 --> 0:02:21.200
<v Speaker 1>song is not bad. Um, the sounds good. But basically,

0:02:21.840 --> 0:02:24.519
<v Speaker 1>the way in which they used Watson was that they

0:02:24.680 --> 0:02:29.640
<v Speaker 1>crunched the twenty six thousand songs from the top one charts.

0:02:30.400 --> 0:02:32.359
<v Speaker 1>It's from over what time period, like over a few

0:02:32.400 --> 0:02:35.960
<v Speaker 1>years presume, I don't know, But the point is that

0:02:36.000 --> 0:02:39.640
<v Speaker 1>they used them to discover patterns in the songs. What

0:02:39.800 --> 0:02:42.120
<v Speaker 1>makes the top hundred song. Basically that's right, and then

0:02:42.160 --> 0:02:44.560
<v Speaker 1>that's reproduce it right, which is interesting because I think

0:02:44.560 --> 0:02:46.640
<v Speaker 1>anybody could kind of tell you what makes the top

0:02:46.680 --> 0:02:49.040
<v Speaker 1>on hundred song right. They're called earworms. But when you

0:02:49.040 --> 0:02:52.200
<v Speaker 1>think about a data set of twenty six thousand, like,

0:02:52.320 --> 0:02:54.200
<v Speaker 1>no human being can listen to that many songs and

0:02:54.400 --> 0:02:57.320
<v Speaker 1>do any productive after they hear it. So in this episode,

0:02:57.360 --> 0:03:00.359
<v Speaker 1>we're going to look at AI and art and different

0:03:00.400 --> 0:03:03.800
<v Speaker 1>kinds of fields to really understand how computers crunched data

0:03:03.919 --> 0:03:06.840
<v Speaker 1>to crack open this creative code. But first I want

0:03:06.880 --> 0:03:09.400
<v Speaker 1>to go back to the algorithm who wrote the poem

0:03:09.440 --> 0:03:11.920
<v Speaker 1>we heard at the beginning of the episode, because that

0:03:12.000 --> 0:03:15.239
<v Speaker 1>algorithm also wrote the film that Steven Spielberg of the

0:03:15.280 --> 0:03:18.799
<v Speaker 1>machine learning world, and the algorithm is called Benjamin. So

0:03:18.880 --> 0:03:22.080
<v Speaker 1>we're going to meet Benjamin and a few people not

0:03:22.240 --> 0:03:26.480
<v Speaker 1>named Benjamin. I was a speech trator for John Kerry,

0:03:27.040 --> 0:03:30.000
<v Speaker 1>Tim Gaitner, and Barack Obama, um, not in that order,

0:03:30.080 --> 0:03:32.640
<v Speaker 1>And I'm essentially a ghost writer and a photographer who

0:03:32.760 --> 0:03:37.040
<v Speaker 1>learned to code one giant Frankenstein monster. That's Ross Goodwin

0:03:37.480 --> 0:03:41.240
<v Speaker 1>and that Frankenstein monster it goes by Benjamin and it's

0:03:41.280 --> 0:03:44.400
<v Speaker 1>the work of Ross Goodwin and Oscar Sharp, who you'll

0:03:44.400 --> 0:03:47.680
<v Speaker 1>remember from that poetry reading, but neither of them actually

0:03:47.800 --> 0:03:51.800
<v Speaker 1>named Benjamin. Well, they named itself, or rather, there was

0:03:51.840 --> 0:03:53.520
<v Speaker 1>a piece of paper that came out of it that

0:03:53.600 --> 0:03:55.840
<v Speaker 1>said my name is Benjamin on it. I read that

0:03:56.040 --> 0:03:58.360
<v Speaker 1>in response to a question that was put to it,

0:03:58.760 --> 0:04:02.480
<v Speaker 1>and a room full of people went, oh, a program

0:04:02.520 --> 0:04:06.080
<v Speaker 1>that names itself is rather uncanny. And Oscar had been

0:04:06.120 --> 0:04:08.400
<v Speaker 1>chasing that uncanny nous ever since he was at n

0:04:08.560 --> 0:04:11.280
<v Speaker 1>y U for graduate school. Whenever I met anyone who

0:04:11.280 --> 0:04:14.160
<v Speaker 1>good program, I would grab them by the pels and

0:04:14.280 --> 0:04:16.440
<v Speaker 1>yell into the face. Can you can you build something

0:04:16.480 --> 0:04:19.520
<v Speaker 1>that can write like people talk in some way? And

0:04:19.600 --> 0:04:22.720
<v Speaker 1>one day in class Oscar notices there's this sneakerhead and

0:04:22.720 --> 0:04:25.000
<v Speaker 1>he's sitting on his laptop and his laptop is writing

0:04:25.160 --> 0:04:30.080
<v Speaker 1>without him touching it. And I'm like, oh, so we go,

0:04:30.320 --> 0:04:33.360
<v Speaker 1>So we go for coffee, and that gut coffee was.

0:04:33.400 --> 0:04:35.560
<v Speaker 1>It was a lengthy coffee. We're still having that coffee.

0:04:35.560 --> 0:04:39.719
<v Speaker 1>We're still having such a cold coffee. By now, you

0:04:39.800 --> 0:04:41.960
<v Speaker 1>might not believe it if it happened in a film,

0:04:42.080 --> 0:04:44.800
<v Speaker 1>but Oscar had stumbled on exactly the person he was

0:04:44.839 --> 0:04:47.680
<v Speaker 1>looking for. Oscar came to me and he said, I

0:04:47.720 --> 0:04:50.239
<v Speaker 1>want to make a movie from a computer generated screenplay.

0:04:50.360 --> 0:04:52.120
<v Speaker 1>And I said, you know, of course that sounds amazing.

0:04:52.200 --> 0:04:54.280
<v Speaker 1>Let's do it. But let's figure out how we're going

0:04:54.279 --> 0:04:57.480
<v Speaker 1>to generate the screenplay, because that's a nuanced process with

0:04:57.520 --> 0:04:59.679
<v Speaker 1>lots of stabs, and we need to consider like every

0:04:59.680 --> 0:05:03.760
<v Speaker 1>part it. So Oscar volunteered himself to teach me all

0:05:03.880 --> 0:05:07.800
<v Speaker 1>the things about storytelling and narrative and filmmaking. He turned

0:05:07.800 --> 0:05:11.360
<v Speaker 1>me onto like Vladimir prop Joseph campbell Um, all these

0:05:11.400 --> 0:05:16.000
<v Speaker 1>theories of storytelling, and so they begin to experiment. I

0:05:16.040 --> 0:05:19.400
<v Speaker 1>tried a bunch of prototypes that used like various structures

0:05:19.440 --> 0:05:23.159
<v Speaker 1>that had been postulated by these theorists over time, and

0:05:23.360 --> 0:05:27.680
<v Speaker 1>the output was not interesting. Despite following the rules laid

0:05:27.680 --> 0:05:31.680
<v Speaker 1>out by narrative theorists, Ross couldn't get anything good just

0:05:31.760 --> 0:05:35.599
<v Speaker 1>telling his programs what a story should contain. So a

0:05:35.680 --> 0:05:39.000
<v Speaker 1>year passes, Oscar moves to l A and when I

0:05:39.040 --> 0:05:42.320
<v Speaker 1>get this email from Ross and it's the results in

0:05:42.360 --> 0:05:43.960
<v Speaker 1>one of those experiments that he wants me to read,

0:05:44.200 --> 0:05:47.240
<v Speaker 1>and read it he did. Rossity mailed the poem from

0:05:47.240 --> 0:05:50.480
<v Speaker 1>the beginning of the episode, the room is blown away

0:05:50.480 --> 0:05:53.240
<v Speaker 1>from the door and the stones are beginning to shine.

0:05:53.760 --> 0:05:55.839
<v Speaker 1>I immediately was like, oh my god, I don't know

0:05:55.839 --> 0:05:57.400
<v Speaker 1>how he's doing this. But he said, I don't know

0:05:57.400 --> 0:06:00.120
<v Speaker 1>what technology you're using right now, but can we it

0:06:00.200 --> 0:06:03.680
<v Speaker 1>for screenplay? And so they did, and not just a screenplay,

0:06:03.720 --> 0:06:06.680
<v Speaker 1>they actually produced a short film called Sunspring and they

0:06:06.720 --> 0:06:09.839
<v Speaker 1>even got Thomas middle Ditch, the lead on Hbos Silaken Valley,

0:06:10.000 --> 0:06:15.599
<v Speaker 1>to star in it. Principle is completely constructed. Of the

0:06:15.600 --> 0:06:18.320
<v Speaker 1>same time, it's all about you. To be true, you

0:06:18.360 --> 0:06:20.279
<v Speaker 1>didn't even watch the movie with the rest of the base.

0:06:20.360 --> 0:06:22.880
<v Speaker 1>I don't know, I don't care. I know it's a

0:06:22.920 --> 0:06:26.279
<v Speaker 1>consequence whatever you need to know about the presence of

0:06:26.320 --> 0:06:29.120
<v Speaker 1>the story. I'm a little bit of a boy on

0:06:29.160 --> 0:06:32.680
<v Speaker 1>the floor. So what do you think, Carol? It kind

0:06:32.680 --> 0:06:34.440
<v Speaker 1>of reminds me of when my parents used to take

0:06:34.480 --> 0:06:37.520
<v Speaker 1>me to like a bad production of Macbeth or as

0:06:37.560 --> 0:06:41.880
<v Speaker 1>you like it. Traumatic. You're there and you're seven or eight,

0:06:42.200 --> 0:06:44.880
<v Speaker 1>and you want to understand what's going on, and so

0:06:45.160 --> 0:06:47.640
<v Speaker 1>you kind of pay as close attention as you possibly

0:06:47.680 --> 0:06:50.520
<v Speaker 1>can to what the actors are doing because you have

0:06:50.720 --> 0:06:53.840
<v Speaker 1>no idea what the dialogue means. Yeah, I mean that

0:06:53.920 --> 0:06:57.279
<v Speaker 1>to me is quite impressive because a machine can create

0:06:57.360 --> 0:07:00.120
<v Speaker 1>something which has enough of the elements in common the

0:07:00.240 --> 0:07:02.400
<v Speaker 1>film that we can talk about a real film. You

0:07:02.440 --> 0:07:05.080
<v Speaker 1>can't say it's not a film absolutely. Of course. What's

0:07:05.080 --> 0:07:07.680
<v Speaker 1>different is it didn't take Benjamin very long at all

0:07:07.800 --> 0:07:10.800
<v Speaker 1>to make it. Once you press the button fraction of

0:07:10.800 --> 0:07:12.760
<v Speaker 1>a second there was a couple of seconds perpase, maybe

0:07:12.800 --> 0:07:14.480
<v Speaker 1>maybe a couple of seconds total, actually a fraction of

0:07:14.480 --> 0:07:17.880
<v Speaker 1>a second per page. That's right. After months of agonizing

0:07:17.920 --> 0:07:21.720
<v Speaker 1>over centuries of storytelling theory, the final output only took

0:07:21.800 --> 0:07:25.640
<v Speaker 1>a couple of seconds. So what was Ross's breakthrough? To

0:07:25.760 --> 0:07:28.080
<v Speaker 1>understand we turned to one of the most famous AI

0:07:28.120 --> 0:07:32.760
<v Speaker 1>scientists in the world, Sebastian Throne. Recently, something magical had

0:07:32.760 --> 0:07:37.800
<v Speaker 1>happened recently. The feat has discovered was called machine learning.

0:07:38.080 --> 0:07:41.680
<v Speaker 1>With AI, computers can now find their own rules. They

0:07:41.720 --> 0:07:45.440
<v Speaker 1>are called neural networks. They're comprised of hundreds of millions

0:07:45.440 --> 0:07:48.280
<v Speaker 1>of little vase sample processing units, and those units are

0:07:48.360 --> 0:07:51.800
<v Speaker 1>modeled after what a neurons do in our physical brains.

0:07:52.040 --> 0:07:54.640
<v Speaker 1>You just give them examples, very much like the way

0:07:54.680 --> 0:07:58.600
<v Speaker 1>we we waste children. We don't give our children rules

0:07:58.640 --> 0:08:01.560
<v Speaker 1>for every contingency in life. In the first eight years

0:08:01.560 --> 0:08:04.720
<v Speaker 1>of education. We let them learn, They experience the world,

0:08:05.120 --> 0:08:07.880
<v Speaker 1>and they loan behold. They make their own rules. And

0:08:07.880 --> 0:08:09.560
<v Speaker 1>we are now in the world where computers can do

0:08:09.680 --> 0:08:12.360
<v Speaker 1>the same thing. And this means machine learning can be

0:08:12.440 --> 0:08:16.160
<v Speaker 1>used in all kinds of different fields. Sebastian himself applied

0:08:16.160 --> 0:08:19.240
<v Speaker 1>the technology at Google, where he led the initial development

0:08:19.280 --> 0:08:21.840
<v Speaker 1>of their self driving car. When you want to read

0:08:21.880 --> 0:08:24.720
<v Speaker 1>a book, a book on like what the car should

0:08:24.760 --> 0:08:27.640
<v Speaker 1>do in every situation, that rule book is really complicated

0:08:27.680 --> 0:08:29.680
<v Speaker 1>and it can promise you no matter how many years

0:08:29.760 --> 0:08:33.040
<v Speaker 1>you spent writing it, it's not gonna work. But when

0:08:33.080 --> 0:08:36.360
<v Speaker 1>you give the machine the ability to learn its own rules,

0:08:36.640 --> 0:08:40.040
<v Speaker 1>it is actually able to surpass how people can drive.

0:08:41.200 --> 0:08:45.079
<v Speaker 1>We'll hear more from Sebastian later, but machine learning mL

0:08:45.520 --> 0:08:48.040
<v Speaker 1>is the engine that drives almost all of the excitement

0:08:48.080 --> 0:08:52.280
<v Speaker 1>about AI today, from identifying targets on the battlefield to

0:08:52.600 --> 0:08:56.640
<v Speaker 1>understanding genetic diseases. And it's also what allowed Ross and

0:08:56.640 --> 0:08:59.959
<v Speaker 1>Oscar to create a usable movie script. Rather than laying

0:09:00.000 --> 0:09:04.199
<v Speaker 1>down storytelling rules, they simply showed Benjamin hundreds of examples

0:09:04.520 --> 0:09:12.160
<v Speaker 1>and the algorithm found patterns and learned for itself more sleepwalkers.

0:09:12.360 --> 0:09:23.160
<v Speaker 1>After the break, you're like, oh, did we essentially we

0:09:23.240 --> 0:09:27.000
<v Speaker 1>teach this algorithm anything else about screenplay other than just

0:09:27.040 --> 0:09:30.280
<v Speaker 1>putting in a bunch of screenplays, right, And that's the

0:09:30.320 --> 0:09:33.680
<v Speaker 1>way that machine learning works. What is happening in a

0:09:33.760 --> 0:09:36.439
<v Speaker 1>deep learning algorithm of this kind is it's building an

0:09:36.480 --> 0:09:41.199
<v Speaker 1>extraordinarily complicated mathematical formula by reading all of this stuff

0:09:41.240 --> 0:09:43.439
<v Speaker 1>over and over again, like the auto complete on your phone.

0:09:43.440 --> 0:09:46.160
<v Speaker 1>The neural that is actually sampling from a probability distribution

0:09:46.200 --> 0:09:49.640
<v Speaker 1>of which letters, bass or punctuation become next. So the

0:09:49.679 --> 0:09:53.240
<v Speaker 1>script for sun Spring was essentially the most mathematically probable

0:09:53.280 --> 0:09:56.800
<v Speaker 1>Sci Fi script except Ross and Oscar did have one

0:09:56.880 --> 0:10:01.040
<v Speaker 1>important lever of creative control. The other of parameters that

0:10:01.080 --> 0:10:04.480
<v Speaker 1>you're probably wondering about, there's one called the temperature is

0:10:04.480 --> 0:10:08.240
<v Speaker 1>the riskiness of those next letter predictions. What Ross is

0:10:08.280 --> 0:10:12.200
<v Speaker 1>describing is almost like a dial for creativity. Turn it

0:10:12.240 --> 0:10:14.680
<v Speaker 1>up to a really high temperature, and the neural net

0:10:14.720 --> 0:10:17.880
<v Speaker 1>is going to be extra creative and start making up words,

0:10:17.960 --> 0:10:22.600
<v Speaker 1>babbling at a very high temperature. It's essentially drunk. Low temperature,

0:10:22.600 --> 0:10:25.200
<v Speaker 1>it's going to be very repetitive and possible even begin

0:10:25.240 --> 0:10:28.760
<v Speaker 1>to plagiarize its source material. So it'll be very repetitive.

0:10:28.760 --> 0:10:30.480
<v Speaker 1>It'll be like the streets and the streets, and the

0:10:30.520 --> 0:10:34.400
<v Speaker 1>streets and the streets. It's essentially went working for network television. Yeah, exactly.

0:10:34.480 --> 0:10:36.280
<v Speaker 1>So we wanted it to be sort of in the middle.

0:10:37.000 --> 0:10:39.280
<v Speaker 1>In the middle is where we found the best output

0:10:39.440 --> 0:10:44.280
<v Speaker 1>and the most I think usable output, and Sunspring was born.

0:10:45.960 --> 0:10:49.320
<v Speaker 1>So Benjamin Ross and Oscar right together now they write

0:10:49.320 --> 0:10:53.080
<v Speaker 1>poetry and movies and sometimes what Benjamin spits out is good.

0:10:53.600 --> 0:10:55.360
<v Speaker 1>Often they have to sift through it to find the

0:10:55.400 --> 0:10:59.440
<v Speaker 1>best stuff. But he's prolific and he never ever suffers

0:10:59.440 --> 0:11:04.520
<v Speaker 1>from writers. Look, so Kara was telling us earlier about

0:11:04.559 --> 0:11:07.320
<v Speaker 1>Alex the kidd and using AI to make music, and

0:11:07.320 --> 0:11:09.199
<v Speaker 1>that's something I want to understand a bit more about.

0:11:09.559 --> 0:11:13.640
<v Speaker 1>So Julian went on a little bit of an expedition. Yes,

0:11:13.720 --> 0:11:15.920
<v Speaker 1>I did. I've been seeing a lot of articles lately

0:11:15.960 --> 0:11:18.040
<v Speaker 1>about AI and the arts, and I've been pretty curious

0:11:18.040 --> 0:11:20.760
<v Speaker 1>about music specifically. We might take it for granted, but

0:11:20.920 --> 0:11:24.040
<v Speaker 1>music is this primal emotional thing that's been with us forever.

0:11:24.080 --> 0:11:27.240
<v Speaker 1>It might even predate language. But now Warner Music Group

0:11:27.320 --> 0:11:30.360
<v Speaker 1>made history in April two nineteen, is the first major

0:11:30.440 --> 0:11:33.800
<v Speaker 1>label to sign an AI to a record deal. Yeah,

0:11:33.840 --> 0:11:37.720
<v Speaker 1>they signed this bot called Endel, which makes ambient noises

0:11:37.760 --> 0:11:39.840
<v Speaker 1>based on where you are and what the weather is

0:11:39.880 --> 0:11:41.760
<v Speaker 1>and what time of day it is. When I think

0:11:41.800 --> 0:11:44.240
<v Speaker 1>of this kind of music, I think of those Spotify

0:11:44.320 --> 0:11:48.880
<v Speaker 1>playlists like Peaceful Piano and Blissed Out Dinner Party, which

0:11:48.920 --> 0:11:52.440
<v Speaker 1>would become extremely popular. It's not the same thing as Beyonce, No,

0:11:52.640 --> 0:11:56.280
<v Speaker 1>definitely not. But Warner Music Group signed Endl to generate

0:11:56.320 --> 0:11:59.600
<v Speaker 1>twenty albums of ambient music. And now that we live

0:11:59.600 --> 0:12:03.120
<v Speaker 1>in a world where aies can get record deals, what

0:12:03.240 --> 0:12:05.559
<v Speaker 1>does this mean for artists? What does this mean for

0:12:05.600 --> 0:12:08.920
<v Speaker 1>even just music as we know it? Well, in my

0:12:09.000 --> 0:12:12.000
<v Speaker 1>quest to find out, I visited this company called Amper.

0:12:12.640 --> 0:12:15.840
<v Speaker 1>My name is Drew Silverstein. I am the co founder

0:12:15.920 --> 0:12:18.720
<v Speaker 1>and CEO of Amper Music. Amper is an AI music

0:12:18.760 --> 0:12:22.120
<v Speaker 1>company that Drew says will enable anyone to create music.

0:12:22.360 --> 0:12:24.320
<v Speaker 1>In fact, the only things you need to know are

0:12:24.600 --> 0:12:27.160
<v Speaker 1>the genre of music you want to create, the mood

0:12:27.440 --> 0:12:29.400
<v Speaker 1>you'd like to convey, and the length of your piece

0:12:29.400 --> 0:12:31.760
<v Speaker 1>of music. That's all you know. You can create a

0:12:31.760 --> 0:12:34.160
<v Speaker 1>brand new, unique piece of music in a matter of seconds.

0:12:34.280 --> 0:12:39.679
<v Speaker 1>So the big question is should musicians worry about computers

0:12:39.720 --> 0:12:43.400
<v Speaker 1>taking their job? Well, let's try it and see. So

0:12:43.760 --> 0:12:48.200
<v Speaker 1>what do you want to do? Cinematic, documentary, folk cinematic cinematic,

0:12:48.640 --> 0:12:53.960
<v Speaker 1>minimal percussion or quirky percussion. It's rendering a song right now.

0:12:54.000 --> 0:13:00.520
<v Speaker 1>And here we go. We've got something. I'm were deep

0:13:00.559 --> 0:13:04.439
<v Speaker 1>in the forest of Nicaragua. There's a breed of jaguar.

0:13:05.360 --> 0:13:07.960
<v Speaker 1>You might have heard of it. It's called the take

0:13:08.000 --> 0:13:12.480
<v Speaker 1>a Killer panther. Look, here's the thing. I don't know

0:13:12.760 --> 0:13:17.439
<v Speaker 1>what the difference between that and music is. I really don't. Yeah,

0:13:17.440 --> 0:13:20.760
<v Speaker 1>so you're woud Yeah, all right, So there's that. And

0:13:21.000 --> 0:13:24.520
<v Speaker 1>this isn't the only AI music app out there. Another

0:13:24.600 --> 0:13:28.440
<v Speaker 1>major player is called Magenta, and big surprise, they're at Google.

0:13:28.720 --> 0:13:31.160
<v Speaker 1>Magenta are using AI to create a ton of new tools.

0:13:31.240 --> 0:13:34.360
<v Speaker 1>From a piano genie that makes it impossible to play

0:13:34.400 --> 0:13:37.439
<v Speaker 1>bad notes to something that can generate drum loops, or

0:13:37.520 --> 0:13:40.560
<v Speaker 1>something that can even play piano duets with you. You

0:13:40.559 --> 0:13:44.560
<v Speaker 1>can even translate raw audio to a piano score. Raw audio,

0:13:44.640 --> 0:13:47.480
<v Speaker 1>like if I play just something raw on the piano,

0:13:47.880 --> 0:13:53.760
<v Speaker 1>raw audio like oh, literal raw audio, literal raw audio.

0:13:54.520 --> 0:13:57.040
<v Speaker 1>And Magenta has also trained a neural network just like

0:13:57.360 --> 0:14:00.240
<v Speaker 1>Ross and Oscar, only instead of sci fi scripts, they

0:14:00.320 --> 0:14:04.559
<v Speaker 1>trained on over four hundred performances by skilled pianists. They

0:14:04.600 --> 0:14:07.080
<v Speaker 1>fed it into the neural network and let me play

0:14:07.120 --> 0:14:09.480
<v Speaker 1>one of the piano experts. First. This is a real

0:14:09.640 --> 0:14:17.440
<v Speaker 1>piano player, all right, so nice? Right? Yeah, okay, ready

0:14:17.440 --> 0:14:28.680
<v Speaker 1>for the AI. What that's a computer that was all

0:14:28.680 --> 0:14:30.920
<v Speaker 1>a computer. I didn't ever play a human one. That's

0:14:30.920 --> 0:14:33.800
<v Speaker 1>a computer that was trained by a human playing piano.

0:14:34.480 --> 0:14:37.080
<v Speaker 1>And then how do you make a computer come up

0:14:37.120 --> 0:14:39.560
<v Speaker 1>with that? Right? So even though it's not a screenplay,

0:14:39.600 --> 0:14:41.520
<v Speaker 1>it's still data that you can feed a neural network

0:14:41.560 --> 0:14:44.440
<v Speaker 1>with to find patterns. And in this case, Magenta used

0:14:44.440 --> 0:14:47.680
<v Speaker 1>a data set from the Yamahai Piano competition. So human

0:14:47.720 --> 0:14:50.480
<v Speaker 1>pianists played on these digital keyboards which recorded the nuances

0:14:50.480 --> 0:14:52.640
<v Speaker 1>of their performance, like how long they hit notes, and

0:14:52.680 --> 0:14:54.960
<v Speaker 1>it recorded all that information into a digital score that

0:14:55.000 --> 0:14:57.600
<v Speaker 1>a computer could interpret. And we've actually had that technology

0:14:57.640 --> 0:14:59.840
<v Speaker 1>for a while now, it's called MIDI. But training and

0:15:00.000 --> 0:15:02.080
<v Speaker 1>on network on the data is new. See. The thing

0:15:02.120 --> 0:15:05.520
<v Speaker 1>that I come back to is that a computer doesn't

0:15:05.600 --> 0:15:09.040
<v Speaker 1>know it's playing music, so much of watching a musical

0:15:09.080 --> 0:15:13.480
<v Speaker 1>performance is knowing that this is coming from someone who

0:15:13.560 --> 0:15:18.280
<v Speaker 1>is emoting. Right, Yeah, there's actually there's an emotional communication happening, right,

0:15:18.360 --> 0:15:21.960
<v Speaker 1>that's right. I do think though the future is not

0:15:22.200 --> 0:15:26.240
<v Speaker 1>rejecting this. It's better to imagine what would Stravinsky have

0:15:26.440 --> 0:15:29.040
<v Speaker 1>done with this kind of technology, because Stravinsky is still

0:15:29.080 --> 0:15:42.600
<v Speaker 1>a musical genius. Right, Yeah, Definitely it's cool to listen

0:15:42.640 --> 0:15:45.840
<v Speaker 1>to those musical examples of machine learning because you can

0:15:45.880 --> 0:15:50.720
<v Speaker 1>really hear how the algorithm is reinterpreting existing material. Of course,

0:15:51.200 --> 0:15:54.040
<v Speaker 1>listening to the output is one thing. Tasting it is

0:15:54.120 --> 0:15:58.440
<v Speaker 1>quite another. The problem was that somebody had told me

0:15:58.480 --> 0:16:00.520
<v Speaker 1>that they had made the recipe for Stan, that it

0:16:00.640 --> 0:16:03.360
<v Speaker 1>was good, and what it was as a recipe called

0:16:03.440 --> 0:16:08.160
<v Speaker 1>chocolate baked and serves. That's Janelle Shane. She's a research

0:16:08.200 --> 0:16:11.240
<v Speaker 1>scientist and the author of a blog called AI Weirdness.

0:16:11.800 --> 0:16:14.080
<v Speaker 1>She's talking about a recipe written by Ai that she

0:16:14.120 --> 0:16:19.680
<v Speaker 1>actually cooked and eight. It starts out as a perfectly ordinary,

0:16:19.760 --> 0:16:24.000
<v Speaker 1>flowerless chocolate brownie all the way until the very last ingredient,

0:16:24.400 --> 0:16:26.760
<v Speaker 1>which is a cup of horse badish. I knew I

0:16:26.800 --> 0:16:28.480
<v Speaker 1>was in trouble when I opened the oven door and

0:16:28.520 --> 0:16:33.760
<v Speaker 1>my eyes just started watering. It was yeah, it was terrible.

0:16:34.080 --> 0:16:36.960
<v Speaker 1>On her blog, Jenelle experiments with putting AI to a

0:16:37.080 --> 0:16:40.800
<v Speaker 1>range of tasks, from writing new pickup lines to naming

0:16:40.840 --> 0:16:44.720
<v Speaker 1>Halloween costumes, and often her experiments with machine learning are

0:16:44.720 --> 0:16:49.640
<v Speaker 1>pretty revealing about us. It plays into this thought experiment,

0:16:49.760 --> 0:16:52.560
<v Speaker 1>what would an alien think of our world? It takes

0:16:52.560 --> 0:16:57.400
<v Speaker 1>something that's very ordinary and mixes it up into this

0:16:57.520 --> 0:17:01.560
<v Speaker 1>thing that sounds like the original, but the meaning has

0:17:01.600 --> 0:17:05.240
<v Speaker 1>been completely changed. Chopped whipping cream may be an ingredient

0:17:05.320 --> 0:17:08.840
<v Speaker 1>and fold water, enrolled it into cubes, or spread the

0:17:08.840 --> 0:17:12.080
<v Speaker 1>butter in the refrigerator. That's another direction that came up with.

0:17:12.640 --> 0:17:15.560
<v Speaker 1>Remember Ross and Oscar playing with the creativity setting for

0:17:15.600 --> 0:17:19.439
<v Speaker 1>their scripts. Janelle plays with herbot's temperature too, so I

0:17:19.480 --> 0:17:22.160
<v Speaker 1>can turn it up and the neural net may choose

0:17:22.160 --> 0:17:24.880
<v Speaker 1>its second best or third best guess as to what

0:17:25.080 --> 0:17:28.200
<v Speaker 1>letter comes next. And if I turn the creativity all

0:17:28.200 --> 0:17:31.440
<v Speaker 1>the way down, then everything maybe something like the the

0:17:32.560 --> 0:17:36.200
<v Speaker 1>the or recipes may be just you know, one teaspoon

0:17:36.240 --> 0:17:38.960
<v Speaker 1>of vanilla over and over and over again, because that's

0:17:39.040 --> 0:17:44.640
<v Speaker 1>just a very likely ingredient. It's really interesting with the

0:17:44.680 --> 0:17:48.879
<v Speaker 1>recipes to turn down the creativity and see what it

0:17:48.920 --> 0:17:52.320
<v Speaker 1>comes up with as the most quintessential recipes. At the

0:17:52.359 --> 0:17:55.320
<v Speaker 1>lowest setting, you may not get hole Strandish brownies, but

0:17:55.400 --> 0:17:57.320
<v Speaker 1>you do get a clear picture of what we eat

0:17:57.640 --> 0:18:00.399
<v Speaker 1>and who we are. I look at what kinds of

0:18:00.760 --> 0:18:03.200
<v Speaker 1>recipe titles that comes up with. There are things like

0:18:03.760 --> 0:18:07.919
<v Speaker 1>chocolate chicken chicken cake, and another one that's chocolate chocolate

0:18:08.000 --> 0:18:10.520
<v Speaker 1>chocolate chocolate cake. And there was a lot of cheese

0:18:10.560 --> 0:18:14.080
<v Speaker 1>in these recipes too, so it's kind of revealing about

0:18:14.119 --> 0:18:18.880
<v Speaker 1>what sorts of things we cook with. Then we like chocolate,

0:18:18.920 --> 0:18:21.960
<v Speaker 1>cheese and chicken apparently. But then I did the same

0:18:22.000 --> 0:18:26.920
<v Speaker 1>experiment with recipes from Bone Appetite, and then the most

0:18:26.920 --> 0:18:31.280
<v Speaker 1>common ingredients that kept using were cilantro and pomegranate juice.

0:18:32.160 --> 0:18:35.000
<v Speaker 1>So these algorithms essentially hold up a mirror to the

0:18:35.080 --> 0:18:37.840
<v Speaker 1>data sets that we give them. They do, yeah, they

0:18:37.880 --> 0:18:41.800
<v Speaker 1>reflect the data sets back to us in really weird ways,

0:18:42.160 --> 0:18:45.280
<v Speaker 1>and they can absolutely pick up whatever bias there is

0:18:45.320 --> 0:18:48.240
<v Speaker 1>in a input data set. And I think what we're

0:18:48.280 --> 0:18:52.359
<v Speaker 1>discovering is just how prevalent that bias is and how

0:18:52.400 --> 0:18:55.840
<v Speaker 1>easy it is for neural networks to latch onto that

0:18:55.880 --> 0:18:59.720
<v Speaker 1>bias and copy it as a handy tool toward copying

0:18:59.720 --> 0:19:02.800
<v Speaker 1>whatever where the humans are doing. They say that the

0:19:02.800 --> 0:19:05.679
<v Speaker 1>way to a person's heart is through their stomach. But

0:19:05.760 --> 0:19:09.160
<v Speaker 1>Janelle didn't stop at chocolate, bakes and surfs. She's also

0:19:09.200 --> 0:19:12.680
<v Speaker 1>turned AI onto some more direct roots. I really liked

0:19:12.720 --> 0:19:15.000
<v Speaker 1>the pickup lines. And there are all these puns and

0:19:15.040 --> 0:19:17.879
<v Speaker 1>all this wordplay that it didn't have any way to

0:19:17.880 --> 0:19:20.159
<v Speaker 1>grab hold of and figure out how to use. But

0:19:20.600 --> 0:19:25.880
<v Speaker 1>I think what it produced this sort of charming surrealism

0:19:26.000 --> 0:19:30.480
<v Speaker 1>and kind of garble nonsensical. I think it's an improvement

0:19:30.640 --> 0:19:34.080
<v Speaker 1>on every single one of the originals. My very favorite

0:19:34.080 --> 0:19:37.240
<v Speaker 1>one is you look like a thing and I love you.

0:19:37.240 --> 0:19:39.400
<v Speaker 1>You are so beautiful that you make me feel better

0:19:39.480 --> 0:19:42.679
<v Speaker 1>to see you. Or you must be a tringle because

0:19:42.680 --> 0:19:46.800
<v Speaker 1>you're the only thing here. Are you a camera? Because

0:19:46.840 --> 0:19:50.480
<v Speaker 1>I want to see the most beautiful than you. Yeah,

0:19:49.640 --> 0:19:54.400
<v Speaker 1>I'll definitely lie with you. No one's have a used

0:19:54.480 --> 0:19:56.960
<v Speaker 1>real pickup line on me? Use one on you know?

0:19:57.640 --> 0:20:01.280
<v Speaker 1>Do I look like someone who would receive a pickup Well,

0:20:01.320 --> 0:20:04.880
<v Speaker 1>here's one of them. I don't know you. That's good

0:20:05.119 --> 0:20:07.760
<v Speaker 1>a lot of girls are into that. Are you a candle?

0:20:07.880 --> 0:20:12.639
<v Speaker 1>Because you're so hard of the looks with you. So

0:20:12.640 --> 0:20:15.680
<v Speaker 1>in effect, what the algorithm is doing is highlighting patterns

0:20:15.680 --> 0:20:18.360
<v Speaker 1>in the data. I mean, there's sound structural like pickup

0:20:18.400 --> 0:20:20.960
<v Speaker 1>lines about the words themselves don't make any sense. The

0:20:21.000 --> 0:20:24.440
<v Speaker 1>machines are reflecting their creators and spitting back something which

0:20:24.880 --> 0:20:27.560
<v Speaker 1>resembles pick up lines and makes us think a little

0:20:27.560 --> 0:20:31.320
<v Speaker 1>bit more carefully about what a pickup line is. And

0:20:31.359 --> 0:20:34.280
<v Speaker 1>while training Benjamin Ross and Oscar found the same thing

0:20:34.840 --> 0:20:38.680
<v Speaker 1>as the algorithm learned patterns revealed bias present in our cinema.

0:20:40.600 --> 0:20:43.199
<v Speaker 1>When you train an algorithm like Benjamin on millions and

0:20:43.240 --> 0:20:45.879
<v Speaker 1>millions in this case of synopsis from the Internet of

0:20:45.920 --> 0:20:49.119
<v Speaker 1>the movies, the synopsis that come out have certain patterns

0:20:49.160 --> 0:20:51.520
<v Speaker 1>in them. For example, they mentioned men full times more

0:20:51.520 --> 0:20:54.680
<v Speaker 1>often than they mentioned women. But you you learn other

0:20:54.760 --> 0:20:57.120
<v Speaker 1>things about it than that. You learned that the most

0:20:57.119 --> 0:20:59.080
<v Speaker 1>common phrase in the in the output is a young

0:20:59.119 --> 0:21:02.040
<v Speaker 1>man in a small town. So what does a filmmaker

0:21:02.080 --> 0:21:04.600
<v Speaker 1>like Oscar learn from this? I used to call this

0:21:04.640 --> 0:21:06.960
<v Speaker 1>project the average movie projects. And the reason I called

0:21:06.960 --> 0:21:09.359
<v Speaker 1>it that is the theory was for me, if you

0:21:09.359 --> 0:21:11.520
<v Speaker 1>could make the right kind of algorithm that the movie

0:21:11.560 --> 0:21:14.200
<v Speaker 1>that you would make that would be the theoretically perfect

0:21:14.280 --> 0:21:17.240
<v Speaker 1>movie would also be the most boring movie ever made,

0:21:17.480 --> 0:21:19.120
<v Speaker 1>and that it would. It would it would be by

0:21:19.119 --> 0:21:21.399
<v Speaker 1>definition all of the things that were the most clich

0:21:21.640 --> 0:21:23.399
<v Speaker 1>because that's what cliche means, is the thing that that

0:21:23.480 --> 0:21:26.400
<v Speaker 1>you can rely on to work. And why do that?

0:21:26.560 --> 0:21:28.679
<v Speaker 1>Because the thing I'm most interested in is doing the

0:21:28.720 --> 0:21:30.360
<v Speaker 1>thing that we haven't done yet. I want to move

0:21:30.400 --> 0:21:33.440
<v Speaker 1>the form forward. Seeing all of these biases and assumptions

0:21:33.480 --> 0:21:36.639
<v Speaker 1>that are baked into our movies and our snacks doesn't

0:21:36.680 --> 0:21:39.560
<v Speaker 1>mean we're doomed to repeat them. In fact, the awareness

0:21:39.560 --> 0:21:42.800
<v Speaker 1>can be liberating. That's what's helped me, I think, is

0:21:42.840 --> 0:21:46.600
<v Speaker 1>seeing Benjamin's capacity to show me more directly what it

0:21:46.680 --> 0:21:49.160
<v Speaker 1>is that are our patents, are our habits, and then

0:21:49.359 --> 0:21:51.600
<v Speaker 1>I can ask more easily how to move forward from that.

0:21:53.720 --> 0:22:05.000
<v Speaker 1>We'll get there after the break. So we've heard about

0:22:05.080 --> 0:22:08.240
<v Speaker 1>Janelle Shane using AI to reveal bias, and Ross and

0:22:08.280 --> 0:22:10.879
<v Speaker 1>Oscar using it to help them think more creatively about

0:22:10.920 --> 0:22:13.919
<v Speaker 1>filmmaking as well as how it can be applied to music.

0:22:14.520 --> 0:22:17.200
<v Speaker 1>And that's the great promise of AI. We may worry

0:22:17.200 --> 0:22:20.159
<v Speaker 1>about replacing jobs, but it can augment our lives in

0:22:20.200 --> 0:22:23.520
<v Speaker 1>so many ways. At least that's how Sebastian Throne sees it.

0:22:24.480 --> 0:22:27.359
<v Speaker 1>I would say the term AI is a bit deceptive

0:22:27.359 --> 0:22:29.760
<v Speaker 1>because it sets up computers to be on equal power

0:22:29.760 --> 0:22:33.000
<v Speaker 1>with people. I see it to be stronger where we

0:22:33.040 --> 0:22:36.480
<v Speaker 1>are weak, and weaker where you're strong. It's not a

0:22:36.680 --> 0:22:40.680
<v Speaker 1>technology that will replace us, as it's not really empowers

0:22:40.680 --> 0:22:44.080
<v Speaker 1>but what might that empowerment look like beyond bias detection

0:22:44.240 --> 0:22:48.800
<v Speaker 1>and piano playing well. In seventeen, Sebastian published a paper

0:22:48.840 --> 0:22:52.840
<v Speaker 1>in Nature on using AI to diagnose skin cancer using

0:22:52.840 --> 0:22:56.600
<v Speaker 1>just an iPhone. So in medicine, you can think of

0:22:56.640 --> 0:23:00.960
<v Speaker 1>your iPhone that can find skin cancer as turning regular

0:23:01.400 --> 0:23:04.879
<v Speaker 1>physicians or anybody in the world into an expert on

0:23:04.960 --> 0:23:07.320
<v Speaker 1>day one, because now they have the superpower to be

0:23:07.359 --> 0:23:10.359
<v Speaker 1>able to distinguish something that previously would have taken tens

0:23:10.359 --> 0:23:13.199
<v Speaker 1>of years to learn. The same is true for the

0:23:13.240 --> 0:23:16.239
<v Speaker 1>self driving car. Now children can drive and and and

0:23:16.320 --> 0:23:18.240
<v Speaker 1>blind people can drive, head blind people drive around and

0:23:18.240 --> 0:23:21.800
<v Speaker 1>self driving cars. So for me, the real opportunities to

0:23:22.160 --> 0:23:24.800
<v Speaker 1>use the eye to extract the knowledge from some human

0:23:24.840 --> 0:23:27.760
<v Speaker 1>experts that are well trained and transpose this knowledge to

0:23:27.840 --> 0:23:30.480
<v Speaker 1>other brains that people not so well trained. This is

0:23:30.480 --> 0:23:33.560
<v Speaker 1>what I personally find so fascinating about AI and a

0:23:33.600 --> 0:23:37.400
<v Speaker 1>big reason I wanted to do Sleepwalkers. The same technology

0:23:37.560 --> 0:23:42.719
<v Speaker 1>underlies rossan Oscar's films, Janelle's recipes and self driving cars

0:23:42.720 --> 0:23:46.959
<v Speaker 1>and cancer diagnostics and so much more. The training for

0:23:47.359 --> 0:23:50.800
<v Speaker 1>skin cancer detection or cancer detection radeology and the training

0:23:50.800 --> 0:23:55.040
<v Speaker 1>for the self dime car amazingly similar. In both cases,

0:23:55.080 --> 0:23:57.840
<v Speaker 1>what you do is you compile the data set typically

0:23:57.920 --> 0:24:01.120
<v Speaker 1>hundreds of thousands up to hundreds of millions of images.

0:24:01.600 --> 0:24:04.760
<v Speaker 1>In skin cancer, we use biopsies. We had a database

0:24:04.800 --> 0:24:07.399
<v Speaker 1>of a hundred twenty nine thousand images that a lab

0:24:07.480 --> 0:24:11.159
<v Speaker 1>had biopsied and provided. In the self driving car, you

0:24:11.560 --> 0:24:13.880
<v Speaker 1>could be as easy as having a human driver provide

0:24:13.920 --> 0:24:16.480
<v Speaker 1>inputs with their student veel and the and the break

0:24:16.800 --> 0:24:19.119
<v Speaker 1>as to what the right thing is to do, and

0:24:19.200 --> 0:24:22.600
<v Speaker 1>then the network mimics human behavior, it mimics the diagnostics

0:24:22.680 --> 0:24:25.560
<v Speaker 1>of a physician, or it mimics the style of a driver.

0:24:25.800 --> 0:24:29.800
<v Speaker 1>The underlying albums are amazingly similar. What makes this moment

0:24:29.800 --> 0:24:32.119
<v Speaker 1>all the more interesting is that AI is in the

0:24:32.200 --> 0:24:37.160
<v Speaker 1>process of being consumerized like Sebastion said, your iPhone can

0:24:37.200 --> 0:24:41.199
<v Speaker 1>diagnose skin cancer. Self driving cars are already on the roads,

0:24:41.240 --> 0:24:44.639
<v Speaker 1>and as these tools become more and more accessible, society

0:24:44.800 --> 0:24:48.840
<v Speaker 1>will start to change. These technologies become closer and closer

0:24:48.880 --> 0:24:51.320
<v Speaker 1>to us. The fact that you carry yourself phone is

0:24:51.320 --> 0:24:53.960
<v Speaker 1>a big deal. You might not see this way, but

0:24:54.119 --> 0:24:57.679
<v Speaker 1>what it does it puts the computer seamlessly into your life.

0:24:58.040 --> 0:25:01.280
<v Speaker 1>You're texting app, your SMS is so close to you

0:25:01.520 --> 0:25:04.200
<v Speaker 1>that you can now talk to people thousands of miles

0:25:04.200 --> 0:25:06.600
<v Speaker 1>of a on a button press or on a microphone.

0:25:06.960 --> 0:25:11.400
<v Speaker 1>That makes you effectively super human without the actual physical implant.

0:25:11.680 --> 0:25:14.400
<v Speaker 1>But when it comes to AI, for now, leading edge

0:25:14.400 --> 0:25:17.200
<v Speaker 1>algorithms are off limits to those of us who can't

0:25:17.240 --> 0:25:20.040
<v Speaker 1>code or who don't have the means to learn. Oscar

0:25:20.080 --> 0:25:24.000
<v Speaker 1>wouldn't have been able to make Sunspring without Rosses technical expertise,

0:25:24.680 --> 0:25:28.119
<v Speaker 1>but that's all starting to change, Julian, you spoke with

0:25:28.200 --> 0:25:31.480
<v Speaker 1>somebody making AI more accessible. Yeah, while we were looking

0:25:31.480 --> 0:25:34.040
<v Speaker 1>into AI and music, we came across Runway m L

0:25:34.359 --> 0:25:37.680
<v Speaker 1>their lab based in Bushwick, and they feel strongly about

0:25:37.720 --> 0:25:41.399
<v Speaker 1>letting more people into work with AI Creatively. I spoke

0:25:41.400 --> 0:25:43.840
<v Speaker 1>with Christo bal Valence Weela, the co founder of Runway,

0:25:43.880 --> 0:25:46.560
<v Speaker 1>which is basically like the Adobe Creative Suite for AI.

0:25:46.760 --> 0:25:49.320
<v Speaker 1>So think of a program that looks like Photoshop. They're

0:25:49.320 --> 0:25:51.920
<v Speaker 1>adding tons of different AI models to the Runway app,

0:25:51.960 --> 0:25:54.199
<v Speaker 1>where instead of having to know how to code, you

0:25:54.200 --> 0:25:57.199
<v Speaker 1>can just manipulate some sliders and dials and still have

0:25:57.240 --> 0:25:59.800
<v Speaker 1>a I generate something. If we really think of this

0:26:00.080 --> 0:26:02.439
<v Speaker 1>the game changing technology that will impact us for like

0:26:02.560 --> 0:26:05.160
<v Speaker 1>years to come, we need to have more people from

0:26:05.160 --> 0:26:08.960
<v Speaker 1>different backgrounds and disciplines jumping to that discussion and proposing

0:26:09.000 --> 0:26:11.640
<v Speaker 1>ways of looking at algorithms that those researchers and those

0:26:11.640 --> 0:26:13.960
<v Speaker 1>scientists are not thinking of. This is gonna impact us,

0:26:14.040 --> 0:26:15.679
<v Speaker 1>and it's going to change the way we see not

0:26:15.840 --> 0:26:18.879
<v Speaker 1>just the world, but ourselves. Not just ourselves, but how

0:26:18.960 --> 0:26:21.440
<v Speaker 1>we think about our creativity. And the list of things

0:26:21.520 --> 0:26:24.440
<v Speaker 1>that Runway can help you do is frankly crazy. Will

0:26:24.480 --> 0:26:28.280
<v Speaker 1>post some on our Instagram at Sleepwalkers podcast, but just

0:26:28.359 --> 0:26:31.480
<v Speaker 1>one example, you can take video of anyone adds and

0:26:31.520 --> 0:26:34.199
<v Speaker 1>have their body copy the poses that you make in

0:26:34.240 --> 0:26:37.639
<v Speaker 1>your webcam. So Krystoval actually tweeted one of him controlling

0:26:37.680 --> 0:26:40.879
<v Speaker 1>Stephen Colbert's body on The Late Show just with his

0:26:40.920 --> 0:26:44.400
<v Speaker 1>webcam moving his arms. Stephen moves his arms. It's nuts, right,

0:26:44.560 --> 0:26:47.120
<v Speaker 1>And imagine that kind of technology in the hands of artists.

0:26:47.440 --> 0:26:50.440
<v Speaker 1>Start thinking about them as not like something that's gonna

0:26:50.480 --> 0:26:53.760
<v Speaker 1>destroy our creativity or gonna replace writers an artists or whatever.

0:26:53.880 --> 0:26:55.440
<v Speaker 1>This is gonna be a typewriter, This is gonna be

0:26:55.480 --> 0:26:58.239
<v Speaker 1>a paintbrush, and people will start building and using it

0:26:58.359 --> 0:27:02.600
<v Speaker 1>to understand their own creativity in a new way. Of course,

0:27:02.680 --> 0:27:05.440
<v Speaker 1>as always, it's up to us to make sure we

0:27:05.560 --> 0:27:09.879
<v Speaker 1>use these new tools for good. If I build a shovel, okay,

0:27:09.920 --> 0:27:12.280
<v Speaker 1>and you decide to go to the beach and digging sand,

0:27:12.880 --> 0:27:15.200
<v Speaker 1>you're biased. You're digging in sand and guess what You're

0:27:15.200 --> 0:27:17.440
<v Speaker 1>shoveled only to an up sand. The same as true

0:27:17.480 --> 0:27:19.280
<v Speaker 1>for AI. If you give it a certain type of

0:27:19.359 --> 0:27:21.800
<v Speaker 1>data set, I can promise you whatever I get out

0:27:21.960 --> 0:27:24.600
<v Speaker 1>reflects the data you're put in. It's up to us,

0:27:24.720 --> 0:27:27.960
<v Speaker 1>the people, to make responsible decisions. And as we want

0:27:28.000 --> 0:27:31.800
<v Speaker 1>to create equal opportunity and evadicate certain biases society that

0:27:31.920 --> 0:27:34.480
<v Speaker 1>exists today, is up to us to do it, and

0:27:34.520 --> 0:27:37.320
<v Speaker 1>I promise you, if you work hard on this, technologies

0:27:37.560 --> 0:27:41.280
<v Speaker 1>will reflect that. But even Sebastian, one of Silicon Valleys

0:27:41.320 --> 0:27:47.119
<v Speaker 1>Great Optimists, recognizes the risks all technologies can harm people.

0:27:47.520 --> 0:27:50.480
<v Speaker 1>In fact, technologies can be abused to harm people, Like

0:27:50.920 --> 0:27:53.360
<v Speaker 1>my kitchen knife, which serves me a great purpose every

0:27:53.359 --> 0:27:56.280
<v Speaker 1>time I've guests over in shopping. My produce can also

0:27:56.359 --> 0:28:01.240
<v Speaker 1>be abused to harm people. In the next episode of Sleepwalkers,

0:28:01.320 --> 0:28:04.760
<v Speaker 1>we dive deep into the ability of algorithms to cause harm.

0:28:04.800 --> 0:28:07.800
<v Speaker 1>We traveled from China and the social credit system to

0:28:08.119 --> 0:28:10.800
<v Speaker 1>a parole board in New York and we speak with

0:28:10.840 --> 0:28:15.120
<v Speaker 1>people building technology they believe will make us safer. I'm

0:28:15.160 --> 0:28:31.440
<v Speaker 1>as Vlachen, see you next time. Sleepwalkers is a production

0:28:31.480 --> 0:28:34.920
<v Speaker 1>of our heart Radio and unusual productions. There's so much

0:28:34.960 --> 0:28:37.080
<v Speaker 1>we don't have time for in our episodes, but that

0:28:37.119 --> 0:28:39.680
<v Speaker 1>we'd love to share with you. So for the latest

0:28:39.720 --> 0:28:43.200
<v Speaker 1>AI news, live interviews, and behind the scenes footage, find

0:28:43.280 --> 0:28:47.440
<v Speaker 1>us on Instagram, at Sleepwalker's podcast or at sleepwalkers podcast

0:28:47.480 --> 0:28:50.360
<v Speaker 1>dot com. Special thanks on this episode to paw Suris,

0:28:50.440 --> 0:28:53.960
<v Speaker 1>who introduced us to Oscar Ross and Benjamin and to

0:28:54.200 --> 0:28:58.160
<v Speaker 1>artificial intelligence, which composed over half of music in this episode.

0:28:58.520 --> 0:29:02.479
<v Speaker 1>Could you tell which was which Sleepwalkers is hosted by

0:29:02.520 --> 0:29:05.640
<v Speaker 1>me Ozveloshin and co hosted by me Kara Price. Were

0:29:05.640 --> 0:29:08.600
<v Speaker 1>produced by Julian Weller with help from Jacopo Penzo and

0:29:08.680 --> 0:29:12.280
<v Speaker 1>Taylor Chacog. Mixing by Tristan McNeil and Julian Weller. Our

0:29:12.320 --> 0:29:15.800
<v Speaker 1>story editor is Matthew Riddle. Recording assistance. This episode from

0:29:15.920 --> 0:29:20.160
<v Speaker 1>Joanne de Luna. Sleepwalkers is executive produced by me Ozveloshin

0:29:20.280 --> 0:29:23.600
<v Speaker 1>and Mangesh hatt Together. For more podcasts from my heart Radio,

0:29:23.680 --> 0:29:26.600
<v Speaker 1>visit the i heart Radio app, Apple Podcasts, or wherever

0:29:26.640 --> 0:29:27.960
<v Speaker 1>you listen to your favorite shows,