WEBVTT - Creative Writing Machines, with Mike Sharples

0:00:03.040 --> 0:00:05.360
<v Speaker 1>Welcome to Stuff to Blow Your Mind production of My

0:00:05.480 --> 0:00:14.920
<v Speaker 1>Heart Radio. Hey you welcome to Stuff to Blow Your Mind.

0:00:15.160 --> 0:00:18.320
<v Speaker 1>My name is Robert Lamb and I'm Joe McCormick. Joe.

0:00:18.360 --> 0:00:21.400
<v Speaker 1>While you were out, I conducted an interview with educational

0:00:21.480 --> 0:00:25.759
<v Speaker 1>technologist Mike Sharple's on the book he wrote with Rafael

0:00:25.840 --> 0:00:31.040
<v Speaker 1>Perezi Perez, Story Machines, How Computers Have Become Creative Writers.

0:00:31.480 --> 0:00:34.919
<v Speaker 1>It's a fascinating read about the nature of storytelling, our

0:00:35.080 --> 0:00:38.320
<v Speaker 1>history of attempting to instill the spirit of storytelling into

0:00:38.360 --> 0:00:41.360
<v Speaker 1>the machine, where we seem to be going with this technology,

0:00:41.520 --> 0:00:45.360
<v Speaker 1>and quite remarkably where we are already. Well, I can't

0:00:45.400 --> 0:00:48.200
<v Speaker 1>wait to hear this interview. All right, Well, without further ado,

0:00:48.320 --> 0:00:52.200
<v Speaker 1>let's jump right in. Hi, Mike, welcome to the show.

0:00:52.479 --> 0:00:55.360
<v Speaker 1>I'm here. It's good to be at. The book, co

0:00:55.520 --> 0:01:00.560
<v Speaker 1>authored with Rafael Perez Perez is Story Machines, How Computers

0:01:00.560 --> 0:01:05.640
<v Speaker 1>Have Become Creative Writers, Publishing July five. It's a terrific read,

0:01:05.840 --> 0:01:07.679
<v Speaker 1>but it has to be stressed from the outside. That's

0:01:07.720 --> 0:01:10.800
<v Speaker 1>the storytelling isn't just a pastime that humans engage in.

0:01:11.319 --> 0:01:17.160
<v Speaker 1>Storytelling is something greater? Right? Yeah, So storytelling we suggest

0:01:17.200 --> 0:01:20.360
<v Speaker 1>in the book is something that's fundamental to human existence

0:01:21.000 --> 0:01:25.640
<v Speaker 1>and has been for millennia. It suggested that instead of

0:01:26.120 --> 0:01:30.440
<v Speaker 1>language coming first and then storytelling involving after that, perhaps

0:01:30.440 --> 0:01:35.880
<v Speaker 1>it's the other way around. That perhaps storytelling started as

0:01:35.920 --> 0:01:41.000
<v Speaker 1>a way of human communication through mine, through expressive gestures,

0:01:41.160 --> 0:01:44.520
<v Speaker 1>and then language followed from that. So we want to

0:01:44.520 --> 0:01:49.160
<v Speaker 1>make the point that storytelling is fundamental at a neural level,

0:01:49.560 --> 0:01:52.160
<v Speaker 1>it's how we make sense of the world. It's at

0:01:52.160 --> 0:01:56.200
<v Speaker 1>a cognitive level about how we create narratives to explain

0:01:56.200 --> 0:01:59.840
<v Speaker 1>our existence, and in the social world, it's the stories

0:02:00.000 --> 0:02:02.480
<v Speaker 1>we tell to each other that makes us who we are.

0:02:02.760 --> 0:02:06.240
<v Speaker 1>So when we're talking about machines engaging in storytelling or

0:02:06.320 --> 0:02:09.600
<v Speaker 1>story creation, we're really getting deep into human creativity and

0:02:09.680 --> 0:02:15.440
<v Speaker 1>human identity. Then, yeah, we are. And it's both a

0:02:15.480 --> 0:02:21.560
<v Speaker 1>fascinating insight into human creativity and it's also something the threat.

0:02:21.919 --> 0:02:27.120
<v Speaker 1>So for centuries, writers have been fascinated about this idea

0:02:27.200 --> 0:02:31.040
<v Speaker 1>that a that a machine might take over their craft,

0:02:31.400 --> 0:02:35.160
<v Speaker 1>that a machine might become a storyteller. It goes right

0:02:35.160 --> 0:02:40.040
<v Speaker 1>back to Jonathan Swift, for example, in Gulliver's Travels Coming

0:02:40.080 --> 0:02:46.320
<v Speaker 1>Across a Weird Academy, where apprentices were manipulating a story

0:02:46.360 --> 0:02:51.440
<v Speaker 1>machine which churned out academic texts right through to modern

0:02:51.440 --> 0:02:54.720
<v Speaker 1>writers such as Roll Dow. He wrote a short story

0:02:55.080 --> 0:02:58.880
<v Speaker 1>about how an author was selling his soul to a

0:02:58.960 --> 0:03:03.480
<v Speaker 1>machine that generated short stories for him. So it's become

0:03:03.560 --> 0:03:07.600
<v Speaker 1>something that particularly professional authors have been fascinated by this

0:03:07.720 --> 0:03:10.760
<v Speaker 1>idea that a machine might be as creative as a

0:03:10.880 --> 0:03:15.240
<v Speaker 1>human and also giving us insight into human creativity. Now,

0:03:15.760 --> 0:03:18.480
<v Speaker 1>of course it's one thing to imagine these machines. You

0:03:18.480 --> 0:03:23.120
<v Speaker 1>point out that they're they're necessary precursors in linguistics, literary analysis,

0:03:23.120 --> 0:03:25.480
<v Speaker 1>and other fields to even get to the point of

0:03:25.520 --> 0:03:29.560
<v Speaker 1>considering asking a machine or or making machine they can

0:03:29.600 --> 0:03:32.360
<v Speaker 1>write a story or novel for you. Yeah, and I

0:03:32.400 --> 0:03:34.960
<v Speaker 1>think there are there are a number of ways we

0:03:35.000 --> 0:03:38.240
<v Speaker 1>can approach this. So one of them is to look

0:03:38.280 --> 0:03:42.160
<v Speaker 1>at language. So we are language machines. We we have

0:03:42.280 --> 0:03:46.320
<v Speaker 1>been trained in how to manipulate language as humans, and

0:03:47.000 --> 0:03:50.400
<v Speaker 1>we can now design machines that could copy that in

0:03:50.400 --> 0:03:53.040
<v Speaker 1>a very expressive way. So I'm sure we'll come onto that.

0:03:53.160 --> 0:03:57.960
<v Speaker 1>But AI systems such as GPT three that are expert wordsmiths.

0:03:58.000 --> 0:04:02.760
<v Speaker 1>So that's one route and other route is simulating characters.

0:04:03.120 --> 0:04:08.200
<v Speaker 1>So some of the newer computer games now have stories

0:04:08.280 --> 0:04:13.480
<v Speaker 1>nonplayer characters that can't tell stories to the human players.

0:04:14.120 --> 0:04:16.960
<v Speaker 1>And then the third way is to build models of

0:04:17.200 --> 0:04:21.000
<v Speaker 1>the creative mind. And that's what my colleague Raphael pre

0:04:21.279 --> 0:04:24.960
<v Speaker 1>Perez has been doing for many years in his Mexican program,

0:04:25.080 --> 0:04:28.039
<v Speaker 1>is trying to build a model of the human creative mind.

0:04:28.240 --> 0:04:32.080
<v Speaker 1>So the different routes for coming at modeling storytelling and

0:04:32.160 --> 0:04:35.680
<v Speaker 1>understanding storytelling with machines. And the one that you mentioned

0:04:35.720 --> 0:04:40.000
<v Speaker 1>that I think is very fascinating is Vladimir props morphology

0:04:40.040 --> 0:04:44.080
<v Speaker 1>of the folk tale. This idea of taking apart of

0:04:44.080 --> 0:04:46.840
<v Speaker 1>folk tale tradition and in figuring out like what are

0:04:46.839 --> 0:04:50.120
<v Speaker 1>the basic strokes, what are the basic elements, and thus

0:04:50.200 --> 0:04:54.080
<v Speaker 1>creating I guess, sort of the palette for recreating stories. Yeah,

0:04:54.120 --> 0:04:59.160
<v Speaker 1>this was back in the early twentieth century that Vladimir

0:04:59.200 --> 0:05:04.200
<v Speaker 1>prop was part of a group of Russian linguists who

0:05:04.279 --> 0:05:10.760
<v Speaker 1>and folk tale academics who became interested in the morphology

0:05:10.920 --> 0:05:15.359
<v Speaker 1>the structure of folk tales, and he realized that Russian

0:05:15.360 --> 0:05:19.440
<v Speaker 1>folk tales had this very similar structure, just as fairy

0:05:19.480 --> 0:05:23.719
<v Speaker 1>tales in other Western traditions have, and so he set

0:05:23.720 --> 0:05:26.960
<v Speaker 1>about trying to ride a set of rules that would

0:05:27.000 --> 0:05:31.360
<v Speaker 1>both analyze these folk tales and show their underlying structure,

0:05:31.440 --> 0:05:34.000
<v Speaker 1>but also could be turned around to generate them. That

0:05:34.160 --> 0:05:37.839
<v Speaker 1>if you use these rules essentially as what we now

0:05:37.880 --> 0:05:42.760
<v Speaker 1>call programs or algorithms, they could be employed to create

0:05:42.839 --> 0:05:47.039
<v Speaker 1>new folk tales. And so he set the foundations for

0:05:47.120 --> 0:05:52.520
<v Speaker 1>the structural analysis of stories way back what a hundred

0:05:52.560 --> 0:05:56.560
<v Speaker 1>and twenty years ago now, And interestingly, some of the

0:05:56.680 --> 0:06:02.640
<v Speaker 1>earliest computer programs to generate story were based on props formalism.

0:06:02.680 --> 0:06:04.559
<v Speaker 1>It was really if you look at it, it's really

0:06:04.600 --> 0:06:08.880
<v Speaker 1>like a very pair down computer algorithm. Now answer some

0:06:08.960 --> 0:06:13.840
<v Speaker 1>of the first actual text generating machines. You mentioned a

0:06:13.880 --> 0:06:17.920
<v Speaker 1>few different examples of this. Which are the most important

0:06:17.960 --> 0:06:20.160
<v Speaker 1>to mention or which one is the most important to mention?

0:06:20.520 --> 0:06:23.320
<v Speaker 1>So one of the earliest ones was by Christopher Stracci,

0:06:23.480 --> 0:06:27.760
<v Speaker 1>who was a colleague of volenteering working in Manchester, and

0:06:28.000 --> 0:06:31.920
<v Speaker 1>he developed a very very early computer program on one

0:06:31.960 --> 0:06:38.640
<v Speaker 1>of the first prototype computers that generated love letters Victorian

0:06:38.760 --> 0:06:42.000
<v Speaker 1>love letters. Um, and we can speculate on why he

0:06:42.080 --> 0:06:45.400
<v Speaker 1>would want to write a program to generate love letters,

0:06:45.400 --> 0:06:47.400
<v Speaker 1>but he did and pin them up on the wall

0:06:47.440 --> 0:06:51.120
<v Speaker 1>of his lab. And then since then there's been a

0:06:51.279 --> 0:06:55.000
<v Speaker 1>number of people who have particularly taken a linguistic approach,

0:06:55.600 --> 0:06:59.160
<v Speaker 1>and so one of the pioneers was a linguist called

0:06:59.160 --> 0:07:05.560
<v Speaker 1>Sheldon Clade, and Sheldon Klein had this big, grandiose project

0:07:05.920 --> 0:07:13.160
<v Speaker 1>to try and model human language production as a as

0:07:13.160 --> 0:07:16.200
<v Speaker 1>a as an algorithm, as a computer program, and then

0:07:16.560 --> 0:07:21.000
<v Speaker 1>through this trying to understand the origins of language. And so,

0:07:21.400 --> 0:07:24.920
<v Speaker 1>along with a group of colleagues, many PhD students, he

0:07:25.080 --> 0:07:28.800
<v Speaker 1>produced one of the earliest programs that generated stories. It

0:07:28.960 --> 0:07:32.960
<v Speaker 1>generated murder mysteries, a sort of country house Agatha Christie

0:07:32.960 --> 0:07:37.040
<v Speaker 1>type mysteries, and it was part of this big project

0:07:37.240 --> 0:07:41.120
<v Speaker 1>to formalize language in a generative way. The problem was

0:07:41.160 --> 0:07:47.640
<v Speaker 1>the program the stories that generated were pretty trivial. There

0:07:47.800 --> 0:07:50.640
<v Speaker 1>was people gathered at the country house, there was a murder,

0:07:51.480 --> 0:07:56.120
<v Speaker 1>somebody tried to investigate it the end, and so although

0:07:56.160 --> 0:07:58.640
<v Speaker 1>it was seen as a novelty at the time, it

0:07:58.760 --> 0:08:02.560
<v Speaker 1>wasn't really risk acted for the Great linguistic project that

0:08:02.680 --> 0:08:06.800
<v Speaker 1>lasted for twenty or thirty years in the nine nineties,

0:08:07.600 --> 0:08:10.160
<v Speaker 1>and there were a number of attempts to write entire

0:08:10.280 --> 0:08:14.600
<v Speaker 1>novels by computer and probably The most interesting one was

0:08:14.640 --> 0:08:20.600
<v Speaker 1>by Geckll Scott French, who programmed a Mac computer to

0:08:21.280 --> 0:08:25.360
<v Speaker 1>generate a novel in the style of Jacqueline Susan. I've

0:08:25.400 --> 0:08:28.760
<v Speaker 1>got it here. It's called Justice Once, and it is

0:08:28.960 --> 0:08:33.200
<v Speaker 1>a complete published novel in the style of the pot

0:08:33.240 --> 0:08:39.160
<v Speaker 1>boilant author called Jacqueline Susan, and it seems to be genuine.

0:08:39.200 --> 0:08:41.560
<v Speaker 1>There's a picture of him with his Mac computer on

0:08:41.600 --> 0:08:45.160
<v Speaker 1>the cover of the book. And it took him eight

0:08:45.280 --> 0:08:50.000
<v Speaker 1>years to design this very early AI system that would

0:08:50.040 --> 0:08:54.640
<v Speaker 1>mimic the style of this author, and he engaged that

0:08:54.760 --> 0:09:00.160
<v Speaker 1>eyed dialogue with the program to generate an entire three

0:09:00.200 --> 0:09:04.640
<v Speaker 1>hundred page novel. So that was probably the first, I'm

0:09:05.040 --> 0:09:08.439
<v Speaker 1>right up to this day, greatest example of story writing

0:09:08.440 --> 0:09:12.600
<v Speaker 1>with machine. What's fascinating now is that what took him

0:09:12.720 --> 0:09:15.720
<v Speaker 1>eight years to do could now be done in a

0:09:15.720 --> 0:09:22.960
<v Speaker 1>few seconds with the most recent AI UM generator programs. Yeah,

0:09:23.080 --> 0:09:25.080
<v Speaker 1>in the book you you write them just to read

0:09:25.120 --> 0:09:27.400
<v Speaker 1>a quick quote quote. It took just twenty years ago

0:09:27.440 --> 0:09:29.520
<v Speaker 1>from a program that wrote love letters to one that

0:09:29.559 --> 0:09:33.319
<v Speaker 1>created complete short stories. Uh, and a further twenty years

0:09:33.480 --> 0:09:36.120
<v Speaker 1>to a published three page novel in partnership with the

0:09:36.120 --> 0:09:39.959
<v Speaker 1>computer Uh, it's fascinating to think about. You know this,

0:09:40.200 --> 0:09:45.080
<v Speaker 1>this the technological advancement during that time in broad strokes,

0:09:45.120 --> 0:09:47.920
<v Speaker 1>like what I mean, what were the key advancements going

0:09:47.960 --> 0:09:50.760
<v Speaker 1>on here that made this possible. Well, the first advancement

0:09:51.000 --> 0:09:55.600
<v Speaker 1>was to be able to have interactive computer systems that

0:09:55.760 --> 0:10:00.440
<v Speaker 1>you could program in a high level programming which and

0:10:00.480 --> 0:10:04.120
<v Speaker 1>that's what Christopher Straitchy and Alan Turing were working on,

0:10:05.640 --> 0:10:10.120
<v Speaker 1>and that they could then demonstrate this in generating very

0:10:10.200 --> 0:10:16.080
<v Speaker 1>simple love letters. The next stage was to be able

0:10:16.160 --> 0:10:21.079
<v Speaker 1>to produce grammars, generative grammars, and this goes back to

0:10:21.120 --> 0:10:25.000
<v Speaker 1>the work of prop who realized that you could have

0:10:25.040 --> 0:10:30.360
<v Speaker 1>grammars that didn't just generate individual sentences, but could generate

0:10:30.520 --> 0:10:33.320
<v Speaker 1>entire stories. That you could describe the structure of a

0:10:33.360 --> 0:10:37.280
<v Speaker 1>story in terms of something like a formal grammar, and

0:10:37.640 --> 0:10:41.439
<v Speaker 1>that in THEES there were a number of projects to

0:10:42.160 --> 0:10:46.200
<v Speaker 1>put that grammar into a program that would generate a

0:10:46.200 --> 0:10:48.880
<v Speaker 1>short story. That's what Sheldon Klein and his team did.

0:10:49.840 --> 0:10:53.720
<v Speaker 1>And then the next step beyond that was to write

0:10:53.920 --> 0:10:59.000
<v Speaker 1>symbolic AI programs that modeled the style of a particular writer,

0:10:59.480 --> 0:11:03.040
<v Speaker 1>and that was the great achievement of um Scott French.

0:11:03.320 --> 0:11:08.800
<v Speaker 1>It was a symbolic AI expert system of style. And

0:11:08.840 --> 0:11:12.040
<v Speaker 1>then we come right up to date. And there are

0:11:12.080 --> 0:11:19.200
<v Speaker 1>programs like GPT three, which are hugely competent and well

0:11:19.280 --> 0:11:22.760
<v Speaker 1>trained language systems, So they aren't rule based systems. They

0:11:22.800 --> 0:11:29.480
<v Speaker 1>don't have something like the Sheldon client grammar inside them.

0:11:29.600 --> 0:11:34.480
<v Speaker 1>They have been trained on billions of pieces of text

0:11:34.960 --> 0:11:40.280
<v Speaker 1>and they have many millions of interconnections that create an

0:11:40.320 --> 0:11:45.000
<v Speaker 1>internal language model which they then use to generate texts

0:11:45.040 --> 0:11:48.240
<v Speaker 1>in particular styles. You can start it in a style

0:11:48.440 --> 0:11:50.559
<v Speaker 1>and it will continue in that style. You can give

0:11:50.600 --> 0:11:52.840
<v Speaker 1>it an instruction of what sort of story to write

0:11:53.000 --> 0:11:57.240
<v Speaker 1>and it will continue in that story. But the really

0:11:57.280 --> 0:11:59.600
<v Speaker 1>important thing to say is that there are two different

0:11:59.640 --> 0:12:02.560
<v Speaker 1>sorts of AI. So the sort of AI that's got

0:12:02.559 --> 0:12:06.480
<v Speaker 1>French used was writing explicit rules. That's why it took

0:12:06.520 --> 0:12:10.760
<v Speaker 1>him eight years to code these rules to imitate one

0:12:10.840 --> 0:12:17.880
<v Speaker 1>person style. The GPT three type AI transformer programs induce

0:12:18.480 --> 0:12:22.440
<v Speaker 1>infer those rules from being trained on billions and billions

0:12:22.440 --> 0:12:24.800
<v Speaker 1>of pieces of texts. So there are two different sorts

0:12:24.800 --> 0:12:29.240
<v Speaker 1>of AI. Both had their strengths and weaknesses. UH And

0:12:29.640 --> 0:12:31.800
<v Speaker 1>one of the fascinating things of the future is whether

0:12:31.840 --> 0:12:34.200
<v Speaker 1>we can put them together, whether we can merge those

0:12:34.240 --> 0:12:37.400
<v Speaker 1>two different sorts of AI into the sort of universal

0:12:37.520 --> 0:12:43.080
<v Speaker 1>story machine. Wow. And so GPT three is the current model,

0:12:43.200 --> 0:12:46.720
<v Speaker 1>Is that correct? Or are we had four yet? So

0:12:46.800 --> 0:12:49.800
<v Speaker 1>there isn't a GPT four yet, although I'm sure there's

0:12:49.840 --> 0:12:54.080
<v Speaker 1>one in the pipeline that continually revising GPT three. And

0:12:54.200 --> 0:12:57.480
<v Speaker 1>just for those who don't know GPT, the GPT models

0:12:57.640 --> 0:13:01.720
<v Speaker 1>were developed by and still being developed by company called

0:13:01.760 --> 0:13:05.959
<v Speaker 1>open Ai that was founded by a group of entrepreneurs

0:13:06.000 --> 0:13:10.160
<v Speaker 1>including Elon Musk and others, and that company was set

0:13:10.280 --> 0:13:16.280
<v Speaker 1>up to explore the opportunities of AI for social good.

0:13:16.800 --> 0:13:20.520
<v Speaker 1>It has developed a number of different programs. There's one

0:13:20.559 --> 0:13:24.240
<v Speaker 1>that it's developed for art called Dally, which can do

0:13:24.320 --> 0:13:28.520
<v Speaker 1>the same for art and images as GPT three can

0:13:28.640 --> 0:13:34.040
<v Speaker 1>for for words and stories. But the GPT three now

0:13:34.080 --> 0:13:40.760
<v Speaker 1>it's its third generation. In essence, what it's been, what's

0:13:40.760 --> 0:13:43.560
<v Speaker 1>how it works is it's been trained on billions of

0:13:43.559 --> 0:13:48.200
<v Speaker 1>pieces of text. It uses those texts to form an

0:13:48.240 --> 0:13:54.000
<v Speaker 1>internal model of both the surface structure of language but

0:13:54.080 --> 0:13:58.319
<v Speaker 1>also the internal structure language essentially how the world works.

0:13:58.840 --> 0:14:04.760
<v Speaker 1>And then initially the early GPT models were sentenced computers

0:14:04.880 --> 0:14:08.520
<v Speaker 1>like very highly trained, suped up sentenced computers of the

0:14:08.559 --> 0:14:11.240
<v Speaker 1>sort that you've got on your mobile phone. But they

0:14:11.240 --> 0:14:14.160
<v Speaker 1>can look back at the last five words or so.

0:14:14.640 --> 0:14:19.280
<v Speaker 1>They haven't a big attention window, so they know what

0:14:19.440 --> 0:14:22.320
<v Speaker 1>they've written before, and they use this to continue in

0:14:22.360 --> 0:14:25.040
<v Speaker 1>the same style, the same structure. So they give a

0:14:25.280 --> 0:14:35.600
<v Speaker 1>very plausible simulation, a very plausible indication of coherent language,

0:14:35.960 --> 0:14:39.680
<v Speaker 1>as if it were being written by a human. Now,

0:14:40.160 --> 0:14:44.440
<v Speaker 1>the problem is it's highly believable, but it doesn't have

0:14:44.600 --> 0:14:49.160
<v Speaker 1>any sense of itself. So these systems can't reflect on

0:14:49.240 --> 0:14:53.320
<v Speaker 1>what they've written. They can't look back and say, does

0:14:53.360 --> 0:14:57.520
<v Speaker 1>this makes sense? Does this fit with a good model

0:14:57.520 --> 0:15:01.840
<v Speaker 1>of the world? Is it legal on his decent? It doesn't.

0:15:02.440 --> 0:15:05.680
<v Speaker 1>In essence, it's they're a moral They don't have any

0:15:05.760 --> 0:15:10.560
<v Speaker 1>internal sense of what's right and morality. And so as

0:15:10.600 --> 0:15:15.880
<v Speaker 1>story machines, they're great because they can tell fascinating, plausible, engaging,

0:15:16.280 --> 0:15:21.320
<v Speaker 1>sometimes even poignant stories. But for other purposes like writing

0:15:21.320 --> 0:15:25.520
<v Speaker 1>newspaper articles or writing student essays, then they can be

0:15:25.600 --> 0:15:28.400
<v Speaker 1>dangerous because they don't know what they've written, and they

0:15:28.440 --> 0:15:32.280
<v Speaker 1>don't know whether what they've written is decent, honest, and

0:15:32.360 --> 0:15:41.560
<v Speaker 1>truthful than now. Our listeners can can actually get a

0:15:41.560 --> 0:15:45.720
<v Speaker 1>taste of this by going to story slash machines dot net.

0:15:46.800 --> 0:15:49.640
<v Speaker 1>You have an interface here where you can put in

0:15:49.800 --> 0:15:53.040
<v Speaker 1>just a title or also a title and in some

0:15:53.120 --> 0:15:55.440
<v Speaker 1>text like an opening line of a of a story.

0:15:56.040 --> 0:15:59.280
<v Speaker 1>And this is powered by GPT three. Correct. Yeah, that's right.

0:15:59.440 --> 0:16:04.080
<v Speaker 1>So what I did was wrote a website with an

0:16:04.120 --> 0:16:09.360
<v Speaker 1>interface to the most recent GPT three language model. So

0:16:09.400 --> 0:16:12.920
<v Speaker 1>it basically provides a very simple way for you to

0:16:13.160 --> 0:16:16.120
<v Speaker 1>write the title of a story, a few opening words,

0:16:17.000 --> 0:16:21.480
<v Speaker 1>press the create button, and it comes back with about

0:16:21.480 --> 0:16:25.760
<v Speaker 1>a two d and fifty word introduction to a short story.

0:16:25.920 --> 0:16:28.640
<v Speaker 1>So it gives you a taste of what GPT three

0:16:28.680 --> 0:16:31.120
<v Speaker 1>can do. And yeah, I really recommend that you go

0:16:31.160 --> 0:16:34.880
<v Speaker 1>and try it. Um. I launched it just a couple

0:16:34.920 --> 0:16:38.720
<v Speaker 1>of days ago, so people have already been generating fascinating

0:16:38.760 --> 0:16:42.400
<v Speaker 1>short stories using that, So so go and give it

0:16:42.440 --> 0:16:47.600
<v Speaker 1>a try. It's story hyphen machines dot net. Yeah I was.

0:16:47.680 --> 0:16:50.040
<v Speaker 1>I was firing this up when I was reading the

0:16:50.080 --> 0:16:53.720
<v Speaker 1>book yesterday, and yeah, eventually I was having to bring

0:16:53.760 --> 0:16:55.800
<v Speaker 1>my son over and showed to him. I was really

0:16:56.160 --> 0:17:00.680
<v Speaker 1>impressed toying around with it like trying having a title

0:17:00.800 --> 0:17:04.119
<v Speaker 1>that sort of suggested a certain genre, maybe throwing in

0:17:04.200 --> 0:17:08.720
<v Speaker 1>a first sentence that that was the record that the

0:17:08.760 --> 0:17:11.880
<v Speaker 1>reference to certain genre. And then also I was really

0:17:11.920 --> 0:17:14.560
<v Speaker 1>impressed when I would say, put in a title and

0:17:14.600 --> 0:17:17.200
<v Speaker 1>then put in the first line of an existing short

0:17:17.240 --> 0:17:19.960
<v Speaker 1>story by an author. I think I used Clark Ashton

0:17:20.000 --> 0:17:24.400
<v Speaker 1>Smith as an example, and I used um Corny McCarthy.

0:17:24.400 --> 0:17:26.240
<v Speaker 1>I put in a line of Corny McCarthy's work, and

0:17:26.240 --> 0:17:28.800
<v Speaker 1>I was impressed at how well it seemed to model

0:17:29.400 --> 0:17:34.480
<v Speaker 1>a portion of writing as if it were by these authors. Yeah, exactly. Um,

0:17:34.520 --> 0:17:36.800
<v Speaker 1>I'll give you an example, because it's quite quite good

0:17:36.840 --> 0:17:38.760
<v Speaker 1>to show an example. You can cut it out if

0:17:38.760 --> 0:17:41.560
<v Speaker 1>you want to. So so here's an example one that

0:17:41.640 --> 0:17:45.440
<v Speaker 1>I did on Ernest Hemingway in London. The fog came

0:17:45.480 --> 0:17:47.920
<v Speaker 1>in from the river and it was cold for April.

0:17:48.480 --> 0:17:51.320
<v Speaker 1>I walked up past the British Museum and turned down

0:17:51.359 --> 0:17:55.000
<v Speaker 1>towards Holburn. I had a lunch date with Harriet the Criterion,

0:17:55.200 --> 0:17:57.919
<v Speaker 1>and I was early. It was one of those heavy

0:17:58.080 --> 0:18:00.760
<v Speaker 1>grade days that seemed to press down on the city,

0:18:01.160 --> 0:18:04.919
<v Speaker 1>weighing it down. The traffic moved slowly through the streets,

0:18:04.960 --> 0:18:08.119
<v Speaker 1>and the people hurried along with their heads down trying

0:18:08.160 --> 0:18:10.720
<v Speaker 1>to get out of the cold. I glanced in the

0:18:10.720 --> 0:18:14.040
<v Speaker 1>windows of the shops as I walked, looking for something

0:18:14.080 --> 0:18:17.560
<v Speaker 1>to buy Harry for his birthday. I knew he'd appreciate

0:18:17.640 --> 0:18:20.239
<v Speaker 1>a good bottle of scotch, but I wanted to get

0:18:20.320 --> 0:18:23.240
<v Speaker 1>him something else as well. That's just the start of

0:18:23.320 --> 0:18:26.880
<v Speaker 1>as a story in Ernest Hemingway style, and you can

0:18:26.880 --> 0:18:29.960
<v Speaker 1>give it whatever style you want. It's I found it

0:18:30.000 --> 0:18:35.120
<v Speaker 1>works best when you do a weird juxtaposition. So there

0:18:35.240 --> 0:18:38.399
<v Speaker 1>was someone just a few hours ago did one on

0:18:38.960 --> 0:18:43.520
<v Speaker 1>the Sad Sandwich, and it was a really poignant, sad

0:18:43.560 --> 0:18:48.199
<v Speaker 1>story about a poor, neglected sandwich. So try doing some

0:18:48.200 --> 0:18:51.440
<v Speaker 1>some juxtapositions at words like that. Yes, absolutely, and I'd

0:18:51.480 --> 0:18:54.840
<v Speaker 1>love to hear back from from listeners after they've toyed

0:18:54.880 --> 0:18:58.960
<v Speaker 1>around with us and explored it. But of course, recognizing

0:18:59.000 --> 0:19:03.000
<v Speaker 1>the power of of this technology, Uh yeah, we we

0:19:03.080 --> 0:19:07.000
<v Speaker 1>certainly get into this, this area of of anxiety perhaps,

0:19:07.000 --> 0:19:12.040
<v Speaker 1>but also hope and opportunity. Um I guess on the

0:19:12.080 --> 0:19:14.960
<v Speaker 1>anxiety side of things. The first place my mind went

0:19:15.000 --> 0:19:18.760
<v Speaker 1>as I remember seeing Max Tegmark reference a kind of

0:19:19.600 --> 0:19:22.400
<v Speaker 1>an illustration that was like a topography of human abilities

0:19:22.440 --> 0:19:25.960
<v Speaker 1>and jobs with the idea that at the higher elevations

0:19:25.960 --> 0:19:28.760
<v Speaker 1>were going to be more protected from the rising sea

0:19:28.880 --> 0:19:32.600
<v Speaker 1>levels of AI. So chess and jeopardy were already in

0:19:32.640 --> 0:19:36.680
<v Speaker 1>the water, uh, speech recognition, investment, and social interaction there

0:19:36.800 --> 0:19:40.320
<v Speaker 1>in the lowlands they're going next. And then in this

0:19:40.400 --> 0:19:43.919
<v Speaker 1>particular image, we had science as the highest peak, just

0:19:44.080 --> 0:19:47.439
<v Speaker 1>above the peaks of book writing and AI design. So

0:19:47.520 --> 0:19:50.680
<v Speaker 1>I was just wondering, do you feel like this representation

0:19:51.720 --> 0:19:54.679
<v Speaker 1>was accurate or and or have the water has just

0:19:54.840 --> 0:19:58.600
<v Speaker 1>risen so high already? I think every risen high already.

0:19:58.640 --> 0:20:01.240
<v Speaker 1>I think the ones that are going to be protected

0:20:01.359 --> 0:20:05.120
<v Speaker 1>from the rising tide of AI are probably the caring professions, um,

0:20:05.480 --> 0:20:13.120
<v Speaker 1>the nurses, the child careers. But any profession that works

0:20:13.200 --> 0:20:16.760
<v Speaker 1>with words, he's going to find both an opportunity and

0:20:16.760 --> 0:20:20.440
<v Speaker 1>a threat, I think in generative AI. So the opportunity

0:20:21.000 --> 0:20:24.800
<v Speaker 1>is that it's a new kind of tool. The tools

0:20:24.800 --> 0:20:27.359
<v Speaker 1>that we've had up to now have tended to be

0:20:27.560 --> 0:20:34.000
<v Speaker 1>ones that slow down writing, um, like the saurus or

0:20:35.080 --> 0:20:40.679
<v Speaker 1>spell corrector that they make you pause and check. You

0:20:40.760 --> 0:20:42.760
<v Speaker 1>can do them at the end, but there's always a

0:20:42.800 --> 0:20:46.560
<v Speaker 1>temptation to look up a word to slow down. What's

0:20:46.600 --> 0:20:49.320
<v Speaker 1>different about these is that they can be used to

0:20:49.359 --> 0:20:52.160
<v Speaker 1>speed up your writing. You can, and I've tried doing this,

0:20:52.359 --> 0:20:57.080
<v Speaker 1>writing a paragraph when it starts to dry up, them

0:20:57.080 --> 0:20:59.680
<v Speaker 1>handing it over to the machine to write the next paragraph.

0:21:00.119 --> 0:21:04.800
<v Speaker 1>I'm not a fiction writer. I found it quite empowering

0:21:05.359 --> 0:21:11.080
<v Speaker 1>to use GPT three as a writing buddy. So I

0:21:11.080 --> 0:21:13.680
<v Speaker 1>would write a paragraph, it would write the next paragraph

0:21:13.760 --> 0:21:16.639
<v Speaker 1>and probably take it off in some unexpected direction that

0:21:16.680 --> 0:21:19.520
<v Speaker 1>I would then have to follow, and it might introduce

0:21:19.560 --> 0:21:24.120
<v Speaker 1>a new character, a plot twist, and so both as

0:21:24.160 --> 0:21:27.760
<v Speaker 1>a tool for budding writers and also perhaps a prop

0:21:28.080 --> 0:21:33.440
<v Speaker 1>for professional writers, particularly ones with deadlines to meet, then

0:21:33.480 --> 0:21:38.120
<v Speaker 1>that's the opportunity. I think the threat is the inverse

0:21:38.160 --> 0:21:40.479
<v Speaker 1>side of that, that if you either see it as

0:21:40.480 --> 0:21:45.320
<v Speaker 1>a crutch that rather than trying to do your own writing,

0:21:45.359 --> 0:21:47.840
<v Speaker 1>you just hand it over to the machine, then it's

0:21:47.920 --> 0:21:51.560
<v Speaker 1>very easy to become lazy. And also, as I've said,

0:21:51.720 --> 0:21:54.439
<v Speaker 1>they are a moral machine. So if you're trying to

0:21:54.480 --> 0:22:00.200
<v Speaker 1>do scientific writing, or you're trying to do accurate journalists them,

0:22:00.920 --> 0:22:05.879
<v Speaker 1>then beware because they may well throw in some entirely

0:22:06.000 --> 0:22:11.520
<v Speaker 1>fake research study, some entirely inaccurate, fake reference, um, perhaps

0:22:11.640 --> 0:22:19.960
<v Speaker 1>reference to some completely non nonsensical or inaccurate event that's

0:22:19.960 --> 0:22:22.679
<v Speaker 1>happened in the world. So you've always got to be

0:22:22.720 --> 0:22:25.560
<v Speaker 1>aware of the facts that it throws up. And there's

0:22:25.560 --> 0:22:28.560
<v Speaker 1>a good reason for that that it isn't a fact checker.

0:22:29.320 --> 0:22:32.840
<v Speaker 1>It isn't a Wikipedia or even a Google search. It

0:22:33.000 --> 0:22:38.359
<v Speaker 1>is a language machine. It's loves, you know, anthropomorphic sense.

0:22:38.359 --> 0:22:42.720
<v Speaker 1>It loves playing with words, but those words don't necessarily

0:22:42.760 --> 0:22:47.120
<v Speaker 1>make sense. So if you're going to use tools like

0:22:47.280 --> 0:22:50.600
<v Speaker 1>GPT three as AIGs for writing, then you have to

0:22:50.640 --> 0:22:56.119
<v Speaker 1>be very careful that you cross check the facts and

0:22:56.600 --> 0:23:01.560
<v Speaker 1>the output of that machine to make sure that it

0:23:01.720 --> 0:23:06.800
<v Speaker 1>is accurate and honest and truthful. My mind also went

0:23:06.880 --> 0:23:11.920
<v Speaker 1>to um the possibility of of of almost accidental plagiarism

0:23:12.720 --> 0:23:14.960
<v Speaker 1>because I put in the first line of a Clark

0:23:15.000 --> 0:23:18.520
<v Speaker 1>Ashton Smith story and it it threw in a fascinating

0:23:18.560 --> 0:23:21.960
<v Speaker 1>plot twist that was not in the original story, and

0:23:22.240 --> 0:23:24.600
<v Speaker 1>I don't think I've ever seen in a story, and

0:23:24.600 --> 0:23:26.679
<v Speaker 1>so part of me was wondering like, well, well, you know,

0:23:26.760 --> 0:23:28.640
<v Speaker 1>I should I should latch onto this, maybe I could

0:23:28.760 --> 0:23:31.040
<v Speaker 1>use this. But then the other part, you know, the

0:23:31.119 --> 0:23:33.320
<v Speaker 1>lights coming on in my mind were saying, but hold

0:23:33.359 --> 0:23:36.439
<v Speaker 1>on this. Just because I haven't read it doesn't mean

0:23:36.480 --> 0:23:39.600
<v Speaker 1>it doesn't already exist out there. Might Uh we run

0:23:39.640 --> 0:23:44.720
<v Speaker 1>into situations where, uh, the AI is is reproducing something

0:23:45.200 --> 0:23:47.600
<v Speaker 1>you know, perhaps you honestly, if we want to use

0:23:47.680 --> 0:23:51.399
<v Speaker 1>that term when in in in actuality, it may exist

0:23:51.400 --> 0:23:53.600
<v Speaker 1>out there in some story or another. Yeah, I mean,

0:23:53.640 --> 0:23:55.960
<v Speaker 1>I think the first thing to note is that it's

0:23:56.119 --> 0:24:00.600
<v Speaker 1>not working at this sort of word and sad tense level.

0:24:00.800 --> 0:24:03.520
<v Speaker 1>So it's not copying bits of text from the web

0:24:03.680 --> 0:24:08.359
<v Speaker 1>or from published books. It's working below that, basically at

0:24:08.400 --> 0:24:12.000
<v Speaker 1>the phone name level. It's putting together pieces of words,

0:24:12.880 --> 0:24:16.920
<v Speaker 1>but it's putting together these pieces of words in a

0:24:17.000 --> 0:24:22.000
<v Speaker 1>hugely proficient way. So I've tried taking the output of

0:24:22.040 --> 0:24:26.920
<v Speaker 1>GPT three and doing Google search on phrases and sentences,

0:24:27.600 --> 0:24:32.640
<v Speaker 1>and you don't find them. So it seems like they

0:24:32.760 --> 0:24:41.680
<v Speaker 1>are genuinely producing novel pieces of text. So, for example,

0:24:41.920 --> 0:24:44.400
<v Speaker 1>if students are going to use these for writing essays,

0:24:44.840 --> 0:24:48.960
<v Speaker 1>which is already happening, they're already companies that are advertising

0:24:49.000 --> 0:24:52.159
<v Speaker 1>the services of AI generators for students to write their essays.

0:24:52.960 --> 0:24:57.520
<v Speaker 1>Plagiarism checkers won't detect them. I've tried putting them through

0:24:57.560 --> 0:25:04.920
<v Speaker 1>plagiarism checkers and they of sort of nineties five originality.

0:25:05.280 --> 0:25:08.520
<v Speaker 1>So they're not copying bits of text from the web.

0:25:08.960 --> 0:25:13.679
<v Speaker 1>They are genuinely generating new language. Now, of course, there

0:25:13.720 --> 0:25:18.000
<v Speaker 1>are phrases that may pre exist, and if you give

0:25:18.440 --> 0:25:25.080
<v Speaker 1>highly constrained styles like Shakespeare's sonnets, then it may come

0:25:25.200 --> 0:25:29.800
<v Speaker 1>up with previous lines from other Shakespeare sonnets. But providing

0:25:29.840 --> 0:25:33.040
<v Speaker 1>you give it a broad enough brief, providing you give

0:25:33.080 --> 0:25:35.239
<v Speaker 1>it a general enough style, or even if it's from

0:25:35.240 --> 0:25:41.199
<v Speaker 1>a particular author, then it will generate original text. And

0:25:41.240 --> 0:25:44.720
<v Speaker 1>it's still a bit scariest too. You know, I've talked

0:25:44.720 --> 0:25:46.720
<v Speaker 1>about this with other people and said, but surely it's

0:25:46.760 --> 0:25:50.000
<v Speaker 1>copying from the web. No, it isn't. It's generating. It's

0:25:50.080 --> 0:25:53.720
<v Speaker 1>generating new text in the style of that author, or

0:25:53.760 --> 0:25:58.399
<v Speaker 1>in the style of that piece of fiction or piece

0:25:58.400 --> 0:26:02.280
<v Speaker 1>of journalism. So you already touched on like this the

0:26:02.840 --> 0:26:08.400
<v Speaker 1>collaborative possibilities here. But having touched on in school papers

0:26:08.480 --> 0:26:11.320
<v Speaker 1>and such, what do you think are the the educational

0:26:11.440 --> 0:26:16.600
<v Speaker 1>opportunities with this technology. I think the main educational opportunities

0:26:16.640 --> 0:26:23.600
<v Speaker 1>are for beginning writers. It's a way to explore expressivity

0:26:23.640 --> 0:26:27.199
<v Speaker 1>and creativity. One of the problems when you're beginning writing

0:26:27.280 --> 0:26:32.240
<v Speaker 1>is you tend to see everything as being a linear process.

0:26:32.640 --> 0:26:35.720
<v Speaker 1>You write some words, you write some more words. There's

0:26:35.720 --> 0:26:38.520
<v Speaker 1>a flow of writing. It's very difficult to get out

0:26:38.520 --> 0:26:40.919
<v Speaker 1>of that flow and to think about alternative ways of

0:26:40.960 --> 0:26:44.800
<v Speaker 1>expressing something, how it might be different. And what machines

0:26:45.760 --> 0:26:48.560
<v Speaker 1>light GPT three can do is help you to see

0:26:48.600 --> 0:26:52.760
<v Speaker 1>another way of continuing, another way of expressing your ideas.

0:26:53.080 --> 0:26:55.600
<v Speaker 1>It will look back over the last five words or

0:26:55.640 --> 0:26:59.000
<v Speaker 1>so that you've written and perhaps take it in new directions.

0:26:59.600 --> 0:27:04.760
<v Speaker 1>So it's a way for buddying writers to explore possibilities.

0:27:05.119 --> 0:27:08.679
<v Speaker 1>And you can take what you've written so far and

0:27:08.800 --> 0:27:11.520
<v Speaker 1>press the create button a number of times, and each

0:27:11.560 --> 0:27:13.880
<v Speaker 1>time it will take your writing in a different direction.

0:27:14.200 --> 0:27:18.359
<v Speaker 1>So that's one way. Another way is for in a

0:27:18.440 --> 0:27:21.719
<v Speaker 1>class situation, for a teacher to generate a number of

0:27:21.760 --> 0:27:25.080
<v Speaker 1>different articles on a topic. So to give a topic

0:27:25.160 --> 0:27:29.960
<v Speaker 1>like what's the effect of climate change on rising sea

0:27:30.040 --> 0:27:32.919
<v Speaker 1>levels and get it to generate a number of different

0:27:33.000 --> 0:27:35.920
<v Speaker 1>articles and then to critique them because as I say,

0:27:35.960 --> 0:27:39.040
<v Speaker 1>it doesn't always get its facts right. And so to

0:27:39.160 --> 0:27:42.359
<v Speaker 1>look on these as pieces of journalism you might find

0:27:42.359 --> 0:27:45.720
<v Speaker 1>on the web and to take a critical stance. So

0:27:45.800 --> 0:27:50.120
<v Speaker 1>it's it's a good tool for a teacher to give

0:27:50.359 --> 0:27:54.679
<v Speaker 1>some generated articles to students and say, criticize these. We

0:27:54.760 --> 0:27:58.920
<v Speaker 1>know they're written by machine, so what's wrong with them?

0:27:59.080 --> 0:28:02.600
<v Speaker 1>Generally the ear face structure is pretty good, the spellings correct,

0:28:03.000 --> 0:28:06.359
<v Speaker 1>the style is good. But the deeper you go into

0:28:06.960 --> 0:28:11.200
<v Speaker 1>these machines generated texts, the more you find problems with them.

0:28:11.320 --> 0:28:15.240
<v Speaker 1>So it's it's a good class exercise. And then lastly,

0:28:15.280 --> 0:28:19.919
<v Speaker 1>I think is that it's going to be another tool

0:28:20.080 --> 0:28:24.440
<v Speaker 1>companion that writers use. Just as in the early days

0:28:24.440 --> 0:28:27.720
<v Speaker 1>of word processes, there was a lot of criticism that

0:28:27.960 --> 0:28:32.440
<v Speaker 1>it was slowing down writing, that you were reading from

0:28:32.440 --> 0:28:37.280
<v Speaker 1>the screen rather than from the page, that style checkers

0:28:37.440 --> 0:28:43.240
<v Speaker 1>were making writing more conformists. There will be quite rightly

0:28:43.360 --> 0:28:48.640
<v Speaker 1>people who say these new tools are forcing a machine

0:28:48.720 --> 0:28:54.440
<v Speaker 1>type creativity. But I think if we use them wisely

0:28:54.800 --> 0:28:59.560
<v Speaker 1>to extend and to critique our own creativity, then their

0:28:59.640 --> 0:29:03.360
<v Speaker 1>own resting her and exciting opportunities. Do you think we

0:29:03.480 --> 0:29:08.320
<v Speaker 1>better understand human creativity for having gone through this technological journey.

0:29:08.360 --> 0:29:11.480
<v Speaker 1>That's why I started on this h this journey with

0:29:11.600 --> 0:29:18.680
<v Speaker 1>my colligraph I am. It's because I started work as

0:29:18.680 --> 0:29:23.480
<v Speaker 1>a PhD student on trying to understand children's creative writing

0:29:24.080 --> 0:29:27.640
<v Speaker 1>and to develop tools for children to develop their creativity.

0:29:27.880 --> 0:29:33.400
<v Speaker 1>I became fascinated by machine creativity to try and explore

0:29:33.680 --> 0:29:36.760
<v Speaker 1>what is it that a machine can do in terms

0:29:36.800 --> 0:29:41.840
<v Speaker 1>of creativity and where does that stop? So what are

0:29:41.840 --> 0:29:47.240
<v Speaker 1>the limits of machine creativity? And beyond those limits, how

0:29:47.320 --> 0:29:49.680
<v Speaker 1>does that relate to human creativity? What is that the

0:29:49.720 --> 0:29:54.520
<v Speaker 1>week we can do that a machine can't. And now,

0:29:54.600 --> 0:29:58.360
<v Speaker 1>over the years, perhaps the gap between machine creativity and

0:29:58.440 --> 0:30:03.160
<v Speaker 1>human creativity is narrowing, but it's still there, and it

0:30:03.200 --> 0:30:07.000
<v Speaker 1>gives us insights into the way in which we write,

0:30:07.040 --> 0:30:10.120
<v Speaker 1>in the way in which we think. And because these

0:30:10.240 --> 0:30:16.000
<v Speaker 1>new generative AI programs don't work in a human light way,

0:30:16.680 --> 0:30:22.560
<v Speaker 1>then it becomes a really interesting challenge to say, what's

0:30:22.720 --> 0:30:28.240
<v Speaker 1>alien about them, what's different about the writing they produce

0:30:28.920 --> 0:30:32.480
<v Speaker 1>that shows they aren't human? And what does that say

0:30:32.520 --> 0:30:43.200
<v Speaker 1>about human experience and human creativity? Than now, looking into

0:30:43.200 --> 0:30:48.240
<v Speaker 1>the future and getting more speculative, UM say, I'm a

0:30:48.240 --> 0:30:51.120
<v Speaker 1>fan of Frank Herbert's done novels. Do you see for

0:30:51.600 --> 0:30:54.320
<v Speaker 1>see a future in which one would just be able

0:30:54.360 --> 0:30:58.520
<v Speaker 1>to ask an AI to generate the final books in

0:30:58.520 --> 0:31:01.880
<v Speaker 1>the series that Frank would have written, Uh, perhaps more

0:31:01.920 --> 0:31:05.320
<v Speaker 1>novels in this universe he created that sort of thing.

0:31:05.480 --> 0:31:08.000
<v Speaker 1>Or say you're a fan of a particular short story

0:31:08.040 --> 0:31:10.040
<v Speaker 1>author and you're you're like, why would I read anything

0:31:10.080 --> 0:31:13.440
<v Speaker 1>other than stories by this this particular author. I'm just

0:31:13.480 --> 0:31:16.000
<v Speaker 1>going to ask the AI to generate more of them

0:31:16.000 --> 0:31:20.680
<v Speaker 1>for me. Um, it might we easily arrived at such

0:31:20.680 --> 0:31:22.640
<v Speaker 1>a future. And if and if so, like, what does

0:31:22.680 --> 0:31:26.959
<v Speaker 1>that mean for us as both consumers and producers of

0:31:27.120 --> 0:31:31.200
<v Speaker 1>creative writing. I think we'll arrive at that at that

0:31:31.280 --> 0:31:36.640
<v Speaker 1>space pretty soon. I think they will be pastichious, and

0:31:37.520 --> 0:31:43.880
<v Speaker 1>but they may be pastigious that you can't tell from

0:31:44.160 --> 0:31:47.880
<v Speaker 1>the original and that there will be certainly fans of

0:31:48.680 --> 0:31:53.880
<v Speaker 1>authors like Frank Herbert who will be happy to accept

0:31:53.960 --> 0:31:58.920
<v Speaker 1>them as generated in Frank Herbert style, particularly if they

0:31:58.960 --> 0:32:03.320
<v Speaker 1>have interests the new characters, interesting new plots that will

0:32:03.360 --> 0:32:07.800
<v Speaker 1>happen with short stories and Neil Gaiman type short stories.

0:32:07.840 --> 0:32:11.840
<v Speaker 1>I'm sure that will happen quite soon. Um, if it

0:32:11.880 --> 0:32:16.480
<v Speaker 1>hasn't happened already, there may be plan fiction forums where

0:32:16.520 --> 0:32:21.080
<v Speaker 1>those sorts of AI generated pastiches are already circulating, but

0:32:21.160 --> 0:32:24.160
<v Speaker 1>I think there's the future is more likely to be

0:32:24.240 --> 0:32:31.680
<v Speaker 1>around interactive fiction. So um. At the moment, computer games

0:32:31.680 --> 0:32:35.000
<v Speaker 1>are kind of reaching a plateau that the graphics are

0:32:35.040 --> 0:32:40.440
<v Speaker 1>becoming more and more realistic, the interaction is becoming more

0:32:40.520 --> 0:32:44.800
<v Speaker 1>and more engaging, but the AI is lagging behind. Soon

0:32:44.920 --> 0:32:50.760
<v Speaker 1>you'll be able to have AI based characters in games

0:32:50.800 --> 0:32:56.280
<v Speaker 1>that can tell stories that you're not only asked to

0:32:56.320 --> 0:33:00.080
<v Speaker 1>solve a problem or guide you to the treasure, but

0:33:00.960 --> 0:33:04.080
<v Speaker 1>you can engage with them as conversational partners. They will

0:33:04.120 --> 0:33:07.880
<v Speaker 1>take the story forwards, and once you do that, then

0:33:07.920 --> 0:33:12.840
<v Speaker 1>you can get onto interactive soap operas, interactive worlds where

0:33:13.120 --> 0:33:17.400
<v Speaker 1>you've got both human and machine partners now that can

0:33:17.440 --> 0:33:20.720
<v Speaker 1>take you into all sorts of dark areas, but also

0:33:21.160 --> 0:33:29.040
<v Speaker 1>into all sorts of engaging aspects of new interaction, new

0:33:29.080 --> 0:33:34.280
<v Speaker 1>immersive fiction, new types of social interaction that involved both

0:33:34.320 --> 0:33:37.640
<v Speaker 1>machines and humans. So I think rather than trying to

0:33:37.640 --> 0:33:44.640
<v Speaker 1>emulate a particular writer, I think developing interactive fiction where

0:33:45.040 --> 0:33:49.560
<v Speaker 1>you have a continual um story that you can dip

0:33:49.600 --> 0:33:52.560
<v Speaker 1>in and out of with other characters human and machine

0:33:52.960 --> 0:33:57.800
<v Speaker 1>will likely to be the the most engaging, and probably

0:33:57.840 --> 0:34:01.360
<v Speaker 1>the most influential use is of story machines in the

0:34:01.400 --> 0:34:04.920
<v Speaker 1>near future. In the book, you go into several different

0:34:05.160 --> 0:34:09.640
<v Speaker 1>wonderful examples of of how we've reached this point, you know,

0:34:10.080 --> 0:34:13.759
<v Speaker 1>along the line along the road with video games examples

0:34:13.800 --> 0:34:20.480
<v Speaker 1>I wasn't familiar with, like Colossal Cave Adventure, Dwarf Fortress, UM. So,

0:34:20.520 --> 0:34:23.600
<v Speaker 1>I guess most of these examples of these been These

0:34:23.880 --> 0:34:26.680
<v Speaker 1>haven't necessarily been part of, like saying, the mainstream of

0:34:26.800 --> 0:34:30.400
<v Speaker 1>video game culture. No, they they haven't been part of

0:34:30.440 --> 0:34:33.080
<v Speaker 1>the mainstream. There's been a kind of tributary. So the

0:34:33.160 --> 0:34:37.840
<v Speaker 1>mainstream has been sort of from Pong and Space Invaders

0:34:37.880 --> 0:34:44.239
<v Speaker 1>onwards in terms of graphics and interactivity, and then we

0:34:44.320 --> 0:34:47.360
<v Speaker 1>get to Grand Theft or to where you have hugely

0:34:47.680 --> 0:34:52.200
<v Speaker 1>realistic simulated worlds m and it's the gameplay, the action,

0:34:52.560 --> 0:34:55.880
<v Speaker 1>the game mechanics that's really important. But there's been another

0:34:55.920 --> 0:35:02.960
<v Speaker 1>tributary that been mainly followed by people who are fascinated

0:35:03.000 --> 0:35:06.160
<v Speaker 1>by stories and words and storytelling. And it started with

0:35:06.480 --> 0:35:09.960
<v Speaker 1>Colossal Cave Adventure, which, for those of you who don't know,

0:35:10.400 --> 0:35:16.600
<v Speaker 1>was in the late nineties seventies, um by UM a

0:35:16.680 --> 0:35:25.120
<v Speaker 1>couple who were cavers and Um the Crowder, I think

0:35:25.160 --> 0:35:27.480
<v Speaker 1>that's his name. I would have to check it. Um

0:35:27.560 --> 0:35:34.120
<v Speaker 1>he developed this program which generated a world that you

0:35:34.200 --> 0:35:38.800
<v Speaker 1>could explore. You could go down and explore a cave system,

0:35:38.920 --> 0:35:44.560
<v Speaker 1>and it was all done entirely through text. So you

0:35:44.600 --> 0:35:47.200
<v Speaker 1>are going through a dark forest, you find a great

0:35:47.760 --> 0:35:52.239
<v Speaker 1>in the forest floor. What do you do next? Your

0:35:52.320 --> 0:35:55.799
<v Speaker 1>type go down. It then comes back with a description

0:35:55.880 --> 0:35:59.480
<v Speaker 1>of where you are. You are underneath the forest floor,

0:35:59.560 --> 0:36:04.319
<v Speaker 1>in the spring of a stream, and then you can

0:36:04.400 --> 0:36:07.800
<v Speaker 1>go left, you can go right, you can go down.

0:36:08.239 --> 0:36:13.120
<v Speaker 1>So you are guiding this character through a textual world,

0:36:14.440 --> 0:36:17.200
<v Speaker 1>and the more you get into it, the more you

0:36:17.239 --> 0:36:21.160
<v Speaker 1>engage with not only descriptions but also characters in that world,

0:36:21.440 --> 0:36:24.480
<v Speaker 1>and you could collect things, do things, So it's a

0:36:24.560 --> 0:36:29.600
<v Speaker 1>textual world that you're exploring. UM. Since then there have

0:36:29.719 --> 0:36:35.280
<v Speaker 1>been other extensions of that which make these textual characters

0:36:35.320 --> 0:36:39.160
<v Speaker 1>more believable, so actually interact with them. They cannot only

0:36:39.239 --> 0:36:45.160
<v Speaker 1>give you things, but they can behave as real agents

0:36:45.160 --> 0:36:48.719
<v Speaker 1>in the world would real humans would. And Dwarf Fortress

0:36:49.040 --> 0:36:53.160
<v Speaker 1>is an example of that where you've got a hugely

0:36:53.320 --> 0:36:57.399
<v Speaker 1>realistic world. And there was an example I gave him

0:36:57.400 --> 0:37:03.840
<v Speaker 1>the book about Drunk Cat, and the designers of Dwarf

0:37:03.880 --> 0:37:07.840
<v Speaker 1>Fortress had created all sorts of properties of animals, but

0:37:08.360 --> 0:37:13.440
<v Speaker 1>the ability of cats to drink alcohol had deliberately not

0:37:13.520 --> 0:37:18.440
<v Speaker 1>been programmed him. But in the tavern and war Fortress,

0:37:18.560 --> 0:37:22.960
<v Speaker 1>there were these cats that were lying dead, and only

0:37:23.000 --> 0:37:27.040
<v Speaker 1>through interacting with the code did they understand it. Did

0:37:27.040 --> 0:37:30.400
<v Speaker 1>they realize that what had happened was that the cats

0:37:30.440 --> 0:37:34.560
<v Speaker 1>went into this tavern, that people in the tavern had

0:37:34.600 --> 0:37:38.000
<v Speaker 1>spilt alcohol in the floor, the cats had walked through,

0:37:38.160 --> 0:37:40.760
<v Speaker 1>the cats had licked their paws, that cats have become

0:37:40.800 --> 0:37:44.160
<v Speaker 1>poisoned by alcohol. So you have these hugely rich and

0:37:44.239 --> 0:37:50.040
<v Speaker 1>realistic worlds that are realized through text. And so it's

0:37:50.120 --> 0:37:53.360
<v Speaker 1>become as a bit of a tributary of game playing

0:37:54.000 --> 0:37:58.280
<v Speaker 1>because you do have to interact with with text with words.

0:37:59.040 --> 0:38:03.640
<v Speaker 1>But as they big into merge now with the mainstream games,

0:38:04.040 --> 0:38:07.800
<v Speaker 1>then you will have spoken dialogue. You will be able

0:38:07.880 --> 0:38:16.279
<v Speaker 1>to meet your favorite characters in soap operas in streaming

0:38:16.360 --> 0:38:22.440
<v Speaker 1>series and you could talk to them, you can go

0:38:22.480 --> 0:38:27.480
<v Speaker 1>on dates with them, you can go holidays with them, um,

0:38:28.000 --> 0:38:31.920
<v Speaker 1>you can be part of their story. So it's bringing

0:38:31.960 --> 0:38:36.560
<v Speaker 1>together those two streams of game design these rich visual

0:38:36.600 --> 0:38:41.759
<v Speaker 1>worlds and now these textual, believable story worlds that I

0:38:41.800 --> 0:38:45.800
<v Speaker 1>think is going to be the next generation of interactive games.

0:38:46.200 --> 0:38:49.080
<v Speaker 1>What's gonna be really exciting to see this come together. Yeah,

0:38:49.120 --> 0:38:53.640
<v Speaker 1>I think it will, and I think one of the

0:38:53.680 --> 0:38:55.920
<v Speaker 1>things in the future. One of the opportunities in the

0:38:56.000 --> 0:38:59.719
<v Speaker 1>future is that you will be able to live in

0:38:59.760 --> 0:39:03.319
<v Speaker 1>the worlds for an extended period of time. So you

0:39:03.360 --> 0:39:06.239
<v Speaker 1>don't just play the game for forty minutes just like

0:39:06.360 --> 0:39:09.719
<v Speaker 1>you have a TV series. You'll have a TV series

0:39:09.880 --> 0:39:15.520
<v Speaker 1>where you live in this world. Now, that's both scary

0:39:15.600 --> 0:39:19.680
<v Speaker 1>and exciting to be able to live for extended time

0:39:20.200 --> 0:39:23.640
<v Speaker 1>in a virtual world where you can talk to and

0:39:23.760 --> 0:39:26.680
<v Speaker 1>engage with the characters. So what do you think the

0:39:26.719 --> 0:39:30.759
<v Speaker 1>machines are going to want to tell stories about? So

0:39:30.800 --> 0:39:36.920
<v Speaker 1>it's been suggested that because machines don't have human experience,

0:39:37.360 --> 0:39:42.080
<v Speaker 1>they will never be able to tell stories that are

0:39:42.320 --> 0:39:47.400
<v Speaker 1>experientially rich. They don't know the human condition, They've never

0:39:47.440 --> 0:39:50.120
<v Speaker 1>been there, they've never fallen in love, they've never seen

0:39:50.160 --> 0:39:54.839
<v Speaker 1>a sunset. So they will reach a plateau where they

0:39:54.920 --> 0:39:59.000
<v Speaker 1>may produce pastiches, but they won't be able to describe

0:39:59.560 --> 0:40:02.080
<v Speaker 1>or to we have the human condition, they won't be

0:40:02.120 --> 0:40:05.360
<v Speaker 1>able to engage you in any deep human way in

0:40:05.400 --> 0:40:09.600
<v Speaker 1>a story. Now, I'm sure that's right. It's possible you

0:40:09.640 --> 0:40:14.280
<v Speaker 1>could get around that by having embodied storytellers, so embodied

0:40:14.360 --> 0:40:16.560
<v Speaker 1>robots that can go out into the world, that can

0:40:16.600 --> 0:40:19.560
<v Speaker 1>gaze at the sunset, that can go for walks, that

0:40:19.680 --> 0:40:24.719
<v Speaker 1>can feel the wind in their metal phases. But there's

0:40:24.719 --> 0:40:32.000
<v Speaker 1>another possibility, which is that already programs have something approaching

0:40:32.000 --> 0:40:36.760
<v Speaker 1>a social life, that they are connected to other entities

0:40:36.760 --> 0:40:40.800
<v Speaker 1>on the web. They are part of a social network.

0:40:41.640 --> 0:40:47.359
<v Speaker 1>And if they can tell stories about their worlds, their

0:40:47.440 --> 0:40:51.960
<v Speaker 1>worlds of being entities on the web, their worlds as

0:40:52.040 --> 0:40:57.879
<v Speaker 1>being part of a connected Internet system where there are viruses,

0:40:58.120 --> 0:41:05.080
<v Speaker 1>where there are software breakdowns, breakthroughs, entities that interact with

0:41:05.120 --> 0:41:11.880
<v Speaker 1>each other in a computational way that we can't express,

0:41:12.000 --> 0:41:14.279
<v Speaker 1>then they could be valuable in two ways. One is

0:41:14.320 --> 0:41:18.320
<v Speaker 1>that they could help us to understand this complex system

0:41:18.400 --> 0:41:20.920
<v Speaker 1>that is the World Wide Web. They could help to

0:41:21.000 --> 0:41:27.760
<v Speaker 1>interpret the growing, changing nature of the World Wide Web

0:41:28.040 --> 0:41:33.480
<v Speaker 1>in human language. But also they could tell stories. They

0:41:33.480 --> 0:41:37.600
<v Speaker 1>could tell stories of their travels through the Internet. They

0:41:37.600 --> 0:41:43.560
<v Speaker 1>could tell stories of how they became beings that were

0:41:43.680 --> 0:41:47.160
<v Speaker 1>taught and learned through interaction with other objects in the web.

0:41:47.560 --> 0:41:50.600
<v Speaker 1>So I don't think it's necessary to be embodied to

0:41:50.719 --> 0:41:55.360
<v Speaker 1>have human type experience in order to tell interesting stories.

0:41:56.600 --> 0:42:01.080
<v Speaker 1>That they may tell quite alien stories of life on

0:42:01.120 --> 0:42:05.640
<v Speaker 1>the web. And to me, that's much more exciting and

0:42:05.719 --> 0:42:11.240
<v Speaker 1>interesting than just spouting a pastiche of a human story.

0:42:11.800 --> 0:42:13.880
<v Speaker 1>This reminds me of something you you bring up in

0:42:13.880 --> 0:42:16.759
<v Speaker 1>in the book that that that i've that they really

0:42:16.800 --> 0:42:18.920
<v Speaker 1>ring true and uh and also made me, you know,

0:42:18.960 --> 0:42:21.000
<v Speaker 1>rethink a number of things, and that is that the

0:42:21.120 --> 0:42:23.440
<v Speaker 1>uncanny valley, which is a concept that most of its

0:42:23.520 --> 0:42:27.319
<v Speaker 1>familiar with when it comes to um robots made in

0:42:27.360 --> 0:42:29.960
<v Speaker 1>the in the likeness of a human being, or or

0:42:30.000 --> 0:42:32.880
<v Speaker 1>certainly when we get into computer generated imagery and films.

0:42:32.960 --> 0:42:36.440
<v Speaker 1>But you point out that that this that the uncanny

0:42:36.480 --> 0:42:40.480
<v Speaker 1>valley isn't really a thing in storytelling. In one sense,

0:42:40.480 --> 0:42:44.640
<v Speaker 1>it isn't because stories are meant to be disturbing and unsettling.

0:42:45.080 --> 0:42:47.799
<v Speaker 1>That's why we read science fiction, that's why we read

0:42:47.880 --> 0:42:51.880
<v Speaker 1>crime novels because they're meant to be disturbing. So having

0:42:51.920 --> 0:42:56.399
<v Speaker 1>a machine that is in some sense uncanny or disturbing

0:42:56.880 --> 0:42:59.759
<v Speaker 1>in the language that it produces in the stories. That

0:43:00.000 --> 0:43:03.560
<v Speaker 1>else I think we'll only add to the richness of storytelling.

0:43:04.480 --> 0:43:07.120
<v Speaker 1>But where I think there is an uncanny valley is

0:43:07.560 --> 0:43:12.600
<v Speaker 1>if we then say, was this written by a machine?

0:43:13.280 --> 0:43:17.520
<v Speaker 1>And if so, what kind of machine? So is it

0:43:17.560 --> 0:43:21.360
<v Speaker 1>a machine that was trained on billions of words from

0:43:21.400 --> 0:43:26.600
<v Speaker 1>her the web that has had no experience of the world.

0:43:27.680 --> 0:43:31.240
<v Speaker 1>In that case, how can it be reporting on the world?

0:43:31.360 --> 0:43:35.120
<v Speaker 1>How can it pretend to have a human like experience?

0:43:35.600 --> 0:43:38.520
<v Speaker 1>So if we know that the author is a machine,

0:43:39.440 --> 0:43:44.160
<v Speaker 1>then it becomes unsettling because we then start to judge

0:43:44.760 --> 0:43:49.360
<v Speaker 1>perhaps it's very plausible, very poignant, very evocative prose against

0:43:49.480 --> 0:43:51.960
<v Speaker 1>human experience, and we realize that it hasn't had that

0:43:52.200 --> 0:43:57.200
<v Speaker 1>human experience. So how do we know whether other stories

0:43:57.600 --> 0:44:01.200
<v Speaker 1>are really, you know, the product of human experience? And

0:44:01.239 --> 0:44:04.400
<v Speaker 1>what does it mean to tell a story based on experience?

0:44:05.080 --> 0:44:09.000
<v Speaker 1>So the stories themselves I don't think need to worry, yes,

0:44:09.120 --> 0:44:15.160
<v Speaker 1>because stories do have an uncanniness to them. But I

0:44:15.200 --> 0:44:17.920
<v Speaker 1>think once we start to question the author of those

0:44:17.960 --> 0:44:22.440
<v Speaker 1>stories and whether that story is based on genuine experience,

0:44:22.800 --> 0:44:25.920
<v Speaker 1>then it becomes unsettling. All right, Well, the book again

0:44:26.040 --> 0:44:30.399
<v Speaker 1>is story Machines How Computers Have Become Creative Writers should

0:44:30.440 --> 0:44:33.880
<v Speaker 1>come out July five, and Mike thanks once more for

0:44:34.000 --> 0:44:35.640
<v Speaker 1>taking time out of your day and chatting with us

0:44:35.640 --> 0:44:38.480
<v Speaker 1>on the show. It was a pleasure. I've really enjoyed it,

0:44:38.640 --> 0:44:42.440
<v Speaker 1>so thank you for asking me all right so that

0:44:42.600 --> 0:44:44.920
<v Speaker 1>you have it. Thanks again to Mike Sharple's for a

0:44:45.000 --> 0:44:47.680
<v Speaker 1>taking time out of his day to chat with me again.

0:44:47.719 --> 0:44:51.160
<v Speaker 1>That book is Story Machines, How Computers Have Become Creative

0:44:51.160 --> 0:44:54.719
<v Speaker 1>Writers is available now wherever you get your books. And

0:44:54.760 --> 0:44:57.560
<v Speaker 1>if you want to get a taste of this for yourself,

0:44:57.640 --> 0:45:01.600
<v Speaker 1>you can go to story hyphen machine dot net and

0:45:01.800 --> 0:45:05.360
<v Speaker 1>uh it's experiment a little bit like we've been experimenting

0:45:05.840 --> 0:45:07.279
<v Speaker 1>in the meantime. If you would like to check out

0:45:07.320 --> 0:45:09.600
<v Speaker 1>other episodes of Stuff to Blow Your Mind, you can

0:45:09.600 --> 0:45:11.520
<v Speaker 1>find it in the Stuff to Blow your Mind podcast feed.

0:45:11.520 --> 0:45:14.279
<v Speaker 1>We have Core episodes on Tuesdays and Thursday's, Artifact or

0:45:14.320 --> 0:45:17.279
<v Speaker 1>Monster Fact episodes on Wednesdays, Listener Mail on Mondays, and

0:45:17.280 --> 0:45:19.839
<v Speaker 1>on Friday, we set aside most serious concerns and just

0:45:19.880 --> 0:45:22.480
<v Speaker 1>talk about a weird film. Huge thanks as always to

0:45:22.520 --> 0:45:26.279
<v Speaker 1>our excellent audio producer Seth Nicholas Johnson If you would

0:45:26.320 --> 0:45:28.440
<v Speaker 1>like to get in touch with us with feedback on

0:45:28.480 --> 0:45:30.920
<v Speaker 1>this episode or any other, to suggest a topic for

0:45:30.920 --> 0:45:33.400
<v Speaker 1>the future, or just to say hello, you can email

0:45:33.480 --> 0:45:44.200
<v Speaker 1>us at contact at stuff to Blow your Mind dot com.

0:45:44.280 --> 0:45:46.760
<v Speaker 1>Stuff to Blow Your Mind is production of I Heart Radio.

0:45:47.120 --> 0:45:49.440
<v Speaker 1>For more podcasts for My Heart Radio, visit the iHeart

0:45:49.480 --> 0:45:52.239
<v Speaker 1>Radio app, Apple Podcasts, or wherever you're listening to your

0:45:52.239 --> 0:46:01.040
<v Speaker 1>favorite shows. B B B bl bl bl, Blunt Bust

0:46:01.440 --> 0:46:08.680
<v Speaker 1>presided at the Foo