WEBVTT - Week in Tech: Can a Woolly Mouse Make a Mammoth? 

0:00:00.280 --> 0:00:03.240
<v Speaker 1>Welcome to Tech Stuff, a production of iHeart Podcasts and

0:00:03.279 --> 0:00:06.840
<v Speaker 1>Kaleidoscope IMA's Veloscian, and today will bring you the headlines

0:00:06.840 --> 0:00:11.200
<v Speaker 1>of the week, including a genetically edited rodent, the Wally Mouse. Then,

0:00:11.280 --> 0:00:13.760
<v Speaker 1>on today's Tech Supports segment, we'll talk to four of

0:00:13.840 --> 0:00:16.960
<v Speaker 1>Form Media's Jason Kebler about what the future of AI

0:00:17.079 --> 0:00:22.360
<v Speaker 1>movies could look like. All of that on the weekend

0:00:22.400 --> 0:00:30.520
<v Speaker 1>Tech Is Friday. It's March seventh. I'm excited to be

0:00:30.560 --> 0:00:33.240
<v Speaker 1>back in the studio this week with our producer Eliza Dennis.

0:00:33.280 --> 0:00:34.640
<v Speaker 2>We're glad to have you Stateside.

0:00:34.880 --> 0:00:36.480
<v Speaker 1>Yes, it's felt like I was away for a long time.

0:00:36.600 --> 0:00:38.440
<v Speaker 2>I'm wondering if that had something to do with this

0:00:38.560 --> 0:00:39.440
<v Speaker 2>news cycle though.

0:00:39.720 --> 0:00:43.120
<v Speaker 1>Yeah, there's a lot, lot lot to cover, so should

0:00:43.120 --> 0:00:43.639
<v Speaker 1>we jump in.

0:00:43.800 --> 0:00:45.000
<v Speaker 2>Yeah? Absolutely. So.

0:00:45.120 --> 0:00:46.879
<v Speaker 1>It was a bit of a confusing week when it

0:00:46.920 --> 0:00:50.440
<v Speaker 1>comes to chips and semiconductors, and I'll come back to

0:00:50.479 --> 0:00:54.120
<v Speaker 1>why it was confusing. But Monday saw President Trump hold

0:00:54.160 --> 0:01:01.120
<v Speaker 1>a press conference with the Taiwan Semiconductor Manufacturing Company aka TSMC.

0:01:01.640 --> 0:01:05.800
<v Speaker 1>The clues in the name the company manufacture semiconductors and

0:01:06.120 --> 0:01:10.959
<v Speaker 1>they produce ninety percent of the world's super advanced semiconductor chips.

0:01:11.000 --> 0:01:14.039
<v Speaker 1>These are the chips that power AI training models, but

0:01:14.120 --> 0:01:17.679
<v Speaker 1>also devices and basically are the backbone of the new

0:01:17.800 --> 0:01:22.000
<v Speaker 1>global economy. However, the vast majority of the manufacturing takes

0:01:22.040 --> 0:01:26.240
<v Speaker 1>place in Taiwan, and so many in Washington and beyond

0:01:26.280 --> 0:01:29.559
<v Speaker 1>have worn that TSMC's dominance in the chip industry could

0:01:29.560 --> 0:01:33.720
<v Speaker 1>create a national security risk, given that Taiwan is squarely

0:01:33.880 --> 0:01:38.120
<v Speaker 1>in the bullseye of China's territorial ambitions. But this week,

0:01:38.319 --> 0:01:42.240
<v Speaker 1>the Taiwanese company pledged to invest one hundred billion dollars

0:01:42.280 --> 0:01:44.360
<v Speaker 1>in manufacturing chips on US soil.

0:01:44.760 --> 0:01:46.600
<v Speaker 2>You know, this is so interesting to me because it

0:01:46.640 --> 0:01:50.160
<v Speaker 2>comes after multiple announcements over the last couple of months

0:01:50.200 --> 0:01:54.240
<v Speaker 2>about investments in things like data centers and AI infrastructure.

0:01:54.440 --> 0:01:57.720
<v Speaker 2>And that was with Stargate, and then Apple actually recently

0:01:57.720 --> 0:02:01.040
<v Speaker 2>made a pledge to make more products domestically with themestic contractors.

0:02:01.240 --> 0:02:03.520
<v Speaker 1>Yes, I think they talked about five hundred billion dollars.

0:02:03.640 --> 0:02:06.559
<v Speaker 1>But what was really interesting was that as soon as Tuesday,

0:02:06.720 --> 0:02:10.919
<v Speaker 1>when Trump addressed Congress, he talked about his aggressive desire

0:02:11.000 --> 0:02:14.880
<v Speaker 1>to dismantle the Act that actually TSMC is using in

0:02:14.960 --> 0:02:19.560
<v Speaker 1>part to fund its semiconductor manufacturing in the US. The

0:02:19.680 --> 0:02:23.880
<v Speaker 1>Chips Act was biden error legislation that basically created a

0:02:23.919 --> 0:02:29.320
<v Speaker 1>platform for manufacturing semiconductor chips in the US. I don't know,

0:02:29.400 --> 0:02:31.080
<v Speaker 1>I don't know how to square those two things, but

0:02:31.160 --> 0:02:33.560
<v Speaker 1>that actually brings us to our next headline, which is

0:02:33.760 --> 0:02:39.160
<v Speaker 1>a breakthrough indirectly interpreting and reading brain waves and converting

0:02:39.200 --> 0:02:39.680
<v Speaker 1>them to text.

0:02:40.000 --> 0:02:41.480
<v Speaker 2>The superpower I want.

0:02:41.639 --> 0:02:43.880
<v Speaker 1>Yes, exactly what. You may be able to buy it

0:02:44.080 --> 0:02:46.040
<v Speaker 1>if met had anything to do with it, because they

0:02:46.120 --> 0:02:49.800
<v Speaker 1>announced that in partnership with the Basque Center on Cognition,

0:02:50.000 --> 0:02:53.960
<v Speaker 1>Brain and Language in Spain, researchers have been able to

0:02:54.000 --> 0:02:59.960
<v Speaker 1>decode unspoken language, often reconstructing full sentences directly from brainwaves

0:03:00.480 --> 0:03:03.600
<v Speaker 1>and not even requiring any surgical intervention. This is all

0:03:03.639 --> 0:03:05.600
<v Speaker 1>stuff which can be measured outside the head.

0:03:05.800 --> 0:03:08.840
<v Speaker 2>Yeah, and that's really the breakthrough here, right, because other

0:03:08.919 --> 0:03:14.400
<v Speaker 2>research from companies like Neurolink have been extremely invasive, you know,

0:03:14.520 --> 0:03:17.840
<v Speaker 2>electrodes being implanted into the brain. Invasive.

0:03:18.320 --> 0:03:20.639
<v Speaker 1>Yeah, that's right. And this research is all about kind

0:03:20.680 --> 0:03:24.000
<v Speaker 1>of putting monitors on the skull or around the head

0:03:24.280 --> 0:03:26.520
<v Speaker 1>to be able to read brain waves without having to

0:03:26.800 --> 0:03:30.040
<v Speaker 1>directly hook into the brain, which is obviously much less scary,

0:03:30.360 --> 0:03:33.760
<v Speaker 1>and there's an amazing promise for people with cognitive impairments or

0:03:33.760 --> 0:03:37.560
<v Speaker 1>brain injuries to be able to convert their thoughts into

0:03:37.800 --> 0:03:41.280
<v Speaker 1>text and therefore speech. But there are also some concerns.

0:03:41.360 --> 0:03:45.560
<v Speaker 1>Right the Vox headline was Meta's brain to text tech

0:03:45.760 --> 0:03:49.040
<v Speaker 1>is here. We are not remotely ready, And of course

0:03:49.040 --> 0:03:52.800
<v Speaker 1>the big concern here is privacy if private companies can

0:03:52.840 --> 0:03:55.520
<v Speaker 1>actually read our thoughts. But there's actually a long way

0:03:55.520 --> 0:03:58.600
<v Speaker 1>to go before this research leaves the lab. Nonetheless, the

0:03:58.680 --> 0:04:02.320
<v Speaker 1>experiment was kind of a maze. So thirty five volunteers

0:04:02.400 --> 0:04:06.640
<v Speaker 1>sat under magnetic brain imaging scanners and typed on a keyboard.

0:04:07.040 --> 0:04:10.000
<v Speaker 1>Based on prior training, an AI model was able to

0:04:10.040 --> 0:04:13.640
<v Speaker 1>predict what they were writing, and meture research is accurately

0:04:13.680 --> 0:04:17.080
<v Speaker 1>decoded between seventy and eighty percent of what people typed.

0:04:17.120 --> 0:04:20.479
<v Speaker 1>In other words, with seventy to eighty percent certainty, it

0:04:20.560 --> 0:04:23.280
<v Speaker 1>could know before I clicked a T that I was

0:04:23.279 --> 0:04:25.960
<v Speaker 1>about to click the T. And so the real promise

0:04:26.040 --> 0:04:29.880
<v Speaker 1>here is actually a data from this research is beginning

0:04:29.920 --> 0:04:34.799
<v Speaker 1>to give neuroscientists a path to understanding how abstract thoughts

0:04:35.120 --> 0:04:37.559
<v Speaker 1>are converted into language by the human brain.

0:04:38.080 --> 0:04:39.840
<v Speaker 2>Then I think the other part of this is that

0:04:39.880 --> 0:04:42.680
<v Speaker 2>we're getting closer and closer to this idea that we

0:04:42.720 --> 0:04:45.120
<v Speaker 2>can have wearables that do this kind.

0:04:45.000 --> 0:04:48.000
<v Speaker 1>Of tech totally. But of course, a wearable headset that

0:04:48.080 --> 0:04:50.640
<v Speaker 1>can can actually read your thoughts and translate them into

0:04:50.720 --> 0:04:54.960
<v Speaker 1>language is something that you know, conceivably could change a

0:04:55.000 --> 0:04:58.279
<v Speaker 1>lot of people's lives. In another kind of science fiction

0:04:58.480 --> 0:05:02.320
<v Speaker 1>becomes science fact story, it's about the wooly mammoths. The

0:05:02.360 --> 0:05:07.039
<v Speaker 1>headline from MPR was just irresistible hoping to revive mammoths,

0:05:07.360 --> 0:05:12.560
<v Speaker 1>scientists create wooly mice. Yeah, and I think one of

0:05:12.560 --> 0:05:14.200
<v Speaker 1>the scientists that we knew we could do it, but

0:05:14.400 --> 0:05:17.720
<v Speaker 1>we didn't know they would be this cute and they're

0:05:17.720 --> 0:05:20.040
<v Speaker 1>worth a look. But the story is about a company

0:05:20.080 --> 0:05:23.880
<v Speaker 1>called Colossal Biosciences, and they are, by their own account,

0:05:24.120 --> 0:05:26.880
<v Speaker 1>the first and only de extinction company.

0:05:27.360 --> 0:05:29.520
<v Speaker 2>Okay, this was a concept I had never heard of

0:05:29.600 --> 0:05:30.400
<v Speaker 2>until this week.

0:05:30.720 --> 0:05:33.280
<v Speaker 1>Yeah, this one's been one that's I've been intrigued by

0:05:33.320 --> 0:05:34.600
<v Speaker 1>for it for a long time, and I hope we'll

0:05:34.600 --> 0:05:36.000
<v Speaker 1>be able to cover it on an episode of the

0:05:36.040 --> 0:05:39.560
<v Speaker 1>story before too long. But Colossal's website points out that

0:05:39.680 --> 0:05:42.880
<v Speaker 1>nine hundred and two species are extinct and more than

0:05:42.960 --> 0:05:47.520
<v Speaker 1>nine two hundred are critically endangered, and their mission is

0:05:47.560 --> 0:05:52.240
<v Speaker 1>to restore extinct species to preserve biodiversity. It's a little controversial.

0:05:52.440 --> 0:05:54.960
<v Speaker 1>Some people think there are more efficient ways to do

0:05:55.040 --> 0:06:00.400
<v Speaker 1>conservation than reviving extinct species, you know, But to that

0:06:00.480 --> 0:06:02.479
<v Speaker 1>I would say, I mean, look at the wooly mouse.

0:06:02.520 --> 0:06:05.240
<v Speaker 1>This is whether or not you think this is the

0:06:05.240 --> 0:06:09.880
<v Speaker 1>most efficient investment. It is absolutely wild. So picture a

0:06:09.920 --> 0:06:13.599
<v Speaker 1>mouse with fluffy, orange tan fur that looks like it

0:06:13.640 --> 0:06:15.440
<v Speaker 1>got very wet and then got a blow dry at

0:06:15.440 --> 0:06:17.360
<v Speaker 1>the salon. You've got the picture.

0:06:18.000 --> 0:06:19.800
<v Speaker 2>They are extremely cute, and.

0:06:19.920 --> 0:06:23.160
<v Speaker 1>The way Colossal made them was first studying the wooly

0:06:23.200 --> 0:06:28.440
<v Speaker 1>mammoth genome and then genetically engineering mice by modifying seven

0:06:28.520 --> 0:06:32.039
<v Speaker 1>key genes to make them more like wooly mammoths. You know,

0:06:32.120 --> 0:06:35.440
<v Speaker 1>the wool obviously being the most visible element, but also

0:06:35.480 --> 0:06:38.359
<v Speaker 1>some things that were invisible, like the way the mice

0:06:38.400 --> 0:06:40.920
<v Speaker 1>store fat and their fat metabolism makes them much more

0:06:40.920 --> 0:06:44.080
<v Speaker 1>able to survive in the cold. And according to Colossal,

0:06:44.560 --> 0:06:49.839
<v Speaker 1>the plan is to implant wooly mammoth esque modified embryos

0:06:50.040 --> 0:06:54.040
<v Speaker 1>to Asian elephants by twenty twenty eight. This week, was

0:06:54.040 --> 0:06:58.040
<v Speaker 1>also the Oscars and we both saw the movie that

0:06:58.120 --> 0:06:59.760
<v Speaker 1>won Best Live Action.

0:07:00.640 --> 0:07:02.840
<v Speaker 2>Please tell people about it. It's wonderful.

0:07:03.320 --> 0:07:07.000
<v Speaker 1>So it's a Belgian Dutch copro called I'm Not a Robot?

0:07:07.120 --> 0:07:07.880
<v Speaker 1>What did you make of it?

0:07:08.320 --> 0:07:10.480
<v Speaker 2>I was extremely tickled by this promise.

0:07:11.200 --> 0:07:13.040
<v Speaker 1>So, for those who haven't seen it, the film was

0:07:13.040 --> 0:07:16.720
<v Speaker 1>written and directed by Victoria Wamadam and it's about a

0:07:16.760 --> 0:07:21.480
<v Speaker 1>music producer who fails a series of capture tests and

0:07:22.000 --> 0:07:25.400
<v Speaker 1>in so doing us to question whether she's in fact human.

0:07:26.320 --> 0:07:28.120
<v Speaker 2>I mean the minute I knew that we were having

0:07:28.160 --> 0:07:30.840
<v Speaker 2>a capture test as part of the plot to this movie,

0:07:30.920 --> 0:07:33.080
<v Speaker 2>I was all in. I don't know if you have

0:07:33.160 --> 0:07:37.040
<v Speaker 2>this feeling, but I hate failing captured tests, especially when

0:07:37.040 --> 0:07:40.160
<v Speaker 2>you have to click I'm not a robot and all

0:07:40.200 --> 0:07:42.840
<v Speaker 2>you have to do is choose squares that show images

0:07:42.920 --> 0:07:46.320
<v Speaker 2>of street lights or motorcycles or bikes. How can I

0:07:46.360 --> 0:07:46.960
<v Speaker 2>get that wrong?

0:07:47.240 --> 0:07:49.800
<v Speaker 1>Yeah? So she's failing the tests again and again, even

0:07:49.800 --> 0:07:51.920
<v Speaker 1>though it looks like she's doing it right. And then

0:07:51.920 --> 0:07:55.120
<v Speaker 1>she gets to pop up with another quiz and one

0:07:55.120 --> 0:07:58.120
<v Speaker 1>of the questions is did your parents die before you

0:07:58.160 --> 0:08:02.480
<v Speaker 1>met them? And she answers, she answers, yes, and I

0:08:02.480 --> 0:08:04.760
<v Speaker 1>don't want to spoil the whole plot. It gets pretty eerie,

0:08:05.120 --> 0:08:07.040
<v Speaker 1>but it's a fascinating film well worth a watch. You

0:08:07.080 --> 0:08:08.920
<v Speaker 1>can check it out actually on the New Yorker website

0:08:08.920 --> 0:08:12.040
<v Speaker 1>because they were involved in releasing the film and on YouTube,

0:08:12.280 --> 0:08:15.160
<v Speaker 1>and as a tech nerd, I was rooting for them

0:08:15.200 --> 0:08:17.320
<v Speaker 1>to win the Best Live Action Short and did.

0:08:18.040 --> 0:08:18.280
<v Speaker 3>Yes.

0:08:18.680 --> 0:08:22.000
<v Speaker 2>Congratulations team. I'm not a robot, So.

0:08:22.040 --> 0:08:24.920
<v Speaker 1>Stick around as well after the break for a look

0:08:24.920 --> 0:08:28.720
<v Speaker 1>at how AI was used in this year's OSCAR nominated

0:08:28.760 --> 0:08:32.320
<v Speaker 1>feature films, including The Brutalist, and for a conversation with

0:08:32.400 --> 0:08:35.160
<v Speaker 1>Jason Kebler about what it's like to attend an AI

0:08:35.320 --> 0:08:45.640
<v Speaker 1>film festival. Stay with us, Welcome back. The Oscars were

0:08:45.640 --> 0:08:48.600
<v Speaker 1>on Sunday, so we're going to stick with movies. Back

0:08:48.600 --> 0:08:51.839
<v Speaker 1>in twenty twenty three, the Hollywood Writers' Strike was this

0:08:52.080 --> 0:08:57.000
<v Speaker 1>fascinating early example of a very public negotiation over how

0:08:57.160 --> 0:09:02.760
<v Speaker 1>AI might could, and even would disrupt and displace human labour. Ultimately,

0:09:02.760 --> 0:09:05.280
<v Speaker 1>the Writers Guild of America signed an agreement with the

0:09:05.280 --> 0:09:09.439
<v Speaker 1>Alliance of Motion Picture and Television Producers that Generative AI

0:09:09.559 --> 0:09:13.240
<v Speaker 1>would not reduce or eliminate writers and their pay. But

0:09:13.320 --> 0:09:15.920
<v Speaker 1>this was not a commitment by the industry not to

0:09:16.040 --> 0:09:19.920
<v Speaker 1>use generative AI in filmmaking, far from it. In fact,

0:09:19.920 --> 0:09:23.600
<v Speaker 1>this January, the editor of the Triple Oscar winning movie

0:09:23.679 --> 0:09:27.240
<v Speaker 1>The Brutalist told an industry publication that he had used

0:09:27.240 --> 0:09:31.040
<v Speaker 1>generative AI a few times in post production. Some of

0:09:31.040 --> 0:09:35.160
<v Speaker 1>the actors in The Brutalist, namely Felicity Jones and Adrian Brody,

0:09:35.640 --> 0:09:39.240
<v Speaker 1>performed their roles with a heavy Hungarian accent, and they

0:09:39.240 --> 0:09:42.920
<v Speaker 1>even had some dialogue in Hungarian. To prepare for the roles,

0:09:43.000 --> 0:09:45.839
<v Speaker 1>Brody and Jones spent months with a dialect coach to

0:09:45.880 --> 0:09:50.480
<v Speaker 1>perfect their accents, but as The Brutalist editor David Joncho,

0:09:50.920 --> 0:09:54.720
<v Speaker 1>a native Hungarian speaker, pointed out, English speakers can have

0:09:54.720 --> 0:09:58.880
<v Speaker 1>a hard time pronouncing certain sounds. In post he tried

0:09:58.920 --> 0:10:01.760
<v Speaker 1>to perfect the Hungarian in dialogue, and first the team

0:10:01.800 --> 0:10:03.600
<v Speaker 1>had the actor's reader of the lines in the studio.

0:10:04.080 --> 0:10:06.840
<v Speaker 1>Then they tried having other actors say the lines, but

0:10:07.080 --> 0:10:10.560
<v Speaker 1>that also didn't sound right, so Yoncho turned to AI.

0:10:11.200 --> 0:10:15.079
<v Speaker 1>He fared Brody and Jones's voices into the program respeecher

0:10:15.600 --> 0:10:19.199
<v Speaker 1>and then using his own voice, Yoncho refined certain vowels

0:10:19.200 --> 0:10:22.000
<v Speaker 1>and letters for accuracy, a process that could have been

0:10:22.000 --> 0:10:24.880
<v Speaker 1>done without generative AI, like in an audio editors such

0:10:24.880 --> 0:10:28.120
<v Speaker 1>as pro Tools, but Respeecher made the process much more

0:10:28.120 --> 0:10:31.720
<v Speaker 1>efficient and of course Adrian Brody won the Oscar for

0:10:31.760 --> 0:10:35.800
<v Speaker 1>Best Actor. As us Say Today reported, not all viewers

0:10:35.840 --> 0:10:39.040
<v Speaker 1>would pleased with the news. Don't think it's too reactionary

0:10:39.080 --> 0:10:41.240
<v Speaker 1>to say this movie should lose the Academy buzz. It

0:10:41.280 --> 0:10:44.880
<v Speaker 1>was getting one person posted on eggs. But the manipulation

0:10:44.960 --> 0:10:48.559
<v Speaker 1>of vocal tracks is not uncommon in movies. Deadline noted

0:10:48.559 --> 0:10:51.600
<v Speaker 1>that combinations of vocal tracks will use in performances like

0:10:51.720 --> 0:10:55.800
<v Speaker 1>Romy Mallock's Oscar winning portrayal of Freddie Mercury, and Respeecher

0:10:55.840 --> 0:10:57.880
<v Speaker 1>may have been used in another film nominated for Best

0:10:57.880 --> 0:11:01.440
<v Speaker 1>Picture this year, Amelia Perez. The rise of generative AI

0:11:01.559 --> 0:11:04.440
<v Speaker 1>has been remarkably fast in creative industries. But one big

0:11:04.520 --> 0:11:07.640
<v Speaker 1>question I have is how far could this go and

0:11:07.720 --> 0:11:10.880
<v Speaker 1>how soon? And to answer that, we want to turn

0:11:10.960 --> 0:11:13.679
<v Speaker 1>to our friend Jason Kebler at four or four Media,

0:11:13.760 --> 0:11:16.959
<v Speaker 1>who not too long ago attended a film festival of

0:11:17.080 --> 0:11:20.040
<v Speaker 1>AI generated movies. Jason, welcome back to the show.

0:11:20.120 --> 0:11:21.120
<v Speaker 3>Hey, thanks for having me.

0:11:21.360 --> 0:11:23.320
<v Speaker 1>Before we get into that film festival you went to

0:11:23.600 --> 0:11:26.839
<v Speaker 1>could you just explain how respeech it works and how

0:11:26.880 --> 0:11:29.439
<v Speaker 1>it was used in the editing process for the Brutalist.

0:11:29.760 --> 0:11:34.720
<v Speaker 3>Yeah. So, respeacher is an AI voice synthesizer, and so

0:11:35.160 --> 0:11:39.600
<v Speaker 3>it takes training data of an actor's voice and runs

0:11:39.600 --> 0:11:42.240
<v Speaker 3>it against a large language model. So in this case,

0:11:42.240 --> 0:11:45.800
<v Speaker 3>it would probably be examples of the Hungarian language, et cetera.

0:11:46.280 --> 0:11:49.520
<v Speaker 3>And it would take Adrian Brodie's voice and make it

0:11:49.760 --> 0:11:56.440
<v Speaker 3>more closely match other examples of Hungarian language. And it's

0:11:56.520 --> 0:11:59.120
<v Speaker 3>very interesting because this technology is sort of one of

0:11:59.160 --> 0:12:04.360
<v Speaker 3>the first native AI technologies that was widely used commercially,

0:12:04.920 --> 0:12:08.800
<v Speaker 3>not just Respeecher, but another company called eleven Labs has

0:12:09.640 --> 0:12:13.520
<v Speaker 3>become really famous for like Eric Adams, the mayor of

0:12:13.600 --> 0:12:18.520
<v Speaker 3>New York City, did a calling campaign to various communities

0:12:18.520 --> 0:12:21.240
<v Speaker 3>in New York City where he spoke English, but then

0:12:21.360 --> 0:12:24.920
<v Speaker 3>eleven Labs translated his voice into like fifteen different languages.

0:12:25.320 --> 0:12:28.920
<v Speaker 3>And it's not just like a robot voice reading it

0:12:29.080 --> 0:12:33.239
<v Speaker 3>sounds like Eric Adams speaking Mandarin or Eric Adams speaking Hungarian.

0:12:33.760 --> 0:12:37.600
<v Speaker 3>And so increasingly this is being used in movies, not

0:12:37.640 --> 0:12:40.600
<v Speaker 3>just Respeecher, but also eleven Labs and other tools like it,

0:12:41.160 --> 0:12:44.080
<v Speaker 3>and it really is like one of the first big

0:12:44.120 --> 0:12:47.160
<v Speaker 3>commercial uses of generative AI in movies.

0:12:47.720 --> 0:12:49.560
<v Speaker 1>To me, it feels like it's not that far away

0:12:49.600 --> 0:12:52.520
<v Speaker 1>from other post production tools that have been super charged

0:12:52.559 --> 0:12:55.679
<v Speaker 1>by AI, like description, podcast editing, or other tools like that.

0:12:56.040 --> 0:12:58.880
<v Speaker 3>Yeah, I mean it's really interesting because I think that

0:12:59.160 --> 0:13:02.280
<v Speaker 3>music had this a long time ago, with things like autotune,

0:13:02.600 --> 0:13:06.880
<v Speaker 3>and it's like many, many, many popular artists use autotune,

0:13:06.920 --> 0:13:10.439
<v Speaker 3>and this is a very similar technology. I mean it's

0:13:10.679 --> 0:13:13.840
<v Speaker 3>it's in the same family of technologies at least. So

0:13:13.920 --> 0:13:17.800
<v Speaker 3>it just becomes a question of how much post can

0:13:17.880 --> 0:13:21.000
<v Speaker 3>there be for the human performance to still be there.

0:13:21.040 --> 0:13:24.160
<v Speaker 3>And I think it's a really open question at this point.

0:13:24.280 --> 0:13:27.480
<v Speaker 3>I think if you asked me a while ago, I

0:13:27.480 --> 0:13:31.120
<v Speaker 3>would say they're changing the performance in some fundamental way.

0:13:31.200 --> 0:13:35.080
<v Speaker 3>But I think everything in a movie is so carefully edited,

0:13:35.160 --> 0:13:38.600
<v Speaker 3>so carefully shot. They do hundreds of takes for certain

0:13:38.679 --> 0:13:42.000
<v Speaker 3>scenes and then splice together different takes and cuts, and

0:13:42.040 --> 0:13:44.280
<v Speaker 3>so I think it really is a spectrum of what

0:13:44.360 --> 0:13:48.240
<v Speaker 3>you are willing to accept if you're in the Academy

0:13:48.240 --> 0:13:50.679
<v Speaker 3>and need to decide whether someone is worthy of an

0:13:50.679 --> 0:13:54.040
<v Speaker 3>award for this, I think audiences sort of have to

0:13:54.120 --> 0:13:56.240
<v Speaker 3>accept it because it's being done, and it's been done

0:13:56.280 --> 0:13:57.960
<v Speaker 3>for a long time. And I think that if you

0:13:58.040 --> 0:14:01.679
<v Speaker 3>start like having purity tests about this sort of thing,

0:14:01.679 --> 0:14:03.800
<v Speaker 3>I think it's going to be pretty difficult to know

0:14:03.880 --> 0:14:06.040
<v Speaker 3>which movies to see and which are not to see,

0:14:06.080 --> 0:14:08.840
<v Speaker 3>because ye, honestly, the only reason we know that this

0:14:09.040 --> 0:14:11.400
<v Speaker 3>was used at all was because the editor talked about

0:14:11.400 --> 0:14:12.320
<v Speaker 3>it to the media.

0:14:12.800 --> 0:14:15.280
<v Speaker 1>Yeah. And also, I mean, to be fair to Adrian Brody,

0:14:15.360 --> 0:14:18.439
<v Speaker 1>I doubt that many Academy members would have voted against

0:14:18.520 --> 0:14:21.600
<v Speaker 1>him on the basis to his owncungarian accident wasn't quite perfect,

0:14:21.720 --> 0:14:23.960
<v Speaker 1>So I'm not sure that this was like the key

0:14:24.000 --> 0:14:27.320
<v Speaker 1>input to his victory. But what you said about like

0:14:27.880 --> 0:14:30.600
<v Speaker 1>the role of post production and what that means visa

0:14:30.720 --> 0:14:33.600
<v Speaker 1>v like the original product made me think about this

0:14:33.760 --> 0:14:36.960
<v Speaker 1>AI generated film festival that you went to. So, first

0:14:37.000 --> 0:14:40.160
<v Speaker 1>of all, what made this an AI generated film festival?

0:14:40.240 --> 0:14:42.760
<v Speaker 1>How much of the films were AI generated?

0:14:43.440 --> 0:14:46.520
<v Speaker 3>Yeah, so it varied for each movie, but I think

0:14:46.560 --> 0:14:49.320
<v Speaker 3>that if you walked in off the street, you would say, oh,

0:14:49.400 --> 0:14:51.720
<v Speaker 3>these films were made with AI And what I mean

0:14:51.760 --> 0:14:56.480
<v Speaker 3>by that is each movie had visuals that were clearly

0:14:56.520 --> 0:14:59.680
<v Speaker 3>AI generated, like a lot of the backgrounds were constantly

0:14:59.720 --> 0:15:02.920
<v Speaker 3>changed in a way that if you were using a camera,

0:15:03.560 --> 0:15:07.800
<v Speaker 3>they wouldn't happen. A lot of people had like faces

0:15:07.840 --> 0:15:10.600
<v Speaker 3>that were morphing from scene to scene. One thing I

0:15:10.640 --> 0:15:14.280
<v Speaker 3>will say though, is that TCL was very clear that

0:15:14.320 --> 0:15:17.040
<v Speaker 3>all of the scripts were written by humans, and all

0:15:17.160 --> 0:15:19.320
<v Speaker 3>the voices were done by humans, and all of the

0:15:19.440 --> 0:15:24.000
<v Speaker 3>music was done by humans. The artificial intelligence was limited

0:15:24.040 --> 0:15:26.440
<v Speaker 3>to the visuals in different movies.

0:15:27.080 --> 0:15:28.640
<v Speaker 1>Can you just take me back to kind of how

0:15:28.640 --> 0:15:31.520
<v Speaker 1>you got invited and what questions you had going in?

0:15:32.200 --> 0:15:35.920
<v Speaker 3>Yeah, So I went to the Chinese Theater in Hollywood,

0:15:36.320 --> 0:15:39.520
<v Speaker 3>which is ironically where the oscars are. It's like the

0:15:39.560 --> 0:15:44.040
<v Speaker 3>same complex. And that theater is owned by TCL, which

0:15:44.160 --> 0:15:48.760
<v Speaker 3>is a Chinese TV manufacturer, and like a lot of

0:15:48.800 --> 0:15:52.040
<v Speaker 3>other TV manufacturers at this point, they have their own

0:15:52.200 --> 0:15:55.920
<v Speaker 3>free streaming TV service if you buy a TCL TV,

0:15:56.360 --> 0:16:00.000
<v Speaker 3>And TCL is the first company to put fully AI

0:16:00.160 --> 0:16:03.600
<v Speaker 3>generated movies on its streaming service. And so this was

0:16:03.640 --> 0:16:08.920
<v Speaker 3>a premiere of five films that were created using generative AI.

0:16:09.120 --> 0:16:12.200
<v Speaker 3>And so I had been writing basically about this technology

0:16:12.200 --> 0:16:14.400
<v Speaker 3>for a while and they invited me to come watch them.

0:16:14.720 --> 0:16:16.680
<v Speaker 1>So, despite the fact that you'll perhaps more on the

0:16:16.680 --> 0:16:19.520
<v Speaker 1>skeptical side, they welcome you into the film festival.

0:16:19.720 --> 0:16:22.320
<v Speaker 3>I was pretty shocked that they invited me, because honestly,

0:16:22.360 --> 0:16:25.360
<v Speaker 3>I had written about a trailer that they released for

0:16:25.440 --> 0:16:28.160
<v Speaker 3>an AI generated film and I kind of dunked on it.

0:16:28.200 --> 0:16:31.640
<v Speaker 3>I said, it was really terrible. It's called Last Train Paris,

0:16:31.680 --> 0:16:34.800
<v Speaker 3>and it was like an AI generated rom com. And

0:16:35.320 --> 0:16:38.320
<v Speaker 3>in the YouTube video, it's like the lip syncing of

0:16:38.360 --> 0:16:42.000
<v Speaker 3>the audio and the lips is like really bad. The

0:16:42.120 --> 0:16:46.960
<v Speaker 3>characters move incredibly robotically, and it has this very dreamlike

0:16:47.400 --> 0:16:52.200
<v Speaker 3>quality to it that is very common with AI generated visuals,

0:16:52.280 --> 0:16:54.640
<v Speaker 3>where it's not like a cool effect. It's like, wow,

0:16:54.680 --> 0:16:57.760
<v Speaker 3>this is really distracting because the background is constantly swirling

0:16:57.800 --> 0:17:00.400
<v Speaker 3>and changing and things are popping in and out. And

0:17:00.480 --> 0:17:03.680
<v Speaker 3>after I wrote that article, they still decided to invite me,

0:17:03.760 --> 0:17:05.760
<v Speaker 3>So I thought that was brave of them.

0:17:05.920 --> 0:17:07.159
<v Speaker 1>But what did you think, I mean, what were you

0:17:07.200 --> 0:17:08.400
<v Speaker 1>kind of expecting going into it?

0:17:08.880 --> 0:17:11.440
<v Speaker 3>Going in? I thought that they would be pretty bad,

0:17:11.560 --> 0:17:14.600
<v Speaker 3>to be totally honest with you, just because the state

0:17:14.640 --> 0:17:18.200
<v Speaker 3>of the art at the time. This was back in December,

0:17:18.520 --> 0:17:22.840
<v Speaker 3>which it was only three months ago, but at the time,

0:17:23.040 --> 0:17:28.920
<v Speaker 3>AI video generators were pretty bad, and I didn't think

0:17:28.960 --> 0:17:32.600
<v Speaker 3>that TCL had access to some proprietary system that we

0:17:32.680 --> 0:17:35.119
<v Speaker 3>hadn't seen before. I figured that they would be using

0:17:35.560 --> 0:17:38.040
<v Speaker 3>the state of the art that you can find on

0:17:38.119 --> 0:17:41.600
<v Speaker 3>the internet, and I think that those tools are not

0:17:41.800 --> 0:17:43.680
<v Speaker 3>very good, and so I thought that they would be bad,

0:17:43.800 --> 0:17:46.280
<v Speaker 3>to be totally honest with you, and they were bad.

0:17:48.640 --> 0:17:51.120
<v Speaker 1>Can you describe some of the highlights on the low Lights?

0:17:51.400 --> 0:17:54.760
<v Speaker 3>Yeah? I thought that the films themselves were just they

0:17:54.760 --> 0:17:57.199
<v Speaker 3>felt pretty rushed. So one of them was called The

0:17:57.240 --> 0:18:00.600
<v Speaker 3>Slug and it's about a woman who turns into a slug.

0:18:00.760 --> 0:18:02.719
<v Speaker 3>She has a disease that turns her into a slug

0:18:02.760 --> 0:18:05.600
<v Speaker 3>and it feels like The Substance, which is another you know,

0:18:05.640 --> 0:18:10.160
<v Speaker 3>Oscar nominated film. The visuals on it are wild. Things

0:18:10.200 --> 0:18:12.800
<v Speaker 3>are just like constantly changing. Her face is changing, the

0:18:13.080 --> 0:18:15.600
<v Speaker 3>you know, the food is changing. There's a lot of

0:18:15.640 --> 0:18:19.840
<v Speaker 3>like weird screams that happen that are not super well

0:18:19.880 --> 0:18:23.439
<v Speaker 3>timed with the dialogue. And then also there's like a

0:18:23.480 --> 0:18:26.240
<v Speaker 3>scene where the woman takes a bath and there's like

0:18:26.240 --> 0:18:29.040
<v Speaker 3>a close up on some bath salts and like the

0:18:29.160 --> 0:18:33.639
<v Speaker 3>text on that label is like an alien language because

0:18:33.680 --> 0:18:37.800
<v Speaker 3>AI has like a really bad time generating text, and

0:18:38.359 --> 0:18:39.800
<v Speaker 3>I guess you can take it with a grain of

0:18:39.840 --> 0:18:42.560
<v Speaker 3>salt or say like, hey, this is early technology. But

0:18:42.840 --> 0:18:45.200
<v Speaker 3>when you're watching something as a viewer in a movie

0:18:45.280 --> 0:18:48.439
<v Speaker 3>theater on this giant screen and the text is completely

0:18:48.800 --> 0:18:51.360
<v Speaker 3>not even in English, it's like, wow, it really takes

0:18:51.359 --> 0:18:52.360
<v Speaker 3>you out of the narrative.

0:18:52.520 --> 0:18:54.800
<v Speaker 1>I would say, I mean it's a weird idea, right,

0:18:54.880 --> 0:18:57.040
<v Speaker 1>because I mean you mentioned this is for TCL, the

0:18:57.119 --> 0:19:01.040
<v Speaker 1>Chinese TV manufacturer, and the assumption be like, they don't

0:19:01.080 --> 0:19:02.639
<v Speaker 1>want you to change the channel, right, they want you

0:19:02.680 --> 0:19:04.800
<v Speaker 1>to have their own channel on kind of in the

0:19:04.840 --> 0:19:07.240
<v Speaker 1>background so that you know your attention is with them

0:19:07.280 --> 0:19:08.800
<v Speaker 1>and they can sell you ads whatever it may be.

0:19:09.320 --> 0:19:11.800
<v Speaker 1>But that's very different to like putting hundreds of people

0:19:11.960 --> 0:19:14.359
<v Speaker 1>in a movie theater and kind of fulcing them to

0:19:14.400 --> 0:19:16.160
<v Speaker 1>watch with full attention, right, yeah.

0:19:16.280 --> 0:19:16.520
<v Speaker 2>Yeah.

0:19:16.520 --> 0:19:21.399
<v Speaker 3>And it's very interesting because before the movies played, two

0:19:21.440 --> 0:19:25.119
<v Speaker 3>TCL executives addressed the audience, and it was very interesting

0:19:25.280 --> 0:19:27.520
<v Speaker 3>the difference between what they were saying and what the

0:19:27.560 --> 0:19:31.600
<v Speaker 3>filmmakers were saying, because the TCL executives were business people

0:19:32.000 --> 0:19:35.639
<v Speaker 3>and they were saying our research shows that almost no

0:19:35.680 --> 0:19:38.800
<v Speaker 3>one changes the channel once they're watching something like this,

0:19:38.960 --> 0:19:42.000
<v Speaker 3>like they are watching it in the background usually, and

0:19:42.119 --> 0:19:44.520
<v Speaker 3>so their hope is that you're just going to be

0:19:44.520 --> 0:19:46.199
<v Speaker 3>too lazy to change the channel.

0:19:46.280 --> 0:19:48.800
<v Speaker 1>So inspiring creative brief.

0:19:49.440 --> 0:19:51.920
<v Speaker 3>Right, right, And then the other executives said, like, we're

0:19:51.920 --> 0:19:55.240
<v Speaker 3>going to use this as part of our targeted advertising strategy,

0:19:56.440 --> 0:20:00.280
<v Speaker 3>which was pretty dystopian. And then the actual filmmakers came

0:20:00.280 --> 0:20:02.040
<v Speaker 3>on and said, you know, we put our heart and

0:20:02.080 --> 0:20:03.960
<v Speaker 3>soul into this, and we think this is the future

0:20:03.960 --> 0:20:06.680
<v Speaker 3>of the industry. So that was kind of like a

0:20:06.680 --> 0:20:09.200
<v Speaker 3>whiplash situation for me in the audience.

0:20:12.240 --> 0:20:14.720
<v Speaker 1>When we come back, more from Jason Kebler about the

0:20:14.800 --> 0:20:19.240
<v Speaker 1>rapid advances in generative AI video technology and how the

0:20:19.280 --> 0:20:22.440
<v Speaker 1>state of the art is evolving in real time, stay

0:20:22.440 --> 0:20:37.320
<v Speaker 1>with us. Welcome back to our conversation with Jason Kebler

0:20:37.359 --> 0:20:40.639
<v Speaker 1>from four or four Media, where we continue our conversation

0:20:40.920 --> 0:20:45.359
<v Speaker 1>about a recent AI film festival he attended. There was

0:20:45.400 --> 0:20:47.240
<v Speaker 1>one film though, which I think was like a kind

0:20:47.240 --> 0:20:51.600
<v Speaker 1>of blended documentary and AI film that you thought was

0:20:51.640 --> 0:20:52.960
<v Speaker 1>potentially a bit more interesting.

0:20:53.560 --> 0:20:56.560
<v Speaker 3>Yeah, I thought it was pretty cool. I mean, it

0:20:56.640 --> 0:20:58.640
<v Speaker 3>still had a lot of problems, but It was called

0:20:58.640 --> 0:21:02.040
<v Speaker 3>The Best Day of My Life, and it was mountaineering

0:21:02.119 --> 0:21:07.840
<v Speaker 3>documentary where a mountaineer who got trapped in an avalanche

0:21:08.520 --> 0:21:11.600
<v Speaker 3>is talking directly to the camera, like the actual person

0:21:11.640 --> 0:21:14.119
<v Speaker 3>is talking directly to the camera recounting his story, and

0:21:14.280 --> 0:21:18.680
<v Speaker 3>as he is telling his story, they flashed to generative

0:21:18.720 --> 0:21:22.760
<v Speaker 3>AI depictions of what he is saying, And so I

0:21:22.840 --> 0:21:24.920
<v Speaker 3>thought that was kind of interesting because this is something

0:21:24.960 --> 0:21:27.040
<v Speaker 3>that happened to the guy. He obviously didn't bring a

0:21:27.080 --> 0:21:30.639
<v Speaker 3>camera with him at the time, and you were able

0:21:30.640 --> 0:21:34.600
<v Speaker 3>to sort of like see what he was describing.

0:21:34.440 --> 0:21:37.960
<v Speaker 1>In a way that was actually viscerally compelling, or in

0:21:37.960 --> 0:21:40.560
<v Speaker 1>a way that's still felt a bit uncanny and jarring.

0:21:40.640 --> 0:21:42.800
<v Speaker 3>In a way that made me think that maybe this

0:21:42.880 --> 0:21:45.920
<v Speaker 3>has potential in the future, but this isn't quite there yet,

0:21:45.960 --> 0:21:50.719
<v Speaker 3>because it similarly like the there's various scenes in the film,

0:21:51.359 --> 0:21:55.600
<v Speaker 3>and the guy who's happening to changes in each scene.

0:21:55.640 --> 0:21:59.719
<v Speaker 3>It's like his face looks different in different scenes. He

0:21:59.800 --> 0:22:02.800
<v Speaker 3>was under snow because it was an avalanche, and then

0:22:02.840 --> 0:22:05.080
<v Speaker 3>in the next scene all of the snow had turned

0:22:05.119 --> 0:22:07.639
<v Speaker 3>to mud, and then it turned back to snow, and

0:22:08.160 --> 0:22:12.480
<v Speaker 3>it was like, similarly took you out of the narrative,

0:22:12.680 --> 0:22:16.879
<v Speaker 3>but I thought that the idea behind it was pretty

0:22:16.880 --> 0:22:20.239
<v Speaker 3>interesting and I could see that being a direction that

0:22:20.920 --> 0:22:22.640
<v Speaker 3>future documentaries go.

0:22:23.280 --> 0:22:25.280
<v Speaker 1>And was what was the feeling like in the room?

0:22:25.320 --> 0:22:27.080
<v Speaker 1>I mean, who else was in the audience? What was

0:22:27.119 --> 0:22:30.040
<v Speaker 1>the general takeaway from this experience?

0:22:30.480 --> 0:22:34.280
<v Speaker 3>The mood in the theater was one of incredible optimism

0:22:34.440 --> 0:22:37.399
<v Speaker 3>and excitement. It was a mix of people who had

0:22:37.440 --> 0:22:40.760
<v Speaker 3>worked on these films and people who have like a

0:22:40.800 --> 0:22:44.000
<v Speaker 3>lot of money invested in the idea that this is

0:22:44.040 --> 0:22:46.640
<v Speaker 3>going to be the next big thing in Hollywood. And

0:22:46.720 --> 0:22:50.760
<v Speaker 3>so the mood in the theater was one of incredible

0:22:50.800 --> 0:22:55.800
<v Speaker 3>optimism and excitement. Meanwhile, the films like Objectively are not good.

0:22:55.840 --> 0:22:58.439
<v Speaker 3>They're really They're all on YouTube now and if you

0:22:58.480 --> 0:23:01.840
<v Speaker 3>go watch them, like the comments brutal, there's not a

0:23:01.840 --> 0:23:04.439
<v Speaker 3>lot of views on them. I think on some of them,

0:23:04.480 --> 0:23:06.720
<v Speaker 3>the comments that you even been turned off because people

0:23:06.760 --> 0:23:09.719
<v Speaker 3>are like, how could you dare put this on my television.

0:23:10.520 --> 0:23:13.440
<v Speaker 3>So I did think it was interesting because it reminded

0:23:13.440 --> 0:23:16.639
<v Speaker 3>me of things that I had been to in the past,

0:23:16.760 --> 0:23:20.639
<v Speaker 3>for like virtual reality or for cryptocurrency, things like that,

0:23:21.160 --> 0:23:24.480
<v Speaker 3>And a lot of people have said like generative AI

0:23:24.640 --> 0:23:26.879
<v Speaker 3>is the new crypto, it's the new metaverse, it's the

0:23:26.960 --> 0:23:31.480
<v Speaker 3>new virtual reality. And I think that AI there's like

0:23:31.520 --> 0:23:35.679
<v Speaker 3>a lot of snake oil out there, but undeniably companies

0:23:35.680 --> 0:23:37.679
<v Speaker 3>are leaning into it in a way that's going to

0:23:37.760 --> 0:23:41.720
<v Speaker 3>affect us and affect workers and affect people in the industry.

0:23:42.200 --> 0:23:47.000
<v Speaker 1>It's also interesting where companies fall in terms of how

0:23:47.600 --> 0:23:50.600
<v Speaker 1>vocal they want to be about how they see the

0:23:50.640 --> 0:23:54.639
<v Speaker 1>AI future unfolding. Right, Like, obviously for Chinese TV manufacturer,

0:23:55.040 --> 0:23:58.560
<v Speaker 1>alienating Hollywood doesn't really matter that much, right, whereas like

0:23:58.720 --> 0:24:01.320
<v Speaker 1>full Hollywood studios had to behave very differently.

0:24:01.920 --> 0:24:04.280
<v Speaker 3>Yeah, it's super interesting, and that's a great point because,

0:24:04.320 --> 0:24:07.800
<v Speaker 3>as you said, like the Writer's Guild strike was partially

0:24:07.880 --> 0:24:11.520
<v Speaker 3>about generative AI in the writer's rooms, a lot of

0:24:11.600 --> 0:24:15.680
<v Speaker 3>voice actors, going back to Respeecher, voice actors in both

0:24:15.680 --> 0:24:20.280
<v Speaker 3>the video game world and the animation world are really

0:24:20.320 --> 0:24:24.160
<v Speaker 3>worried that AI voices are going to replace their jobs

0:24:24.280 --> 0:24:27.240
<v Speaker 3>or that they're going to get less work because AI

0:24:27.359 --> 0:24:30.280
<v Speaker 3>is going to be used to generate voices for animation

0:24:30.400 --> 0:24:33.560
<v Speaker 3>and video games. And then, of course, like you said,

0:24:33.600 --> 0:24:37.320
<v Speaker 3>a lot of companies are laying off their workers in

0:24:37.359 --> 0:24:40.080
<v Speaker 3>a bunch of industries and then realizing, oh wait, the

0:24:40.080 --> 0:24:42.320
<v Speaker 3>AI is not good enough to do these jobs yet.

0:24:42.359 --> 0:24:45.200
<v Speaker 3>And so there's a real tension about it because fundamentally,

0:24:45.240 --> 0:24:49.520
<v Speaker 3>this is an automation technology. It's designed to replace human

0:24:49.600 --> 0:24:53.800
<v Speaker 3>labor or do things that sometimes humans can't do. And

0:24:54.040 --> 0:24:56.760
<v Speaker 3>I do think that a lot of companies are going

0:24:56.840 --> 0:25:00.920
<v Speaker 3>to be able to differentiate themselves by saying we do

0:25:01.000 --> 0:25:03.720
<v Speaker 3>not use AI, we respect human artists, we don't want

0:25:03.760 --> 0:25:05.959
<v Speaker 3>to do that. And then some companies are going their

0:25:06.000 --> 0:25:09.960
<v Speaker 3>total opposite way, like TCL, which has very little original programming,

0:25:10.080 --> 0:25:13.400
<v Speaker 3>very little relationships in Hollywood. They don't care if they

0:25:13.400 --> 0:25:16.080
<v Speaker 3>piss off directors and actors and things like that because

0:25:16.359 --> 0:25:18.240
<v Speaker 3>they're just trying to make a name for themselves, so

0:25:18.280 --> 0:25:20.360
<v Speaker 3>they're able to be more aggressive about this.

0:25:21.080 --> 0:25:22.919
<v Speaker 1>So I guess, on the one hand, you have like

0:25:23.080 --> 0:25:28.080
<v Speaker 1>TCL and more or less fully AI generated films. On

0:25:28.119 --> 0:25:30.159
<v Speaker 1>the other hand, you have the brutalist where you know

0:25:30.200 --> 0:25:33.040
<v Speaker 1>at the margins AI was used and respeech, she was

0:25:33.119 --> 0:25:35.959
<v Speaker 1>used to do some accent correction. Do you see like

0:25:36.080 --> 0:25:38.800
<v Speaker 1>ultimately a convergence between those two things, or do you

0:25:38.840 --> 0:25:42.080
<v Speaker 1>think it will remain that like AI is either used

0:25:42.119 --> 0:25:45.800
<v Speaker 1>in like premium productions for optimizing posts, shall we say.

0:25:45.880 --> 0:25:47.880
<v Speaker 1>And on the other hand, you have like this kind

0:25:47.880 --> 0:25:50.800
<v Speaker 1>of wild west of full AI generation, which is a

0:25:50.840 --> 0:25:53.040
<v Speaker 1>long way off from being consumable.

0:25:53.359 --> 0:25:55.520
<v Speaker 3>Yeah, I mean, I do think it's a spectrum and

0:25:56.600 --> 0:26:00.639
<v Speaker 3>slippery slope, if you will. And Special Effects have in

0:26:00.720 --> 0:26:03.680
<v Speaker 3>general been incorporating a lot more AI over the last

0:26:03.760 --> 0:26:07.000
<v Speaker 3>few years. I think one that was really interesting to

0:26:07.040 --> 0:26:11.280
<v Speaker 3>me was when the first deep fakes were sort of invented,

0:26:11.359 --> 0:26:13.640
<v Speaker 3>maybe like five or six years ago, where you can

0:26:13.720 --> 0:26:17.879
<v Speaker 3>like replace someone's face with another face. Star Wars had

0:26:18.240 --> 0:26:21.879
<v Speaker 3>tried to generate like Carrie Fisher after she had died

0:26:21.920 --> 0:26:25.119
<v Speaker 3>for one of the Star Wars films, and apparently they

0:26:25.160 --> 0:26:28.240
<v Speaker 3>spent like millions of dollars doing this. And then someone

0:26:28.359 --> 0:26:32.359
<v Speaker 3>on Reddit using deep fake technology was able to do

0:26:32.440 --> 0:26:36.359
<v Speaker 3>something that was almost indistinguishable from what Lucasfilms had done,

0:26:36.800 --> 0:26:39.679
<v Speaker 3>like on their computer at home, for free. And so

0:26:40.119 --> 0:26:42.560
<v Speaker 3>I do think that we're going to see a lot

0:26:42.600 --> 0:26:45.080
<v Speaker 3>more of this stuff in films, but you may not

0:26:45.160 --> 0:26:50.760
<v Speaker 3>even notice that's happening when they start replacing artists, replacing musicians,

0:26:50.840 --> 0:26:55.440
<v Speaker 3>replacing actors with AI. I think that's I personally think

0:26:55.480 --> 0:26:57.639
<v Speaker 3>that's a problem, and I think that that's when you

0:26:57.720 --> 0:27:00.480
<v Speaker 3>end up with a lesser product. Yeah, I don't know.

0:27:00.560 --> 0:27:02.480
<v Speaker 3>I hope that AI is going to be used to

0:27:02.520 --> 0:27:08.400
<v Speaker 3>make films better, not to create tons of low budget,

0:27:08.560 --> 0:27:11.960
<v Speaker 3>poorly made films that are designed to scratch a specific

0:27:12.040 --> 0:27:15.360
<v Speaker 3>itch or perform an algorithm, which we're definitely gonna see

0:27:15.400 --> 0:27:15.600
<v Speaker 3>a lot.

0:27:15.520 --> 0:27:19.679
<v Speaker 1>Of itist you're a humanist at HUT, Yeah, yeah, And

0:27:19.760 --> 0:27:22.600
<v Speaker 1>I mean you mentioned that this film festival was a

0:27:22.600 --> 0:27:25.119
<v Speaker 1>couple of months ago. Has the state of the art

0:27:25.240 --> 0:27:28.159
<v Speaker 1>change since then? I was playing around with this Google

0:27:28.200 --> 0:27:31.560
<v Speaker 1>deep Mind product called vo two. At least on like

0:27:31.560 --> 0:27:34.320
<v Speaker 1>a scene by scene basis, you can make pretty good

0:27:34.359 --> 0:27:37.560
<v Speaker 1>photo realistic depictions, but then like a couple of seconds each.

0:27:37.600 --> 0:27:39.399
<v Speaker 1>I don't think they've figured out that any means how

0:27:39.440 --> 0:27:42.480
<v Speaker 1>to stitch them together or make continuity. But how is

0:27:42.520 --> 0:27:43.760
<v Speaker 1>the state of the art devolving?

0:27:44.000 --> 0:27:46.639
<v Speaker 3>It's changed a lot in the last three months. There's

0:27:46.680 --> 0:27:50.160
<v Speaker 3>been a lot of Chinese companies that have released video

0:27:50.280 --> 0:27:53.080
<v Speaker 3>models in the last just a couple of weeks, like

0:27:53.200 --> 0:27:56.639
<v Speaker 3>ten Cent, which is a massive Chinese company, released a

0:27:56.680 --> 0:28:00.280
<v Speaker 3>new video model that seems to be better than most

0:28:00.560 --> 0:28:03.760
<v Speaker 3>publicly released video models. You know, it was sort of

0:28:03.800 --> 0:28:07.840
<v Speaker 3>immediately used by people to create non consensual pornography, which

0:28:07.880 --> 0:28:10.800
<v Speaker 3>is quite upsetting and is what a lot of people

0:28:10.800 --> 0:28:13.640
<v Speaker 3>are using these tools for on the internet. But basically

0:28:13.680 --> 0:28:16.040
<v Speaker 3>it's like every week there's a new model and they're

0:28:16.080 --> 0:28:19.000
<v Speaker 3>they're constantly leapfrogging each other. So you know, one will

0:28:19.040 --> 0:28:21.560
<v Speaker 3>be able to generate hands better than another, one will

0:28:21.560 --> 0:28:24.840
<v Speaker 3>be able to generate faces better than another, one will

0:28:24.880 --> 0:28:28.040
<v Speaker 3>have like better movement when you try to make these

0:28:28.080 --> 0:28:32.320
<v Speaker 3>people move, or they require less training data, meaning you

0:28:32.359 --> 0:28:36.560
<v Speaker 3>can make videos based on one input image versus having

0:28:36.600 --> 0:28:39.600
<v Speaker 3>to feed hours of footage into a model to create

0:28:39.640 --> 0:28:42.080
<v Speaker 3>something else. And so you know, these are things that

0:28:42.120 --> 0:28:45.160
<v Speaker 3>like AI nerds spend a lot of time caring about,

0:28:45.640 --> 0:28:49.240
<v Speaker 3>and I would say that there is a big generational

0:28:49.280 --> 0:28:52.800
<v Speaker 3>difference between them. But as like a consumer of these things,

0:28:52.800 --> 0:28:55.160
<v Speaker 3>you might not know that this is happening behind the scenes.

0:28:55.200 --> 0:28:58.959
<v Speaker 3>But the short version is basically it's getting easier to

0:28:59.080 --> 0:29:02.360
<v Speaker 3>make a generated video, it's getting cheaper to do it,

0:29:02.440 --> 0:29:05.920
<v Speaker 3>and the quality is getting better and it's changing on

0:29:05.960 --> 0:29:14.640
<v Speaker 3>like a day to day basis. At this point, Jason,

0:29:14.640 --> 0:29:16.600
<v Speaker 3>thank you so much. Thank you so much for having me.

0:29:20.360 --> 0:29:23.080
<v Speaker 1>That's it for this week. For tech Stuff, I'm oz Voloshin.

0:29:23.560 --> 0:29:26.920
<v Speaker 1>This episode was produced by Eliza Dennis and Victoria Dominguez.

0:29:27.160 --> 0:29:30.040
<v Speaker 1>It was executive produced by me Carrot Price and Kate

0:29:30.080 --> 0:29:34.360
<v Speaker 1>Osborne for Kaleidoscope and Katrina Norvell for iHeart Podcasts. The

0:29:34.480 --> 0:29:38.080
<v Speaker 1>Heath Fraser is our engineer. Kyle Murdoch mixed this episode

0:29:38.160 --> 0:29:40.760
<v Speaker 1>and he also wrote our theme song. Join us next

0:29:40.800 --> 0:29:43.440
<v Speaker 1>Wednesday for tech stuff The Story, when we'll share an

0:29:43.480 --> 0:29:47.560
<v Speaker 1>in depth conversation with the neuroscientist David Eagleman about people

0:29:47.680 --> 0:29:52.720
<v Speaker 1>who develop romantic relationships with AI. Please rate, review, and

0:29:52.800 --> 0:29:55.360
<v Speaker 1>reach out to us at tech Stuff podcast at gmail

0:29:55.360 --> 0:29:55.760
<v Speaker 1>dot com.

0:30:00.040 --> 0:30:00.200
<v Speaker 2>Eight