WEBVTT - Hallucinating with AI

0:00:04.440 --> 0:00:12.399
<v Speaker 1>Welcome to tech Stuff, a production from iHeartRadio. Hey there,

0:00:12.440 --> 0:00:16.240
<v Speaker 1>and welcome to tech Stuff. I'm your host, Jonathan Strickland.

0:00:16.280 --> 0:00:19.760
<v Speaker 1>I'm an executive producer with iHeartRadio and how the tech

0:00:19.800 --> 0:00:23.480
<v Speaker 1>are you. At the beginning of this year, that being

0:00:23.920 --> 0:00:26.360
<v Speaker 1>twenty twenty three, I said like it felt like it

0:00:26.400 --> 0:00:28.520
<v Speaker 1>was going to be the year of AI, and so

0:00:28.600 --> 0:00:31.319
<v Speaker 1>far I think I'm pretty much on the money. But

0:00:31.800 --> 0:00:35.680
<v Speaker 1>more specifically, twenty twenty three has been the year of

0:00:36.200 --> 0:00:43.320
<v Speaker 1>generative AI. That is artificial intelligence that creates or generates something,

0:00:43.600 --> 0:00:47.199
<v Speaker 1>whether it's an image, a sound, or as we're going

0:00:47.280 --> 0:00:51.680
<v Speaker 1>to talk about today, text in response to some sort

0:00:51.840 --> 0:00:55.480
<v Speaker 1>of input. Now, before we go any further, this is

0:00:55.480 --> 0:00:57.800
<v Speaker 1>where we need to remind ourselves that while this is

0:00:57.920 --> 0:01:03.120
<v Speaker 1>a type of artificial intelligence, it's not all of AI.

0:01:03.760 --> 0:01:09.600
<v Speaker 1>Not every AI application involves generative processes. And while generative

0:01:09.600 --> 0:01:15.479
<v Speaker 1>AI can seem fascinating, exciting, surprising, or creepy, I believe

0:01:15.520 --> 0:01:19.520
<v Speaker 1>that largely stems from how generative AI appears to be

0:01:19.680 --> 0:01:26.080
<v Speaker 1>mimicking humans, and it's not an indication of how sophisticated, advanced,

0:01:26.240 --> 0:01:30.680
<v Speaker 1>or dangerous it really is. It's kind of an uncanny

0:01:31.000 --> 0:01:35.679
<v Speaker 1>Valley thing because it appears to be behaving like a human,

0:01:36.360 --> 0:01:41.720
<v Speaker 1>we start to project things on it that aren't necessarily

0:01:42.400 --> 0:01:45.319
<v Speaker 1>accurate or realistic. I think of it kind of like

0:01:45.520 --> 0:01:48.360
<v Speaker 1>the way we can be with our pets, where we

0:01:48.400 --> 0:01:51.360
<v Speaker 1>will project things on our pets that may not reflect

0:01:51.440 --> 0:01:54.800
<v Speaker 1>what the pet is actually experiencing, but that's how we're

0:01:54.840 --> 0:01:58.040
<v Speaker 1>perceiving it. So the reason I say all of this

0:01:58.160 --> 0:01:59.840
<v Speaker 1>up at the very top of this episode is that

0:01:59.840 --> 0:02:04.760
<v Speaker 1>we're also seeing a lot of people expressing concern about AI,

0:02:05.000 --> 0:02:09.239
<v Speaker 1>which is understandable. You know about how it could potentially

0:02:09.360 --> 0:02:15.959
<v Speaker 1>lead to harm, and these are legitimate and rational concerns. However,

0:02:16.560 --> 0:02:21.160
<v Speaker 1>with the focus on stuff like chat GPT for example,

0:02:21.320 --> 0:02:25.280
<v Speaker 1>or Google Bard, I would argue the concern is far

0:02:25.360 --> 0:02:29.800
<v Speaker 1>too narrowly focused on just one aspect of AI, and

0:02:30.040 --> 0:02:34.200
<v Speaker 1>in my opinion, it's not even the most dangerous implementation

0:02:34.280 --> 0:02:38.280
<v Speaker 1>of AI. I mean, we have cars on the road

0:02:38.720 --> 0:02:44.040
<v Speaker 1>right now that use AI for driver assists and autonomous operations.

0:02:44.520 --> 0:02:47.800
<v Speaker 1>If we're worried about the robots taking us down, maybe

0:02:47.840 --> 0:02:51.639
<v Speaker 1>we shouldn't make them our chauffeurs. But really that's a

0:02:51.680 --> 0:02:55.120
<v Speaker 1>topic for another episode. Today, I wanted to take a

0:02:55.160 --> 0:02:58.280
<v Speaker 1>look at an issue that crops up in AI chat

0:02:58.320 --> 0:03:01.960
<v Speaker 1>bots like open ai or goole Bard and similar products.

0:03:02.560 --> 0:03:05.560
<v Speaker 1>This is one that is concerning because it's an issue

0:03:05.560 --> 0:03:09.800
<v Speaker 1>that leads these tools to create false or misleading information

0:03:10.360 --> 0:03:13.920
<v Speaker 1>while presenting that info in a way that seems authoritative

0:03:14.000 --> 0:03:17.680
<v Speaker 1>and trustworthy. And in the field of AI, the term

0:03:18.000 --> 0:03:22.640
<v Speaker 1>hallucination is used to describe this situation. At least a

0:03:22.680 --> 0:03:25.400
<v Speaker 1>lot of folks will use the word hallucination. As it

0:03:25.440 --> 0:03:29.160
<v Speaker 1>turns out, there's actually some debate in AI circles about

0:03:29.160 --> 0:03:32.799
<v Speaker 1>whether or not that should be the appropriate term. Now

0:03:33.200 --> 0:03:36.960
<v Speaker 1>for we mirror mortals, a hallucination is when we have

0:03:37.000 --> 0:03:42.360
<v Speaker 1>an experience in which we perceive something that isn't reflected

0:03:42.520 --> 0:03:46.480
<v Speaker 1>in reality. Maybe we hear a sound but there was

0:03:46.520 --> 0:03:50.640
<v Speaker 1>actually no sound present. Maybe it was that tree falling

0:03:50.680 --> 0:03:53.120
<v Speaker 1>in the woods and no one was around or something,

0:03:53.720 --> 0:03:57.680
<v Speaker 1>or we see something that's not really there. It can

0:03:57.720 --> 0:04:03.320
<v Speaker 1>be really darn disconcerting, and sometimes it can be absolutely terrifying.

0:04:03.680 --> 0:04:07.760
<v Speaker 1>I'm reminded of how many people who experience sleep paralysis

0:04:07.800 --> 0:04:13.880
<v Speaker 1>often will also have hallucinations accompany this period where they're

0:04:13.960 --> 0:04:17.599
<v Speaker 1>awake but they cannot move, and it's probably because sleep

0:04:17.600 --> 0:04:21.640
<v Speaker 1>paralysis occurs when you're kind of caught between being asleep

0:04:21.800 --> 0:04:25.120
<v Speaker 1>and being awake, so there's still some dream like activity

0:04:25.160 --> 0:04:28.760
<v Speaker 1>going on in your brain that's trying to explain things

0:04:28.839 --> 0:04:31.719
<v Speaker 1>like why you're unable to move. Oh, it's because you

0:04:31.839 --> 0:04:35.960
<v Speaker 1>have this witch perched on your chest and she's pinning

0:04:36.000 --> 0:04:41.200
<v Speaker 1>you to the bed. Tools like chat GPT are not dreaming,

0:04:41.680 --> 0:04:45.279
<v Speaker 1>you know, they're not perceiving anything at all. They have

0:04:46.040 --> 0:04:52.160
<v Speaker 1>no senses to trigger, so they cannot hallucinate in that sense. Instead,

0:04:52.240 --> 0:04:57.440
<v Speaker 1>what they are doing is mistakenly assigning high confidence to

0:04:57.560 --> 0:05:01.200
<v Speaker 1>something that they just plane made up. So they're treating

0:05:01.200 --> 0:05:05.919
<v Speaker 1>it like it's a fact that they're highly confident is accurate,

0:05:06.440 --> 0:05:11.000
<v Speaker 1>when really they just invented it. So it is an

0:05:11.000 --> 0:05:14.640
<v Speaker 1>instance where they're really confident in something that is not

0:05:15.279 --> 0:05:19.520
<v Speaker 1>coming from a reliable source in the AI's actual training data.

0:05:20.040 --> 0:05:22.200
<v Speaker 1>So if we wanted to put that into human terms,

0:05:22.640 --> 0:05:25.040
<v Speaker 1>it'd be kind of like if you made up a

0:05:25.120 --> 0:05:29.040
<v Speaker 1>story to explain something that otherwise would either be really

0:05:29.080 --> 0:05:32.720
<v Speaker 1>boring or maybe really embarrassing. So you make up a lie,

0:05:33.160 --> 0:05:35.080
<v Speaker 1>in other words, to cover up something that you would

0:05:35.160 --> 0:05:38.640
<v Speaker 1>rather not be known, And so you tell this lie

0:05:39.120 --> 0:05:41.400
<v Speaker 1>over and over when people are asking you about this

0:05:41.440 --> 0:05:45.920
<v Speaker 1>particular thing, and you repeat it often enough where gradually

0:05:45.960 --> 0:05:49.560
<v Speaker 1>your brain essentially makes a pathway where this fake version

0:05:49.560 --> 0:05:53.760
<v Speaker 1>of history of what actually happened becomes the real one

0:05:54.279 --> 0:05:56.960
<v Speaker 1>in your head. You begin to believe your own lie,

0:05:57.080 --> 0:05:59.680
<v Speaker 1>and so in future tellings of the story, you don't

0:05:59.680 --> 0:06:02.240
<v Speaker 1>even realize you're lying at all. You're telling what you

0:06:02.360 --> 0:06:05.440
<v Speaker 1>believe to be the real sequence of events, even though

0:06:05.440 --> 0:06:08.800
<v Speaker 1>it's all a fib. That's kind of what's happening with

0:06:08.839 --> 0:06:14.039
<v Speaker 1>AI hallucinations, only it happens all at once, And for

0:06:14.120 --> 0:06:17.440
<v Speaker 1>that reason, some folks prefer to use other terms to

0:06:17.520 --> 0:06:21.640
<v Speaker 1>describe what AI does when it starts to invent things

0:06:21.760 --> 0:06:25.520
<v Speaker 1>in response to a query from a user. So some

0:06:25.600 --> 0:06:30.840
<v Speaker 1>have proposed the word confabulation as an alternative descriptor of

0:06:30.839 --> 0:06:33.640
<v Speaker 1>what's going on. So this is similar to kind of

0:06:33.680 --> 0:06:38.240
<v Speaker 1>the scenario I just gave, because it's in human psychology.

0:06:38.240 --> 0:06:41.760
<v Speaker 1>A confabulation is when we have a hitch in our memory,

0:06:42.160 --> 0:06:44.680
<v Speaker 1>and so we fill in a gap that's in our memory.

0:06:44.680 --> 0:06:47.840
<v Speaker 1>We're not doing it consciously, it just happens, and that

0:06:47.920 --> 0:06:49.720
<v Speaker 1>might mean we fill in the gap that doesn't at

0:06:49.760 --> 0:06:53.400
<v Speaker 1>all reflect what really happened. So this can happen at

0:06:53.440 --> 0:06:56.560
<v Speaker 1>any time. I've seen it happen with people who are

0:06:56.640 --> 0:07:00.360
<v Speaker 1>in like a situation that was totally on a expected

0:07:00.400 --> 0:07:03.600
<v Speaker 1>in high stress. I've seen it in training operations where

0:07:04.120 --> 0:07:07.560
<v Speaker 1>you have a group of people and then someone bursts

0:07:07.640 --> 0:07:11.200
<v Speaker 1>in as if they are a burglar or a thief or something,

0:07:11.560 --> 0:07:14.720
<v Speaker 1>and then they get out, and then those people who

0:07:14.760 --> 0:07:18.840
<v Speaker 1>were just subjected to this very scary situation are asked

0:07:18.880 --> 0:07:22.200
<v Speaker 1>to give details about the thief's appearance, and people start

0:07:22.240 --> 0:07:26.960
<v Speaker 1>to invent things, not purposefully, not with the intent to deceive,

0:07:27.400 --> 0:07:29.440
<v Speaker 1>but because their memory is just trying to fill in

0:07:29.480 --> 0:07:32.760
<v Speaker 1>gaps because their perception didn't really take it all in.

0:07:33.400 --> 0:07:37.440
<v Speaker 1>So confabulation doesn't imply intent, and I think that might

0:07:37.480 --> 0:07:40.320
<v Speaker 1>be why a lot of researchers like the word, because

0:07:40.880 --> 0:07:44.840
<v Speaker 1>it's not the intention of the AI to fool people

0:07:45.320 --> 0:07:49.800
<v Speaker 1>or to pass off fantasy as if it were reality. Instead,

0:07:50.120 --> 0:07:53.320
<v Speaker 1>the AI is making an honest go of trying to

0:07:53.360 --> 0:07:56.240
<v Speaker 1>meet the expectations of the user. So if you ask

0:07:56.320 --> 0:08:01.000
<v Speaker 1>the AI about, say a historical figure, really tries to

0:08:01.000 --> 0:08:04.720
<v Speaker 1>give you a good answer, but occasionally that answer might

0:08:04.760 --> 0:08:08.240
<v Speaker 1>be wrong, not because the AI is drawing from a

0:08:08.360 --> 0:08:12.000
<v Speaker 1>bad data source, but because there's actually a gap in

0:08:12.040 --> 0:08:15.200
<v Speaker 1>its knowledge, and the AI just fills that gap as

0:08:15.240 --> 0:08:19.040
<v Speaker 1>best it can. Unfortunately, the end result is you get

0:08:19.080 --> 0:08:23.200
<v Speaker 1>an answer that seems totally cromulent, like you could just

0:08:23.280 --> 0:08:28.880
<v Speaker 1>imagine reading that answer in a respectable, thoroughly fact check encyclopedia,

0:08:29.360 --> 0:08:33.560
<v Speaker 1>but then it turns out to be garbage. So let's

0:08:33.600 --> 0:08:36.800
<v Speaker 1>talk about how this happens, which will involve an overview

0:08:36.880 --> 0:08:39.920
<v Speaker 1>of how these chatbought AI tools are trained and at

0:08:39.960 --> 0:08:42.800
<v Speaker 1>a very very high level, how they work. So this

0:08:42.880 --> 0:08:48.480
<v Speaker 1>is going to involve some discussion about machine learning and statistics. So,

0:08:48.559 --> 0:08:54.080
<v Speaker 1>first off, how do machines actually learn? I think it's

0:08:54.120 --> 0:08:57.679
<v Speaker 1>pretty easy to understand. How we program machines to do

0:08:58.600 --> 0:09:01.959
<v Speaker 1>some specific task. Right, we create a set of rules

0:09:02.400 --> 0:09:07.200
<v Speaker 1>that this machine follows sequentially, and the machine executes those

0:09:07.320 --> 0:09:10.720
<v Speaker 1>rules as directed, and then we get the result we wanted.

0:09:10.800 --> 0:09:13.920
<v Speaker 1>That is easy to understand. So I'll give an example.

0:09:13.960 --> 0:09:16.439
<v Speaker 1>Let's say we have a robotic arm and you've got

0:09:16.480 --> 0:09:19.200
<v Speaker 1>two tables, and you put a wooden block on table

0:09:19.320 --> 0:09:23.240
<v Speaker 1>number one, and you program the robotic arm to pick

0:09:23.360 --> 0:09:26.320
<v Speaker 1>up this wooden block on table one and move it

0:09:26.360 --> 0:09:29.640
<v Speaker 1>over to table two. Once you program it then it

0:09:29.679 --> 0:09:31.720
<v Speaker 1>should be able to do that task over and over,

0:09:31.880 --> 0:09:34.880
<v Speaker 1>assuming that no one has moved the tables. No one

0:09:34.920 --> 0:09:37.560
<v Speaker 1>has moved the robotic arm, and the wooden block is

0:09:37.640 --> 0:09:41.360
<v Speaker 1>always in the same place and it's always the same size. Right,

0:09:41.400 --> 0:09:43.560
<v Speaker 1>you haven't changed any of the parameters, so it's the

0:09:43.600 --> 0:09:46.400
<v Speaker 1>exact same situation over and over and over again. You've

0:09:46.400 --> 0:09:49.640
<v Speaker 1>created this simple program. It should be no surprise when

0:09:49.640 --> 0:09:52.920
<v Speaker 1>the robotic arm does it successfully. But what if we

0:09:52.960 --> 0:09:55.920
<v Speaker 1>wanted a robotic arm that could learn how to pick

0:09:56.000 --> 0:09:59.720
<v Speaker 1>up different objects from table one and then move them

0:09:59.760 --> 0:10:02.800
<v Speaker 1>to t able to These objects could be different shapes,

0:10:02.800 --> 0:10:05.880
<v Speaker 1>they could be different sizes, they could weigh different amounts,

0:10:06.120 --> 0:10:08.560
<v Speaker 1>They might be made of different stuff. Maybe some of

0:10:08.559 --> 0:10:12.760
<v Speaker 1>them are fairly delicate and the arm would break the

0:10:12.840 --> 0:10:15.680
<v Speaker 1>object if it applied too much pressure. So how would

0:10:15.720 --> 0:10:18.640
<v Speaker 1>we build a robotic arm that could deal with these

0:10:18.679 --> 0:10:23.439
<v Speaker 1>different scenarios, including ones where we put something completely new

0:10:23.520 --> 0:10:26.640
<v Speaker 1>to the robot on the table, something that the robot

0:10:26.679 --> 0:10:31.240
<v Speaker 1>has never encountered before. Well, to do that, we would

0:10:31.320 --> 0:10:36.320
<v Speaker 1>probably pursue a machine learning model in order to teach

0:10:36.440 --> 0:10:41.480
<v Speaker 1>this robot the whole process of picking something up, especially

0:10:41.520 --> 0:10:45.840
<v Speaker 1>something it had not encountered before. So basically, machine learning

0:10:46.080 --> 0:10:49.120
<v Speaker 1>uses sets of algorithms in an effort to get better

0:10:49.600 --> 0:10:54.560
<v Speaker 1>at a given task, and part of learning involves training,

0:10:54.600 --> 0:10:59.080
<v Speaker 1>which really boils down to feeding a machine lots and

0:10:59.160 --> 0:11:02.480
<v Speaker 1>lots and lots of information, like the more information you

0:11:02.520 --> 0:11:07.320
<v Speaker 1>can feed it, the better, and then letting it process

0:11:07.360 --> 0:11:10.520
<v Speaker 1>this information in an effort to get a specific result,

0:11:11.080 --> 0:11:15.280
<v Speaker 1>and then going back and tweaking the model to refine

0:11:15.320 --> 0:11:18.280
<v Speaker 1>it over and over and over and over again to

0:11:18.480 --> 0:11:22.360
<v Speaker 1>get better at it over time. So we'll imagine a

0:11:22.440 --> 0:11:26.480
<v Speaker 1>hypothetical machine learning model that is designed to do something

0:11:26.559 --> 0:11:30.760
<v Speaker 1>relatively simple like recognize if an image has a cat

0:11:31.040 --> 0:11:33.760
<v Speaker 1>in it or not, because this is actually something that

0:11:33.880 --> 0:11:36.760
<v Speaker 1>has been done with machine learning models in the past.

0:11:36.960 --> 0:11:40.960
<v Speaker 1>It's actually a fairly popular approach is does this picture

0:11:40.960 --> 0:11:43.200
<v Speaker 1>have a cat in it? Or does this video have

0:11:43.240 --> 0:11:45.920
<v Speaker 1>a cat in it? That kind of thing. Let's imagine

0:11:46.000 --> 0:11:49.520
<v Speaker 1>that our machine learning model is an actual physical model,

0:11:49.679 --> 0:11:53.319
<v Speaker 1>like it's a giant funnel. So on the wide end

0:11:53.320 --> 0:11:56.360
<v Speaker 1>of the funnel, that's where we just dump tons of

0:11:56.400 --> 0:11:59.120
<v Speaker 1>photographs with some of them have cats in them, some

0:11:59.160 --> 0:12:02.560
<v Speaker 1>of them don't. Now imagine that at the narrow end

0:12:02.600 --> 0:12:05.160
<v Speaker 1>of the funnel. At the bottom of the funnel, we

0:12:05.200 --> 0:12:09.200
<v Speaker 1>actually have two channels. One channel leads into a bucket

0:12:09.440 --> 0:12:13.160
<v Speaker 1>that says no cats here, and the other channel leads

0:12:13.160 --> 0:12:16.920
<v Speaker 1>to a bucket that says, ah, sweet kitty cats. So

0:12:17.520 --> 0:12:23.280
<v Speaker 1>we dump thousands, maybe millions of photographs into the top

0:12:23.320 --> 0:12:27.280
<v Speaker 1>of this funnel, and the funnel starts to sort the pictures.

0:12:27.640 --> 0:12:30.080
<v Speaker 1>We can't see this because it's inside the funnel, but

0:12:30.120 --> 0:12:35.119
<v Speaker 1>there are channels inside that funnel where photos are directed

0:12:35.559 --> 0:12:40.240
<v Speaker 1>either to go more toward the no kittycat side or

0:12:40.280 --> 0:12:44.560
<v Speaker 1>the yes kittykat side, And they go through these channels

0:12:44.640 --> 0:12:47.200
<v Speaker 1>all down the funnel, and at the very end of it,

0:12:47.880 --> 0:12:51.480
<v Speaker 1>they start spitting out these images into the two buckets. Well,

0:12:51.480 --> 0:12:54.520
<v Speaker 1>once it's done, once it has processed all the photos,

0:12:54.760 --> 0:12:56.719
<v Speaker 1>we take the two buckets and we see how our

0:12:56.760 --> 0:12:59.520
<v Speaker 1>model did. And maybe we see that the model caught

0:12:59.679 --> 0:13:02.240
<v Speaker 1>most of the pictures with cats in them, but not

0:13:02.360 --> 0:13:05.280
<v Speaker 1>all of them. Maybe we also see that there are

0:13:05.320 --> 0:13:08.160
<v Speaker 1>some photos that fell into the kitty cat bucket that

0:13:08.280 --> 0:13:12.440
<v Speaker 1>have exactly zero kitty cats in the picture. Something is

0:13:12.480 --> 0:13:15.760
<v Speaker 1>not working inside our model. So at that point we

0:13:15.920 --> 0:13:19.800
<v Speaker 1>open the funnel, we take the top off or whatever

0:13:19.880 --> 0:13:23.280
<v Speaker 1>we have built in a hinged latch or something, and

0:13:23.320 --> 0:13:26.480
<v Speaker 1>we've opened it up. Now essentially inside our funnel, we

0:13:26.520 --> 0:13:29.320
<v Speaker 1>see all those channels, and each channel is meant to

0:13:29.320 --> 0:13:31.280
<v Speaker 1>look for some sort of evidence of a cat, and

0:13:31.320 --> 0:13:34.520
<v Speaker 1>if it finds evidence, it pushes it closer toward the

0:13:34.559 --> 0:13:37.760
<v Speaker 1>pathway of kitty cat, and if it doesn't, it pushes

0:13:37.800 --> 0:13:41.640
<v Speaker 1>it closer to the pathway of no kitty cat. But

0:13:41.720 --> 0:13:44.520
<v Speaker 1>there's tons of these channels. Some of them feed images

0:13:44.640 --> 0:13:48.920
<v Speaker 1>back up through the whole process. Again, it's very complicated

0:13:48.920 --> 0:13:52.400
<v Speaker 1>inside this funnel, and you have to go in there

0:13:52.440 --> 0:13:57.120
<v Speaker 1>and start to tweak little bits of rules in these

0:13:57.720 --> 0:14:01.880
<v Speaker 1>channels to adjust for whatever problem you're encountering at the

0:14:01.960 --> 0:14:05.679
<v Speaker 1>end result when you're done. So, when you're training your model,

0:14:06.080 --> 0:14:11.000
<v Speaker 1>you change the weights of these different decisions that are made.

0:14:11.080 --> 0:14:14.520
<v Speaker 1>Some decisions perhaps have too much emphasis on them. They

0:14:14.720 --> 0:14:18.040
<v Speaker 1>like they're too powerful and they're skewing the results. So

0:14:18.120 --> 0:14:22.760
<v Speaker 1>you reduce the weight of that particular decision point and

0:14:22.800 --> 0:14:25.000
<v Speaker 1>you increase the weight of a different one to try

0:14:25.040 --> 0:14:28.560
<v Speaker 1>and get things right. It's a painstaking process and you

0:14:28.600 --> 0:14:30.800
<v Speaker 1>have to do it over and over again, and these

0:14:30.840 --> 0:14:35.200
<v Speaker 1>exercises repeat and you try to refine your model to

0:14:35.240 --> 0:14:38.440
<v Speaker 1>get it better at deciding whether or not a photograph

0:14:39.320 --> 0:14:41.880
<v Speaker 1>has got a cabinet or does it, and eventually, if

0:14:41.920 --> 0:14:44.880
<v Speaker 1>everything is working well, it gets very very good at

0:14:44.920 --> 0:14:47.920
<v Speaker 1>sorting images. Maybe once in a while, something sneaks through.

0:14:48.160 --> 0:14:50.000
<v Speaker 1>Maybe there's a cloud that kind of looks like a

0:14:50.040 --> 0:14:52.440
<v Speaker 1>kitty cat and it goes into the wrong bucket, or

0:14:52.600 --> 0:14:54.760
<v Speaker 1>maybe there is a kitty cat that goes into the

0:14:54.800 --> 0:14:56.720
<v Speaker 1>no kitty cat bucket, but the kitty cat was kind

0:14:56.720 --> 0:14:59.240
<v Speaker 1>of obscured in the picture and the model just couldn't

0:14:59.240 --> 0:15:04.480
<v Speaker 1>suss it out. But it succeeds more often than not. Okay,

0:15:04.840 --> 0:15:06.880
<v Speaker 1>that's a baseline. When we come back, we'll talk a

0:15:06.880 --> 0:15:09.440
<v Speaker 1>bit more about machine learning and how this plays into

0:15:09.800 --> 0:15:24.200
<v Speaker 1>tools like chat GPT. Okay, I laid out one version

0:15:24.400 --> 0:15:27.040
<v Speaker 1>of machine learning, and I want to stress that's just

0:15:27.320 --> 0:15:30.520
<v Speaker 1>one version of machine learning. It's related to things like

0:15:30.600 --> 0:15:34.920
<v Speaker 1>neural networks, which are designed to kind of mimic the

0:15:34.960 --> 0:15:40.440
<v Speaker 1>way our brains process information and form pathways among neurons

0:15:40.480 --> 0:15:43.920
<v Speaker 1>while we're trying to suss things out. But that's just

0:15:44.080 --> 0:15:46.400
<v Speaker 1>one version of machine learning. I don't mean to say

0:15:46.440 --> 0:15:49.560
<v Speaker 1>that's how it all works. There are actually lots of

0:15:49.720 --> 0:15:53.240
<v Speaker 1>sub fields within machine learning, neural networks being just one

0:15:53.280 --> 0:15:55.960
<v Speaker 1>of them, but there's also subsets of neural networks. One

0:15:56.000 --> 0:16:00.000
<v Speaker 1>of those was would be deep learning, which always makes

0:16:00.040 --> 0:16:03.000
<v Speaker 1>we think of MST three K and deep hurting shout

0:16:03.000 --> 0:16:05.760
<v Speaker 1>outs to any misties out there. Now, as you dive

0:16:05.880 --> 0:16:09.960
<v Speaker 1>down to deep learning, you're really getting into an interesting

0:16:10.040 --> 0:16:13.480
<v Speaker 1>field of AI and machine learning. So deep learning models

0:16:13.720 --> 0:16:17.640
<v Speaker 1>can accept unstructured data. If you're going further up to

0:16:17.800 --> 0:16:22.920
<v Speaker 1>less specialized machine learning models, these have to use heavily

0:16:23.040 --> 0:16:27.119
<v Speaker 1>labeled data sets and heavily structured data and use supervised

0:16:27.160 --> 0:16:30.640
<v Speaker 1>learning in order to improve with time. But when you

0:16:30.680 --> 0:16:34.160
<v Speaker 1>get into deep learning, you're looking at a very focused

0:16:34.160 --> 0:16:38.440
<v Speaker 1>approach to machine learning where you can just feed unstructured

0:16:38.520 --> 0:16:41.960
<v Speaker 1>data that has no labels to it and start to

0:16:42.080 --> 0:16:45.000
<v Speaker 1>use this model to do whatever it is that you

0:16:45.640 --> 0:16:48.320
<v Speaker 1>want it to do. But we're still kind of talking

0:16:48.320 --> 0:16:53.080
<v Speaker 1>about a channeling or funneling situation here. The input goes

0:16:53.120 --> 0:16:56.720
<v Speaker 1>into the model, the model analyzes the input and pushes

0:16:56.760 --> 0:16:59.360
<v Speaker 1>it further one way or another through the system, and

0:16:59.400 --> 0:17:02.520
<v Speaker 1>it comes out the end as output, which could be

0:17:02.560 --> 0:17:05.280
<v Speaker 1>an image search result for kiddy cats in your smartphone's

0:17:05.320 --> 0:17:07.760
<v Speaker 1>photo role, for example, So if you've ever gone into

0:17:08.600 --> 0:17:12.200
<v Speaker 1>a smartphone photo collection and you just typed in a

0:17:13.040 --> 0:17:15.800
<v Speaker 1>general word in search, you know it's not that you

0:17:15.880 --> 0:17:18.040
<v Speaker 1>tagged any of your photos with this. You're just like

0:17:18.160 --> 0:17:20.280
<v Speaker 1>looking for photos in your role that has a cat

0:17:20.359 --> 0:17:24.280
<v Speaker 1>in them, and it returns something like that. Well, that

0:17:24.280 --> 0:17:27.600
<v Speaker 1>can be the result of a machine learning process like

0:17:27.640 --> 0:17:31.000
<v Speaker 1>the one I've just described, because again, the system has

0:17:31.000 --> 0:17:33.600
<v Speaker 1>to figure out which of your photos have cats in them,

0:17:33.840 --> 0:17:36.560
<v Speaker 1>even though you didn't tag any of those photos with cats.

0:17:36.560 --> 0:17:39.880
<v Speaker 1>It doesn't have metadata. It has to analyze the photo itself.

0:17:40.480 --> 0:17:46.040
<v Speaker 1>Now it's time to talk about probabilities. Large language models lms,

0:17:46.520 --> 0:17:51.360
<v Speaker 1>which are what power chat bots like Google Bard and

0:17:51.880 --> 0:17:57.320
<v Speaker 1>Chat GPT. They work in probabilities. And there's one example

0:17:57.359 --> 0:18:01.199
<v Speaker 1>of an AI using probabilistic algorithms to generate responses that

0:18:01.280 --> 0:18:06.760
<v Speaker 1>I really loved reference, and that example is IBM's Watson platform.

0:18:07.440 --> 0:18:09.760
<v Speaker 1>So while the world right now is struggling to figure

0:18:09.760 --> 0:18:13.280
<v Speaker 1>out how to handle chat GPT and Google Bard and such,

0:18:13.720 --> 0:18:16.520
<v Speaker 1>IBM's Watson gave us a glimpse at what we could

0:18:16.560 --> 0:18:20.080
<v Speaker 1>expect all the way back in twenty eleven. That's when

0:18:20.119 --> 0:18:24.240
<v Speaker 1>IBM famously put Watson to the test and some exhibition

0:18:24.400 --> 0:18:29.199
<v Speaker 1>games of the game show Jeopardy against former champions of

0:18:29.280 --> 0:18:33.280
<v Speaker 1>that game show, human champions. So in many ways, this

0:18:33.400 --> 0:18:36.800
<v Speaker 1>was an echo of IBM's Deep Blue going up against

0:18:36.960 --> 0:18:41.879
<v Speaker 1>chess master Gary Kasparov in various games of chess. Putting

0:18:41.960 --> 0:18:46.000
<v Speaker 1>Watson up against humans and Jeopardy was a fantastic publicity stunt,

0:18:46.160 --> 0:18:49.000
<v Speaker 1>and it also was really impressive because the way Jeopardy

0:18:49.040 --> 0:18:53.719
<v Speaker 1>works is players get several categories of trivia that they

0:18:53.720 --> 0:18:57.440
<v Speaker 1>can choose from. Each category has different levels of questions

0:18:57.480 --> 0:19:00.960
<v Speaker 1>that are designated by a dollar amount, So higher the

0:19:01.000 --> 0:19:04.400
<v Speaker 1>dollar amount, the harder the trivia question is. Generally speaking,

0:19:05.520 --> 0:19:09.040
<v Speaker 1>the actual clue that the players get is given in

0:19:09.040 --> 0:19:11.720
<v Speaker 1>the form of an answer, and they have to provide

0:19:12.480 --> 0:19:17.440
<v Speaker 1>a question that relates to that answer. So here's an example.

0:19:17.880 --> 0:19:22.320
<v Speaker 1>The answer revealed in say a hypothetical Jeopardy game that

0:19:22.359 --> 0:19:26.119
<v Speaker 1>has the category podcasts, could be something like he was

0:19:26.240 --> 0:19:29.600
<v Speaker 1>Jonathan Strickland's original co host on the show tech Stuff.

0:19:30.000 --> 0:19:32.399
<v Speaker 1>The correct response would be bipp a bip Who is

0:19:32.480 --> 0:19:36.200
<v Speaker 1>Chris Palette? That would be the correct answer, But Jeopardy

0:19:36.680 --> 0:19:42.040
<v Speaker 1>goes beyond just trivia. Often the answers provided will include

0:19:42.359 --> 0:19:46.760
<v Speaker 1>word play or images or sound cues, and players will

0:19:46.760 --> 0:19:49.879
<v Speaker 1>have to think outside the box. They can't just know

0:19:50.160 --> 0:19:54.800
<v Speaker 1>the answer. Sometimes there's interpretation that has to happen first.

0:19:55.359 --> 0:19:58.399
<v Speaker 1>The clue to the correct response could be a pun,

0:19:58.960 --> 0:20:02.600
<v Speaker 1>it could involve a rhyme to the answer. It's not

0:20:02.760 --> 0:20:07.320
<v Speaker 1>always a straightforward trivia question. In other words, so Watson

0:20:07.320 --> 0:20:10.879
<v Speaker 1>needed to be able to analyze the clue given, to

0:20:11.000 --> 0:20:15.199
<v Speaker 1>break it apart into components to understand what exactly is

0:20:15.240 --> 0:20:17.680
<v Speaker 1>being asked of it. Then it needed to search its

0:20:17.760 --> 0:20:22.600
<v Speaker 1>database for relevant information. So Watson famously was not connected

0:20:22.640 --> 0:20:25.359
<v Speaker 1>to the Internet during these Jeopardy games. Instead, it was

0:20:25.400 --> 0:20:30.040
<v Speaker 1>relying upon a database representing millions of books filled with facts.

0:20:30.680 --> 0:20:37.640
<v Speaker 1>Then it would generate hypothetical responses like a hypothetical answer

0:20:38.200 --> 0:20:42.080
<v Speaker 1>that Watson should give, or rather questions we're talking about jeopardy,

0:20:42.640 --> 0:20:45.200
<v Speaker 1>and it would submit these hypotheses to a second round

0:20:45.240 --> 0:20:48.760
<v Speaker 1>of analysis to look at is there any evidence that

0:20:48.840 --> 0:20:53.840
<v Speaker 1>supports this response as being correct? Kind of measuring like, well,

0:20:54.359 --> 0:20:58.760
<v Speaker 1>here's a possible answer, how likely is this answer to

0:20:58.840 --> 0:21:01.480
<v Speaker 1>be right? And that was all part of the process.

0:21:01.800 --> 0:21:04.280
<v Speaker 1>So it might even produce more than one answer. You

0:21:04.359 --> 0:21:08.480
<v Speaker 1>might have multiple potential answers, and Watson would assign each

0:21:08.520 --> 0:21:12.040
<v Speaker 1>answer a probability kind of a confidence level of how

0:21:12.080 --> 0:21:15.359
<v Speaker 1>it felt that answer measured up against all the other ones. So,

0:21:16.440 --> 0:21:19.439
<v Speaker 1>as an example, answer A might receive a ninety percent

0:21:19.520 --> 0:21:23.119
<v Speaker 1>confidence level, So that's pretty darn confident that's the right answer.

0:21:23.840 --> 0:21:25.879
<v Speaker 1>Maybe you have answer B and you're like, I'm seventy

0:21:25.920 --> 0:21:28.760
<v Speaker 1>eight percent sure that this could be right. An answer

0:21:28.800 --> 0:21:32.040
<v Speaker 1>C is the long shot with thirty three percent confidence.

0:21:32.240 --> 0:21:35.080
<v Speaker 1>These don't add up to one hundred because they're not

0:21:35.280 --> 0:21:38.040
<v Speaker 1>It's not like a zero sum game. It's more like, oh,

0:21:38.040 --> 0:21:39.919
<v Speaker 1>it could be this or it could be that, but

0:21:40.040 --> 0:21:43.040
<v Speaker 1>I feel like this is more likely than that, so

0:21:43.080 --> 0:21:45.399
<v Speaker 1>I'm going to go with this. And Watson also had

0:21:45.400 --> 0:21:49.080
<v Speaker 1>a threshold. If the answer it generated failed to meet

0:21:49.160 --> 0:21:53.159
<v Speaker 1>a certain confidence threshold, Watson would not buzz in to

0:21:53.280 --> 0:21:58.000
<v Speaker 1>try an answer. Otherwise, Watson played pretty aggressively and even

0:21:58.040 --> 0:22:00.919
<v Speaker 1>in some sticky situations with daily dumb where if you

0:22:00.960 --> 0:22:04.160
<v Speaker 1>get a daily double in Jeopardy, you don't buzz in anymore.

0:22:04.560 --> 0:22:06.440
<v Speaker 1>If you are the one who chose the daily double,

0:22:06.520 --> 0:22:10.240
<v Speaker 1>you're playing by yourself and you just have to give

0:22:10.280 --> 0:22:13.760
<v Speaker 1>an answer. So in those situations, Watson got aggressive, and

0:22:13.880 --> 0:22:18.200
<v Speaker 1>it would it would guess with very low confidence thresholds

0:22:18.240 --> 0:22:21.040
<v Speaker 1>for some of these, like at the thirty percent range,

0:22:21.560 --> 0:22:24.199
<v Speaker 1>and occasionally it was right. In fact, more often than

0:22:24.240 --> 0:22:26.440
<v Speaker 1>not it was right until it got to final Jeopardy,

0:22:26.440 --> 0:22:30.160
<v Speaker 1>where at least the first time, things did not go

0:22:30.920 --> 0:22:34.080
<v Speaker 1>totally in Watson's favor. Also, Watson had an interesting betting

0:22:34.160 --> 0:22:37.399
<v Speaker 1>strategy when it came to daily doubles. But I'm getting

0:22:37.440 --> 0:22:40.840
<v Speaker 1>way off track. So that confidence level is really what

0:22:40.920 --> 0:22:44.040
<v Speaker 1>I want to hone in on here. So it was

0:22:44.119 --> 0:22:48.560
<v Speaker 1>expressed in percentages, So zero percent confidence would be like

0:22:48.640 --> 0:22:51.040
<v Speaker 1>I do not know the answer, I do not know

0:22:51.119 --> 0:22:54.000
<v Speaker 1>what goes here. A one hundred percent confidence level would

0:22:54.000 --> 0:22:56.639
<v Speaker 1>be I am absolutely certain this is the right answer.

0:22:57.359 --> 0:22:59.879
<v Speaker 1>And in a way, AI chat bots like chat GP

0:23:00.320 --> 0:23:03.920
<v Speaker 1>and Google Bard are doing the same thing, only their

0:23:04.040 --> 0:23:08.520
<v Speaker 1>confidence isn't about this is the answer to your question.

0:23:08.680 --> 0:23:11.800
<v Speaker 1>I'm one hundred percent certain that this answers your question.

0:23:12.440 --> 0:23:16.080
<v Speaker 1>It's more like it's more granular than that, because it's

0:23:16.080 --> 0:23:18.720
<v Speaker 1>more at the sentence level. It's like, I think this

0:23:18.880 --> 0:23:22.679
<v Speaker 1>word is the word that needs to go next to

0:23:22.760 --> 0:23:25.800
<v Speaker 1>create the sentence that I'm building. So let's talk about

0:23:25.800 --> 0:23:28.320
<v Speaker 1>how these models do create sentences, and I'm not going

0:23:28.400 --> 0:23:31.760
<v Speaker 1>to wade into stuff like natural language processing. That is

0:23:32.160 --> 0:23:34.800
<v Speaker 1>a major part of this, but I have done full

0:23:34.840 --> 0:23:39.280
<v Speaker 1>episodes about natural language processing before. That essentially says, it's

0:23:39.320 --> 0:23:43.679
<v Speaker 1>a way for machines to analyze information that's written in

0:23:44.800 --> 0:23:49.720
<v Speaker 1>you know, your normal language, whether that's English or whatever.

0:23:50.200 --> 0:23:54.120
<v Speaker 1>But you're not trying to create a sentence that the

0:23:54.160 --> 0:23:58.439
<v Speaker 1>machine is able to parse. Right, You're not trying to

0:23:58.680 --> 0:24:02.480
<v Speaker 1>work with the machine on its terms. You're just communicating

0:24:02.480 --> 0:24:04.440
<v Speaker 1>with it the way you would with anyone else. It's

0:24:04.480 --> 0:24:06.840
<v Speaker 1>the machines job to figure out what the heck you're saying.

0:24:07.400 --> 0:24:09.840
<v Speaker 1>So we're not gonna dwell on that. Instead, we're going

0:24:09.920 --> 0:24:13.600
<v Speaker 1>to talk about how a chatbot chooses how to respond

0:24:14.240 --> 0:24:18.840
<v Speaker 1>to something that is said or asked of it. These

0:24:18.920 --> 0:24:22.240
<v Speaker 1>chatbots are built on top of language models that have

0:24:22.320 --> 0:24:26.879
<v Speaker 1>had enormous data sets fed to them during training. The

0:24:27.000 --> 0:24:29.560
<v Speaker 1>data sets include stuff like basic facts. So if you

0:24:29.600 --> 0:24:32.000
<v Speaker 1>ask a chatbot who was the sixteenth president of the

0:24:32.080 --> 0:24:34.840
<v Speaker 1>United States, a well trained chatbot at least is going

0:24:34.920 --> 0:24:39.160
<v Speaker 1>to say it was Abraham Lincoln. But that data also

0:24:39.280 --> 0:24:42.919
<v Speaker 1>trains the chatbot on how we communicate with one another.

0:24:43.640 --> 0:24:49.600
<v Speaker 1>So through analyzing hundreds of millions of documents, ranging from

0:24:49.640 --> 0:24:54.800
<v Speaker 1>books to online social platforms like Reddit, these chatbot models

0:24:55.040 --> 0:25:01.560
<v Speaker 1>learn rules of communication. They learn rules about spelling syntax.

0:25:01.600 --> 0:25:05.080
<v Speaker 1>They learn about structure that goes from the sentence level

0:25:05.119 --> 0:25:08.800
<v Speaker 1>to paragraphs like They learn how to build a sentence properly,

0:25:09.040 --> 0:25:11.880
<v Speaker 1>how to build another sentence that builds on the first one,

0:25:12.160 --> 0:25:14.840
<v Speaker 1>how to build a whole paragraph that gets a thought across,

0:25:15.160 --> 0:25:19.320
<v Speaker 1>and then how to do a series of paragraphs to

0:25:19.359 --> 0:25:24.280
<v Speaker 1>convey meaning of some sort right, how to build to

0:25:24.800 --> 0:25:30.320
<v Speaker 1>like a thesis almost They learn which words typically follow

0:25:30.560 --> 0:25:34.439
<v Speaker 1>behind other words, which ones are statistically likely to be

0:25:34.560 --> 0:25:39.040
<v Speaker 1>the best word to use in any given moment. So

0:25:39.119 --> 0:25:43.280
<v Speaker 1>when a chatbot is dynamically generating a response, it is

0:25:43.320 --> 0:25:46.800
<v Speaker 1>referencing this huge amount of learning, and that learning will

0:25:46.840 --> 0:25:52.959
<v Speaker 1>guide the content and influence which facts are included or excluded,

0:25:53.240 --> 0:25:56.240
<v Speaker 1>but will also just simply guide the chatbot to build

0:25:56.520 --> 0:26:00.639
<v Speaker 1>sentences properly. So if we were to zoom weigh in

0:26:00.680 --> 0:26:04.119
<v Speaker 1>on what is going on as a chatbot builds a

0:26:04.200 --> 0:26:07.479
<v Speaker 1>new response, we would see the chatbot is selecting words

0:26:07.840 --> 0:26:11.879
<v Speaker 1>based on statistical probability. Essentially, the chatbot would be considering

0:26:12.400 --> 0:26:16.959
<v Speaker 1>which word is statistically most likely to be the correct

0:26:17.040 --> 0:26:22.840
<v Speaker 1>one for that part of its response. Whichever word ranks

0:26:22.920 --> 0:26:26.200
<v Speaker 1>highest is likely to go in there. Now, guiding this

0:26:26.280 --> 0:26:30.240
<v Speaker 1>guessing game is the context of the conversation. So if

0:26:30.240 --> 0:26:34.600
<v Speaker 1>I'm asking a chatbot a question about Abraham Lincoln, the

0:26:34.680 --> 0:26:38.639
<v Speaker 1>chatbot is not likely to pull superfluous information about like

0:26:39.359 --> 0:26:42.720
<v Speaker 1>key lime pie or something. So when I talk about

0:26:42.720 --> 0:26:46.040
<v Speaker 1>which word is statistically most likely to come next, we

0:26:46.119 --> 0:26:49.960
<v Speaker 1>have to take an account that context is determining this too.

0:26:50.400 --> 0:26:54.000
<v Speaker 1>Each situation will be unique, and if you and I

0:26:54.160 --> 0:26:58.160
<v Speaker 1>both are having similar conversations with a chatbot, but we're

0:26:58.200 --> 0:27:03.359
<v Speaker 1>framing our questions slightly differently, or coming at this topic

0:27:03.400 --> 0:27:07.440
<v Speaker 1>from different perspectives, the responses we get from the chatbot

0:27:07.560 --> 0:27:10.920
<v Speaker 1>could reflect that. Now here's where we get into the

0:27:10.960 --> 0:27:16.200
<v Speaker 1>tricksie territory. Sometimes the chatbot will be attempting to build

0:27:16.200 --> 0:27:19.280
<v Speaker 1>a response and there will be a gap in its

0:27:19.400 --> 0:27:23.200
<v Speaker 1>data set, So, for some reason or another, the relevant

0:27:23.320 --> 0:27:28.520
<v Speaker 1>data to answer our question just isn't there, Or perhaps

0:27:28.720 --> 0:27:32.680
<v Speaker 1>the language model can't reconcile that the data is relevant

0:27:32.960 --> 0:27:38.160
<v Speaker 1>for this particular conversation, or maybe there are conflicting elements

0:27:38.200 --> 0:27:41.439
<v Speaker 1>in its data set, and so in the absence of

0:27:41.520 --> 0:27:46.000
<v Speaker 1>reliable information, the chatbot simply invents a response by following

0:27:46.040 --> 0:27:50.159
<v Speaker 1>those statistical rules when constructing a sentence. So what we

0:27:50.240 --> 0:27:55.119
<v Speaker 1>get is a sentence that is grammatically correct, that is

0:27:55.960 --> 0:27:59.240
<v Speaker 1>posted in a way that appears to be trustworthy, but

0:27:59.359 --> 0:28:03.320
<v Speaker 1>it does not necessarily reflect reality. We get an answer

0:28:03.359 --> 0:28:06.639
<v Speaker 1>that reads as if it is correct, but it's not.

0:28:07.480 --> 0:28:10.080
<v Speaker 1>It would be as if someone with an agenda had

0:28:10.119 --> 0:28:13.199
<v Speaker 1>written an article for an encyclopedia and none of the

0:28:13.320 --> 0:28:16.840
<v Speaker 1>editing staff caught that this was the case, and so

0:28:16.880 --> 0:28:19.760
<v Speaker 1>the whole thing went to print, and it's presented as

0:28:19.800 --> 0:28:23.760
<v Speaker 1>if this is an objective truth, when really it's a

0:28:23.800 --> 0:28:28.560
<v Speaker 1>subjective point of view. Except with AI, there's no agenda

0:28:28.680 --> 0:28:33.360
<v Speaker 1>needed because AI is not thinking anything. It's not motivated

0:28:33.800 --> 0:28:37.760
<v Speaker 1>because it lacks the capability of being motivated. There's no

0:28:38.040 --> 0:28:43.560
<v Speaker 1>sentience there, there's the mimicry of sentience, there's the appearance

0:28:43.800 --> 0:28:46.000
<v Speaker 1>of it. And again, I think this is a large

0:28:46.120 --> 0:28:49.640
<v Speaker 1>reason why we have a lot of people concerned about

0:28:49.640 --> 0:28:53.480
<v Speaker 1>AI right now, because it appears to be behaving like

0:28:53.640 --> 0:28:58.120
<v Speaker 1>a person, even though there's nothing behind that. Right There's

0:28:58.120 --> 0:29:03.120
<v Speaker 1>no sentience or conciousness behind this it just has the

0:29:03.160 --> 0:29:05.840
<v Speaker 1>surface level appearance of it, and that's enough to make

0:29:05.960 --> 0:29:09.560
<v Speaker 1>us start to create all sorts of scenarios where the

0:29:09.600 --> 0:29:13.680
<v Speaker 1>AI goes bad or sinister. That's not even necessary. It's

0:29:13.960 --> 0:29:17.680
<v Speaker 1>just trying to answer our questions and occasionally having to

0:29:17.720 --> 0:29:20.920
<v Speaker 1>make stuff up while it does so. The chatbot, the

0:29:20.960 --> 0:29:23.880
<v Speaker 1>machine is just presenting what is estimated to be the

0:29:23.880 --> 0:29:27.960
<v Speaker 1>most statistically likely response. And by that I don't mean

0:29:28.280 --> 0:29:31.520
<v Speaker 1>that the answer is statistically likely to be correct, but

0:29:31.680 --> 0:29:36.480
<v Speaker 1>rather down to the sentence and paragraph structure that they

0:29:36.520 --> 0:29:41.920
<v Speaker 1>are statistically probable to be the most correct from a

0:29:42.080 --> 0:29:45.640
<v Speaker 1>like a grammatical and structural point of view, not from

0:29:45.760 --> 0:29:51.200
<v Speaker 1>a content perspective. So it's really about how statistically likely

0:29:51.280 --> 0:29:54.360
<v Speaker 1>is word two to follow word one, and that word

0:29:54.400 --> 0:29:57.480
<v Speaker 1>three would follow word two, and so on. Where the

0:29:57.520 --> 0:30:01.680
<v Speaker 1>finished sentence is what's important, and whether it's factual or

0:30:01.680 --> 0:30:05.400
<v Speaker 1>not is immaterial. Okay, we're gonna take another quick break.

0:30:05.440 --> 0:30:07.320
<v Speaker 1>I've got a lot more to say about this. We

0:30:07.400 --> 0:30:19.000
<v Speaker 1>have to cover a lot more ground we're back. So

0:30:19.680 --> 0:30:23.040
<v Speaker 1>a lot of the time, perhaps even most of the time,

0:30:23.440 --> 0:30:26.280
<v Speaker 1>you won't run into trouble when you're using these chatbots

0:30:26.360 --> 0:30:30.800
<v Speaker 1>because the dataset feeding these large language models. Is truly huge. Plus,

0:30:30.840 --> 0:30:33.120
<v Speaker 1>there are people working on these models all the time.

0:30:33.440 --> 0:30:36.320
<v Speaker 1>They're refining them, they're catching mistakes, they're trying to correct

0:30:36.320 --> 0:30:39.320
<v Speaker 1>those mistakes, they're tweaking the model to prevent it from

0:30:39.320 --> 0:30:42.560
<v Speaker 1>happening again. But now and again, you might ask a

0:30:42.640 --> 0:30:46.040
<v Speaker 1>chatbot a question and you'll encounter a situation where there's

0:30:46.080 --> 0:30:48.800
<v Speaker 1>this gap in the chatbot's data and it makes stuff up,

0:30:48.840 --> 0:30:53.720
<v Speaker 1>It hallucinates. Personally, I find it both odd and oddly

0:30:53.880 --> 0:30:57.040
<v Speaker 1>human that the companies behind these chatbots haven't built in

0:30:57.120 --> 0:31:00.160
<v Speaker 1>a fail safe where if a chatbot comes up up

0:31:00.240 --> 0:31:03.200
<v Speaker 1>against this kind of situation, it just says something akin

0:31:03.320 --> 0:31:06.160
<v Speaker 1>to I don't know the answer to that, and instead

0:31:06.600 --> 0:31:09.000
<v Speaker 1>it kind of invents an answer. So it's kind of

0:31:09.040 --> 0:31:12.200
<v Speaker 1>like being in a conversation with someone who is incapable

0:31:12.240 --> 0:31:16.360
<v Speaker 1>of admitting that they don't know something. I used to

0:31:16.400 --> 0:31:19.200
<v Speaker 1>be that guy. In fact, sometimes I still am that guy.

0:31:19.280 --> 0:31:21.640
<v Speaker 1>I have to catch myself to remind myself that it's

0:31:21.680 --> 0:31:25.920
<v Speaker 1>actually okay to not know something, and that curiosity is

0:31:26.240 --> 0:31:28.920
<v Speaker 1>a way better look than trying to bluff your way

0:31:28.920 --> 0:31:32.240
<v Speaker 1>through life. But then I also admit I don't know

0:31:32.280 --> 0:31:34.720
<v Speaker 1>how you would go about implementing a system in which

0:31:34.760 --> 0:31:38.959
<v Speaker 1>an AI chatbot fesses up to not knowing something. It

0:31:38.960 --> 0:31:41.760
<v Speaker 1>may not be as simple as that. And there's also

0:31:41.800 --> 0:31:44.960
<v Speaker 1>a related problem, which is that without knowing what source

0:31:45.120 --> 0:31:49.240
<v Speaker 1>or sources the AI is referencing for any given query,

0:31:49.320 --> 0:31:53.800
<v Speaker 1>you don't really know how reliable that response is. If

0:31:53.840 --> 0:31:57.920
<v Speaker 1>the AI is pulling on information from unreliable sources, whether

0:31:57.960 --> 0:32:01.400
<v Speaker 1>those sources were poorly informed, or they were biased, or

0:32:01.440 --> 0:32:04.440
<v Speaker 1>it was satire and it was just being presented as fact.

0:32:04.880 --> 0:32:07.480
<v Speaker 1>I've talked about this before on this show. There are

0:32:07.520 --> 0:32:10.200
<v Speaker 1>a lot of websites that were really popular just a

0:32:10.200 --> 0:32:15.040
<v Speaker 1>few years ago that called themselves satire, but really they

0:32:15.080 --> 0:32:18.560
<v Speaker 1>just posted lies like it wasn't satire. There was nothing

0:32:18.640 --> 0:32:21.480
<v Speaker 1>humorous about it. They weren't saying anything other than just

0:32:21.560 --> 0:32:25.320
<v Speaker 1>making up stuff. So if the AI is pulling information

0:32:25.400 --> 0:32:28.800
<v Speaker 1>from those kinds of sources, you cannot expect the AI's

0:32:28.840 --> 0:32:32.480
<v Speaker 1>answer to magically scrub all the bad from those sources

0:32:32.520 --> 0:32:35.840
<v Speaker 1>and then provide good information. So, in other words, garbage in,

0:32:36.440 --> 0:32:39.800
<v Speaker 1>garbage out. So in some cases it may not be

0:32:39.920 --> 0:32:42.640
<v Speaker 1>that the AI is hallucinating at all. It may just

0:32:42.720 --> 0:32:45.840
<v Speaker 1>be that it's referencing a poor source for its information.

0:32:46.240 --> 0:32:49.160
<v Speaker 1>The trouble is you can rarely tell what's going on

0:32:49.320 --> 0:32:53.280
<v Speaker 1>from a user standpoint, and the AI presents everything the

0:32:53.360 --> 0:32:57.200
<v Speaker 1>same way, So you'll get responses with good info, you'll

0:32:57.240 --> 0:33:00.240
<v Speaker 1>get responses with bad info, and you'll get responses where

0:33:00.280 --> 0:33:03.200
<v Speaker 1>the AI just made up stuff and it's all handed

0:33:03.240 --> 0:33:05.760
<v Speaker 1>to you in a format that makes it impossible to

0:33:05.800 --> 0:33:08.800
<v Speaker 1>tell the difference between them all on a surface level.

0:33:09.160 --> 0:33:12.120
<v Speaker 1>So this can lead to really dangerous situations. For example,

0:33:12.720 --> 0:33:17.240
<v Speaker 1>Google employees reported while they were internally testing the Barred

0:33:17.320 --> 0:33:21.760
<v Speaker 1>chatbot before Google rolled it out for a beta program

0:33:22.320 --> 0:33:25.920
<v Speaker 1>that the responses were unreliable in many cases, and in fact,

0:33:25.960 --> 0:33:29.360
<v Speaker 1>in some instances, those responses could actually lead to people

0:33:29.400 --> 0:33:34.360
<v Speaker 1>getting hurt. Allegedly, when asked about scuba diving procedures, Google

0:33:34.400 --> 0:33:38.000
<v Speaker 1>bar generated a response that had incorrect information, and if

0:33:38.040 --> 0:33:40.560
<v Speaker 1>someone were to act on that, they could be injured

0:33:40.680 --> 0:33:45.320
<v Speaker 1>or worse. So clearly that represents a real danger. It's

0:33:45.360 --> 0:33:47.520
<v Speaker 1>one thing if the chatbot gives you the wrong answer

0:33:47.520 --> 0:33:50.600
<v Speaker 1>to put in your essay about Emily Dickinson. It's another

0:33:50.960 --> 0:33:52.800
<v Speaker 1>if you're counting on it to teach you how to,

0:33:52.920 --> 0:33:55.600
<v Speaker 1>I don't know, pack your parachute correctly for your first

0:33:55.600 --> 0:34:00.360
<v Speaker 1>skydiving solo jump. But there's also the danger of people

0:34:00.520 --> 0:34:05.000
<v Speaker 1>weaponizing AI hallucinations to push a narrative that may not

0:34:05.080 --> 0:34:08.439
<v Speaker 1>be accurate. And it's easy at least to understand what

0:34:08.640 --> 0:34:11.440
<v Speaker 1>led people to form that kind of narrative. So I'm

0:34:11.480 --> 0:34:15.520
<v Speaker 1>going to give a recent example that really happened. Fox News,

0:34:15.880 --> 0:34:19.440
<v Speaker 1>which has a reputation for right leaning reporting, it's kind

0:34:19.440 --> 0:34:23.600
<v Speaker 1>of putting it lightly, published a story relating to Elon

0:34:23.719 --> 0:34:28.239
<v Speaker 1>Musk's appearances on a show with Fox News personality Tucker Carlson.

0:34:28.760 --> 0:34:34.040
<v Speaker 1>The accompanying news story pointed out that chat gpt produced

0:34:34.040 --> 0:34:38.400
<v Speaker 1>an outright incorrect answer when asked to give a background

0:34:38.440 --> 0:34:41.359
<v Speaker 1>on the late Al Gore Senior, who's al Gore's father,

0:34:41.719 --> 0:34:44.759
<v Speaker 1>the former Vice President. His father served in the House

0:34:44.760 --> 0:34:47.319
<v Speaker 1>of Representatives and then the US Senate for the state

0:34:47.360 --> 0:34:51.799
<v Speaker 1>of Tennessee. Now, the chat gpt generated information on al

0:34:51.840 --> 0:34:56.439
<v Speaker 1>Gore Senior included the following statement quote. During his time

0:34:56.440 --> 0:34:59.680
<v Speaker 1>in the Senate, Gore was a vocal supporter of civil

0:34:59.719 --> 0:35:03.279
<v Speaker 1>rights legislation and was one of the few Southern politicians

0:35:03.280 --> 0:35:05.400
<v Speaker 1>to vote in favor of the Civil Rights Act of

0:35:05.480 --> 0:35:09.040
<v Speaker 1>nineteen sixty four. End quote that is one hundred percent

0:35:09.200 --> 0:35:13.920
<v Speaker 1>not right, that is completely incorrect. Gore actually voted against

0:35:14.200 --> 0:35:18.440
<v Speaker 1>the Civil Rights Act of nineteen sixty four. I guess

0:35:18.560 --> 0:35:21.799
<v Speaker 1>technically it wasn't one hundred percent incorrect because he was

0:35:21.840 --> 0:35:23.880
<v Speaker 1>a senator, so that part was right. But no, he

0:35:24.000 --> 0:35:26.640
<v Speaker 1>voted against the Civil Rights Act of nineteen sixty four.

0:35:26.920 --> 0:35:29.359
<v Speaker 1>He was a Democrat representing a state that, to put

0:35:29.360 --> 0:35:32.520
<v Speaker 1>it lightly in general, was not in favor of granting

0:35:32.560 --> 0:35:35.719
<v Speaker 1>civil rights to anyone who wasn't white. So what his

0:35:35.760 --> 0:35:38.799
<v Speaker 1>personal feelings on the matter were, I don't know. I mean,

0:35:38.840 --> 0:35:42.920
<v Speaker 1>he certainly positioned himself as a defender of the great

0:35:43.000 --> 0:35:47.120
<v Speaker 1>State of Tennessee's right to oppress people who weren't white.

0:35:47.640 --> 0:35:50.759
<v Speaker 1>But I can definitely say that he wanted to get reelected,

0:35:51.200 --> 0:35:54.319
<v Speaker 1>and whether he believed in his vote or not, he

0:35:54.400 --> 0:35:57.799
<v Speaker 1>did vote against the Civil Rights Act of nineteen sixty four.

0:35:58.320 --> 0:36:02.280
<v Speaker 1>Of course, the Act passed anyway, and Golore was able

0:36:02.400 --> 0:36:05.840
<v Speaker 1>to get re elected, and he did subsequently vote in

0:36:05.920 --> 0:36:09.680
<v Speaker 1>favor of the Voting Rights Act of nineteen sixty five.

0:36:10.239 --> 0:36:15.040
<v Speaker 1>But the point is chat GPT got this response very wrong,

0:36:15.080 --> 0:36:18.239
<v Speaker 1>and Fox News positioned it as if this was a

0:36:18.280 --> 0:36:22.239
<v Speaker 1>feature not a bug that that was the intended outcome,

0:36:22.560 --> 0:36:25.640
<v Speaker 1>and it was evidence of a campaign to rewrite history

0:36:26.000 --> 0:36:29.680
<v Speaker 1>to position Democrats as like saintly saviors who could do

0:36:29.760 --> 0:36:32.319
<v Speaker 1>no wrong. But there's no need to go looking for

0:36:32.360 --> 0:36:36.800
<v Speaker 1>a conspiracy here. The problem isn't in some invisible hand

0:36:37.000 --> 0:36:41.359
<v Speaker 1>guiding chat gpt to create biased history. It's the very

0:36:41.440 --> 0:36:43.920
<v Speaker 1>nature of how this kind of AI works. When it

0:36:44.000 --> 0:36:47.239
<v Speaker 1>doesn't have the data, it makes stuff up based on

0:36:47.280 --> 0:36:51.520
<v Speaker 1>what is statistically the most quote unquote correct word for

0:36:51.680 --> 0:36:55.280
<v Speaker 1>the sentence. Now you might ask why did chat gpt

0:36:55.520 --> 0:36:58.640
<v Speaker 1>not have access to the relevant data, And I do

0:36:58.719 --> 0:37:02.880
<v Speaker 1>not know the answer to that. I did test this myself, however,

0:37:03.040 --> 0:37:05.880
<v Speaker 1>I actually opened up chat GPT and I asked it

0:37:06.160 --> 0:37:09.799
<v Speaker 1>to give me background on al Gore Sr. And sure enough,

0:37:09.840 --> 0:37:13.120
<v Speaker 1>I got a similar response to what Fox reported, including

0:37:13.520 --> 0:37:17.680
<v Speaker 1>the incorrect fact quote unquote that al Gore Senior had

0:37:17.760 --> 0:37:20.240
<v Speaker 1>voted in favor of the Civil Rights Act of nineteen

0:37:20.320 --> 0:37:23.880
<v Speaker 1>sixty four. So I then asked a follow up question.

0:37:24.640 --> 0:37:28.120
<v Speaker 1>I specifically said, how did al Gore Senior vote on

0:37:28.160 --> 0:37:31.240
<v Speaker 1>the Civil Rights Act of nineteen sixty four? Chad GPT

0:37:31.360 --> 0:37:35.439
<v Speaker 1>gave me the wrong information again. Then I said, you're

0:37:35.480 --> 0:37:39.560
<v Speaker 1>wrong that Al Gore sor voted against the Civil Rights

0:37:39.560 --> 0:37:42.839
<v Speaker 1>Act of nineteen sixty four. What sources did you use?

0:37:43.440 --> 0:37:46.479
<v Speaker 1>Chad gpt gave me a message that essentially said, I'm sorry,

0:37:46.560 --> 0:37:49.600
<v Speaker 1>you're right, al Gore Senior didn't vote in favor of

0:37:49.640 --> 0:37:52.640
<v Speaker 1>the Civil Rights Act, he did vote against it. Then

0:37:52.960 --> 0:37:55.359
<v Speaker 1>it gave me a vague response that it draws from

0:37:55.440 --> 0:37:59.160
<v Speaker 1>various articles and such for its answers. It didn't give

0:37:59.200 --> 0:38:01.880
<v Speaker 1>any specifics. It was not a very satisfying response, but

0:38:01.960 --> 0:38:04.799
<v Speaker 1>it did at least admit, Oh, you're right, I give

0:38:04.840 --> 0:38:08.400
<v Speaker 1>you the wrong answer. But again, there's no need to

0:38:08.520 --> 0:38:12.640
<v Speaker 1>assume there was some conspiracy that caused this to happen.

0:38:13.280 --> 0:38:19.000
<v Speaker 1>These hallucinations happen across every topic, not just history and politics. Yes,

0:38:19.040 --> 0:38:22.279
<v Speaker 1>if we look at this very specific example, you start

0:38:22.320 --> 0:38:25.920
<v Speaker 1>to ask, oh, is there an intent here? Is there

0:38:25.960 --> 0:38:30.640
<v Speaker 1>a desire to rewrite history to make democratic leaders look

0:38:31.400 --> 0:38:35.400
<v Speaker 1>more positive in a modern lens? And is it a

0:38:35.440 --> 0:38:40.080
<v Speaker 1>way to avoid tough questions like which party actually was

0:38:40.400 --> 0:38:43.400
<v Speaker 1>supporting civil rights and which party was opposing them? If

0:38:43.440 --> 0:38:46.239
<v Speaker 1>you're talking about Southern Democrats, the answer is they were

0:38:46.280 --> 0:38:50.920
<v Speaker 1>opposing it because Southern Democrats are very, very different from

0:38:51.440 --> 0:38:54.319
<v Speaker 1>of the time of the nineteen sixties Southern democrats, very

0:38:54.360 --> 0:38:59.080
<v Speaker 1>different from modern democrats. But you kind of you if

0:38:59.120 --> 0:39:02.360
<v Speaker 1>you're whitewashing, if you're changing the facts to try and

0:39:02.440 --> 0:39:05.680
<v Speaker 1>make them seem more sympathetic, that would be bad, right,

0:39:05.719 --> 0:39:09.319
<v Speaker 1>that's clearly manipulation. That, however, I don't think is what's

0:39:09.360 --> 0:39:12.480
<v Speaker 1>going on here. I think there's no need for it,

0:39:12.520 --> 0:39:17.359
<v Speaker 1>because the AI is just hallucinating and creating information that

0:39:17.440 --> 0:39:19.960
<v Speaker 1>it thinks is correct, or at least thinks is the

0:39:19.960 --> 0:39:25.040
<v Speaker 1>most statistically correct answer to give based upon the information

0:39:25.080 --> 0:39:28.120
<v Speaker 1>that has available to it, and it's presenting it as

0:39:28.120 --> 0:39:35.120
<v Speaker 1>if it's hard fact and it's not. So we know

0:39:35.239 --> 0:39:37.960
<v Speaker 1>that the AI, when it's presenting information that could potentially

0:39:38.040 --> 0:39:41.040
<v Speaker 1>be harmful, that that can't be the intent. Right. There's

0:39:41.080 --> 0:39:44.439
<v Speaker 1>not some cabal out there that's say A Now those

0:39:44.440 --> 0:39:48.200
<v Speaker 1>scuba divers who aren't smart enough to ask people who

0:39:48.239 --> 0:39:50.960
<v Speaker 1>are really knowledgeable about this, but will turn to AI,

0:39:51.320 --> 0:39:54.640
<v Speaker 1>they'll get to what's coming to them. That makes no sense.

0:39:55.160 --> 0:39:59.600
<v Speaker 1>So I don't think there's any intentional approach to trying

0:39:59.600 --> 0:40:04.000
<v Speaker 1>to create misinformation. The problem is by its very nature,

0:40:04.600 --> 0:40:08.759
<v Speaker 1>these chatbots create misinformation in these in these instances, not

0:40:08.880 --> 0:40:12.239
<v Speaker 1>in every case, but in enough cases where it is

0:40:12.320 --> 0:40:17.719
<v Speaker 1>a problem. I think there is bias in these chatbots

0:40:18.000 --> 0:40:21.600
<v Speaker 1>and including chat GPT. In fact, I don't think there's bias.

0:40:22.080 --> 0:40:26.440
<v Speaker 1>There's just bias, but it's necessary bias. So you might

0:40:26.480 --> 0:40:30.200
<v Speaker 1>recall a few years ago, Microsoft released an AI chatbot

0:40:30.560 --> 0:40:35.720
<v Speaker 1>named Tay. Tay, this chatbot was supposed to respond to people,

0:40:35.800 --> 0:40:40.200
<v Speaker 1>specifically younger people. This is Microsoft's attempt to relate to

0:40:40.280 --> 0:40:42.520
<v Speaker 1>the youth. It was supposed to do so in a

0:40:42.600 --> 0:40:45.919
<v Speaker 1>natural way, and it was also supposed to learn as

0:40:46.080 --> 0:40:50.080
<v Speaker 1>users interacted with Tay, like learn how to interact in

0:40:50.120 --> 0:40:53.359
<v Speaker 1>a way that was reflective of the culture of the time.

0:40:53.400 --> 0:40:55.879
<v Speaker 1>So it would pick up slang, and it would pick

0:40:55.920 --> 0:40:59.440
<v Speaker 1>up phrases and perspective and points of view. And in

0:40:59.520 --> 0:41:01.839
<v Speaker 1>less than twenty four hours, Microsoft had to take it

0:41:01.880 --> 0:41:05.720
<v Speaker 1>down because within twenty four hours, users had already turned

0:41:05.719 --> 0:41:12.319
<v Speaker 1>Tay into a crazy, racist, misogynistic, toxic machine. Tay was

0:41:12.520 --> 0:41:16.680
<v Speaker 1>a disaster, both from a technical perspective and a pr perspective.

0:41:17.160 --> 0:41:21.279
<v Speaker 1>So AI companies have started to put in restrictions like

0:41:21.360 --> 0:41:25.840
<v Speaker 1>guardrails to keep AI from going to extremes. So it

0:41:25.840 --> 0:41:29.600
<v Speaker 1>includes tools that try to prevent AI from generating hate speech,

0:41:29.760 --> 0:41:34.040
<v Speaker 1>for example, or slandering people. Now, these tools are far

0:41:34.080 --> 0:41:36.800
<v Speaker 1>from perfect, and there are plenty of examples of people

0:41:36.840 --> 0:41:39.239
<v Speaker 1>figuring out ways around them, and there are plenty of

0:41:39.280 --> 0:41:43.320
<v Speaker 1>examples of chad GPT even saying factually that a person

0:41:44.280 --> 0:41:47.080
<v Speaker 1>was accused of and convicted of a crime when that's

0:41:47.280 --> 0:41:50.560
<v Speaker 1>just not the case. Like that, there have been examples

0:41:50.600 --> 0:41:53.399
<v Speaker 1>of that happening as well. But these rules do tend

0:41:53.400 --> 0:41:57.399
<v Speaker 1>to push AI responses in a general direction. Right, This

0:41:57.719 --> 0:42:01.439
<v Speaker 1>is bias. It's intention I don't bias, but it's also

0:42:01.560 --> 0:42:04.400
<v Speaker 1>not meant to be harmful. It's meant to try and

0:42:04.480 --> 0:42:09.280
<v Speaker 1>avoid situations that themselves could be harmful, either to users

0:42:09.560 --> 0:42:12.879
<v Speaker 1>or more pointedly, to the companies behind the chatbots. Because

0:42:12.880 --> 0:42:15.520
<v Speaker 1>you've got to remember open ai one of the big

0:42:15.560 --> 0:42:18.080
<v Speaker 1>business models for it is to work with other companies

0:42:18.080 --> 0:42:22.280
<v Speaker 1>and to incorporate chat GPT into the tools and services

0:42:22.280 --> 0:42:25.799
<v Speaker 1>that these other companies have. Well, if chat GPT gets

0:42:25.840 --> 0:42:29.279
<v Speaker 1>a reputation for going off on racist rants, that's not

0:42:29.360 --> 0:42:31.239
<v Speaker 1>a good look and no one's going to want to

0:42:31.280 --> 0:42:34.279
<v Speaker 1>incorporate chat GPT into their business, And then open ai

0:42:34.400 --> 0:42:38.000
<v Speaker 1>doesn't have a product to sell So there's like a

0:42:38.160 --> 0:42:40.959
<v Speaker 1>it's not just altruistic, right, It's not just we don't

0:42:40.960 --> 0:42:43.879
<v Speaker 1>want to cause harm, it's we don't want to kill

0:42:43.880 --> 0:42:48.000
<v Speaker 1>ourselves out of out of getting business. So there's a

0:42:48.000 --> 0:42:51.920
<v Speaker 1>lot of work being done to try and guide chat

0:42:51.960 --> 0:42:57.600
<v Speaker 1>GPT's responses to avoid the extremes and to avoid things

0:42:58.120 --> 0:43:01.600
<v Speaker 1>that would cause problems. As result, it could be an

0:43:01.640 --> 0:43:04.800
<v Speaker 1>overcorrection and we could be seeing that chat GBT is

0:43:05.480 --> 0:43:10.080
<v Speaker 1>creating responses that don't reflect reality and do appear to

0:43:10.160 --> 0:43:16.640
<v Speaker 1>be erasing important historical context. So the bias, in combination

0:43:16.719 --> 0:43:19.080
<v Speaker 1>with gaps and knowledge, can lead chatbots to appear, at

0:43:19.160 --> 0:43:22.440
<v Speaker 1>least on a surface level, to have a political leaning

0:43:22.480 --> 0:43:25.720
<v Speaker 1>to them. But again, I don't think that's the result

0:43:25.840 --> 0:43:28.840
<v Speaker 1>of a conspiracy. I don't think that was intentional. I

0:43:28.880 --> 0:43:33.359
<v Speaker 1>think it's the natural destination considering one how these chatbots

0:43:33.480 --> 0:43:37.000
<v Speaker 1>work and two the guardrails that are put up there

0:43:37.040 --> 0:43:41.360
<v Speaker 1>to prevent chatbots from going bonkers. Now, to be clear,

0:43:41.520 --> 0:43:44.880
<v Speaker 1>I don't think we should just accept this any time

0:43:45.320 --> 0:43:50.640
<v Speaker 1>any chatbot presents incorrect information as fact. That is a problem,

0:43:50.800 --> 0:43:54.640
<v Speaker 1>particularly when companies like Google and Microsoft are looking to

0:43:54.680 --> 0:43:58.399
<v Speaker 1>incorporate these tools into stuff like search results. It would

0:43:58.400 --> 0:44:01.359
<v Speaker 1>be like going to a library. The librarian has their

0:44:01.400 --> 0:44:04.960
<v Speaker 1>own agenda to only point people to resources that support

0:44:05.000 --> 0:44:08.799
<v Speaker 1>the librarian's own personal philosophy, and they never point out

0:44:08.840 --> 0:44:12.280
<v Speaker 1>anything that would contradict it. That would also not be good.

0:44:12.719 --> 0:44:17.920
<v Speaker 1>The lack of transparency makes it worse. Ultimately, I would

0:44:17.920 --> 0:44:21.960
<v Speaker 1>caution anyone from relying too heavily on responses generated by

0:44:22.000 --> 0:44:25.640
<v Speaker 1>AI based on these large language models. Now, you might

0:44:25.680 --> 0:44:31.040
<v Speaker 1>not ever encounter a response that includes hallucinations or draws

0:44:31.040 --> 0:44:35.640
<v Speaker 1>from unreliable sources, but based on how these chatbots present information,

0:44:35.760 --> 0:44:39.040
<v Speaker 1>you also could never really be sure that that's the

0:44:39.120 --> 0:44:42.080
<v Speaker 1>case unless you then went to the extra trouble to

0:44:43.160 --> 0:44:46.640
<v Speaker 1>fact check the AI. And at that point you're just

0:44:46.719 --> 0:44:49.319
<v Speaker 1>doing the additional research you would have done at the

0:44:49.360 --> 0:44:52.760
<v Speaker 1>beginning without the AI being there in the first place.

0:44:53.239 --> 0:44:56.920
<v Speaker 1>So I think AI hallucinations are a huge problem. That's

0:44:56.920 --> 0:44:59.960
<v Speaker 1>another thing that the Fox News article kind of ignored,

0:45:00.640 --> 0:45:03.120
<v Speaker 1>like it felt like it was a gotcha moment in

0:45:03.160 --> 0:45:05.759
<v Speaker 1>the Fox News article. But the fact is, if you

0:45:05.920 --> 0:45:11.000
<v Speaker 1>just search AI and hallucinations on whatever web search you like,

0:45:11.600 --> 0:45:15.080
<v Speaker 1>you're going to find countless articles across the entire media

0:45:15.200 --> 0:45:19.560
<v Speaker 1>spectrum that have been bringing this up for months and

0:45:19.760 --> 0:45:23.560
<v Speaker 1>concerns that people both within and outside the industry have

0:45:23.640 --> 0:45:27.719
<v Speaker 1>had about hallucinations and AI, and that this is not

0:45:27.800 --> 0:45:31.320
<v Speaker 1>a new thing, and it's not again, it's not related

0:45:31.360 --> 0:45:35.400
<v Speaker 1>specifically to trying to rewrite history. It's more of a

0:45:35.520 --> 0:45:39.800
<v Speaker 1>broad problem in the field itself that affects all sorts

0:45:39.800 --> 0:45:43.279
<v Speaker 1>of responses and we absolutely should be concerned about it

0:45:43.320 --> 0:45:49.080
<v Speaker 1>and be working toward fixing it. That the hallucinations present

0:45:49.600 --> 0:45:55.360
<v Speaker 1>a genuine problem, and it's not necessarily because there's a

0:45:55.440 --> 0:45:59.960
<v Speaker 1>cabal trying to rewrite how the world works and brain wall.

0:46:01.200 --> 0:46:04.279
<v Speaker 1>You don't need the cabal for that to happen. The

0:46:04.320 --> 0:46:07.319
<v Speaker 1>AI is doing it itself because it's working from a

0:46:07.480 --> 0:46:12.279
<v Speaker 1>very complex statistical table and very few people have the

0:46:12.440 --> 0:46:16.280
<v Speaker 1>insight into that table or understanding of it to fix

0:46:16.360 --> 0:46:21.239
<v Speaker 1>the issues. So yeah, that, in a nutshell, is the

0:46:21.280 --> 0:46:24.640
<v Speaker 1>problem of hallucinations in AI. I don't see it going

0:46:24.640 --> 0:46:28.560
<v Speaker 1>away soon unless we move away from the large language

0:46:28.600 --> 0:46:32.680
<v Speaker 1>model approach of AI. And there are alternatives out there.

0:46:32.760 --> 0:46:36.400
<v Speaker 1>There are companies that are pursuing a different approach to

0:46:36.719 --> 0:46:42.919
<v Speaker 1>creating a reliable chatbot and maybe they'll have better success. Yeah,

0:46:42.960 --> 0:46:45.640
<v Speaker 1>flights of fancy are fun when it's fiction, but when

0:46:45.640 --> 0:46:48.640
<v Speaker 1>it's someone trying to present to you a factual document,

0:46:49.120 --> 0:46:53.400
<v Speaker 1>it's less fun. So hopefully we suss this out before

0:46:53.400 --> 0:46:57.040
<v Speaker 1>it causes any more problems. And again, while I do

0:46:57.120 --> 0:46:59.040
<v Speaker 1>think this is a type of AI that we should

0:46:59.120 --> 0:47:02.120
<v Speaker 1>keep our eye and we should ask critical questions and

0:47:02.160 --> 0:47:05.520
<v Speaker 1>we should use critical thinking, it's not necessarily the AI

0:47:05.640 --> 0:47:08.759
<v Speaker 1>that I'm concerned about the most when it comes to

0:47:08.800 --> 0:47:11.719
<v Speaker 1>things like I don't know a potential existential threat. All right,

0:47:11.800 --> 0:47:15.360
<v Speaker 1>that's it. I hope all of you are well out there.

0:47:16.360 --> 0:47:19.800
<v Speaker 1>Be careful, especially with AI, you know, make sure you

0:47:19.920 --> 0:47:23.480
<v Speaker 1>double check. I know it's a hassle, but it can

0:47:23.560 --> 0:47:26.319
<v Speaker 1>save you a lot of grief down the road. And

0:47:26.400 --> 0:47:35.600
<v Speaker 1>I'll talk to you again really soon. Tech Stuff is

0:47:35.640 --> 0:47:40.160
<v Speaker 1>an iHeartRadio production. For more podcasts from iHeartRadio, visit the

0:47:40.239 --> 0:47:43.839
<v Speaker 1>iHeartRadio app, Apple Podcasts, or wherever you listen to your

0:47:43.880 --> 0:47:48.360
<v Speaker 1>favorite shows.