1 00:00:02,520 --> 00:00:07,400 Speaker 1: All Zone Media. Hello and welcome to Better Offline. I'm 2 00:00:07,400 --> 00:00:22,000 Speaker 1: your host ed zetron. Wikipedia is a free online encyclopedia 3 00:00:22,079 --> 00:00:24,520 Speaker 1: used by over a billion people a month, one that, 4 00:00:24,640 --> 00:00:27,680 Speaker 1: despite being maintained entirely by an army of free and 5 00:00:27,760 --> 00:00:31,320 Speaker 1: at times anonymous contributors, remains one of the most reliable 6 00:00:31,320 --> 00:00:35,320 Speaker 1: sources of information on the Internet. Wikipedia's sixty two million 7 00:00:35,400 --> 00:00:39,000 Speaker 1: articles are edited by over one hundred thousand contributors, and 8 00:00:39,040 --> 00:00:42,199 Speaker 1: these contributors have somehow done a better job maintaining the 9 00:00:42,280 --> 00:00:45,320 Speaker 1: quality and validity of the information than any of the 10 00:00:45,360 --> 00:00:50,200 Speaker 1: information provided by any of the major platforms today. Wikipedia 11 00:00:50,240 --> 00:00:54,480 Speaker 1: is funded entirely through the Wikimedia Foundation, a nonprofit organization 12 00:00:54,560 --> 00:00:57,120 Speaker 1: funded through public donations and grants, and has the staff 13 00:00:57,160 --> 00:01:00,200 Speaker 1: of over seven hundred people. Yeah, and I'll thank you. 14 00:01:00,240 --> 00:01:03,880 Speaker 1: Most people have absolutely no idea how intricate the webs 15 00:01:03,920 --> 00:01:07,480 Speaker 1: encyclopedia is. And today I'm joined by Gorilla Warfare, a 16 00:01:07,520 --> 00:01:10,319 Speaker 1: Wikipedia editor that's made over one hundred and thirty thousand 17 00:01:10,480 --> 00:01:13,600 Speaker 1: edits in the span of eighteen years across two accounts. 18 00:01:13,959 --> 00:01:16,920 Speaker 1: You'll likely know her better as researcher and critic Molly White, 19 00:01:17,040 --> 00:01:20,400 Speaker 1: the creator of the Citation Needed newsletter and web three 20 00:01:20,480 --> 00:01:27,640 Speaker 1: is going great. All right, Molly, thank you for joining me, 21 00:01:28,280 --> 00:01:31,800 Speaker 1: Thanks for having me. So, I don't know how I 22 00:01:31,920 --> 00:01:34,520 Speaker 1: was to ask this book, Why should people actually care 23 00:01:34,560 --> 00:01:36,000 Speaker 1: about Wikipedia today? 24 00:01:37,680 --> 00:01:40,200 Speaker 2: Well, I mean, I think Wikipedia is one of the 25 00:01:40,280 --> 00:01:44,319 Speaker 2: highest quality resources there is on the web today. I 26 00:01:44,319 --> 00:01:48,080 Speaker 2: think everyone uses it sometimes without even realizing it. Given 27 00:01:48,120 --> 00:01:53,600 Speaker 2: that Wikipedia data powers so much of you know, AI chat, 28 00:01:53,720 --> 00:01:57,920 Speaker 2: GPT responses or home assistants that are answering your questions 29 00:01:58,000 --> 00:02:02,080 Speaker 2: or sirie. So I think keeping the quality of information 30 00:02:02,400 --> 00:02:06,640 Speaker 2: on Wikipedia high, or even improving it even further, is 31 00:02:07,000 --> 00:02:08,720 Speaker 2: one of the most important things that we need to 32 00:02:08,760 --> 00:02:09,520 Speaker 2: be focusing on. 33 00:02:10,760 --> 00:02:13,160 Speaker 1: But one of the big critiques of it. And I know, 34 00:02:13,240 --> 00:02:15,480 Speaker 1: and this is very much a podcast for people who 35 00:02:15,520 --> 00:02:17,959 Speaker 1: might be techy, but people also who are just being 36 00:02:18,000 --> 00:02:20,960 Speaker 1: exposed to tech outside of the consumer realm. How is 37 00:02:20,960 --> 00:02:23,840 Speaker 1: it reliable? Because that is the big that's the big question. 38 00:02:24,080 --> 00:02:26,080 Speaker 1: How is this something you can trust? 39 00:02:27,000 --> 00:02:28,919 Speaker 2: Well, I think, like a lot of things, you need 40 00:02:28,960 --> 00:02:30,960 Speaker 2: to take it with a grain of salt and understand 41 00:02:31,000 --> 00:02:35,880 Speaker 2: that there is variation on Wikipedia between you know, some 42 00:02:36,000 --> 00:02:38,960 Speaker 2: very high quality articles that have been reviewed by hundreds 43 00:02:39,000 --> 00:02:42,360 Speaker 2: of people versus some that are fairly new and you know, 44 00:02:42,440 --> 00:02:46,440 Speaker 2: being created still. So I think it's important to realize 45 00:02:46,440 --> 00:02:49,040 Speaker 2: that there is the possibility that whatever you're reading is 46 00:02:49,120 --> 00:02:52,880 Speaker 2: not reliable, and there is some onus on the reader 47 00:02:53,040 --> 00:02:56,600 Speaker 2: to verify that what they're reading is correct. But I 48 00:02:56,639 --> 00:02:59,880 Speaker 2: think that generally speaking, the quality control and Wikipedia is 49 00:03:00,000 --> 00:03:03,519 Speaker 2: actually pretty good. There are a lot of editors who 50 00:03:03,560 --> 00:03:09,120 Speaker 2: are constantly maintaining the platform, making sure that the material 51 00:03:09,200 --> 00:03:13,200 Speaker 2: on there is you know, well sourced, it's coming from 52 00:03:13,360 --> 00:03:18,080 Speaker 2: high quality, reputable publications, and that you know, it is 53 00:03:18,639 --> 00:03:21,600 Speaker 2: meeting the criteria that Wikipedia puts in place. And I 54 00:03:21,600 --> 00:03:24,360 Speaker 2: think the result of that is that the content on 55 00:03:24,400 --> 00:03:26,760 Speaker 2: Wikipedia tends to be very high quality. 56 00:03:27,520 --> 00:03:31,040 Speaker 1: And how does the actual moderation work, So what goes 57 00:03:31,080 --> 00:03:33,200 Speaker 1: into a Wikipedia page? 58 00:03:33,400 --> 00:03:37,160 Speaker 2: Well, it's a little bit ad hoc. There's not really 59 00:03:37,320 --> 00:03:41,160 Speaker 2: a process in which every page has to go through 60 00:03:41,200 --> 00:03:45,960 Speaker 2: a set of criteria or anything like that. Generally speaking, 61 00:03:46,160 --> 00:03:49,920 Speaker 2: anyone can edit Wikipedia and you know, contribute to the 62 00:03:49,920 --> 00:03:54,200 Speaker 2: best of their ability, and when someone does something that 63 00:03:54,320 --> 00:03:58,600 Speaker 2: is not in line with Wikipedia's requirements, then hopefully, you know, 64 00:03:58,640 --> 00:04:01,000 Speaker 2: the idea is that hopefully someone will come along and 65 00:04:01,560 --> 00:04:04,800 Speaker 2: notice that, revert the change or improve it. So that 66 00:04:04,880 --> 00:04:08,000 Speaker 2: it does meet the criteria. And I realize that sounds 67 00:04:08,120 --> 00:04:11,400 Speaker 2: very like slap dash, and it is to some extent, 68 00:04:11,560 --> 00:04:15,960 Speaker 2: but because of the sort of processes that have developed 69 00:04:15,960 --> 00:04:19,320 Speaker 2: over the years, it actually works pretty well. Where most 70 00:04:19,400 --> 00:04:22,599 Speaker 2: pages do you get you know, a fairly strong set 71 00:04:22,640 --> 00:04:25,719 Speaker 2: of people who are you know, taking a look at 72 00:04:25,760 --> 00:04:28,320 Speaker 2: the changes that are coming in, making sure that they 73 00:04:28,440 --> 00:04:33,120 Speaker 2: are appropriate for the encyclopedia and you know, allowing them 74 00:04:33,160 --> 00:04:36,760 Speaker 2: to remain or you know, discussing them and reverting them 75 00:04:36,839 --> 00:04:37,599 Speaker 2: if necessary. 76 00:04:38,240 --> 00:04:40,200 Speaker 1: Talk to me a little bit about that process though. 77 00:04:40,279 --> 00:04:43,560 Speaker 2: What happens well someone you know, if someone goes and 78 00:04:43,560 --> 00:04:46,200 Speaker 2: makes an edit to a page. Generally speaking, there are 79 00:04:46,520 --> 00:04:49,600 Speaker 2: you know a fair number of people who are patrolling 80 00:04:50,279 --> 00:04:53,600 Speaker 2: recent changes to the encyclopedia, you know, not necessarily even 81 00:04:53,600 --> 00:04:56,320 Speaker 2: watching that page specifically, but just looking at new edits 82 00:04:56,320 --> 00:05:01,760 Speaker 2: that are coming in and you know, checking them for reliability. 83 00:05:01,839 --> 00:05:05,080 Speaker 2: You know, is there a source included, does the edit 84 00:05:05,400 --> 00:05:08,680 Speaker 2: past this, you know, the smell check. There's also some 85 00:05:08,720 --> 00:05:12,560 Speaker 2: automated processes involved that will try to filter out edits 86 00:05:12,560 --> 00:05:17,359 Speaker 2: that are abusive or you know, match certain patterns. And 87 00:05:17,400 --> 00:05:21,039 Speaker 2: then there's the you know, there's a lot of editors 88 00:05:21,080 --> 00:05:24,360 Speaker 2: who have you know, various pages that they're interested in 89 00:05:24,480 --> 00:05:28,159 Speaker 2: or they have expertise in on what's called their watch list, 90 00:05:28,320 --> 00:05:31,040 Speaker 2: and so they take a look at changes to those 91 00:05:31,080 --> 00:05:33,240 Speaker 2: pages every once in a while and see if you know, 92 00:05:33,320 --> 00:05:35,279 Speaker 2: everything looks all right or if there needs to be 93 00:05:35,320 --> 00:05:38,760 Speaker 2: improvements or edits to you know, what's been added to 94 00:05:38,920 --> 00:05:41,400 Speaker 2: bring them back into line with the quality that we expect. 95 00:05:42,440 --> 00:05:45,839 Speaker 1: So is there an organization? I know there's the Wikimedia Foundation, 96 00:05:45,920 --> 00:05:49,120 Speaker 1: we know that, but is there is there a moderate 97 00:05:49,200 --> 00:05:52,880 Speaker 1: to chat? Is there a place where people congregate or 98 00:05:52,960 --> 00:05:56,120 Speaker 1: is this just entirely I don't want to say decentralized, 99 00:05:56,200 --> 00:05:58,240 Speaker 1: but disorganized. 100 00:05:58,560 --> 00:06:02,840 Speaker 2: Well, it's certainly disorganized there. Yeah, so there are there 101 00:06:02,880 --> 00:06:06,719 Speaker 2: are actually kind of a bunch of places where people congregate, 102 00:06:07,920 --> 00:06:10,599 Speaker 2: and some people use some of them and some people don't. 103 00:06:11,040 --> 00:06:14,880 Speaker 2: But you know, there's there are various places on the 104 00:06:14,960 --> 00:06:18,919 Speaker 2: encyclopedia that are you know, project specific pages where people 105 00:06:19,000 --> 00:06:22,200 Speaker 2: discuss issues that are coming up or flag you know, 106 00:06:22,320 --> 00:06:26,000 Speaker 2: things that need more attention. You know, not everyone has 107 00:06:26,080 --> 00:06:29,000 Speaker 2: like the ability, for example, to block an editor who's 108 00:06:29,040 --> 00:06:31,960 Speaker 2: being disruptive, So editors who can't do that themselves will 109 00:06:32,000 --> 00:06:35,400 Speaker 2: report it for people who can and then there's you know, 110 00:06:35,839 --> 00:06:40,040 Speaker 2: more real time places like we still use IRC, believe 111 00:06:40,040 --> 00:06:42,440 Speaker 2: it or not, but there's also. 112 00:06:42,360 --> 00:06:45,640 Speaker 1: For that and for listeners. By the way, IRC is 113 00:06:45,839 --> 00:06:47,960 Speaker 1: what a thirty year old sharewab product. 114 00:06:48,400 --> 00:06:52,640 Speaker 2: Yeah, it's basically one of the first online chat protocols. 115 00:06:53,600 --> 00:06:57,320 Speaker 1: I was a polarist IRC guy. But for listeners as well, 116 00:06:57,440 --> 00:07:01,120 Speaker 1: discord is very very much I believe Discorde is actually 117 00:07:01,120 --> 00:07:03,560 Speaker 1: somehow built on IOCA. That's a different episode thought. 118 00:07:03,640 --> 00:07:05,000 Speaker 2: I'm not sure if it actually is or if it 119 00:07:05,120 --> 00:07:08,120 Speaker 2: just interoperates well with it. But there's also a Wikipedia 120 00:07:08,400 --> 00:07:11,040 Speaker 2: Discord now for people who don't want to go figure 121 00:07:11,040 --> 00:07:14,040 Speaker 2: out how IRC works. So you know, there's a bunch 122 00:07:14,040 --> 00:07:18,720 Speaker 2: of different places where people can chat and talk about 123 00:07:18,760 --> 00:07:22,360 Speaker 2: working on the project, different formats. 124 00:07:22,440 --> 00:07:24,560 Speaker 1: And there's no compensation of any kind. 125 00:07:25,200 --> 00:07:28,280 Speaker 2: Correct, not unless you are, you know, an employee of 126 00:07:28,280 --> 00:07:31,280 Speaker 2: the Wikimedia Foundation, who is you know, writing the actual 127 00:07:31,320 --> 00:07:34,800 Speaker 2: media wiki software or you know, performing one of those tasks. 128 00:07:34,800 --> 00:07:37,760 Speaker 2: But that's a very small organization in comparison to the 129 00:07:37,840 --> 00:07:41,000 Speaker 2: number of people who edit the encyclopedia, the vast majority 130 00:07:41,040 --> 00:07:45,600 Speaker 2: of whom do so for you know, for free. Technically, 131 00:07:45,640 --> 00:07:48,600 Speaker 2: you there are some people who edit Wikipedia articles for 132 00:07:48,760 --> 00:07:51,880 Speaker 2: pay as a part of like PR strategies and stuff. 133 00:07:51,920 --> 00:07:56,360 Speaker 2: But that's a whole can of worms and it's somewhat controversial. 134 00:07:56,920 --> 00:07:59,560 Speaker 1: Well, that's actually a good question. So I run a 135 00:07:59,560 --> 00:08:01,840 Speaker 1: p off and I know that that people will come 136 00:08:01,880 --> 00:08:03,560 Speaker 1: along on both sides and say, can you get me 137 00:08:03,600 --> 00:08:06,440 Speaker 1: a Wikipedia page? And then others will say for this 138 00:08:06,840 --> 00:08:09,600 Speaker 1: incredibly large sum of money, I can do this for you. 139 00:08:10,280 --> 00:08:14,040 Speaker 1: And it feels like those people are fighting a losing battle. 140 00:08:14,080 --> 00:08:18,480 Speaker 1: It's one of the rare cases where capitalism can't really win. 141 00:08:18,960 --> 00:08:21,280 Speaker 1: And how is it that Wikipedia is so resistant to 142 00:08:21,320 --> 00:08:22,160 Speaker 1: that kind of stuff? 143 00:08:23,600 --> 00:08:29,560 Speaker 2: Right, So there's a lot of resistance from the wikimedia 144 00:08:29,600 --> 00:08:34,480 Speaker 2: community towards people who are editing for promotional purposes, because, 145 00:08:34,559 --> 00:08:35,920 Speaker 2: you know, the whole point of the project is it 146 00:08:36,000 --> 00:08:39,840 Speaker 2: supposed to be an encyclopedia, not an advertising space, not 147 00:08:39,920 --> 00:08:42,480 Speaker 2: a resume, you know, not a place to promote your 148 00:08:42,480 --> 00:08:45,959 Speaker 2: business or your product. But of course people want information 149 00:08:46,040 --> 00:08:49,559 Speaker 2: on Wikipedia about them, and so there are some people 150 00:08:49,720 --> 00:08:52,679 Speaker 2: who are able to sort of tread that line where 151 00:08:53,240 --> 00:08:58,320 Speaker 2: they understand wikimedia policies very well. They understand, you know, 152 00:08:58,360 --> 00:09:01,040 Speaker 2: what is allowed as far as writing about, you know, 153 00:09:01,080 --> 00:09:04,800 Speaker 2: a company without being promotional they understand what that you know, 154 00:09:05,000 --> 00:09:07,960 Speaker 2: organization or company or person would need to accomplish in 155 00:09:08,040 --> 00:09:11,840 Speaker 2: order to you know, achieve those notability requirements, and then 156 00:09:11,880 --> 00:09:15,040 Speaker 2: they can write about them, and they do so very transparently. 157 00:09:15,120 --> 00:09:18,319 Speaker 2: You know, they disclose that they are doing so for compensation, 158 00:09:18,480 --> 00:09:21,520 Speaker 2: you know that they've been hired by that person or organization, 159 00:09:22,559 --> 00:09:25,280 Speaker 2: and they go through the process and it's very you know, 160 00:09:25,600 --> 00:09:29,839 Speaker 2: carefully orchestrated. But there are also a bunch of people 161 00:09:29,840 --> 00:09:31,760 Speaker 2: who do so sort of on the you know, on 162 00:09:31,840 --> 00:09:35,559 Speaker 2: the sly and they don't really disclose that they're doing 163 00:09:35,600 --> 00:09:38,600 Speaker 2: it for pay. Those are often the people who will 164 00:09:38,760 --> 00:09:40,880 Speaker 2: sort of cold call you, like if you've gotten emails, 165 00:09:40,880 --> 00:09:43,160 Speaker 2: they're like, hey, I can make you Wikipedia page. A 166 00:09:43,240 --> 00:09:46,760 Speaker 2: lot of those are sort of scammy, uh, not so 167 00:09:47,200 --> 00:09:52,040 Speaker 2: reputable organizations, and you know, generally speaking, they actually have 168 00:09:52,120 --> 00:09:54,360 Speaker 2: a pretty hard time doing what they claim that they 169 00:09:54,400 --> 00:09:57,559 Speaker 2: can do because they don't tend to understand the policy 170 00:09:57,600 --> 00:10:01,400 Speaker 2: as well. They try to shoe horn articles into Wikipedia 171 00:10:01,400 --> 00:10:05,400 Speaker 2: about subjects that are not notable or that are too promotional, 172 00:10:05,840 --> 00:10:08,760 Speaker 2: and so they tend to get taken down, much to 173 00:10:08,840 --> 00:10:10,640 Speaker 2: the dismay of the people who pay them. Quite a 174 00:10:10,640 --> 00:10:12,960 Speaker 2: lot of money. So if you're listening to this and 175 00:10:12,960 --> 00:10:14,440 Speaker 2: you've heard from one of those people, I would not 176 00:10:14,559 --> 00:10:16,079 Speaker 2: recommend hiring them. 177 00:10:16,440 --> 00:10:19,120 Speaker 1: Just become more important, which I guess is kind of 178 00:10:19,160 --> 00:10:24,160 Speaker 1: the question. How does Wikipedia actually judge notoriety? What is important? 179 00:10:24,800 --> 00:10:30,200 Speaker 2: Yeah, So, generally speaking, the notability requirements is, you know, 180 00:10:30,640 --> 00:10:34,960 Speaker 2: it basically goes on how much coverage a subject has 181 00:10:35,040 --> 00:10:39,480 Speaker 2: received in reliable sources that are independent of the subject. 182 00:10:40,040 --> 00:10:43,720 Speaker 2: So you know, if a big newspaper or several newspapers 183 00:10:43,720 --> 00:10:48,080 Speaker 2: write about you in detail, chances are you might be notable, 184 00:10:48,160 --> 00:10:51,760 Speaker 2: en not for Wikipedia article. On the flip side, if 185 00:10:51,800 --> 00:10:55,080 Speaker 2: you're just publishing press releases that's not independent of you, 186 00:10:55,559 --> 00:10:59,319 Speaker 2: that probably doesn't qualify. If there's just one brief mention 187 00:10:59,440 --> 00:11:02,600 Speaker 2: of you a reliable source someplays, that's probably not sufficient. 188 00:11:03,360 --> 00:11:07,160 Speaker 2: So that's sort of the general guideline. There are other 189 00:11:07,360 --> 00:11:10,760 Speaker 2: sort of more specific requirements for some sorts of like 190 00:11:11,480 --> 00:11:14,600 Speaker 2: specific types of topics, you know, like sports players and 191 00:11:14,640 --> 00:11:18,120 Speaker 2: things like that. But generally speaking, even if someone meets 192 00:11:18,120 --> 00:11:21,240 Speaker 2: one of those guidelines, they've already met that general guideline, 193 00:11:21,280 --> 00:11:24,200 Speaker 2: where you know, it's just about the amount of coverage 194 00:11:24,240 --> 00:11:27,439 Speaker 2: in reliable sources and that can be pressed, that can 195 00:11:27,480 --> 00:11:31,040 Speaker 2: be academia, can be books, you know, anything that sort 196 00:11:31,040 --> 00:11:33,080 Speaker 2: of qualifies as a reliable source. 197 00:11:46,120 --> 00:11:48,760 Speaker 1: Now recently seen it, one of the largest tech sites 198 00:11:48,760 --> 00:11:55,240 Speaker 1: actually found itself downgraded in reliability or were you aware 199 00:11:55,280 --> 00:11:56,920 Speaker 1: of that situation? Did you see that happen? 200 00:11:57,600 --> 00:12:00,000 Speaker 2: I saw the conversations about it. I wasn't a participant 201 00:12:00,080 --> 00:12:02,040 Speaker 2: in them, but I sort of watched it happen. 202 00:12:02,240 --> 00:12:06,679 Speaker 1: Yeah, So for the listeners seen it has hundreds of 203 00:12:06,720 --> 00:12:09,000 Speaker 1: that hundreds of millions of views a month. It's one 204 00:12:09,000 --> 00:12:13,320 Speaker 1: of the most notable websites, frankly, and has been for decades. 205 00:12:13,920 --> 00:12:19,800 Speaker 1: So what happened there? Why was this significant website considered 206 00:12:19,880 --> 00:12:21,640 Speaker 1: less reliable by Wikipedia? 207 00:12:22,559 --> 00:12:27,640 Speaker 2: Yeah? So Wikipedia has these discussions pretty frequently about different 208 00:12:27,679 --> 00:12:31,160 Speaker 2: sources and whether or not those sources are generally reliable 209 00:12:31,400 --> 00:12:35,960 Speaker 2: or sometimes reliable or usually not reliable. And in order 210 00:12:36,040 --> 00:12:38,400 Speaker 2: to sort of prevent people from having to have that 211 00:12:38,480 --> 00:12:41,240 Speaker 2: same conversation every single time they want to cite a 212 00:12:41,280 --> 00:12:44,120 Speaker 2: source that's heavily cited, you know, like CE or you know, 213 00:12:44,160 --> 00:12:46,800 Speaker 2: the New York Times or something like that, we have 214 00:12:46,920 --> 00:12:51,400 Speaker 2: this list of very commonly discussed sources where we just discuss, 215 00:12:51,480 --> 00:12:55,160 Speaker 2: you know, we outline the general reliability of that source. 216 00:12:55,600 --> 00:12:58,200 Speaker 2: And what you're referring to is a recent discussion where 217 00:12:58,400 --> 00:13:01,920 Speaker 2: some editors decided we really needed to revisit the general 218 00:13:01,960 --> 00:13:06,679 Speaker 2: reliability of c net, which was previously considered fairly reliable 219 00:13:06,760 --> 00:13:10,160 Speaker 2: for sourcing on Wikipedia, and the reason for that is 220 00:13:10,200 --> 00:13:12,480 Speaker 2: that there had been a pretty noticeable change in the 221 00:13:12,559 --> 00:13:16,920 Speaker 2: quality of the UH material that they were publishing, where it, 222 00:13:17,160 --> 00:13:20,760 Speaker 2: you know, it no longer had the level of accuracy 223 00:13:20,920 --> 00:13:24,600 Speaker 2: that it once did. You know, the articles that were 224 00:13:24,600 --> 00:13:27,480 Speaker 2: being published didn't seem to be being edited well or 225 00:13:27,520 --> 00:13:30,880 Speaker 2: fact checked well. And so we decided that, you know, 226 00:13:32,000 --> 00:13:34,160 Speaker 2: if you're just if you're planning to use c net 227 00:13:34,200 --> 00:13:38,520 Speaker 2: as a source, you should really consider this, and you know, 228 00:13:38,600 --> 00:13:42,200 Speaker 2: consider that it's probably not even as reliable as it 229 00:13:42,240 --> 00:13:45,040 Speaker 2: once was, and maybe use something else, because you know, 230 00:13:45,160 --> 00:13:49,880 Speaker 2: they've started using AI generated content and stuff like that 231 00:13:49,880 --> 00:13:53,200 Speaker 2: that has you know, noticeably affected the quality of the 232 00:13:53,360 --> 00:13:55,200 Speaker 2: articles that they publish. 233 00:13:55,240 --> 00:13:58,080 Speaker 1: And what is the what are the ramifications of that downgrading? 234 00:13:58,120 --> 00:13:59,520 Speaker 1: What does that mean? Practically? 235 00:14:00,640 --> 00:14:03,320 Speaker 2: It means that, for you know, if you're writing a 236 00:14:03,320 --> 00:14:06,320 Speaker 2: new article in Wikipedia and you want to use c 237 00:14:06,440 --> 00:14:10,320 Speaker 2: net as a source, you are somewhat discouraged from doing so. Now, 238 00:14:10,920 --> 00:14:14,079 Speaker 2: it means that where c net has been used as 239 00:14:14,080 --> 00:14:17,480 Speaker 2: a source already, editors are going to be looking to 240 00:14:18,320 --> 00:14:21,760 Speaker 2: you know, improve that sourcing, generally by swapping it out 241 00:14:21,800 --> 00:14:26,960 Speaker 2: with something that's more reliable. Although, you know, as notability 242 00:14:27,080 --> 00:14:31,600 Speaker 2: changes in these publications, sometimes older content that was written 243 00:14:32,360 --> 00:14:36,120 Speaker 2: or published prior to a change at the organization will 244 00:14:36,160 --> 00:14:40,640 Speaker 2: be considered reliable, whereas newer content that's being created now 245 00:14:40,680 --> 00:14:43,480 Speaker 2: and might be using you know, AI or other tools, 246 00:14:43,960 --> 00:14:47,200 Speaker 2: is not considered reliable anymore. So, for example, we have 247 00:14:47,240 --> 00:14:49,920 Speaker 2: a couple of sources where we'll say that, you know, 248 00:14:49,960 --> 00:14:53,560 Speaker 2: anything they published before twenty fifteen is fine, but anything 249 00:14:53,600 --> 00:14:55,480 Speaker 2: after that you should take with a grain of salt. 250 00:14:55,840 --> 00:14:58,000 Speaker 2: So I think that's partly the case with c net 251 00:14:58,120 --> 00:15:00,160 Speaker 2: now as well. 252 00:15:00,160 --> 00:15:02,800 Speaker 1: Why do you think that Wikipedia is so much better 253 00:15:03,320 --> 00:15:08,600 Speaker 1: at quality control than I don't know, Google, Well. 254 00:15:08,480 --> 00:15:12,200 Speaker 2: I think there's a very different set of incentives at play. 255 00:15:13,280 --> 00:15:17,960 Speaker 2: You know, Google is a profit driven corporation. They have 256 00:15:18,200 --> 00:15:22,440 Speaker 2: other motivations besides providing the person who is using their 257 00:15:22,440 --> 00:15:25,480 Speaker 2: search engine with the most relevant results. You know, that 258 00:15:25,600 --> 00:15:29,520 Speaker 2: is ostensibly what they do, but they also have motivations 259 00:15:29,560 --> 00:15:33,600 Speaker 2: around generating ad revenue and clicks for you know, different 260 00:15:33,760 --> 00:15:37,200 Speaker 2: groups that are using their service or advertising with them. 261 00:15:37,760 --> 00:15:42,960 Speaker 2: Wikipedia doesn't have those sometimes conflicting incentives. You know, the 262 00:15:43,280 --> 00:15:46,600 Speaker 2: goal of Wikipedia is to deliver the highest quality information 263 00:15:46,680 --> 00:15:51,840 Speaker 2: that we can, and we're not you know, there's no advertisements. 264 00:15:51,840 --> 00:15:54,880 Speaker 2: We're not trying to surface some specific set of content 265 00:15:54,960 --> 00:15:58,280 Speaker 2: for people based on who is paying, you know, or 266 00:15:58,600 --> 00:16:01,600 Speaker 2: or not paying. It's really just that one incentive, and 267 00:16:01,640 --> 00:16:04,280 Speaker 2: I think that simplifies things because we can all sort 268 00:16:04,280 --> 00:16:07,920 Speaker 2: of get behind that one goal and you know, we 269 00:16:07,960 --> 00:16:10,640 Speaker 2: aren't trying to juggle multiple things that are sometimes very 270 00:16:10,680 --> 00:16:12,880 Speaker 2: much in conflict with one another, as I think we've 271 00:16:12,880 --> 00:16:15,640 Speaker 2: seen with Google, where you know, that goal of providing 272 00:16:15,680 --> 00:16:19,120 Speaker 2: the highest quality information to the search user has been 273 00:16:19,760 --> 00:16:24,960 Speaker 2: very much challenged by the other incentives at play. 274 00:16:25,040 --> 00:16:28,880 Speaker 1: So you've edited Wikipedia for about eighteen years. I think 275 00:16:29,160 --> 00:16:31,840 Speaker 1: the simplest question is why do you still do it? 276 00:16:31,880 --> 00:16:35,600 Speaker 1: Why did you start? Why do you continue? Well? 277 00:16:35,640 --> 00:16:39,120 Speaker 2: I started because I was a curious kid who discovered 278 00:16:39,120 --> 00:16:42,840 Speaker 2: that I could and that was exciting for me, and 279 00:16:43,240 --> 00:16:44,480 Speaker 2: I have I don't know. I think there are just 280 00:16:44,520 --> 00:16:47,000 Speaker 2: some people out there who have like a very specific 281 00:16:47,040 --> 00:16:50,880 Speaker 2: type of brain where Wikipedia just like sort of tickles 282 00:16:50,920 --> 00:16:53,320 Speaker 2: it just right, you know, and like doing that kind 283 00:16:53,320 --> 00:16:58,520 Speaker 2: of editing and you know, curation just appeals to me. 284 00:16:58,600 --> 00:17:02,000 Speaker 2: I find it really enjoyable. But I also think that, 285 00:17:02,080 --> 00:17:08,360 Speaker 2: you know, the project is a really wonderful, uh creation, 286 00:17:08,680 --> 00:17:10,919 Speaker 2: you know, and I think it provides a really important 287 00:17:10,960 --> 00:17:13,800 Speaker 2: service to the people who have access to it, which 288 00:17:13,840 --> 00:17:16,840 Speaker 2: is almost anyone. I mean, it's anyone with an Internet connection, 289 00:17:17,000 --> 00:17:19,000 Speaker 2: and there are even ways to get access to Wikipedia 290 00:17:19,040 --> 00:17:22,040 Speaker 2: without an Internet connection. And so I think that, you know, 291 00:17:22,200 --> 00:17:27,600 Speaker 2: maintaining a quality source of high you know, reliability material 292 00:17:27,760 --> 00:17:31,520 Speaker 2: is incredibly important and arguably getting only more important as 293 00:17:31,720 --> 00:17:35,240 Speaker 2: other sources of that same type of content, you know, 294 00:17:35,280 --> 00:17:37,960 Speaker 2: are becoming degraded in the way that you just described 295 00:17:37,960 --> 00:17:40,760 Speaker 2: with Google for example. So you know, I'm very passionate 296 00:17:40,800 --> 00:17:44,400 Speaker 2: about it. I think providing high quality information to everyone 297 00:17:45,160 --> 00:17:47,600 Speaker 2: is one of the most important things that sort of 298 00:17:47,680 --> 00:17:51,560 Speaker 2: humanity needs to achieve, and so I find that very 299 00:17:51,800 --> 00:17:52,840 Speaker 2: strongly motivating. 300 00:17:53,920 --> 00:17:57,600 Speaker 1: And is there any pressure a toll from the Wikimedia Foundation? 301 00:17:57,720 --> 00:18:01,600 Speaker 1: Do they push people around a toll A not going 302 00:18:01,600 --> 00:18:04,000 Speaker 1: either way. I'm just wondering what influence they have on 303 00:18:04,040 --> 00:18:04,680 Speaker 1: the platform. 304 00:18:05,359 --> 00:18:08,280 Speaker 2: Yeah, it's a kind of an unusual relationship between the 305 00:18:08,280 --> 00:18:11,480 Speaker 2: Wi Community Foundation and the editing community. It's sometimes a 306 00:18:11,520 --> 00:18:16,000 Speaker 2: little bit adversarial, which is sometimes a little confusing to 307 00:18:16,040 --> 00:18:18,880 Speaker 2: people who are new to the project. But I would 308 00:18:18,920 --> 00:18:22,320 Speaker 2: say no, the Wi Commedia Foundation generally takes a pretty 309 00:18:22,359 --> 00:18:26,359 Speaker 2: hands off approach towards the projects that you know, it 310 00:18:26,760 --> 00:18:32,320 Speaker 2: kind of supports. The Wikimedia Foundation almost never comes in 311 00:18:32,400 --> 00:18:34,399 Speaker 2: and says, hey, you need to delete that, or you 312 00:18:34,440 --> 00:18:37,680 Speaker 2: need to you know, do something very specific with the content. 313 00:18:38,080 --> 00:18:42,600 Speaker 2: There are some very edge cases with like legal requests 314 00:18:42,600 --> 00:18:44,520 Speaker 2: and things like that where they can sometimes do that, 315 00:18:44,560 --> 00:18:47,760 Speaker 2: but they are very very conservative on when they will 316 00:18:47,800 --> 00:18:50,239 Speaker 2: do that, and for the most part they take that 317 00:18:50,440 --> 00:18:54,240 Speaker 2: very hands off approach. The Wi Community Foundation is mostly 318 00:18:54,359 --> 00:18:58,080 Speaker 2: concerned with, you know, paying the bills, keeping the site online, 319 00:18:58,640 --> 00:19:02,480 Speaker 2: developing the soft where that actually powers the encyclopedia and 320 00:19:02,520 --> 00:19:05,480 Speaker 2: the editing infrastructure and all that, and then you know, 321 00:19:05,640 --> 00:19:08,840 Speaker 2: trying to contribute and grow the editing community and that 322 00:19:08,880 --> 00:19:11,560 Speaker 2: type of thing. But they are pretty hands off when 323 00:19:11,560 --> 00:19:14,680 Speaker 2: it comes to editing, which I think is actually quite good. 324 00:19:15,160 --> 00:19:19,480 Speaker 2: It allows the community to you know, organically develop its 325 00:19:19,520 --> 00:19:24,600 Speaker 2: own policies and protocols and things like that, and you know, 326 00:19:24,640 --> 00:19:27,359 Speaker 2: without the Wikimedia Foundation sort of putting its thumb on 327 00:19:27,400 --> 00:19:28,840 Speaker 2: the scale. 328 00:19:29,440 --> 00:19:33,360 Speaker 1: So more practical question, as an editor, what do you do? 329 00:19:33,560 --> 00:19:36,399 Speaker 1: Is it just editing? The reason I ask this is 330 00:19:36,680 --> 00:19:40,400 Speaker 1: you've provided me with your Wikipedia editor page, and there's 331 00:19:40,440 --> 00:19:44,000 Speaker 1: things like on block reblog, blog thanks. What do you 332 00:19:44,280 --> 00:19:47,200 Speaker 1: do as an editor? What is what are these numbers about? 333 00:19:47,640 --> 00:19:49,399 Speaker 1: Because it feels like there's kind of a niche social 334 00:19:49,440 --> 00:19:50,720 Speaker 1: network and inside it. 335 00:19:51,720 --> 00:19:54,800 Speaker 2: Yeah, there's a lot that happens behind the scenes besides 336 00:19:54,960 --> 00:19:58,680 Speaker 2: just you know, opening up a page and writing something new. 337 00:19:59,280 --> 00:20:01,000 Speaker 2: I do do so of that, you know, I do 338 00:20:01,200 --> 00:20:03,440 Speaker 2: write pages from scratch once in a while, or I'll 339 00:20:03,560 --> 00:20:05,520 Speaker 2: go in and edit something if a page is missing 340 00:20:05,600 --> 00:20:08,200 Speaker 2: something or need something. But there's also a whole bunch 341 00:20:08,240 --> 00:20:12,120 Speaker 2: of other sort of administrative work I guess that goes 342 00:20:12,160 --> 00:20:16,040 Speaker 2: on behind the scenes, where I will sometimes you know, 343 00:20:16,400 --> 00:20:19,280 Speaker 2: patrol articles. That's sort of what I had described earlier, 344 00:20:19,320 --> 00:20:22,920 Speaker 2: where people will watch the recent changes to the project 345 00:20:22,920 --> 00:20:26,080 Speaker 2: and try to just filter out anything that looks abusive 346 00:20:26,320 --> 00:20:30,440 Speaker 2: or you know, disruptive or honestly just less than productive. 347 00:20:31,119 --> 00:20:33,879 Speaker 2: Sometimes I try to, you know, go around and welcome 348 00:20:33,960 --> 00:20:36,480 Speaker 2: new editors who are just joining the projects and hims 349 00:20:36,480 --> 00:20:38,359 Speaker 2: they need help, and so I'll try to sort of, 350 00:20:38,920 --> 00:20:41,520 Speaker 2: you know, show them the ropes a little bit. And 351 00:20:41,560 --> 00:20:45,200 Speaker 2: then there's you know, the blocking and deletion and things 352 00:20:45,240 --> 00:20:48,000 Speaker 2: like that, where people who are known as administrators on 353 00:20:48,040 --> 00:20:53,280 Speaker 2: the project can block an editor who is regularly being 354 00:20:53,320 --> 00:20:56,760 Speaker 2: disruptive or delete a page if it doesn't meet the 355 00:20:56,800 --> 00:20:59,840 Speaker 2: requirements that we have defined in our policies and things 356 00:20:59,880 --> 00:21:01,640 Speaker 2: like that, and so I do some of that as well. 357 00:21:02,400 --> 00:21:04,159 Speaker 1: So how does one become an administrator? 358 00:21:06,160 --> 00:21:09,960 Speaker 2: So there's a whole process called request for admanship where 359 00:21:10,200 --> 00:21:15,040 Speaker 2: you either put yourself up, you nominate yourself as a candidate, 360 00:21:15,240 --> 00:21:17,560 Speaker 2: or someone else who thinks that you would be a 361 00:21:17,560 --> 00:21:20,480 Speaker 2: good administrator will do so. And then there's a long 362 00:21:20,560 --> 00:21:23,280 Speaker 2: process where you answer some questions and then the wikimedia 363 00:21:23,280 --> 00:21:27,119 Speaker 2: community votes pretty much on whether or not you you know, 364 00:21:27,200 --> 00:21:32,280 Speaker 2: have the experience, the you know, the mindset basically, the 365 00:21:32,320 --> 00:21:37,440 Speaker 2: right attitude towards contributing, and then there's a vote pretty 366 00:21:37,480 --> 00:21:39,680 Speaker 2: much and if you pass the threshold, then you are 367 00:21:40,880 --> 00:21:44,399 Speaker 2: nominated or you know, you become an administrator at that point. 368 00:21:44,520 --> 00:21:48,040 Speaker 2: So it's something that happens, you know, fairly regularly. I 369 00:21:48,080 --> 00:21:52,320 Speaker 2: did it in twenty ten, I think but you know, 370 00:21:52,680 --> 00:21:54,800 Speaker 2: various people go through it pretty often. 371 00:22:06,080 --> 00:22:09,359 Speaker 1: Have you seen more people joining as editors or is 372 00:22:09,359 --> 00:22:14,840 Speaker 1: there more people or less people editing these days? 373 00:22:15,600 --> 00:22:19,040 Speaker 2: I would say there's been sort of a slow decline 374 00:22:19,240 --> 00:22:22,159 Speaker 2: in the number of people who are joining if you 375 00:22:22,240 --> 00:22:26,760 Speaker 2: look at you know, people who join as like a 376 00:22:26,880 --> 00:22:31,280 Speaker 2: user and then continue to edit somewhat frequently versus you know, not. 377 00:22:31,400 --> 00:22:33,159 Speaker 2: I'm not just talking about people who like correct a 378 00:22:33,200 --> 00:22:34,720 Speaker 2: typo once in a while, but the sort of like 379 00:22:34,800 --> 00:22:40,960 Speaker 2: regular contributors is sort of yeah, it's it's either flatlining 380 00:22:41,040 --> 00:22:44,040 Speaker 2: or declining to some extent. Uh, And it has been 381 00:22:44,080 --> 00:22:47,199 Speaker 2: for a really long time, and so it's sort of 382 00:22:47,240 --> 00:22:51,760 Speaker 2: this continual discussion that happens within Wikipedia about you know what, 383 00:22:51,760 --> 00:22:53,520 Speaker 2: what do we do about that? How do we encourage 384 00:22:53,520 --> 00:22:55,800 Speaker 2: more people to join, How do we make the project 385 00:22:55,800 --> 00:22:59,200 Speaker 2: more welcoming to new people who want to get involved? 386 00:22:59,720 --> 00:23:02,480 Speaker 2: You know, something that I try to focus on as well, 387 00:23:02,560 --> 00:23:04,680 Speaker 2: just because I think it's really important to not only 388 00:23:04,720 --> 00:23:07,040 Speaker 2: contribute to the encyclopedia, but make sure that other people 389 00:23:07,040 --> 00:23:10,080 Speaker 2: are aware that they can and you know, doing so 390 00:23:10,200 --> 00:23:11,919 Speaker 2: if they have the desire to do so. 391 00:23:12,760 --> 00:23:15,680 Speaker 1: Are they training materials? Is there an onboarding process? 392 00:23:16,920 --> 00:23:21,159 Speaker 2: There are. In fact, there are many of such things 393 00:23:21,200 --> 00:23:24,719 Speaker 2: that different people have created, and the Wi Community Foundation 394 00:23:24,760 --> 00:23:27,160 Speaker 2: themselves has gotten a little more involved in recent years 395 00:23:27,200 --> 00:23:30,120 Speaker 2: and trying to make that sign up process a little 396 00:23:30,160 --> 00:23:32,440 Speaker 2: bit more friendly, so you know, when you first create 397 00:23:32,440 --> 00:23:34,840 Speaker 2: an account, it doesn't just drop you into the editor 398 00:23:34,880 --> 00:23:36,320 Speaker 2: like good luck, you know, which is kind of what 399 00:23:36,320 --> 00:23:40,520 Speaker 2: it used to do. Now there's some little widgets and 400 00:23:40,560 --> 00:23:43,520 Speaker 2: things that will guide you through making your first edit 401 00:23:43,720 --> 00:23:46,480 Speaker 2: and you know, finding an article that maybe needs some 402 00:23:46,560 --> 00:23:49,320 Speaker 2: improvement so you can give it a shot. Those types 403 00:23:49,359 --> 00:23:52,199 Speaker 2: of things are being developed, and then there's you know, 404 00:23:52,200 --> 00:23:55,040 Speaker 2: community resources to try to encourage people to do the same, 405 00:23:56,119 --> 00:23:59,240 Speaker 2: which are just created by different editors for various purposes. 406 00:24:00,240 --> 00:24:05,080 Speaker 1: Is there any automation Yep, there's there's quite a lot. 407 00:24:05,560 --> 00:24:10,440 Speaker 2: There are Wikipedia editing bots that uh perform various tasks, 408 00:24:11,000 --> 00:24:15,679 Speaker 2: everything from you know, fixing vandalism to you know, introducing 409 00:24:15,840 --> 00:24:19,160 Speaker 2: archive links so that if a source link goes down, 410 00:24:19,760 --> 00:24:21,239 Speaker 2: you know you can still get a copy of that 411 00:24:21,280 --> 00:24:24,480 Speaker 2: source material, all kinds of things like that. There's anti 412 00:24:24,480 --> 00:24:27,240 Speaker 2: spam stuff, you know, where it looks for spammy links 413 00:24:27,280 --> 00:24:29,840 Speaker 2: and removes it. So yeah, there's there's quite a bit 414 00:24:29,880 --> 00:24:34,000 Speaker 2: of automation, both by community members. Well, so some of 415 00:24:34,040 --> 00:24:37,479 Speaker 2: it's yeah, some of it's community members who who you know, 416 00:24:37,600 --> 00:24:40,480 Speaker 2: create bots and maintain them. I would say it's probably 417 00:24:40,560 --> 00:24:43,520 Speaker 2: the majority community members, but there you know, there's also 418 00:24:43,560 --> 00:24:46,480 Speaker 2: the media Wiki software itself, which is maintained by the 419 00:24:46,520 --> 00:24:50,399 Speaker 2: Wikimedia Foundation, but they don't really do much of the 420 00:24:50,520 --> 00:24:52,480 Speaker 2: editing side of things, so most of the editing stuff 421 00:24:52,520 --> 00:24:53,439 Speaker 2: is community based. 422 00:24:54,200 --> 00:24:57,720 Speaker 1: Has there been any discussion of AI or integrating AI 423 00:24:57,840 --> 00:24:58,600 Speaker 1: or anything like that. 424 00:25:00,080 --> 00:25:02,320 Speaker 2: There has been quite a lot of discussion of AI, 425 00:25:02,800 --> 00:25:09,080 Speaker 2: and it's mostly been around two things. The sourcing situation. 426 00:25:09,359 --> 00:25:13,439 Speaker 2: So can you consider AI generated material to be a 427 00:25:13,480 --> 00:25:18,199 Speaker 2: reliable source? Generally speaking? The answer has been, know that 428 00:25:18,480 --> 00:25:21,080 Speaker 2: you have to be very cautious when you know, typing 429 00:25:21,119 --> 00:25:24,480 Speaker 2: something into chat gpt, because a lot of times you'll 430 00:25:24,480 --> 00:25:27,199 Speaker 2: get back something that is not accurate at all. And 431 00:25:27,200 --> 00:25:30,440 Speaker 2: then there's also been discussion around can you use AI 432 00:25:30,560 --> 00:25:34,280 Speaker 2: generated content, you know, as you know, can you tell 433 00:25:34,440 --> 00:25:36,280 Speaker 2: chat gpt or something like that to go make an 434 00:25:36,400 --> 00:25:39,040 Speaker 2: edit for you know, can you use content that chat 435 00:25:39,080 --> 00:25:42,600 Speaker 2: gpt has written, as you know, content that you put 436 00:25:42,600 --> 00:25:46,440 Speaker 2: into a Wikipedia article, And the answer there has largely 437 00:25:46,520 --> 00:25:50,600 Speaker 2: been yeah, you can, but you have to be really 438 00:25:50,640 --> 00:25:53,480 Speaker 2: careful about it, and you have to sort of take 439 00:25:53,560 --> 00:25:57,440 Speaker 2: full responsibility for anything that you have generated and put 440 00:25:57,480 --> 00:26:00,280 Speaker 2: into a Wikipedia article, because you know, it's still up 441 00:26:00,320 --> 00:26:02,600 Speaker 2: to you to fact check it and things like that. 442 00:26:03,200 --> 00:26:07,240 Speaker 2: As far as integrating AI into Wikipedia to try to 443 00:26:07,680 --> 00:26:10,800 Speaker 2: i don't know, generate article content or something more generally, 444 00:26:11,240 --> 00:26:15,480 Speaker 2: that is not something that the community has seemed particularly 445 00:26:15,480 --> 00:26:18,439 Speaker 2: interested in doing, and I suspect it would introduce a 446 00:26:18,440 --> 00:26:22,080 Speaker 2: lot of problems. There were some sort of past attempts 447 00:26:22,119 --> 00:26:26,560 Speaker 2: at doing things sort of like that in the translation space, 448 00:26:27,640 --> 00:26:31,399 Speaker 2: where the wikimedia software would try to encourage people to 449 00:26:31,480 --> 00:26:35,239 Speaker 2: translate articles from one version of Wikipedia to another, and 450 00:26:35,280 --> 00:26:39,520 Speaker 2: that was a largely unsuccessful experiment, I think, where too 451 00:26:39,560 --> 00:26:43,720 Speaker 2: many people were just pasting in machine translation or you know, 452 00:26:43,920 --> 00:26:47,520 Speaker 2: not doing the right kinds of quality control. So we 453 00:26:47,600 --> 00:26:49,520 Speaker 2: have to be really cautious around that type of thing. 454 00:26:50,880 --> 00:26:53,240 Speaker 1: So I think it was a year or two ago 455 00:26:53,520 --> 00:26:56,560 Speaker 1: Elon Musk said he wanted to buy Wikipedia for a 456 00:26:56,600 --> 00:27:00,359 Speaker 1: billion dollars. Just so well clear that is impossible. 457 00:27:00,080 --> 00:27:04,639 Speaker 2: Right, yeah, I mean Wikipedia is not for sale. I 458 00:27:04,680 --> 00:27:08,600 Speaker 2: don't think there's any you know, genuine interest on either 459 00:27:08,720 --> 00:27:12,520 Speaker 2: side for Wikipedia to be sold. I think Elon is 460 00:27:12,560 --> 00:27:15,000 Speaker 2: also particularly not interested in it. He just has a 461 00:27:15,000 --> 00:27:17,439 Speaker 2: long standing beef with Wikipedia. 462 00:27:17,880 --> 00:27:20,679 Speaker 1: Which understandably because they tell the truth, which is not 463 00:27:20,760 --> 00:27:24,560 Speaker 1: a big thing for mister Musk. Now, so you've done 464 00:27:24,600 --> 00:27:27,880 Speaker 1: a lot of advocacy recently about editing and saying why 465 00:27:27,920 --> 00:27:30,720 Speaker 1: people should become editors, But I kind of want to 466 00:27:30,720 --> 00:27:33,960 Speaker 1: hear your sales. Why should the average person edit Wikipedia? 467 00:27:35,760 --> 00:27:39,040 Speaker 2: Well, I mean, I think, honestly, there's probably one hundred 468 00:27:39,080 --> 00:27:41,760 Speaker 2: answers for that, and it depends very much on the person. 469 00:27:42,560 --> 00:27:43,960 Speaker 2: You know, I do it because I love it. I 470 00:27:44,000 --> 00:27:46,840 Speaker 2: find it really enjoyable. Some people do it because they 471 00:27:46,880 --> 00:27:51,800 Speaker 2: think that the resource of Wikipedia is incredibly important and 472 00:27:51,960 --> 00:27:55,240 Speaker 2: needs to be maintained. You know, that's also a factor 473 00:27:55,280 --> 00:27:57,800 Speaker 2: for me. But I think it's okay to also just 474 00:27:57,840 --> 00:27:59,439 Speaker 2: do it just for fun. You know, I think you 475 00:27:59,480 --> 00:28:02,440 Speaker 2: don't have to have some big driving motivation behind it. 476 00:28:02,960 --> 00:28:06,119 Speaker 2: But I do think that, you know, especially now as 477 00:28:06,480 --> 00:28:11,040 Speaker 2: the web is somewhat under threat from this sort of 478 00:28:11,119 --> 00:28:16,640 Speaker 2: AI generated junk that's being you know, becoming so prevalent 479 00:28:16,840 --> 00:28:21,280 Speaker 2: in search results in any particular website that you go visit, 480 00:28:21,800 --> 00:28:24,440 Speaker 2: it's really important to make sure that there is this 481 00:28:24,600 --> 00:28:28,880 Speaker 2: you know, human reviewed material out there that is as 482 00:28:29,040 --> 00:28:33,080 Speaker 2: high quality as possible and that really values you know, 483 00:28:33,119 --> 00:28:37,440 Speaker 2: the quality of information, accuracy, reliability, you know, neutral point 484 00:28:37,480 --> 00:28:40,719 Speaker 2: of view, those types of things above just you know, 485 00:28:40,800 --> 00:28:44,400 Speaker 2: spitting out as much content as possible with little regard 486 00:28:44,520 --> 00:28:48,760 Speaker 2: for its accuracy or reliability. And so I think that 487 00:28:48,880 --> 00:28:51,960 Speaker 2: you know, now more than ever, projects like Wikipedia are 488 00:28:51,960 --> 00:28:56,640 Speaker 2: incredibly important, and you know, maintaining them as other parts 489 00:28:56,640 --> 00:28:59,840 Speaker 2: of the web begin to degrade, is going to only 490 00:28:59,880 --> 00:29:02,360 Speaker 2: be become, you know, a more important thing to do. 491 00:29:03,360 --> 00:29:06,360 Speaker 1: It kind of feels like as this prevalence of AI 492 00:29:06,440 --> 00:29:12,000 Speaker 1: generated content grows, we kind of need user generated content well, 493 00:29:12,080 --> 00:29:13,680 Speaker 1: never as it's being killed off. 494 00:29:15,560 --> 00:29:18,840 Speaker 2: Yeah, it's kind of ironic. I think, you know that 495 00:29:19,480 --> 00:29:22,760 Speaker 2: we're starting to see this proliferation of this AI generated 496 00:29:22,760 --> 00:29:27,520 Speaker 2: content that's really just quantity over quality, and in doing so, 497 00:29:27,640 --> 00:29:30,640 Speaker 2: it's sort of killing the quality content that is often 498 00:29:30,680 --> 00:29:34,520 Speaker 2: the source material for the AI itself. You know, we're 499 00:29:34,560 --> 00:29:38,600 Speaker 2: seeing this decline in journalism, for example, where some outlets 500 00:29:38,640 --> 00:29:41,800 Speaker 2: are laying off their media teams, you know, instead of 501 00:29:42,000 --> 00:29:44,920 Speaker 2: instead hoping that they can use AI to just churn 502 00:29:45,000 --> 00:29:49,320 Speaker 2: out articles. But you know those same AI tools that 503 00:29:49,320 --> 00:29:52,520 Speaker 2: they're using were trained on journalism. They were trained on 504 00:29:52,520 --> 00:29:54,880 Speaker 2: the types of stuff that is no longer being created, 505 00:29:55,320 --> 00:29:57,840 Speaker 2: and so you end up with this potentially really circular 506 00:29:57,920 --> 00:30:02,800 Speaker 2: situation where AI could start ingesting AI generated content and 507 00:30:03,000 --> 00:30:05,760 Speaker 2: entering this really massive quality spiral. 508 00:30:05,840 --> 00:30:10,560 Speaker 1: I think what Jason Sadowski calls habsburg AI. Yeah, the 509 00:30:10,720 --> 00:30:12,920 Speaker 1: inbreeding about it, keep going, sorry. 510 00:30:12,840 --> 00:30:15,440 Speaker 2: Yeah, no, it's a great terms. He always has great 511 00:30:15,520 --> 00:30:19,000 Speaker 2: terms for things. But yeah, I mean I think that 512 00:30:19,240 --> 00:30:23,080 Speaker 2: you know, we really people are very excited about AI, 513 00:30:23,280 --> 00:30:25,560 Speaker 2: and they believe that AI will just continue to get 514 00:30:25,560 --> 00:30:28,640 Speaker 2: better and better and better without really thinking that critically 515 00:30:28,720 --> 00:30:32,240 Speaker 2: about what is required to create like a high quality, 516 00:30:33,560 --> 00:30:37,840 Speaker 2: you know, large language model, and something like Wikipedia is honestly, 517 00:30:37,880 --> 00:30:42,560 Speaker 2: like incredibly important to creating any quality large language model. 518 00:30:42,600 --> 00:30:44,719 Speaker 2: I mean, I think every large language model out there 519 00:30:44,760 --> 00:30:47,120 Speaker 2: pretty much uses Wikipedia as a source material. 520 00:30:47,320 --> 00:30:49,600 Speaker 1: Yeah, Chat GPT was trained on it, right. 521 00:30:49,680 --> 00:30:51,520 Speaker 2: Yeah, I mean I think basically all of them were. 522 00:30:52,840 --> 00:30:55,840 Speaker 2: And so you know, if you enter that very circular 523 00:30:56,240 --> 00:30:59,320 Speaker 2: loop where that training material gets worse and worse and worse, 524 00:30:59,760 --> 00:31:02,040 Speaker 2: I think AI will get worse and worse and worse. 525 00:31:02,200 --> 00:31:04,920 Speaker 2: And so making sure that there is you know, very 526 00:31:04,960 --> 00:31:10,240 Speaker 2: high quality information out there that is not being in shitified. 527 00:31:10,280 --> 00:31:16,120 Speaker 2: To use Corey Doctor's term, bye by this AI, you know, 528 00:31:16,520 --> 00:31:20,920 Speaker 2: junk that's being created is I think very important just 529 00:31:21,000 --> 00:31:24,160 Speaker 2: to you know, general knowledge, but also I think it 530 00:31:24,200 --> 00:31:26,360 Speaker 2: should be very important to those people who care so 531 00:31:26,440 --> 00:31:27,040 Speaker 2: much about AI. 532 00:31:28,120 --> 00:31:32,440 Speaker 1: Yeah. It's ironic as well because the sources of information 533 00:31:32,520 --> 00:31:35,400 Speaker 1: we've come to rely upon, like Google and Being to 534 00:31:35,440 --> 00:31:39,320 Speaker 1: some extent I guess, are so dependent on using generator comment. 535 00:31:39,400 --> 00:31:41,280 Speaker 1: But really it just they've choked it. They're choking it 536 00:31:41,320 --> 00:31:45,880 Speaker 1: as we speak. Is Wikipedia seeing less traffic or more traffic? 537 00:31:45,960 --> 00:31:47,560 Speaker 1: Is it? How is this affecting it? 538 00:31:49,360 --> 00:31:52,320 Speaker 2: I honestly don't know. I guess I could look at 539 00:31:52,320 --> 00:31:57,560 Speaker 2: Wikipedia traffic that's my dub. Yeah, But I mean I 540 00:31:57,600 --> 00:32:01,160 Speaker 2: think there is, you know, to some extent, with Google 541 00:32:01,480 --> 00:32:05,640 Speaker 2: and these various software projects that ingest Wikipedia data. You know, 542 00:32:05,720 --> 00:32:10,479 Speaker 2: sometimes they do draw that attention away from the source, 543 00:32:10,600 --> 00:32:13,160 Speaker 2: you know, where they'll just highlight the first paragraph of 544 00:32:13,160 --> 00:32:15,680 Speaker 2: Wikipedia and search results and people never click through to 545 00:32:15,720 --> 00:32:19,800 Speaker 2: the Wikipedia article. That's a pretty common phenomenon. You know. 546 00:32:19,920 --> 00:32:24,200 Speaker 2: News sources are sometimes upset about that also. But I 547 00:32:24,200 --> 00:32:26,920 Speaker 2: think one thing that's useful about Wikipedia is that Wikipedia 548 00:32:26,960 --> 00:32:30,840 Speaker 2: doesn't really have the same incentive to draw clicks as 549 00:32:30,920 --> 00:32:35,640 Speaker 2: you know, an ad supported news source might, and so. 550 00:32:35,760 --> 00:32:38,040 Speaker 1: To selected on search. 551 00:32:38,120 --> 00:32:40,120 Speaker 2: But it is, yeah, exactly, It's like to some extent, 552 00:32:40,160 --> 00:32:42,160 Speaker 2: as long as the information is getting out there, we 553 00:32:42,200 --> 00:32:46,600 Speaker 2: don't really care how and so you know, I guess 554 00:32:46,640 --> 00:32:49,480 Speaker 2: there's some question of like, is it actually bad for 555 00:32:49,600 --> 00:32:53,880 Speaker 2: traffic to be redirected away from Wikipedia? I could probably 556 00:32:53,920 --> 00:32:55,560 Speaker 2: make that argument in both directions. 557 00:32:57,000 --> 00:32:59,680 Speaker 1: Well, Molly, thank you so much for joining me today. 558 00:33:00,000 --> 00:33:02,280 Speaker 1: Where can people find you? 559 00:33:02,280 --> 00:33:04,680 Speaker 2: You can find me at Mollywaite dot net. I also 560 00:33:04,680 --> 00:33:08,440 Speaker 2: read a newsletter at Citation Needed dot news. 561 00:33:09,120 --> 00:33:10,520 Speaker 1: Well, thank you so much for joining me. 562 00:33:11,080 --> 00:33:11,840 Speaker 2: Thanks for having me. 563 00:33:13,600 --> 00:33:15,680 Speaker 1: Now, listeners, I'm going to do something a little bit 564 00:33:15,800 --> 00:33:18,440 Speaker 1: off based, if you will. Molly and I just talked 565 00:33:18,480 --> 00:33:21,400 Speaker 1: about Wikipedia, and I realize it feels weird to advocate 566 00:33:21,440 --> 00:33:25,520 Speaker 1: for something on a objective media platform, though I think 567 00:33:25,520 --> 00:33:27,920 Speaker 1: we all agree that I have my biases and my 568 00:33:28,000 --> 00:33:30,800 Speaker 1: things that I care about more and that I'm angry about, 569 00:33:31,000 --> 00:33:34,080 Speaker 1: and I think we all do. But I must state 570 00:33:34,160 --> 00:33:39,720 Speaker 1: how important keeping Wikipedia alive is. This platform, despite the 571 00:33:39,720 --> 00:33:43,240 Speaker 1: fact that's editable by everyone, is more reliable than Google 572 00:33:43,320 --> 00:33:45,520 Speaker 1: is at the moment, it's more reliable than a lot 573 00:33:45,560 --> 00:33:49,960 Speaker 1: of media publications. As we speak, the user generated Internet 574 00:33:50,040 --> 00:33:53,040 Speaker 1: is being destroyed, it is being sold off for parts, 575 00:33:53,320 --> 00:33:57,920 Speaker 1: it's being turned into a rot economic catastrophe. Something like 576 00:33:57,960 --> 00:34:01,560 Speaker 1: Wikipedia is truly important. Really. Again, it's weird to advocate 577 00:34:01,760 --> 00:34:05,000 Speaker 1: for any product or thing, but I encourage you, and 578 00:34:05,040 --> 00:34:07,080 Speaker 1: I know it's weird to say, but please go and 579 00:34:07,240 --> 00:34:10,200 Speaker 1: edit Wikipedia. Please sit down and edit whatever it is. 580 00:34:10,400 --> 00:34:13,640 Speaker 1: Learn the ropes. This is one of the few community 581 00:34:13,760 --> 00:34:16,279 Speaker 1: organized Internet things that actually exist that we can help with. 582 00:34:16,360 --> 00:34:18,960 Speaker 1: It's a thing you can do today to change the Internet, 583 00:34:19,160 --> 00:34:24,000 Speaker 1: to fight for what's right. And yeah, these publications are 584 00:34:24,040 --> 00:34:27,839 Speaker 1: funded in part by tech publications. The Wikimedia Foundation has 585 00:34:27,920 --> 00:34:31,000 Speaker 1: taken money from Google. But as you've heard from Molly, 586 00:34:31,080 --> 00:34:33,959 Speaker 1: who I trust deeply and you should do, they do 587 00:34:34,160 --> 00:34:37,840 Speaker 1: have a firewall between them. This is one way you 588 00:34:37,880 --> 00:34:40,879 Speaker 1: can fight back, and I implore you to do so, 589 00:34:41,520 --> 00:34:43,600 Speaker 1: even if it's one editor, even if it's one page. 590 00:34:43,600 --> 00:34:47,160 Speaker 1: You keep an eye on, follow the training documents, join 591 00:34:47,239 --> 00:34:52,799 Speaker 1: the party. Please try. I understand that it's impossible to 592 00:34:52,840 --> 00:34:55,640 Speaker 1: ask for money, and I should never do so, but 593 00:34:55,680 --> 00:35:00,279 Speaker 1: your time, your attention on Wikipedia is genuinely important. It 594 00:35:00,400 --> 00:35:02,680 Speaker 1: sounds silly. I know we've all kind of thought it's 595 00:35:02,719 --> 00:35:06,839 Speaker 1: just Wikipedia, anyone can edit it. But I'm worried. I'm 596 00:35:06,880 --> 00:35:10,600 Speaker 1: worried for the Internet as it stands. I'm worried the 597 00:35:10,600 --> 00:35:14,120 Speaker 1: Internet sources are going to become more centralized, more focused 598 00:35:14,160 --> 00:35:18,319 Speaker 1: on the big take platforms. Please protect what's left of 599 00:35:18,320 --> 00:35:22,800 Speaker 1: the good Internet. Edit Wikipedia today. I'm serious. At times, 600 00:35:22,800 --> 00:35:25,040 Speaker 1: it can feel a little hopeless out there. It can 601 00:35:25,080 --> 00:35:27,400 Speaker 1: feel like there's nothing we can do against these trillion 602 00:35:27,440 --> 00:35:30,960 Speaker 1: dollar enterprises, and to some extent, there's nothing we can do. 603 00:35:31,840 --> 00:35:35,000 Speaker 1: We really can't. We can't stop Sundar Pashai, we can't 604 00:35:35,040 --> 00:35:39,239 Speaker 1: stop Sam Ortman. What we can do is help reinforce 605 00:35:39,400 --> 00:35:43,279 Speaker 1: what made the Internet great. What we can do is 606 00:35:43,360 --> 00:35:47,319 Speaker 1: contribute to open source. What we can do is edit Wikipedia, 607 00:35:47,760 --> 00:35:51,799 Speaker 1: even a little, Even those little contributions matter. This is 608 00:35:51,840 --> 00:35:55,000 Speaker 1: what made the Internet what it is today, and we 609 00:35:55,040 --> 00:35:58,359 Speaker 1: can fight for it and we can protect it. Thank 610 00:35:58,400 --> 00:36:10,520 Speaker 1: you for listening, Thank you for listening to Better Offline. 611 00:36:10,640 --> 00:36:13,080 Speaker 1: The editor and composer of the Better Offline theme song 612 00:36:13,160 --> 00:36:15,799 Speaker 1: is Matasowski. You can check out more of his music 613 00:36:15,800 --> 00:36:19,479 Speaker 1: and audio projects at Matasowski dot com. M A T 614 00:36:19,480 --> 00:36:23,719 Speaker 1: T O S O W s KI dot com. You 615 00:36:23,760 --> 00:36:26,240 Speaker 1: can email me at easy at better offline dot com 616 00:36:26,320 --> 00:36:28,279 Speaker 1: or check out better offline dot com to find my 617 00:36:28,360 --> 00:36:30,960 Speaker 1: newsletter and more links to this podcast. Thank you so 618 00:36:31,040 --> 00:36:34,719 Speaker 1: much for listening. Better Offline is a production of cool 619 00:36:34,800 --> 00:36:37,600 Speaker 1: Zone Media. For more from cool Zone Media, visit our 620 00:36:37,600 --> 00:36:40,640 Speaker 1: website cool Zonemedia dot com, or check us out on 621 00:36:40,680 --> 00:36:41,520 Speaker 1: the iHeartRadio 622 00:36:41,520 --> 00:36:44,279 Speaker 2: App, Apple Podcasts, or wherever you get your podcasts.