WEBVTT - Monologue: OpenAI Is Getting Desperate 0:00:02.160 --> 0:00:08.119 Zone Media. Hello you, it's your better offline monologue and 0:00:08.160 --> 0:00:17.920 I'm your host ed ZiT trum. Now next week I'm 0:00:17.920 --> 0:00:19.400 going to have a two part of that digs into 0:00:19.400 --> 0:00:21.840 Microsoft Stata senter pull back an open ai is shaking 0:00:21.920 --> 0:00:25.200 you funding situation. But this week's monologue focuses an open 0:00:25.200 --> 0:00:28.880 AI's new model, GPT four point five. You may be 0:00:29.040 --> 0:00:31.880 wondering what it does differently to GPT four oh, or 0:00:31.920 --> 0:00:34.159 claudes on at three point seven, or any number of 0:00:34.200 --> 0:00:36.760 other large language models, And if I'm honest, I have 0:00:36.800 --> 0:00:40.800 absolutely no idea. Thankfully neither does Sam Ortman, CEO of 0:00:40.800 --> 0:00:43.480 open Ai, who said, and I quote, the GPT four 0:00:43.479 --> 0:00:47.120 point five was the first model that feels like talking 0:00:47.159 --> 0:00:51.560 to a thoughtful person to him, which makes me wonder 0:00:51.680 --> 0:00:55.000 what the other models have been like. So I went 0:00:55.040 --> 0:00:57.680 back and look. So compare this to the launch of 0:00:57.720 --> 0:01:01.520 GPT four oh, which Altman called open AI's best model ever, 0:01:01.640 --> 0:01:05.679 saying that it was fast, smart, natively multimodal, referring to 0:01:05.720 --> 0:01:07.880 the ability to accept text as well as audio and 0:01:07.959 --> 0:01:11.360 video and photos as well, and available to all chat 0:01:11.400 --> 0:01:14.520 GPT users, including on their free plan, adding that it 0:01:14.640 --> 0:01:18.400 was a very good model, especially at Coding. By contrast, 0:01:18.440 --> 0:01:22.160 Alltman summarized GPT four point five as a giant, expensive model, 0:01:22.480 --> 0:01:25.520 one that required hundreds of thousands of GPUs to launch 0:01:25.520 --> 0:01:28.760 beyond chat GPT pro. It started there. It's still not 0:01:28.880 --> 0:01:32.440 as I record this out to plus users or free users. Now. 0:01:32.520 --> 0:01:35.160 GPT pro of course is open AI is two hundred 0:01:35.160 --> 0:01:39.960 dollars a month subscription, and it's unclear when plus, which 0:01:40.000 --> 0:01:41.640 is the twenty buck a month one will get it, 0:01:41.680 --> 0:01:44.920 but apparently it's in the next few days. Aortman also 0:01:44.920 --> 0:01:47.880 added that GPT four point five isn't a reasoning model 0:01:47.960 --> 0:01:50.680 and won't crush benchmarks on account of it being a 0:01:50.720 --> 0:01:53.480 different kind of intelligence that has and all of these 0:01:53.520 --> 0:01:56.880 are quotes magic to it that Sam Mortman had not 0:01:57.000 --> 0:02:02.360 felt before. Yeah, just you know, shit's not doing well 0:02:02.800 --> 0:02:06.120 when you have to just be like it's magic, It's 0:02:06.160 --> 0:02:09.240 it's literally magic. I made magic. Now what does the 0:02:09.240 --> 0:02:13.519 magic do? I'm really not sure. In fact, it's pretty 0:02:13.520 --> 0:02:15.960 difficult to find exactly what it is that GPT four 0:02:16.000 --> 0:02:18.120 point five does differently, or what it's good at, or 0:02:18.160 --> 0:02:21.240 indeed really anything about him. BENJ. Edwards over At asked 0:02:21.240 --> 0:02:24.639 Tenneker had one developer called it a lemon. GPT four 0:02:24.680 --> 0:02:27.960 point five costs an incredible seventy five dollars per million 0:02:28.000 --> 0:02:30.880 input tokens prompts and data pushed into a model, and 0:02:30.919 --> 0:02:33.840 one hundred and fifty dollars per million output tokens is 0:02:33.840 --> 0:02:36.200 in the thing it creates. A token is like zero 0:02:36.240 --> 0:02:39.920 point seven. I think one token is maybe three words. 0:02:40.160 --> 0:02:42.600 Someone will get up my ass for this. Nevertheless, this 0:02:42.680 --> 0:02:45.240 seems like a lot. It isn't when you're running a company. 0:02:45.600 --> 0:02:48.080 And by the way, this is roughly three thousand percent 0:02:48.160 --> 0:02:51.080 more expensive for input tokens and fifteen hundred percent more 0:02:51.120 --> 0:02:53.680 expensive for output tokens than GPT four to zero for 0:02:53.800 --> 0:02:57.680 results that open ai co founder Andredge Carpathy described as 0:02:57.800 --> 0:03:00.960 a little bit better and awesome, but also not exactly 0:03:00.960 --> 0:03:04.200 in ways that are trivial to point to. That translates 0:03:04.200 --> 0:03:06.960 to it's a little bit better, but I can't really 0:03:07.000 --> 0:03:10.399 tell you why. And yes, you're gonna hear me say 0:03:10.400 --> 0:03:13.120 something similar in next week's episode, because the larger picture 0:03:13.160 --> 0:03:16.119 for open ai right now is pretty fucking dire considering 0:03:16.120 --> 0:03:18.440 their main backer, soft Bank, has to borrow billions of 0:03:18.440 --> 0:03:21.960 dollars to fund them. Nevertheless, back to four point five 0:03:22.360 --> 0:03:24.639 Since launch, which was for some reason, on the day 0:03:24.639 --> 0:03:27.760 that Sam Ortman's child was being born in the hospital, 0:03:29.040 --> 0:03:32.280 He's been posting some really weird shits since, though. A 0:03:32.280 --> 0:03:35.080 few days after launch, Aortman claimed that GPT four point 0:03:35.080 --> 0:03:38.080 five was the first time people had been emailing with 0:03:38.120 --> 0:03:41.440 such passion, asking the open AI promised never to stop 0:03:41.480 --> 0:03:44.320 offering a specific model or even replace it with an update, 0:03:44.600 --> 0:03:47.240 at which point I assume everybody in the room started 0:03:47.280 --> 0:03:50.760 clapping and they saluted Sam Mortman and said, thank you, 0:03:50.840 --> 0:03:53.280 sir for making this happen. And by the way, what 0:03:53.360 --> 0:03:55.360 I'm suggesting is that no one's ever done this, or 0:03:55.400 --> 0:03:58.560 like one freak did, or maybe Aortman emailed it to himself, 0:03:59.080 --> 0:04:04.440 who just shut up? Just your company burns five billion 0:04:04.480 --> 0:04:06.400 dollars a year and the best you've got is this 0:04:06.480 --> 0:04:10.360 warmed up dog shit about people marine todding you over 0:04:10.440 --> 0:04:13.400 your model and never taking it away. Has opening I 0:04:13.520 --> 0:04:17.960 ever even taken away a model? Jesus fucking these companies anyway. 0:04:18.520 --> 0:04:21.400 A few days later, Altman posted a conversation where he 0:04:21.480 --> 0:04:24.280 asked GPT four point five if it believed it was real, 0:04:24.600 --> 0:04:26.560 leading to a series of bullet points with things like 0:04:26.640 --> 0:04:29.400 what do we mean by real only for GPT four 0:04:29.400 --> 0:04:31.719 point five, saying that it believed that it was not 0:04:31.800 --> 0:04:35.159 an independent consciousness, but rather a structured experience happening within 0:04:35.480 --> 0:04:38.680 your consciousness, referring to Sam Altman, which is the kind 0:04:38.680 --> 0:04:40.919 of shit that's only impressive if you're an imbecile or 0:04:40.920 --> 0:04:43.279 so stoned you've texted date of your friends the question 0:04:43.520 --> 0:04:45.680 what if the joker was Batman? And by the way, 0:04:45.720 --> 0:04:47.800 the answer to that is called The Batman Who Laughs 0:04:47.839 --> 0:04:50.280 and it's one of the worst comics ever written. If 0:04:50.279 --> 0:04:52.200 you want to talk to me about DC Metal, please 0:04:52.240 --> 0:04:55.800 email me. It's easy. That's e z or z. If 0:04:55.800 --> 0:04:59.240 you're Canadian or British at Better Offline dot com I 0:04:59.320 --> 0:05:02.320 a really if you are working for DC Comics right 0:05:02.320 --> 0:05:04.640 now and you had anything to do with Death Metal 0:05:04.760 --> 0:05:06.920 or The Batman Who Laughs, you and I have a grievance. 0:05:07.240 --> 0:05:09.920 You and I need to talk. Sorry what this is 0:05:09.960 --> 0:05:13.680 a tech pop us right back to open AI. More worryingly, 0:05:13.960 --> 0:05:17.520 Samultman posted an idea for paid plans where your twenty 0:05:17.520 --> 0:05:20.919 dollars plus subscription converts the credits you can use across 0:05:20.920 --> 0:05:24.080 speech just like Deep Research O one. GPT four point five, Sora, 0:05:24.200 --> 0:05:26.800 and so on, with no fixed limits per feature, and 0:05:26.839 --> 0:05:29.080 you choose what you want. If you run out of credits, 0:05:29.360 --> 0:05:31.960 you can buy more. This, to be clear, is an 0:05:32.000 --> 0:05:34.840 attempt to raise prices without actually raising them by attempting 0:05:34.839 --> 0:05:38.240 to limit usage of open AI's more expensive models. Chat, 0:05:38.279 --> 0:05:41.040 GPT plus and other subscriptions give you a limit, for example, 0:05:41.080 --> 0:05:43.560 a limit of eighty messages every three hours on GPT 0:05:43.680 --> 0:05:46.320 four to oh, but using one doesn't limit your use 0:05:46.320 --> 0:05:49.679 of other products. Here, open ai is trying to create 0:05:49.680 --> 0:05:52.040 a rent seeking model where power users have to pay 0:05:52.200 --> 0:05:54.600 for more credits if they want to use, say open 0:05:54.640 --> 0:05:56.720 ai is more expensive models like Sora and O one, 0:05:56.880 --> 0:05:59.040 and I imagine any situation like this will be one 0:05:59.040 --> 0:06:01.479 where they hope that people simply won't use their credits 0:06:01.680 --> 0:06:03.520 or overuse them and have to pay for top ups. 0:06:04.040 --> 0:06:06.839 This is, of course all theoretical, but it heavily suggests 0:06:06.839 --> 0:06:09.920 that open ai is getting desperate. And now the information 0:06:10.040 --> 0:06:12.880 is reporting that open ai executives have told some investors 0:06:12.880 --> 0:06:16.200 that they will be charging two thousand dollars per month 0:06:16.400 --> 0:06:19.080 for their low end agent product. And yes that's a 0:06:19.160 --> 0:06:24.080 quote sold to And again I quote high income knowledge 0:06:24.080 --> 0:06:27.560 workers with supposed mid tier agents for software development costing 0:06:27.720 --> 0:06:31.400 possibly ten thousand dollars a month, with supposed PhD level 0:06:31.440 --> 0:06:34.440 research agents costing twenty thousand dollars a month, and I 0:06:34.440 --> 0:06:36.960 will tell you the PhDs I know would probably do 0:06:37.000 --> 0:06:39.560 it for half, and they'd even work for an annoying 0:06:39.560 --> 0:06:42.560 asshole like Sam Altman. Now you may wonder what any 0:06:42.600 --> 0:06:44.640 of these things do, and the answer is that neither 0:06:44.839 --> 0:06:47.200 I nor the Information know. As of right now. The 0:06:47.240 --> 0:06:50.680 only operational or agent, open ai has his operator, open 0:06:50.720 --> 0:06:53.800 AI's agent that sometimes successfully uses a web browser search 0:06:53.839 --> 0:06:57.240 for something in minutes, which would usually take you seconds. 0:06:57.920 --> 0:07:00.240 The Information attempted to suggest that the two thousand a 0:07:00.240 --> 0:07:03.159 month agent would be some sort of thing that could 0:07:03.240 --> 0:07:05.839 sort through and rank sales lead But I'm sorry, do 0:07:05.920 --> 0:07:08.040 I really have to read this shit with a straight face? 0:07:08.120 --> 0:07:10.560 Twenty thousand dollars for a PhD level agent? What the 0:07:10.600 --> 0:07:13.040 fuck does that mean? What would it do? Why do 0:07:13.120 --> 0:07:15.960 these companies? I get emails every week having to justify 0:07:16.080 --> 0:07:19.920 my fucking cynicism, But these shittheads, they're allowed to just 0:07:20.160 --> 0:07:22.080 make up stuff and it leak it to the information 0:07:22.160 --> 0:07:24.560 The Information publishes. It were all meant to be impressed. 0:07:24.600 --> 0:07:28.960 What the fucking what the fuck I'm allowed to rent 0:07:28.960 --> 0:07:31.000 on these? They're allowing me to rant on these. It's 0:07:31.120 --> 0:07:33.760 just it sickens me. I have had this week at 0:07:33.840 --> 0:07:37.240 least five people email me and be like, well ed, 0:07:37.280 --> 0:07:39.760 what would it take to change your mind about this stuff? 0:07:39.800 --> 0:07:42.320 Why do I have to fucking do it? Why do 0:07:42.400 --> 0:07:45.680 I The multi billion dollar companies do a dogshit job 0:07:45.720 --> 0:07:48.920 of actually explaining this stuff or selling it. They lose 0:07:48.960 --> 0:07:50.880 billions of dollars. But I'm the guy who has to 0:07:50.960 --> 0:07:54.200 justify myself. Oh well, I'll keep doing it. Nevertheless, NES's 0:07:54.240 --> 0:07:56.000 old At the bottom of this article's a far more 0:07:56.040 --> 0:07:59.560 obvious payale horse. Open ai is planning to charge twenty 0:07:59.600 --> 0:08:02.400 percent to thirty percent of pro customers the two hundred 0:08:02.400 --> 0:08:04.880 dollars a month subscription that loses the money every time, 0:08:05.200 --> 0:08:07.880 a higher price because of how many research queries they're 0:08:07.920 --> 0:08:11.160 doing with Altmann, according to the information, suggesting some sort 0:08:11.200 --> 0:08:14.000 of hey guessoir a la carte or pay as you 0:08:14.040 --> 0:08:18.520 go approach. I want to be clear about something. This 0:08:18.640 --> 0:08:21.040 is not a company that's cooking. This is not a 0:08:21.080 --> 0:08:25.120 company that's worked out anything. Open Aiye is unprofitable, unsustainable, 0:08:25.160 --> 0:08:27.920 and deeply, deeply lost. These are the actions of a 0:08:28.000 --> 0:08:32.080 desperate company run by a desperate man. You have only 0:08:32.120 --> 0:08:34.320 Sam Morman had a thoughtful friend to talk about all 0:08:34.360 --> 0:08:35.800 these problems too,