1 00:00:07,133 --> 00:00:10,453 Speaker 1: You're listening to the Saturday Morning with Jack Tame podcast 2 00:00:10,573 --> 00:00:11,693 Speaker 1: from News Talks at me. 3 00:00:12,893 --> 00:00:15,373 Speaker 2: Open Ai has just released a new tool. This is 4 00:00:15,373 --> 00:00:18,453 Speaker 2: the company, of course, behind chat GPT, and this tool 5 00:00:18,533 --> 00:00:22,333 Speaker 2: can go one step further our textbit Paul Steenhouse is 6 00:00:22,333 --> 00:00:24,293 Speaker 2: here with the details. Got to Paul, what can it do? 7 00:00:25,293 --> 00:00:28,173 Speaker 3: Yeah, yeah, I feel like we're getting one step closer 8 00:00:28,213 --> 00:00:31,853 Speaker 3: to Rosy the robot from the Jetsons. Although I guess 9 00:00:32,453 --> 00:00:35,173 Speaker 3: this thing can't move around. Maybe that's the next step 10 00:00:35,173 --> 00:00:38,453 Speaker 3: it'll take. Because this is Rosy the Robot effectively for 11 00:00:38,533 --> 00:00:42,693 Speaker 3: your computer and your digital version of that assistant. But yeah, 12 00:00:42,693 --> 00:00:45,813 Speaker 3: it's called Operator Jack and basically it's going to be 13 00:00:45,893 --> 00:00:49,653 Speaker 3: able to start doing things for you on the Internet. 14 00:00:49,933 --> 00:00:53,933 Speaker 3: So let's imagine you want to book a flight. You 15 00:00:53,973 --> 00:00:55,893 Speaker 3: can simply type in and this is the cool part. 16 00:00:55,933 --> 00:00:58,173 Speaker 3: You just use natural language. You say, I want to 17 00:00:58,173 --> 00:01:01,053 Speaker 3: book a flight to Hawaii, and I want to do 18 00:01:01,133 --> 00:01:03,533 Speaker 3: three nights, and I don't want it to be rainy season, 19 00:01:03,613 --> 00:01:05,533 Speaker 3: and I want to do this, this, and this activity. 20 00:01:06,413 --> 00:01:09,093 Speaker 3: It will then just go away and start browsing the 21 00:01:09,133 --> 00:01:13,373 Speaker 3: Internet just like you or I would, And it uses screenshots. 22 00:01:13,413 --> 00:01:15,933 Speaker 3: They call it computer vision, but really it's screenshots of 23 00:01:15,973 --> 00:01:19,653 Speaker 3: the web page, It analyzes those, figures out what fields 24 00:01:19,653 --> 00:01:21,853 Speaker 3: it needs to fill in, figures out what buttons it 25 00:01:21,893 --> 00:01:24,453 Speaker 3: needs to click with its virtual mouse, and what to 26 00:01:24,533 --> 00:01:28,053 Speaker 3: type with its virtual keyboard, and actually just starts doing 27 00:01:28,213 --> 00:01:31,933 Speaker 3: things for you. So what's really interesting about this though, 28 00:01:32,053 --> 00:01:34,493 Speaker 3: is we have kind of like assistance and things, and 29 00:01:34,533 --> 00:01:37,013 Speaker 3: they can do things for us, but typically they all 30 00:01:37,053 --> 00:01:39,293 Speaker 3: need to be pre programmed, right, because they need to 31 00:01:39,373 --> 00:01:44,453 Speaker 3: use what we call in the digital world APIs effectively 32 00:01:44,853 --> 00:01:47,933 Speaker 3: structured data messages that different services send each other, and 33 00:01:47,933 --> 00:01:49,653 Speaker 3: we will be able to be able to do things. 34 00:01:50,053 --> 00:01:52,253 Speaker 3: This is interesting though, because it actually just like a human. 35 00:01:52,653 --> 00:01:54,973 Speaker 3: It basically looks at the screen, figures it out, it 36 00:01:55,013 --> 00:01:57,613 Speaker 3: doesn't need any pre programming, and it can just start 37 00:01:57,653 --> 00:02:00,013 Speaker 3: doing things. So if a new restaurant popped up, it 38 00:02:00,093 --> 00:02:03,253 Speaker 3: could theoretically go to its website and make a booking 39 00:02:03,253 --> 00:02:06,573 Speaker 3: for you without ever having seen that website before. And 40 00:02:06,613 --> 00:02:10,013 Speaker 3: so it's a big step forward because we're actually going 41 00:02:10,053 --> 00:02:12,533 Speaker 3: to start getting towards some assistance that are smart and 42 00:02:12,613 --> 00:02:14,973 Speaker 3: might actually be able to do some things for us. 43 00:02:15,133 --> 00:02:16,973 Speaker 2: Yeah, so I mean how. 44 00:02:16,893 --> 00:02:20,773 Speaker 3: The price deck? Oh yeah, go on, Okay, So it's 45 00:02:20,813 --> 00:02:23,773 Speaker 3: part of the chet GPT's pro plan, which is just 46 00:02:23,773 --> 00:02:27,053 Speaker 3: two hundred dollars two hundred US dollars a month. They 47 00:02:27,053 --> 00:02:28,613 Speaker 3: do say that they're going to start rolling it out 48 00:02:28,653 --> 00:02:31,813 Speaker 3: to the other plans. Do you know Sam Oltman, the 49 00:02:31,933 --> 00:02:35,173 Speaker 3: CEO of chap of open Ai, he said that he 50 00:02:36,133 --> 00:02:38,733 Speaker 3: chose that price and he thought they'd make money off 51 00:02:38,813 --> 00:02:40,933 Speaker 3: that price. Turns out they're not making any money off 52 00:02:40,973 --> 00:02:43,813 Speaker 3: that price because people are using it so heavily this 53 00:02:44,013 --> 00:02:47,693 Speaker 3: chet GPT pro feature that it's it's making a loss. 54 00:02:47,853 --> 00:02:50,373 Speaker 3: Oh really, the next one's probably going to be Yeah. 55 00:02:50,373 --> 00:02:52,613 Speaker 2: So it's not that they've priced people out of the market. 56 00:02:52,653 --> 00:02:55,333 Speaker 2: It's just that it's actually being used so much and 57 00:02:55,333 --> 00:02:59,253 Speaker 2: it's using so much computing power that they're Yeah. So 58 00:02:59,533 --> 00:03:02,653 Speaker 2: my question though, is like, honestly, how usable is this? 59 00:03:03,053 --> 00:03:05,253 Speaker 2: Like how much stuff do you really need to be 60 00:03:05,373 --> 00:03:08,573 Speaker 2: done by? How many times are you booking a flight 61 00:03:08,613 --> 00:03:11,013 Speaker 2: to Hawaii and needing that kind of research downe you 62 00:03:11,053 --> 00:03:13,133 Speaker 2: know what I mean. It's kind of like with voice 63 00:03:13,133 --> 00:03:15,413 Speaker 2: assistance and Siri and stuff, it's like, well, how often 64 00:03:15,453 --> 00:03:17,373 Speaker 2: do you really need to time a set? How often 65 00:03:17,413 --> 00:03:19,013 Speaker 2: do you really need to know the temperature? 66 00:03:19,093 --> 00:03:19,293 Speaker 1: You know? 67 00:03:20,893 --> 00:03:23,813 Speaker 3: Well, this one's different because it can do basically anything. 68 00:03:23,973 --> 00:03:26,493 Speaker 3: Right now, it still is in its research beta phase, 69 00:03:26,533 --> 00:03:28,173 Speaker 3: so they're going to say there will be hiccups, but 70 00:03:28,413 --> 00:03:30,093 Speaker 3: we're well on our way to it being able to 71 00:03:30,413 --> 00:03:34,373 Speaker 3: craft a Facebook post for you or browsing Facebook and 72 00:03:34,413 --> 00:03:36,893 Speaker 3: telling you what might be interesting, or you know, you 73 00:03:36,933 --> 00:03:40,453 Speaker 3: could probably even send it if you were gaming your 74 00:03:40,493 --> 00:03:44,093 Speaker 3: bank details and tell it to pay your bills and 75 00:03:44,133 --> 00:03:46,253 Speaker 3: do things like. That's where it's headed, right, Like you 76 00:03:46,293 --> 00:03:49,413 Speaker 3: could actually say pay my contact energy bill and it 77 00:03:49,453 --> 00:03:51,133 Speaker 3: could go away and start to figure out how to 78 00:03:51,173 --> 00:03:54,413 Speaker 3: do that. So we're not there yet, but that's what 79 00:03:54,453 --> 00:03:55,493 Speaker 3: they wanted to be able to do. 80 00:03:55,573 --> 00:03:55,853 Speaker 1: Yeah. 81 00:03:55,933 --> 00:03:58,413 Speaker 2: Yeah, Oh it sounds amazing. Okay, thank you so much. Paul, 82 00:03:58,573 --> 00:04:02,853 Speaker 2: sounds amazing if expensive. Paul Stenhouse our texpert. 83 00:04:02,893 --> 00:04:06,733 Speaker 1: There for more from Saturday Morning with Jack Team. Listen 84 00:04:06,773 --> 00:04:09,613 Speaker 1: live to News Talks ed B from nine am Saturday, 85 00:04:09,813 --> 00:04:11,813 Speaker 1: or follow the podcast on iHeartRadio