1 00:00:00,080 --> 00:00:03,360 Speaker 1: Awsco Adam ze Lipski with me from their big event 2 00:00:03,400 --> 00:00:05,640 Speaker 1: of the year in Adam. Like three big pieces of 3 00:00:05,680 --> 00:00:09,719 Speaker 1: news right, A deeper relationship with Nvidia on Gracehopper super chip, 4 00:00:09,880 --> 00:00:13,480 Speaker 1: on DGX cloud, you have next generations of your own silicon, 5 00:00:13,840 --> 00:00:15,440 Speaker 1: and then you have a chatbot that will get to 6 00:00:15,840 --> 00:00:18,639 Speaker 1: at the end of the discussion. But let's start with 7 00:00:18,640 --> 00:00:20,520 Speaker 1: the strategy. I think a lot of questions that I 8 00:00:20,600 --> 00:00:24,119 Speaker 1: get is what is the differentiation in strategy with what 9 00:00:24,200 --> 00:00:28,240 Speaker 1: AWUS is doing here deepening its relationship and reliance on 10 00:00:28,320 --> 00:00:32,000 Speaker 1: DGX cloud and grace Hopper superchip versus what you're doing 11 00:00:32,479 --> 00:00:35,320 Speaker 1: with the cloud supported by your own silicon. 12 00:00:35,520 --> 00:00:37,639 Speaker 2: Well, I think our strategy has been very consistent. 13 00:00:37,800 --> 00:00:42,000 Speaker 3: It is to provide choice and powerful options and all 14 00:00:42,120 --> 00:00:45,800 Speaker 3: layers of the stack, so down at the infrastructure layer 15 00:00:46,000 --> 00:00:49,280 Speaker 3: of what you need to do generative AI really well, 16 00:00:49,800 --> 00:00:52,919 Speaker 3: as you mentioned, we both have been investing in our 17 00:00:52,960 --> 00:00:56,320 Speaker 3: own silicon, and then we have had a long standing, 18 00:00:56,600 --> 00:01:00,520 Speaker 3: wonderful relationship with Nvidia. We've been the first to pretty 19 00:01:00,560 --> 00:01:05,480 Speaker 3: much every significant Nvidia chip to the cloud, including their 20 00:01:05,560 --> 00:01:08,040 Speaker 3: latest H one hundred this past summer, and I was 21 00:01:08,080 --> 00:01:11,279 Speaker 3: really excited to have their CEO, Jensen Holong on stage 22 00:01:11,280 --> 00:01:14,560 Speaker 3: with me this morning talking about an expansion of the 23 00:01:14,600 --> 00:01:20,280 Speaker 3: partnership with AWS, bringing their DJX cloud to AWS, and 24 00:01:20,520 --> 00:01:23,360 Speaker 3: Nvidia itself is going to be standing up a huge 25 00:01:23,720 --> 00:01:26,920 Speaker 3: supercomputer of a cloud for their own internal R and 26 00:01:27,040 --> 00:01:31,240 Speaker 3: D on AWS. So it's a fantastic expansion of what's 27 00:01:31,319 --> 00:01:35,600 Speaker 3: really a great relationship that benefits our mutual customers, Adam. 28 00:01:35,360 --> 00:01:38,440 Speaker 1: And they still supply constraints around in Vidio GPUs. 29 00:01:38,720 --> 00:01:41,240 Speaker 3: Well, they're very popular, and I think it is still 30 00:01:41,280 --> 00:01:45,000 Speaker 3: remains true that there's probably more people who want to 31 00:01:45,000 --> 00:01:47,680 Speaker 3: get their hands on them than actually can, which is 32 00:01:47,680 --> 00:01:51,160 Speaker 3: one of the reasons why we also announced this concept 33 00:01:51,360 --> 00:01:56,960 Speaker 3: of clusters BREC two for chips and so that you 34 00:01:57,000 --> 00:02:00,000 Speaker 3: can actually go down reserve up to hundreds of GPS 35 00:02:00,120 --> 00:02:02,920 Speaker 3: use for your short term generative AI needs for things 36 00:02:02,920 --> 00:02:05,440 Speaker 3: like training models, which can tend to be apisodic, and 37 00:02:05,480 --> 00:02:08,280 Speaker 3: there's really nobody else out there offering anything like that 38 00:02:08,400 --> 00:02:09,080 Speaker 3: kind of service. 39 00:02:09,320 --> 00:02:12,400 Speaker 1: Given that's the case, if you had to shift any 40 00:02:12,440 --> 00:02:16,000 Speaker 1: customers to trainingum because of the limited supply of in 41 00:02:16,120 --> 00:02:17,639 Speaker 1: video GPUs. 42 00:02:17,880 --> 00:02:22,200 Speaker 3: Yeah, it's really not about shifting, it's really about customers 43 00:02:22,240 --> 00:02:25,480 Speaker 3: need different, different customers need different things. The same customer 44 00:02:25,680 --> 00:02:29,320 Speaker 3: needs different things for different use cases and awos for 45 00:02:29,400 --> 00:02:32,000 Speaker 3: this over seventeen years we've been doing. This has been 46 00:02:32,040 --> 00:02:35,639 Speaker 3: all about choice, all about democratization, all about putting tools 47 00:02:35,639 --> 00:02:37,359 Speaker 3: in the hands of our customers so that they can 48 00:02:37,360 --> 00:02:40,160 Speaker 3: make the choices about what we need. And so we're 49 00:02:40,160 --> 00:02:43,960 Speaker 3: also excited that we announced the second generation of our 50 00:02:44,480 --> 00:02:48,320 Speaker 3: training specific chip for training gener workloads, and that is 51 00:02:48,800 --> 00:02:52,120 Speaker 3: TRAININGUM too, and that's going to have up to four 52 00:02:52,240 --> 00:02:55,239 Speaker 3: times the compute performance of the first generation of TRAINUM. 53 00:02:55,440 --> 00:02:57,359 Speaker 3: We of course have been investing in our own custom 54 00:02:57,440 --> 00:03:01,800 Speaker 3: silicon for years. We also today that we're shipping the 55 00:03:01,960 --> 00:03:05,960 Speaker 3: preview of our fourth generation of general purpose chip, Graviton four. 56 00:03:06,160 --> 00:03:07,920 Speaker 3: I happen to have one with me right here. This 57 00:03:08,160 --> 00:03:11,000 Speaker 3: is a Graviton forward ship. We're not talking about it, 58 00:03:11,080 --> 00:03:13,959 Speaker 3: we're not showing slides on it. It's not future looking 59 00:03:14,040 --> 00:03:16,880 Speaker 3: like a lot of other cloud providers. It is shipping today. 60 00:03:17,280 --> 00:03:21,240 Speaker 3: And again our fourth generation, the power is incredible, the 61 00:03:21,360 --> 00:03:25,440 Speaker 3: price performance is going to be incredibly attractive, and our 62 00:03:25,520 --> 00:03:29,160 Speaker 3: chips also have incredible energy efficiency benefits, which is really 63 00:03:29,440 --> 00:03:31,840 Speaker 3: really important to our customers these days. 64 00:03:32,360 --> 00:03:34,520 Speaker 1: Adam, if the chips are as real as the one 65 00:03:34,560 --> 00:03:36,320 Speaker 1: as you held in your hand, and they're shipping to 66 00:03:36,360 --> 00:03:39,160 Speaker 1: the real world. Give us some numbers, what percentage of 67 00:03:39,200 --> 00:03:43,120 Speaker 1: the AW customer base is using Graviton and is anyone 68 00:03:43,240 --> 00:03:47,200 Speaker 1: using Trainium the existing or future generation besides Amazon. 69 00:03:47,600 --> 00:03:51,160 Speaker 3: Yeah, Graviton has become very popular. I don't think we've 70 00:03:51,160 --> 00:03:54,240 Speaker 3: broken out numbers, but all one hundred of the top 71 00:03:54,280 --> 00:03:57,440 Speaker 3: EC two it's our compute service, the top one hundred 72 00:03:57,520 --> 00:04:02,160 Speaker 3: EC two customers and fifty thousand customers overall are using 73 00:04:02,200 --> 00:04:07,200 Speaker 3: Graviton for so it's incredibly popular, incredibly powerful, and in 74 00:04:07,280 --> 00:04:10,240 Speaker 3: terms of TRAININGUM, we're seeing a lot of interest, a 75 00:04:10,280 --> 00:04:12,880 Speaker 3: lot of folks moving to adopt that as well as 76 00:04:13,440 --> 00:04:17,360 Speaker 3: of course GPU based capacity. And with training, we've got 77 00:04:17,400 --> 00:04:20,839 Speaker 3: folks like Data Bricks and Rico and many others that 78 00:04:20,920 --> 00:04:25,279 Speaker 3: are building and deploying on them. Anthropic that we're partnering 79 00:04:25,279 --> 00:04:28,040 Speaker 3: with very closely, and it's obviously one of the very 80 00:04:28,160 --> 00:04:34,960 Speaker 3: leading suppliers of foundation models and large language models, is 81 00:04:35,400 --> 00:04:38,640 Speaker 3: going to be training future generations of their models on 82 00:04:38,800 --> 00:04:41,479 Speaker 3: TRAININGUM on Anthropics. 83 00:04:41,520 --> 00:04:44,680 Speaker 1: Since last we spoke what's changed does Anthropics come out 84 00:04:44,720 --> 00:04:48,560 Speaker 1: and said it's also deepened its relationship with Google on 85 00:04:48,960 --> 00:04:55,400 Speaker 1: Google Cloud platform. Is Anthropic using any of your custom 86 00:04:55,480 --> 00:04:58,560 Speaker 1: silicon to train Clawed or do you have any sort 87 00:04:58,560 --> 00:05:00,920 Speaker 1: of visibility on a start day of when they'll start 88 00:05:00,960 --> 00:05:02,880 Speaker 1: training a future model on your silicon? 89 00:05:03,920 --> 00:05:05,039 Speaker 2: So really nothing's changed. 90 00:05:05,360 --> 00:05:08,960 Speaker 3: Anthropic has been running on AWS literally since they were 91 00:05:08,960 --> 00:05:12,440 Speaker 3: founded in twenty twenty one, which seems decades ago, but 92 00:05:12,640 --> 00:05:15,800 Speaker 3: it was only in twenty twenty one. And you know, 93 00:05:15,839 --> 00:05:18,119 Speaker 3: we obviously are providing choice of a lot of different 94 00:05:18,160 --> 00:05:22,600 Speaker 3: model providers to customers. You know, many startups, including Anthropic, 95 00:05:22,720 --> 00:05:24,880 Speaker 3: you know, as they as they grow or are going 96 00:05:24,920 --> 00:05:28,120 Speaker 3: to use multi providers providers for different things. But I 97 00:05:28,120 --> 00:05:32,360 Speaker 3: think we Anathropic have been very clear AWS is anthropics 98 00:05:32,640 --> 00:05:35,960 Speaker 3: primary cloud provider for their mission critical workloads and the 99 00:05:36,000 --> 00:05:39,200 Speaker 3: majority of overall workloads from Anthropic are going to run 100 00:05:39,240 --> 00:05:43,000 Speaker 3: on AWS, and Anthropic will be training future versions of 101 00:05:44,240 --> 00:05:48,080 Speaker 3: their Claud models on AWS. They will be collaborating with 102 00:05:48,160 --> 00:05:51,919 Speaker 3: us to help improve our trainum and our inferential chip technology, 103 00:05:52,240 --> 00:05:54,920 Speaker 3: and they're going to be bringing powerful features around fine 104 00:05:54,960 --> 00:05:58,200 Speaker 3: tuning and customization that for certain periods of time will 105 00:05:58,240 --> 00:06:01,920 Speaker 3: only only be available on AS and through Anthropic directly, 106 00:06:02,120 --> 00:06:04,839 Speaker 3: not on any other cloud, So the relationship is deep, 107 00:06:04,880 --> 00:06:06,240 Speaker 3: it's strong, and it's unchanged. 108 00:06:06,520 --> 00:06:10,279 Speaker 1: Adam, will you ever be able to get open AI 109 00:06:10,839 --> 00:06:12,080 Speaker 1: to be on AWS? 110 00:06:12,680 --> 00:06:14,760 Speaker 3: I think we are going to try and bring all 111 00:06:14,880 --> 00:06:17,599 Speaker 3: of the models that people care about on too AWS 112 00:06:17,880 --> 00:06:22,880 Speaker 3: O God, Anthropic, cohere Stability, AI AI twenty one, Meta 113 00:06:22,920 --> 00:06:25,680 Speaker 3: Islama II model. I'm sure there will be others over time, 114 00:06:25,720 --> 00:06:28,600 Speaker 3: and we're just going to be guided by where our 115 00:06:28,600 --> 00:06:30,280 Speaker 3: customers tell us they care about the most. 116 00:06:31,240 --> 00:06:33,400 Speaker 1: So the other big piece of news is an AI 117 00:06:33,520 --> 00:06:36,200 Speaker 1: chatbot finally from Amazon Q. 118 00:06:37,000 --> 00:06:39,279 Speaker 2: But here's my question. If I use. 119 00:06:39,200 --> 00:06:43,000 Speaker 1: Gmail or I use a Microsoft piece of software, I 120 00:06:43,120 --> 00:06:47,000 Speaker 1: can take those pieces of software and use the new 121 00:06:47,000 --> 00:06:51,200 Speaker 1: eight AI tools that those companies are bringing to market 122 00:06:51,760 --> 00:06:55,920 Speaker 1: in their existing suites. Q is on the AWS console. 123 00:06:56,240 --> 00:06:58,080 Speaker 1: So does that mean I have to train as a 124 00:06:58,120 --> 00:07:01,280 Speaker 1: developer and become a developer to get access to it. 125 00:07:02,760 --> 00:07:03,040 Speaker 3: No. Q. 126 00:07:03,560 --> 00:07:05,000 Speaker 2: We're incredibly excited about Q. 127 00:07:05,480 --> 00:07:08,400 Speaker 3: All the many customers that have been previewing Q and 128 00:07:08,440 --> 00:07:12,720 Speaker 3: Private are incredibly excited about it, and we think it's 129 00:07:12,760 --> 00:07:16,440 Speaker 3: going to be transformative in helping to reinvent how people work. 130 00:07:16,920 --> 00:07:21,320 Speaker 3: I mean the existing chatbots or generative AI assistants are 131 00:07:21,360 --> 00:07:23,520 Speaker 3: great for consumers, and so many of us of course 132 00:07:23,560 --> 00:07:26,680 Speaker 3: have used them experiment with them, but unfortunately, for the 133 00:07:26,720 --> 00:07:30,280 Speaker 3: most part, they don't really work at work, they don't 134 00:07:30,280 --> 00:07:32,480 Speaker 3: really know who you are, they don't really have access 135 00:07:32,520 --> 00:07:35,360 Speaker 3: to your positories, and for the most part, they've lacked 136 00:07:35,400 --> 00:07:41,360 Speaker 3: security and privacy characteristics that are pretty much requirements for 137 00:07:42,080 --> 00:07:46,640 Speaker 3: enterprises and many other companies. So Amazon Q is going 138 00:07:46,680 --> 00:07:50,760 Speaker 3: to be tailored and customized to your business. We've built 139 00:07:50,760 --> 00:07:55,000 Speaker 3: connectors to forty over forty major systems, including Microsoft three 140 00:07:55,120 --> 00:08:01,160 Speaker 3: sixty five, including Google, systems including Salesforce, Zendesk, service Now, 141 00:08:01,480 --> 00:08:03,960 Speaker 3: and many others, and Q is going to be able 142 00:08:03,960 --> 00:08:06,440 Speaker 3: to access those if you give Q access to them, 143 00:08:06,600 --> 00:08:09,160 Speaker 3: but it's going to respect your privacy and security. If 144 00:08:09,200 --> 00:08:11,680 Speaker 3: you don't have access to a database without Q, you're 145 00:08:11,720 --> 00:08:13,600 Speaker 3: not going to have access to it with Q. 146 00:08:14,000 --> 00:08:15,960 Speaker 2: But meanwhile, it's going to make it so much. 147 00:08:15,800 --> 00:08:19,360 Speaker 3: Easier and more convenient to find data, to generate answers, 148 00:08:19,360 --> 00:08:23,200 Speaker 3: to summarize things, to reach new conclusions, and to get 149 00:08:23,240 --> 00:08:26,320 Speaker 3: to actual insights and actions. So it's going to integrate 150 00:08:26,360 --> 00:08:29,000 Speaker 3: really well with the systems that people care about. It's 151 00:08:29,000 --> 00:08:32,680 Speaker 3: going to be enterprise grade, and again We're really excited 152 00:08:32,720 --> 00:08:34,719 Speaker 3: about it because our customers are excited about it. 153 00:08:35,160 --> 00:08:38,320 Speaker 1: Adam, I know that you're someone that believes fundamentally choice 154 00:08:38,360 --> 00:08:41,520 Speaker 1: is important when it comes to AI tools. What work 155 00:08:41,559 --> 00:08:44,040 Speaker 1: are you doing on the diversity and ethics side here? 156 00:08:44,080 --> 00:08:46,520 Speaker 1: We always talk a lot about silicon and product, but 157 00:08:46,600 --> 00:08:49,640 Speaker 1: what about some of the guardrails and safeguards needed in AI. 158 00:08:50,960 --> 00:08:52,120 Speaker 2: It's incredibly important. 159 00:08:52,240 --> 00:08:54,440 Speaker 3: I fund you should use the word guardrails because this 160 00:08:54,559 --> 00:08:57,280 Speaker 3: morning here a reinvent where we have fifty thousand people 161 00:08:57,320 --> 00:08:59,920 Speaker 3: live in Las Vegas and over three hundred other thousand 162 00:09:00,040 --> 00:09:04,280 Speaker 3: people registered around the world. We actually announced a guardrails 163 00:09:04,280 --> 00:09:07,560 Speaker 3: for Amazon Bedrock. Bedrock is our foundation model as a 164 00:09:07,600 --> 00:09:12,880 Speaker 3: service offering with all those models inside, and Guardrails is 165 00:09:12,920 --> 00:09:17,080 Speaker 3: a new capability which actually allows you to enforce your 166 00:09:17,120 --> 00:09:21,559 Speaker 3: own responsible AI priorities and policies. So if you're a 167 00:09:21,600 --> 00:09:24,880 Speaker 3: financial services company, you might have your chatbot not give 168 00:09:24,880 --> 00:09:28,880 Speaker 3: investment advice, or if you're a utility with customer service 169 00:09:29,440 --> 00:09:33,680 Speaker 3: chatbot or generative AI assistant, you might have it strip 170 00:09:33,760 --> 00:09:37,040 Speaker 3: out the PII or personally identifiable information from the call 171 00:09:37,080 --> 00:09:40,480 Speaker 3: summary to respect people's privacy. So we're taking really seriously 172 00:09:40,559 --> 00:09:43,400 Speaker 3: to the point of actually building in these leading edge 173 00:09:43,440 --> 00:09:47,440 Speaker 3: capabilities that really aren't available from other providers. And in addition, 174 00:09:47,480 --> 00:09:50,160 Speaker 3: it's really important that we're participating in the working groups 175 00:09:50,200 --> 00:09:52,320 Speaker 3: that are discussing these things. We're going to need a 176 00:09:52,360 --> 00:09:57,200 Speaker 3: multilateral approach of government, private sector, academics, and more. And 177 00:09:57,320 --> 00:09:59,520 Speaker 3: I was the White House with President Biden the summer 178 00:10:00,040 --> 00:10:03,920 Speaker 3: where we participated in these voluntary commitments around AI. Earlier 179 00:10:03,960 --> 00:10:06,640 Speaker 3: this month, I was with the UK Prime Minister Sunak 180 00:10:06,679 --> 00:10:09,640 Speaker 3: at a gathering where he announced at AI Safety Institute 181 00:10:09,679 --> 00:10:12,520 Speaker 3: in the UK, we're going to continue spending time participating 182 00:10:12,559 --> 00:10:13,360 Speaker 3: in these dialogues. 183 00:10:13,440 --> 00:10:16,880 Speaker 1: Adam Szlitzky, ws CEO, thank you for your time. Back 184 00:10:16,920 --> 00:10:17,400 Speaker 1: to you guys. 185 00:10:17,400 --> 00:10:18,000 Speaker 2: Thank you