LIVE NEWS
  • Global economy must stop pandering to ‘frivolous desires of ultra-rich’, says UN expert | Environment
  • Some Middle East Flights Resume but Confusion Reigns From Iran Strikes
  • Clinton Deposition Videos Released in Epstein Investigation
  • Elevance stock tumbles as CMS may halt Medicare enrollment
  • Wild spaces for butterflies to be created in Glasgow
  • You can now adjust how your caller card looks for calls on Android phones
  • TRON DAO expands TRON Academy initiative with Dartmouth, Princeton, Oxford, and Cambridge
  • Alex Mitchell: England scrum-half ruled out of Six Nations
Prime Reports
  • Home
  • Popular Now
  • Crypto
  • Cybersecurity
  • Economy
  • Geopolitics
  • Global Markets
  • Politics
  • See More
    • Artificial Intelligence
    • Climate Risks
    • Defense
    • Healthcare Innovation
    • Science
    • Technology
    • World
Prime Reports
  • Home
  • Popular Now
  • Crypto
  • Cybersecurity
  • Economy
  • Geopolitics
  • Global Markets
  • Politics
  • Artificial Intelligence
  • Climate Risks
  • Defense
  • Healthcare Innovation
  • Science
  • Technology
  • World
Home»Artificial Intelligence»AWS Graviton5 Strikes A Different Balance For Server CPUs
Artificial Intelligence

AWS Graviton5 Strikes A Different Balance For Server CPUs

primereportsBy primereportsDecember 5, 2025No Comments6 Mins Read
Share Facebook Twitter Pinterest LinkedIn Tumblr Reddit Telegram Email
AWS Graviton5 Strikes A Different Balance For Server CPUs
Share
Facebook Twitter LinkedIn Pinterest Email


AWS Graviton5 Strikes A Different Balance For Server CPUs

We have been expecting a new Arm server CPU design out of the Annapurna Labs folks who create the CPUs, XPUs, DPUs, and scale up switches for Amazon Web Services since the launch of the Graviton4 processor for two-socket systems two years ago.

The Graviton4, based on the “Demeter” V2 core just like the Nvidia “Grace” CG100 processor, was the first server CPU created by AWS that had NUMA clustering, allowing for two CPUs to share memory and present a single memory space and compute complex to the operating system. But as Dave Brown, vice president of compute and machine learning services at AWS explained in the opening keynote at the re:Invent 2025 conference today, having two processors sharing memory across a set of NUMA links introduced a lot of latencies for applications as did not having enough L3 cache for the Graviton4 cores to use as a DRAM cache. And this, as well as other factors, made applications run slower than you might expect when they had a total of 192 Neoverse V2 cores to support them.

So with the Graviton5, which is now in technology previewed with selected AWS customers, the Annapurna Labs team seems to have scratched the NUMA and put 192 Arm cores on a single socket. And now the bottleneck shifts back to the memory and memory bandwidth balance reckoned against those 192 cores, because now there is half as much memory capacity and perhaps a little more than half as much memory bandwidth against what we presume are “Poseidon” Neoverse V3 cores inside that single Graviton5 socket. (See Arm Neoverse Roadmap Brings CPU Designs, But No Big Fat GPU for more on the Neoverse core and chip roadmaps from Arm.)

There is nothing, we think, that precludes AWS from creating a two-socket NUMA version of Graviton5, of course, and that may eventually happen if customers need such configurations. (And we think that some will.)

Brown did not give out much in the way of feeds and speeds for Graviton5. We know Graviton5 has 192 cores in a single socket, 2X the cores of the Graviton4 CPU, but only delivers about 25 percent more performance. We also know that Graviton5 has 2.67X the amount of L3 cache per core as Graviton4 and has 5.3X the L3 cache per chip as Graviton4. We think that Graviton5 is etched in the same 3 nanometer processes from Taiwan Semiconductor Manufacturing Co as the current Tranium3 XPU that is now shipping in volume inside UltraServer clusters.

Brown also briefly showed a block diagram of Graviton5, which we snapped quickly but which is still blurry given how far away the camera was from the backdrop screen on stage:

If you sort of squint at that, you can see 96 pairs of Arm cores in the center of the chip, with a mesh interconnect between them. There are four PCI-Express 6.0 controllers across the top of the chip and four more across the bottom, which should be 96 lanes in total and 2.84 TB/sec full duplex at twelve lanes per PCI-Express controller.

On the right and left edges of the chip you see six DDR5 memory controllers on each side, for a total of twelve DDR5 memory controllers for the whole Graviton5 socket. If AWS is using DDR5-6400 memory running at 6.4 GHz, a single Graviton5 chip would have 614.4 GB/sec of memory bandwidth, a 14.3 percent increase compared to Graviton4. When we were fantasizing about what AWS might do, we had hoped it would push up to 165 controllers on the socket and even boost the memory speed to 7.2 GHz with DDR5-7200 memory, which is rare out there in the world. (And hence, that is why AWS didn’t push it.)

Brown did not speak about the Graviton5 core at all, but it could be a Demeter V2 core or a Poseidon V3 core, and we think it is highly likely that it is a V3 core with all the Arm-V9.2-A enhancements that come with Poseidon, but with the core geared down a lot to save thermals. If we are right, then this is like an Intel E-core variant aimed at heavily threaded workloads where throughput needs to be pushed compared to more lightly threaded jobs where thread performance (and hence higher clock speeds) are important and, in the Intel parlance, “performance cores” or P-cores.

This is all admittedly a guess, but this is the only way we can think of to double the core count on a chip but only increase performance by 25 percent. So, Graviton4 had a 2.8 GHz clock speed, and to get that modest 25 percent performance boost on threaded workloads, the clock speed would have to drop to around 1.75 GHz.

The Poseidon V3 core allows 2 MB or 3 MB of L2 cache per core, and we opted for the fatter one in our table. But again, AWS has not confirmed the Graviton5 specs as yet. We think the L1 instruction and data caches will stay at 64 KB each inside each core.

Here is how the six different Graviton chips stack up on the feeds and speeds:

Click to enlarge

When we do our estimating, we think the Graviton5 complex has around 132 billion transistors and burns about 180 watts running at 1.75 GHz. We also think that AWS can absolutely run the clocks a lot higher, and we fully expect for the speeds to get back up to the 3 GHz range before too long for customers who want lots of cores that are also cranking fast. Such an overclocked Graviton5 might burn 650 watts to 700 watts, possibly more – and it might also have NVLink Fusion and UALink ports as well to directly link into GPU and XPU compute engines to share memory.

Brown said that M9g instances using Graviton5 and aimed at general purpose workloads are in preview now. C9g instances aimed at compute-intensive jobs and R9g instances aimed at memory intensive jobs are expected to be unveiled in 2026.

Sign up to our Newsletter

Featuring highlights, analysis, and stories from the week directly from us to your inbox with nothing in between.
Subscribe now

Related Articles

Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
Previous ArticleAnalysis: Why COP30’s ‘tripling adaptation finance’ target is less ambitious than it seems
Next Article Chinese Hackers Exploiting React2Shell Vulnerability
primereports
  • Website

Related Posts

Artificial Intelligence

The Greatest AI Show On Earth

February 25, 2026
Artificial Intelligence

Judge Dismisses Elon Musk’s XAI Trade Secret Lawsuit Against OpenAI

February 25, 2026
Artificial Intelligence

OpenClaw is being called a security “Dumpster fire,” but there is a way to stay safe

February 25, 2026
Add A Comment
Leave A Reply Cancel Reply

Top Posts

Global Resources Outlook 2024 | UNEP

December 6, 20255 Views

The D Brief: DHS shutdown likely; US troops leave al-Tanf; CNO’s plea to industry; Crowded robot-boat market; And a bit more.

February 14, 20264 Views

German Chancellor Merz faces difficult mission to Israel – DW – 12/06/2025

December 6, 20254 Views
Stay In Touch
  • Facebook
  • YouTube
  • TikTok
  • WhatsApp
  • Twitter
  • Instagram
Latest Reviews

Subscribe to Updates

Get the latest tech news from FooBar about tech, design and biz.

PrimeReports.org
Independent global news, analysis & insights.

PrimeReports.org brings you in-depth coverage of geopolitics, markets, technology and risk – with context that helps you understand what really matters.

Editorially independent · Opinions are those of the authors and not investment advice.
Facebook X (Twitter) LinkedIn YouTube
Key Sections
  • World
  • Geopolitics
  • Popular Now
  • Artificial Intelligence
  • Cybersecurity
  • Crypto
All Categories
  • Artificial Intelligence
  • Climate Risks
  • Crypto
  • Cybersecurity
  • Defense
  • Economy
  • Geopolitics
  • Global Markets
  • Healthcare Innovation
  • Politics
  • Popular Now
  • Science
  • Technology
  • World
  • About Us
  • Contact Us
  • Privacy Policy
  • Terms & Conditions
  • Disclaimer
  • Cookie Policy
  • DMCA / Copyright Notice
  • Editorial Policy

Sign up for Prime Reports Briefing – essential stories and analysis in your inbox.

By subscribing you agree to our Privacy Policy. You can opt out anytime.
Latest Stories
  • Global economy must stop pandering to ‘frivolous desires of ultra-rich’, says UN expert | Environment
  • Some Middle East Flights Resume but Confusion Reigns From Iran Strikes
  • Clinton Deposition Videos Released in Epstein Investigation
© 2026 PrimeReports.org. All rights reserved.
Privacy Terms Contact

Type above and press Enter to search. Press Esc to cancel.