LIVE NEWS
  • It has the highest levels of toxic Pfas in drinking water in Scotland. But how did this remote island become awash with forever chemicals? | Pfas
  • For veterans, a place where peace can take root : NPR
  • This common amino acid helped mice survive deadly inflammation
  • Apple Will Reportedly Add Bill-Splitting Feature to iOS 27
  • Opinion | Putin Has No Good Way Out of His War
  • Flowise’s MCP implementation can run ghost commands
  • DOE Restarts Home Efficiency Rebates, and Electrification Is the Biggest Loser
  • Albania prosecutors probe Jared Kushner-linked resort amid violent protests
Prime Reports
  • Home
  • Popular Now
  • Crypto
  • Cybersecurity
  • Economy
  • Geopolitics
  • Global Markets
  • Politics
  • See More
    • Artificial Intelligence
    • Climate Risks
    • Defense
    • Healthcare Innovation
    • Science
    • Technology
    • World
Prime Reports
  • Home
  • Popular Now
  • Crypto
  • Cybersecurity
  • Economy
  • Geopolitics
  • Global Markets
  • Politics
  • Artificial Intelligence
  • Climate Risks
  • Defense
  • Healthcare Innovation
  • Science
  • Technology
  • World
Home»Artificial Intelligence»Cut AI token usage by 96%? Here’s how AWS Strands Agents does it.
Artificial Intelligence

Cut AI token usage by 96%? Here’s how AWS Strands Agents does it.

primereportsBy primereportsApril 30, 2026No Comments3 Mins Read
Share Facebook Twitter Pinterest LinkedIn Tumblr Reddit Telegram Email
Cut AI token usage by 96%? Here’s how AWS Strands Agents does it.
Share
Facebook Twitter LinkedIn Pinterest Email


For this episode of The New Stack Makers, I sat down with AWS developer advocate Morgan Willis to talk about Strands Agents, the company’s open source agentic framework, which has seen over 14 million downloads since it launched just under a year ago. Willis brought a hands-on demo built around a simple accounting API to show what building with Strands looks like in practice.

The demo walks through three iterations of the same task: looking up the latest invoice for a customer. First, Willis mapped each API endpoint directly to an agent tool, the way most developers would by default. The agent needed five chained API calls and burned roughly 52,000 tokens. Then she swapped in intent-based tools that are built around an outcome rather than a data operation. With the same query, getting an answer now took one tool call and only 2,000 tokens.

“It’s calling multiple API’s, but rolling them up into one intent-based tool for the agent that it’s going to have a better time using — and understanding when exactly to use it. […]

“The fewer tools that you expose to your agent, the less likely it is to call the wrong one.”

“Your agent is going to have a better time reasoning around what tool to use and when, because these tools are more aligned to a task and less aligned to data,” Willis tells The New Stack. “The fewer tools that you expose to your agent, the less likely it is to call the wrong one.”

The third iteration moved those tools to a remote MCP server via AWS Agent Core Gateway and enabled semantic search across the tool catalog, so the agent received only the tools relevant to each query, rather than the full set of 16. That cut token usage roughly in half again compared to loading everything.

Willis says the broader principle at work here is that narrowly scoped agents tend to outperform general-purpose ones. 

“I think agents that are more narrowly defined tend to perform better than general use case agents. If you’re looking for context efficiency, speed, and accuracy, I would also look at your agent design as well.” 

Having many agents, each doing a small number of things, lets you design tools precisely for each use case rather than building a more general agent that tries to do everything. As MCP servers proliferate and tool catalogs grow, the question of which tools an agent actually sees on a given run is going to matter as much as the tools themselves.


Group Created with Sketch.

Cut AI token usage by 96%? Here’s how AWS Strands Agents does it.

Before joining The New Stack as its senior editor for AI, Frederic was the enterprise editor at TechCrunch, where he covered everything from the rise of the cloud and the earliest days of Kubernetes to the advent of quantum computing….

Read more from Frederic Lardinois



Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
Previous ArticleTranscribing historical Canadian weather data
Next Article Marathon record breaker Sabastian Sawe returns to hero’s welcome in Kenya | Newsfeed
primereports
  • Website

Related Posts

Artificial Intelligence

Flowise’s MCP implementation can run ghost commands

June 2, 2026
Artificial Intelligence

Dell Makes The Profits Up In Volume For Booming AI Servers

June 2, 2026
Artificial Intelligence

Design Your AI Agents Around How They Fail, Not What They Can Do

June 1, 2026
Add A Comment
Leave A Reply Cancel Reply

Top Posts

Paxton’s win over Cornyn sets up high-stakes Texas clash with Talarico

May 28, 202616 Views

Global Resources Outlook 2024 | UNEP

December 6, 202510 Views

Texas Democrat Talarico claims voting laws are rigged ahead of Paxton race

May 28, 20269 Views
Stay In Touch
  • Facebook
  • YouTube
  • TikTok
  • WhatsApp
  • Twitter
  • Instagram
Latest Reviews

Subscribe to Updates

Get the latest tech news from FooBar about tech, design and biz.

PrimeReports.org
Independent global news, analysis & insights.

PrimeReports.org brings you in-depth coverage of geopolitics, markets, technology and risk – with context that helps you understand what really matters.

Editorially independent · Opinions are those of the authors and not investment advice.
Facebook X (Twitter) LinkedIn YouTube
Key Sections
  • World
  • Geopolitics
  • Popular Now
  • Artificial Intelligence
  • Cybersecurity
  • Crypto
All Categories
  • Artificial Intelligence
  • Climate Risks
  • Crypto
  • Cybersecurity
  • Defense
  • Economy
  • Geopolitics
  • Global Markets
  • Healthcare Innovation
  • Politics
  • Popular Now
  • Science
  • Technology
  • World
  • About Us
  • Contact Us
  • Privacy Policy
  • Terms & Conditions
  • Disclaimer
  • Cookie Policy
  • DMCA / Copyright Notice
  • Editorial Policy

Sign up for Prime Reports Briefing – essential stories and analysis in your inbox.

By subscribing you agree to our Privacy Policy. You can opt out anytime.
Latest Stories
  • It has the highest levels of toxic Pfas in drinking water in Scotland. But how did this remote island become awash with forever chemicals? | Pfas
  • For veterans, a place where peace can take root : NPR
  • This common amino acid helped mice survive deadly inflammation
© 2026 PrimeReports.org. All rights reserved.
Privacy Terms Contact

Type above and press Enter to search. Press Esc to cancel.