Landing in Las Vegas for AWS re:Invent this year felt a little like stepping into a time warp—or maybe a wormhole built out of GPUs and venture capital. Because whatever show you thought re:Invent was a few years ago—the cloud show, the infrastructure show, the “how many people can we fit into the Venetian ballrooms” show—yeah, forget all that. AWS isn’t just pivoting to AI; they’re consuming it, radiating it, and distributing it like oxygen to every corner of every conversation.
The message here is unmistakable: All AI. All the Time.
You can see it in the keynotes, the billboards, the vendor parties, even in the random hallway conversations fueled by cold-brew and adrenaline. And AWS’s announcements? Let’s just say they didn’t come to play small ball.
Welcome to the Age of Agents
AWS CEO Matt Garman did not mince words: He expects billions of AI agents running across IT environments. Not millions—billions. Across DevOps, security, coding, operations, customer service, and everything in between.
AWS rolled out:
- Long-running frontier agents that never go off duty
- Kiro AI coding agent, quietly optimizing pull requests and background tasks
- AWS Security Agent reviewing and testing code
- AWS DevOps Agent to help manage incidents (think of it as your “always-on-call” teammate who doesn’t complain about 3 a.m. alerts)
They also pushed forward on AgentCore, adding guardrails-as-code and continuous behavioral evaluation so your agent doesn’t go rogue and start rewriting your infrastructure to suit its mood.
Honestly? Five years ago we were still arguing about whether “AIOps” was a real thing or just fancy marketing. Today AWS is casually previewing an entire civilization of autonomous workers.
AI Factories and the Hardware Arms Race
AWS also leaned into what I think is the most significant tectonic shift in infrastructure since containerization: the rise of AI Factories.
These aren’t metaphors. They’re literal machine rooms—GPUs, Trainium accelerators, model pipelines, orchestration layers—all delivered as a managed service.
This is AWS saying:
“Don’t just build apps here. Build AI itself here.”
And the hardware to back it up is no joke:
- The newest NVIDIA GB300 NVL72 GPUs
- A planned Trainium 4 boasting 6× the performance of Trainium 3
- Up to 362 FP8 PFLOPs packed into an AWS UltraServer
If you’re counting FLOPs in the triple digits, congratulations: you’re playing the AI Major League.
Models Everywhere
AWS expanded Bedrock with:
- 18 new open-weight models (including from Mistral)
- A next-gen family of Nova 2 models—lite, pro, sonic, and omni
- Nova Omni, which can convert speech or text into video
- And the blockbuster: Nova Forge, letting you build your own foundational model from scratch
Let’s pause there.
Building your own foundation model used to be something only a handful of the richest organizations on earth could even contemplate. Now AWS is making it look like ordering room service at the Wynn.
Built Here vs Best in Breed
One thing I always look for at re:Invent is AWS’s identity—are they pushing customers toward their own stack, or embracing the broader ecosystem?
This year? It’s both.
AWS has its own models, accelerators, and agents—absolutely. But they’re also deeply intertwined with the best in the business: NVIDIA, Mistral, the open-weight movement, and more.
Are AWS’s homegrown tools going to rival specialized vendors?
Or will they become the classic 80% of the value at 20% of the cost solutions AWS is known for?
Time will tell. But the dual-track strategy is very… AWS. Pragmatic. Ubiquitous. Relentless.
Billions of Agents—Should We Be Worried?
Walking through the airport, I saw a Databricks ad boasting they “build agents that don’t suck.”
A bold claim.
Also, quite an admission: most agents today do suck.
So yes, billions of agents sounds scary.
But then again, so did billions of containers before Kubernetes, or millions of microservices before service meshes. The tech industry is good at taking chaos and industrializing it.
Shimmy’s Take
Look, let’s not nitpick. There’s enough AI here to wrap around the planet several times. Some of this will be evolutionary dead ends. Some will be revolutionary breakthroughs. But all of it signals a profound truth:
The AI-Native era isn’t coming. It’s here.
Hard to believe it’s been only three years since ChatGPT dropped and blew the doors off the industry. Now we’re talking about AI factories, multimodal video models, and fleets of agents managing IT operations.
Which brings me to…
A 2–3 Year Reflection: Where This All Goes
AWS is not preparing for incremental change. They’re preparing for a world where:
1. Dev teams become AI-native by default
You won’t start a project without an agent.
You won’t open a repo without automated analysis.
PRs, tests, deployments—AI handles it unless it escalates.
Developers shift from writing code to supervising and shaping code ecosystems.
2. IT operations will be transformed by long-running frontier agents
Your “on-call” will be:
- Autonomous
- Context-aware
- Equipped with full historical telemetry
- Continuously retraining itself
Humans will handle judgment, ethics, policy, and supervision—not debugging.
3. AWS becomes the world’s AI foundry
Not just cloud.
Not just infrastructure.
The manufacturing floor for AI itself.
Nova Forge is the tell.
Trainium 4 is the tell.
AgentCore is the tell.
In 2–3 years, AWS won’t just be powering AI apps.
It will be powering AI companies, AI workers, AI infrastructure, and AI factories that generate entire ecosystems of digital labor.
Final Thought
Standing in the middle of re:Invent today feels like standing inside the engine room of the tech industry’s next 20 years. And if AWS’s vision holds, we’re not heading into a world with more cloud.
We’re heading into a world where the cloud becomes the operating system for AI.

