Supermicro has unveiled a new generation of servers and rack-scale systems built to support the growing demands of agentic AI, a product line supported by Arm’s recently introduced AGI CPU.
While vast investment in recent years has poured into advanced GPUs for training LLMs, Supermicro and Arm are riding the growing focus on CPUs, which are well suited to today’s move toward AI inference and autonomous AI agents.
As agentic AI systems perform tasks like planning, reasoning, retrieval, and orchestration, they place heavy demands on memory bandwidth and I/O performance, creating new infrastructure requirements beyond raw GPU horsepower.
To support those requirements, Supermicro’s new systems use Arm’s AGI CPU, which features up to 136 Arm Neoverse V3 cores, 12 DDR5 memory channels operating at speeds up to 8800 MT/s, and PCIe Gen6 connectivity. The processor operates within a 300-watt power envelope and is geared to deliver high compute density while reducing power consumption.
Arm estimates that systems based on the AGI CPU can provide roughly twice the performance per rack compared with comparable x86-based platforms.
“Supermicro is really a jack of all trades when it comes to building systems, and it sees a partnership with Arm-based CPUs, and in this case Arm-supplied CPUs, as a way to offer an alternative for companies that want lower power and possibly lower cost than with AMD/Intel chips,” Jack Gold, chief analyst at J. Gold Associates, told Techstrong.ai.
“I don’t expect this to be a major disrupter in the market for x86 CPUs in the near term, but it does provide an optional capability for companies, especially as edge-based compute proliferates and inference-based AI dominates.”
Liquid-Cooled and Air-Cooled Systems
Supermicro’s lineup includes both liquid-cooled and air-cooled systems aimed at hyperscale cloud providers, enterprises and edge deployments.
For advanced AI environments, the company introduced a liquid-cooled Open Rack Wide platform capable of supporting up to 336 AGI CPUs in a fully populated rack. A separate Open Rack V3 configuration can accommodate up to 168 AGI CPUs per rack while maintaining compatibility with advanced data center architectures.
For the air-cooled alternatives, a single-socket edge server handles distributed inference workloads in locations with limited power and space. Another dual-socket 2U server is designed for general-purpose enterprise computing, including virtualization, databases, analytics and cloud infrastructure applications.
For AI inference deployments requiring accelerator support, Supermicro unveiled a 5U system capable of housing up to eight GPUs alongside dual AGI CPUs and high-density NVMe storage. The company said these systems are scheduled to enter production during 2027.
Data Center Building Block Solutions
Beyond individual servers, Supermicro is promoting its broader Data Center Building Block Solutions (DCBBS) strategy. The company introduced infrastructure blueprints based on NVIDIA’s upcoming Vera Rubin platforms, including designs that can scale from 5 megawatts to 1 gigawatt of AI capacity. A single modular deployment can support 1,152 GPUs while integrating compute, storage, networking, liquid cooling and power distribution into a unified architecture.
The announcement aligns with a larger industry trend favoring Arm-based architectures in data centers. Arm has gained traction among hyperscalers and cloud providers seeking better performance-per-watt as AI workloads drive energy consumption higher.
Bottom line: for Supermicro, the new systems represent a view that the next phase of AI infrastructure will require a more balanced approach that combines CPUs, accelerators, memory and networking.

