Supermicro unveils 72-GPU AMD Helios rack-scale platform at Computex

Supermicro and AMD unveiled the Helios platform at Computex - a 72-GPU rack-scale system running AMD Instinct MI455X GPUs and EPYC CPUs. It bundles compute, networking, and cooling into one unit aimed at large AI deployments.

Categorized in: AI News Operations
Published on: Jun 03, 2026
Supermicro unveils 72-GPU AMD Helios rack-scale platform at Computex

Supermicro and AMD Launch Helios Rack-Scale AI Platform for Large-Scale Deployments

Supermicro and AMD unveiled the next-generation Helios platform at Computex, a rack-scale system designed to simplify how organizations deploy and operate large-scale AI workloads. The 72-GPU double-width rack runs AMD Instinct MI455X GPUs paired with sixth-generation AMD EPYC CPUs and AMD Pensando networking technology.

For operations teams, Helios addresses a core challenge: scaling AI infrastructure without proportional increases in complexity. The platform uses a modular design that allows organizations to expand from single racks to full clusters without redesigning their deployment strategy.

What the Platform Delivers

Helios bundles compute, networking, and software into a single rack-scale unit. The system handles large-scale model training, inference, and fine-tuning workloads while managing power consumption through integrated cooling.

Key operational features include:

  • Modular scalability from individual racks to multi-rack clusters
  • Open networking for both scale-up and scale-out AI deployments
  • Integrated virtualization and software acceleration
  • Advanced security built into the architecture

Charles Liang, Supermicro's president and CEO, said the platform shifts infrastructure design from individual servers to complete rack-scale systems. "By combining Supermicro's DCBBS with AMD Instinct MI455X GPU architecture, we deliver unprecedented AI performance, improved power efficiency through advanced cooling, and scalable infrastructure for next-generation AI workloads."

Operational Impact

The unified architecture reduces deployment time. Organizations can move from procurement to production faster because components are pre-validated and integrated. This matters for teams managing timelines on large AI projects.

Resource utilization improves through the platform's open networking and software stack. Operations staff can monitor and allocate compute across workloads more efficiently than managing individual server deployments.

Ravi Pendekanti, AMD's corporate vice president for Data Center Solutions, said the next phase of AI infrastructure depends on deployment speed and efficiency. "The next era of AI will be defined not only by more compute, but by how efficiently that compute can be deployed, connected and scaled."

Who This Targets

The platform is designed for cloud service providers, hyperscalers, and enterprises running sovereign AI, LLM training, or high-throughput inference at scale. Organizations managing multiple AI projects can use the modular design to allocate resources to different teams without separate infrastructure builds.

Supermicro demonstrated the system at Computex in Taipei and offers an interactive demo through its A+ Superverse platform for those evaluating the technology.

AI Learning Path for Operations Managers covers how to evaluate and implement AI infrastructure decisions like this in your organization.


Get Daily AI News

Your membership also unlocks:

700+ AI Courses
700+ Certifications
Personalized AI Learning Plan
6500+ AI Tools (no Ads)
Daily AI News by job industry (no Ads)