Supermicro and AMD Launch Helios Rack-Scale AI Platform for Large-Scale Deployments
Supermicro and AMD unveiled the next-generation Helios platform at Computex, a rack-scale system designed to simplify how organizations deploy and operate large-scale AI workloads. The 72-GPU double-width rack runs AMD Instinct MI455X GPUs paired with sixth-generation AMD EPYC CPUs and AMD Pensando networking technology.
For operations teams, Helios addresses a core challenge: scaling AI infrastructure without proportional increases in complexity. The platform uses a modular design that allows organizations to expand from single racks to full clusters without redesigning their deployment strategy.
What the Platform Delivers
Helios bundles compute, networking, and software into a single rack-scale unit. The system handles large-scale model training, inference, and fine-tuning workloads while managing power consumption through integrated cooling.
Key operational features include:
- Modular scalability from individual racks to multi-rack clusters
- Open networking for both scale-up and scale-out AI deployments
- Integrated virtualization and software acceleration
- Advanced security built into the architecture
Charles Liang, Supermicro's president and CEO, said the platform shifts infrastructure design from individual servers to complete rack-scale systems. "By combining Supermicro's DCBBS with AMD Instinct MI455X GPU architecture, we deliver unprecedented AI performance, improved power efficiency through advanced cooling, and scalable infrastructure for next-generation AI workloads."
Operational Impact
The unified architecture reduces deployment time. Organizations can move from procurement to production faster because components are pre-validated and integrated. This matters for teams managing timelines on large AI projects.
Resource utilization improves through the platform's open networking and software stack. Operations staff can monitor and allocate compute across workloads more efficiently than managing individual server deployments.
Ravi Pendekanti, AMD's corporate vice president for Data Center Solutions, said the next phase of AI infrastructure depends on deployment speed and efficiency. "The next era of AI will be defined not only by more compute, but by how efficiently that compute can be deployed, connected and scaled."
Who This Targets
The platform is designed for cloud service providers, hyperscalers, and enterprises running sovereign AI, LLM training, or high-throughput inference at scale. Organizations managing multiple AI projects can use the modular design to allocate resources to different teams without separate infrastructure builds.
Supermicro demonstrated the system at Computex in Taipei and offers an interactive demo through its A+ Superverse platform for those evaluating the technology.
AI Learning Path for Operations Managers covers how to evaluate and implement AI infrastructure decisions like this in your organization.
Your membership also unlocks: