 |
Latest Benchmarks Show Supermicro Systems with the NVIDIA B200 Outperformed the Previous Generation of Systems with 3X the Token Generation Per Second
SAN JOSE, Calif., April 3, 2025 /PRNewswire/ -- Super Micro Computer, Inc. (SMCI), a Total IT Solution Provider for AI/ML, HPC, Cloud, Storage, and 5G/Edge, is announcing first-to-market industry leading performance on several MLPerf Inference v5.0 benchmarks, using the NVIDIA HGXâ„¢ B200 8-GPU. The 4U liquid-cooled and 10U air-cooled systems achieved the best performance in select benchmarks. Supermicro demonstrated more than 3 times the tokens per second (Token/s) generation for Llama2-70B and Llama3.1-405B benchmarks compared to H200 8-GPU systems.
"Supermicro remains a leader in the AI industry, as evidenced by the first new benchmarks released by MLCommons in 2025," said Charles Liang, president and CEO of Supermicro. "Our building block architecture enables us to be first-to-market with a diverse range of systems optimized for various workloads. We continue to collaborate closely with NVIDIA to fine-tune our systems and secure a leadership position in AI workloads."
Learn more about the new MLPerf v5.0 Inference benchmarks at: https://mlcommons.org/benchmarks/inference-datacenter/
Supermicro is the only system vendor publishing record MLPerf inference performance (on select benchmarks) for both the air-cooled and liquid-cooled NVIDIA HGXâ„¢ B200 8-GPU systems. Both air-cooled and liquid-cooled systems were operational before the MLCommons benchmark start date. Supermicro engineers optimized the systems and software to showcase the impressive performance. Within the operating margin, the Supermicro air-cooled B200 system exhibited the same level of performance as the liquid-cooled B200 system. Supermicro has been delivering these systems to customers while we conducted the benchmarks.
MLCommons emphasizes that all results be reproducible, that the products are available and that the results can be audited by other MLCommons members. Supermicro engineers optimized the systems and software, as allowed by the MLCommons rules.
The SYS-421GE-NBRT-LCC (8x NVIDIA B200-SXM-180GB) and SYS-A21GE-NBRT (8x NVIDIA B200-SXM-180GB) showed performance leadership running the Mixtral 8x7B Inference, Mixture of Experts benchmarks with 129,000 tokens/second. The Supermicro air-cooled and liquid-cooled NVIDIA B200 based system delivered over 1,000 tokens/second inference for the large Llama3.1-405b model, whereas the previous generations of GPU systems have much smaller results. For smaller inferencing tasks, using the LLAMA2-70b benchmark, a Supermicro system with the NVIDIA B200 SXM-180GB installed shows the highest performance from a Tier 1 system supplier.
Specifically:
- Stable Diffusion XL (Server)
SYS-A21GE-NBRT (8x B200-SXM-180GB)
#1 queries/s, 28.92
- llama2-70b-interactive-99 (Server)
SYS-A21GE-NBRT (8x B200-SXM-180GB)
#1 Tokens/s, 62,265.70
- Llama3.1-405b (offline)
SYS-421GE-NBRT-LCC (8xB200-SXM-180GB)
#1 Tokens/s 1521.74
- Llama3.1-405b (Server)
SYS-A21GE-NBRT (8x B200-SXNM-180GB)
#1 Tokens/s, 1080.31 (for an 8-GPU node)
- mixtral-8x7b (Server)
SYS-421GE-NBRT-LCC (8x B200-SXM-180GB)
#1 Tokens/s, 129,047.00
- mixtral-8x7b (Offline)
SYS-421GE-NBRT-LCC (8x B200-SXM-180GB)
#1 Tokens/s, 128,795.00
"MLCommons congratulates Supermicro on their submission to the MLPerf Inference v5.0 benchmark. We are pleased to see their results showcasing significant performance gains compared to earlier generations of systems," said David Kanter, Head of MLPerf at MLCommons. "Customers will be pleased by the performance improvements achieved which are validated by the neutral, representative and reproducible MLPerf results."
Supermicro offers a comprehensive AI portfolio with over 100 GPU-optimized systems, both air-cooled and liquid-cooled options, with a choice of CPUs, ranging from single-socket optimized systems to 8-way multiprocessor systems. Supermicro rack-scale systems include computing, storage, and network components, which reduce the time required to install them once they are delivered to a customer site.
Supermicro's NVIDIA HGX B200 8-GPU systems utilize next-generation liquid-cooling and air-cooling technology. The newly developed cold plates and the new 250kW coolant distribution unit (CDU) more than double the cooling capacity of the previous generation in the same 4U form factor. Available in 42U, 48U, or 52U configurations, the rack-scale design with the new vertical coolant distribution manifolds (CDM) no longer occupies valuable rack units. This enables eight systems, comprising 64 NVIDIA Blackwell GPUs in a 42U rack, and up to 12 systems with 96 NVIDIA Blackwell GPUs in a 52U rack.
The new air-cooled 10U NVIDIA HGX B200 system features a redesigned chassis with expanded thermal headroom to accommodate eight 1000W TDP Blackwell GPUs. Up to 4 of the new 10U air-cooled systems can be installed and fully integrated in a rack, the same density as the previous generation, while providing up to 15x inference and 3x training performance.
About Super Micro Computer, Inc.
Supermicro (NASDAQ: SMCI) is a global leader in Application-Optimized Total IT Solutions. Founded and operating in San Jose, California, Supermicro is committed to delivering first-to-market innovation for Enterprise, Cloud, AI, and 5G Telco/Edge IT Infrastructure. We are a Total IT Solutions provider with server, AI, storage, IoT, switch systems, software, and support services. Supermicro's motherboard, power, and chassis design expertise further enables our development and production, enabling next-generation innovation from cloud to edge for our global customers. Our products are designed and manufactured in-house (in the US, Asia, and the Netherlands), leveraging global operations for scale and efficiency and optimized to improve TCO and reduce environmental impact (Green Computing). The award-winning portfolio of Server Building Block Solutions® allows customers to optimize for their exact workload and application by selecting from a broad family of systems built from our flexible and reusable building blocks that support a comprehensive set of form factors, processors, memory, GPUs, storage, networking, power, and cooling solutions (air-conditioned, free air cooling or liquid cooling).
Supermicro, Server Building Block Solutions, and We Keep IT Green are trademarks and/or registered trademarks of Super Micro Computer, Inc.
All other brands, names, and trademarks are the property of their respective owners.
Latest Benchmarks Show Supermicro Systems with the NVIDIA B200 Outperformed the Previous Generation of Systems with 3X the Token Generation Per Second
SAN JOSE, Calif., April 3, 2025 /PRNewswire/ -- Super Micro Computer, Inc. (SMCI), a Total IT Solution Provider for AI/ML, HPC, Cloud, Storage, and 5G/Edge, is announcing first-to-market industry leading performance on several MLPerf Inference v5.0 benchmarks, using the NVIDIA HGXâ„¢ B200 8-GPU. The 4U liquid-cooled and 10U air-cooled systems achieved the best performance in select benchmarks. Supermicro demonstrated more than 3 times the tokens per second (Token/s) generation for Llama2-70B and Llama3.1-405B benchmarks compared to H200 8-GPU systems.
"Supermicro remains a leader in the AI industry, as evidenced by the first new benchmarks released by MLCommons in 2025," said Charles Liang, president and CEO of Supermicro. "Our building block architecture enables us to be first-to-market with a diverse range of systems optimized for various workloads. We continue to collaborate closely with NVIDIA to fine-tune our systems and secure a leadership position in AI workloads."
Learn more about the new MLPerf v5.0 Inference benchmarks at: https://mlcommons.org/benchmarks/inference-datacenter/
Supermicro is the only system vendor publishing record MLPerf inference performance (on select benchmarks) for both the air-cooled and liquid-cooled NVIDIA HGXâ„¢ B200 8-GPU systems. Both air-cooled and liquid-cooled systems were operational before the MLCommons benchmark start date. Supermicro engineers optimized the systems and software to showcase the impressive performance. Within the operating margin, the Supermicro air-cooled B200 system exhibited the same level of performance as the liquid-cooled B200 system. Supermicro has been delivering these systems to customers while we conducted the benchmarks.
MLCommons emphasizes that all results be reproducible, that the products are available and that the results can be audited by other MLCommons members. Supermicro engineers optimized the systems and software, as allowed by the MLCommons rules.
The SYS-421GE-NBRT-LCC (8x NVIDIA B200-SXM-180GB) and SYS-A21GE-NBRT (8x NVIDIA B200-SXM-180GB) showed performance leadership running the Mixtral 8x7B Inference, Mixture of Experts benchmarks with 129,000 tokens/second. The Supermicro air-cooled and liquid-cooled NVIDIA B200 based system delivered over 1,000 tokens/second inference for the large Llama3.1-405b model, whereas the previous generations of GPU systems have much smaller results. For smaller inferencing tasks, using the LLAMA2-70b benchmark, a Supermicro system with the NVIDIA B200 SXM-180GB installed shows the highest performance from a Tier 1 system supplier.
Specifically:
- Stable Diffusion XL (Server)
SYS-A21GE-NBRT (8x B200-SXM-180GB)
#1 queries/s, 28.92
- llama2-70b-interactive-99 (Server)
SYS-A21GE-NBRT (8x B200-SXM-180GB)
#1 Tokens/s, 62,265.70
- Llama3.1-405b (offline)
SYS-421GE-NBRT-LCC (8xB200-SXM-180GB)
#1 Tokens/s 1521.74
- Llama3.1-405b (Server)
SYS-A21GE-NBRT (8x B200-SXNM-180GB)
#1 Tokens/s, 1080.31 (for an 8-GPU node)
- mixtral-8x7b (Server)
SYS-421GE-NBRT-LCC (8x B200-SXM-180GB)
#1 Tokens/s, 129,047.00
- mixtral-8x7b (Offline)
SYS-421GE-NBRT-LCC (8x B200-SXM-180GB)
#1 Tokens/s, 128,795.00
"MLCommons congratulates Supermicro on their submission to the MLPerf Inference v5.0 benchmark. We are pleased to see their results showcasing significant performance gains compared to earlier generations of systems," said David Kanter, Head of MLPerf at MLCommons. "Customers will be pleased by the performance improvements achieved which are validated by the neutral, representative and reproducible MLPerf results."
Supermicro offers a comprehensive AI portfolio with over 100 GPU-optimized systems, both air-cooled and liquid-cooled options, with a choice of CPUs, ranging from single-socket optimized systems to 8-way multiprocessor systems. Supermicro rack-scale systems include computing, storage, and network components, which reduce the time required to install them once they are delivered to a customer site.
Supermicro's NVIDIA HGX B200 8-GPU systems utilize next-generation liquid-cooling and air-cooling technology. The newly developed cold plates and the new 250kW coolant distribution unit (CDU) more than double the cooling capacity of the previous generation in the same 4U form factor. Available in 42U, 48U, or 52U configurations, the rack-scale design with the new vertical coolant distribution manifolds (CDM) no longer occupies valuable rack units. This enables eight systems, comprising 64 NVIDIA Blackwell GPUs in a 42U rack, and up to 12 systems with 96 NVIDIA Blackwell GPUs in a 52U rack.
The new air-cooled 10U NVIDIA HGX B200 system features a redesigned chassis with expanded thermal headroom to accommodate eight 1000W TDP Blackwell GPUs. Up to 4 of the new 10U air-cooled systems can be installed and fully integrated in a rack, the same density as the previous generation, while providing up to 15x inference and 3x training performance.
About Super Micro Computer, Inc.
Supermicro (NASDAQ: SMCI) is a global leader in Application-Optimized Total IT Solutions. Founded and operating in San Jose, California, Supermicro is committed to delivering first-to-market innovation for Enterprise, Cloud, AI, and 5G Telco/Edge IT Infrastructure. We are a Total IT Solutions provider with server, AI, storage, IoT, switch systems, software, and support services. Supermicro's motherboard, power, and chassis design expertise further enables our development and production, enabling next-generation innovation from cloud to edge for our global customers. Our products are designed and manufactured in-house (in the US, Asia, and the Netherlands), leveraging global operations for scale and efficiency and optimized to improve TCO and reduce environmental impact (Green Computing). The award-winning portfolio of Server Building Block Solutions® allows customers to optimize for their exact workload and application by selecting from a broad family of systems built from our flexible and reusable building blocks that support a comprehensive set of form factors, processors, memory, GPUs, storage, networking, power, and cooling solutions (air-conditioned, free air cooling or liquid cooling).
Supermicro, Server Building Block Solutions, and We Keep IT Green are trademarks and/or registered trademarks of Super Micro Computer, Inc.
All other brands, names, and trademarks are the property of their respective owners.
** The press release content is from PR Newswire. Bastille Post is not involved in its creation. **
Industry's First-to-Market Supermicro NVIDIA HGX™ B200 Systems Demonstrate AI Performance Leadership on MLPerf® Inference v5.0 Results
The Initiative Aims to Address the Pain Points of Family Filmmaking
BEIJING, Dec. 13, 2025 /PRNewswire/ -- On December 6th, the 3rd FamilyLens International Film Festival opened in Beijing. SmallRig, a global specialized provider of imaging solutions, collaborating with FamilyLens, is empowering family filmmaking through various initiatives, including the Family Filmmaking Co-Creation Initiative, a dedicated family filmmaking kit, and a social impact screening program.
Deepening the "Co-Creation Initiatives"
The Co-Creation Initiatives are a series of global creative initiatives open to filmmakers and image creators worldwide. Through deep collaboration across multiple dimensions — including product co-creation, discovery and promotion, and content co-creation — SmallRig aims to expand the boundaries of mobile filmmaking and bring the spirit of Free Your Dream to life with creators everywhere.
Following the launch of the Mobile Filmmaking Co-Creation Initiative at the 14th International Smartphone Film Festival, SmallRig announced the launch of the Family Filmmaking Co-Creation Initiative on December 6th at the 3rd FamilyLens International Film Festival. The initiative invites global creators and every family to participate, focusing on three core directions: product co-creation, work promotion, and content co-creation, to explore the possibilities of family filmmaking for everyone.
Mr. Zhou Yang, Founder and CEO of SmallRig, shared the inspiration for the initiative: "The initiative is rooted in the immense universality and profound emotional depth inherent in family narratives, which serve as a common emotional bond connecting global audiences. We firmly believe that in this era, where everyone can be a content creator, every family can and should film their own story. The instinct to create is deeply embedded in the human spirit, and every home is the origin of countless narratives."
Gu Xue, Founder and Director of the FamilyLens International Film Festival, stated: "We hope that people and practitioners around the world who care about family movies can find suitable solutions and gain insights from the Co-Creation Initiatives. We look forward to more and more people exploring the field of 'Home' together through this initiative."
To address the pain points of family filmmaking, SmallRig officially released the SmallRig Family Filmmaking Kit at the opening ceremony. The kit includes a high-quality microphone, fill light, and a portable tripod, specifically designed to achieve "professional function democratization" and "complex feature simplification." During the FamilyLens Workshop, attendees experienced the convenience of the equipment firsthand. Many expressed that the kit truly solves many problems, enabling ordinary families to complete necessary filming without specialized photography knowledge.
Social Impact Screening Program
The 3rd FamilyLens International Film Festival is open to the public from December 6th to 14th. The festival opened with the screening of well-received documentary: K-Family Affairs. In addition to the competition section, the festival features several distinctive sections, including the Reframing Home Movies— An Italian Retrospective, Youth Film Program, Filmmaker in Focus, and the Social Impact Program. Four major awards, such as the Real-Life Portrait Award and the Artistic Exploration Award, will also be presented.
As a key component of the Co-Creation Initiatives, the SmallRig Image Development Fund partnered with the FamilyLens International Film Festival to curate the "Social Impact Screening Program." This unit focuses on elevating awareness of critical issues within the family unit, advocating for a new reflection: "Starting with Seeing, Concluding with Understanding."
Arum Nam, Director of K-Family Affairs, stated:
"Starting with the stories of your family, your friends, and yourself, I believe this personal narrative can connect directly to the bigger society."
SmallRig believes that truly meaningful social impact storytelling stems from awareness in proximity—achieved by using the lens to penetrate the daily surface and enabling a deep, empathetic "Seeing with Empathy."
The four featured works in this unit are:
- People of the Ascent
- Granny's Lost and Found
- Ruixi at Fourteen
- No Country For My Maternal Grandma
These films highlight four family issues that require "seeing": the yearning of left-behind children, the mental isolation of Alzheimer's patients, the elderly searching for subjectivity in their drifting lives, and the emotional volatility and struggle of Bipolar Disorder.
Strategic Outlook and Future Expansion
SmallRig will continue to support the Family Filmmaking Co-Creation Initiative's deep development through promotional campaigns and practical workshops focused on family movie scenarios.
The overall goal of SmallRig's Global Co-creation Initiative is to continuously explore and meet the growing, diverse needs of global creators across different vertical domains. SmallRig is committed to persistently expanding the co-creation model into more imaging sectors, collaborating with industry partners to push the boundaries of imaging and grant global creators broader creative freedom.
About SmallRig
Founded in 2013, SmallRig is an innovation-driven global company that designs and manufactures comprehensive support solutions and accessories for all content creation needs. Trusted by over four million creators globally, SmallRig pioneered the User Co-creation Design (UCD) philosophy and the DreamRig Program.
For more information, visit: www.smallrig.com.
** The press release content is from PR Newswire. Bastille Post is not involved in its creation. **
SmallRig and FamilyLens Launch Global Family Filmmaking Initiative at 3rd FamilyLens International Film Festival
SmallRig and FamilyLens Launch Global Family Filmmaking Initiative at 3rd FamilyLens International Film Festival
SmallRig and FamilyLens Launch Global Family Filmmaking Initiative at 3rd FamilyLens International Film Festival