Skip to Content Facebook Feature Image

AWS and Cerebras Collaboration Aims to Set a New Standard for AI Inference Speed and Performance in the Cloud

Business

AWS and Cerebras Collaboration Aims to Set a New Standard for AI Inference Speed and Performance in the Cloud
Business

Business

AWS and Cerebras Collaboration Aims to Set a New Standard for AI Inference Speed and Performance in the Cloud

2026-03-13 23:07 Last Updated At:03-14 12:51

SEATTLE & SUNNYVALE, Calif.--(BUSINESS WIRE)--Mar 13, 2026--

Amazon Web Services, Inc. (AWS), an Amazon.com, Inc. company (NASDAQ: AMZN), and Cerebras Systems today announced a collaboration that will, in the coming months, deliver the fastest AI inference solutions available for generative AI applications and LLM workloads. The solution, to be deployed on Amazon Bedrock in AWS data centers, combines AWS Trainium-powered servers, Cerebras CS-3 systems, and Elastic Fabric Adapter (EFA) networking. Later this year, AWS will also offer leading open-source LLMs and Amazon Nova using Cerebras hardware.

This press release features multimedia. View the full release here: https://www.businesswire.com/news/home/20260313406341/en/

Amazon is deploying Cerebras Wafer Scale Engines in AWS datacenters​. Ultra fast inference will be available through AWS Bedrock, bringing industry leading performance to the largest hyperscale cloud.​

“Inference is where AI delivers real value to customers, but speed remains a critical bottleneck for demanding workloads like real-time coding assistance and interactive applications,” said David Brown, Vice President, Compute & ML Services, AWS. “What we're building with Cerebras solves that: by splitting the inference workload across Trainium and CS-3, and connecting them with Amazon’s Elastic Fabric Adapter, each system does what it's best at. The result will be inference that's an order of magnitude faster and higher performance than what's available today."

“Partnering with AWS to build a disaggregated inference solution will bring the fastest inference to a global customer base,” said Andrew Feldman, Founder and CEO of Cerebras Systems. “Every enterprise around the world will be able to benefit from blisteringly fast inference within their existing AWS environment.”

How It Works: Inference Disaggregation

The Trainium + CS-3 solution enables “inference disaggregation,” a technique which separates AI inference into two stages: prompt processing, or “prefill,” and output generation, or “decode.” These two stages have profoundly different computational characteristics. Prefill is natively parallel, computationally intensive, and requires moderate memory bandwidth. Decode, on the other hand, is inherently serial, computationally light, and memory bandwidth intensive. Decode typically represents the majority of inference time in these scenarios because each output token must be generated sequentially.

Because each stage has a different computational challenge, they each benefit from different compute architectures and low-latency, high-bandwidth EFA networking between them. By strategically disaggregating the inference problem — with Trainium optimized for prefill and the Cerebras CS-3 optimized for decode — the two different computational challenges can be optimized in a specialized way.

Built on the AWS Nitro System — the foundation of AWS's secure, high-performance cloud infrastructure — the new solution will ensure that Cerebras CS-3 systems and Trainium-powered instances operate with the same security, isolation, and operational consistency customers expect from AWS.

AWS Trainium for Prefill and Cerebras CS-3 for Decode

Trainium is Amazon's purpose-built AI chip, designed to deliver scalable performance and cost efficiency for training and inference across a broad range of generative AI workloads. Two of the world's leading AI labs—Anthropic and OpenAI—are committed to Trainium. Anthropic has named AWS its primary training partner and is using Trainium to train and deploy its models, while OpenAI will consume 2 gigawatts of Trainium capacity through AWS infrastructure to support demand for Stateful Runtime Environment, frontier models, and other advanced workloads. Since its recent release, Trainium3 has seen strong customer adoption, with organizations across industries committing significant capacity.

Cerebras' CS-3 is the world's fastest AI inference system. It delivers thousands of times greater memory bandwidth than the fastest GPU. As reasoning models now represent a majority of inference to compute and generate more tokens per request as they “think” through problems, the need to accelerate this portion of the workflow has grown accordingly. OpenAI, Cognition, Mistral, and others use Cerebras to accelerate their most demanding workloads, especially agentic coding where developer productivity is constrained by inference speed.

In the disaggregated solution, CS-3 will be fully dedicated to decoding acceleration, enabling dramatically higher capacity for fast output tokens. With Trainium handling prefill, the CS-3 handling decode operations, and high-speed EFA networking connecting them, each processor will deliver maximum token capacity for its focused part of the workload.

About Amazon Web Services

Amazon Web Services (AWS) is guided by customer obsession, pace of innovation, commitment to operational excellence, and long-term thinking. By democratizing technology for nearly two decades and making cloud computing and generative AI accessible to organizations of every size and industry, AWS has built one of the fastest-growing enterprise technology businesses in history. Millions of customers trust AWS to accelerate innovation, transform their businesses, and shape the future. With the most comprehensive AI capabilities and global infrastructure footprint, AWS empowers builders to turn big ideas into reality. Learn more at aws.amazon.com and follow @AWSNewsroom.

About Cerebras Systems

Cerebras Systems builds the fastest AI infrastructure in the world. We are a team of pioneering computer architects, computer scientists, AI researchers, and engineers of all types. We have come together to make AI blisteringly fast through innovation and invention because we believe that when AI is fast it will change the world. Our flagship technology, the Wafer Scale Engine 3 (WSE-3) is the world’s largest and fastest AI processor. 56 times larger than the largest GPU, the WSE uses a fraction of the power per unit compute while delivering inference and training more than 20 times faster than the competition. Leading corporations, research institutes and governments on four continents chose Cerebras to run their AI workloads. Cerebras solutions are available on premise and in the cloud, for further information, visit cerebras.ai or follow us on LinkedIn, X and/or Threads.

This press release contains forward-looking statements, including statements regarding the expected benefits of our products and the transaction described herein. These statements are subject to risks and uncertainties that could cause actual results to differ materially. Neither we nor any other person assumes responsibility for the accuracy and completeness of forward-looking statements. The forward-looking statements included in this press release relate only to events and information as of the date hereof. Cerebras undertakes no obligation to update or revise any forward-looking statement as a result of new information, future events or otherwise, except as otherwise required by law.

Amazon is deploying Cerebras Wafer Scale Engines in AWS datacenters. Ultra fast inference will be available through AWS Bedrock, bringing industry leading performance to the largest hyperscale cloud.

Amazon is deploying Cerebras Wafer Scale Engines in AWS datacenters. Ultra fast inference will be available through AWS Bedrock, bringing industry leading performance to the largest hyperscale cloud.

WASHINGTON (AP) — The Trump administration on Thursday loosened federal rules that require grocery stores and air-conditioning companies to reduce greenhouse gases used in cooling equipment, a step President Donald Trump said would help lower grocery costs.

Trump, at a White House ceremony, said the action by the Environmental Protection Agency would “substantially lower costs for consumers” by delaying costly restrictions that limit the type of refrigerants U.S. businesses and families can use.

The move to relax the Biden-era rules on harmful pollutants known as HFCs emitted by refrigerators and other appliances was the latest attempt by the Trump administration to try to address rising voter concerns over the cost of living ahead of pivotal elections in November.

It is not clear how much or how quickly the loosening of the refrigerant rule might impact grocery prices. Industry groups said the move could even raise prices because manufacturers have already redesigned products, retooled factories and trained workers to build and service next-generation refrigerant equipment.

Inflation in the United States increased to 3.8% annually in April, amid price spikes caused by the Iran war and President Donald Trump’s sweeping tariffs. Inflation is now outpacing wage gains as the war has kept oil and gasoline prices high.

The Biden-era regulation was “unnecessary and costly and actually makes the machinery worse,” Trump said at a ceremony joined by top executives from Kroger, Piggly Wiggly and other grocery chains. The EPA action will protect hundreds of thousands of jobs and save Americans more than $2 billion a year, he said.

The Air-Conditioning, Heating and Refrigeration Institute, which represents more than 330 HVAC manufacturers and commercial refrigeration companies, said the change in approach would “inject uncertainty across the market” and could even raise prices.

“This rule works against basic supply and demand,” said Stephen Yurek, the group’s president and CEO. “By extending the compliance deadline” for phasing out hydrofluorocarbons, or HFCs, the administration “is maintaining and even increasing demand in the market for existing refrigerants while supply continues to fall.”

Manufacturers have already retooled product lines and certified models based on the existing timeline, Yurek said. Nearly 90% of residential and light commercial air conditioning systems use substitute refrigerants, rather than HFCs, he said.

The administration's action on refrigerants represents a reversal after Trump signed a law in his first term that aimed to reduce harmful, planet-warming pollutants emitted by refrigerators and air conditioners. That bipartisan measure brought environmentalists and major business groups into rare alignment on the contentious issue of climate change and won praise across the political spectrum.

The 2020 law reflected a broad bipartisan consensus on the need to quickly phase out domestic use of HFCs, greenhouse gases that are thousands of times more potent than carbon dioxide and are considered a major driver of global warming.

The EPA action highlights the second Trump administration’s drive to roll back regulations perceived as climate friendly. The plan is among a series of sweeping environmental changes that EPA Administrator Lee Zeldin has said will put a “dagger through the heart of climate change religion.”

Environmentalists criticized the administration’s actions, saying the new rule would exacerbate climate pollution while disrupting a yearslong industry transition to new coolants as an alternative to HFCs.

The 2020 law signed by Trump, known as the American Innovation and Manufacturing Act, phased out HFCs as part of an international agreement on ozone pollution. The law accelerated an industry shift to alternative refrigerants that use less harmful chemicals and are widely available.

The U.S. Chamber of Commerce and the American Chemistry Council, the top lobbying group for the chemical industry, were among numerous business groups that supported the law and an international deal on pollutants, known as the Kigali Amendment, as victories for jobs and the environment. U.S. companies such as Chemours and Honeywell developed and produce the alternative refrigerants sold in the United States and around the world.

The 2023 rule now being relaxed imposed steep restrictions on HFCs starting in 2026. Zeldin said the rule from the Democratic Biden administration did not give companies enough time to comply and that the rapid switch to other refrigerants caused shortages and price increases last year. Some in the industry dispute this.

The Food Industry Association, which represents grocery stores and suppliers, applauded the Trump EPA proposal last year, saying the earlier rule “imposed significant and unrealistic compliance timelines.”

Kevin McDaniel, Piggly Wiggly franchise owner, speaks during an event with President Donald Trump about loosening a federal refrigerant rule, in the Oval Office at the White House, Thursday, May 21, 2026, in Washington. (AP Photo/Jacquelyn Martin)

Kevin McDaniel, Piggly Wiggly franchise owner, speaks during an event with President Donald Trump about loosening a federal refrigerant rule, in the Oval Office at the White House, Thursday, May 21, 2026, in Washington. (AP Photo/Jacquelyn Martin)

Kroger CEO Greg Foran speaks speaks during an event with President Donald Trump about loosening a federal refrigerant rule, in the Oval Office at the White House, Thursday, May 21, 2026, in Washington. (AP Photo/Jacquelyn Martin)

Kroger CEO Greg Foran speaks speaks during an event with President Donald Trump about loosening a federal refrigerant rule, in the Oval Office at the White House, Thursday, May 21, 2026, in Washington. (AP Photo/Jacquelyn Martin)

Lee Zeldin, Environmental Protection Agency administrator, listens as President Donald Trump speaks during an event about loosening a federal refrigerant rule, in the Oval Office at the White House, Thursday, May 21, 2026, in Washington. (AP Photo/Jacquelyn Martin)

Lee Zeldin, Environmental Protection Agency administrator, listens as President Donald Trump speaks during an event about loosening a federal refrigerant rule, in the Oval Office at the White House, Thursday, May 21, 2026, in Washington. (AP Photo/Jacquelyn Martin)

President Donald Trump speaks during an event about loosening a federal refrigerant rule, in the Oval Office at the White House, Thursday, May 21, 2026, in Washington. (AP Photo/Jacquelyn Martin)

President Donald Trump speaks during an event about loosening a federal refrigerant rule, in the Oval Office at the White House, Thursday, May 21, 2026, in Washington. (AP Photo/Jacquelyn Martin)

FILE - A shop owner reaches into a drink display refrigerator at his convenience store in Kent, Wash., Oct. 1, 2018. (AP Photo/Elaine Thompson, File)

FILE - A shop owner reaches into a drink display refrigerator at his convenience store in Kent, Wash., Oct. 1, 2018. (AP Photo/Elaine Thompson, File)

Recommended Articles