Skip to Content Facebook Feature Image

Nota AI Has Two MoE Quantization Papers Accepted at ICML 2026 Workshop, Demonstrating Global Competitiveness in Large-Scale AI Optimization

Business

Nota AI Has Two MoE Quantization Papers Accepted at ICML 2026 Workshop, Demonstrating Global Competitiveness in Large-Scale AI Optimization
Business

Business

Nota AI Has Two MoE Quantization Papers Accepted at ICML 2026 Workshop, Demonstrating Global Competitiveness in Large-Scale AI Optimization

2026-06-11 20:00 Last Updated At:20:15

  • Two papers on MoE-specific quantization algorithms accepted at a workshop held in conjunction with ICML 2026
  • Recognition follows Nota AI's overall win at the NVIDIA Nemotron Hackathon
  • Strengthening core optimization technologies to make large-scale AI models smaller and more efficient to run
  • SEOUL, South Korea, June 11, 2026 /PRNewswire/ -- Nota AI, a company specializing in AI model compression and optimization, announced that two of its papers on MoE-specific quantization algorithms have been accepted to the Resource-Adaptive Foundation Model Inference (AdaptFM) Workshop at ICML 2026, one of the world's leading machine learning conferences.

    ICML is widely recognized as one of the premier global conferences in machine learning and artificial intelligence, bringing together the latest research from global technology companies, leading universities, and major research institutions. The AdaptFM Workshop focuses on technologies that enable large-scale AI models to run efficiently under limited computing resources. Researchers from global companies and research institutions, including Amazon and Meta, serve on the organizing committee, while researchers from leading AI companies such as NVIDIA, Qualcomm AI Research, OpenAI, Apple, and Microsoft are also participating as members of the program committee.

    This achievement is significant as it recognizes Nota AI's accumulated technical expertise in optimizing Mixture-of-Experts (MoE) models, an architecture increasingly regarded as a core structure for large language models (LLMs). MoE models improve both performance and efficiency by activating only a subset of expert models as needed. However, their complex structure requires a different approach to quantization, the process of making models smaller and more efficient, compared to conventional model architectures.

    Nota AI previously won both its track and the overall competition at the NVIDIA Nemotron Hackathon with a data-driven MoE quantization method. With the acceptance of these two papers, Nota AI will once again present research outcomes specifically designed for MoE architectures on a global research stage.

    The first accepted paper, "DREAM-MoE," proposes a method to reduce changes in a model's decision flow that can occur when large-scale AI models are quantized across multiple segments. The method focuses on the fact that even a small error in an earlier segment can affect expert selection in later segments. DREAM-MoE helps the quantized model select experts in a way that remains closer to the original model.

    The second paper, "SRA-MoE," proposes a method that identifies and prioritizes important inputs that have a greater impact on the model's final output. Rather than treating all inputs equally, SRA-MoE is designed to prevent expert selection from being significantly disrupted for these key inputs, helping maintain model quality more effectively under limited resources.

    Both studies demonstrated higher performance compared to the latest MoE-specific quantization methods. This shows that large-scale AI models can be executed with less memory and fewer computing resources while reducing quality degradation. As the cost, power consumption, and hardware burden of running large AI models continue to increase, MoE-specific quantization technologies are becoming increasingly important.

    Nota AI has been proactively focusing its R&D efforts on optimizing large AI models that require substantial memory and computing resources. The company is advancing large-scale model optimization, including Solar MoE, as part of the sovereign foundation model project led by the Upstage consortium. It is also expanding its experience in quantizing NVIDIA Nemotron 3 Nano to newer large models such as Nemotron Ultra, further broadening the scope of its optimization technologies.

    "This paper acceptance reflects Nota AI's continued advancement of MoE-specific quantization technologies," said Myungsu Chae, CEO of Nota AI. "Following our overall win at the NVIDIA Nemotron Hackathon, we are pleased to present our research at the ICML 2026 AdaptFM Workshop. We will continue developing optimization technologies that enable large-scale AI models to be used more efficiently and practically."

    In addition, Nota AI will host "Nota AI - Korea Efficient Days" during ICML 2026 at COEX in Seoul. The event will bring together global researchers, engineers, and business leaders visiting Korea to share research trends and industrial applications of Efficient AI. Through the event, Nota AI plans to introduce its research achievements in large-scale AI model optimization and expand opportunities for technical collaboration and business engagement.

SEOUL, South Korea, June 11, 2026 /PRNewswire/ -- Nota AI, a company specializing in AI model compression and optimization, announced that two of its papers on MoE-specific quantization algorithms have been accepted to the Resource-Adaptive Foundation Model Inference (AdaptFM) Workshop at ICML 2026, one of the world's leading machine learning conferences.

ICML is widely recognized as one of the premier global conferences in machine learning and artificial intelligence, bringing together the latest research from global technology companies, leading universities, and major research institutions. The AdaptFM Workshop focuses on technologies that enable large-scale AI models to run efficiently under limited computing resources. Researchers from global companies and research institutions, including Amazon and Meta, serve on the organizing committee, while researchers from leading AI companies such as NVIDIA, Qualcomm AI Research, OpenAI, Apple, and Microsoft are also participating as members of the program committee.

This achievement is significant as it recognizes Nota AI's accumulated technical expertise in optimizing Mixture-of-Experts (MoE) models, an architecture increasingly regarded as a core structure for large language models (LLMs). MoE models improve both performance and efficiency by activating only a subset of expert models as needed. However, their complex structure requires a different approach to quantization, the process of making models smaller and more efficient, compared to conventional model architectures.

Nota AI previously won both its track and the overall competition at the NVIDIA Nemotron Hackathon with a data-driven MoE quantization method. With the acceptance of these two papers, Nota AI will once again present research outcomes specifically designed for MoE architectures on a global research stage.

The first accepted paper, "DREAM-MoE," proposes a method to reduce changes in a model's decision flow that can occur when large-scale AI models are quantized across multiple segments. The method focuses on the fact that even a small error in an earlier segment can affect expert selection in later segments. DREAM-MoE helps the quantized model select experts in a way that remains closer to the original model.

The second paper, "SRA-MoE," proposes a method that identifies and prioritizes important inputs that have a greater impact on the model's final output. Rather than treating all inputs equally, SRA-MoE is designed to prevent expert selection from being significantly disrupted for these key inputs, helping maintain model quality more effectively under limited resources.

Both studies demonstrated higher performance compared to the latest MoE-specific quantization methods. This shows that large-scale AI models can be executed with less memory and fewer computing resources while reducing quality degradation. As the cost, power consumption, and hardware burden of running large AI models continue to increase, MoE-specific quantization technologies are becoming increasingly important.

Nota AI has been proactively focusing its R&D efforts on optimizing large AI models that require substantial memory and computing resources. The company is advancing large-scale model optimization, including Solar MoE, as part of the sovereign foundation model project led by the Upstage consortium. It is also expanding its experience in quantizing NVIDIA Nemotron 3 Nano to newer large models such as Nemotron Ultra, further broadening the scope of its optimization technologies.

"This paper acceptance reflects Nota AI's continued advancement of MoE-specific quantization technologies," said Myungsu Chae, CEO of Nota AI. "Following our overall win at the NVIDIA Nemotron Hackathon, we are pleased to present our research at the ICML 2026 AdaptFM Workshop. We will continue developing optimization technologies that enable large-scale AI models to be used more efficiently and practically."

In addition, Nota AI will host "Nota AI - Korea Efficient Days" during ICML 2026 at COEX in Seoul. The event will bring together global researchers, engineers, and business leaders visiting Korea to share research trends and industrial applications of Efficient AI. Through the event, Nota AI plans to introduce its research achievements in large-scale AI model optimization and expand opportunities for technical collaboration and business engagement.

** This press release is distributed by PR Newswire through automated distribution system, for which the client assumes full responsibility. **

Nota AI Has Two MoE Quantization Papers Accepted at ICML 2026 Workshop, Demonstrating Global Competitiveness in Large-Scale AI Optimization

Nota AI Has Two MoE Quantization Papers Accepted at ICML 2026 Workshop, Demonstrating Global Competitiveness in Large-Scale AI Optimization

SHANGHAI, June 14, 2026 /PRNewswire/ -- The Gala Night of the 28th Shanghai International Film Festival (SIFF) was held in Shanghai today, bringing together filmmakers, industry professionals and audiences from around the world in celebration of cinema.

Running through June 20, this year's festival will present more than 420 films across over 1,600 screenings in Shanghai and cities throughout the Yangtze River Delta. As one of only 17 film festivals worldwide to hold official A-list classification from FIAPF, SIFF continues to serve as a major platform for international cinematic exchange.

The winners of the Golden Goblet Awards will be announced at the Golden Goblet Awards Ceremony of the 28th Shanghai International Film Festival on June 20.

A Night of Cinema Along the Huangpu River

The Gala Night transformed Shanghai into a vibrant stage for global cinema, welcoming filmmakers, actors and industry professionals from around the world. Golden Goblet Awards jurors, nominated filmmakers and representatives of major Chinese and international productions gathered on the red carpet, showcasing the festival's growing international profile and influence.

One of the evening's most significant moments was the presentation of the festival's honorary awards. Renowned actress and cultural ambassador Lisa Lu received the Lifetime Achievement Award, while acclaimed Chinese director Zhang Yimou was honored with the Outstanding Contribution to Chinese Cinema Award, recognizing their exceptional achievements and enduring contributions to the development of cinema.

The celebration once again highlighted Shanghai's unique role as a city where cinema, culture and international exchange converge.

Opening Film Afterpiece Makes World Premiere

This year's opening film, Afterpiece, made its world premiere as one of the festival's most anticipated events. Directed by Keane T.K. Wong and produced by Derek Yee, the film stars Stephen Fung, Chrissie Chau, Myolie Wu and Angela Yuen. The story follows a once-celebrated theatre director seeking creative redemption while navigating the increasingly blurred boundaries between performance and reality.

Developed under a Hong Kong film mentorship initiative pairing established filmmakers with emerging directors, Afterpiece reflects the continued transmission of creative talent across generations in Hong Kong cinema.

Members of the cast and creative team attended the Gala Night and participated in audience activities following the premiere screening.

International Jury Reflects Global Vision

Members of the Golden Goblet Awards jury met with the media in Shanghai ahead of this year's festival activities, representing filmmakers and industry professionals from around the world.

The seven-member Main Competition jury is chaired by acclaimed actor Tony Leung Chiu-wai and includes Chinese director Guan Hu, actress Xin Zhilei, Tunisian producer Dora Bouchoucha, Kyrgyz director Aktan Arym Kubat, Georgian filmmaker Déa Kulumbegashvili and Mexican director Fernanda Valadez.

"It is a great honor to serve as jury president of the 28th Shanghai International Film Festival. Cinema is the art of dreaming, and Shanghai is the very vessel on which the Chinese film dream set sail. I hold a few more tickets for this voyage—would you like to join me?" said Tony Leung Chiu-wai.

The jury's diverse composition reflects SIFF's longstanding commitment to fostering dialogue among filmmakers from different cultures and creative traditions.

A Truly Global Stage for Cinema

This year, SIFF received approximately 4,100 submissions from 125 countries and regions, setting a new record in the festival's history. Among nearly 3,000 eligible entries, 82 percent are world or international premieres, underscoring the festival's growing global influence and appeal.

Audience enthusiasm remained strong, with online ticket sales launching on June 5 and 250,000 tickets sold within the first 15 minutes.

Beyond screenings, SIFF continues to promote international collaboration through initiatives including SIFF PROJECT, the International Film & TV Market, industry forums and masterclasses. The Belt and Road Film Festival Alliance, initiated by SIFF, now includes 57 member institutions from 50 countries, further strengthening cooperation among film communities worldwide.

Chen Guo, Managing Director of Shanghai International Film & TV Events Center, said: "Our goal is to use the festival platform to help films transcend geographic, linguistic and cultural boundaries, encouraging deeper cultural dialogue and creative exchange among filmmakers and audiences from different parts of the world."

The International Film & TV Market will be held during the festival period, bringing together exhibitors, industry organizations and professionals from around the world to explore new opportunities in content creation, technological innovation and international cooperation.

For more information and the full festival schedule, please visit the official SIFF website at https://www.siff.com/english/

 

** This press release is distributed by PR Newswire through automated distribution system, for which the client assumes full responsibility. **

Gala Night of the 28th Shanghai International Film Festival Held in Shanghai

Gala Night of the 28th Shanghai International Film Festival Held in Shanghai

Recommended Articles