Skip to Content Facebook Feature Image

ShengShu Technology Unveils World Action Model "Motubrain": One Brain, Infinite Possibilities for Robotic Intelligence

Business

ShengShu Technology Unveils World Action Model "Motubrain": One Brain, Infinite Possibilities for Robotic Intelligence
Business

Business

ShengShu Technology Unveils World Action Model "Motubrain": One Brain, Infinite Possibilities for Robotic Intelligence

2026-04-29 21:30 Last Updated At:21:45

From understanding and generating the world to taking action, Motubrain tops two global benchmarks and redefines the embodied AI landscape

SINGAPORE, April 29, 2026 /PRNewswire/ -- ShengShu Technology today announces Motubrain, a World Action Model that replaces multiple task-specific systems with a single, unified model that functions as a robotic brain for the physical world. Ranking highly on both WorldArena and RoboTwin 2.0, two of the field's most rigorous benchmarks in embodied world models, Motubrain marks a decisive shift in an industry where robotic systems are typically built from task-specific or specialized systems.

Best known for its leading video model Vidu, ShengShu Technology and its advancements in generative video for robotics earmarks an industry first. Generative video has laid the foundation for simulating robots in real-world environments at scale. Motubrain builds on this by turning those simulations into action, by enabling robots to learn from diverse, large-scale pre-training data while reducing reliance on traditional physical data collection.

"A true world model must be able to build a unified representation of the real world and predict how it evolves," said Jun Zhu, Founder of ShengShu Technology. "Video is a critical foundation of that intelligence because it naturally captures time, space, motion, causality, and physical dynamics at scale. We believe general world models should not be built as stitched-together modules, but as a unified architecture that brings together perception, reasoning, prediction, generation, and action in a single system. That is what can ultimately bridge the digital world and the physical world."

Global Rankings: Among the Top Performers in Embodied AI

Motubrain has delivered top-tier performance on leading embodied AI benchmarks. Ranked among the industry's best models for robotic perception, anticipation, and planning in the physical world, Motubrain achieved a 63.77 EWM Score on WorldArena. It has also been recognized as one of the strongest performers on RoboTwin 2.0, scoring an average of 96.0 across 50 predetermined tasks, and remains the only model to exceed 95.0 in randomized environments.

The Architecture Behind the Breakthrough

Motubrain's core breakthrough is unifying the "seen world" and the "actions to take" within a single model, and it is built on four core principles that together redefine what an embodied AI model for training robots can be:

  • One Brain, Many Skills: A unified model that can handle a wide range of tasks and gets smarter and stronger as task variety increases. Training each skill one by one is no longer required, and unlike conventional models, the wider the range of complex tasks it handles at once, Motubrain's success rate and reliability with multi-tasking increases.
  • One Brain, Universal Across Robots: Motubrain isn't built for a single robot model. It's designed to be a universal brain that can power many kinds of robots. This breaks the old "one robot, one model" pattern. And as more robot types, real‑world scenarios, and data join the ecosystem, Motubrain keeps getting smarter, which in turn helps every robot in the network perform better.
  • One Brain, End-to-End: Motubrain learns entire task sequences directly. It can handle complex, multi‑step tasks involving up to 10 atomic actions, also known as the smallest unit of movement in robotics, far beyond the typical 2–3 atomic actions. So the robot no longer sees isolated actions; it sees a complete, meaningful task from start to finish.
  • One Brain, Able to Anticipate: Predicts the world while driving action. Environmental change, task progression, and execution are processed together inside one model, not assembled from separate subsystems.

To deliver this, Motubrain is built on a Unified Multimodal Model that treats video and action as two continuous modalities to be learned together. A single training run gives it five capabilities at once: vision-language-action control (VLA), world modelling, video generation, inverse dynamics modelling (IDM), and joint video-action prediction. A three-stream Mixture-of-Transformers (MoT) then brings video, action, and language together by drawing on the strengths of existing pretrained models, enabling Motubrain to understand environments, follow language instructions, predict what happens next, and generate actions all at the same time. Unlike systems that chain together separate perception, planning, and control modules, Motbrain processes the full loop.

Motbrain learns from a far broader range of data than conventional AI models that train robots, including unlabelled video, task recordings without language annotations, and data from different robot embodiments. A proprietary latent action framework extracts physical motion directly from large-scale video, including human footage, simulation data, and multi-robot task trajectories, without requiring the data to be labelled or tagged to indicate specific actions. This broader learning paradigm translates into strong scaling behavior. In task-scaling evaluations, Motubrain's average success rate continued to rise as the number of training tasks increased, reaching approximately 92% at 50 tasks, while Pi-0.5 declined to roughly 68% over the same range. In data-scaling evaluations, Motubrain also maintained a clear advantage as the number of training episodes increased, achieving about 92% average success at 27,500 episodes, compared with roughly 85% for Motus and 68% for Pi-0.5. A three-stage pipeline built on a six-layer data pyramid lets Motubrain generalise skills across environments and robot types while remaining precise enough for fine-grained deployment scenarios.

Motubrain understands what is happening around it, anticipates what may happen next, and responds in real time. In real-world tests, robots trained with Motubrain have carried out complete, multi-step tasks with a level of adaptability beyond most conventional robotic systems. For example, they can insert flowers into a vase under changing conditions and use both arms independently for different goals. Most notably, Motubrain-trained robots demonstrate a remarkable ability to understand and predict outcomes during execution: when a ladle comes up empty while scooping, they can recognise that nothing has been collected and automatically attempt the scooping action again, despite never being trained on retry data. This marks the shift from robots that merely execute tasks to robots that truly complete them.

Training the Next Generation of Robots

Motubrain is not a research model awaiting commercialisation; it is operational. Several leading robotics companies are already using MotuBrain in active robot training programs, deploying its cross-embodiment, multi-skill capabilities on real hardware across industrial, commercial, and home environments.

To further enhance real-world performance, ShengShu has partnered with Astribot, SimpleAI, and Anyverse Dynamics to advance a general-purpose embodied AI brain, focusing on foundation model evolution, multimodal data integration, robust data infrastructure, and full-stack hardware–software optimisation.

Connecting the Dots: Alibaba's Investment and Motubrain

Motubrain is ShengShu's next strategic pillar, alongside Vidu, the company's flagship generative video platform, which its recent Vidu Q3 ranked No.1 in the first global Reference-to-Video leaderboard released by SuperCLUE. The two products are distinct in application but continuous at the foundation: the same world model technology that makes Vidu one of the world's leading video generation systems gives Motubrain its capacity to predict and act in the physical world. Where Vidu generates the world, Motubrain acts in it.

Backed by a $293 million Series B led by Alibaba Cloud and with investors including the China Internet Investment Fund, TAL Education Group, Baidu Ventures, and Luminous Ventures, ShengShu enters the Physical AI era as a leader, achieving successful live deployments and boasting the highest benchmarks for its unique ability to both deeply understand and effectively act upon its tasks.

To learn more about Motubrain, visit the official website: https://www.shengshu.com/en/motubrain

To access the visual assets, click here

About ShengShu Technology

Founded in March 2023, ShengShu Technology is a world-leading artificial intelligence company, specializing in the development of Multimodal Large Language Models. Driven by innovation, the company delivers cutting-edge MaaS and SaaS products that revolutionize creative production by enabling smarter, faster, and more scalable content creation. With its flagship video generation platform Vidu, ShengShu Technology's solutions have reached more than 200 countries and regions around the world, spanning fields including interactive entertainment, advertising, film, animation, cultural tourism, and more.

** This press release is distributed by PR Newswire through automated distribution system, for which the client assumes full responsibility. **

ShengShu Technology Unveils World Action Model "Motubrain": One Brain, Infinite Possibilities for Robotic Intelligence

ShengShu Technology Unveils World Action Model "Motubrain": One Brain, Infinite Possibilities for Robotic Intelligence

Presented at Brazil's AgriShow, New Report Highlights Maturing Industry with over 600,000 DJI Agricultural Drones Now Deployed in 100+ Countries and Regions.

SHENZHEN, China, April 29, 2026 /PRNewswire/ -- DJI Agriculture, the global leader in innovative agricultural drone technology, today unveiled its fifth annual Agricultural Drone Industry Insight Report (2025/2026) at Agrishow 2026 in Ribeirão Preto, Brazil. The report highlights how global policies are trending toward liberalization, standardization, and strategic integration. Meanwhile, DJI Agriculture strengthened its network of 3,500 service and repair centers worldwide while advocating for standardized drone operations. By the end of 2025, over 600,000 DJI agricultural drones were already in use globally by more than 600,000 trained operators. The adoption of this technology has saved approximately 410 million tons of water—equivalent to the annual drinking water consumption of 740 million people—and cut carbon emissions by 51 million tons, equal to the annual carbon absorption capacity of 240 million trees.

"Agricultural drones are no longer a novelty – they are essential farm equipment worldwide. In Brazil, DJI Drones are now widely applied on the country's major crops, including coffee, soybeans, corn, sugarcane, and forage grass." said from Yuan Zhang, Head of Global Sales at DJI Agriculture, "As the global adoption continues to grow, DJI Agriculture will continue to strengthen our support network for operators while offering training through our global network of over 7,000 certified instructors. These investments underscore the company's commitment to helping farmers improve efficiency and sustainably increase their yields through innovative drone technology."

Drone Seeding and Spraying Elevates Pasture Production and Sustainability in Brazil
The report presents several case studies on the use cases of agricultural drones for various crops in different countries. In Brazil, farmers have deployed DJI Agras T25P, T70P, and T100 agricultural drones to cover full-cycle precision operations in forage management, lifting forage renewal efficiency and pasture productivity. For example, using drones to more precisely spot-spray weed patches can reduce herbicide use by up to 35%. Simultaneously, full-process drone spraying and seeding also offer environmental protection benefits by eliminating soil compaction, reducing chemical drift near sensitive ecosystems, and lowering the carbon footprint of livestock farming.

Improved Field Trials and Academic Studies Driving More Compliant Field Operations
The report also documents several new field trials and academic studies offering credible, evidence-based validation of the advantages of agricultural spraying drones in precision application, operational efficiency, economic benefits, and sustainability. Simultaneously, organizations such as UAPASTF developed guidelines for the safe and effective application of pesticides, informed by updated drone field-drift studies. Better drift testing enables more precise, safer, and more compliant field operations, making it a key enabler of precision agriculture and environmentally responsible crop protection.

These studies have helped drive more evidence-based policies and market developments worldwide, which is further fueling the rapid global expansion of the agricultural drone industry. For example, ANAC (the National Civil Aviation Agency of Brazil) updated its drone regulations to establish "standard scenarios" for recurring agricultural operations. In Canada, Transport Canada's regulatory amendments to the Canadian Aviation Regulations have simplified operational rules for agricultural drones, directly supporting spraying, mapping, monitoring, and precision farming.

As the industry continues to mature worldwide, DJI Agriculture aims to provide farmers and policymakers with a clearer view of how agricultural drones deliver measurable environmental value and open new pathways for global agricultural modernization.

Read the full 2025/2026 Agricultural Drone Industry Insight Report here.

About DJI Agriculture
DJI Agriculture was established by DJI in 2015 with the mission to bring innovative drone technology to farming, making it more sustainable, efficient, and safer. DJI began investing in research and development for the advancement of spray drones in 2012, before it created a dedicated business unit for agriculture drones. As the global leader of the drone industry, DJI is building a better world by continuously promoting human advancement through products that add value to lives around the world in more profound ways than ever before. Today, an estimated 600,000 agricultural drones are in use worldwide to treat more than 300 types of crops in more than 100 countries and regions.

** This press release is distributed by PR Newswire through automated distribution system, for which the client assumes full responsibility. **

DJI Agriculture Reveals Global Adoption of Agricultural Drones Cuts 51Mt in Carbon Emissions and Saves 410Mts of Water for Farmers Globally

DJI Agriculture Reveals Global Adoption of Agricultural Drones Cuts 51Mt in Carbon Emissions and Saves 410Mts of Water for Farmers Globally

Recommended Articles