Skip to Content Facebook Feature Image

The World's First Open-Source Medical Video LLM Released, Calling the Global Developer Community to Push It Further

Business

The World's First Open-Source Medical Video LLM Released, Calling the Global Developer Community to Push It Further
Business

Business

The World's First Open-Source Medical Video LLM Released, Calling the Global Developer Community to Push It Further

2026-04-24 21:19 Last Updated At:21:35

SHANGHAI, April 24, 2026 /PRNewswire/ -- United Imaging Intelligence (UII) has unveiled uAI NEXUS MedVLM, a pioneering Medical Video Large Language Model that delivers unprecedented spatial and temporal precision in clinical environments.

UII is fully open-sourcing the model and introducing a new comprehensive benchmark for industry-wide evaluation. The research has been accepted by CVPR 2026, one of the top AI conferences, underscoring its recognition by the global computer vision community.

uAI NEXUS MedVLM is built on a monumental dataset comprising 531,850 video-instruction pairs across 8 clinical scenarios, including robotic surgery, laparoscopic procedures, endoscopy, open surgery, and nursing care.

With only 4B/7B parameters, uAI NEXUS MedVLM significantly outperforms leading general-purpose foundation models, including GPT-5.4 and Gemini 3.1, across key medical video tasks. It achieves 89.4% accuracy in surgical safety assessment, compared to GPT-5.4 (1.8%) and Gemini 3.1 (10.1%). In spatio-temporal action localization, it delivers up to 14× higher mIoU than GPT-5.4 and 4× higher than Gemini 3.1. For video report generation, it scores 4.2 out of 5, substantially surpassing GPT-5.4 (2.5) and Gemini 3.1 (2.4).

(Source: The above performance statistics are from the research paper: https://arxiv.org/abs/2512.06581)

Launching a Global Open Challenge to Accelerate Collaborative Innovation

To advance medical video LLM development, UII has launched a phased rollout of its MedVidBench dataset, beginning with the open-source release of 6,245 rigorous benchmark test samples. Spanning eight diverse surgical datasets, this initiative marks a global first in terms of both scale and clinical precision.

Developers can evaluate their models on a unified leaderboard, where submissions are automatically assessed against private ground truth. Results are reflected in a continuously updated global ranking, enabling transparent and comparable performance evaluation across models.

UII invites AI researchers, developers, and healthcare institutions worldwide to participate in this open challenge and help advance medical video intelligence through collaborative innovation.

Project Page: https://uii-ai.github.io/MedGRPO/ 

Advancing Intelligence Across the Full Spectrum of Medical Video Tasks

Medical video understanding has long remained one of the most formidable frontiers in artificial intelligence—demanding microscopic spatial awareness, complex temporal logic, and uncompromising clinical accuracy. Historically, progress has been paralyzed by the severe scarcity of clinical data and the prohibitive cost of expert annotation.

UII has shattered this bottleneck. By engineering a massive, frame-by-frame annotation framework across diverse clinical videos, we have rigorously mapped critical attributes: instrument trajectories, spatial positioning, precise surgical actions, and crucial risk indicators. This unprecedented data foundation equips uAI NEXUS MedVLM with a complete, robust clinical intelligence stack.

Built on this foundation, the model seamlessly integrates perception, reasoning, and decision-making. It delivers highly accurate spatio-temporal localization of instruments and automated procedural recognition, applying advanced reasoning to transform complex video sequences into structured clinical reports, regional descriptions, and rapid workflow summaries. Moving beyond passive observation, it elevates these insights into active decision-making that supports next-step prediction, surgical skill assessment, and comprehensive safety risk evaluation.

Translating AI Innovation into Real-World Clinical Impact

Built for clinical deployment, uAI NEXUS MedVLM enables more informed decision-making and data-driven quality control across surgical workflows, while reducing the learning curve for clinicians and improving training efficiency and consistency.

Looking ahead, uAI NEXUS MedVLM can serve as the core perceptual and cognitive engine for embodied AI operating in the physical world. Together, they form a closed-loop system of visual perception, cognitive reasoning, and physical execution, advancing toward a more automated, standardized, and intelligent healthcare ecosystem.

 

** This press release is distributed by PR Newswire through automated distribution system, for which the client assumes full responsibility. **

The World's First Open-Source Medical Video LLM Released, Calling the Global Developer Community to Push It Further

The World's First Open-Source Medical Video LLM Released, Calling the Global Developer Community to Push It Further

The World's First Open-Source Medical Video LLM Released, Calling the Global Developer Community to Push It Further

The World's First Open-Source Medical Video LLM Released, Calling the Global Developer Community to Push It Further

WUHU, China, April 24, 2026 /PRNewswire/ -- Beijing Auto Show 2026 opened on April 24, bringing together global automotive brands and advanced technologies. As a key barometer of the industry, the show serves as a platform for innovation and exchange. Chery Group appeared with its full brand lineup, including CHERY, EXEED, iCAUR, OMODA & JAECOO, and LEPAS. A total of 15 models made their global debut. The event welcomed over 4,000 guests from more than 100 countries, setting a new record for Chery in both scale and international reach.

At the show, Chery presented breakthrough technologies and a forward-looking strategy. Built on a global brand ecosystem, core technology development, and new energy innovation, the company demonstrated capabilities reshaping industry competition and setting new benchmarks for value creation.

Deepening Global Layout: A Multi-Brand Portfolio Reshaping the Industry Landscape

With a five-brand portfolio developed through years of overseas experience, Chery has formed a multi-scenario ecosystem. Each brand targets specific segments, shifting from broad coverage to precise positioning. Key debuts included Tiggo V, ES GT, EX8, OMODA 4, right-hand-drive V27, and L6 BEV, covering family, premium, and youth mobility needs. The brands operate in synergy, sharing resources and technologies to drive Chery's shift from scale expansion to value upgrading.

Advanced Architecture Redefines the Upper Limit of Experience: Digital Chassis Breaks the Boundaries of Smart Mobility

Chery showcased next-generation architecture innovations. The Feiyu Digital Intelligent Chassis i integrates steer-by-wire and brake-by-wire, enabling 90 km/h moose test performance and zero-radius tank turns. The GAIA All-Domain System combines amphibious mobility with satellite communication, redefining the vehicle as an all-scenario mobility terminal. The industry-first full-domain 48V system improves efficiency by 15% and meets ASIL D safety standards.

Through continuous breakthroughs, Chery is evolving from a technology follower into a standards setter in global smart mobility.

Full-Chain Energy Efficiency Management Enables Technology Accessibility: System Innovation Addressing Global Mobility Challenges

Chery introduced a full life-cycle energy efficiency system. The KunPeng High Efficiency Engine achieves 48.57% thermal efficiency and 4.0 kWh/L conversion. The DHT360 hybrid system delivers up to 360 kW, enabling full-speed electric driving. The Rhino Battery offers high safety and fast charging, reaching 80% in 12 minutes. The system supports a more efficient and sustainable mobility future.

Website: https://www.cheryinternational.com

 

** This press release is distributed by PR Newswire through automated distribution system, for which the client assumes full responsibility. **

Driving the Future of Mobility with Technology: Chery at Beijing Auto Show 2026

Driving the Future of Mobility with Technology: Chery at Beijing Auto Show 2026

Recommended Articles