-

Robbyant Open-Sources LingBot-World, a World Model for Millisecond-Level Real-Time Interaction

SHANGHAI--(BUSINESS WIRE)--Robbyant, an embodied AI company within Ant Group, today announced the open-source release of LingBot-World, a world model that achieves industry-leading performance in video quality, dynamic fidelity, long-term consistency, and interactivity. Designed for embodied intelligence, autonomous driving, and game development, LingBot-World offers a high-fidelity, highly dynamic, and real-time controllable “digital sandbox” for simulation and training.

Addressing the common challenge in video generation known as “long-term drift”, where prolonged generation often leads to object deformation, detail collapse, subject disappearance, or scene structure breakdown, LingBot-World leverages multi-stage training and parallelized acceleration to achieve up to nearly 10 minutes of continuous, stable, and lossless video generation. This capability supports complex, multi-step tasks requiring extended temporal coherence.

In terms of interactivity, LingBot-World delivers a generation throughput of approximately 16 FPS and maintains end-to-end interaction latency under one second. Users can control characters and camera perspectives in real time via keyboard or mouse, with immediate visual feedback to their inputs. Additionally, users can trigger environmental changes and world events through text commands—for example, adjusting weather conditions, altering visual styles, or initiating specific scenarios—all while preserving consistent spatial relationships within the scene.

LingBot-World also demonstrates strong zero-shot generalization. With just a single real-world image (e.g., an urban street view) or a game screenshot as input, LingBot-World can generate an interactive video stream without requiring additional scene-specific training or data collection, significantly lowering deployment and operational costs across diverse environments.

To address the scarcity of high-quality interactive data for world model training, LingBot-World adopts a hybrid data acquisition strategy. It combines large-scale, carefully curated web videos covering diverse real-world scenes, with game-engine synthetic data, including Unreal Engine (UE) pipelines. By extracting clean, UI-free frames directly from the rendering layer while simultaneously logging precise action commands and camera poses, the model receives accurately aligned training signals that capture how actions drive environmental changes.

LingBot-World excels in long-sequence consistency, real-time responsiveness, and modeling the causal relationship between actions and environmental dynamics. This enables it to “imagine” the physical world in a digital space, providing AI agents with a cost-effective, high-fidelity environment for trial-and-error learning. Its support for diverse scene variations, such as lighting conditions or object placements, further boosts the real-world generalization of embodied AI algorithms.

Zhu Xing, CEO of Robbyant, said, “The release of LingBot-World is the third AI model in the LingBot series dedicated to embodied intelligence. This is an important extension of Ant Group’s artificial general intelligence (AGI) strategy from the digital realm to physical perception, and underscores our full-stack roadmap spanning foundational models, general-purpose applications, and physical-world interaction.”

During Robbyant’s “Evolution of Embodied AI Week” initiative, the company has already unveiled LingBot-Depth, a high-precision spatial perception model, and LingBot-VLA, a vision-language-action model designed to serve as a “universal brain” for real-world robotics.

To learn more about LingBot-World, please visit:

About Robbyant

Robbyant is an embodied intelligence company within Ant Group, dedicated to advancing embodied intelligence through cutting-edge software and hardware technologies. Robbyant independently develops foundational large models for embodied AI and actively explores next-generation intelligent devices, aiming to create robotic companions and caregivers that truly understand and enhance people’s everyday lives and deliver reliable intelligent services across key use cases, such as elderly care, medical assistance, and household tasks.

To learn more about Robbyant, please visit: www.robbyant.com

Contacts

Media Inquiries
Vick Li Wei
Ant Group
vick.lw@antgroup.com

Ant Group


Release Versions

Contacts

Media Inquiries
Vick Li Wei
Ant Group
vick.lw@antgroup.com

Social Media Profiles
More News From Ant Group

Singapore Tourism Board and Ant International Deepen Partnership to Accelerate Tourism Growth Through Travel Innovation

SINGAPORE--(BUSINESS WIRE)--The Singapore Tourism Board (STB) and Ant International, renewed their multi-year strategic partnership to deepen tourism-led economic impact by strengthening Singapore’s position as a world-class destination and delivering seamless digital experiences for global travellers through Alipay+, Ant International’s unified wallet gateway. Building on the partnership which began in 2018, STB and Ant International will: Amplify Singapore’s destination appeal amongst key mar...

Ant Group’s Alipay AI Pay and AI Health App AQ Each Surpass 100 Million Users During CNY as AI Adoption Accelerates in China

HANGZHOU, China--(BUSINESS WIRE)--As AI adoption gained momentum during the 2026 Chinese New Year, Ant Group announced today that both Alipay AI Pay and its AI health app AQ have each surpassed the 100 million user milestone. AI Payment Adoption Accelerates amid CNY AI Shopping Boom From ordering bubble tea and coffee to buying movie tickets, Chinese consumers embraced AI-powered services in everyday scenarios during this year’s holiday, driving a surge in Alipay AI Pay usage. Alipay AI Pay has...

Ant Group Releases Ling-2.5-1T and Ring-2.5-1T, Evolving Its Open-Source AI Model Family

HANGZHOU, China--(BUSINESS WIRE)--Ant Group today announced the release of Ling-2.5-1T, its newest trillion-parameter large language model, and Ring-2.5-1T, the world’s first hybrid linear-architecture thinking model. Both models represent the latest evolution of the Ling 2.0 series unveiled in October 2025, and are now available under open licenses on Hugging Face and ModelScope. Ling-2.5-1T is the latest flagship in Ant Group’s Ling model series. It is designed to deliver higher reasoning eff...
Back to Newsroom