Ant Group's Robbyant Unveils LingBot-Map: A Streaming 3D Reconstruction Model for Real-Time Spatial Understanding

LingBot-Map comprehensively outperforms existing methods across multiple major international benchmarks

SHANGHAI--(BUSINESS WIRE)--Robbyant, the embodied AI company within Ant Group, today announced the open-sourcing of LingBot-Map, a new streaming 3D reconstruction model. This innovative technology empowers robots, autonomous vehicles, and AR devices to perceive and understand their three-dimensional surroundings in real-time using only a standard RGB camera.

Unlike traditional 3D reconstruction methods that process a complete set of images offline, LingBot-Map operates on a "see-as-you-go" principle. It continuously estimates the camera's position and reconstructs the scene's 3D structure frame-by-frame as video is captured.

LingBot-Map sets a new benchmark for accuracy in the field. On the Oxford Spires dataset, known for its large scale and challenging lighting conditions, the model achieved an Absolute Trajectory Error (ATE) of just 6.42 meters. This represents a remarkable near 2.8x improvement in trajectory accuracy over the previous best streaming method and significantly outperforms offline methods like DA3 (12.87 meters) and VIPE (10.52 meters).

The model's superiority extends to other major benchmarks, including ETH3D, 7-Scenes, and Tanks and Temples, where it leads in both pose estimation and 3D reconstruction quality. On the ETH3D benchmark, LingBot-Map achieved a reconstruction F1 score of 98.98, more than 21 percentage points higher than the second-place method.

Beyond precision, LingBot-Map also achieves both real-time performance and long-term stability. The model achieves an inference speed of approximately 20 FPS and supports continuous inference on long video sequences exceeding 10,000 frames with almost unchanged accuracy. This capability is fundamental for applications requiring continuous, online spatial awareness, such as robot navigation, obstacle avoidance, and complex object manipulation.

The core challenge in streaming 3D reconstruction lies in balancing geometric accuracy, temporal consistency, and computational efficiency. LingBot-Map addresses this through a novel pure auto-regressive modeling approach built on a Geometric Context Transformer.

The model's key innovation is its Geometric Context Attention (GCA) mechanism, which efficiently organizes and utilizes geometric information across frames, allowing the model to retain crucial historical context while minimizing redundant computation. Inspired by the hierarchical information management of classic SLAM systems, LingBot-Map's architecture effectively leverages a unified model to handle tasks that traditionally require complex, hand-crafted design and optimization.

The launch of LingBot-Map marks a new step in Robbyant's mission to build a comprehensive intelligent foundation for embodied AI. It follows the recent open-sourcing of several other major models:

LingBot-Depth: A high-precision spatial perception model.
LingBot-VLA: A general-purpose Vision-Language-Action model.
LingBot-World: A world model for environmental simulation.
LingBot-VA: An auto-regressive video-action model for robot control.

With LingBot-Map, Robbyant has further strengthened its technology stack, providing a robust solution for real-time spatial understanding and online 3D mapping.

To learn more about LingBot-Map, please visit:

Code and demo: https://github.com/Robbyant/lingbot-map
Tech report: https://arxiv.org/abs/2604.14141
Hugging Face: https://huggingface.co/robbyant/lingbot-map

About Robbyant

Robbyant is an embodied intelligence company within Ant Group, dedicated to advancing embodied intelligence through cutting-edge software and hardware technologies. Robbyant independently develops foundational large models for embodied AI and actively explores next-generation intelligent devices, aiming to create robotic companions and caregivers that truly understand and enhance people’s everyday lives and deliver reliable intelligent services across key use cases, such as elderly care, medical assistance, and household tasks.

To learn more about Robbyant, please visit: www.robbyant.com

Contacts

Media Inquiries
Vick Li Wei
Ant Group
vick.lw@antgroup.com

Industry:

More News From Ant Group

Ant Group Open-Sources SingGuard-NSFA to Establish New Security Paradigms for Autonomous AI Agents

HANGZHOU, China--(BUSINESS WIRE)--Ant Group’s AI Security Lab today announced the open-source release of SingGuard-NSFA, a specialized security guardrail framework designed specifically for autonomous AI agents. The framework secures agentic AI systems against operational threats like prompt injection, addressing critical vulnerabilities as AI transitions from passive content generation to active, autonomous execution. As AI agents rapidly move from research labs to business scenarios, the secu...

Robbyant Launches LingBot-VA 2.0 Built Natively for Embodied AI and Physical World Control

SHANGHAI--(BUSINESS WIRE)--Robbyant, an embodied AI company within Ant Group, today announced the release of LingBot-VA 2.0, the industry’s first embodied-native video-action world model. This release marks a key transition in robotics foundation models, shifting from repurposing digital world models to designing them natively for the physical world. Instead of relying on fine-tuned digital content generation models, LingBot-VA 2.0 is built from scratch to meet the original demands of dynamic m...

Robbyant Unveils LingBot-World 2.0: Pioneering Hour-Long Real-Time Generation in World Models

SHANGHAI--(BUSINESS WIRE)--Robbyant, an embodied AI company within Ant Group, today announced the open-source release of LingBot-World 2.0 (Infinity). This interactive world model significantly upgraded its world prediction and interactivity capabilities, supporting hour-long continuous generation, 720p/60fps high-definition real-time output, and richer interactive actions. LingBot-World 2.0 also integrates a native agent mechanism, evolving generated worlds from merely watchable and controllab...

Back to Newsroom

Services & Solutions

Services

Solutions For

Resources

Education

Why Business Wire

Ant Group's Robbyant Unveils LingBot-Map: A Streaming 3D Reconstruction Model for Real-Time Spatial Understanding

Contacts

Ant Group

Contacts

Ant Group Open-Sources SingGuard-NSFA to Establish New Security Paradigms for Autonomous AI Agents

Robbyant Launches LingBot-VA 2.0 Built Natively for Embodied AI and Physical World Control

Robbyant Unveils LingBot-World 2.0: Pioneering Hour-Long Real-Time Generation in World Models

Ant Group

Contacts