-

Ant Group's Robbyant Unveils LingBot-Map: A Streaming 3D Reconstruction Model for Real-Time Spatial Understanding

SHANGHAI--(BUSINESS WIRE)--Robbyant, the embodied AI company within Ant Group, today announced the open-sourcing of LingBot-Map, a new streaming 3D reconstruction model. This innovative technology empowers robots, autonomous vehicles, and AR devices to perceive and understand their three-dimensional surroundings in real-time using only a standard RGB camera.

Unlike traditional 3D reconstruction methods that process a complete set of images offline, LingBot-Map operates on a "see-as-you-go" principle. It continuously estimates the camera's position and reconstructs the scene's 3D structure frame-by-frame as video is captured.

LingBot-Map sets a new benchmark for accuracy in the field. On the Oxford Spires dataset, known for its large scale and challenging lighting conditions, the model achieved an Absolute Trajectory Error (ATE) of just 6.42 meters. This represents a remarkable near 2.8x improvement in trajectory accuracy over the previous best streaming method and significantly outperforms offline methods like DA3 (12.87 meters) and VIPE (10.52 meters).

The model's superiority extends to other major benchmarks, including ETH3D, 7-Scenes, and Tanks and Temples, where it leads in both pose estimation and 3D reconstruction quality. On the ETH3D benchmark, LingBot-Map achieved a reconstruction F1 score of 98.98, more than 21 percentage points higher than the second-place method.

Beyond precision, LingBot-Map also achieves both real-time performance and long-term stability. The model achieves an inference speed of approximately 20 FPS and supports continuous inference on long video sequences exceeding 10,000 frames with almost unchanged accuracy. This capability is fundamental for applications requiring continuous, online spatial awareness, such as robot navigation, obstacle avoidance, and complex object manipulation.

The core challenge in streaming 3D reconstruction lies in balancing geometric accuracy, temporal consistency, and computational efficiency. LingBot-Map addresses this through a novel pure auto-regressive modeling approach built on a Geometric Context Transformer.

The model's key innovation is its Geometric Context Attention (GCA) mechanism, which efficiently organizes and utilizes geometric information across frames, allowing the model to retain crucial historical context while minimizing redundant computation. Inspired by the hierarchical information management of classic SLAM systems, LingBot-Map's architecture effectively leverages a unified model to handle tasks that traditionally require complex, hand-crafted design and optimization.

The launch of LingBot-Map marks a new step in Robbyant's mission to build a comprehensive intelligent foundation for embodied AI. It follows the recent open-sourcing of several other major models:

  • LingBot-Depth: A high-precision spatial perception model.
  • LingBot-VLA: A general-purpose Vision-Language-Action model.
  • LingBot-World: A world model for environmental simulation.
  • LingBot-VA: An auto-regressive video-action model for robot control.

With LingBot-Map, Robbyant has further strengthened its technology stack, providing a robust solution for real-time spatial understanding and online 3D mapping.

To learn more about LingBot-Map, please visit:

Code and demo: https://github.com/Robbyant/lingbot-map
Tech report: https://arxiv.org/abs/2604.14141
Hugging Face: https://huggingface.co/robbyant/lingbot-map

About Robbyant

Robbyant is an embodied intelligence company within Ant Group, dedicated to advancing embodied intelligence through cutting-edge software and hardware technologies. Robbyant independently develops foundational large models for embodied AI and actively explores next-generation intelligent devices, aiming to create robotic companions and caregivers that truly understand and enhance people’s everyday lives and deliver reliable intelligent services across key use cases, such as elderly care, medical assistance, and household tasks.

To learn more about Robbyant, please visit: www.robbyant.com

Contacts

Media Inquiries
Vick Li Wei
Ant Group
vick.lw@antgroup.com

Ant Group


Release Versions

Contacts

Media Inquiries
Vick Li Wei
Ant Group
vick.lw@antgroup.com

Social Media Profiles
More News From Ant Group

LankaPay, SLTDA and Alipay+ Join Hands to Drive Tourism Growth and Local QR Payment Adoption

COLOMBO, Sri Lanka--(BUSINESS WIRE)--LankaPay, the Sri Lanka Tourism Development Authority (SLTDA) and Alipay+ have entered into a strategic collaboration to enhance Sri Lanka’s tourism appeal and drive international visitor arrivals, particularly from Asia and the wider Asia-Pacific (APAC) region. The collaboration leverages the strength of the global ecosystem of Alipay+, Ant International’s unified wallet gateway, enabled locally through LankaPay, to position Sri Lanka as a preferred travel...

Ant International Kicking off Alipay+’s Support for the 2026 New York Liberty Season and Sustainability Initiatives

SAN FRANCISCO--(BUSINESS WIRE)--Ahead of the 2026 WNBA season and in celebration of their partnership with the New York Liberty, Ant International hosted a U.S. event where Leiming Chen, Senior Vice President and Chief Sustainability Officer of Ant International, laid out the vision for the intersection of technology and sports to expand the horizon of community action. Ant International’s Alipay+ entered into a multi-year partnership with the New York Liberty in 2025, becoming the team’s offic...

Ant Group’s Robbyant Teams Up with Leju to Bridge Embodied Intelligence and Real-World Applications

SHANGHAI--(BUSINESS WIRE)--Robbyant, an embodied AI company within Ant Group, today announced a strategic partnership with Leju Robot, a leading company focused on core robotic technologies. This partnership aims to drive innovation centered on embodiment, data, and models of robots, exploring the application and commercialization of embodied AI in real-world scenarios and accelerating the transformation of embodied AI robots from specific task execution to general intelligence. Under this part...
Back to Newsroom