LoGeR: DeepMind’s 3D Reconstruction That Scales to 10,000 Frames with Hybrid Memory

Google DeepMind’s LoGeR reconstructs 3D geometry from video over 10,000+ frames and kilometer-scale distances, reducing trajectory error by 74% on KITTI with no post-processing.
artificial-intelligence
Author

Kabui, Charles

Published

2026-03-19

Keywords

3d-reconstruction, geometric-reconstruction, google-deepmind, test-time-training, long-context-vision