Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Long-LRM: Long-sequence Large Reconstruction Model for Wide-coverage Gaussian Splats

Oct 16, 2024

Chen Ziwen, Hao Tan, Kai Zhang, Sai Bi, Fujun Luan, Yicong Hong, Li Fuxin, Zexiang Xu

Figure 1 for Long-LRM: Long-sequence Large Reconstruction Model for Wide-coverage Gaussian Splats

Figure 2 for Long-LRM: Long-sequence Large Reconstruction Model for Wide-coverage Gaussian Splats

Figure 3 for Long-LRM: Long-sequence Large Reconstruction Model for Wide-coverage Gaussian Splats

Figure 4 for Long-LRM: Long-sequence Large Reconstruction Model for Wide-coverage Gaussian Splats

Share this with someone who'll enjoy it:

Abstract:We propose Long-LRM, a generalizable 3D Gaussian reconstruction model that is capable of reconstructing a large scene from a long sequence of input images. Specifically, our model can process 32 source images at 960x540 resolution within only 1.3 seconds on a single A100 80G GPU. Our architecture features a mixture of the recent Mamba2 blocks and the classical transformer blocks which allowed many more tokens to be processed than prior work, enhanced by efficient token merging and Gaussian pruning steps that balance between quality and efficiency. Unlike previous feed-forward models that are limited to processing 1~4 input images and can only reconstruct a small portion of a large scene, Long-LRM reconstructs the entire scene in a single feed-forward step. On large-scale scene datasets such as DL3DV-140 and Tanks and Temples, our method achieves performance comparable to optimization-based approaches while being two orders of magnitude more efficient. Project page: https://arthurhero.github.io/projects/llrm

View paper on

Share this with someone who'll enjoy it:

Title:Long-LRM: Long-sequence Large Reconstruction Model for Wide-coverage Gaussian Splats

Paper and Code