Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Yunchi Zhang

FloorSAM: SAM-Guided Floorplan Reconstruction with Semantic-Geometric Fusion

Sep 19, 2025

Han Ye, Haofu Wang, Yunchi Zhang, Jiangjian Xiao, Yuqiang Jin, Jinyuan Liu, Wen-An Zhang, Uladzislau Sychou, Alexander Tuzikov, Vladislav Sobolevskii(+3 more)

Figure 1 for FloorSAM: SAM-Guided Floorplan Reconstruction with Semantic-Geometric Fusion

Figure 2 for FloorSAM: SAM-Guided Floorplan Reconstruction with Semantic-Geometric Fusion

Figure 3 for FloorSAM: SAM-Guided Floorplan Reconstruction with Semantic-Geometric Fusion

Figure 4 for FloorSAM: SAM-Guided Floorplan Reconstruction with Semantic-Geometric Fusion

Abstract:Reconstructing building floor plans from point cloud data is key for indoor navigation, BIM, and precise measurements. Traditional methods like geometric algorithms and Mask R-CNN-based deep learning often face issues with noise, limited generalization, and loss of geometric details. We propose FloorSAM, a framework that integrates point cloud density maps with the Segment Anything Model (SAM) for accurate floor plan reconstruction from LiDAR data. Using grid-based filtering, adaptive resolution projection, and image enhancement, we create robust top-down density maps. FloorSAM uses SAM's zero-shot learning for precise room segmentation, improving reconstruction across diverse layouts. Room masks are generated via adaptive prompt points and multistage filtering, followed by joint mask and point cloud analysis for contour extraction and regularization. This produces accurate floor plans and recovers room topological relationships. Tests on Giblayout and ISPRS datasets show better accuracy, recall, and robustness than traditional methods, especially in noisy and complex settings. Code and materials: github.com/Silentbarber/FloorSAM.

* 12 pages, 15 figures,

Via

Access Paper or Ask Questions

MapTRv2: An End-to-End Framework for Online Vectorized HD Map Construction

Aug 10, 2023

Bencheng Liao, Shaoyu Chen, Yunchi Zhang, Bo Jiang, Qian Zhang, Wenyu Liu, Chang Huang, Xinggang Wang

Figure 1 for MapTRv2: An End-to-End Framework for Online Vectorized HD Map Construction

Figure 2 for MapTRv2: An End-to-End Framework for Online Vectorized HD Map Construction

Figure 3 for MapTRv2: An End-to-End Framework for Online Vectorized HD Map Construction

Figure 4 for MapTRv2: An End-to-End Framework for Online Vectorized HD Map Construction

Abstract:High-definition (HD) map provides abundant and precise static environmental information of the driving scene, serving as a fundamental and indispensable component for planning in autonomous driving system. In this paper, we present \textbf{Map} \textbf{TR}ansformer, an end-to-end framework for online vectorized HD map construction. We propose a unified permutation-equivalent modeling approach, \ie, modeling map element as a point set with a group of equivalent permutations, which accurately describes the shape of map element and stabilizes the learning process. We design a hierarchical query embedding scheme to flexibly encode structured map information and perform hierarchical bipartite matching for map element learning. To speed up convergence, we further introduce auxiliary one-to-many matching and dense supervision. The proposed method well copes with various map elements with arbitrary shapes. It runs at real-time inference speed and achieves state-of-the-art performance on both nuScenes and Argoverse2 datasets. Abundant qualitative results show stable and robust map construction quality in complex and various driving scenes. Code and more demos are available at \url{https://github.com/hustvl/MapTR} for facilitating further studies and applications.

* Code available at https://github.com/hustvl/MapTR . arXiv admin note: substantial text overlap with arXiv:2208.14437

Via

Access Paper or Ask Questions

VMA: Divide-and-Conquer Vectorized Map Annotation System for Large-Scale Driving Scene

Apr 19, 2023

Shaoyu Chen, Yunchi Zhang, Bencheng Liao, Jiafeng Xie, Tianheng Cheng, Wei Sui, Qian Zhang, Chang Huang, Wenyu Liu, Xinggang Wang

Figure 1 for VMA: Divide-and-Conquer Vectorized Map Annotation System for Large-Scale Driving Scene

Figure 2 for VMA: Divide-and-Conquer Vectorized Map Annotation System for Large-Scale Driving Scene

Figure 3 for VMA: Divide-and-Conquer Vectorized Map Annotation System for Large-Scale Driving Scene

Figure 4 for VMA: Divide-and-Conquer Vectorized Map Annotation System for Large-Scale Driving Scene

Abstract:High-definition (HD) map serves as the essential infrastructure of autonomous driving. In this work, we build up a systematic vectorized map annotation framework (termed VMA) for efficiently generating HD map of large-scale driving scene. We design a divide-and-conquer annotation scheme to solve the spatial extensibility problem of HD map generation, and abstract map elements with a variety of geometric patterns as unified point sequence representation, which can be extended to most map elements in the driving scene. VMA is highly efficient and extensible, requiring negligible human effort, and flexible in terms of spatial scale and element type. We quantitatively and qualitatively validate the annotation performance on real-world urban and highway scenes, as well as NYC Planimetric Database. VMA can significantly improve map generation efficiency and require little human effort. On average VMA takes 160min for annotating a scene with a range of hundreds of meters, and reduces 52.3% of the human cost, showing great application value.

Via

Access Paper or Ask Questions