Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Jiangye Yuan

Building Extraction at Scale using Convolutional Neural Network: Mapping of the United States

May 23, 2018

Hsiuhan Lexie Yang, Jiangye Yuan, Dalton Lunga, Melanie Laverdiere, Amy Rose, Budhendra Bhaduri

Figure 1 for Building Extraction at Scale using Convolutional Neural Network: Mapping of the United States

Figure 2 for Building Extraction at Scale using Convolutional Neural Network: Mapping of the United States

Figure 3 for Building Extraction at Scale using Convolutional Neural Network: Mapping of the United States

Figure 4 for Building Extraction at Scale using Convolutional Neural Network: Mapping of the United States

Abstract:Establishing up-to-date large scale building maps is essential to understand urban dynamics, such as estimating population, urban planning and many other applications. Although many computer vision tasks has been successfully carried out with deep convolutional neural networks, there is a growing need to understand their large scale impact on building mapping with remote sensing imagery. Taking advantage of the scalability of CNNs and using only few areas with the abundance of building footprints, for the first time we conduct a comparative analysis of four state-of-the-art CNNs for extracting building footprints across the entire continental United States. The four CNN architectures namely: branch-out CNN, fully convolutional neural network (FCN), conditional random field as recurrent neural network (CRFasRNN), and SegNet, support semantic pixel-wise labeling and focus on capturing textural information at multi-scale. We use 1-meter resolution aerial images from National Agriculture Imagery Program (NAIP) as the test-bed, and compare the extraction results across the four methods. In addition, we propose to combine signed-distance labels with SegNet, the preferred CNN architecture identified by our extensive evaluations, to advance building extraction results to instance level. We further demonstrate the usefulness of fusing additional near IR information into the building extraction framework. Large scale experimental evaluations are conducted and reported using metrics that include: precision, recall rate, intersection over union, and the number of buildings extracted. With the improved CNN model and no requirement of further post-processing, we have generated building maps for the United States. The quality of extracted buildings and processing time demonstrated the proposed CNN-based framework fits the need of building extraction at scale.

* Accepted by IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing

Via

Access Paper or Ask Questions

Combining Maps and Street Level Images for Building Height and Facade Estimation

Dec 13, 2016

Jiangye Yuan, Anil M. Cheriyadat

Figure 1 for Combining Maps and Street Level Images for Building Height and Facade Estimation

Figure 2 for Combining Maps and Street Level Images for Building Height and Facade Estimation

Figure 3 for Combining Maps and Street Level Images for Building Height and Facade Estimation

Figure 4 for Combining Maps and Street Level Images for Building Height and Facade Estimation

Abstract:We propose a method that integrates two widely available data sources, building footprints from 2D maps and street level images, to derive valuable information that is generally difficult to acquire -- building heights and building facade masks in images. Building footprints are elevated in world coordinates and projected onto images. Building heights are estimated by scoring projected footprints based on their alignment with building features in images. Building footprints with estimated heights can be converted to simple 3D building models, which are projected back to images to identify buildings. In this procedure, accurate camera projections are critical. However, camera position errors inherited from external sensors commonly exist, which adversely affect results. We derive a solution to precisely locate cameras on maps using correspondence between image features and building footprints. Experiments on real-world datasets show the promise of our method.

* UrbanGIS '16 Proceedings of the 2nd ACM SIGSPATIAL Workshop on Smart Cities and Urban Analytics

Via

Access Paper or Ask Questions

Automatic Building Extraction in Aerial Scenes Using Convolutional Networks

Feb 21, 2016

Jiangye Yuan

Figure 1 for Automatic Building Extraction in Aerial Scenes Using Convolutional Networks

Figure 2 for Automatic Building Extraction in Aerial Scenes Using Convolutional Networks

Figure 3 for Automatic Building Extraction in Aerial Scenes Using Convolutional Networks

Figure 4 for Automatic Building Extraction in Aerial Scenes Using Convolutional Networks

Abstract:Automatic building extraction from aerial and satellite imagery is highly challenging due to extremely large variations of building appearances. To attack this problem, we design a convolutional network with a final stage that integrates activations from multiple preceding stages for pixel-wise prediction, and introduce the signed distance function of building boundaries as the output representation, which has an enhanced representation power. We leverage abundant building footprint data available from geographic information systems (GIS) to compile training data. The trained network achieves superior performance on datasets that are significantly larger and more complex than those used in prior work, demonstrating that the proposed method provides a promising and scalable solution for automating this labor-intensive task.

Via

Access Paper or Ask Questions