Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Hunter Blanton

Revisiting Near/Remote Sensing with Geospatial Attention

Apr 04, 2022

Scott Workman, M. Usman Rafique, Hunter Blanton, Nathan Jacobs

Figure 1 for Revisiting Near/Remote Sensing with Geospatial Attention

Figure 2 for Revisiting Near/Remote Sensing with Geospatial Attention

Figure 3 for Revisiting Near/Remote Sensing with Geospatial Attention

Figure 4 for Revisiting Near/Remote Sensing with Geospatial Attention

Abstract:This work addresses the task of overhead image segmentation when auxiliary ground-level images are available. Recent work has shown that performing joint inference over these two modalities, often called near/remote sensing, can yield significant accuracy improvements. Extending this line of work, we introduce the concept of geospatial attention, a geometry-aware attention mechanism that explicitly considers the geospatial relationship between the pixels in a ground-level image and a geographic location. We propose an approach for computing geospatial attention that incorporates geometric features and the appearance of the overhead and ground-level imagery. We introduce a novel architecture for near/remote sensing that is based on geospatial attention and demonstrate its use for five segmentation tasks. The results demonstrate that our method significantly outperforms the previous state-of-the-art methods.

* IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2022

Via

Access Paper or Ask Questions

Augmenting Depth Estimation with Geospatial Context

Sep 20, 2021

Scott Workman, Hunter Blanton

Figure 1 for Augmenting Depth Estimation with Geospatial Context

Figure 2 for Augmenting Depth Estimation with Geospatial Context

Figure 3 for Augmenting Depth Estimation with Geospatial Context

Figure 4 for Augmenting Depth Estimation with Geospatial Context

Abstract:Modern cameras are equipped with a wide array of sensors that enable recording the geospatial context of an image. Taking advantage of this, we explore depth estimation under the assumption that the camera is geocalibrated, a problem we refer to as geo-enabled depth estimation. Our key insight is that if capture location is known, the corresponding overhead viewpoint offers a valuable resource for understanding the scale of the scene. We propose an end-to-end architecture for depth estimation that uses geospatial context to infer a synthetic ground-level depth map from a co-located overhead image, then fuses it inside of an encoder/decoder style segmentation network. To support evaluation of our methods, we extend a recently released dataset with overhead imagery and corresponding height maps. Results demonstrate that integrating geospatial context significantly reduces error compared to baselines, both at close ranges and when evaluating at much larger distances than existing benchmarks consider.

* IEEE/CVF International Conference on Computer Vision (ICCV) 2021

Via

Access Paper or Ask Questions

A Structure-Aware Method for Direct Pose Estimation

Dec 22, 2020

Hunter Blanton, Scott Workman, Nathan Jacobs

Figure 1 for A Structure-Aware Method for Direct Pose Estimation

Figure 2 for A Structure-Aware Method for Direct Pose Estimation

Figure 3 for A Structure-Aware Method for Direct Pose Estimation

Figure 4 for A Structure-Aware Method for Direct Pose Estimation

Abstract:Estimating camera pose from a single image is a fundamental problem in computer vision. Existing methods for solving this task fall into two distinct categories, which we refer to as direct and indirect. Direct methods, such as PoseNet, regress pose from the image as a fixed function, for example using a feed-forward convolutional network. Such methods are desirable because they are deterministic and run in constant time. Indirect methods for pose regression are often non-deterministic, with various external dependencies such as image retrieval and hypothesis sampling. We propose a direct method that takes inspiration from structure-based approaches to incorporate explicit 3D constraints into the network. Our approach maintains the desirable qualities of other direct methods while achieving much lower error in general.

Via

Access Paper or Ask Questions

Dynamic Image for 3D MRI Image Alzheimer's Disease Classification

Nov 30, 2020

Xin Xing, Gongbo Liang, Hunter Blanton, Muhammad Usman Rafique, Chris Wang, Ai-Ling Lin, Nathan Jacobs

Figure 1 for Dynamic Image for 3D MRI Image Alzheimer's Disease Classification

Figure 2 for Dynamic Image for 3D MRI Image Alzheimer's Disease Classification

Figure 3 for Dynamic Image for 3D MRI Image Alzheimer's Disease Classification

Figure 4 for Dynamic Image for 3D MRI Image Alzheimer's Disease Classification

Abstract:We propose to apply a 2D CNN architecture to 3D MRI image Alzheimer's disease classification. Training a 3D convolutional neural network (CNN) is time-consuming and computationally expensive. We make use of approximate rank pooling to transform the 3D MRI image volume into a 2D image to use as input to a 2D CNN. We show our proposed CNN model achieves $9.5\%$ better Alzheimer's disease classification accuracy than the baseline 3D models. We also show that our method allows for efficient training, requiring only 20% of the training time compared to 3D CNN models. The code is available online: https://github.com/UkyVision/alzheimer-project.

* Accepted to ECCV2020 Workshop on BioImage Computing

Via

Access Paper or Ask Questions

Single Image Cloud Detection via Multi-Image Fusion

Jul 29, 2020

Scott Workman, M. Usman Rafique, Hunter Blanton, Connor Greenwell, Nathan Jacobs

Figure 1 for Single Image Cloud Detection via Multi-Image Fusion

Figure 2 for Single Image Cloud Detection via Multi-Image Fusion

Figure 3 for Single Image Cloud Detection via Multi-Image Fusion

Figure 4 for Single Image Cloud Detection via Multi-Image Fusion

Abstract:Artifacts in imagery captured by remote sensing, such as clouds, snow, and shadows, present challenges for various tasks, including semantic segmentation and object detection. A primary challenge in developing algorithms for identifying such artifacts is the cost of collecting annotated training data. In this work, we explore how recent advances in multi-image fusion can be leveraged to bootstrap single image cloud detection. We demonstrate that a network optimized to estimate image quality also implicitly learns to detect clouds. To support the training and evaluation of our approach, we collect a large dataset of Sentinel-2 images along with a per-pixel semantic labelling for land cover. Through various experiments, we demonstrate that our method reduces the need for annotated training data and improves cloud detection performance.

* IEEE International Geoscience and Remote Sensing Symposium (IGARSS) 2020

Via

Access Paper or Ask Questions

RasterNet: Modeling Free-Flow Speed using LiDAR and Overhead Imagery

Jun 14, 2020

Armin Hadzic, Hunter Blanton, Weilian Song, Mei Chen, Scott Workman, Nathan Jacobs

Figure 1 for RasterNet: Modeling Free-Flow Speed using LiDAR and Overhead Imagery

Figure 2 for RasterNet: Modeling Free-Flow Speed using LiDAR and Overhead Imagery

Figure 3 for RasterNet: Modeling Free-Flow Speed using LiDAR and Overhead Imagery

Figure 4 for RasterNet: Modeling Free-Flow Speed using LiDAR and Overhead Imagery

Abstract:Roadway free-flow speed captures the typical vehicle speed in low traffic conditions. Modeling free-flow speed is an important problem in transportation engineering with applications to a variety of design, operation, planning, and policy decisions of highway systems. Unfortunately, collecting large-scale historical traffic speed data is expensive and time consuming. Traditional approaches for estimating free-flow speed use geometric properties of the underlying road segment, such as grade, curvature, lane width, lateral clearance and access point density, but for many roads such features are unavailable. We propose a fully automated approach, RasterNet, for estimating free-flow speed without the need for explicit geometric features. RasterNet is a neural network that fuses large-scale overhead imagery and aerial LiDAR point clouds using a geospatially consistent raster structure. To support training and evaluation, we introduce a novel dataset combining free-flow speeds of road segments, overhead imagery, and LiDAR point clouds across the state of Kentucky. Our method achieves state-of-the-art results on a benchmark dataset.

Via

Access Paper or Ask Questions

Joint 2D-3D Breast Cancer Classification

Feb 27, 2020

Gongbo Liang, Xiaoqin Wang, Yu Zhang, Xin Xing, Hunter Blanton, Tawfiq Salem, Nathan Jacobs

Figure 1 for Joint 2D-3D Breast Cancer Classification

Figure 2 for Joint 2D-3D Breast Cancer Classification

Figure 3 for Joint 2D-3D Breast Cancer Classification

Figure 4 for Joint 2D-3D Breast Cancer Classification

Abstract:Breast cancer is the malignant tumor that causes the highest number of cancer deaths in females. Digital mammograms (DM or 2D mammogram) and digital breast tomosynthesis (DBT or 3D mammogram) are the two types of mammography imagery that are used in clinical practice for breast cancer detection and diagnosis. Radiologists usually read both imaging modalities in combination; however, existing computer-aided diagnosis tools are designed using only one imaging modality. Inspired by clinical practice, we propose an innovative convolutional neural network (CNN) architecture for breast cancer classification, which uses both 2D and 3D mammograms, simultaneously. Our experiment shows that the proposed method significantly improves the performance of breast cancer classification. By assembling three CNN classifiers, the proposed model achieves 0.97 AUC, which is 34.72% higher than the methods using only one imaging modality.

* Accepted by IEEE International Conference of Bioinformatics and Biomedicine (BIBM), 2019

Via

Access Paper or Ask Questions

2D Convolutional Neural Networks for 3D Digital Breast Tomosynthesis Classification

Feb 27, 2020

Yu Zhang, Xiaoqin Wang, Hunter Blanton, Gongbo Liang, Xin Xing, Nathan Jacobs

Figure 1 for 2D Convolutional Neural Networks for 3D Digital Breast Tomosynthesis Classification

Figure 2 for 2D Convolutional Neural Networks for 3D Digital Breast Tomosynthesis Classification

Figure 3 for 2D Convolutional Neural Networks for 3D Digital Breast Tomosynthesis Classification

Figure 4 for 2D Convolutional Neural Networks for 3D Digital Breast Tomosynthesis Classification

Abstract:Automated methods for breast cancer detection have focused on 2D mammography and have largely ignored 3D digital breast tomosynthesis (DBT), which is frequently used in clinical practice. The two key challenges in developing automated methods for DBT classification are handling the variable number of slices and retaining slice-to-slice changes. We propose a novel deep 2D convolutional neural network (CNN) architecture for DBT classification that simultaneously overcomes both challenges. Our approach operates on the full volume, regardless of the number of slices, and allows the use of pre-trained 2D CNNs for feature extraction, which is important given the limited amount of annotated training data. In an extensive evaluation on a real-world clinical dataset, our approach achieves 0.854 auROC, which is 28.80% higher than approaches based on 3D CNNs. We also find that these improvements are stable across a range of model configurations.

* Accepted by IEEE International Conference of Bioinformatics and Biomedicine (BIBM), 2019

Via

Access Paper or Ask Questions

Learning to Map Nearly Anything

Sep 16, 2019

Tawfiq Salem, Connor Greenwell, Hunter Blanton, Nathan Jacobs

Figure 1 for Learning to Map Nearly Anything

Figure 2 for Learning to Map Nearly Anything

Figure 3 for Learning to Map Nearly Anything

Figure 4 for Learning to Map Nearly Anything

Abstract:Looking at the world from above, it is possible to estimate many properties of a given location, including the type of land cover and the expected land use. Historically, such tasks have relied on relatively coarse-grained categories due to the difficulty of obtaining fine-grained annotations. In this work, we propose an easily extensible approach that makes it possible to estimate fine-grained properties from overhead imagery. In particular, we propose a cross-modal distillation strategy to learn to predict the distribution of fine-grained properties from overhead imagery, without requiring any manual annotation of overhead imagery. We show that our learned models can be used directly for applications in mapping and image localization.

Via

Access Paper or Ask Questions

Remote Estimation of Free-Flow Speeds

Jun 24, 2019

Weilian Song, Tawfiq Salem, Hunter Blanton, Nathan Jacobs

Figure 1 for Remote Estimation of Free-Flow Speeds

Figure 2 for Remote Estimation of Free-Flow Speeds

Figure 3 for Remote Estimation of Free-Flow Speeds

Figure 4 for Remote Estimation of Free-Flow Speeds

Abstract:We propose an automated method to estimate a road segment's free-flow speed from overhead imagery and road metadata. The free-flow speed of a road segment is the average observed vehicle speed in ideal conditions, without congestion or adverse weather. Standard practice for estimating free-flow speeds depends on several road attributes, including grade, curve, and width of the right of way. Unfortunately, many of these fine-grained labels are not always readily available and are costly to manually annotate. To compensate, our model uses a small, easy to obtain subset of road features along with aerial imagery to directly estimate free-flow speed with a deep convolutional neural network (CNN). We evaluate our approach on a large dataset, and demonstrate that using imagery alone performs nearly as well as the road features and that the combination of imagery with road features leads to the highest accuracy.

* 4 pages, 4 figures, IGARSS 2019 camera-ready submission

Via

Access Paper or Ask Questions