Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Stephen Phillips

EVORA: Deep Evidential Traversability Learning for Risk-Aware Off-Road Autonomy

Nov 10, 2023

Xiaoyi Cai, Siddharth Ancha, Lakshay Sharma, Philip R. Osteen, Bernadette Bucher, Stephen Phillips, Jiuguang Wang, Michael Everett, Nicholas Roy, Jonathan P. How

Figure 1 for EVORA: Deep Evidential Traversability Learning for Risk-Aware Off-Road Autonomy

Figure 2 for EVORA: Deep Evidential Traversability Learning for Risk-Aware Off-Road Autonomy

Figure 3 for EVORA: Deep Evidential Traversability Learning for Risk-Aware Off-Road Autonomy

Figure 4 for EVORA: Deep Evidential Traversability Learning for Risk-Aware Off-Road Autonomy

Abstract:Traversing terrain with good traction is crucial for achieving fast off-road navigation. Instead of manually designing costs based on terrain features, existing methods learn terrain properties directly from data via self-supervision, but challenges remain to properly quantify and mitigate risks due to uncertainties in learned models. This work efficiently quantifies both aleatoric and epistemic uncertainties by learning discrete traction distributions and probability densities of the traction predictor's latent features. Leveraging evidential deep learning, we parameterize Dirichlet distributions with the network outputs and propose a novel uncertainty-aware squared Earth Mover's distance loss with a closed-form expression that improves learning accuracy and navigation performance. The proposed risk-aware planner simulates state trajectories with the worst-case expected traction to handle aleatoric uncertainty, and penalizes trajectories moving through terrain with high epistemic uncertainty. Our approach is extensively validated in simulation and on wheeled and quadruped robots, showing improved navigation performance compared to methods that assume no slip, assume the expected traction, or optimize for the worst-case expected cost.

* Under review. Journal extension for arXiv:2210.00153. Project website: https://xiaoyi-cai.github.io/evora/

Via

Access Paper or Ask Questions

Learning Portrait Style Representations

Dec 08, 2020

Sadat Shaik, Bernadette Bucher, Nephele Agrafiotis, Stephen Phillips, Kostas Daniilidis, William Schmenner

Figure 1 for Learning Portrait Style Representations

Figure 2 for Learning Portrait Style Representations

Figure 3 for Learning Portrait Style Representations

Figure 4 for Learning Portrait Style Representations

Abstract:Style analysis of artwork in computer vision predominantly focuses on achieving results in target image generation through optimizing understanding of low level style characteristics such as brush strokes. However, fundamentally different techniques are required to computationally understand and control qualities of art which incorporate higher level style characteristics. We study style representations learned by neural network architectures incorporating these higher level characteristics. We find variation in learned style features from incorporating triplets annotated by art historians as supervision for style similarity. Networks leveraging statistical priors or pretrained on photo collections such as ImageNet can also derive useful visual representations of artwork. We align the impact of these expert human knowledge, statistical, and photo realism priors on style representations with art historical research and use these representations to perform zero-shot classification of artists. To facilitate this work, we also present the first large-scale dataset of portraits prepared for computational analysis.

* Sadat Shaik and Bernadette Bucher contributed equally

Via

Access Paper or Ask Questions

All Graphs Lead to Rome: Learning Geometric and Cycle-Consistent Representations with Graph Convolutional Networks

Jan 07, 2019

Stephen Phillips, Kostas Daniilidis

Figure 1 for All Graphs Lead to Rome: Learning Geometric and Cycle-Consistent Representations with Graph Convolutional Networks

Figure 2 for All Graphs Lead to Rome: Learning Geometric and Cycle-Consistent Representations with Graph Convolutional Networks

Abstract:Image feature matching is a fundamental part of many geometric computer vision applications, and using multiple images can improve performance. In this work, we formulate multi-image matching as a graph embedding problem then use a Graph Convolutional Network to learn an appropriate embedding function for aligning image features. We use cycle consistency to train our network in an unsupervised fashion, since ground truth correspondence is difficult or expensive to aquire. In addition, geometric consistency losses can be added at training time, even if the information is not available in the test set, unlike previous approaches that optimize cycle consistency directly. To the best of our knowledge, no other works have used learning for multi-image feature matching. Our experiments show that our method is competitive with other optimization based approaches.

* 9 pages, 7 figures, 2 tables, 2 supplemental figures

Via

Access Paper or Ask Questions

Understanding image motion with group representations

Feb 26, 2018

Andrew Jaegle, Stephen Phillips, Daphne Ippolito, Kostas Daniilidis

Figure 1 for Understanding image motion with group representations

Figure 2 for Understanding image motion with group representations

Figure 3 for Understanding image motion with group representations

Figure 4 for Understanding image motion with group representations

Abstract:Motion is an important signal for agents in dynamic environments, but learning to represent motion from unlabeled video is a difficult and underconstrained problem. We propose a model of motion based on elementary group properties of transformations and use it to train a representation of image motion. While most methods of estimating motion are based on pixel-level constraints, we use these group properties to constrain the abstract representation of motion itself. We demonstrate that a deep neural network trained using this method captures motion in both synthetic 2D sequences and real-world sequences of vehicle motion, without requiring any labels. Networks trained to respect these constraints implicitly identify the image characteristic of motion in different sequence types. In the context of vehicle motion, this method extracts information useful for localization, tracking, and odometry. Our results demonstrate that this representation is useful for learning motion in the general setting where explicit labels are difficult to obtain.

* Published as a conference paper at ICLR 2018; 14 pages, including references and supplement

Via

Access Paper or Ask Questions

Fast, Robust, Continuous Monocular Egomotion Computation

Feb 16, 2016

Andrew Jaegle, Stephen Phillips, Kostas Daniilidis

Figure 1 for Fast, Robust, Continuous Monocular Egomotion Computation

Figure 2 for Fast, Robust, Continuous Monocular Egomotion Computation

Figure 3 for Fast, Robust, Continuous Monocular Egomotion Computation

Figure 4 for Fast, Robust, Continuous Monocular Egomotion Computation

Abstract:We propose robust methods for estimating camera egomotion in noisy, real-world monocular image sequences in the general case of unknown observer rotation and translation with two views and a small baseline. This is a difficult problem because of the nonconvex cost function of the perspective camera motion equation and because of non-Gaussian noise arising from noisy optical flow estimates and scene non-rigidity. To address this problem, we introduce the expected residual likelihood method (ERL), which estimates confidence weights for noisy optical flow data using likelihood distributions of the residuals of the flow field under a range of counterfactual model parameters. We show that ERL is effective at identifying outliers and recovering appropriate confidence weights in many settings. We compare ERL to a novel formulation of the perspective camera motion equation using a lifted kernel, a recently proposed optimization framework for joint parameter and confidence weight estimation with good empirical properties. We incorporate these strategies into a motion estimation pipeline that avoids falling into local minima. We find that ERL outperforms the lifted kernel method and baseline monocular egomotion estimation strategies on the challenging KITTI dataset, while adding almost no runtime cost over baseline egomotion methods.

* Accepted as a conference paper at ICRA 2016. Main paper: 8 pages, 7 figures. Supplement: 4 pages, 2 figures

Via

Access Paper or Ask Questions