Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Darshan Hegde

Multi-View Fusion of Sensor Data for Improved Perception and Prediction in Autonomous Driving

Aug 27, 2020

Sudeep Fadadu, Shreyash Pandey, Darshan Hegde, Yi Shi, Fang-Chieh Chou, Nemanja Djuric, Carlos Vallespi-Gonzalez

Figure 1 for Multi-View Fusion of Sensor Data for Improved Perception and Prediction in Autonomous Driving

Figure 2 for Multi-View Fusion of Sensor Data for Improved Perception and Prediction in Autonomous Driving

Figure 3 for Multi-View Fusion of Sensor Data for Improved Perception and Prediction in Autonomous Driving

Figure 4 for Multi-View Fusion of Sensor Data for Improved Perception and Prediction in Autonomous Driving

Abstract:We present an end-to-end method for object detection and trajectory prediction utilizing multi-view representations of LiDAR returns. Our method builds on a state-of-the-art Bird's-Eye View (BEV) network that fuses voxelized features from a sequence of historical LiDAR data as well as rasterized high-definition map to perform detection and prediction tasks. We extend the BEV network with additional LiDAR Range-View (RV) features that use the raw LiDAR information in its native, non-quantized representation. The RV feature map is projected into BEV and fused with the BEV features computed from LiDAR and high-definition map. The fused features are then further processed to output the final detections and trajectories, within a single end-to-end trainable network. In addition, using this framework the RV fusion of LiDAR and camera is performed in a straightforward and computational efficient manner. The proposed approach improves the state-of-the-art on proprietary large-scale real-world data collected by a fleet of self-driving vehicles, as well as on the public nuScenes data set.

Via

Access Paper or Ask Questions

Sensor Fusion for Joint 3D Object Detection and Semantic Segmentation

Apr 25, 2019

Gregory P. Meyer, Jake Charland, Darshan Hegde, Ankit Laddha, Carlos Vallespi-Gonzalez

Figure 1 for Sensor Fusion for Joint 3D Object Detection and Semantic Segmentation

Figure 2 for Sensor Fusion for Joint 3D Object Detection and Semantic Segmentation

Figure 3 for Sensor Fusion for Joint 3D Object Detection and Semantic Segmentation

Figure 4 for Sensor Fusion for Joint 3D Object Detection and Semantic Segmentation

Abstract:In this paper, we present an extension to LaserNet, an efficient and state-of-the-art LiDAR based 3D object detector. We propose a method for fusing image data with the LiDAR data and show that this sensor fusion method improves the detection performance of the model especially at long ranges. The addition of image data is straightforward and does not require image labels. Furthermore, we expand the capabilities of the model to perform 3D semantic segmentation in addition to 3D object detection. On a large benchmark dataset, we demonstrate our approach achieves state-of-the-art performance on both object detection and semantic segmentation while maintaining a low runtime.

* Accepted for publication at CVPR Workshop on Autonomous Driving 2019

Via

Access Paper or Ask Questions

Temporal Analysis of Language through Neural Language Models

May 14, 2014

Yoon Kim, Yi-I Chiu, Kentaro Hanaki, Darshan Hegde, Slav Petrov

Figure 1 for Temporal Analysis of Language through Neural Language Models

Figure 2 for Temporal Analysis of Language through Neural Language Models

Figure 3 for Temporal Analysis of Language through Neural Language Models

Figure 4 for Temporal Analysis of Language through Neural Language Models

Abstract:We provide a method for automatically detecting change in language across time through a chronologically trained neural language model. We train the model on the Google Books Ngram corpus to obtain word vector representations specific to each year, and identify words that have changed significantly from 1900 to 2009. The model identifies words such as "cell" and "gay" as having changed during that time period. The model simultaneously identifies the specific years during which such words underwent change.

* Proceedings of the ACL 2014 Workshop on Language Technologies and Computational Social Science. June, 2014. 61--65

Via

Access Paper or Ask Questions