Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Binnan Zhuang

ProspectNet: Weighted Conditional Attention for Future Interaction Modeling in Behavior Prediction

Aug 29, 2022

Yutian Pang, Zehua Guo, Binnan Zhuang

Figure 1 for ProspectNet: Weighted Conditional Attention for Future Interaction Modeling in Behavior Prediction

Figure 2 for ProspectNet: Weighted Conditional Attention for Future Interaction Modeling in Behavior Prediction

Figure 3 for ProspectNet: Weighted Conditional Attention for Future Interaction Modeling in Behavior Prediction

Figure 4 for ProspectNet: Weighted Conditional Attention for Future Interaction Modeling in Behavior Prediction

Abstract:Behavior prediction plays an important role in integrated autonomous driving software solutions. In behavior prediction research, interactive behavior prediction is a less-explored area, compared to single-agent behavior prediction. Predicting the motion of interactive agents requires initiating novel mechanisms to capture the joint behaviors of the interactive pairs. In this work, we formulate the end-to-end joint prediction problem as a sequential learning process of marginal learning and joint learning of vehicle behaviors. We propose ProspectNet, a joint learning block that adopts the weighted attention score to model the mutual influence between interactive agent pairs. The joint learning block first weighs the multi-modal predicted candidate trajectories, then updates the ego-agent's embedding via cross attention. Furthermore, we broadcast the individual future predictions for each interactive agent into a pair-wise scoring module to select the top $K$ prediction pairs. We show that ProspectNet outperforms the Cartesian product of two marginal predictions, and achieves comparable performance on the Waymo Interactive Motion Prediction benchmarks.

Via

Access Paper or Ask Questions

Radar Camera Fusion via Representation Learning in Autonomous Driving

Mar 14, 2021

Xu Dong, Binnan Zhuang, Yunxiang Mao, Langechuan Liu

Figure 1 for Radar Camera Fusion via Representation Learning in Autonomous Driving

Figure 2 for Radar Camera Fusion via Representation Learning in Autonomous Driving

Figure 3 for Radar Camera Fusion via Representation Learning in Autonomous Driving

Figure 4 for Radar Camera Fusion via Representation Learning in Autonomous Driving

Abstract:Radars and cameras are mature, cost-effective, and robust sensors and have been widely used in the perception stack of mass-produced autonomous driving systems. Due to their complementary properties, outputs from radar detection (radar pins) and camera perception (2D bounding boxes) are usually fused to generate the best perception results. The key to successful radar-camera fusion is accurate data association. The challenges in radar-camera association can be attributed to the complexity of driving scenes, the noisy and sparse nature of radar measurements, and the depth ambiguity from 2D bounding boxes. Traditional rule-based association methods are susceptible to performance degradation in challenging scenarios and failure in corner cases. In this study, we propose to address rad-cam association via deep representation learning, to explore feature-level interaction and global reasoning. Concretely, we design a loss sampling mechanism and an innovative ordinal loss to overcome the difficulty of imperfect labeling and to enforce critical human reasoning. Despite being trained with noisy labels generated by a rule-based algorithm, our proposed method achieves a performance of 92.2% F1 score, which is 11.6% higher than the rule-based teacher. Moreover, this data-driven method also lends itself to continuous improvement via corner case mining.

Via

Access Paper or Ask Questions