Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Ahmed El-Sallab

Spatio-Temporal Multi-Task Learning Transformer for Joint Moving Object Detection and Segmentation

Jun 21, 2021

Eslam Mohamed, Ahmed El-Sallab

Figure 1 for Spatio-Temporal Multi-Task Learning Transformer for Joint Moving Object Detection and Segmentation

Figure 2 for Spatio-Temporal Multi-Task Learning Transformer for Joint Moving Object Detection and Segmentation

Figure 3 for Spatio-Temporal Multi-Task Learning Transformer for Joint Moving Object Detection and Segmentation

Figure 4 for Spatio-Temporal Multi-Task Learning Transformer for Joint Moving Object Detection and Segmentation

Abstract:Moving objects have special importance for Autonomous Driving tasks. Detecting moving objects can be posed as Moving Object Segmentation, by segmenting the object pixels, or Moving Object Detection, by generating a bounding box for the moving targets. In this paper, we present a Multi-Task Learning architecture, based on Transformers, to jointly perform both tasks through one network. Due to the importance of the motion features to the task, the whole setup is based on a Spatio-Temporal aggregation. We evaluate the performance of the individual tasks architecture versus the MTL setup, both with early shared encoders, and late shared encoder-decoder transformers. For the latter, we present a novel joint tasks query decoder transformer, that enables us to have tasks dedicated heads out of the shared model. To evaluate our approach, we use the KITTI MOD [29] data set. Results show1.5% mAP improvement for Moving Object Detection, and 2%IoU improvement for Moving Object Segmentation, over the individual tasks networks.

Via

Access Paper or Ask Questions