Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Unsupervised Learning of Depth, Camera Pose and Optical Flow from Monocular Video

May 19, 2022

Dipan Mandal, Abhilash Jain, Sreenivas Subramoney

Figure 1 for Unsupervised Learning of Depth, Camera Pose and Optical Flow from Monocular Video

Figure 2 for Unsupervised Learning of Depth, Camera Pose and Optical Flow from Monocular Video

Figure 3 for Unsupervised Learning of Depth, Camera Pose and Optical Flow from Monocular Video

Figure 4 for Unsupervised Learning of Depth, Camera Pose and Optical Flow from Monocular Video

Share this with someone who'll enjoy it:

Abstract:We propose DFPNet -- an unsupervised, joint learning system for monocular Depth, Optical Flow and egomotion (Camera Pose) estimation from monocular image sequences. Due to the nature of 3D scene geometry these three components are coupled. We leverage this fact to jointly train all the three components in an end-to-end manner. A single composite loss function -- which involves image reconstruction-based loss for depth & optical flow, bidirectional consistency checks and smoothness loss components -- is used to train the network. Using hyperparameter tuning, we are able to reduce the model size to less than 5% (8.4M parameters) of state-of-the-art DFP models. Evaluation on KITTI and Cityscapes driving datasets reveals that our model achieves results comparable to state-of-the-art in all of the three tasks, even with the significantly smaller model size.

* 8 pages, 2 figures. arXiv admin note: text overlap with arXiv:1803.02276 by other authors

View paper on

Share this with someone who'll enjoy it:

Title:Unsupervised Learning of Depth, Camera Pose and Optical Flow from Monocular Video

Paper and Code