Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Vlad Licaret

Semantics through Time: Semi-supervised Segmentation of Aerial Videos with Iterative Label Propagation

Oct 02, 2020

Alina Marcu, Vlad Licaret, Dragos Costea, Marius Leordeanu

Figure 1 for Semantics through Time: Semi-supervised Segmentation of Aerial Videos with Iterative Label Propagation

Figure 2 for Semantics through Time: Semi-supervised Segmentation of Aerial Videos with Iterative Label Propagation

Figure 3 for Semantics through Time: Semi-supervised Segmentation of Aerial Videos with Iterative Label Propagation

Figure 4 for Semantics through Time: Semi-supervised Segmentation of Aerial Videos with Iterative Label Propagation

Abstract:Semantic segmentation is a crucial task for robot navigation and safety. However, current supervised methods require a large amount of pixelwise annotations to yield accurate results. Labeling is a tedious and time consuming process that has hampered progress in low altitude UAV applications. This paper makes an important step towards automatic annotation by introducing SegProp, a novel iterative flow-based method, with a direct connection to spectral clustering in space and time, to propagate the semantic labels to frames that lack human annotations. The labels are further used in semi-supervised learning scenarios. Motivated by the lack of a large video aerial dataset, we also introduce Ruralscapes, a new dataset with high resolution (4K) images and manually-annotated dense labels every 50 frames - the largest of its kind, to the best of our knowledge. Our novel SegProp automatically annotates the remaining unlabeled 98% of frames with an accuracy exceeding 90% (F-measure), significantly outperforming other state-of-the-art label propagation methods. Moreover, when integrating other methods as modules inside SegProp's iterative label propagation loop, we achieve a significant boost over the baseline labels. Finally, we test SegProp in a full semi-supervised setting: we train several state-of-the-art deep neural networks on the SegProp-automatically-labeled training frames and test them on completely novel videos. We convincingly demonstrate, every time, a significant improvement over the supervised scenario.

* Accepted as oral presentation at Asian Conference on Computer Vision (ACCV), 2020. arXiv admin note: text overlap with arXiv:1910.10026

Via

Access Paper or Ask Questions

Towards Automatic Annotation for Semantic Segmentation in Drone Videos

Oct 22, 2019

Alina Marcu, Dragos Costea, Vlad Licaret, Marius Leordeanu

Figure 1 for Towards Automatic Annotation for Semantic Segmentation in Drone Videos

Figure 2 for Towards Automatic Annotation for Semantic Segmentation in Drone Videos

Figure 3 for Towards Automatic Annotation for Semantic Segmentation in Drone Videos

Figure 4 for Towards Automatic Annotation for Semantic Segmentation in Drone Videos

Abstract:Semantic segmentation is a crucial task for robot navigation and safety. However, it requires huge amounts of pixelwise annotations to yield accurate results. While recent progress in computer vision algorithms has been heavily boosted by large ground-level datasets, the labeling time has hampered progress in low altitude UAV applications, mostly due to the difficulty imposed by large object scales and pose variations. Motivated by the lack of a large video aerial dataset, we introduce a new one, with high resolution (4K) images and manually-annotated dense labels every 50 frames. To help the video labeling process, we make an important step towards automatic annotation and propose SegProp, an iterative flow-based method with geometric constrains to propagate the semantic labels to frames that lack human annotations. This results in a dataset with more than 50k annotated frames - the largest of its kind, to the best of our knowledge. Our experiments show that SegProp surpasses current state-of-the-art label propagation methods by a significant margin. Furthermore, when training a semantic segmentation deep neural net using the automatically annotated frames, we obtain a compelling overall performance boost at test time of 16.8% mean F-measure over a baseline trained only with manually-labeled frames. Our Ruralscapes dataset, the label propagation code and a fast segmentation tool are available at our website: https://sites.google.com/site/aerialimageunderstanding/

* 7 pages, 6 figures, submitted at the International Conference on Robotics and Automation (ICRA) 2020

Via

Access Paper or Ask Questions