Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Xiyu Yan

WeClick: Weakly-Supervised Video Semantic Segmentation with Click Annotations

Aug 04, 2021

Peidong Liu, Zibin He, Xiyu Yan, Yong Jiang, Shutao Xia, Feng Zheng, Maowei Hu

Figure 1 for WeClick: Weakly-Supervised Video Semantic Segmentation with Click Annotations

Figure 2 for WeClick: Weakly-Supervised Video Semantic Segmentation with Click Annotations

Figure 3 for WeClick: Weakly-Supervised Video Semantic Segmentation with Click Annotations

Figure 4 for WeClick: Weakly-Supervised Video Semantic Segmentation with Click Annotations

Abstract:Compared with tedious per-pixel mask annotating, it is much easier to annotate data by clicks, which costs only several seconds for an image. However, applying clicks to learn video semantic segmentation model has not been explored before. In this work, we propose an effective weakly-supervised video semantic segmentation pipeline with click annotations, called WeClick, for saving laborious annotating effort by segmenting an instance of the semantic class with only a single click. Since detailed semantic information is not captured by clicks, directly training with click labels leads to poor segmentation predictions. To mitigate this problem, we design a novel memory flow knowledge distillation strategy to exploit temporal information (named memory flow) in abundant unlabeled video frames, by distilling the neighboring predictions to the target frame via estimated motion. Moreover, we adopt vanilla knowledge distillation for model compression. In this case, WeClick learns compact video semantic segmentation models with the low-cost click annotations during the training phase yet achieves real-time and accurate models during the inference period. Experimental results on Cityscapes and Camvid show that WeClick outperforms the state-of-the-art methods, increases performance by 10.24% mIoU than baseline, and achieves real-time execution.

* Accepted by ACM MM2021 Oral

Via

Access Paper or Ask Questions

Deep Flow Collaborative Network for Online Visual Tracking

Nov 05, 2019

Peidong Liu, Xiyu Yan, Yong Jiang, Shu-Tao Xia

Figure 1 for Deep Flow Collaborative Network for Online Visual Tracking

Figure 2 for Deep Flow Collaborative Network for Online Visual Tracking

Figure 3 for Deep Flow Collaborative Network for Online Visual Tracking

Figure 4 for Deep Flow Collaborative Network for Online Visual Tracking

Abstract:The deep learning-based visual tracking algorithms such as MDNet achieve high performance leveraging to the feature extraction ability of a deep neural network. However, the tracking efficiency of these trackers is not very high due to the slow feature extraction for each frame in a video. In this paper, we propose an effective tracking algorithm to alleviate the time-consuming problem. Specifically, we design a deep flow collaborative network, which executes the expensive feature network only on sparse keyframes and transfers the feature maps to other frames via optical flow. Moreover, we raise an effective adaptive keyframe scheduling mechanism to select the most appropriate keyframe. We evaluate the proposed approach on large-scale datasets: OTB2013 and OTB2015. The experiment results show that our algorithm achieves considerable speedup and high precision as well.

Via

Access Paper or Ask Questions