Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Yuan-Hao Lee

A Pixel-Level Meta-Learner for Weakly Supervised Few-Shot Semantic Segmentation

Nov 02, 2021

Yuan-Hao Lee, Fu-En Yang, Yu-Chiang Frank Wang

Figure 1 for A Pixel-Level Meta-Learner for Weakly Supervised Few-Shot Semantic Segmentation

Figure 2 for A Pixel-Level Meta-Learner for Weakly Supervised Few-Shot Semantic Segmentation

Figure 3 for A Pixel-Level Meta-Learner for Weakly Supervised Few-Shot Semantic Segmentation

Figure 4 for A Pixel-Level Meta-Learner for Weakly Supervised Few-Shot Semantic Segmentation

Abstract:Few-shot semantic segmentation addresses the learning task in which only few images with ground truth pixel-level labels are available for the novel classes of interest. One is typically required to collect a large mount of data (i.e., base classes) with such ground truth information, followed by meta-learning strategies to address the above learning task. When only image-level semantic labels can be observed during both training and testing, it is considered as an even more challenging task of weakly supervised few-shot semantic segmentation. To address this problem, we propose a novel meta-learning framework, which predicts pseudo pixel-level segmentation masks from a limited amount of data and their semantic labels. More importantly, our learning scheme further exploits the produced pixel-level information for query image inputs with segmentation guarantees. Thus, our proposed learning model can be viewed as a pixel-level meta-learner. Through extensive experiments on benchmark datasets, we show that our model achieves satisfactory performances under fully supervised settings, yet performs favorably against state-of-the-art methods under weakly supervised settings.

* Accepted to WACV 2022

Via

Access Paper or Ask Questions

Dual-MTGAN: Stochastic and Deterministic Motion Transfer for Image-to-Video Synthesis

Feb 26, 2021

Fu-En Yang, Jing-Cheng Chang, Yuan-Hao Lee, Yu-Chiang Frank Wang

Figure 1 for Dual-MTGAN: Stochastic and Deterministic Motion Transfer for Image-to-Video Synthesis

Figure 2 for Dual-MTGAN: Stochastic and Deterministic Motion Transfer for Image-to-Video Synthesis

Figure 3 for Dual-MTGAN: Stochastic and Deterministic Motion Transfer for Image-to-Video Synthesis

Figure 4 for Dual-MTGAN: Stochastic and Deterministic Motion Transfer for Image-to-Video Synthesis

Abstract:Generating videos with content and motion variations is a challenging task in computer vision. While the recent development of GAN allows video generation from latent representations, it is not easy to produce videos with particular content of motion patterns of interest. In this paper, we propose Dual Motion Transfer GAN (Dual-MTGAN), which takes image and video data as inputs while learning disentangled content and motion representations. Our Dual-MTGAN is able to perform deterministic motion transfer and stochastic motion generation. Based on a given image, the former preserves the input content and transfers motion patterns observed from another video sequence, and the latter directly produces videos with plausible yet diverse motion patterns based on the input image. The proposed model is trained in an end-to-end manner, without the need to utilize pre-defined motion features like pose or facial landmarks. Our quantitative and qualitative results would confirm the effectiveness and robustness of our model in addressing such conditioned image-to-video tasks.

* Accepted to ICPR 2020

Via

Access Paper or Ask Questions