Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Mustafa Gkhan Uzunbas

DCAN: Dual Channel-wise Alignment Networks for Unsupervised Scene Adaptation

Apr 16, 2018

Zuxuan Wu, Xintong Han, Yen-Liang Lin, Mustafa Gkhan Uzunbas, Tom Goldstein, Ser Nam Lim, Larry S. Davis

Figure 1 for DCAN: Dual Channel-wise Alignment Networks for Unsupervised Scene Adaptation

Figure 2 for DCAN: Dual Channel-wise Alignment Networks for Unsupervised Scene Adaptation

Figure 3 for DCAN: Dual Channel-wise Alignment Networks for Unsupervised Scene Adaptation

Figure 4 for DCAN: Dual Channel-wise Alignment Networks for Unsupervised Scene Adaptation

Abstract:Harvesting dense pixel-level annotations to train deep neural networks for semantic segmentation is extremely expensive and unwieldy at scale. While learning from synthetic data where labels are readily available sounds promising, performance degrades significantly when testing on novel realistic data due to domain discrepancies. We present Dual Channel-wise Alignment Networks (DCAN), a simple yet effective approach to reduce domain shift at both pixel-level and feature-level. Exploring statistics in each channel of CNN feature maps, our framework performs channel-wise feature alignment, which preserves spatial structures and semantic information, in both an image generator and a segmentation network. In particular, given an image from the source domain and unlabeled samples from the target domain, the generator synthesizes new images on-the-fly to resemble samples from the target domain in appearance and the segmentation network further refines high-level features before predicting semantic maps, both of which leverage feature statistics of sampled images from the target domain. Unlike much recent and concurrent work relying on adversarial training, our framework is lightweight and easy to train. Extensive experiments on adapting models trained on synthetic segmentation benchmarks to real urban scenes demonstrate the effectiveness of the proposed framework.

Via

Access Paper or Ask Questions