Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:DCT-Mask: Discrete Cosine Transform Mask Representation for Instance Segmentation

Nov 19, 2020

Xing Shen, Jirui Yang, Chunbo Wei, Bing Deng, Jianqiang Huang, Xiansheng Hua, Xiaoliang Cheng, Kewei Liang

Figure 1 for DCT-Mask: Discrete Cosine Transform Mask Representation for Instance Segmentation

Figure 2 for DCT-Mask: Discrete Cosine Transform Mask Representation for Instance Segmentation

Figure 3 for DCT-Mask: Discrete Cosine Transform Mask Representation for Instance Segmentation

Figure 4 for DCT-Mask: Discrete Cosine Transform Mask Representation for Instance Segmentation

Share this with someone who'll enjoy it:

Abstract:Binary grid mask representation is broadly used in instance segmentation. A representative instantiation is Mask R-CNN which predicts masks on a $28\times 28$ binary grid. Generally, a low-resolution grid is not sufficient to capture the details, while a high-resolution grid dramatically increases the training complexity. In this paper, we propose a new mask representation by applying the discrete cosine transform(DCT) to encode the high-resolution binary grid mask into a compact vector. Our method, termed DCT-Mask, could be easily integrated into most pixel-based instance segmentation methods. Without any bells and whistles, DCT-Mask yields significant gains on different frameworks, backbones, datasets, and training schedules. It does not require any pre-processing or pre-training, and almost no harm to the running speed. Especially, for higher-quality annotations and more complex backbones, our method has a greater improvement. Moreover, we analyze the performance of our method from the perspective of the quality of mask representation. The main reason why DCT-Mask works well is that it obtains a high-quality mask representation with low complexity. Code will be made available.

View paper on

Share this with someone who'll enjoy it:

Title:DCT-Mask: Discrete Cosine Transform Mask Representation for Instance Segmentation

Paper and Code