Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Dynamic Graph Message Passing Networks for Visual Recognition

Sep 20, 2022

Li Zhang, Mohan Chen, Anurag Arnab, Xiangyang Xue, Philip H. S. Torr

Figure 1 for Dynamic Graph Message Passing Networks for Visual Recognition

Figure 2 for Dynamic Graph Message Passing Networks for Visual Recognition

Figure 3 for Dynamic Graph Message Passing Networks for Visual Recognition

Figure 4 for Dynamic Graph Message Passing Networks for Visual Recognition

Share this with someone who'll enjoy it:

Abstract:Modelling long-range dependencies is critical for scene understanding tasks in computer vision. Although convolution neural networks (CNNs) have excelled in many vision tasks, they are still limited in capturing long-range structured relationships as they typically consist of layers of local kernels. A fully-connected graph, such as the self-attention operation in Transformers, is beneficial for such modelling, however, its computational overhead is prohibitive. In this paper, we propose a dynamic graph message passing network, that significantly reduces the computational complexity compared to related works modelling a fully-connected graph. This is achieved by adaptively sampling nodes in the graph, conditioned on the input, for message passing. Based on the sampled nodes, we dynamically predict node-dependent filter weights and the affinity matrix for propagating information between them. This formulation allows us to design a self-attention module, and more importantly a new Transformer-based backbone network, that we use for both image classification pretraining, and for addressing various downstream tasks (object detection, instance and semantic segmentation). Using this model, we show significant improvements with respect to strong, state-of-the-art baselines on four different tasks. Our approach also outperforms fully-connected graphs while using substantially fewer floating-point operations and parameters. Code and models will be made publicly available at https://github.com/fudan-zvg/DGMN2

* PAMI extension of CVPR 2020 oral work arXiv:1908.06955

View paper on

Share this with someone who'll enjoy it:

Title:Dynamic Graph Message Passing Networks for Visual Recognition

Paper and Code