Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:HGFormer: Topology-Aware Vision Transformer with HyperGraph Learning

Apr 03, 2025

Hao Wang, Shuo Zhang, Biao Leng

Figure 1 for HGFormer: Topology-Aware Vision Transformer with HyperGraph Learning

Figure 2 for HGFormer: Topology-Aware Vision Transformer with HyperGraph Learning

Figure 3 for HGFormer: Topology-Aware Vision Transformer with HyperGraph Learning

Figure 4 for HGFormer: Topology-Aware Vision Transformer with HyperGraph Learning

Share this with someone who'll enjoy it:

Abstract:The computer vision community has witnessed an extensive exploration of vision transformers in the past two years. Drawing inspiration from traditional schemes, numerous works focus on introducing vision-specific inductive biases. However, the implicit modeling of permutation invariance and fully-connected interaction with individual tokens disrupts the regional context and spatial topology, further hindering higher-order modeling. This deviates from the principle of perceptual organization that emphasizes the local groups and overall topology of visual elements. Thus, we introduce the concept of hypergraph for perceptual exploration. Specifically, we propose a topology-aware vision transformer called HyperGraph Transformer (HGFormer). Firstly, we present a Center Sampling K-Nearest Neighbors (CS-KNN) algorithm for semantic guidance during hypergraph construction. Secondly, we present a topology-aware HyperGraph Attention (HGA) mechanism that integrates hypergraph topology as perceptual indications to guide the aggregation of global and unbiased information during hypergraph messaging. Using HGFormer as visual backbone, we develop an effective and unitive representation, achieving distinct and detailed scene depictions. Empirical experiments show that the proposed HGFormer achieves competitive performance compared to the recent SoTA counterparts on various visual benchmarks. Extensive ablation and visualization studies provide comprehensive explanations of our ideas and contributions.

View paper on

Share this with someone who'll enjoy it:

Title:HGFormer: Topology-Aware Vision Transformer with HyperGraph Learning

Paper and Code