Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:End-to-End Object Detection with Adaptive Clustering Transformer

Nov 18, 2020

Minghang Zheng, Peng Gao, Xiaogang Wang, Hongsheng Li, Hao Dong

Figure 1 for End-to-End Object Detection with Adaptive Clustering Transformer

Figure 2 for End-to-End Object Detection with Adaptive Clustering Transformer

Figure 3 for End-to-End Object Detection with Adaptive Clustering Transformer

Figure 4 for End-to-End Object Detection with Adaptive Clustering Transformer

Share this with someone who'll enjoy it:

Abstract:End-to-end Object Detection with Transformer (DETR)proposes to perform object detection with Transformer and achieve comparable performance with two-stage object detection like Faster-RCNN. However, DETR needs huge computational resources for training and inference due to the high-resolution spatial input. In this paper, a novel variant of transformer named Adaptive Clustering Transformer(ACT) has been proposed to reduce the computation cost for high-resolution input. ACT cluster the query features adaptively using Locality Sensitive Hashing (LSH) and ap-proximate the query-key interaction using the prototype-key interaction. ACT can reduce the quadratic O(N2) complexity inside self-attention into O(NK) where K is the number of prototypes in each layer. ACT can be a drop-in module replacing the original self-attention module without any training. ACT achieves a good balance between accuracy and computation cost (FLOPs). The code is available as supplementary for the ease of experiment replication and verification.

* technique report

View paper on

Share this with someone who'll enjoy it:

Title:End-to-End Object Detection with Adaptive Clustering Transformer

Paper and Code