Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Sparse MDOD: Training End-to-End Multi-Object Detector without Bipartite Matching

May 18, 2022

Jaeyoung Yoo, Hojun Lee, Seunghyeon Seo, Inseop Chung, Nojun Kwak

Figure 1 for Sparse MDOD: Training End-to-End Multi-Object Detector without Bipartite Matching

Figure 2 for Sparse MDOD: Training End-to-End Multi-Object Detector without Bipartite Matching

Figure 3 for Sparse MDOD: Training End-to-End Multi-Object Detector without Bipartite Matching

Figure 4 for Sparse MDOD: Training End-to-End Multi-Object Detector without Bipartite Matching

Share this with someone who'll enjoy it:

Abstract:Recent end-to-end multi-object detectors simplify the inference pipeline by removing the hand-crafted process such as the duplicate bounding box removal using non-maximum suppression (NMS). However, in the training, they require bipartite matching to calculate the loss from the output of the detector. Contrary to the directivity of the end-to-end method, the bipartite matching makes the training of the end-to-end detector complex, heuristic, and reliant. In this paper, we aim to propose a method to train the end-to-end multi-object detector without bipartite matching. To this end, we approach end-to-end multi-object detection as a density estimation using a mixture model. Our proposed detector, called Sparse Mixture Density Object Detector (Sparse MDOD) estimates the distribution of bounding boxes using a mixture model. Sparse MDOD is trained by minimizing the negative log-likelihood and our proposed regularization term, maximum component maximization (MCM) loss that prevents duplicated predictions. During training, no additional procedure such as bipartite matching is needed, and the loss is directly computed from the network outputs. Moreover, our Sparse MDOD outperforms the existing detectors on MS-COCO, a renowned multi-object detection benchmark.

* 8 figures

View paper on

Share this with someone who'll enjoy it:

Title:Sparse MDOD: Training End-to-End Multi-Object Detector without Bipartite Matching

Paper and Code