Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Kebing Yan

Small Object Detection via Coarse-to-fine Proposal Generation and Imitation Learning

Aug 18, 2023

Xiang Yuan, Gong Cheng, Kebing Yan, Qinghua Zeng, Junwei Han

Figure 1 for Small Object Detection via Coarse-to-fine Proposal Generation and Imitation Learning

Figure 2 for Small Object Detection via Coarse-to-fine Proposal Generation and Imitation Learning

Figure 3 for Small Object Detection via Coarse-to-fine Proposal Generation and Imitation Learning

Figure 4 for Small Object Detection via Coarse-to-fine Proposal Generation and Imitation Learning

Abstract:The past few years have witnessed the immense success of object detection, while current excellent detectors struggle on tackling size-limited instances. Concretely, the well-known challenge of low overlaps between the priors and object regions leads to a constrained sample pool for optimization, and the paucity of discriminative information further aggravates the recognition. To alleviate the aforementioned issues, we propose CFINet, a two-stage framework tailored for small object detection based on the Coarse-to-fine pipeline and Feature Imitation learning. Firstly, we introduce Coarse-to-fine RPN (CRPN) to ensure sufficient and high-quality proposals for small objects through the dynamic anchor selection strategy and cascade regression. Then, we equip the conventional detection head with a Feature Imitation (FI) branch to facilitate the region representations of size-limited instances that perplex the model in an imitation manner. Moreover, an auxiliary imitation loss following supervised contrastive learning paradigm is devised to optimize this branch. When integrated with Faster RCNN, CFINet achieves state-of-the-art performance on the large-scale small object detection benchmarks, SODA-D and SODA-A, underscoring its superiority over baseline detector and other mainstream detection approaches.

* Camera-ready version for ICCV2023. Our code will be available at https://github.com/shaunyuan22/CFINet

Via

Access Paper or Ask Questions

Towards Large-Scale Small Object Detection: Survey and Benchmarks

Jul 31, 2022

Gong Cheng, Xiang Yuan, Xiwen Yao, Kebing Yan, Qinghua Zeng, Junwei Han

Figure 1 for Towards Large-Scale Small Object Detection: Survey and Benchmarks

Figure 2 for Towards Large-Scale Small Object Detection: Survey and Benchmarks

Figure 3 for Towards Large-Scale Small Object Detection: Survey and Benchmarks

Figure 4 for Towards Large-Scale Small Object Detection: Survey and Benchmarks

Abstract:With the rise of deep convolutional neural networks, object detection has achieved prominent advances in past years. However, such prosperity could not camouflage the unsatisfactory situation of Small Object Detection (SOD), one of the notoriously challenging tasks in computer vision, owing to the poor visual appearance and noisy representation caused by the intrinsic structure of small targets. In addition, large-scale dataset for benchmarking small object detection methods remains a bottleneck. In this paper, we first conduct a thorough review of small object detection. Then, to catalyze the development of SOD, we construct two large-scale Small Object Detection dAtasets (SODA), SODA-D and SODA-A, which focus on the Driving and Aerial scenarios respectively. SODA-D includes 24704 high-quality traffic images and 277596 instances of 9 categories. For SODA-A, we harvest 2510 high-resolution aerial images and annotate 800203 instances over 9 classes. The proposed datasets, as we know, are the first-ever attempt to large-scale benchmarks with a vast collection of exhaustively annotated instances tailored for multi-category SOD. Finally, we evaluate the performance of mainstream methods on SODA. We expect the released benchmarks could facilitate the development of SOD and spawn more breakthroughs in this field. Datasets and codes will be available soon at: \url{https://shaunyuan22.github.io/SODA}.

* 20 pages, 12 figures

Via

Access Paper or Ask Questions