Abstract:Oriented bounding box regression is crucial for oriented object detection. However, regression-based methods often suffer from boundary problems and the inconsistency between loss and evaluation metrics. In this paper, a modulated Kalman IoU loss of approximate SkewIoU is proposed, named MKIoU. To avoid boundary problems, we convert the oriented bounding box to Gaussian distribution, then use the Kalman filter to approximate the intersection area. However, there exists significant difference between the calculated and actual intersection areas. Thus, we propose a modulation factor to adjust the sensitivity of angle deviation and width-height offset to loss variation, making the loss more consistent with the evaluation metric. Furthermore, the Gaussian modeling method avoids the boundary problem but causes the angle confusion of square objects simultaneously. Thus, the Gaussian Angle Loss (GA Loss) is presented to solve this problem by adding a corrected loss for square targets. The proposed GA Loss can be easily extended to other Gaussian-based methods. Experiments on three publicly available aerial image datasets, DOTA, UCAS-AOD, and HRSC2016, show the effectiveness of the proposed method.
Abstract:Rotated object detection is a challenging task in aerial images since the objects in aerial images are displayed in arbitrary directions and are frequently densely packed. Although considerable progress has been made, there are still challenges that existing regression-based rotation detectors suffer from the representation ambiguity. In this paper, we propose a simple, practical framework to optimize the bounding box regression for rotating objects. Rather than directly regressing the five parameters (coordinates of the central point, width, height, and rotation angle) or the four vertices, we employ the area ratio of the parallelogram (ARP) to describe a multi-oriented object accurately. Specifically, ARP regresses coordinates of the center point, height, and width of the oriented object's minimum circumscribed rectangle and three area ratios. It may facilitate learning offset and avoid the issue of angular periodicity or label points sequence for oriented objects. To further remedy the confusion issue of nearly horizontal objects, the area ratio between the object and its minimal circumscribed rectangle has been used to guide the selection of horizontal or oriented detection for each object. The rotation efficient IOU loss (R-EIOU) connects the flat bounding box with the three area ratios and improves the accuracy of the rotating bounding. Experimental results on remote sensing datasets, including HRSC2016, DOTA, and UCAS-AOD, show that our method achieves superior detection performance than many state-of-the-art approaches. The code and model will be coming with the paper published.