Pre-locate net for object detection in high-resolution images |
| |
Affiliation: | Key Laboratory of Precision Opto-mechatronics Technology,Ministry of Education,and the School of Instrumentation and Optoelectronic Engineering,Beihang University,Beijing 100083,China |
| |
Abstract: | Small-object detection has long been a challenge. High-megapixel cameras are used to solve this problem in industries. However, current detectors are inefficient for high-resolution images. In this work, we propose a new module called Pre-Locate Net, which is a plug-and-play structure that can be combined with most popular detectors. We inspire the use of classification ideas to obtain candidate regions in images, greatly reducing the amount of calculation, and thus achieving rapid detection in high-resolution images. Pre-Locate Net mainly includes two parts, candidate region classification and behavior classification. Candidate region classification is used to obtain a candidate region, and behavior classification is used to estimate the scale of an object. Different follow-up processing is adopted according to different scales to balance the variance of the network input. Different from the popular candidate region generation method, we abandon the idea of regression of a bounding box and adopt the concept of classification, so as to realize the prediction of a candidate region in the shallow network. We build a high-resolution dataset of aircraft and landing gears covering complex scenes to verify the effectiveness of our method. Compared to state-of-the-art detectors (e.g., Guided Anchoring, Libra-RCNN, and FASF), our method achieves the best mAP of 94.5 on 1920 × 1080 images at 16.7 FPS. |
| |
Keywords: | Aircraft and landing gear detection Candidate region Convolutional neural network High resolution images Small object |
本文献已被 万方数据 ScienceDirect 等数据库收录! |
|