Ro-Yolo for Optimized Small Object Detection in Remote Sensing

14 Pages Posted: 23 May 2025

See all articles by Xingyu Mu

Xingyu Mu

Shandong University

Chao Chang

Shandong University

Zihan Guo

Shandong University

Quanmin Kan

Shandong University

Weijie Cheng

Shandong University

Xincheng Tian

Shandong University

Lelai Zhou

Shandong University

Abstract

Remote Sensing Target Detection has significant applications in fields such as traffic monitoring and urban planning. However, the detection of small targets in UAV aerial imagery presents challenges, such as their high proportions, dense distribution, and significant scale variations. To address these issues, this paper proposes an efficient remote sensing small object detector network—RO-YOLO. First, the RFAConv module is introduced to replace traditional convolution, and the Fast Cross Stage Feature (FCSF) module is designed to improve detection accuracy while effectively controlling the model’s parameter size. Second, to enhance the network's capability for fine-grained feature extraction of small objects, a plug-and-play Backbone-to-Neck Feature Bridge (BNFB) module is developed. This module uses a multi-branch structure to capture different receptive fields, achieving multi-scale feature enhancement. Additionally, to facilitate sufficient fusion of detailed information from shallow feature maps and semantic information from deep feature maps, the Refinement and Alternation Feature Network (RAFN) structure is proposed. This design significantly reduces the model's parameter size while improving detection performance. Finally, the ACmix attention mechanism is integrated to further enhance the model's target focus capability. Experimental results on the challenging VisDrone, TinyPerson, and DOTA benchmark datasets demonstrate that RO-YOLO outperforms the original YOLOv8s, achieving mAP@50 improvements of 9.4%, 5.4%, and 2.3%, respectively, with a 71.1% reduction in parameter size. These results validate the effectiveness and generalization ability of RO-YOLO in small object detection tasks.

Keywords: Small object detection, Feature Enhancement, Multiscale feature extraction, Unmanned aerial vehicle (UAV) image, Attention Mechanism

Suggested Citation

Mu, Xingyu and Chang, Chao and Guo, Zihan and Kan, Quanmin and Cheng, Weijie and Tian, Xincheng and Zhou, Lelai, Ro-Yolo for Optimized Small Object Detection in Remote Sensing. Available at SSRN: https://ssrn.com/abstract=5266679 or http://dx.doi.org/10.2139/ssrn.5266679

Xingyu Mu

Shandong University ( email )

27 Shanda Nanlu
South Rd.
Jinan, SD 250100
China

Chao Chang

Shandong University ( email )

27 Shanda Nanlu
South Rd.
Jinan, SD 250100
China

Zihan Guo

Shandong University ( email )

27 Shanda Nanlu
South Rd.
Jinan, SD 250100
China

Quanmin Kan

Shandong University ( email )

27 Shanda Nanlu
South Rd.
Jinan, SD 250100
China

Weijie Cheng

Shandong University ( email )

27 Shanda Nanlu
South Rd.
Jinan, SD 250100
China

Xincheng Tian (Contact Author)

Shandong University ( email )

27 Shanda Nanlu
South Rd.
Jinan, SD 250100
China

Lelai Zhou

Shandong University ( email )

27 Shanda Nanlu
South Rd.
Jinan, SD 250100
China

Do you have a job opening that you would like to promote on SSRN?

Paper statistics

Downloads
12
Abstract Views
94
PlumX Metrics