高级检索

基于改进的YOLOv11n的无人机航拍目标检测算法

Drone Aerial Image Target Detection Algorithm Based on an Improved YOLOv11n

  • 摘要: 无人机航拍图像中的目标通常尺度小、分布密集,且易受复杂背景、运动模糊与遮挡、端侧算力受限等因素的影响,使得目标检测算法难以兼顾精度与效率。针对上述问题,本文在 YOLO11 算法模型基础上提出了改进模型 GC-YOLO,首先,该模型引入采样模块 DCAD,在降采样阶段融合局部与全局信息以增强细粒度表征;其次,构建 GC-PAFPN 特征金字塔,通过增加跨尺度融合路径和 CSP-Omni-Kernel 神经网络模型以提升多尺度特征交互;最后,设计形状与语义对齐的解耦检测头 SADH,在降低参数与计算开销的同时提升目标/特征分类与定位一致性。实验结果表明,在 VisDrone2021 数据集上,本文 GC-YOLO 算法取得 mAP50/%=46.4%、mAP50–95/%=28.7%,相较 YOLO11n(38.5%/23.1%)分别提升 7.9 和 5.6 个百分点,模型参数量为 4.02M、计算量为 18.7 GFLOPs,本文算法在保持轻量化的前提下有效提升了航拍小目标检测性能,可为无人机端侧目标检测提供一种可行方案。

     

    Abstract: Targets in drone aerial images are typically small in scale, densely distributed, and easily affected by complex backgrounds, motion blur, occlusion, and limited edge-device computing resources, making it difficult for detection algorithms to balance accuracy and efficiency. To address these challenges, this paper proposes an improved model named GC-YOLO based on the YOLO11 framework. First, a sampling module, DCAD, is introduced to fuse local and global information during the downsampling stage, enhancing fine-grained feature representation. Second, a GC-PAFPN feature pyramid is constructed by adding cross-scale fusion paths and incorporating the CSP-Omni-Kernel neural network model to improve multi-scale feature interaction. Finally, a Shape- and Semantic-Aligned Decoupled Head (SADH) is designed to reduce parameters and computational cost while improving the consistency between target classification and localization. Experimental results on the VisDrone2021 dataset show that the proposed GC-YOLO achieves mAP50 of 46.4% and mAP50–95 of 28.7%, representing improvements of 7.9 and 5.6 percentage points over YOLO11n (38.5% and 23.1%), respectively. The model has 4.02M parameters and 18.7 GFLOPs, demonstrating that it effectively improves aerial small-object detection performance while maintaining a lightweight design, providing a feasible solution for onboard drone target detection

     

/

返回文章
返回