In this paper,based on a bidirectional parallel multi-branch feature pyramid network(BPMFPN),a novel one-stage object detector called BPMFPN Det is proposed for real-time detection of ground multi-scale targets by swa...In this paper,based on a bidirectional parallel multi-branch feature pyramid network(BPMFPN),a novel one-stage object detector called BPMFPN Det is proposed for real-time detection of ground multi-scale targets by swarm unmanned aerial vehicles(UAVs).First,the bidirectional parallel multi-branch convolution modules are used to construct the feature pyramid to enhance the feature expression abilities of different scale feature layers.Next,the feature pyramid is integrated into the single-stage object detection framework to ensure real-time performance.In order to validate the effectiveness of the proposed algorithm,experiments are conducted on four datasets.For the PASCAL VOC dataset,the proposed algorithm achieves the mean average precision(mAP)of 85.4 on the VOC 2007 test set.With regard to the detection in optical remote sensing(DIOR)dataset,the proposed algorithm achieves 73.9 mAP.For vehicle detection in aerial imagery(VEDAI)dataset,the detection accuracy of small land vehicle(slv)targets reaches 97.4 mAP.For unmanned aerial vehicle detection and tracking(UAVDT)dataset,the proposed BPMFPN Det achieves the mAP of 48.75.Compared with the previous state-of-the-art methods,the results obtained by the proposed algorithm are more competitive.The experimental results demonstrate that the proposed algorithm can effectively solve the problem of real-time detection of ground multi-scale targets in aerial images of swarm UAVs.展开更多
针对在复杂背景下输电线路多尺度缺陷目标检测精度较低的问题,文中提出一种基于改进YOLOv7(You Only Look Once version 7)的输电线路多类缺陷目标检测模型。对于复杂背景造成缺陷目标较低的问题,在Backbone部分引入改进的Swin Transfor...针对在复杂背景下输电线路多尺度缺陷目标检测精度较低的问题,文中提出一种基于改进YOLOv7(You Only Look Once version 7)的输电线路多类缺陷目标检测模型。对于复杂背景造成缺陷目标较低的问题,在Backbone部分引入改进的Swin Transformer模块,通过使用多头注意力机制提升对全局特征的提取效果来提高模型的检测精度。对于待检测目标的多尺度特性,在特征金字塔基础上引入自适应特征融合模块,提升了Neck部分特征融合网络对多类不同尺度缺陷目标的检测能力。使用SIoU(Structured Intersection over Union)损失函数在提高预测框回归精度的同时加快了模型的收敛。实验结果表明,相较于YOLOv5、YOLOv7和Faster R-CNN(Faster Region with Convolutional Neural Network)模型,改进YOLOv7模型具有较高的检测精度,其平均检测精度可达96.4%,检测速度为29.6 frame·s^(-1),能够为输电线路多类缺陷目标检测提供参考。展开更多
文摘In this paper,based on a bidirectional parallel multi-branch feature pyramid network(BPMFPN),a novel one-stage object detector called BPMFPN Det is proposed for real-time detection of ground multi-scale targets by swarm unmanned aerial vehicles(UAVs).First,the bidirectional parallel multi-branch convolution modules are used to construct the feature pyramid to enhance the feature expression abilities of different scale feature layers.Next,the feature pyramid is integrated into the single-stage object detection framework to ensure real-time performance.In order to validate the effectiveness of the proposed algorithm,experiments are conducted on four datasets.For the PASCAL VOC dataset,the proposed algorithm achieves the mean average precision(mAP)of 85.4 on the VOC 2007 test set.With regard to the detection in optical remote sensing(DIOR)dataset,the proposed algorithm achieves 73.9 mAP.For vehicle detection in aerial imagery(VEDAI)dataset,the detection accuracy of small land vehicle(slv)targets reaches 97.4 mAP.For unmanned aerial vehicle detection and tracking(UAVDT)dataset,the proposed BPMFPN Det achieves the mAP of 48.75.Compared with the previous state-of-the-art methods,the results obtained by the proposed algorithm are more competitive.The experimental results demonstrate that the proposed algorithm can effectively solve the problem of real-time detection of ground multi-scale targets in aerial images of swarm UAVs.
文摘针对在复杂背景下输电线路多尺度缺陷目标检测精度较低的问题,文中提出一种基于改进YOLOv7(You Only Look Once version 7)的输电线路多类缺陷目标检测模型。对于复杂背景造成缺陷目标较低的问题,在Backbone部分引入改进的Swin Transformer模块,通过使用多头注意力机制提升对全局特征的提取效果来提高模型的检测精度。对于待检测目标的多尺度特性,在特征金字塔基础上引入自适应特征融合模块,提升了Neck部分特征融合网络对多类不同尺度缺陷目标的检测能力。使用SIoU(Structured Intersection over Union)损失函数在提高预测框回归精度的同时加快了模型的收敛。实验结果表明,相较于YOLOv5、YOLOv7和Faster R-CNN(Faster Region with Convolutional Neural Network)模型,改进YOLOv7模型具有较高的检测精度,其平均检测精度可达96.4%,检测速度为29.6 frame·s^(-1),能够为输电线路多类缺陷目标检测提供参考。