一种新颖的单目视觉深度学习算法:H_SFPN

Novel Deep Learning Algorithm for Monocular Vision:H_SFPN

在线阅读下载PDF

导出

摘要针对单目视觉目标检测,提出了一种基于single-stage深度学习的H_SFPN算法。该算法与现有的YOLOv3和CenterNet算法相比,在保证实时性能的条件下,可有效提高小目标检测的准确度。首先设计了一种新的网络架构(backbone),这种架构通过改进的沙漏(Hourglass)网络模型来提取特征图,以便充分利用底层特征的高分辨率以及高层特征的高语义信息。然后在特征图融合阶段提出了基于SFPN的特征图加权融合方法。最后,H_SFPN算法对目标位置和大小的损失函数进行了改进,可有效降低训练误差,并加快收敛速度。由MSCOCO数据集上的实验结果可知,所提H_SFPN算法明显优于Faster-RCNN,YOLOv3以及EfficientDet等现有的主流深度学习目标检测算法,其中对小目标的检测指标AP s最高,达到了32.7。 This paper proposes a single-stage deep learning based H_SFPN algorithm for monocular visual object detection.Compared with the existing YOLOv3 and CenterNet algorithms,the proposed algorithm can effectively improve the accuracy of small object detection without sacrificing the real-time performance.This paper designs a new network architecture(backbone),which uses an improved Hourglass network model to extract feature maps in order to make full use of the high resolution of the underlying features and the high semantic information of the high-level features.Then in the feature map fusion stage,a method SFPN based on the weighted fusion of feature maps is proposed.Finally,the proposed H_SFPN algorithm improves the loss function of the object position and size,which can effectively reduce the training error and accelerate the convergence speed.According to the experimental results on the MSCOCO data set,the proposed H_SFPN algorithm is significantly better than the existing mainstream deep learning object detection algorithms such as Faster-RCNN,YOLOv3 and EfficientDet.Among them,the small object detection index AP s of this algorithm is the highest,reaching 32.7.

作者石先让宋廷伦唐得志戴振泳 SHI Xian-rang;SONG Ting-lun;TANG De-zhi;DAI Zhen-yong(College of Energy and Power Engineering,Nanjing University of Aeronautics and Astronautics,Nanjing 210001,China;Chery Advanced Engineering&Technology Center,Wuhu,Anhui 241006,China)

机构地区南京航空航天大学能源与动力学院奇瑞前瞻与预研技术中心

出处《计算机科学》 CSCD 北大核心 2021年第4期130-137,共8页 Computer Science

基金安徽省发改委重大研发项目。

关键词深度卷积神经网络目标检测加权融合网络架构损失函数 Deep convolutional neural network Object detection Weighted fusion Backbone Loss function

分类号 TP391.41 [自动化与计算机技术—计算机应用技术] TN219 [电子电信—物理电子学]

作者简介 SHI Xian-rang,born in 1996,postgra-duate.His main research interests include autonomous driving,object detection and pattern recognition.nuaasxr@163.com;通信作者:SONG Ting-lun,born in 1965,Ph.D,professor,Ph.D supervisor.His main research interests include simulation driven vehicle architecture design and development,autonomous driving vehicles,and data driven energy management strategies for new energy vehicles.songtinglun@nuaa.edu.cn。

引文网络
相关文献

1马一凡,赵凡宇,王鑫,金仲和.基于改进指针网络的卫星对地观测任务规划方法[J].浙江大学学报（工学版）,2021,55(2):395-401. 被引量：4
2毕号旗,向新,李娜,郑万泽,鞠明.SC-FDE系统中基于压缩感知的慢衰落航空稀疏信道估计[J].空军工程大学学报（自然科学版）,2020,21(5):82-88. 被引量：2
3Guoying Liu,Shuanghao Chen,Jing Xiong,Qingju Jiao.An Oracle Bone Inscription Detector Based on Multi-Scale Gaussian Kernels[J].Applied Mathematics,2021,12(3):224-239. 被引量：1
4夏烨,雷晓鸣,王鹏,孙利民.针对网级评估的区域桥梁退化建模与演绎应用[J].中南大学学报（自然科学版）,2021,52(3):828-838. 被引量：5
5雷艳惠,赵芳霞.基于神经网络的铝合金机械圆盘冲锻工艺优化[J].热加工工艺,2021,50(5):102-105. 被引量：1
6赵刚,王梦灵.基于模糊分析的LSTM交通流量预测[J].计算机工程与设计,2021,42(4):1103-1108. 被引量：12

计算机科学

2021年第4期

浏览历史

内容加载中请稍等...

一种新颖的单目视觉深度学习算法:H_SFPN

相关作者

相关机构

相关主题

浏览历史