激光透窗低质量图像人体姿态识别技术研究

Human Posture Recognition Algorithm for Low-Quality Laser Through-Window Images

导出

摘要针对激光透窗低质量图像的人体姿态识别,现有算法存在识别精度低以及严重的漏检、误检问题,鉴于此,本研究提出了一种高效的人体姿态识别算法YOLO-TCpose。设计了一种新的卷积模块,该模块可在保留全局信息的同时提升局部特征的提取能力,解决了标准卷积的固有缺陷问题;对特征融合网络进行重构,采用跨层级联和模型剪枝等方式实现浅层信息和深层信息的交互融合,以提升模型对小目标姿态的识别能力;构建了增强检测一体化网络,结合改进的ADNet图像增强去噪算法降低噪声对姿态识别的影响,提高了模型的检测精度。最后,编写人体姿态识别算法,将模型部署在Jetson NX移动开发平台上,设计了一套完整的机载激光透窗成像人体姿态识别系统。实验结果表明:YOLO-TCpose算法具有较强的鲁棒性和泛化能力,具有较高的实际应用价值。 Objective Laser throughwindow imaging technology,an advanced detection method,can effectively penetrate window glass and visualize indoor targets behind the window,providing many application prospects.In antiterrorism and stability maintenance scenarios,a throughwindow scope enables the capture of accurate information regarding the number and posture of terrorists outside the window.In traffic monitoring applications,this technology enables the assessment of a driver’s status without requiring the driver to exit the vehicle,thereby improving traffic management efficiency.However,the practical application of laser throughwindow imaging technology faces several challenges.Image quality and accurate capture of target information behind windows are significantly affected by factors such as natural illumination,object occlusion,and strong reflections from the window glass.Accurately detecting human targets and identifying their poses in complex environments is highly challenging.Conventional image processing techniques often cannot achieve accurate and efficient detection results when faced with disruptions such as changes in illumination or occlusion.Addressing these challenges requires the development of more robust object detection and attitude recognition algorithms that can be effectively implemented on edge computing platforms to meet realtime requirements.This study is highly significant,with the potential to substantially enhance fields such as antiterrorism measures,security operations,military reconnaissance activities,and traffic management.Methods Currently,laser throughwindow imaging data are not publicly accessible.Therefore,a new dataset was constructed using a laser rangegating imaging system that covers two types of scenes:natural and manmade.The natural scene includes various simulated human postures for data collection,whereas the manmade scene incorporates diverse types of glass,throughwindow distances,lighting conditions,and occlusions to enhance data diversity.Existing algorithms for human postural recognition in lowquality laser throughwindow images typically exhibit suboptimal accuracy,which is characterized by significant missed and false detection.Thus,this study used YOLOv8nPose as the base model with a targeted optimization design to address these problems.A novel convolution module was developed to improve the feature extraction ability in lowquality image scenarios with laser throughwindows,while crosslevel association and a model pruning method were used to reconstruct the feature fusion network.This approach aims to reduce the model size and improve the recognition of small target human poses.Additionally,an enhanced detection integration network that combined image denoising and postural recognition tasks enabled endtoend integrated training,further enhancing the model detection performance.Finally,a human posture recognition algorithm was implemented by deploying the model on the Jetson NX mobile development platform,creating a fully functional airborne laser throughwindow imaging human posture recognition system.Results and Discussions This study compared the performance of Faster RCNN,Alphapose,Openpose,HigherHRNet,YOLOv5s6-pose,and YOLOv8nPose algorithms for human pose recognition(Table 2).The results indicate that the YOLOv8nPose model outperforms Faster RCNN,Openpose,and HigherHRNet.Alphapose and YOLOv5s6-pose exhibit slightly better performance indicators than YOLOv8nPose.However,they significantly lag behind YOLOv8nPose in terms of inference speed and model size.Nevertheless,the proposed YOLOTCpose algorithm performs exceptionally well across various performance indicators.Additional experiments were conducted using the Openpose,Alphapose,and YOLOv8nPose algorithms in artificial and natural scenes to assess the effectiveness of the YOLOTCpose algorithm.In artificial scenes(Fig.6),comparative experiments involving single and multiple people with occlusion demonstrate that YOLOTCpose outperforms Openpose and Alphapose by achieving accurate key point positioning and significantly reducing missed detections during multiperson pose recognition.Notably,YOLO-TCpose exhibits significant advantages, particularly in scenarios involving multiperson occlusion. In natural scenes (Fig. 7), the experimental results indicate that during posture recognition tasks such as crawling during the day, standing at night, or squatting on rainy day;YOLOTCposeaccurately detects human target along with their corresponding key points, outperforming other algorithms by a significant margin. Finally, YOLOTCposeexhibits superior detection accuracy, stability, and adaptability in various environments compared to current mainstream algorithms.Conclusions This study introduces YOLOTCpose,an efficient and lightweight human posture recognition algorithm designed for detecting human poses in lowqualitylaser throughwindow images. To address the limitations of traditional convolution, a novel convolutional module was developed to improve feature extraction capabilities. Additionally, the feature fusion network was restructured by eliminating large target detection layers and incorporating small target detection layers. This adjustment facilitates the effective fusion of shallow and deep information through crosslayerconnections, thereby improving the recognition performance for small targets. By incorporating an improved ADNet denoising algorithm, an integrated network for image enhancement and pose recognition was developed, which significantly improves the detection accuracy. The experimental results demonstrate that YOLOTCposeachieves improvements of 19.3 and 26.6 percentage points in the precision and recall rate, respectively, for object detection. The mean average precision (mAP) at 0.5 and mAP at 0.5: 0.95 for keypoint detection are enhanced by 16.0 and 10.1 percentage points, respectively. In addition, the inference speed is increased by 5.1 ms, and the model size is reduced by 1.69 MB. Furthermore, algorithms for recognizing three postures—standing, squatting, and crawling—were developed, and the model was successfully deployed on the Jetson NX mobile development platform, establishing a fully functional airborne laser throughwindow imaging human posture recognition system.

作者伍智华程江华刘通蔡亚辉潘乐昊 Wu Zhihua;Cheng Jianghua;Liu Tong;Cai Yahui;Pan Lehao(College of Electronic Science and Technology,National University of Defense Technology,Changsha 410073,Hunan,China)

机构地区国防科技大学电子科学学院

出处《中国激光》北大核心 2025年第6期260-271,共12页 Chinese Journal of Lasers

基金国防科技大学自主创新科学基金(24-ZZCX-JDZ-11)。

关键词激光透窗卷积运算小目标检测姿态识别图像去噪 laser throughwindow imaging convolution operation small object detection posture recognition image denoising

分类号 TP391 [自动化与计算机技术—计算机应用技术]

作者简介通信作者:伍智华,18627599098@163.com。

引文网络
相关文献

参考文献19

1王佳林,段锦,付强,谢国芳,莫苏新,方瑞森.基于Mueller矩阵的偏振抑制反光方法[J].光学学报,2023,43(20):154-166. 被引量：5
2陈沐,金浩然,杨克己,居冰峰.超声扫查横波成像的波数域快速重建技术[J].激光与光电子学进展,2023,60(3):298-309. 被引量：1
3谭伊玫,徐英莹,张硕,刘雁飞,郝群,唐鑫.百万像素胶体量子点中波红外焦平面阵列成像技术(特邀)[J].激光与光电子学进展,2024,61(2):395-401. 被引量：5
4袁怡鑫,陈涛,刘成波,孟静.光声计算层析成像中的皮肤智能去除方法[J].中国激光,2023,50(21):134-142. 被引量：3
5任婧荣,傅相达,王孟瑞,赵天宇,汪召军,冯坤,梁言生,王少伟,雷铭.快速宽场三维显微技术研究进展[J].中国激光,2023,50(3):49-64. 被引量：8
6魏金文,李儒佳,吴佳琛,张启航,高云晖,曹良才.相位恢复波前重构技术的发展与应用(特邀)[J].激光与光电子学进展,2024,61(2):1-17. 被引量：2
7王琦,米佳帅.基于深度学习的单像素成像研究进展[J].激光与光电子学进展,2024,61(10):59-73. 被引量：7
8谢俊,邸江磊,秦玉文.深度学习在水下成像技术中的应用(特邀)[J].光子学报,2022,51(11):1-48. 被引量：7
9成珂阳,李琦.深度学习用于连续太赫兹同轴数字全息重建[J].中国激光,2023,50(19):238-248. 被引量：6
10李迟件,姚靖,高玉峰,赖溥祥,何悦之,齐苏敏,郑炜.利用深度学习扩展双光子成像视场[J].中国激光,2023,50(9):72-81. 被引量：4

二级参考文献204

1王寿增,孙峰,张鑫.激光照明距离选通成像技术研究进展[J].红外与激光工程,2008,37(S3):95-99. 被引量：14
2闫旭光,彭复员,徐国华,李旭涛.海水介质中激光前向散射的空间时间特性分析[J].激光技术,2005,29(3):266-269. 被引量：13
3王萍,张春,罗颖昕.一种雾天图像低对比度增强的快速算法[J].计算机应用,2006,26(1):152-153. 被引量：62
4詹翔,周焰.一种基于局部方差的雾天图像增强方法[J].计算机应用,2007,27(2):510-512. 被引量：45
5岑兆丰,李晓彤,朱启华.光学系统杂散光分析[J].红外与激光工程,2007,36(3):300-304. 被引量：42
6孙玉宝,肖亮,韦志辉,吴慧中.基于偏微分方程的户外图像去雾方法[J].系统仿真学报,2007,19(16):3739-3744. 被引量：34
7王瑞荣,陈伟民,毛楚生,董佳钦,傅思祖.Laser-produced plasma He-alpha source for pulse radiography[J].Chinese Optics Letters,2009,7(2):156-158. 被引量：1
8昌彦君,彭复员.水下激光成像的实验研究[J].实验室研究与探索,2009,28(3):19-21. 被引量：2
9杨孝全,蔡鑫,Konstantin Maslov,汪立宏,骆清铭.High-resolution photoacoustic microscope for rat brain imaging in vivo[J].Chinese Optics Letters,2010,8(6):609-611. 被引量：4
10郭璠,蔡自兴,谢斌,唐琎.图像去雾技术研究综述与展望[J].计算机应用,2010,30(9):2417-2421. 被引量：114

共引文献200

1王文琪,刘巍,刘洋,程习康,张洋.基于双拟合优化的聚焦深度三维形貌测量方法[J].仪器仪表学报,2023,44(11):30-38. 被引量：2
2吕昌,尹和,邵叶秦.基于结构重参数化的目标检测模型[J].电子测量技术,2023,46(18):114-121. 被引量：1
3许志烨.基于yolov5的校园跌倒检测算法[J].长江信息通信,2023,36(2):135-137.
4杨成佳,钱明.基于小目标检测的YOLO算法研究综述[J].吉林工程技术师范学院学报,2023,39(3):92-96. 被引量：13
5张睿敏,杜叔强,周秀媛.基于改进YOLOv5+Kalman的动态手势识别跟踪算法研究[J].软件工程,2023,26(7):17-20.
6王琳毅,白静,李文静,蒋金哲.YOLO系列目标检测算法研究进展[J].计算机工程与应用,2023,59(14):15-29. 被引量：90
7冯奇斌,张新,郑琛,王梓,吕国强.基于卷积神经网络的双层液晶显示方法[J].光子学报,2023,52(8):138-147. 被引量：1
8杨叶君,刘刚,肖刚,顾新杰.基于自适应特征增强和生成器路径交互的红外与可见光图像融合[J].激光与光电子学进展,2023,60(14):170-180. 被引量：5
9江淑娜.Yolov5实现网络环境下教师节拍手势智能识别[J].福建电脑,2023,39(9):8-13. 被引量：3
10周乐,陈一畅,刘铭哲,朱超.基于多传感器融合的人体跌倒检测系统[J].空天预警研究学报,2023,37(2):129-135. 被引量：1

1刘杰,王坤.基于“POWER”的移动金融开发平台建设实践[J].中国金融电脑,2024(1):75-78.
2无.新一代银行移动开发平台建设研究[J].中国金融电脑,2024(7):53-58.
3龙伟军,武凡,陈虹廷,徐艺卓,杜川.多通道特征融合改进DenseNet的人体姿态识别方法[J].现代雷达,2025,47(4):67-76.
4蔡林,徐义春,张上.基于状态空间模型和卷积注意力的遥感影像变化检测[J].软件,2025,46(3):17-24.

中国激光

2025年第6期

浏览历史

内容加载中请稍等...

激光透窗低质量图像人体姿态识别技术研究

参考文献19

二级参考文献204

共引文献200

相关作者

相关机构

相关主题

浏览历史