基于强化学习的多能源动态滑翔航迹优化方法

Multi energy dynamic soaring trajectory optimization method based on reinforcement learning

在线阅读下载PDF

导出

摘要针对无人机动态滑翔问题,提出了一种基于深度强化学习的航迹优化方法。该方法综合利用梯度风能和太阳能,引入了障碍物约束以模拟复杂障碍环境。使用神经网络近似逼近高斯伪谱方法求解航迹的策略,在训练得到的策略基础上利用双延迟深度确定性策略梯度算法进行策略改进,在大幅度提升推理实时性的同时解决了传统最优控制算法在动态滑翔领域难以应对变化风场的问题。实验针对动态滑翔2种经典模式进行仿真验证,之后在考虑多种能量源的情况下进行蒙特卡洛仿真。结果表明,基于深度强化学习的动态滑翔航迹优化方法在单个滑翔周期内获能与最优结果相近,而实时推理决策时间减少了91%。在变化风场环境下,文中方法相较于传统方法具有更强的适应性。 In addressing the issue of dynamic soaring in unmanned aerial vehicles,a trajectory optimization approach based on deep reinforcement learning is proposed.This method synergistically utilizes gradient wind energy and solar energy and incorporates obstacle constraints to simulate complex barrier environments.It employs neural networks to approximate the Gaussian pseudospectral method for solving trajectory policies.On the foundation of the trained policies,the method utilizes the twin delayed deep deterministic policy gradient algorithm for policy enhancement.This significantly boosts the real-time inference capabilities while addressing the challenges traditional optimal control algorithms face in dynamic soaring due to varying wind fields.The experiments initially validate the approach through simulation of two classic modes of dynamic soaring,followed by Monte Carlo simulations considering multiple energy sources.The results indicate that the dynamic soaring trajectory optimization method based on deep reinforcement learning achieves energy acquisition comparable to optimal outcomes within a single soaring cycle,with a 91%reduction in real-time inference decision time.Moreover,in changing wind field environments,this method demonstrates superior adaptability compared to traditional approaches.

作者张云飞王宏伦张梦华巩轶男 ZHANG Yunfei;WANG Honglun;ZHANG Menghua;GONG Yinan(School of Automation Science and Electrical Engineering,Beihang University,Beijing 100191,China;The Science and Technology on Aircraft Control Laboratory,Beihang University,Beijing 100191,China;Hiwing Aviation General Equipment Co.,Ltd.,Beijing 100074,China)

机构地区北京航空航天大学自动化科学与电气工程学院北京航空航天大学飞行器控制一体化技术国防科技重点实验室海鹰航空通用装备有限责任公司

出处《西北工业大学学报》北大核心 2025年第1期128-139,共12页 Journal of Northwestern Polytechnical University

关键词动态滑翔强化学习高斯伪谱航迹优化 dynamic soaring reinforcement learning Gaussian pseudospectral method trajectory optimization

分类号 V249.1 [航空宇航科学与技术—飞行器设计]

作者简介张云飞(1999-),硕士研究生;通信作者:王宏伦(1970-),教授,e-mail:wang-hl-12@126.com。

引文网络
相关文献

参考文献4

1朱熠,李继广,郝向宇.梯度风场中无人机动态滑翔飞行轨迹优化[J].西安航空学院学报,2023,41(5):8-16. 被引量：2
2刘思奇,白俊强.结合动态滑翔技术的小型太阳能无人机飞行能量变化分析[J].西北工业大学学报,2020,38(1):48-57. 被引量：3
3刘思奇,白俊强.基于六自由度模型的高空动态滑翔探究[J].西北工业大学学报,2021,39(4):703-711. 被引量：4
4马东立,包文卓,乔宇航.基于重力储能的太阳能飞机飞行轨迹研究[J].航空学报,2014,35(2):408-416. 被引量：24

二级参考文献22

1钱翼稷.空气动力学[M].北京:北京航空航天大学出版社,2008.
2Noll T E, Brown J M, Perez-Davis M E, et al. Investiga- tion of the Helios prototype aircraft mishap, Volume I Mishap Report[R]. Washington, D. C. : NASA Langley Research Center, 2004.
3Hannes R. Fly around the world with a solar powered air- plane, AIAA-2008-8954[R]. Reston: AIAA, 2008.
4Hall D W, Fortenbach C D, Dimiceli E V, et al. A pre- liminary study of solar powered aircraft and associated power trains, NASA CR-3699[R]. Washington, D. C. : NASA, 1983.
5Hall D W, Hall S A. Structural sizing of a solar powered aircraft, NASA CR-172313[R]. Washington, D. C.: NASA, 1984.
6Klesh A T, Kabamba P T. Energy-optimal path planning for solar-powered aircraft in level flight, AIAA-2007-6655 [R]. Reston: AIAA, 2007.
7Spangelo S C, Gilberty E G, Kleshz A T, et al. Periodic energy-optimal path planning for solar-powered aircraft,AIAA-2009 6016[R]. Reston: AIAA, 2009.
8Brandt S A, Gilliamt F T. Design analysis methodology for solar-powered aircraft :J:. Journal of Aircraft, 1995, 32(4): 703- 709.
9Sachsl G, Lenz J, Holz:pfel F. Unlimited endurance per- formance of solar uavs with minimal or zero electric energy storage, AIAA-2009-6013[R]. Reston: AIAA, 2009.
10Sinsay J D, Tracey B, Alonso J J, et al. Air vehicle de sign and technology considerations for an electric vtol met- ro-regional public transportation, AIAA-2012-5404 [R].Reston: AIAA, 2012.

共引文献28

1李锋,叶川,李广佳,郑安波,付义伟.临近空间太阳能飞行器横航向稳定性[J].航空学报,2016,37(4):1148-1158. 被引量：9
2段卓毅,王伟,耿建中,张健,李军府.高空长航时太阳能无人机高效气动力设计新挑战[J].空气动力学学报,2017,35(2):156-171. 被引量：8
3李赛,周伟,罗建军,谢飞.小型长航时太阳能无人机总体设计优化方法[J].空军工程大学学报（自然科学版）,2018,19(1):1-8. 被引量：7
4周伟,李赛,王学仁,谢飞.基于FQFD的太阳能无人机设计指标排序方法[J].航空学报,2018,39(2):135-145. 被引量：5
5王少奇,马东立,杨穆清,张良.高空太阳能无人机三维航迹优化[J].北京航空航天大学学报,2019,45(5):936-943. 被引量：14
6张晓辉,刘莉,戴月领.燃料电池无人机能源管理与飞行状态耦合[J].航空学报,2019,40(7):87-103. 被引量：11
7刘思奇,白俊强.结合动态滑翔技术的小型太阳能无人机飞行能量变化分析[J].西北工业大学学报,2020,38(1):48-57. 被引量：3
8王春阳,周洲,王睿.基于最长航时的太阳能无人机操纵策略研究[J].西北工业大学学报,2020,38(1):75-83. 被引量：3
9刘莉,曹潇,张晓辉,贺云涛.轻小型太阳能/氢能无人机发展综述[J].航空学报,2020,41(3):1-28. 被引量：33
10马东立,张良,杨穆清,夏兴禄,王少奇.超长航时太阳能无人机关键技术综述[J].航空学报,2020,41(3):29-58. 被引量：62

1刘发江,李开艳,撒靓瑶.四种常用概率分布间的极限关系及近似逼近探究[J].昭通学院学报,2024,46(5):35-38.
2杜飞,陈凯麒,刘晓波,王世岩,黄爱平,董飞,刘畅,杜彦良,阳星,孙龙.大型浅水湖泊高时空分辨率风场特征数值模拟研究:以巢湖为例[J].水利水电技术（中英文）,2024,55(2):39-49.
3谭亚.回归常识,回到附近[J].商界,2025(1):2-2.
4王远卓,张冉,李惠峰.基于协态估计的火箭动力下降邻近最优制导[J].宇航学报,2024,45(5):741-752.
5祁鸣东,盛守照,曹植,黄天宇,田佳,仇是.一种基于高次样条的直升机四维飞行航迹规划方法[J].导航定位学报,2025,13(1):155-161.
6谢颖(编译).资讯•名刊[J].浙商,2025(1):13-13.
7梁雨欣,申培萍,尹建菲.求解一类线性多乘积规划问题的自适应分支定界算法[J].应用数学,2025,38(1):217-223.
8赵晓蕾.基于时间序列的广东电信市场信息需求量预测研究[J].中文科技期刊数据库(全文版)自然科学,2020(6):00121-00123.
9刘光军,王宇涛,马黎阳,吴铁洲.基于凸二次规划的电池均衡系统研究[J].电源技术,2025,49(3):569-576.
10王鹏,张洪霄,林旭东,李明.望远镜指向抖动与波前畸变耦合的仿真分析(特邀)[J].光子学报,2025,54(2):50-57.

西北工业大学学报

2025年第1期

浏览历史

内容加载中请稍等...

基于强化学习的多能源动态滑翔航迹优化方法

参考文献4

二级参考文献22

共引文献28

相关作者

相关机构

相关主题

浏览历史