检索结果-维普期刊中文期刊服务平台

期刊文献⁺

任意字段

题名或关键词

题名

关键词

文摘

作者

第一作者

机构

刊名

分类号

参考文献

作者简介

基金资助

栏目信息

共找到6篇文章

< 1 >

每页显示 20 50 100

已选择0条

导出题录引用分析

统计分析

显示方式：

文摘详细列表

相关度排序被引量排序时效性排序

A novel trajectories optimizing method for dynamic soaring based on deep reinforcement learning: 1; 作者 Wanyong Zou Ni Li +2 位作者 Fengcheng An Kaibo Wang Changyin Dong 《Defence Technology(防务技术)》 2025年第4期99-108,共10页; Dynamic soaring,inspired by the wind-riding flight of birds such as albatrosses,is a biomimetic technique which leverages wind fields to enhance the endurance of unmanned aerial vehicles(UAVs).Achieving a precise soar... 展开更多; 关键词 Dynamic soaring Differential flatness Trajectory optimization Proximal policy optimization; 在线阅读下载PDF 职称材料

Optimal policy for controlling two-server queueing systems with jockeying: 2; 作者 LIN Bing LIN Yuchen BHATNAGAR Rohit 《Journal of Systems Engineering and Electronics》 SCIE EI CSCD 2022年第1期144-155,共12页; This paper studies the optimal policy for joint control of admission, routing, service, and jockeying in a queueing system consisting of two exponential servers in parallel.Jobs arrive according to a Poisson process.U... 展开更多; 关键词 queueing system jockeying optimal policy Markov decision process(MDP) dynamic programming; 在线阅读下载PDF 职称材料

A generalized geometric process based repairable system model with bivariate policy: 3; 作者 MA Ning YE Jimin WANG Junyuan 《Journal of Systems Engineering and Electronics》 SCIE EI CSCD 2021年第3期631-641,共11页; The maintenance model of simple repairable system is studied.We assume that there are two types of failure,namely type Ⅰ failure(repairable failure)and type Ⅱ failure(irrepairable failure).As long as the type Ⅰ fai... 展开更多; 关键词 renewal reward theorem generalized geometric process(GGP) average cost rate optimal policy replacement; 在线阅读下载PDF 职称材料

基于多智能体深度强化学习的无人机路径规划被引量：10: 4; 作者司鹏搏吴兵 +2 位作者杨睿哲李萌孙艳华《北京工业大学学报》 CAS CSCD 北大核心 2023年第4期449-458,共10页; 为解决多无人机(unmanned aerial vehicle, UAV)在复杂环境下的路径规划问题,提出一个多智能体深度强化学习UAV路径规划框架.该框架首先将路径规划问题建模为部分可观测马尔可夫过程,采用近端策略优化算法将其扩展至多智能体,通过设计UA... 展开更多; 关键词无人机(unmanned aerial vehicle UAV) 复杂环境路径规划马尔可夫决策过程多智能体近端策略优化算法(multi-agent proximal policy optimization MAPPO) 网络剪枝(network pruning NP); 在线阅读下载PDF 职称材料

Task assignment in ground-to-air confrontation based on multiagent deep reinforcement learning 被引量：4: 5; 作者 Jia-yi Liu Gang Wang +2 位作者 Qiang Fu Shao-hua Yue Si-yuan Wang 《Defence Technology（防务技术）》 SCIE EI CAS CSCD 2023年第1期210-219,共10页; The scale of ground-to-air confrontation task assignments is large and needs to deal with many concurrent task assignments and random events.Aiming at the problems where existing task assignment methods are applied to... 展开更多; 关键词 Ground-to-air confrontation Task assignment General and narrow agents Deep reinforcement learning Proximal policy optimization(PPO); 在线阅读下载PDF 职称材料

Cooperative multi-target hunting by unmanned surface vehicles based on multi-agent reinforcement learning 被引量：2: 6; 作者 Jiawei Xia Yasong Luo +3 位作者 Zhikun Liu Yalun Zhang Haoran Shi Zhong Liu 《Defence Technology（防务技术）》 SCIE EI CAS CSCD 2023年第11期80-94,共15页; To solve the problem of multi-target hunting by an unmanned surface vehicle(USV)fleet,a hunting algorithm based on multi-agent reinforcement learning is proposed.Firstly,the hunting environment and kinematic model wit... 展开更多; 关键词 Unmanned surface vehicles Multi-agent deep reinforcement learning Cooperative hunting Feature embedding Proximal policy optimization; 在线阅读下载PDF 职称材料

已选择0条

导出题录引用分析

统计分析

上一页 1 下一页到第页

使用帮助返回顶部