Research on virtual entity decision model for LVC tactical confrontation of army units 被引量：4

在线阅读下载PDF

导出

摘要 According to the requirements of the live-virtual-constructive(LVC)tactical confrontation(TC)on the virtual entity(VE)decision model of graded combat capability,diversified actions,real-time decision-making,and generalization for the enemy,the confrontation process is modeled as a zero-sum stochastic game(ZSG).By introducing the theory of dynamic relative power potential field,the problem of reward sparsity in the model can be solved.By reward shaping,the problem of credit assignment between agents can be solved.Based on the idea of meta-learning,an extensible multi-agent deep reinforcement learning(EMADRL)framework and solving method is proposed to improve the effectiveness and efficiency of model solving.Experiments show that the model meets the requirements well and the algorithm learning efficiency is high.

作者 GAO Ang GUO Qisheng DONG Zhiming TANG Zaijiang ZHANG Ziwei FENG Qiqi

机构地区 Military Exercise and Training Center

出处《Journal of Systems Engineering and Electronics》 SCIE EI CSCD 2022年第5期1249-1267,共19页 系统工程与电子技术（英文版）

基金 supported by the Military Scentific Research Project(41405030302,41401020301).

关键词 live-virtual-constructive(LVC) army unit tactical confrontation(TC) intelligent decision model multi-agent deep reinforcement learning

分类号 TP391.9 [自动化与计算机技术—计算机应用技术] E91 [军事]

作者简介 GAO Ang was born in 1988.He received his Ph.D.degree in science of military equipemnt from Army Academy of Armored Forces.He is a Ph.D.candidate in Army Academy of Armored Forces.His research interest is intelligent decision of computer generated force based on multi-agent deep reinforcement learning.E-mail:15689783388@163.com;GUO Qisheng was born in 1962.He received his Ph.D.degree in science of military equipemnt from Tsinghua University.His research interests are equipment requirement demonstration and equipment test.E-mail:236211566@qq.com;Corresponding author:DONG Zhiming was born in 1977.He received his Ph.D.degree in science of military equipemnt from Army Academy of Armored Forces.His research interests are equipment requirement demonstration and equipment test.E-mail:dong_zhiming@163.com;TANG Zaijiang was born in 1976.He received his Ph.D.degree in science of military equipemnt from Army Academy of Armored Forces.His research interest is battle simulation.E-mail:tangzaijiang@sina.com;ZHANG Ziwei was born in 1986.He received his Ph.D.degree in science of military equipemnt from Army Academy of Armored Forces.He is a Ph.D.candidate in Army Academy of Armored Forces.His research interest is equipment test evaluation.E-mail:gaoang370829@sohu.com;FENG Qiqi was born in 1992.She received her M.S.degree in science of military equipemnt form Army Academy of Armored Forces.She is pursuing her Ph.D.degree in Army Academy of Armored Forces.Her research interest is real-time research of live virtual constructive.E-mail:594472717@qq.com。

引文网络
相关文献

参考文献10

1陈斌,王江,王阳.战斗机嵌入式训练系统中的智能虚拟陪练[J].航空学报,2020,41(6):359-373. 被引量：15
2欧峤,贺筱媛,陶九阳.协同目标分配问题研究综述[J].系统仿真学报,2019,31(11):2216-2227. 被引量：11
3ZHANG Jie,WANG Gang,YUE Shaohua,SONG Yafei,LIU Jiayi,YAO Xiaoqiang.Multi-agent system application in accordance with game theory in bi-directional coordination network model[J].Journal of Systems Engineering and Electronics,2020,31(2):279-289. 被引量：3
4刘波,张选平,王瑞,覃征.基于组合拍卖的协同多目标攻击空战决策算法[J].航空学报,2010,31(7):1433-1444. 被引量：26
5徐安,于雷,寇英信,徐保伟,李战武.基于MDP框架的飞行器隐蔽接敌策略[J].系统工程与电子技术,2011,33(5):1063-1068. 被引量：11
6XU Ximeng,YANG Rennong,FU Ying.Situation assessment for air combat based on novel semi-supervised naive Bayes[J].Journal of Systems Engineering and Electronics,2018,29(4):768-779. 被引量：15
7Chengwei Ruan,Zhongliang Zhou,Hongqiang Liu,Haiyan Yang.Task assignment under constraint of timing sequential for cooperative air combat[J].Journal of Systems Engineering and Electronics,2016,27(4):836-844. 被引量：6
8Xiao Xu,Mei Yang,Ge Li.ADAPTIVE CGF COMMANDER BEHAVIOR MODELING THROUGH HTN GUIDED MONTE CARLO TREE SEARCH[J].Journal of Systems Science and Systems Engineering,2018,27(2):231-249. 被引量：7
9宣贺君,向勇,和晓强,刘道华.联合火力打击中武器目标分配问题的多目标优化模型及算法[J].信阳师范学院学报（自然科学版）,2019,32(4):664-669. 被引量：14
10孔德鹏,常天庆,郝娜,张雷,郭理彬.基于对抗的突击武器与支援武器协同火力打击决策方法[J].兵工学报,2019,40(3):629-640. 被引量：13

二级参考文献138

1王振宇,马亚平,李柯.联合火力打击火力分配方案优化方法研究[J].军事运筹与系统工程,2005,19(2):12-17. 被引量：14
2程红斌,张凤鸣,张晓丰.多机协同空战目标分配算法[J].空军工程大学学报（自然科学版）,2005,6(2):7-10. 被引量：12
3罗德林,杨忠,段海滨,吴在桂,沈春林.HEURISTIC PARTICLE SWARM OPTIMIZATION ALGORITHM FOR AIR COMBAT DECISION-MAKING ON CMTA[J].Transactions of Nanjing University of Aeronautics and Astronautics,2006,23(1):20-26. 被引量：18
4史建国,高晓光,李相民.基于离散模糊动态贝叶斯网络的空战态势评估及仿真[J].系统仿真学报,2006,18(5):1093-1096. 被引量：29
5欧爱辉,朱自谦.基于多属性决策和态势估计结果的空战威胁评估方法[J].火控雷达技术,2006,35(2):64-67. 被引量：14
6霍霄华,陈岩,朱华勇,沈林成.多UCAV协同控制中的任务分配模型及算法[J].国防科技大学学报,2006,28(3):83-88. 被引量：48
7蔡怀平,刘靖旭,陈英武.动态武器目标分配问题的马尔可夫性[J].国防科技大学学报,2006,28(3):124-127. 被引量：22
8叶媛媛,闵春平,沈林成.多UCAV任务分配的混合遗传算法与约束处理[J].控制与决策,2006,21(7):781-786. 被引量：22
9龙涛,朱华勇,沈林成.多UCAV协同中基于协商的分布式任务分配研究[J].宇航学报,2006,27(3):457-462. 被引量：32
10王小艺,侯朝桢,原菊梅,郭飞,郝伟.防空火力分配建模及优化方法研究[J].控制与决策,2006,21(8):913-917. 被引量：24

共引文献109

1马金毅,王灿,薛涛,艾剑良,董一群.空战格斗飞行机动数据库建立及应用[J].航空学报,2023,44(S01):39-47. 被引量：2
2魏明英,崔正达,李运迁.多弹协同拦截综述与展望[J].航空学报,2020(S01):29-36. 被引量：34
3高永,李本威.超视距空战效能评估模型[J].海军航空工程学院学报,2012,27(1):66-70. 被引量：2
4陈侠,唐婷.不确定环境下多无人机动态任务分配方法[J].火力与指挥控制,2013,38(1):45-49. 被引量：11
5费爱国,张陆游,胡晓静.基于改进拍卖算法的空空导弹制导权移交技术研究[J].上海航天,2013,30(1):18-23. 被引量：3
6费爱国,张陆游,刘刚,王远.基于粒子群拍卖混合算法的空空导弹制导权移交技术[J].宇航学报,2013,34(3):340-346. 被引量：11
7付昭旺,于雷,刘霞,曲大鹏.网络信息支持下目标“虚拟跟踪”方法研究[J].电光与控制,2013,20(4):1-6. 被引量：2
8张涛,于雷,魏贤智,周中良.改进遗传算法的超视距协同多目标攻击决策[J].火力与指挥控制,2013,38(5):137-140. 被引量：7
9付昭旺,于雷,李战武,李飞.战斗机隐蔽接敌轨迹优化方法[J].国防科技大学学报,2013,35(5):52-58. 被引量：7
10万路军,姚佩阳,周翔翔,孙鹏.有人/无人作战智能体分布式协同目标分配方法[J].系统工程与电子技术,2014,36(2):278-287. 被引量：17

同被引文献51

1姚益平,朱峰,唐文杰,范波,陈凯.智能装备仿真实验技术初探[J].系统仿真技术,2023,19(2):97-106. 被引量：2
2王志强,王姿旖,王卓越.面向网络空间安全的创新人才培养机制探索与实践[J].北京电子科技学院学报,2022,30(1):144-150. 被引量：9
3王辉青.作战实验若干基本理论问题探讨[J].军事运筹与系统工程,2008,22(1):3-8. 被引量：21
4卜先锦,叶雄兵,季明.作战实验点设计研究[J].军事运筹与系统工程,2010,24(2):61-66. 被引量：6
5周玉芳,余云智,翟永翠.LVC仿真技术综述[J].指挥控制与仿真,2010,32(4):1-7. 被引量：40
6文伟平.北大信息安全方向硕士研究生教学模式探索[J].信息安全与通信保密,2014,12(5):45-47. 被引量：1
7李进,吉宁,刘小荷,李韦翰.美军新一代支持联合训练的JLVC2020框架研究[J].计算机仿真,2015,32(1):463-467. 被引量：24
8管文辉,樊长虹.基于CAN总线的智能组合靶标[J].电子世界,2015(15):27-29. 被引量：1
9周敏.网络攻防实战教学系统的设计与实现[J].实验技术与管理,2016,33(6):154-156. 被引量：7
10罗军舟,杨明,凌振,吴文甲,顾晓丹.网络空间安全体系与关键技术[J].中国科学：信息科学,2016,46(8):939-968. 被引量：59

引证文献4

1张畅,张玉臣,冀会芳,张恒巍.网络空间安全实战化教学训练的思考[J].网络安全技术与应用,2024(2):165-169. 被引量：2
2孙晴,王步云,孙翼.基于LVC的作战实验方法研究[J].舰船电子工程,2024,44(12):110-114.
3郭斐然,李庆坤,蔡敬坤,晁鲁静.面向跨域协同智能体系的多粒度LVC仿真系统设计[J].指挥控制与仿真,2025,47(2):141-148. 被引量：1
4张琪,黄鹤松,蔡亚,焦鹏.一种基于靶标代理的虚实对抗训练系统交互设计方法[J].指挥控制与仿真,2025,47(3):126-134.

二级引证文献3

1苏晶,任一支,姚晔,许艳萍,程振伟.网络强国建设背景下高校实战化网络安全人才培养模式和质量监控评价探素[J].评价与管理,2024,22(2):24-29.
2辛岳霖.轮胎企业财务风险评估系统的SVM仿真模型构建与应用[J].中国轮胎资源综合利用,2025(3):67-69.
3封富君,李海龙,沈燮阳,邢宇航.网络安全课程的教学改革探索[J].网络安全技术与应用,2025(6):94-97.

1程学龙,朱大奇,孙兵,陈云赛.深海载人潜水器推进器系统故障诊断的新型主元分析算法[J].控制理论与应用,2018,35(12):1796-1804. 被引量：4
2Hu Kefei,Xie Ying.Trash Collectors:THE GARBAGE ARMY[J].China Weekly,2021(2):32-35.
3高昂,郭齐胜,董志明,杨绍卿.基于EAS+MADRL的多无人车体系效能评估方法研究[J].系统工程与电子技术,2021,43(12):3643-3651. 被引量：3
4Xie Renchao,Liu Xu,Duan Xuefei,Tang Qinqin,Yu Fei Richard,Huang Tao.Dynamic computation offloading in time-varying environment for ultra-dense networks:a stochastic game approach[J].The Journal of China Universities of Posts and Telecommunications,2021,28(2):24-37.
5LIU Peng,LI Jichao,XIA Boyuan,ZHAO Danling,TAN Yuejin.Weapons equipment portfolios selection based on equipment system contribution rates[J].Journal of Systems Engineering and Electronics,2021,32(3):584-595. 被引量：9
6巴西陶鲁斯公司G3战术手枪[J].轻兵器,2022(10).
7Haifa Ahmed Al MAASHI.From Security Governance to Geopolitical Rivalry:Iran-GCC Confrontation in the Red Sea and the Indian Ocean[J].Asian Journal of Middle Eastern and Islamic Studies,2017,11(4):46-63. 被引量：1
8Syng-Yup Ohn,Sung-Do Chi,Chan Heo.Identification of breast cancer by classification of proteome patterns[J].International Journal of Modeling, Simulation, and Scientific Computing,2016,7(4):36-44.
9孙布勒,杨昂,孙鹏,姜大洁.基于AI的信道估计的泛化性能提升方法[J].无线电通信技术,2022,48(4):652-657.
10Luguang Wang,Fei Song,Gui Fang,Zhibin Feng,Wen Li,Yifan Xu,Chen Pan,Xiaojing Chu.A Multi-Agent Reinforcement Learning-Based Collaborative Jamming System: Algorithm Design and Software-Defined Radio Implementation[J].China Communications,2022,19(10):38-54. 被引量：2

Journal of Systems Engineering and Electronics

2022年第5期

浏览历史

内容加载中请稍等...