期刊文献+
共找到23篇文章
< 1 2 >
每页显示 20 50 100
仅根据Proximity数据构建向量空间模型的方法 被引量:1
1
作者 徐硕 乔晓东 +1 位作者 朱礼军 郭怀恩 《情报学报》 CSSCI 北大核心 2011年第11期1163-1170,共8页
在实际应用中,许多研究对象都是抽象的,难以用某种特征向量的形式表示,这使得许多成熟的数据挖掘和机器学习方法难以被采用。不过,通常可将其转化成一个Proximity数据矩阵,使得矩阵中的元素表示两个对象间某种“比较”关系。针对... 在实际应用中,许多研究对象都是抽象的,难以用某种特征向量的形式表示,这使得许多成熟的数据挖掘和机器学习方法难以被采用。不过,通常可将其转化成一个Proximity数据矩阵,使得矩阵中的元素表示两个对象间某种“比较”关系。针对该问题,本文提出仅根据Proximity数据矩阵利用多维尺度分析法(MDS)将研究对象进行向量化表示,即构建了一种向量空间模型。最后,对汉语科技词系统中的词语进行了聚类分析,结果表明,向量空间模型构建后再聚类的结果明显优于直接针对Proximity数据进行聚类分析的结果,从而验证了该方法的可行性和有效性。 展开更多
关键词 多维尺度法 proximity数据 向量空间模型 汉语科技词系统 聚类分析
在线阅读 下载PDF
Research on proximity effect of electromagnetic railgun 被引量:8
2
作者 Yu-tao LOU Hai-yuan LI Bao-ming LI 《Defence Technology(防务技术)》 SCIE EI CAS CSCD 2016年第3期223-226,共4页
The rails of electromagnetic railgun can be ablated by the temperature rise due to current concentration.The current distributions on the rails and armature are not only affected by the skin effect,but also influenced... The rails of electromagnetic railgun can be ablated by the temperature rise due to current concentration.The current distributions on the rails and armature are not only affected by the skin effect,but also influenced by the proximity effect which is rarely mentioned.This paper illustrated the difference between skin effect and proximity effect,and the influencing factors of proximity effect were investigated.Results show that the current is concentrated on the surface around rails due to the skin effect,and the proximity effect exacerbates the current density on the inner surfaces of rails.Decrease in distance from rails enhances the proximity effect,but has nothing to do with the skin effect,which also augments the rail resistance,resulting in temperature rise.It can explain the reason why the ablation is often detected in the small caliber railgun.Research results in this paper can provide support for design and optimization of electromagnetic railgun. 展开更多
关键词 ELECTROMAGNETIC RAILGUN proximity EFFECT SKIN EFFECT Ablation
在线阅读 下载PDF
Adaptive distributed formation maintenance for multiple UAVs:Exploiting proximity behavior observations 被引量:6
3
作者 LIU Wei-heng ZHENG Xin DENG Zhi-hong 《Journal of Central South University》 SCIE EI CAS CSCD 2021年第3期784-795,共12页
The formation maintenance of multiple unmanned aerial vehicles(UAVs)based on proximity behavior is explored in this study.Individual decision-making is conducted according to the expected UAV formation structure and t... The formation maintenance of multiple unmanned aerial vehicles(UAVs)based on proximity behavior is explored in this study.Individual decision-making is conducted according to the expected UAV formation structure and the position,velocity,and attitude information of other UAVs in the azimuth area.This resolves problems wherein nodes are necessarily strongly connected and communication is strictly consistent under the traditional distributed formation control method.An adaptive distributed formation flight strategy is established for multiple UAVs by exploiting proximity behavior observations,which remedies the poor flexibility in distributed formation.This technique ensures consistent position and attitude among UAVs.In the proposed method,the azimuth area relative to the UAV itself is established to capture the state information of proximal UAVs.The dependency degree factor is introduced to state update equation based on proximity behavior.Finally,the formation position,speed,and attitude errors are used to form an adaptive dynamic adjustment strategy.Simulations are conducted to demonstrate the effectiveness and robustness of the theoretical results,thus validating the effectiveness of the proposed method. 展开更多
关键词 unmanned aerial vehicle formation maintenance proximity behavior adaptive distributed control formation flight control
在线阅读 下载PDF
Three-dimensional coordinates test method with uncertain projectile proximity explosion position based on dynamic seven photoelectric detection screen 被引量:2
4
作者 Han-shan Li Xiao-qian Zhang 《Defence Technology(防务技术)》 SCIE EI CAS CSCD 2022年第9期1643-1652,共10页
To objectively obtain the three-dimensional coordinates of the projectile fuze proximity explosion when projectile intersects the head of missile target, we propose a dynamic seven photoelectric detection screen test ... To objectively obtain the three-dimensional coordinates of the projectile fuze proximity explosion when projectile intersects the head of missile target, we propose a dynamic seven photoelectric detection screen test method, which is made up of six plane detection screens and a flash photoelectric dynamic detection screen. The three-dimensional coordinates calculation model of the projectile proximity explosion position based on seven plane detection screens with dynamic characteristics is established.According to the relation of the dynamic seven photoelectric detection screen planes and the time values,the analytical function of the projectile proximity explosion position parameters under non-linear motion is derived. The projectile signal filtering method based on discrete wavelet transform is explored in this work. Additionally, the projectile signal recognition algorithm using an improved particle swarm is proposed. Based on the characteristics of the time duration and the signal peak error for the projectile passing through the detection screen, the signals attribution of the same projectile passing through six detection screens are analyzed for obtaining precise time values of the same projectile passing through the detection screens. On the basis of the projectile fuze proximity explosion test, the linear motion model and the proposed non-linear motion model are used to calculate and compare the same group of projectiles proximity explosion position parameters. The comparison of test results verifies that the proposed test method and calculation model in this work accurately obtain the actual projectile proximity explosion position parameters. 展开更多
关键词 Dynamic multi-screen array plane Flash photoelectric detection target Projectile signal processing Particle swarm proximity explosion fuze Three-dimensional coordinate
在线阅读 下载PDF
Ethical Leadership and Moral Imagination: A Moderated Mediation Model of Proximity and Organizational Commitment
5
作者 Jiang Xiaochuan Yang Jianfeng 《学术界》 CSSCI 北大核心 2019年第5期191-202,共12页
Moral imagination is the ability that can help individuals overcome constraints of organizational mental models to develop fresh frameworks and to make ethical decisions on the basis of those frameworks.This study aim... Moral imagination is the ability that can help individuals overcome constraints of organizational mental models to develop fresh frameworks and to make ethical decisions on the basis of those frameworks.This study aimed to explore the moderated mediator role of organizational commitment between ethical leadership and moral imagination.Data of 281 employees were collected,and results showed that when the victim of a certain ethical issue is their own company,organizational commitment fully mediated the effect of ethical leadership on moral imagination;however,when the victim is other company,ethical leadership and organizational commitment hadn't any effect on moral imagination.Those results showed the process that ethical leadership uses to influence moral imagination is not a social learning process but a social exchange process. 展开更多
关键词 ETHICAL LEADERSHIP MORAL IMAGINATION proximity ETHICAL DECISION making
在线阅读 下载PDF
A high performance waveform and a new ranging method for the proximity detector
6
作者 Qi-le Chen Xin-hong Hao +1 位作者 Xiao-peng Yan Ping Li 《Defence Technology(防务技术)》 SCIE EI CAS CSCD 2020年第4期834-845,共12页
Signal modulation is an essential design factor for proximity detectors and directly affects the system's potential performance.In order to achieve the advantages of chaotic codes bi-phase modulation(CCBPM)and lin... Signal modulation is an essential design factor for proximity detectors and directly affects the system's potential performance.In order to achieve the advantages of chaotic codes bi-phase modulation(CCBPM)and linear frequency modulation(LFM) simultaneously,this paper designed a waveform which combined chaotic codes bi-phase modulation and linear frequency modulation(CCBPM-LFM) for proximity detectors.The CCBPM-LFM waveform was analyzed in the aspect of time delay resolution(TDR) and Doppler tolerance(DT) based on ambiguity function(AF).Then,a ranging method,which we called instant correlation harmonic demodulation(ICHD),was presented for the detector using the CCBPM-LFM waveform.By combining time domain instant correlation with harmonic demodulation,the ICHD solved the problem caused by combination modulation and made the most of the linear frequency modulation(LFM) harmonics and the correlation of chaotic codes.Finally,a prototype was implemented and ranging experiments were carried out.From the theoretical analysis and experimental results,the proximity detector used the CCBPM-LFM waveform has an outstanding detection performance. 展开更多
关键词 proximity detectors Detection performance Combination modulation Chaotic codes bi-phase modulation Linear frequency modulation
在线阅读 下载PDF
Air target recognition method against ISRJ for radio frequency proximity sensors using chaotic stream encryption
7
作者 Jian-feng Li Jian Dai +2 位作者 Xin-hong Hao Xiao-peng Yan Xin-wei Wang 《Defence Technology(防务技术)》 SCIE EI CAS CSCD 2023年第10期267-279,共13页
The interrupted-sampling repeater jamming(ISRJ)can cause false targets to the radio-frequency proximity sensors(RFPSs),resulting in a serious decline in the target detection capability of the RFPS.This article propose... The interrupted-sampling repeater jamming(ISRJ)can cause false targets to the radio-frequency proximity sensors(RFPSs),resulting in a serious decline in the target detection capability of the RFPS.This article proposes a recognition method for RFPSs to identify the false targets caused by ISRJ.The proposed method is realized by assigning a unique identity(ID)to each RFPS,and each ID is a periodically and chaotically encrypted in every pulse period.The processing technique of the received signal is divided into ranging and ID decryption.In the ranging part,a high-resolution range profile(HRRP)can be obtained by performing pulse compression with the binary chaotic sequences.To suppress the noise,the singular value decomposition(SVD)is applied in the preprocessing.Regarding ID decryption,targets and ISRJ can be recognized through the encryption and decryption processes,which are controlled by random keys.An adaptability analysis conducted in terms of the peak-to-side lobe ratio(PSLR)and bit error rate(BER)indicates that the proposed method performs well within a 70-k Hz Doppler shift.A simulation and experimental results show that the proposed method achieves extremely stable target and ISRJ recognition accuracies at different signal-to-noise ratios(SNRs)and jamming-to-signal ratios(JSRs). 展开更多
关键词 Interrupted-sampling repeater jamming(ISRJ) Radio frequency proximity sensors(RFPS) Chaotic stream encryption Air target recognition Identity(ID)decryption
在线阅读 下载PDF
A novel trajectories optimizing method for dynamic soaring based on deep reinforcement learning
8
作者 Wanyong Zou Ni Li +2 位作者 Fengcheng An Kaibo Wang Changyin Dong 《Defence Technology(防务技术)》 2025年第4期99-108,共10页
Dynamic soaring,inspired by the wind-riding flight of birds such as albatrosses,is a biomimetic technique which leverages wind fields to enhance the endurance of unmanned aerial vehicles(UAVs).Achieving a precise soar... Dynamic soaring,inspired by the wind-riding flight of birds such as albatrosses,is a biomimetic technique which leverages wind fields to enhance the endurance of unmanned aerial vehicles(UAVs).Achieving a precise soaring trajectory is crucial for maximizing energy efficiency during flight.Existing nonlinear programming methods are heavily dependent on the choice of initial values which is hard to determine.Therefore,this paper introduces a deep reinforcement learning method based on a differentially flat model for dynamic soaring trajectory planning and optimization.Initially,the gliding trajectory is parameterized using Fourier basis functions,achieving a flexible trajectory representation with a minimal number of hyperparameters.Subsequently,the trajectory optimization problem is formulated as a dynamic interactive process of Markov decision-making.The hyperparameters of the trajectory are optimized using the Proximal Policy Optimization(PPO2)algorithm from deep reinforcement learning(DRL),reducing the strong reliance on initial value settings in the optimization process.Finally,a comparison between the proposed method and the nonlinear programming method reveals that the trajectory generated by the proposed approach is smoother while meeting the same performance requirements.Specifically,the proposed method achieves a 34%reduction in maximum thrust,a 39.4%decrease in maximum thrust difference,and a 33%reduction in maximum airspeed difference. 展开更多
关键词 Dynamic soaring Differential flatness Trajectory optimization Proximal policy optimization
在线阅读 下载PDF
一种基于稀疏优化和Nesterov动量策略的模型剪枝算法 被引量:1
9
作者 周强 陈军 +1 位作者 鲍蕾 陶卿 《数据采集与处理》 CSCD 北大核心 2024年第3期659-667,共9页
随着深度学习快速发展,模型的参数量和计算复杂度爆炸式增长,在移动终端上部署面临挑战,模型剪枝成为深度学习模型落地应用的关键。目前,基于正则化的剪枝方法通常采用L2正则化并结合基于数量级的重要性标准,是一种经验性的方法,缺乏理... 随着深度学习快速发展,模型的参数量和计算复杂度爆炸式增长,在移动终端上部署面临挑战,模型剪枝成为深度学习模型落地应用的关键。目前,基于正则化的剪枝方法通常采用L2正则化并结合基于数量级的重要性标准,是一种经验性的方法,缺乏理论依据,精度难以保证。受Proximal梯度方法求解稀疏优化问题的启发,本文提出一种能够在深度神经网络上直接产生稀疏解的Prox⁃NAG优化方法,并设计了与之配套的迭代剪枝算法。该方法基于L1正则化,利用Nesterov动量求解优化问题,克服了原有正则化剪枝方法对L2正则化和数量级标准的依赖,是稀疏优化从传统机器学习向深度学习的自然推广。在CIFAR10数据集上对ResNet系列模型进行剪枝实验,实验结果证明Prox⁃NAG剪枝算法较原有剪枝算法性能有所提升。 展开更多
关键词 稀疏 优化 剪枝算法 Proximal梯度方法 Nesterov加速梯度(Nesterov accelerated gradient NAG)
在线阅读 下载PDF
浅析Wakelock机制与Android电源管理 被引量:3
10
作者 邵俊骏 《计算机应用与软件》 CSCD 北大核心 2013年第4期293-295,共3页
从Android电源管理模块的设计角度出发,研究基于Android架构平台的电源管理,如Android移动电话、Android平板电脑。简单介绍Android电源管理,通过Android源代码以及Proximity Sensor Wakelock应用实例分析Android电源管理中的Wakelock机... 从Android电源管理模块的设计角度出发,研究基于Android架构平台的电源管理,如Android移动电话、Android平板电脑。简单介绍Android电源管理,通过Android源代码以及Proximity Sensor Wakelock应用实例分析Android电源管理中的Wakelock机制,以期探究Android电源管理实做理念,并在此基础上寻求更优的电源管理方法。 展开更多
关键词 ANDROID 电源管理 proximity SENSOR 休眠 Wakelock
在线阅读 下载PDF
多目标约束向量优化问题的类拉格朗日乘数法 被引量:3
11
作者 李润鑫 黄辉 +3 位作者 尚振宏 曹宇 王红斌 张晶 《数学物理学报(A辑)》 CSCD 北大核心 2018年第6期1076-1094,共19页
文献[21]给出了实希尔伯特空间中含有一个约束条件的向量优化问题的有关帕雷托解的拉格朗日乘数法.该文把文献[21]中的主要结果推广到了含有任意m个约束条件的多目标向量优化问题中,给出了实希尔伯特空间中,以proximal法锥和目标函数的c... 文献[21]给出了实希尔伯特空间中含有一个约束条件的向量优化问题的有关帕雷托解的拉格朗日乘数法.该文把文献[21]中的主要结果推广到了含有任意m个约束条件的多目标向量优化问题中,给出了实希尔伯特空间中,以proximal法锥和目标函数的coderivative刻画的多目标约束向量优化问题的类拉格朗日乘数法. 展开更多
关键词 向量优化 Proximal法锥 CODERIVATIVE 弱ε帕雷托解 多目标约束向量优化问题
在线阅读 下载PDF
脊柱畸形矫形术后近端交界性后凸相关研究进展 被引量:6
12
作者 王天昊 赵永飞 王岩 《中国脊柱脊髓杂志》 CAS CSCD 北大核心 2016年第1期77-81,共5页
脊柱畸形矫形手术通过矫正矢状面和冠状面的畸形并进行长节段固定融合以重建躯干的平衡。近端交界性后凸(proximal junctional kyphosis,PJK)是发生在脊柱侧凸或后凸畸形矫形术后的一种特定的影像学表现,通常因手术近端内固定交界区... 脊柱畸形矫形手术通过矫正矢状面和冠状面的畸形并进行长节段固定融合以重建躯干的平衡。近端交界性后凸(proximal junctional kyphosis,PJK)是发生在脊柱侧凸或后凸畸形矫形术后的一种特定的影像学表现,通常因手术近端内固定交界区的应力改变引起[1]。 展开更多
关键词 近端 脊柱畸形矫形术 后凸 交界性 翻修手术 内固定 脊柱侧凸 proximal 过矫 交界区
在线阅读 下载PDF
多目标优化问题proximal真有效解的最优性条件 被引量:5
13
作者 李小燕 高英 《应用数学和力学》 CSCD 北大核心 2015年第6期668-676,共9页
在广义凸性假设下,给出了集合proximal真有效点的线性标量化,并在此基础上证明了它与Benson真有效点和Borwein真有效点的等价性.将这些结果应用到多目标优化问题上,得到proximal真有效解的最优性条件.最后,利用proximal次微分,得到了pro... 在广义凸性假设下,给出了集合proximal真有效点的线性标量化,并在此基础上证明了它与Benson真有效点和Borwein真有效点的等价性.将这些结果应用到多目标优化问题上,得到proximal真有效解的最优性条件.最后,利用proximal次微分,得到了proximal真有效解的模糊型最优性条件. 展开更多
关键词 proximal法锥 多目标优化 proximal真有效解 最优性条件
在线阅读 下载PDF
Improved pruning algorithm for Gaussian mixture probability hypothesis density filter 被引量:8
14
作者 NIE Yongfang ZHANG Tao 《Journal of Systems Engineering and Electronics》 SCIE EI CSCD 2018年第2期229-235,共7页
With the increment of the number of Gaussian components, the computation cost increases in the Gaussian mixture probability hypothesis density(GM-PHD) filter. Based on the theory of Chen et al, we propose an improved ... With the increment of the number of Gaussian components, the computation cost increases in the Gaussian mixture probability hypothesis density(GM-PHD) filter. Based on the theory of Chen et al, we propose an improved pruning algorithm for the GM-PHD filter, which utilizes not only the Gaussian components’ means and covariance, but their weights as a new criterion to improve the estimate accuracy of the conventional pruning algorithm for tracking very closely proximity targets. Moreover, it solves the end-less while-loop problem without the need of a second merging step. Simulation results show that this improved algorithm is easier to implement and more robust than the formal ones. 展开更多
关键词 Gaussian mixture probability hypothesis density(GM-PHD) filter pruning algorithm proximity targets clutter rate
在线阅读 下载PDF
High-resolution forward-looking imaging algorithm for missile-borne detectors 被引量:2
15
作者 CHENG Cheng GAO Min ZHOU Xiaodong 《Journal of Systems Engineering and Electronics》 SCIE EI CSCD 2019年第3期456-466,共11页
Aiming at a novel missile-borne detector in the optional burst height proximity fuze, a self-adaptive high-resolution forward-looking imaging algorithm (SAHRFL-IA) is presented. The echo data are captured by the missi... Aiming at a novel missile-borne detector in the optional burst height proximity fuze, a self-adaptive high-resolution forward-looking imaging algorithm (SAHRFL-IA) is presented. The echo data are captured by the missile-borne detector in the target regions;thereby the azimuth angulation accuracy at the same distance dimension is improved dynamically. Thus, azimuth information of the targets in the detection area may be obtained accurately. The proposed imaging algorithm breaks through the conventional misconception of merely using azimuth discrimination curves under ideal conditions during monopulse angulation. The real-time echo data from the target region are used to perform error correction for this discrimination curve, and finally the accuracy of the azimuth angulation may reach the optimum at the same distance dimension. A series of experiments demonstrate the validity, reliability and high performance of the proposed imaging algorithm. Azimuth angulation accuracy may reach ten times that of the detection beam width. Meanwhile, the running time of this algorithm satisfies the requirements of missile-borne platforms. 展开更多
关键词 FORWARD-LOOKING imaging HIGH-RESOLUTION missileborne detector SELF-ADAPTIVE radio proximity FUZE
在线阅读 下载PDF
Comparison of three formulations for eddy-current problems in a spiral coil electromagnetic acoustic transducer
16
作者 石文泽 吴运新 +3 位作者 龚海 赵志然 范吉志 谭良辰 《Journal of Central South University》 SCIE EI CAS CSCD 2016年第4期817-824,共8页
Three differential equations based on different definitions of current density are compared. Formulation I is based on an incomplete equation for total current density (TCD). Formulations II and {I1 are based on inc... Three differential equations based on different definitions of current density are compared. Formulation I is based on an incomplete equation for total current density (TCD). Formulations II and {I1 are based on incomplete and complete equations for source current density (SCD), respectively. Using the weak form of finite element method (FEM), three formulations were applied in a spiral coil electromagnetic acoustic transducer (EMAT) example to solve magnetic vector potential (MVP). The input impedances calculated by Formulation III are in excellent agreement with the experimental measurements. Results show that the errors for Formulations I & II vary with coil diameter, coil spacing, lift-off distance and external excitation frequency, for the existence of eddy-current and skin & proximity effects. And the current distribution across the coil conductor also follows the same trend. It is better to choose Formulation I instead of Formulation Ili to solve MVP when the coil diameter is less than twice the skin depth for Formulation I is a low cost and high efficiency calculation method. 展开更多
关键词 electromagnetic acoustic transducer (EMAT) eddy current finite element method (FEM) skin and proximity effects spiral coil
在线阅读 下载PDF
Task assignment in ground-to-air confrontation based on multiagent deep reinforcement learning 被引量:4
17
作者 Jia-yi Liu Gang Wang +2 位作者 Qiang Fu Shao-hua Yue Si-yuan Wang 《Defence Technology(防务技术)》 SCIE EI CAS CSCD 2023年第1期210-219,共10页
The scale of ground-to-air confrontation task assignments is large and needs to deal with many concurrent task assignments and random events.Aiming at the problems where existing task assignment methods are applied to... The scale of ground-to-air confrontation task assignments is large and needs to deal with many concurrent task assignments and random events.Aiming at the problems where existing task assignment methods are applied to ground-to-air confrontation,there is low efficiency in dealing with complex tasks,and there are interactive conflicts in multiagent systems.This study proposes a multiagent architecture based on a one-general agent with multiple narrow agents(OGMN)to reduce task assignment conflicts.Considering the slow speed of traditional dynamic task assignment algorithms,this paper proposes the proximal policy optimization for task assignment of general and narrow agents(PPOTAGNA)algorithm.The algorithm based on the idea of the optimal assignment strategy algorithm and combined with the training framework of deep reinforcement learning(DRL)adds a multihead attention mechanism and a stage reward mechanism to the bilateral band clipping PPO algorithm to solve the problem of low training efficiency.Finally,simulation experiments are carried out in the digital battlefield.The multiagent architecture based on OGMN combined with the PPO-TAGNA algorithm can obtain higher rewards faster and has a higher win ratio.By analyzing agent behavior,the efficiency,superiority and rationality of resource utilization of this method are verified. 展开更多
关键词 Ground-to-air confrontation Task assignment General and narrow agents Deep reinforcement learning Proximal policy optimization(PPO)
在线阅读 下载PDF
基于多智能体深度强化学习的无人机路径规划 被引量:10
18
作者 司鹏搏 吴兵 +2 位作者 杨睿哲 李萌 孙艳华 《北京工业大学学报》 CAS CSCD 北大核心 2023年第4期449-458,共10页
为解决多无人机(unmanned aerial vehicle, UAV)在复杂环境下的路径规划问题,提出一个多智能体深度强化学习UAV路径规划框架.该框架首先将路径规划问题建模为部分可观测马尔可夫过程,采用近端策略优化算法将其扩展至多智能体,通过设计UA... 为解决多无人机(unmanned aerial vehicle, UAV)在复杂环境下的路径规划问题,提出一个多智能体深度强化学习UAV路径规划框架.该框架首先将路径规划问题建模为部分可观测马尔可夫过程,采用近端策略优化算法将其扩展至多智能体,通过设计UAV的状态观测空间、动作空间及奖赏函数等实现多UAV无障碍路径规划;其次,为适应UAV搭载的有限计算资源条件,进一步提出基于网络剪枝的多智能体近端策略优化(network pruning-based multi-agent proximal policy optimization, NP-MAPPO)算法,提高了训练效率.仿真结果验证了提出的多UAV路径规划框架在各参数配置下的有效性及NP-MAPPO算法在训练时间上的优越性. 展开更多
关键词 无人机(unmanned aerial vehicle UAV) 复杂环境 路径规划 马尔可夫决策过程 多智能体近端策略优化算法(multi-agent proximal policy optimization MAPPO) 网络剪枝(network pruning NP)
在线阅读 下载PDF
LSTM-DPPO based deep reinforcement learning controller for path following optimization of unmanned surface vehicle 被引量:3
19
作者 XIA Jiawei ZHU Xufang +1 位作者 LIU Zhong XIA Qingtao 《Journal of Systems Engineering and Electronics》 SCIE EI CSCD 2023年第5期1343-1358,共16页
To solve the path following control problem for unmanned surface vehicles(USVs),a control method based on deep reinforcement learning(DRL)with long short-term memory(LSTM)networks is proposed.A distributed proximal po... To solve the path following control problem for unmanned surface vehicles(USVs),a control method based on deep reinforcement learning(DRL)with long short-term memory(LSTM)networks is proposed.A distributed proximal policy opti-mization(DPPO)algorithm,which is a modified actor-critic-based type of reinforcement learning algorithm,is adapted to improve the controller performance in repeated trials.The LSTM network structure is introduced to solve the strong temporal cor-relation USV control problem.In addition,a specially designed path dataset,including straight and curved paths,is established to simulate various sailing scenarios so that the reinforcement learning controller can obtain as much handling experience as possible.Extensive numerical simulation results demonstrate that the proposed method has better control performance under missions involving complex maneuvers than trained with limited scenarios and can potentially be applied in practice. 展开更多
关键词 unmanned surface vehicle(USV) deep reinforce-ment learning(DRL) path following path dataset proximal po-licy optimization long short-term memory(LSTM)
在线阅读 下载PDF
Cooperative multi-target hunting by unmanned surface vehicles based on multi-agent reinforcement learning 被引量:2
20
作者 Jiawei Xia Yasong Luo +3 位作者 Zhikun Liu Yalun Zhang Haoran Shi Zhong Liu 《Defence Technology(防务技术)》 SCIE EI CAS CSCD 2023年第11期80-94,共15页
To solve the problem of multi-target hunting by an unmanned surface vehicle(USV)fleet,a hunting algorithm based on multi-agent reinforcement learning is proposed.Firstly,the hunting environment and kinematic model wit... To solve the problem of multi-target hunting by an unmanned surface vehicle(USV)fleet,a hunting algorithm based on multi-agent reinforcement learning is proposed.Firstly,the hunting environment and kinematic model without boundary constraints are built,and the criteria for successful target capture are given.Then,the cooperative hunting problem of a USV fleet is modeled as a decentralized partially observable Markov decision process(Dec-POMDP),and a distributed partially observable multitarget hunting Proximal Policy Optimization(DPOMH-PPO)algorithm applicable to USVs is proposed.In addition,an observation model,a reward function and the action space applicable to multi-target hunting tasks are designed.To deal with the dynamic change of observational feature dimension input by partially observable systems,a feature embedding block is proposed.By combining the two feature compression methods of column-wise max pooling(CMP)and column-wise average-pooling(CAP),observational feature encoding is established.Finally,the centralized training and decentralized execution framework is adopted to complete the training of hunting strategy.Each USV in the fleet shares the same policy and perform actions independently.Simulation experiments have verified the effectiveness of the DPOMH-PPO algorithm in the test scenarios with different numbers of USVs.Moreover,the advantages of the proposed model are comprehensively analyzed from the aspects of algorithm performance,migration effect in task scenarios and self-organization capability after being damaged,the potential deployment and application of DPOMH-PPO in the real environment is verified. 展开更多
关键词 Unmanned surface vehicles Multi-agent deep reinforcement learning Cooperative hunting Feature embedding Proximal policy optimization
在线阅读 下载PDF
上一页 1 2 下一页 到第
使用帮助 返回顶部