期刊文献+

Robust reinforcement learning with UUB guarantee for safe motion control of autonomous robots 被引量:1

原文传递
导出
摘要 This paper addresses the issue of safety in reinforcement learning(RL)with disturbances and its application in the safety-constrained motion control of autonomous robots.To tackle this problem,a robust Lyapunov value function(rLVF)is proposed.The rLVF is obtained by introducing a data-based LVF under the worst-case disturbance of the observed state.Using the rLVF,a uniformly ultimate boundedness criterion is established.This criterion is desired to ensure that the cost function,which serves as a safety criterion,ultimately converges to a range via the policy to be designed.Moreover,to mitigate the drastic variation of the rLVF caused by differences in states,a smoothing regularization of the rLVF is introduced.To train policies with safety guarantees under the worst disturbances of the observed states,an off-policy robust RL algorithm is proposed.The proposed algorithm is applied to motion control tasks of an autonomous vehicle and a cartpole,which involve external disturbances and variations of the model parameters,respectively.The experimental results demonstrate the effectiveness of the theoretical findings and the advantages of the proposed algorithm in terms of robustness and safety.
出处 《Science China(Technological Sciences)》 SCIE EI CAS CSCD 2024年第1期172-182,共11页 中国科学(技术科学英文版)
基金 supported by the National Natural Science Foundation of China(Grant Nos.62225305 and 12072088) the Fundamental Research Funds for the Central Universities,China(Grant Nos.HIT.BRET.2022004,HIT.OCEF.2022047,and HIT.DZIJ.2023049) the Grant JCKY2022603C016,State Key Laboratory of Robotics and System(HIT) the Heilongjiang Touyan Team。
作者简介 Corresponding author:ZHANG LiXian,email:lixianzhang@hit.edu.cn。
  • 相关文献

参考文献5

二级参考文献21

共引文献20

同被引文献9

引证文献1

二级引证文献2

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部