虚拟人运动控制策略学习方法的研究进展与展望

Recent Advances on Motion Control Policy Learning for Humanoid Characters

在线阅读下载PDF

导出

摘要虚拟人运动合成是虚拟现实和角色动画领域中的关键问题之一,旨在合成真实自然且能够响应用户输入信息的运动序列.虚拟人运动控制策略根据用户输入约束解算关节力矩,并依托现有的物理引擎更新虚拟人状态,合成的运动序列在满足用户输入约束的同时可以满足物理真实性.近年来,深度强化学习技术因在序列决策和交互任务中的出色表现而备受研究者的关注,为基于物理引擎的虚拟人控制策略学习提供了新途径.文中对虚拟人运动控制策略学习方法进行综述,从理论基础和应用设计等方面介绍相关研究.在应用设计方面,首先基于深度强化学习的基础元素,从状态表示、奖励函数设计、控制策略设计以及物理仿真4个角度对现有工作进行梳理总结;其次,分析现有通用技术框架并指出其在控制策略上的拓展方向,并以实际问题为例探讨虚拟人运动控制策略的具体应用.最后,总结当前该领域的研究现状,指出利用丰富的运动捕获数据提升运动控制策略的深度与广度是未来的主要研究方向,展望虚拟人运动控制策略学习方法在多模态的感知与控制、世界模型学习和具身智能等应用方向的发展前景. Motion synthesis for humanoid characters,aimed at generating realistic and natural motion sequences that can respond to user input,has long been a formidable challenge in the fields of virtual reality and character animation.Motion control policies address this challenge by calculating joint torques based on user input constraints,updating the character state using existing physics engines,and synthesizing motion sequences that not only meet user input constraints but also ensure physical realism.In recent years,deep reinforcement learning has garnered significant attention from researchers due to its exceptional performance in sequential decision-making and interactive tasks,providing a novel approach for learning control policies for humanoid characters grounded in physics engines.This paper reviews the advancements in motion control policy learning for humanoid characters and introduces relevant research from both theoretical foundations and practical design perspectives.In terms of practical designs,existing works are examined from four key aspects:state representation,reward function design,control policy design,and the physical simulation engine employed,all based on the fundamental elements of deep reinforcement learning.Furthermore,a comprehensive analysis of a general technical framework is conducted,highlighting potential directions for extending control policies.The specific application of motion control policies for humanoid characters is discussed using practical problems as case studies.Finally,a summary of the current research status is provided,indicating that leveraging extensive motion capture data to enhance the depth and breadth of motion control policies represents a promising future research direction.The paper also outlines the prospects for the development of motion control policy learning for humanoid characters,particularly in the areas of multimodal perception and control,world model learning,and embodied intelligence.

作者叶永竞许逸文张子豪胡磊夏时洪 Ye Yongjing;Xu Yiwen;Zhang Zihao;Hu Lei;Xia Shihong(Prospective Research Laboratory,Institute of Computing Technology,Chinese Academy of Sciences,Beijing 1001902;University of Chinese Academy of Sciences,Beijing 100049)

机构地区中国科学院计算技术研究所前瞻研究实验室中国科学院大学

出处《计算机辅助设计与图形学学报》北大核心 2025年第2期185-206,共22页 Journal of Computer-Aided Design & Computer Graphics

基金工业软件重点专项(2022YFB3303202) 国家自然科学基金(62302481)。

关键词虚拟人运动控制策略强化学习深度学习角色动画 humanoid character motion control policy reinforcement learning deep learning character anima-tion

分类号 TP391.41 [自动化与计算机技术—计算机应用技术]

作者简介叶永竞(1995-),男,博士研究生,主要研究方向为计算机图形学、虚拟现实;许逸文(1997-),男,博士研究生,主要研究方向为计算机图形学、虚拟现实;张子豪(1993-),男,博士,助理研究员,主要研究方向为计算机图形学、计算机视觉;胡磊(1997-),男,博士研究生,主要研究方向为计算机图形学、三维人体运动建模与生成;通信作者:夏时洪(1974-),男,博士,研究员,博士生导师,CCF高级会员,主要研究方向为计算机图形学、虚拟现实、人工智能.xsh@ict.ac.cn。

引文网络
相关文献

1王春兵.三维人体运动数据有效性检索与驱动技术[J].电视技术,2023,47(12):103-106.
2卢愚风,王晗,谢亦璠,江一舟,邵志敏.中国乳腺癌重要基础转化研究——进展与展望[J].中国癌症杂志,2025,35(2):143-153. 被引量：1
3李国安,刘俊辰,汪淼.面向虚拟场景交互任务的分阶段视线预测方法[J].计算机辅助设计与图形学学报,2025,37(2):207-215.
4王栋,王利.矿用支护锚杆推进机器人结构设计[J].工矿自动化,2024,50(S2):198-200.
5张皇津.深度数学阅读提升初中生解题思维能力的对策[J].特区教育,2025(12):22-24.
6姚星晨,陈友谋,凌逸,刘新如,吕祥,祝永福.中药干预结直肠癌转移前生态位与免疫逃逸机制的研究进展与展望[J].中医药临床杂志,2025,37(3):575-581.
7刘顶刚.巧用方法破困局走出迷雾新尝试——论合成思想在复杂斜抛运动中的应用[J].物理通报,2025(3):83-85.
8龙亚莉.基于数智化能力建构的高质量出版传播人才培养思考[J].湖北开放大学学报,2025,45(1):54-59. 被引量：1
9蒋勇军,田兴.西南岩溶区土壤侵蚀研究进展与展望[J].四川师范大学学报(自然科学版),2025,48(3):341-352.
10夏强,徐东伟.我国活体肝移植的进展与展望[J].中国实用外科杂志,2025,45(1):1-4.

计算机辅助设计与图形学学报

2025年第2期

浏览历史

内容加载中请稍等...

虚拟人运动控制策略学习方法的研究进展与展望

相关作者

相关机构

相关主题

浏览历史