期刊文献+

基于双流卷积与双中心loss的行为识别研究 被引量:3

Research on action recognition based on two-stream convolution and double center loss
在线阅读 下载PDF
导出
摘要 针对行为视频中相似动作类内差异大、类间差异小,识别准确率不高的问题,提出了一种基于双流卷积网络与双中心loss的行为识别方法.该方法首先构建双流卷积网络结构,以C3Dnet模型作为双流结构的基础模型,分别提取多尺度RGB视频帧中的表观短时运动信息和堆叠光流图中的长时运动信息;然后将双流结构提取的深度信息经长短时记忆(LSTM)网络解析后进行特征融合;最后,利用基于双中心loss的2C-softmax目标函数,来最大化类间距离和最小化类内距离,从而实现相似动作的分类与识别.在数据集KTH上的实验结果表明,该方法能够准确识别相似动作,识别准确率可达98.2%,具有很好的识别效果. Aiming at the problem of large difference in similar action classes, small difference between classes in action video and low recognition accuracy, a action recognition method based on two-stream convolution network and double-center loss is proposed. The method first constructs a two-stream convolutional network structure, and uses the C3 Dnet model as the basic model of the two-stream structure to extract the apparent short-term motion information in the multi-scale RGB video frame and the long-term motion information in the stacked optical flow map respectively;Then, the depth information extracted by the two-stream structure is parsed by a long and short time memory(LSTM) network to perform feature fusion;Finally, the 2 C-softmax objective function based on dual-center loss is used to maximize the distance between classes and minimize the distance within the class, so as to classify and identify similar actions. The experimental results on the data set KTH show that the method can accurately identify similar actions, and the recognition accuracy can reach 98.2%, which has a good recognition effect.
作者 毛志强 马翠红 崔金龙 王毅 MAO Zhi-qiang;MA Cui-hong;CUI Jin-long;WANG Yi(College of Electrical Engineering,North China University of Science and Technology,Tangshan 063210,China;Beijing Jiaotong University Haibin College,Cangzhou 061100,China)
出处 《微电子学与计算机》 北大核心 2019年第3期96-100,共5页 Microelectronics & Computer
基金 国家自然科学基金(61171058)
关键词 双流卷积网络 中心loss 长短时记(LSTM) 光流图 two-stream convolution network center loss long short-term memory (LSTM) optical flow map
作者简介 毛志强,男,(1991-),硕士研究生.研究方向为计算机视觉、目标检测与人体行为分析;通讯作者:马翠红,女,(1960-),教授,研究方向为复杂工业系统的建模与控制、图像识别与视频分析.E-mail:864404484@qq.com;崔金龙,男,(1989-),硕士,助教,研究方向为钢成分测量;王毅,男,(1994-),硕士研究生,研究方向为计算机视觉、目标检测与视频分析.
  • 相关文献

参考文献1

二级参考文献1

共引文献24

同被引文献16

引证文献3

二级引证文献9

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部