基于语音信号与心电信号的多模态情感识别被引量：14

Multimodal emotion recognition based on speech and ECG signals

在线阅读下载PDF

导出

摘要通过采集与分析语音信号和心电信号,研究了相应的情感特征与融合算法.首先,通过噪声刺激和观看影视片段的方式分别诱发烦躁情感和喜悦情感,并采集了相应情感状态下的语音信号和心电信号.然后,提取韵律、音质特征和心率变异性特征分别作为语音信号和心电信号的情感特征.最后,利用加权融合和特征空间变换的方法分别对判决层和特征层进行融合,并比较了这2种融合算法在语音信号与心电信号融合情感识别中的性能.实验结果表明:在相同测试条件下,基于心电信号和基于语音信号的单模态情感分类器获得的平均识别率分别为71%和80%;通过特征层融合,多模态分类器的识别率则达到90%以上;特征层融合算法的平均识别率高于判决层融合算法.因此,依据语音信号、心电信号等不同来源的情感特征可以构建出可靠的情感识别系统. Through collecting and analyzing speech signals and electrocardiography（ECG） signals,emotion features and fusion algorithms are studied.First,annoyance is induced by noise stimulation and happiness is induced by comedy movie clips.The corresponding speech signals and ECG signals are recorded.Then,prosodic features and voice quality features are adopted for speech emotional features,and heart rate variability（HRV） features are used for ECG emotional features.Finally,the decision level fusion and the feature level fusion are accomplished by the weighted fusion method and the feature transformation method,respectively.The performances of the two fusion methods in speech emotion and ECG emotion recognition are compared.The experimental results show that for the same testing set,the average recognition rates of the single modal classifier based on the ECG signals and the single modal classifier based on the speech signals reach 71% and 80%,respectively,while that of the multi-modal classifier with the feature level fusion of the speech signals and the ECG signals achieves above 90%.The average recognition rate of the feature level fusion algorithm is higher than that of the decision level fusion algorithm.The different signal channels such as speech signals and ECG signals show a promising improvement in building a reliable emotion recognition system.

作者黄程韦金赟王青云赵力邹采荣

机构地区东南大学水声信号处理教育部重点实验室徐州师范大学物理与电子工程学院

出处《东南大学学报（自然科学版）》 EI CAS CSCD 北大核心 2010年第5期895-900,共6页 Journal of Southeast University：Natural Science Edition

基金国家自然科学基金资助项目(60472058 60975017) 江苏省自然科学基金资助项目(BK2008291)

关键词情感识别多模态判决层融合特征层融合 emotion recognition multimodal decision level fusion feature level fusion

分类号 TP391.4 [自动化与计算机技术—计算机应用技术]

作者简介黄程韦（1984-），男，博士生赵力（联系人），男，博士，教授，博士生导师，zhaoli@seu．edu．cn．

引文网络
相关文献

参考文献11

1Zeng Z,Pantic M,Roisman G I,et al.A survey of affect recognition methods:audio,visual and spontaneous expressions[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2009,31(1):39-58.
2Hoch S,Althoff F,McGlaun A,et al.Bimodal fusion of emotional data in an automotive environment[C]//Proceedings of the 2005 IEEE International Conference on Acoustics,Speech,and Signal Processing.Philadelphia,Pennsylvania,USA,2005:1085-1088.
3Busso C,Deng Z,Yildirim S,et al.Analysis of emotion recognition using facial expressions,speech and multimodal information[C]//Proceedings of the Sixth International Conference on Multimodal Interfaces.Pennsylvania,USA,2004:205-211.
4Wagner J,Kim J,Andre E.From physiological signals to emotions:implementing and comparing selected methods for feature extraction and classification[C]//Proceedings of the 2005 IEEE International Conference on Multimedia & Expo.Amsterdam,the Netherlands,2005:940-943.
5Khiet T.How does real affect affect affect recognition in speech?[D].Enschede,the Netherlands:Center for Telematics and Information Technology of University of Twente,2009.
6Tato R,Santos R,Kompe R,et al.Emotion space improves emotion recognition[C]//Proceedings of the 2002 International Conference on Speech and Language Processing.Denver,Colorado,USA,2002:2029-2032.
7Schuller B,Rigoll G,Lang M.Speech emotion recognition combining acoustic features and linguistic information in a hybrid support vector machine-belief network architecture[C]//Proceedings of the 2004 IEEE International Conference on Acoustics,Speech,and Signal Processing.Montreal,Canada,2004:577-580.
8Pittam J,Scherer K R.Vocal expression and communication of emotion[M].New York,USA:Guilford Press,1993:185-198.
9Biemans M.Gender variation in voice quality[D].Nijmegen,the Netherlands:Department of Linguistics of Radboud University Nijmegen,2000.
10Peng Hangchuan,Long Fuhui,Ding Chris.Feature selection based on mutual information:criteria of max-dependency,max-relevance,and min-redundancy[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2005,27(8):1226-1238.

同被引文献218

1黄力行,辛乐,赵礼悦,陶建华.自适应权重的双模态情感识别[J].清华大学学报（自然科学版）,2008,48(S1):715-719. 被引量：8
2付丽琴,毛峡,陈立江.基于改进的排序式选举算法的语音情感融合识别[J].计算机应用,2009,29(2):381-385. 被引量：1
3赵腊生,张强,魏小鹏.语音情感识别研究进展[J].计算机应用研究,2009,26(2):428-432. 被引量：21
4韩文静,李海峰.基于韵律语段的语音情感识别方法研究[J].清华大学学报（自然科学版）,2009(S1):1363-1368. 被引量：8
5李爱军,邵鹏飞,党建武.情感表达的跨文化多模态感知研究[J].清华大学学报（自然科学版）,2009(S1):1393-1401. 被引量：6
6李建平,张平,王丽芳,代景华,阎克乐.5种基本情绪自主神经反应模式特异性的实验研究[J].中国行为医学科学,2005,14(3):257-259. 被引量：20
7李建平,郭念锋,阎克乐,王丽芳.情绪自主神经特异性研究及进展[J].心理科学,2005,28(3):744-746. 被引量：12
8文沁,汪增福.基于三维数据的人脸表情识别[J].计算机仿真,2005,22(7):99-103. 被引量：10
9高慧,苏广川,陈善广.不同情绪状态下汉语语音的声学特征分析[J].航天医学与医学工程,2005,18(5):350-354. 被引量：23
10穆光宗.解析“老年弱势群体”[J].社会科学论坛（学术研究卷）,2005(3):38-40. 被引量：14

引证文献14

1程静,刘光远.学科交叉视角下的情感识别研究进展[J].计算机科学,2012,39(5):19-24. 被引量：5
2罗武骏,黄程韦,查诚,赵力.越南语语音情感特征分析与识别[J].信号处理,2013,29(10):1423-1432. 被引量：4
3党宏社,郭楚佳,张娜.信息融合技术在情绪识别领域的研究展望[J].计算机应用研究,2013,30(12):3536-3539. 被引量：6
4张明阳,查诚,塔什甫拉提.尼扎木丁,徐新洲,赵力.结合数据场情感空间和混合蛙跳算法的连续语音情感变化趋势检测[J].声学学报,2019,44(1):12-19. 被引量：5
5韩志艳,王健.多模式情感识别特征参数融合算法研究[J].计算机技术与发展,2016,26(5):27-30. 被引量：2
6韩志艳,王健.面向语音与面部表情信号的情感可视化方法[J].电子设计工程,2016,24(11):146-149.
7韩志艳,王健.基于模糊核聚类的多模式情感识别算法研究[J].电子设计工程,2016,24(20):1-4.
8龚雪,张育钊,庄铭杰,唐加能.基于动态可变参数的复合混沌系统的语音加密算法研究[J].声学技术,2016,35(6):542-549. 被引量：4
9窦金花,覃京燕.基于情感计算的弱势群体产品情感交互设计研究[J].包装工程,2017,38(6):7-11. 被引量：10
10陈鹏展,张欣,徐芳萍.基于语音信号与文本信息的双模态情感识别[J].华东交通大学学报,2017,34(2):100-104. 被引量：8

二级引证文献50

1塔什甫拉提·尼扎木丁,梁瑞宇,谢跃,赵力.采用原子表示模型的维吾尔语语音情感识别[J].信号处理,2020,36(1):9-17. 被引量：3
2党宏社,郭楚佳,张娜.信息融合技术在情绪识别领域的研究展望[J].计算机应用研究,2013,30(12):3536-3539. 被引量：6
3马素萍,高洪波.基于Guass-Newton法的远程教育学习兴趣趋避度模型构建[J].内江师范学院学报,2014,29(4):26-29.
4李发权,杨立才,颜红博.基于PCA-SVM多生理信息融合的情绪识别方法[J].山东大学学报（工学版）,2014,44(6):70-76. 被引量：2
5陶华伟,柳晶晶,梁瑞宇,查诚,张昕然,赵力.面向语音情感识别的Gabor分块局部二值模式特征[J].信号处理,2016,32(5):505-511. 被引量：5
6陈茜,史殿习,杨若松.多维数据特征融合的用户情绪识别[J].计算机科学与探索,2016,10(6):751-760. 被引量：3
7柳沙,彭鑫玉,张宝月.皮肤电信号在产品外观偏好评价中的应用研究[J].包装工程,2016,37(24):22-27. 被引量：3
8杜岳涛.DES算法在VoIP系统中的嵌入式设计与实现研究[J].自动化与仪器仪表,2018,0(5):6-9. 被引量：5
9张力行,叶宁,黄海平,王汝传.基于皮肤电信号与文本信息的双模态情感识别系统[J].计算机系统应用,2018,27(11):103-108. 被引量：1
10潘莹.情感识别综述[J].电脑知识与技术,2018,14(3Z):169-171. 被引量：5

1权光日,洪炳熔,钱国良.基于示例学习的特征空间变换方法[J].计算机研究与发展,1998,35(5):398-402. 被引量：2
2曾秀花,杨鉴,徐永华.语种辨识的多特征信息应用[J].计算机工程与应用,2010,46(25):146-148. 被引量：2
3截取有用影视片段[J].电脑爱好者（普及版）,2011(A01):97-97.
4樊康新.基于SVM的网络文本情感分类系统的研究与设计[J].计算机时代,2015(12):34-37. 被引量：5
5顾健.多传感器信号融合算法的仿真[J].舰船科学技术,1999(5):29-33.
6AVI 高质高量，通用性强[J].数码世界,2007,0(10):165-165.
7吴立珍,曾迎生.基于AT89C52单片机多超声信号融合系统设计[J].微计算机信息,2006(11Z):86-88. 被引量：11
8金良磊.不走寻常路——“电影式”论坛签名制作流程[J].电脑知识与技术（经验技巧）,2007(3):103-104.
9邓玉春,姜昱明.基于步态的身份识别[J].微电子学与计算机,2005,22(6):12-15. 被引量：2
10杨佳能,阳爱民,周咏梅.基于语义分析的中文微博情感分类方法[J].山东大学学报（理学版）,2014,49(11):14-21. 被引量：23

东南大学学报（自然科学版）

2010年第5期

浏览历史

内容加载中请稍等...

基于语音信号与心电信号的多模态情感识别被引量：14

参考文献11

同被引文献218

引证文献14

二级引证文献50

相关作者

相关机构

相关主题

浏览历史

基于语音信号与心电信号的多模态情感识别 被引量：14

参考文献11

同被引文献218

引证文献14

二级引证文献50

相关作者

相关机构

相关主题

浏览历史

基于语音信号与心电信号的多模态情感识别被引量：14