多特征融合的英语口语考试自动评分系统的研究被引量：11

Research for Automatic Short Answer Scoring in Spoken English Test Based on Multiple Features

在线阅读下载PDF

导出

摘要该文主要针对大规模英语口语考试自动评分系统的问答题型,采用多特征融合的方法进行评分。以语音识别文本作为研究对象,提取了3类特征进行评分。这3类特征分别是:相似度特征、句法特征和语音特征。总共9个特征从不同方面描述了考生回答与专家评分之间的关系。在相似度特征中,改进了Manhattan距离作为相似度。同时提出了基于编辑距离的关键词覆盖率的特征,充分考虑了识别文本中存在的单词变异现象,为给考生一个客观公平的分数提供依据。所有提取的特征利用多元线性回归模型进行融合,得到机器评分。实验结果表明,提取的特征对机器评分是十分有效的,并且在以考生为单位的系统评分性能达到了专家评分性能的98.4%。 This paper focuses on automatic scoring about ask-and-answer item in large scale of spoken English test. Three kinds of features are extracted to score based on the text from Automatic Speech Recognition （ASR）. They are similarity features, parser features and features about speech. All of nine features describe the relation with human raters from different aspects. Among features of similarity measure, Manhattan distance is converted into similarity to improve the performance of scoring. Furthermore, keywords coverage rate based on edit distance is proposed to distinguish words＇ variation in order to give students a more objective score. All of those features are put into multiple linear regression model to score. The experiment results show that performance of automatic scoring system based on speakers achieves 98.4% of human raters.

作者李艳玲颜永红

机构地区中国科学院声学研究所语言声学与内容理解实验室内蒙古师范大学计算机与信息工程学院

出处《电子与信息学报》 EI CSCD 北大核心 2012年第9期2097-2102,共6页 Journal of Electronics & Information Technology

基金国家自然科学基金(10925419,90920302,10874203,60875014,61072124,11074275,11161140319)资助课题

关键词自动语音识别自动评分特征选择相似度句法树 Automatic Speech Recognition （ASR） Automatic scoring Feature selection Similarity measure Parser tree

分类号 TP391.42 [自动化与计算机技术—计算机应用技术]

作者简介通信作者：李艳玲liyanling@hccl．ioa．ac．cn 李艳玲：女，1978年生，讲师，博士生，研究方向为信号处理、自然语言处理．颜永红：男，1967年生，研究员，研究方向为语音识别、语种识别以及语音信号处理等．

引文网络
相关文献

参考文献14

1Mohler M and Mihalcea R. Learning to grade short answer questions using semantic similarity measures and dependency graph alignments[C]. Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, Portland, USA, 2011: 752-762.
2丁克玉,李兆远,刘飞,等.面向大规模英语口语考试的自动语法评分技术研究[C].第12届中国机器学习会议,济南,2010:1-7.
3Chen Miao and Zechner K. Computing and evaluating syntactic complexity features for automated scoring of spontaneous non-native speech[C]. Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics (ACL), Portland, USA, 2011: 722-731.
4Butcher P G and Jordan S E. A comparison of human and computer marking of short free-text student responses[J]. Computers & Education, 2010, 55(2): 489-499.
5Leacock C and Chodorow M. C-rater: automated scoring of short-answer questions[J]. Computers and the Humanities, 2003, 37(4): 389-405.
6Valenti S, Neri F, and Cucchiarelli A. An overview of current research on automated essay grading[J]. Journal of Information Technology Education, 2003, 2: 319-330.
7Siddiqi R and Harrison C J. On the automated assessment of short free-text responses[C]. International Association for Educational Assessment (IAEA), Cambridge, UK, 2008: 1-11.
8Sukkarieh J Z, Pulman S G, and Ralkes N. Auto-marking: using computational linguistics to score short, free textresponses[C]. Proceedings of the 29th Annual Conference of the International Association for Educational Assessment, Manchester, UK, 2003: 1-15.
9Achananuparp P, Hu Xiao-hua, and Shen Xia-jiong. The evaluation of sentence similarity measures[C]. Proceedings of the 10th International Conference on Data Warehousing and Knowledge Discovery, Berlin, Germany, 2008, 5182: 305-316.
10Richard K, Angelo K, and Mayya T. Automated assessment of short free-text responses in computer science using latent semantic analysis[C]. Proceedings of the 16th Annual Joint Conference on Innovation and Technology in Computer Science Education (ITiCSE), Darmstadt, Germany, 2011: 158-162.

同被引文献78

1赵博,檀晓红.基于语音识别技术的英语口语教学系统[J].计算机应用,2009,29(3):761-763. 被引量：12
2黄申,李宏言,王士进,梁家恩,徐波.辅助语音评分系统中一种流利度自动评分方法[J].清华大学学报（自然科学版）,2009(S1):1349-1355. 被引量：5
3严可,胡国平,魏思,戴礼荣,李萌涛,杨晓果,冯国栋.面向大规模英语口语机考的复述题自动评分技术[J].清华大学学报（自然科学版）,2009(S1):1356-1362. 被引量：18
4董志峰,汪增福.基于动态MFCC的说话人识别算法[J].模式识别与人工智能,2005,18(5):596-601. 被引量：7
5王国梁,梁维谦,刘加,刘润生.嵌入式中等词汇量英语语音识别片上系统[J].清华大学学报（自然科学版）,2005,45(10):1393-1396. 被引量：2
6刘振安,罗永钊.基于特征比较的语音评分方法研究[J].计算机应用,2005,25(12):2928-2930. 被引量：6
7黄骁勇,虞维平.语音识别技术在外语口语学习中的应用[J].计算机系统应用,2006,15(6):18-21. 被引量：6
8ARIAS J P, YOMA N B, VIVANCO H. Automatic intonation assessment for computer aided language learning[ J ]. Speech Communication, 2010,52 ( 3 ) : 254 - 267.
9SUKKAR R A, LEE C H. Vocabulary Independent Discriminative Utterance Verification for Nonkeyword Rejection in Sub- word based Speech Recognition [ J ]. IEEE Transactions on Speech and Audio Processing, 1996,4 (6) :420 -429.
10BUTCHER P G, JORDAN S E. A comparison of human and computer marking of short free -text student responses[ J]. Computers & Education,2010,55 (2) :489 - 499.

引证文献11

1范晨.英语口语测试评分研究[J].学园,2022,15(10):46-48.
2李艳玲,颜永红.中文口语理解中关键语义类模糊匹配方法的研究[J].小型微型计算机系统,2014,35(9):2182-2186. 被引量：2
3滕海坤,刘心声,王丽红.基于语音识别技术的英语发音评测系统研究[J].盐城工学院学报（自然科学版）,2016,29(1):17-22. 被引量：5
4梁玮.语音识别技术架构下的英语音标辅助学习平台开发及应用研究[J].计算技术与自动化,2020,39(2):155-159. 被引量：6
5孙海洋,张敏.英语口语机器评分和人工评分的对比研究[J].外语研究,2020,37(4):57-62. 被引量：7
6王婧锦.基于多特征融合的汉英口语翻译自动评分方法研究[J].现代科学仪器,2021,38(2):258-261. 被引量：2
7孙海洋.国内外英语口语自动评分研究综述[J].外语教育研究前沿,2021,4(2):28-36. 被引量：11
8李心广,陈帅,龙晓岚.一种面向句子的汉英口语翻译自动评分方法[J].中文信息学报,2021,35(7):54-62. 被引量：2
9骆雁雁.基于多特征融合的英语口语智能评价方法研究[J].外语电化教学,2023(2):49-55. 被引量：2
10秦天,宗易,蒋沛霖.人工智能技术在英语口语自主学习评测中的应用研究综述[J].英语广场（学术研究）,2024(27):50-53.

二级引证文献52

1孙昱岚.国内外二语语音习得的研究现状综述[J].校园英语,2020(51):234-235.
2范晨.英语口语测试评分研究[J].学园,2022,15(10):46-48.
3程江南,史洁颖.Read Aloud对大学生英语阅读自我效能感的影响研究[J].现代英语,2024(4):83-87.
4黄华,张睿,潘鑫.AI+环境下学生科学探究能力测评研究[J].物理与工程,2022,32(6):49-56.
5李洪民.自然语言处理中的技术评测以及相关英语专业考试分析[J].电脑知识与技术（过刊）,2017,23(10X):166-167. 被引量：2
6王晓丹,金小峰,胡玉龙.基于句子级的朝鲜语口语语音库的研究与建立[J].科技通报,2017,33(4):191-194.
7林俊,方宽.审计大数据下模糊匹配审计证据获取方法研究[J].计算机与数字工程,2018,46(4):758-763. 被引量：4
8李薇,郑小红,王莉,张莉,陈继红.分析高职院校英语专业语音测试的现状及改革对策[J].校园英语,2018,0(22):70-71.
9周君,刘璟.酒店服务机器人语音交互系统设计[J].软件导刊,2018,17(11):13-16. 被引量：2
10张莉.高职院校计算机辅助英语语音测评的现状及改革对策[J].校园英语,2019,0(1):71-72.

1优必选与亚马逊合作推出人形机器人Lynx[J].智能机器人,2017,0(1):17-17.
2比木.提升系统性能我有“绝招”[J].网友世界,2010(9):18-18.
3赵恩来,郝文宁,赵飞,陈刚,邵校莎莎.改进的基于密度的航迹聚类算法[J].计算机工程,2011,37(9):270-272. 被引量：15
4马梅真,赵春晖.一种基于改进直方图的声呐图像识别方法[J].声学与电子工程,2007(1):18-21. 被引量：1
5陈楷,吴方劫,李奎,张承学,刘佳,黎洪民.电力物资招投标管理软件系统的设计与开发[J].电力系统装备,2005(6):100-102.
6杨文伟,罗志伟.英语口语计算机考试系统的设计与实现[J].中国科技成果,2010(18):54-56. 被引量：2
7赵林.华为Voice Internet业务——带给您全新的感受[J].电信技术,2003(1):86-86.
8俞铁城.适用于自动语音识别的声道参数[J].物理,1998,27(2):125-125.
9庞亭亭,廖建新,朱晓民,吕文锋.VoiceXML语音平台性能指标研究[J].计算机系统应用,2007,16(8):20-22.
10杨舒,王玉德.基于Contourlet变换和Hu不变矩的图像检索算法[J].红外与激光工程,2014,43(1):306-310. 被引量：31

电子与信息学报

2012年第9期

浏览历史

内容加载中请稍等...

多特征融合的英语口语考试自动评分系统的研究被引量：11

参考文献14

同被引文献78

引证文献11

二级引证文献52

相关作者

相关机构

相关主题

浏览历史

多特征融合的英语口语考试自动评分系统的研究 被引量：11

参考文献14

同被引文献78

引证文献11

二级引证文献52

相关作者

相关机构

相关主题

浏览历史

多特征融合的英语口语考试自动评分系统的研究被引量：11