检索结果-维普期刊中文期刊服务平台

基于多时间尺度特征的语音识别模型: 1; 作者韩疆尹宝林《北京航空航天大学学报》 EI CAS CSCD 北大核心 2000年第2期201-205,共5页; 提出了基于多时间尺度特征的语音识别模型 .该模型采用描述谱参数轨迹的段特征 ,在段尺度上实现了对语音信号帧间相关性的显式建模 ;采用段特征依赖的非平稳时间序列产生模型 ,实现了不同尺度特征间的相关性建模 ,并在帧尺度上通过参数... 展开更多; 关键词语音识别模型帧间相关笥多时间尺度段特征; 在线阅读下载PDF 职称材料

几种小训练样本集的数字语音识别模型的比较性研究被引量：1: 2; 作者贺苏宁虞厥邦《计算机科学》 CSCD 北大核心 2005年第9期170-175,共6页; 本文通过对小训练样本集的基于DTW结构的数字语音识别模型的比较性分析,指出其存在的三个一般性问题:(1)DTW逐帧匹配模式割裂了观测向量序列的内在联系;(2)压扩观测向量序列造成局部信息使用的不均匀;(3)计算复杂度高,识别率低。为了解... 展开更多; 关键词训练样本集数字语音识别模型置信度评估自适应反馈学习 DTW 匹配模式; 在线阅读下载PDF 职称材料

双模型语音识别中的听视觉合成和模型同步异步性实验研究被引量：3: 3; 作者谢磊蒋冬梅 +4 位作者 Ilse Ravyse 赵荣椿 Hichem Sahli Werner Verhelst Jan Cornelis 《西北工业大学学报》 EI CAS CSCD 北大核心 2004年第2期171-175,共5页; 研究了双模型语音识别系统中前合成和后合成两种听觉视觉合成方法 ;同时在后合成方法中引入了考虑听觉和视觉同步异步特点的复合模型。仿真实验证明了在声学噪音环境下 ,后合成方法能够带来比较理想的识别效果 ;考虑听觉和视觉同步异步... 展开更多; 关键词语音识别双模型语音识别听觉视觉合成模型同步异步性; 在线阅读下载PDF 职称材料

Improved hidden Markov model for speech recognition and POS tagging 被引量：4: 4; 作者袁里驰《Journal of Central South University》 SCIE EI CAS 2012年第2期511-516,共6页; In order to overcome defects of the classical hidden Markov model (HMM), Markov family model (MFM), a new statistical model was proposed. Markov family model was applied to speech recognition and natural language proc... 展开更多; 关键词 hidden Markov model Markov family model speech recognition part-of-speech tagging; 在线阅读下载PDF 职称材料

A new formant feature and its application in Mandarin vowel pronunciation quality assessment: 5; 作者卢小春潘复平 +1 位作者尹俊勋胡维平《Journal of Central South University》 SCIE EI CAS 2013年第12期3573-3581,共9页; In order to improve the Mandarin vowel pronunciation quality assessment, a nox/el formant feature was proposed and applied to formant classification for Chinese Mandarin vowel pronunciation quality evaluation. Formant... 展开更多; 关键词 computer assisted language learning speech recognition Gaussian mixture model FORMANT Gabor feature NEURALNETWORK; 在线阅读下载PDF 职称材料

题名基于多时间尺度特征的语音识别模型: 1; 作者韩疆尹宝林; 机构北京航空航天大学计算机科学与工程系; 出处《北京航空航天大学学报》 EI CAS CSCD 北大核心 2000年第2期201-205,共5页; 文摘提出了基于多时间尺度特征的语音识别模型 .该模型采用描述谱参数轨迹的段特征 ,在段尺度上实现了对语音信号帧间相关性的显式建模 ;采用段特征依赖的非平稳时间序列产生模型 ,实现了不同尺度特征间的相关性建模 ,并在帧尺度上通过参数化的均值轨迹函数 ,实现了对语音信号帧间相关性的隐式建模 .给出了基于多时间尺度特征联合统计距离优化的分段算法及基于最大似然准则的模型参数估计算法 .识别实验表明 ,该模型的识别性能优于标准HMM及趋势HMM .; 关键词语音识别模型帧间相关笥多时间尺度段特征; Keywords speech recognition feature extraction correlations multiple time scale non stationary time series segmental feature; 分类号 TN912.34 [电子电信—通信与信息系统]; 在线阅读下载PDF 职称材料

题名几种小训练样本集的数字语音识别模型的比较性研究被引量：1: 2; 作者贺苏宁虞厥邦; 机构电子科技大学电子工程学院; 出处《计算机科学》 CSCD 北大核心 2005年第9期170-175,共6页; 文摘本文通过对小训练样本集的基于DTW结构的数字语音识别模型的比较性分析,指出其存在的三个一般性问题:(1)DTW逐帧匹配模式割裂了观测向量序列的内在联系;(2)压扩观测向量序列造成局部信息使用的不均匀;(3)计算复杂度高,识别率低。为了解决这些问题,我们提出了基于数字语音时频信息整体结构的单特征向量识别模型。这种模型完整地利用了观测向量序列的全部信息,结合置信度评估和自适应反馈学习之后可及时地吸收测试向量携带的新的环境特征信息,调整识别模型结构。该模型的错识率较之最好的基于DTW结构的混合城模型的错识率降低50％以上,计算复杂度则是固定帧长模型的 13.12％。; 关键词训练样本集数字语音识别模型置信度评估自适应反馈学习 DTW 匹配模式; Keywords HMM, DTW, MFCC, Observation vector, Confidence measure, Self- adaptive feedback learning; 分类号 TP391.4 [自动化与计算机技术—计算机应用技术]; 在线阅读下载PDF 职称材料

题名双模型语音识别中的听视觉合成和模型同步异步性实验研究被引量：3: 3; 作者谢磊蒋冬梅 Ilse Ravyse 赵荣椿 Hichem Sahli Werner Verhelst Jan Cornelis; 机构西北工业大学计算机科学与工程系布鲁塞尔自由大学电子与信息处理系; 出处《西北工业大学学报》 EI CAS CSCD 北大核心 2004年第2期171-175,共5页; 基金中国科技部与比利时弗拉芒大区科技合作项目 (国科外字 19990 2 0 9); 文摘研究了双模型语音识别系统中前合成和后合成两种听觉视觉合成方法 ;同时在后合成方法中引入了考虑听觉和视觉同步异步特点的复合模型。仿真实验证明了在声学噪音环境下 ,后合成方法能够带来比较理想的识别效果 ;考虑听觉和视觉同步异步性的模型可以有效地提高识别率。; 关键词语音识别双模型语音识别听觉视觉合成模型同步异步性; Keywords speech recognition, audio visual fusion, model asynchrony; 分类号 TN912.3 [电子电信—通信与信息系统]; 在线阅读下载PDF 职称材料

题名Improved hidden Markov model for speech recognition and POS tagging 被引量：4: 4; 作者袁里驰; 机构 School of Information Technology School of Information Science and Engineering; 出处《Journal of Central South University》 SCIE EI CAS 2012年第2期511-516,共6页; 基金 Project(60763001)supported by the National Natural Science Foundation of China Projects(2009GZS0027,2010GZS0072)supported by the Natural Science Foundation of Jiangxi Province,China; 文摘 In order to overcome defects of the classical hidden Markov model (HMM), Markov family model (MFM), a new statistical model was proposed. Markov family model was applied to speech recognition and natural language processing. The speaker independently continuous speech recognition experiments and the part-of-speech tagging experiments show that Markov family model has higher performance than hidden Markov model. The precision is enhanced from 94.642% to 96.214% in the part-of-speech tagging experiments, and the work rate is reduced by 11.9% in the speech recognition experiments with respect to HMM baseline system.; 关键词 hidden Markov model Markov family model speech recognition part-of-speech tagging; Keywords 隐马尔可夫模型连续语音识别词性标注自然语言处理统计模型基线系统 HMM 实验; 分类号 TN912.34 [电子电信—通信与信息系统] TP391 [自动化与计算机技术—计算机应用技术]; 在线阅读下载PDF 职称材料

题名A new formant feature and its application in Mandarin vowel pronunciation quality assessment: 5; 作者卢小春潘复平尹俊勋胡维平; 机构 School of Electronic and Information Engineering College of Computer and Information Technology ThinkIT Laboratory College of Electronic Engineering; 出处《Journal of Central South University》 SCIE EI CAS 2013年第12期3573-3581,共9页; 基金 Project(61062011)supported by the National Natural Science Foundation of China Project(2010GXNSFA013128)supported by the Natural Science Foundation of Guangxi Province,China; 文摘 In order to improve the Mandarin vowel pronunciation quality assessment, a nox/el formant feature was proposed and applied to formant classification for Chinese Mandarin vowel pronunciation quality evaluation. Formant candidates of each frame were plotted on the time-frequency plane to form a bitmap, and its Gabor feature was extracted to represent the formant trajectory. The feature was then classified by using GMM model and the classification posterior probability was mapped to pronunciation quality grade. The experiments of comparing the Gabor transformation based formant trajectory feature with several other kinds of traditionally used features show that with this method, a human-machine scoring correlation coefficient （CC） of 0.842 can be achieved, which is better than the result of 0.832 by traditional speech recognition techniques. At the same time, considering that the long-term information of formant classification and the short-term information of speech recognition technique are complementary to each other, it is investigated to combine their results with linear or nonlinear methods to further improve the evaluation performance. As a result, experiments on PSK show that the best CC of 0.913, which is very close to the correlation of inter-human rating of 0.94, is gotten by using neural network.; 关键词 computer assisted language learning speech recognition Gaussian mixture model FORMANT Gabor feature NEURALNETWORK; Keywords 质量评价共振峰发音元音应用 Gabor变换语音识别技术模型分类; 分类号 TN912.3 [电子电信—通信与信息系统]; 在线阅读下载PDF 职称材料

	题名	作者	出处	发文年	被引量	操作
1	基于多时间尺度特征的语音识别模型	韩疆尹宝林	《北京航空航天大学学报》 EI CAS CSCD 北大核心	2000	0	在线阅读下载PDF 职称材料
2	几种小训练样本集的数字语音识别模型的比较性研究	贺苏宁虞厥邦	《计算机科学》 CSCD 北大核心	2005	1	在线阅读下载PDF 职称材料
3	双模型语音识别中的听视觉合成和模型同步异步性实验研究	谢磊蒋冬梅 Ilse Ravyse 赵荣椿 Hichem Sahli Werner Verhelst Jan Cornelis	《西北工业大学学报》 EI CAS CSCD 北大核心	2004	3	在线阅读下载PDF 职称材料
4	Improved hidden Markov model for speech recognition and POS tagging	袁里驰	《Journal of Central South University》 SCIE EI CAS	2012	4	在线阅读下载PDF 职称材料
5	A new formant feature and its application in Mandarin vowel pronunciation quality assessment	卢小春潘复平尹俊勋胡维平	《Journal of Central South University》 SCIE EI CAS	2013	0	在线阅读下载PDF 职称材料