In order to improve the Mandarin vowel pronunciation quality assessment, a nox/el formant feature was proposed and applied to formant classification for Chinese Mandarin vowel pronunciation quality evaluation. Formant...In order to improve the Mandarin vowel pronunciation quality assessment, a nox/el formant feature was proposed and applied to formant classification for Chinese Mandarin vowel pronunciation quality evaluation. Formant candidates of each frame were plotted on the time-frequency plane to form a bitmap, and its Gabor feature was extracted to represent the formant trajectory. The feature was then classified by using GMM model and the classification posterior probability was mapped to pronunciation quality grade. The experiments of comparing the Gabor transformation based formant trajectory feature with several other kinds of traditionally used features show that with this method, a human-machine scoring correlation coefficient (CC) of 0.842 can be achieved, which is better than the result of 0.832 by traditional speech recognition techniques. At the same time, considering that the long-term information of formant classification and the short-term information of speech recognition technique are complementary to each other, it is investigated to combine their results with linear or nonlinear methods to further improve the evaluation performance. As a result, experiments on PSK show that the best CC of 0.913, which is very close to the correlation of inter-human rating of 0.94, is gotten by using neural network.展开更多
针对步态识别过程易受拍摄视角、外观变化等因素影响问题,提出一种融合点云步态模型与深度学习的步态识别算法。算法通过轻量级特征描述符(lightweight feature descriptor,LFD)提取图像特征,并将其进行特征配准;基于几何-匹配核预处理...针对步态识别过程易受拍摄视角、外观变化等因素影响问题,提出一种融合点云步态模型与深度学习的步态识别算法。算法通过轻量级特征描述符(lightweight feature descriptor,LFD)提取图像特征,并将其进行特征配准;基于几何-匹配核预处理增强识别技术(gait model-key point recognition and extraction,GM-KPRE)提取人体关键点信息,在支持向量机算法中引入径向基函数核进行步态分类和识别;在公开数据集CASIA-B和Market-1501-v15.09.15上进行实验验证,实验结果表明,算法能有效提高步态识别准确率和效率。展开更多
基金Project(61062011)supported by the National Natural Science Foundation of ChinaProject(2010GXNSFA013128)supported by the Natural Science Foundation of Guangxi Province,China
文摘In order to improve the Mandarin vowel pronunciation quality assessment, a nox/el formant feature was proposed and applied to formant classification for Chinese Mandarin vowel pronunciation quality evaluation. Formant candidates of each frame were plotted on the time-frequency plane to form a bitmap, and its Gabor feature was extracted to represent the formant trajectory. The feature was then classified by using GMM model and the classification posterior probability was mapped to pronunciation quality grade. The experiments of comparing the Gabor transformation based formant trajectory feature with several other kinds of traditionally used features show that with this method, a human-machine scoring correlation coefficient (CC) of 0.842 can be achieved, which is better than the result of 0.832 by traditional speech recognition techniques. At the same time, considering that the long-term information of formant classification and the short-term information of speech recognition technique are complementary to each other, it is investigated to combine their results with linear or nonlinear methods to further improve the evaluation performance. As a result, experiments on PSK show that the best CC of 0.913, which is very close to the correlation of inter-human rating of 0.94, is gotten by using neural network.
文摘针对步态识别过程易受拍摄视角、外观变化等因素影响问题,提出一种融合点云步态模型与深度学习的步态识别算法。算法通过轻量级特征描述符(lightweight feature descriptor,LFD)提取图像特征,并将其进行特征配准;基于几何-匹配核预处理增强识别技术(gait model-key point recognition and extraction,GM-KPRE)提取人体关键点信息,在支持向量机算法中引入径向基函数核进行步态分类和识别;在公开数据集CASIA-B和Market-1501-v15.09.15上进行实验验证,实验结果表明,算法能有效提高步态识别准确率和效率。