基于正弦加噪声模型的说话人转换方法被引量：1

A Voice Conversion Scheme Based on Sinusoidal Plus Noise Model

在线阅读下载PDF

导出

摘要提出一种基于正弦加噪声模型的说话人转换方法，着重讨论通过修改音素段内的声学参数实现说话人的转换。通过修改基音频率和共振峰结构，该方法合成的语音有效地模拟了目标说话人的特性。听力测试表明，转换后的语音和目标说话人的语音相似度达到78.8％。与经典的LPC方法的对比实验验证了该法在合成语音质量方面的优越性。 A voice conversion approach with a sinusoidal plus noise model is introduced and a parametric conversion algorithm based on phoneme segments is discussed in this paper. The modification of both pitch and formant structure contributed greatly to reproducing the target speaker's characteristics. Listening tests show that the similarity between target speech and modified one reached 78.8%. Compared to classical LPC approach, the experiments prove the superiority of this approach in terms of speech quality.

作者夏菁尹俊勋黄建成黄锋

机构地区华南理工大学电子信息学院摩托罗拉中国研究中心

出处《电声技术》 2005年第2期49-52,共4页 Audio Engineering

关键词说话人转换正弦加噪声模型音素基音共振峰 voice conversion sinusoidal plus noise model phoneme pitch formant

分类号 TN912.33 [电子电信—通信与信息系统]

引文网络
相关文献

参考文献6

1李波,王成友,蔡宣平,唐朝京,张尔扬.语音转换及相关技术综述[J].通信学报,2004,25(5):109-118. 被引量：34
2Stylianou Y, Cappe O, Moulines E. Continuous Probabilistic Transformation for Voice Conversion. Speech and Audio Processing IEEE, 1998, (6) : 131-142.
3Abe M. A Segment-based Approach to Voice Conversion. Proc IEEE ICASSP, 1991,(2):765-768.
4Matsumoto H, Hiki S, Sone T, Nimuba T. Muhidimentional Presentation of Personal Quality of Vowel and Its Acoustical Correlates.IEEE Trans. Audio and Electroacoustics, 1973, (21) :428-436.
5McAulay R J, Quatieri T F. Speech Analysis/synthesis Based on a Sinusoidal Representation. IEEE Trans. on Acoustics, Speech and Signal Processing,1986,34 : 744-754.
6吕声,尹俊勋,黄建成.基于高斯混合模型和残差预测的说话人转换系统[J].电声技术,2004,28(6):33-36. 被引量：4

二级参考文献30

1初敏.韵律研究与合成语音的自然度[A].第五届全国现代语音学学术会议.新世纪的现代语音学[C].北京: 清华大学出版社,2001.295-301.
2Kain A., Macon M.W. Design and evaluation of a voice conversion algorithm based on spectral envelope mapping and residual prediction. In IEEE International Conference on Acoustics, Speech, and Signal Processing, Proceedings,2001,2:813-816.
3Arslan L. Speaker transformation algorithm using segment codebook. Speech Communication Journal. 1999, 28:211-226.
4Y. Stylianou, O. Cappe, E. Moulines. Statistical method for voice quality transformation. In Proc. EUROSPPECH, 1995.
5Y. Stylianou, O. Cappe, E. Moulines. Continuous probabilistic transform for voice conversion In IEEE Transaction on speech and audio processing, 1998,6 (2):131-142.
6VERHELST W, MERTENS J. Voice conversion using partitions of spectral feature space[A]. ICCASSP[C]. Atlanta USA, 1996.365-368.
7LEE K S, DOH W, YOUN D H. Voice conversion using low dimensional vector mapping[J]. IEICE Trans Inf & Syst, 2002, E85-D(8):1297-1305.
8MIZUNO H, ABE M. Voice conversion algorithm based on piecewise linear conversion rules for formant frequencies and spectrum tilt[J]. Speech Communication, 1995, 16(2): 153-164.
9NARENDRANATH M, MURTHY H A, RAJENDRAN S. Transformation of formants for voice conversion using artificial neural networks[J]. Speech Communication, 1995, 16(2): 207-216.
10TURK O. New methods for voice conversion. Master Degree Thesis of Science[D]. Bogazici University, 2003.

共引文献36

1岳振军,王浩,张雄伟.基于正弦谐波模型和BP神经网络的语音变换算法及实现[J].信号处理,2005,21(z1):208-211. 被引量：7
2孙健,贾永兴,陈向东.一种基于DCT和PSOLA的语音变换方法[J].军事通信技术,2008,29(2):23-26.
3李元良,李波,王成友.语音转换中基于系统单位冲激响应的频谱搬移方法[J].矿业研究与开发,2005,25(5):59-61. 被引量：1
4陆静芳,李波,王成友.语音转换中系统单位冲激响应的频谱搬移方法研究[J].现代电子技术,2005,28(24):40-42.
5王浩,苏巨诗,许胜华,岳振军.基于正弦谐波模型的语音变换算法及实现[J].解放军理工大学学报（自然科学版）,2005,6(6):525-530.
6张辉,李波,王宝良.利用谱包络变换后LPC系数实现频谱搬移[J].空军工程大学学报（自然科学版）,2006,7(6):62-64. 被引量：1
7何峰,于东武,林嘉宇.一种语音更改技术的研究与实现[J].电声技术,2007,31(2):54-56. 被引量：1
8孙卓,岳振军.一种汉语语音变换技术[J].电声技术,2007,31(6):37-40. 被引量：1
9赵建洋,胡泽雄.动态文本-语音编程系统的研究与应用[J].淮阴工学院学报,2007,16(3):36-39. 被引量：2
10何峰,陈晓清,李国锁,林嘉宇.一种新的语音信号共振峰提取的算法[J].信号处理,2007,23(4):618-621. 被引量：6

同被引文献10

1陈克安,尹雪飞.应用于多通道有源控制的自适应组合逆算法[J].信号处理,2006,22(3):366-369. 被引量：5
2MALLAT S G, ZHANG Zhi-feng. Matching pursuit with time-frequency dictionaries[J]. IEEE Trans. on Signal Processing, 1993,41 (12) : 3397-3415.
3GODAVARTI M, HERO A O. Partial update LMS algorithms[J]. IEEE Trans. on Signal Processing,2005,53 (7) :2382-2398.
4MALLAT S. A wavelet tour of signal processing[M]. 2nd ed. Beijing : China Machine Press, 2002 : 357,362,309- 311,413-413.
5EPHRAIM Y, VAN H L. A single subspace approach for speech enhancement[J]. IEEE Trans. on Speech Audio Processing, 1995,3 (4) : 251-266.
6FRIEDLANDER B. A signal subspace method for adaptive interference cancellation[J]. IEEE Trans. on Acoustic, Speech and Signal Processing,1989,36(12):1835-1845.
7WANG Ha-li, WONG S K, KOK Chi-wah. Efficient predictive model of zero quantized DCT coefficients for fast video encoding[J]. Image and Vision Computing, 2007,25 (6):922-933.
8NIKARA J A, TAKALA J H, ASTOLA J T. Discrete cosine and sine transforms-regular algorithms and pipeline architectures[J]. Signal Processing,2006,86 (2) : 230-249.
9蒯冲,司锡才,付永庆.有噪自回归信号参数估计的最小均方算法[J].哈尔滨工程大学学报,2001,22(5):65-67. 被引量：3
10刘强生,吴乐南.Compete Matching Pursuits Algorithm[J].Journal of Southeast University(English Edition),2002,18(1):24-27. 被引量：1

引证文献1

1郭昕,于凤芹.基于匹配追踪与子空间联合的语音增强[J].电声技术,2008,32(9):52-55. 被引量：1

二级引证文献1

1李明.正交匹配追踪在语音增强中的应用[J].科技创新导报,2017,14(34):148-149.

1张炳,俞一彪.基于改进GMM和韵律联合短时谱的说话人转换[J].信号处理,2009,25(4):548-552. 被引量：2
2吕声,尹俊勋.同语种说话人转换的实现[J].移动通信,2004,0(S3):24-27.
3吕声,尹俊勋,黄建成.基于高斯混合模型和残差预测的说话人转换系统[J].电声技术,2004,28(6):33-36. 被引量：4
4杨诚,马永杰.一种新型基频变窗音频信号分析/合成系统[J].信息化纵横,2009(11):54-59. 被引量：1
5Sajad Faramarzi,Atefeh Elekaei, Reza Biria.Investigating Iranian Test-takers＇ Performance Over Taking Different Modalities of Listening Comprehension Test[J].Sino-US English Teaching,2015,12(5):327-340.
6宋建丽.听力测试答题策略[J].新课堂（英语版）,2012(4):47-48.
7王博.放大器的频率响应问题——解惑功放（3）[J].家电大视野,2004(4):48-50.
8陈凌辉,凌震华,戴礼荣.基于话者无关模型的说话人转换方法[J].模式识别与人工智能,2013,26(3):254-259.
9王辉,张玲华.数字助听器中广义旁瓣抵消器结构的汉语语音处理技术[J].声学学报,2012,37(5):534-538.
10丰海波.浅谈语音知识在英语教学中的重要性[J].中学生英语（教师版）,2010(5):13-14. 被引量：1

电声技术

2005年第2期

浏览历史

内容加载中请稍等...

基于正弦加噪声模型的说话人转换方法被引量：1

参考文献6

二级参考文献30

共引文献36

同被引文献10

引证文献1

二级引证文献1

相关作者

相关机构

相关主题

浏览历史

基于正弦加噪声模型的说话人转换方法 被引量：1

参考文献6

二级参考文献30

共引文献36

同被引文献10

引证文献1

二级引证文献1

相关作者

相关机构

相关主题

浏览历史

基于正弦加噪声模型的说话人转换方法被引量：1