网站首页  |  期刊介绍  |  编委会  |  投稿指南  |  在线订阅  |  联系我们English
李强,张玲,朱兰,明艳.一种甚低码率声码器的设计[J].重庆邮电大学学报(自然科学版),2018,30(6):776-782. 本文二维码信息
二维码(扫一下试试看!)
一种甚低码率声码器的设计
Design of an ultra-low bit rate vocoder
投稿时间:2017-07-12  修订日期:2018-03-03
DOI: 10.3979/j.issn.1673-825X.2018.06.007
中文关键词:  混合激励线性预测(MELP)  多帧联合量化  矢量量化器  性能测试
English Keywords:mixed excitation linear prediction(MELP)  multi-frame joint quantization  vector quantization  performance test
基金项目:国家高技术研究发展计划(“863”计划)(2012AA01A508)
作者单位E-mail
李强 重庆邮电大学 信号与信息处理重庆市重点实验室,重庆 400065 liqiang@cqupt.edu.cn 
张玲 重庆邮电大学 信号与信息处理重庆市重点实验室,重庆 400065 994282280@qq.com 
朱兰 重庆邮电大学 信号与信息处理重庆市重点实验室,重庆 400065 879239099@qq.com 
明艳 重庆邮电大学 信号与信息处理重庆市重点实验室,重庆 400065 mingyan@cqupt.edu.cn 
摘要点击次数: 148
全文下载次数: 81
中文摘要:
      在混合激励线性预测 (mixed excitation linear prediction, MELP) 模型的基础上,以超帧为单位,采用多帧联合编码技术,分模式对子帧的语音特征参数进行联合量化,实现了一种码率为600 bit/s的声码器。为了进一步减小量化误差,设计出了一种基于高斯混合模型的预测分类分裂矢量量化器(predictive switched split vector quantization based on Gauss mixture model, GMM-PSSVQ),该量化器对超帧中某些子帧的线谱频率进行量化,并利用帧间预测和线性插值等方法提高编码效率。采用谱失真对设计的矢量量化器进行性能评估,并分别与多级矢量量化和预测分裂矢量量化算法进行性能比较;通过客观感知语音质量评估和主观判断韵字测试对实现的声码器进行性能测试。测试结果表明,设计的矢量量化器平均谱失真最低,实现的声码器合成语音具有较高的清晰度和可懂度。
English Summary:
      Based on the mixed excitation linear prediction (MELP) model, this paper designs a vocoder with a bit rate of 600 bit/s. It adopts a multi-frame joint coding technique with the super frame, and then through the divided model to realize joint quantification for the speech feature parameters of sub frames in the super frame. To deal with the problem that the performance of the existing vector quantization is non-optimal, a predictive switched split vector quantization based on Gauss mixture model (GMM-PSSVQ) is adopted. It quantizes the line spectrum frequency of some sub frames and uses the inter prediction and linear interpolation method to improve the coding efficiency. The performance of the designed vector quantization is evaluated by spectral distortion and it is compared with the multistage vector quantization and predictive splitting vector quantization. The performance of the vocoder is tested by the perceptual evaluation of speech quality and Diagnostic Rhymer Test. Experimental results show that the proposed algorithm has the lowest average spectral distortion, and the speech synthesized by the vocoder proposed in this thesis has high clarity and intelligibility.
HTML    PDF浏览   查看/发表评论  下载PDF阅读器
版权所有 © 2009 重庆邮电大学期刊社  
地址:重庆市 南岸区 重庆邮电大学 期刊社 邮编:400065
电话:023-62461032 E-mail : journal@cqupt.edu.cn
meinv 海贼王论坛