中文版 | English
题名

Comparing the Contributions of Amplitude and Phase to Speech Intelligibility in a Vocoder-based Speech Synthesis Model

作者
通讯作者Chen, Fei
DOI
发表日期
2016
ISSN
19909772
会议录名称
卷号
08-12-September-2016
页码
1355-1358
会议地点
San Francisco, CA, United states
出版地
C/O EMMANUELLE FOXONET, 4 RUE DES FAUVETTES, LIEU DIT LOUS TOURILS, BAIXAS, F-66390, FRANCE
出版者
摘要
Vocoder-based speech synthesis model has been long used to assess the contribution of acoustic cue for speech recognition. This study compared the perceptual contributions of amplitude and phase by using two types of stimuli, i.e., amplitude- and phase-based vocoded stimuli. The amplitude-based vocoded stimuli were synthesized by preserving amplitude fluctuation cue but discarding phase cue (i.e., setting phase to zero), while the phase-based vocoded stimuli were synthesized by preserving phase cue and discarding amplitude cue (i.e., setting amplitude to unit). Listening experiments with normal hearing participants showed consistent findings with earlier studies that the intelligibility scores of both amplitude- and phase-based vocoded stimuli increased when using a large number of channels in vocoder-based speech synthesis. In addition, at all tested conditions, the intelligibility scores of amplitude-based vocoded stimuli were significantly larger than those of phase-based vocoded stimuli, suggesting that amplitude might carry more perceptual contribution than phase. This intelligibility advantage of amplitude over phase may be attributed to the difference in the amount of envelope information contained in the two types of vocoded stimuli.
关键词
学校署名
第一 ; 通讯
语种
英语
相关链接[来源记录]
收录类别
资助项目
National Natural Science Foundation of China[61571213]
WOS研究方向
Acoustics ; Computer Science ; Engineering ; Linguistics
WOS类目
Acoustics ; Computer Science, Artificial Intelligence ; Engineering, Electrical & Electronic ; Linguistics
WOS记录号
WOS:000409394400283
EI入藏号
20164603003709
EI主题词
Speech communication ; Speech processing ; Speech recognition ; Speech synthesis ; Vocoders
EI分类号
Speech:751.5 ; Sound Recording:752.2
来源库
Web of Science
引用统计
被引频次[WOS]:0
成果类型会议论文
条目标识符http://sustech.caswiz.com/handle/2SGJ60CL/24947
专题工学院_电子与电气工程系
作者单位
1.Southern Univ Sci & Technol, Dept Elect & Elect Engn, Shenzhen, Peoples R China
2.Univ Hong Kong, Div Speech & Hearing Sci, Hong Kong, Hong Kong, Peoples R China
第一作者单位电子与电气工程系
通讯作者单位电子与电气工程系
第一作者的第一单位电子与电气工程系
推荐引用方式
GB/T 7714
Chen, Fei,Chiao, Benson C. L.,Int Speech Commun Assoc. Comparing the Contributions of Amplitude and Phase to Speech Intelligibility in a Vocoder-based Speech Synthesis Model[C]. C/O EMMANUELLE FOXONET, 4 RUE DES FAUVETTES, LIEU DIT LOUS TOURILS, BAIXAS, F-66390, FRANCE:ISCA-INT SPEECH COMMUNICATION ASSOC,2016:1355-1358.
条目包含的文件
条目无相关文件。
个性服务
原文链接
推荐该条目
保存到收藏夹
查看访问统计
导出为Endnote文件
导出为Excel格式
导出为Csv格式
Altmetrics Score
谷歌学术
谷歌学术中相似的文章
[Chen, Fei]的文章
[Chiao, Benson C. L.]的文章
[Int Speech Commun Assoc]的文章
百度学术
百度学术中相似的文章
[Chen, Fei]的文章
[Chiao, Benson C. L.]的文章
[Int Speech Commun Assoc]的文章
必应学术
必应学术中相似的文章
[Chen, Fei]的文章
[Chiao, Benson C. L.]的文章
[Int Speech Commun Assoc]的文章
相关权益政策
暂无数据
收藏/分享
所有评论 (0)
[发表评论/异议/意见]
暂无评论

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。