中文版 | English
题名

Modeling speech intelligibility with recovered envelope from temporal fine structure stimulus

作者
通讯作者Chen, Fei
发表日期
2016-07
DOI
发表期刊
ISSN
0167-6393
EISSN
1872-7182
卷号81页码:120-128
摘要
Temporal envelope and fine structure are two prominent acoustic cues for speech perception. Most existing speech-transmission-index based metrics make use of the temporal envelope information and discard the temporal fine structure (TFS) cue to predict speech intelligibility. Recent studies have shown that the TFS stimulus synthesized with multiband TFS waveforms contains rich intelligibility information, which is reflected as the recovered envelope from the TFS stimulus. The present study first assessed the performance of using the recovered envelope from the synthesized TFS stimulus to predict the intelligibility of noise-distorted and noise-suppressed speech. The TFS stimulus was synthesized and fed as an input into the conventional normalized covariance measure (NCM) module. The results showed that the recovered envelope from the TFS stimulus predicted the intelligibility as well as the original envelope extracted from the wideband speech signal did. In addition, an additive intelligibility model was designed to combine the envelope from wideband speech and the recovered envelope from the TFS stimulus to predict speech intelligibility. The prediction power was significantly improved when these two envelope waveforms were integrated. The present study suggests that the recovered envelope from the TFS stimulus may be alternative acoustic information for modeling speech intelligibility and improving the prediction power of the conventional NCM-based intelligibility index. (C) 2016 Elsevier B.V. All rights reserved.
关键词
相关链接[来源记录]
收录类别
SCI ; EI
语种
英语
学校署名
第一 ; 通讯
资助项目
Ministry of Science and Technology of Taiwan[MOST 104-2221-E-001-026-MY2]
WOS研究方向
Acoustics ; Computer Science
WOS类目
Acoustics ; Computer Science, Interdisciplinary Applications
WOS记录号
WOS:000378440500008
出版者
EI入藏号
20161002068569
EI主题词
Acoustic noise ; Atomic physics ; Forecasting ; Recovery ; Speech communication ; Speech transmission
EI分类号
Acoustic Noise:751.4 ; Speech:751.5 ; Atomic and Molecular Physics:931.3
ESI学科分类
COMPUTER SCIENCE
来源库
Web of Science
引用统计
被引频次[WOS]:2
成果类型期刊论文
条目标识符http://sustech.caswiz.com/handle/2SGJ60CL/29567
专题工学院_电子与电气工程系
作者单位
1.Southern Univ Sci & Technol, Dept Elect & Elect Engn, Xueyuan Rd 1088, Shenzhen, Peoples R China
2.Acad Sinica, Res Ctr Informat Technol Innovat, Taipei 115, Taiwan
3.Yuan Ze Univ, Dept Elect Engn, Chungli, Taiwan
第一作者单位电子与电气工程系
通讯作者单位电子与电气工程系
第一作者的第一单位电子与电气工程系
推荐引用方式
GB/T 7714
Chen, Fei,Tsao, Yu,Lai, Ying-Hui. Modeling speech intelligibility with recovered envelope from temporal fine structure stimulus[J]. SPEECH COMMUNICATION,2016,81:120-128.
APA
Chen, Fei,Tsao, Yu,&Lai, Ying-Hui.(2016).Modeling speech intelligibility with recovered envelope from temporal fine structure stimulus.SPEECH COMMUNICATION,81,120-128.
MLA
Chen, Fei,et al."Modeling speech intelligibility with recovered envelope from temporal fine structure stimulus".SPEECH COMMUNICATION 81(2016):120-128.
条目包含的文件
文件名称/大小 文献类型 版本类型 开放类型 使用许可 操作
Chen-2016-Modeling s(821KB)----限制开放--
个性服务
原文链接
推荐该条目
保存到收藏夹
查看访问统计
导出为Endnote文件
导出为Excel格式
导出为Csv格式
Altmetrics Score
谷歌学术
谷歌学术中相似的文章
[Chen, Fei]的文章
[Tsao, Yu]的文章
[Lai, Ying-Hui]的文章
百度学术
百度学术中相似的文章
[Chen, Fei]的文章
[Tsao, Yu]的文章
[Lai, Ying-Hui]的文章
必应学术
必应学术中相似的文章
[Chen, Fei]的文章
[Tsao, Yu]的文章
[Lai, Ying-Hui]的文章
相关权益政策
暂无数据
收藏/分享
所有评论 (0)
[发表评论/异议/意见]
暂无评论

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。