题名 | Modeling speech intelligibility with recovered envelope from temporal fine structure stimulus |
作者 | |
通讯作者 | Chen, Fei |
发表日期 | 2016-07
|
DOI | |
发表期刊 | |
ISSN | 0167-6393
|
EISSN | 1872-7182
|
卷号 | 81页码:120-128 |
摘要 | Temporal envelope and fine structure are two prominent acoustic cues for speech perception. Most existing speech-transmission-index based metrics make use of the temporal envelope information and discard the temporal fine structure (TFS) cue to predict speech intelligibility. Recent studies have shown that the TFS stimulus synthesized with multiband TFS waveforms contains rich intelligibility information, which is reflected as the recovered envelope from the TFS stimulus. The present study first assessed the performance of using the recovered envelope from the synthesized TFS stimulus to predict the intelligibility of noise-distorted and noise-suppressed speech. The TFS stimulus was synthesized and fed as an input into the conventional normalized covariance measure (NCM) module. The results showed that the recovered envelope from the TFS stimulus predicted the intelligibility as well as the original envelope extracted from the wideband speech signal did. In addition, an additive intelligibility model was designed to combine the envelope from wideband speech and the recovered envelope from the TFS stimulus to predict speech intelligibility. The prediction power was significantly improved when these two envelope waveforms were integrated. The present study suggests that the recovered envelope from the TFS stimulus may be alternative acoustic information for modeling speech intelligibility and improving the prediction power of the conventional NCM-based intelligibility index. (C) 2016 Elsevier B.V. All rights reserved. |
关键词 | |
相关链接 | [来源记录] |
收录类别 | |
语种 | 英语
|
学校署名 | 第一
; 通讯
|
资助项目 | Ministry of Science and Technology of Taiwan[MOST 104-2221-E-001-026-MY2]
|
WOS研究方向 | Acoustics
; Computer Science
|
WOS类目 | Acoustics
; Computer Science, Interdisciplinary Applications
|
WOS记录号 | WOS:000378440500008
|
出版者 | |
EI入藏号 | 20161002068569
|
EI主题词 | Acoustic noise
; Atomic physics
; Forecasting
; Recovery
; Speech communication
; Speech transmission
|
EI分类号 | Acoustic Noise:751.4
; Speech:751.5
; Atomic and Molecular Physics:931.3
|
ESI学科分类 | COMPUTER SCIENCE
|
来源库 | Web of Science
|
引用统计 |
被引频次[WOS]:2
|
成果类型 | 期刊论文 |
条目标识符 | http://sustech.caswiz.com/handle/2SGJ60CL/29567 |
专题 | 工学院_电子与电气工程系 |
作者单位 | 1.Southern Univ Sci & Technol, Dept Elect & Elect Engn, Xueyuan Rd 1088, Shenzhen, Peoples R China 2.Acad Sinica, Res Ctr Informat Technol Innovat, Taipei 115, Taiwan 3.Yuan Ze Univ, Dept Elect Engn, Chungli, Taiwan |
第一作者单位 | 电子与电气工程系 |
通讯作者单位 | 电子与电气工程系 |
第一作者的第一单位 | 电子与电气工程系 |
推荐引用方式 GB/T 7714 |
Chen, Fei,Tsao, Yu,Lai, Ying-Hui. Modeling speech intelligibility with recovered envelope from temporal fine structure stimulus[J]. SPEECH COMMUNICATION,2016,81:120-128.
|
APA |
Chen, Fei,Tsao, Yu,&Lai, Ying-Hui.(2016).Modeling speech intelligibility with recovered envelope from temporal fine structure stimulus.SPEECH COMMUNICATION,81,120-128.
|
MLA |
Chen, Fei,et al."Modeling speech intelligibility with recovered envelope from temporal fine structure stimulus".SPEECH COMMUNICATION 81(2016):120-128.
|
条目包含的文件 | ||||||
文件名称/大小 | 文献类型 | 版本类型 | 开放类型 | 使用许可 | 操作 | |
Chen-2016-Modeling s(821KB) | -- | -- | 限制开放 | -- |
|
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。
修改评论