南方科技大学知识苑(SUSTech KC): Modeling speech intelligibility with recovered envelope from temporal fine structure stimulus

题名	Modeling speech intelligibility with recovered envelope from temporal fine structure stimulus
作者	Chen, Fei1 ; Tsao, Yu 2; Lai, Ying-Hui 2,3
通讯作者	Chen, Fei
发表日期	2016-07
DOI	10.1016/j.specom.2016.01.006
发表期刊	SPEECH COMMUNICATION 影响因子和分区
ISSN	0167-6393
EISSN	1872-7182
卷号	81页码:120-128
摘要	Temporal envelope and fine structure are two prominent acoustic cues for speech perception. Most existing speech-transmission-index based metrics make use of the temporal envelope information and discard the temporal fine structure (TFS) cue to predict speech intelligibility. Recent studies have shown that the TFS stimulus synthesized with multiband TFS waveforms contains rich intelligibility information, which is reflected as the recovered envelope from the TFS stimulus. The present study first assessed the performance of using the recovered envelope from the synthesized TFS stimulus to predict the intelligibility of noise-distorted and noise-suppressed speech. The TFS stimulus was synthesized and fed as an input into the conventional normalized covariance measure (NCM) module. The results showed that the recovered envelope from the TFS stimulus predicted the intelligibility as well as the original envelope extracted from the wideband speech signal did. In addition, an additive intelligibility model was designed to combine the envelope from wideband speech and the recovered envelope from the TFS stimulus to predict speech intelligibility. The prediction power was significantly improved when these two envelope waveforms were integrated. The present study suggests that the recovered envelope from the TFS stimulus may be alternative acoustic information for modeling speech intelligibility and improving the prediction power of the conventional NCM-based intelligibility index. (C) 2016 Elsevier B.V. All rights reserved.
关键词	Speech intelligibility Temporal fine structure Recovered envelope Normalized covariance measure
相关链接	[来源记录]
收录类别	SCI ; EI
语种	英语
学校署名	第一 ; 通讯
资助项目	Ministry of Science and Technology of Taiwan[MOST 104-2221-E-001-026-MY2]
WOS研究方向	Acoustics ; Computer Science
WOS类目	Acoustics ; Computer Science, Interdisciplinary Applications
WOS记录号	WOS:000378440500008
出版者	ELSEVIER SCIENCE BV
EI入藏号	20161002068569
EI主题词	Acoustic noise ; Atomic physics ; Forecasting ; Recovery ; Speech communication ; Speech transmission
EI分类号	Acoustic Noise:751.4 ; Speech:751.5 ; Atomic and Molecular Physics:931.3
ESI学科分类	COMPUTER SCIENCE
来源库	Web of Science
引用统计	被引频次[WOS]：2
成果类型	期刊论文
条目标识符	http://sustech.caswiz.com/handle/2SGJ60CL/29567
专题	工学院_电子与电气工程系
作者单位	1.Southern Univ Sci & Technol, Dept Elect & Elect Engn, Xueyuan Rd 1088, Shenzhen, Peoples R China 2.Acad Sinica, Res Ctr Informat Technol Innovat, Taipei 115, Taiwan 3.Yuan Ze Univ, Dept Elect Engn, Chungli, Taiwan
第一作者单位	电子与电气工程系
通讯作者单位	电子与电气工程系
第一作者的第一单位	电子与电气工程系
推荐引用方式 GB/T 7714	Chen, Fei,Tsao, Yu,Lai, Ying-Hui. Modeling speech intelligibility with recovered envelope from temporal fine structure stimulus[J]. SPEECH COMMUNICATION,2016,81:120-128.
APA	Chen, Fei,Tsao, Yu,&Lai, Ying-Hui.(2016).Modeling speech intelligibility with recovered envelope from temporal fine structure stimulus.SPEECH COMMUNICATION,81,120-128.
MLA	Chen, Fei,et al."Modeling speech intelligibility with recovered envelope from temporal fine structure stimulus".SPEECH COMMUNICATION 81(2016):120-128.

条目包含的文件
文件名称/大小	文献类型	版本类型	开放类型	使用许可	操作
Chen-2016-Modeling s（821KB）	--	--	限制开放	--