题名 | Representing the intelligibility advantage of ideal binary masking with the most energetic channels |
作者 | |
通讯作者 | Chen, Fei |
发表日期 | 2016-12
|
DOI | |
发表期刊 | |
ISSN | 0001-4966
|
EISSN | 1520-8524
|
卷号 | 140期号:6页码:4161-4169 |
摘要 | This study investigates how the intelligibility advantage of ideal binary mask (IBM) processing in synthesizing speech is affected by the use of a small number of the most energetic channels. In experiment 1, IBM-processed Mandarin speech that had been corrupted by speech spectrum-shaped noise or two-talker babble was synthesized by using as few as four of the most energetic target-dominated channels at each frame. This approach provided intelligibility comparable to that of speech synthesized with all of the target-dominated channels. Experiments 2, 3, and 4 examined how the intelligibility advantage of IBM processing from experiment 1 was affected by the local SNR threshold, low-frequency region (LFR) cut-off frequency, and vowel-based segmentation, respectively. Experiments 2 and 3 showed that a threshold of 0 dB for local SNR and a cutoff of 3000 Hz for LFR were optimal choices for improving the intelligibility of IBM processing based on the most energetic channels. Experiment 4 found that the intelligibility advantage of IBM processing with the most energetic channels was preserved at the segmental level of vowel-only IBM-processed speech. Taken together, the results suggest that compared to IBM-processed speech synthesized with all of the target-dominated channels, Mandarin speech synthesized by selecting a small number of the most energetic target-dominated channels can achieve similar levels of intelligibility. (C) 2016 Acoustical Society of America. |
相关链接 | [来源记录] |
收录类别 | |
语种 | 英语
|
学校署名 | 第一
; 通讯
|
资助项目 | Basic Research Foundation of Shenzhen[JCYJ20160429191402782]
|
WOS研究方向 | Acoustics
; Audiology & Speech-Language Pathology
|
WOS类目 | Acoustics
; Audiology & Speech-Language Pathology
|
WOS记录号 | WOS:000390347900024
|
出版者 | |
EI入藏号 | 20165003123658
|
EI主题词 | Linguistics
; Signal to noise ratio
|
EI分类号 | Information Theory and Signal Processing:716.1
; Speech:751.5
|
ESI学科分类 | PHYSICS
|
来源库 | Web of Science
|
引用统计 |
被引频次[WOS]:2
|
成果类型 | 期刊论文 |
条目标识符 | http://sustech.caswiz.com/handle/2SGJ60CL/29330 |
专题 | 工学院_电子与电气工程系 |
作者单位 | Southern Univ Sci & Technol, Dept Elect & Elect Engn, Shenzhen, Peoples R China |
第一作者单位 | 电子与电气工程系 |
通讯作者单位 | 电子与电气工程系 |
第一作者的第一单位 | 电子与电气工程系 |
推荐引用方式 GB/T 7714 |
Chen, Fei. Representing the intelligibility advantage of ideal binary masking with the most energetic channels[J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA,2016,140(6):4161-4169.
|
APA |
Chen, Fei.(2016).Representing the intelligibility advantage of ideal binary masking with the most energetic channels.JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA,140(6),4161-4169.
|
MLA |
Chen, Fei."Representing the intelligibility advantage of ideal binary masking with the most energetic channels".JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 140.6(2016):4161-4169.
|
条目包含的文件 | ||||||
文件名称/大小 | 文献类型 | 版本类型 | 开放类型 | 使用许可 | 操作 | |
Chen-2016-Representi(1118KB) | -- | -- | 限制开放 | -- |
个性服务 |
原文链接 |
推荐该条目 |
保存到收藏夹 |
查看访问统计 |
导出为Endnote文件 |
导出为Excel格式 |
导出为Csv格式 |
Altmetrics Score |
谷歌学术 |
谷歌学术中相似的文章 |
[Chen, Fei]的文章 |
百度学术 |
百度学术中相似的文章 |
[Chen, Fei]的文章 |
必应学术 |
必应学术中相似的文章 |
[Chen, Fei]的文章 |
相关权益政策 |
暂无数据 |
收藏/分享 |
|
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。
修改评论