中文版 | English
题名

Representing the intelligibility advantage of ideal binary masking with the most energetic channels

作者
通讯作者Chen, Fei
发表日期
2016-12
DOI
发表期刊
ISSN
0001-4966
EISSN
1520-8524
卷号140期号:6页码:4161-4169
摘要
This study investigates how the intelligibility advantage of ideal binary mask (IBM) processing in synthesizing speech is affected by the use of a small number of the most energetic channels. In experiment 1, IBM-processed Mandarin speech that had been corrupted by speech spectrum-shaped noise or two-talker babble was synthesized by using as few as four of the most energetic target-dominated channels at each frame. This approach provided intelligibility comparable to that of speech synthesized with all of the target-dominated channels. Experiments 2, 3, and 4 examined how the intelligibility advantage of IBM processing from experiment 1 was affected by the local SNR threshold, low-frequency region (LFR) cut-off frequency, and vowel-based segmentation, respectively. Experiments 2 and 3 showed that a threshold of 0 dB for local SNR and a cutoff of 3000 Hz for LFR were optimal choices for improving the intelligibility of IBM processing based on the most energetic channels. Experiment 4 found that the intelligibility advantage of IBM processing with the most energetic channels was preserved at the segmental level of vowel-only IBM-processed speech. Taken together, the results suggest that compared to IBM-processed speech synthesized with all of the target-dominated channels, Mandarin speech synthesized by selecting a small number of the most energetic target-dominated channels can achieve similar levels of intelligibility. (C) 2016 Acoustical Society of America.
相关链接[来源记录]
收录类别
SCI ; EI
语种
英语
学校署名
第一 ; 通讯
资助项目
Basic Research Foundation of Shenzhen[JCYJ20160429191402782]
WOS研究方向
Acoustics ; Audiology & Speech-Language Pathology
WOS类目
Acoustics ; Audiology & Speech-Language Pathology
WOS记录号
WOS:000390347900024
出版者
EI入藏号
20165003123658
EI主题词
Linguistics ; Signal to noise ratio
EI分类号
Information Theory and Signal Processing:716.1 ; Speech:751.5
ESI学科分类
PHYSICS
来源库
Web of Science
引用统计
被引频次[WOS]:2
成果类型期刊论文
条目标识符http://sustech.caswiz.com/handle/2SGJ60CL/29330
专题工学院_电子与电气工程系
作者单位
Southern Univ Sci & Technol, Dept Elect & Elect Engn, Shenzhen, Peoples R China
第一作者单位电子与电气工程系
通讯作者单位电子与电气工程系
第一作者的第一单位电子与电气工程系
推荐引用方式
GB/T 7714
Chen, Fei. Representing the intelligibility advantage of ideal binary masking with the most energetic channels[J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA,2016,140(6):4161-4169.
APA
Chen, Fei.(2016).Representing the intelligibility advantage of ideal binary masking with the most energetic channels.JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA,140(6),4161-4169.
MLA
Chen, Fei."Representing the intelligibility advantage of ideal binary masking with the most energetic channels".JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 140.6(2016):4161-4169.
条目包含的文件
文件名称/大小 文献类型 版本类型 开放类型 使用许可 操作
Chen-2016-Representi(1118KB)----限制开放--
个性服务
原文链接
推荐该条目
保存到收藏夹
查看访问统计
导出为Endnote文件
导出为Excel格式
导出为Csv格式
Altmetrics Score
谷歌学术
谷歌学术中相似的文章
[Chen, Fei]的文章
百度学术
百度学术中相似的文章
[Chen, Fei]的文章
必应学术
必应学术中相似的文章
[Chen, Fei]的文章
相关权益政策
暂无数据
收藏/分享
所有评论 (0)
[发表评论/异议/意见]
暂无评论

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。