中文版 | English
题名

A Novel Data Stream Learning Approach to Tackle One-Sided Label Noise From Verification Latency

作者
DOI
发表日期
2022
会议名称
IEEE International Conference on Fuzzy Systems (FUZZ-IEEE) / IEEE World Congress on Computational Intelligence (IEEE WCCI) / International Joint Conference on Neural Networks (IJCNN) / IEEE Congress on Evolutionary Computation (IEEE CEC)
ISSN
2161-4393
ISBN
978-1-6654-9526-4
会议录名称
页码
1-8
会议日期
18-23 July 2022
会议地点
Padua, Italy
出版地
345 E 47TH ST, NEW YORK, NY 10017 USA
出版者
摘要
Many real-world data stream applications suffer from verification latency, where the labels of the training examples arrive with a delay. In binary classification problems, the labeling process frequently involves waiting for a pre-determined period of time to observe an event that assigns the example to a given class. Once this time passes, if such labeling event does not occur, the example is labeled as belonging to the other class. For example, in software defect prediction, one may wait to see if a defect is associated to a software change implemented by a developer, producing a defect-inducing training example. If no defect is found during the waiting time, the training example is labeled as clean. Such verification latency inherently causes label noise associated to insufficient waiting time. For example, a defect may be observed only after the pre-defined waiting time has passed, resulting in a noisy example of the clean class. Due to the nature of the waiting time, such noise is frequently one-sided, meaning that it only occurs to examples of one of the classes. However, no existing work tackles label noise associated to verification latency. This paper proposes a novel data stream learning approach that estimates the confidence in the labels assigned to the training examples and uses this to improve predictive performance in problems with one-sided label noise. Our experiments with 14 real-world datasets from the domain of software defect prediction demonstrate the effectiveness of the proposed approach compared to existing ones.
关键词
学校署名
第一
语种
英语
相关链接[IEEE记录]
收录类别
资助项目
National Natural Science Foundation of China[62002148]
WOS研究方向
Computer Science ; Engineering ; Neurosciences & Neurology
WOS类目
Computer Science, Artificial Intelligence ; Computer Science, Hardware & Architecture ; Engineering, Electrical & Electronic ; Neurosciences
WOS记录号
WOS:000867070900043
来源库
IEEE
全文链接https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=9891911
引用统计
被引频次[WOS]:3
成果类型会议论文
条目标识符http://sustech.caswiz.com/handle/2SGJ60CL/406470
专题工学院_计算机科学与工程系
作者单位
1.Department of Computer Science and Engineering, Guangdong Provincial Key Laboratory of Brain-inspired Intelligent Computation, Southern University of Science and Technology, China
2.School of Computer Science, University of Birmingham, UK
第一作者单位计算机科学与工程系
第一作者的第一单位计算机科学与工程系
推荐引用方式
GB/T 7714
Liyan Song,Shuxian Li,Leandro L. Minku,et al. A Novel Data Stream Learning Approach to Tackle One-Sided Label Noise From Verification Latency[C]. 345 E 47TH ST, NEW YORK, NY 10017 USA:IEEE,2022:1-8.
条目包含的文件
条目无相关文件。
个性服务
原文链接
推荐该条目
保存到收藏夹
查看访问统计
导出为Endnote文件
导出为Excel格式
导出为Csv格式
Altmetrics Score
谷歌学术
谷歌学术中相似的文章
[Liyan Song]的文章
[Shuxian Li]的文章
[Leandro L. Minku]的文章
百度学术
百度学术中相似的文章
[Liyan Song]的文章
[Shuxian Li]的文章
[Leandro L. Minku]的文章
必应学术
必应学术中相似的文章
[Liyan Song]的文章
[Shuxian Li]的文章
[Leandro L. Minku]的文章
相关权益政策
暂无数据
收藏/分享
所有评论 (0)
[发表评论/异议/意见]
暂无评论

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。