中文版 | English
题名

A Practical Human Labeling Method for Online Just-in-Time Software Defect Prediction

作者
通讯作者Yao, Xin
DOI
发表日期
2023
会议名称
31st ACM Joint Meeting of the European Software Engineering Conference / Symposium on the Foundations-of-Software-Engineering (ESEC/FSE)
会议录名称
会议日期
DEC 03-09, 2023
会议地点
null,San Francisco,CA
出版地
1601 Broadway, 10th Floor, NEW YORK, NY, UNITED STATES
出版者
摘要
Just-in-Time Software Defect Prediction (JIT-SDP) can be seen as an online learning problem where additional software changes produced over time may be labeled and used to create training examples. These training examples form a data stream that can be used to update JIT-SDP models in an attempt to avoid models becoming obsolete and poorly performing. However, labeling procedures adopted in existing online JIT-SDP studies implicitly assume that practitioners would not inspect software changes upon a defect-inducing prediction, delaying the production of training examples. This is inconsistent with a real-world scenario where practitioners would adopt JIT-SDP models and inspect certain software changes predicted as defect-inducing to check whether they really induce defects. Such inspection means that some software changes would be labeled much earlier than assumed in existing work, potentially leading to different JIT-SDP models and performance results. This paper aims at formulating a more practical human labeling procedure that takes into account the adoption of JIT-SDP models during the software development process. It then analyses whether and to what extent it would impact the predictive performance of JIT-SDP models. We also propose a new method to target the labeling of software changes with the aim of saving human inspection effort. Experiments based on 14 GitHub projects revealed that adopting a more realistic labeling procedure led to significantly higher predictive performance than when delaying the labeling process, meaning that existing work may have been underestimating the performance of JIT-SDP. In addition, our proposed method to target the labeling process was able to reduce human effort while maintaining predictive performance by recommending practitioners to inspect software changes that are more likely to induce defects. We encourage the adoption of more realistic human labeling methods in research studies to obtain an evaluation of JIT-SDP predictive performance that is closer to reality.
关键词
学校署名
第一 ; 通讯
语种
英语
相关链接[来源记录]
收录类别
资助项目
National Natural Science Foundation of China (NSFC)["62002148","62250710682"]
WOS研究方向
Computer Science
WOS类目
Computer Science, Software Engineering ; Computer Science, Theory & Methods
WOS记录号
WOS:001148157800050
来源库
Web of Science
引用统计
被引频次[WOS]:2
成果类型会议论文
条目标识符http://sustech.caswiz.com/handle/2SGJ60CL/706640
专题工学院_斯发基斯可信自主研究院
工学院_计算机科学与工程系
作者单位
1.Research Institute of Trustworthy Autonomous Systems, Southern University of Science and Technology, Shenzhen, China
2.Guangdong Provincial Key Laboratory of Braininspired Intelligent Computation, Department of Computer Science and Engineering, Southern University of Science and Technology, Shenzhen, China
3.School of Computer Science, The University of Birmingham, Birmingham, Edgbaston, United Kingdom
第一作者单位斯发基斯可信自主系统研究院;  计算机科学与工程系
通讯作者单位斯发基斯可信自主系统研究院;  计算机科学与工程系
第一作者的第一单位斯发基斯可信自主系统研究院
推荐引用方式
GB/T 7714
Song, Liyan,Minku, Leandro,Teng, Cong,et al. A Practical Human Labeling Method for Online Just-in-Time Software Defect Prediction[C]. 1601 Broadway, 10th Floor, NEW YORK, NY, UNITED STATES:ASSOC COMPUTING MACHINERY,2023.
条目包含的文件
条目无相关文件。
个性服务
原文链接
推荐该条目
保存到收藏夹
查看访问统计
导出为Endnote文件
导出为Excel格式
导出为Csv格式
Altmetrics Score
谷歌学术
谷歌学术中相似的文章
[Song, Liyan]的文章
[Minku, Leandro]的文章
[Teng, Cong]的文章
百度学术
百度学术中相似的文章
[Song, Liyan]的文章
[Minku, Leandro]的文章
[Teng, Cong]的文章
必应学术
必应学术中相似的文章
[Song, Liyan]的文章
[Minku, Leandro]的文章
[Teng, Cong]的文章
相关权益政策
暂无数据
收藏/分享
所有评论 (0)
[发表评论/异议/意见]
暂无评论

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。