题名 | In Defense of Softmax Parametrization for Calibrated and Consistent Learning to Defer |
作者 | |
通讯作者 | Lei Feng |
DOI | |
发表日期 | 2023-12-10
|
会议名称 | 37th Conference on Neural Information Processing Systems (NeurIPS 2023).
|
会议日期 | 2023-12-10
|
会议地点 | the New Orleans Ernest N. Morial Convention Center
|
摘要 | Enabling machine learning classifiers to defer their decision to a downstream expert when the expert is more accurate will ensure improved safety and performance. This objective can be achieved with the learning-to-defer framework which aims to jointly learn how to classify and how to defer to the expert. In recent studies, it has been theoretically shown that popular estimators for learning to defer parameterized with softmax provide unbounded estimates for the likelihood of deferring which makes them uncalibrated. However, it remains unknown whether this is due to the widely used softmax parameterization and if we can find a softmax-based estimator that is both statistically consistent and possesses a valid probability estimator. In this work, we first show that the cause of the miscalibrated and unbounded estimator in prior literature is due to the symmetric nature of the surrogate losses used and not due to softmax. We then propose a novel statistically consistent asymmetric softmax-based surrogate loss that can produce valid estimates without the issue of unboundedness. We further analyze the non-asymptotic properties of our method and empirically validate its performance and calibration on benchmark datasets. |
学校署名 | 其他
|
语种 | 英语
|
来源库 | 人工提交
|
引用统计 | |
成果类型 | 会议论文 |
条目标识符 | http://sustech.caswiz.com/handle/2SGJ60CL/646936 |
专题 | 理学院_统计与数据科学系 |
作者单位 | 1.School of Computer Science and Engineering, Nanyang Technological University 2.CSAIL and IDSS, Massachusetts Institute of Technology 3.Department of Statistics and Data Science, Southern University of Science and Technology |
推荐引用方式 GB/T 7714 |
Yuzhou Cao,Hussein Mozannar,Lei Feng,et al. In Defense of Softmax Parametrization for Calibrated and Consistent Learning to Defer[C],2023.
|
条目包含的文件 | ||||||
文件名称/大小 | 文献类型 | 版本类型 | 开放类型 | 使用许可 | 操作 | |
8331_in_defense_of_s(679KB) | -- | -- | 限制开放 | -- |
|
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。
修改评论