当前位置: X-MOL 学术Nat. Mach. Intell. › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
The curious case of the test set AUROC
Nature Machine Intelligence ( IF 23.8 ) Pub Date : 2024-04-04 , DOI: 10.1038/s42256-024-00817-7
Michael Roberts , Alon Hazan , Sören Dittmer , James H. F. Rudd , Carola-Bibiane Schönlieb

The area under the receiver operating characteristic curve (AUROC) of the test set is used throughout machine learning (ML) for assessing a model’s performance. However, when concordance is not the only ambition, this gives only a partial insight into performance, masking distribution shifts of model outputs and model instability.

中文翻译:

测试集 AUROC 的奇怪案例

测试集的受试者工作特征曲线 (AUROC) 下的面积在整个机器学习 (ML) 中用于评估模型的性能。然而,当一致性不是唯一的目标时,这只能提供对性能的部分了解,掩盖了模型输出的分布变化和模型的不稳定性。
更新日期:2024-04-04
down
wechat
bug