Calibration of probability predictions from machine-learning and statistical models

DataONE2020-03-02 更新2025-06-28 收录

下载链接：

https://search.dataone.org/view/sha256:e1ea9d3adce440bba3a8df7599d861dcd676e0bb9c1a551e5eda8dab23724955

下载链接

链接失效反馈

官方服务：

资源简介：

This data set describes the occurrence (yes/no) of a bird, the Southern Whiteface (Aphelocephala leucopsis) in Australia. A suite of environmental variables is provided, which are used in the paper to illustrate a statistical problem. The data are meant to allow reproduction of the analysis in this paper. They are not intended for actual ecological analysis. The data come as .Rdata-file, i.e. as an R-dataset (described technically here: https://www.loc.gov/preservation/digital/formats/fdd/fdd000470.shtml). Here is the paper's abstract: Aim: Predictions from statistical models may be uncalibrated, meaning that the predicted values do not have the nominal coverage probability. This is easiest seen with probability predictions in machine-learning classification, including the common species occurrence probabilities. Here, a predicted probability of, say, 0.7 should indicate that out of 100 cases with these environmental conditions, and hence the same predicted probability, the specie...

创建时间：

2025-06-21

5,000+

优质数据集

54 个

任务类型

进入经典数据集