five

Replication Data for: A multi-analyte machine learning model to detect wrong blood in tube errors

收藏
NIAID Data Ecosystem2026-05-02 收录
下载链接:
https://doi.org/10.7910/DVN/XCYHPX
下载链接
链接失效反馈
官方服务:
资源简介:
Misidentification of blood specimens is an important pre-analytical risk that can lead to patient harm. We developed several machine learning models to detect this problem using Complete Blood Count (CBC) data in a large pediatric inpatient population. We achieved accuracy of >97% using CBC with differential cell counts. We then utilized a validation set designed to mimic real world prevalence, achieving a positive predictive value of 20%. Datasets are tabular data at the test level containing CBC with Diff and CBC no Diff analyte deltas (absolute deltas: current value - previous value and percent deltas: current value / previous value) for patients at the Children's Hospital of Philadelphia (CHOP) meeting certain inclusion criteria. Also included are patient sex, age and hours between CBCs. The analysis was conducted for CBC with Diff and CBC no Diff tests in parallel but separately. Therefore, each test has its own notebook and corresponding train/validation/test datasets. 118,314 total tests: 8,253 Complete Blood Count (CBC) no Differential (Diff) 110,061 CBC with Diff tests The raw clinical data consisting of Complete Blood Count (CBC) test results was extracted from the Children's Hospital of Philadelphia (CHOP) Clinical Data Warehouse (CDW). The data was analyzed with a novel machine learning model.
创建时间:
2024-09-13
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作