five

An HTS-derived AI evaluation dataset with realistic virtual screening space and non-trivial class separation

收藏
DataCite Commons2026-05-05 更新2026-05-07 收录
下载链接:
https://zenodo.org/doi/10.5281/zenodo.20030795
下载链接
链接失效反馈
官方服务:
资源简介:
HTS-derived evaluation benchmark with UMAP-based sampling and clustering for realistic representation of the virtual screening (VS) space, while enforcing non-trivial class separability. The dataset is primarily designed for classification tasks; however, continuous inhibition activity (%) values are also provided, along with associated standard error and standard deviation, for potential regression applications. Notably, these activity measurements are inherently noisy, as they originate from primary high-throughput screening conditions and are based on a fluorescence readout of helicase activity. This indirect measurement is sensitive to experimental variability, including plate effects, signal plateauing, and non-enzymatic contributions mitigated through trap DNA. Despite assay optimization (e.g., buffer conditions and reaction timing) and high overall quality (Z' = 0.86), these factors introduce variability that can result in noisy continuous labels, including potential false positives and false negatives.
提供机构:
Zenodo
创建时间:
2026-05-04
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作