Label distribution of different data sources.

NIAID Data Ecosystem2026-03-14 收录

下载链接：

https://figshare.com/articles/dataset/Label_distribution_of_different_data_sources_/22334147

下载链接

链接失效反馈

官方服务：

资源简介：

The use of imaging systems in protein crystallisation means that the experimental setups no longer require manual inspection to determine the outcome of the trials. However, it leads to the problem of how best to find images which contain useful information about the crystallisation experiments. The adoption of a deeplearning approach in 2018 enabled a four-class machine classification system of the images to exceed human accuracy for the first time. Underpinning this was the creation of a labelled training set which came from a consortium of several different laboratories. The MARCO classification model does not have the same accuracy on local data as it does on images from the original test set; this can be somewhat mitigated by retraining the ML model and including local images. We have characterized the image data used in the original MARCO model, and performed extensive experiments to identify training settings most likely to enhance the local performance of a MARCO-dataset based ML classification model.

创建时间：

2023-03-24

5,000+

优质数据集

54 个

任务类型

进入经典数据集