five

MIL(multi-instance learning)

收藏
OpenXLab2026-04-18 收录
下载链接:
https://openxlab.org.cn/datasets/OpenDataLab/MIL
下载链接
链接失效反馈
官方服务:
资源简介:
MIL算法在71 mil基准数据集上进行了测试。这是最大的实验MIL存储库,用于算法比较。数据集的应用领域是分子活性预测,图像注释,文本分类,网页分类和音频记录分类 (n miproblems.org提供数据集的mat文件)。 每个数据集文件都是一个逗号分隔值 (CSV) 格式的文件,它具有实例数许多行和特征数许多列以及两个附加列。第一个附加列对应于传播到实例的bag类标签。第二列是bag ID列,其中每个实例接收其所有者bag的bag id号。其余列分别存储实例的特征值。

MIL algorithms were tested on the 71 MIL benchmark datasets. This is the largest experimental MIL repository for algorithm comparison. The application fields of these datasets include molecular activity prediction, image annotation, text classification, web page classification, and audio recording classification. The MAT format files of the datasets are provided via miproblems.org. Each dataset file is in Comma-Separated Values (CSV) format, containing multiple rows corresponding to the number of instances, multiple columns corresponding to the number of features, and two additional columns. The first additional column corresponds to the bag-level class label of the bag that the instance belongs to. The second column is the bag ID column, where each instance is assigned the bag ID number of its parent bag. The remaining columns respectively store the feature values of the instance.
提供机构:
OpenDataLab
创建时间:
2022-10-17
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作