five

m-sized Training and Evaluation Data for Publication "Using Supervised Learning to Classify Metadata of Research Data by Field of Study"

收藏
NIAID Data Ecosystem2026-03-11 收录
下载链接:
https://zenodo.org/record/3490457
下载链接
链接失效反馈
官方服务:
资源简介:
Automated classification of metadata of research data by their discipline(s) of research can be used in scientometric research, by repository service providers, and in the context of research data aggregation services. Openly available metadata of the DataCite index for research data were used to compile a large training and evaluation set comprised of 609,524 records. This is the cleaned and vectorized version with a feature selection of medium size

基于研究学科对科研数据元数据实施自动化分类,可应用于科学计量研究、仓储服务提供商的业务场景,以及科研数据聚合服务的相关场景中。本研究采用科研数据DataCite索引的公开元数据,构建了包含609524条记录的大型训练与评测数据集。该数据集为经过清洗、向量化处理,并完成中等规模特征选择的版本。
创建时间:
2020-04-20
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作