five

Training dataset for AI-supported subject indexing of research data with DFG classification

收藏
DataCite Commons2025-11-13 更新2026-05-07 收录
下载链接:
https://darus.uni-stuttgart.de/citation?persistentId=doi:10.18419/DARUS-5500
下载链接
链接失效反馈
官方服务:
资源简介:
<p>This is a structured JSON dataset for training and evaluating AI models for automated subject indexing of research data and linking of research data and publications. It was created for the project <a href="https://wiki.bsz-bw.de/spaces/DAFDM/pages/138118716/DA-FDM">DA-FDM</a> in order to train a vector based model to give automated suggestions for the topic classification of research datasets in DaRUS. In the context of this project, the DFG-classification was integrated as a controlled vocabulary in DaRUS for the Topic Classification field. DFG classes were added manually to datasets from DaRUS that were uploaded prior to the integration.</p> <p>This dataset includes classification tags (<a href="https://w3id.org/dfgfo/2020">DFG</a>, <a href="https://swb.bsz-bw.de/DB=2.104/SET=1/TTL=1/START_WELCOME">GND</a>, <a href="https://www.wikidata.org">Wikidata</a>), publication links, respective open-access information, and, if the publication is open-access, the respective full texts for datasets from DaRUS as well as <a href="https://tudatalib.ulb.tu-darmstadt.de/home">TUdatalib</a>.</p> <p> Example object for the dataset: <pre> { "name": "doi:10.18419/darus-1234", "tags": [ { "name": "dfg-fs$102-04", "url": "https://w3id.org/dfgfo/2020/102-04" }, { "name": "ResearchDataSet" } ], "links": [ { "name": "doi:10.12345/abc5678", "type": "publication", "is_open_access": true, "open_access_url": "https://www.asdfg.com/10.12345/abc5678", "text": "Extracted publication full text ..." } ] } </pre> </p>
提供机构:
DaRUS
创建时间:
2025-10-22
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作