Training dataset for AI-supported subject indexing of research data with DFG classification
收藏DataCite Commons2025-11-13 更新2026-05-07 收录
下载链接:
https://darus.uni-stuttgart.de/citation?persistentId=doi:10.18419/DARUS-5500
下载链接
链接失效反馈官方服务:
资源简介:
<p>This is a structured JSON dataset for training and evaluating AI models for automated subject indexing of research data and linking of research data and publications. It was created for the project <a href="https://wiki.bsz-bw.de/spaces/DAFDM/pages/138118716/DA-FDM">DA-FDM</a> in order to train a vector based model to give automated suggestions for the topic classification of research datasets in DaRUS. In the context of this project, the DFG-classification was integrated as a controlled vocabulary in DaRUS for the Topic Classification field. DFG classes were added manually to datasets from DaRUS that were uploaded prior to the integration.</p>
<p>This dataset includes classification tags (<a href="https://w3id.org/dfgfo/2020">DFG</a>, <a href="https://swb.bsz-bw.de/DB=2.104/SET=1/TTL=1/START_WELCOME">GND</a>, <a href="https://www.wikidata.org">Wikidata</a>), publication links, respective open-access information, and, if the publication is open-access, the respective full texts for datasets from DaRUS as well as <a href="https://tudatalib.ulb.tu-darmstadt.de/home">TUdatalib</a>.</p>
<p>
Example object for the dataset:
<pre>
{
"name": "doi:10.18419/darus-1234",
"tags": [
{
"name": "dfg-fs$102-04",
"url": "https://w3id.org/dfgfo/2020/102-04"
},
{
"name": "ResearchDataSet"
}
],
"links": [
{
"name": "doi:10.12345/abc5678",
"type": "publication",
"is_open_access": true,
"open_access_url": "https://www.asdfg.com/10.12345/abc5678",
"text": "Extracted publication full text ..."
}
]
}
</pre>
</p>
提供机构:
DaRUS
创建时间:
2025-10-22



