five

STEM-ECR-v1.0

收藏
DataCite Commons2022-01-17 更新2025-04-15 收录
下载链接:
https://data.uni-hannover.de/dataset/0a4d8d4e-f881-48f5-82e8-20c2c635e8bd
下载链接
链接失效反馈
官方服务:
资源简介:
##Grounding Scientific Entity References in STEM Scholarly Content to Authoritative Encyclopedic and Lexicographic Sources The STEM ECR v1.0 dataset has been developed to provide a benchmark for the evaluation of scientific entity extraction, classification, and resolution tasks in a domain-independent fashion. It comprises annotations for scientific entities in scientific Abstracts drawn from 10 disciplines in Science, Technology, Engineering, and Medicine. The annotated entities are further grounded to Wikipedia and Wiktionary, respectively. ###What this repository contains? The dataset is organized in the following folders: * Scientific Entity Annotations: Contains annotations for Process, Material, Method, and Data scientific entities in the STEM dataset. * Scientific Entity Resolution: Annotations for the STEM dataset scientific entities with Entity Linking (EL) annotations to Wikipedia and Word Sense Disambiguation (WSD) annotations to Wiktionary. ###Annotation Guidelines The annotation guidelines that supported the creation of this corpus can be found [here](https://gitlab.com/TIBHannover/orkg/orkg-nlp/-/blob/master/STEM-ECR-v1.0/annotation-guidelines-for-scientific-linked-data.pdf "Guidelines"). ###Supporting Publication D'Souza, J., Hoppe, A., Brack, A., Jaradeh, M., Auer, S., & Ewerth, R. (2020). [The STEM-ECR Dataset: Grounding Scientific Entity References in STEM Scholarly Content to Authoritative Encyclopedic and Lexicographic Sources](https://www.aclweb.org/anthology/2020.lrec-1.268). In Proceedings of The 12th Language Resources and Evaluation Conference (pp. 2192–2203). European Language Resources Association. ###Useful Links * https://github.com/elsevierlabs/OA-STM-Corpus/ * https://brat.nlplab.org/ * https://dumps.wikimedia.org/enwiki/20190920/ * https://dumps.wikimedia.org/enwiktionary/20190920/ * https://dkpro.github.io/dkpro-jwpl/ * https://dkpro.github.io/dkpro-jwktl/ * https://orkg.org/orkg
提供机构:
LUIS
创建时间:
2020-02-13
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作