five

Manual Dependency Annotation of Three German Text Extracts from the Project hermA (Gold Standard Data)

收藏
NIAID Data Ecosystem2026-03-11 收录
下载链接:
https://zenodo.org/record/1324079
下载链接
链接失效反馈
官方服务:
资源简介:
This dataset was created in the digital humanities project hermA (www.herma.uni-hamburg.de) and comprises annotated extracts of the following three texts: Modern literature (Lit2009): novel Corpus Delicti: Ein Prozess by German author Juli Zeh, published in Frankfurt/Main in 2009. Non-contemporary literature (Lit1850): Eine Frauenfahrt um die Welt ('A woman’s journey around the world') by Austrian author Ida Pfeiffer (1850). Full text available at Deutsches Textarchiv: http://www.deutschestextarchiv.de/pfeiffer_frauenfahrt01_1850/6. Modern academic writing (Aca2009): Stand, Möglichkeiten und Grenzen der Telemedizin in Deutschland ('Telemedicine in Germany: status, chances and limits') by Rüdiger Klar and Ernst Pelikan, published in Bundesgesundheitsblatt ('Federal Health Gazette') in 2009. DOI 10.1007/s00103-009-0787-7. The texts are annotated for part-of-speech and dependency syntax and are made available in CoNLL file format. We describe the annotation process and report inter-annotator agreements in: Adelmann, Benedikt, Melanie Andresen, Wolfgang Menzel & Heike Zinsmeister. 2018. Evaluation of Out-Of Domain Dependency Parsing for its Application in a Digital Humanities Project. Proceedings of the 14th Conference on Natural Language Processing (KONVENS 2018). Vienna, Austria.
创建时间:
2020-01-24
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作