Das Deutsche Referenzkorpus
收藏re3data.org2024-05-31 收录
下载链接:
https://www.re3data.org/repository/r3d100010264
下载链接
链接失效反馈官方服务:
资源简介:
The project is set up in order to improve the infrastructure for text-based linguistic research and development by building a huge, automatically annotated German text corpus and the corresponding tools for corpus annotation and exploitation. DeReKo constitutes the largest linguistically motivated collection of contemporary German texts, contains fictional, scientific and newspaper texts, as well as several other text types, contains only licenced texts, is encoded with rich meta-textual information, is fully annotated morphosyntactically (three concurrent annotations), is continually expanded, with a focus on size and stratification of data, may be analyzed free of charge via the query system COSMAS II, serves as a 'primordial sample' from which users may draw specialized sub-samples (socalled 'virtual corpora') to represent the language domain they wish to investigate. Info: Access to data of Das Deutsche Referenzkorpus is also provided by: IDS Repository https://www.re3data.org/repository/r3d100010382
本项目的设立旨在通过构建一个庞大的、自动标注的德语文本语料库及其相应的语料库标注与利用工具,以提升基于文本的语言研究与发展基础设施。DeReKo 构成了当代德语文本中语言学动机驱动的最大集合,囊括了小说、科学论文及报纸文章等多种文本类型,仅包含授权文本,编码时融入了丰富的元文本信息,在形态句法层面上进行全面标注(三种并行标注),持续扩充,注重数据规模与分层,用户可通过查询系统 COSMAS II 免费进行分析,作为‘原初样本’供用户从中提取特定子样本(所谓‘虚拟语料库’),以表征他们所希望研究的语言领域。信息:德意志参考语料库的数据亦可通过 IDS 仓库获取:https://www.re3data.org/repository/r3d100010382
提供机构:
DeReKo



