DECM Machine Ready Corpus
收藏NIAID Data Ecosystem2026-03-14 收录
下载链接:
https://figshare.com/articles/dataset/DECM_Machine_Ready_Corpus/12048729
下载链接
链接失效反馈官方服务:
资源简介:
The DECM Corpus is a digital corpus of the
texts of Relaciones Geográficas de Nueva España (the Geographic Reports of New Spain) with different versions, including a machine ready version, a gold standard annotated dataset, and an automatically annotated version ready for text mining and machine learning experiments.
This is the DECM Machine Ready Corpus. This
version includes text only files (.txt) containing each of the 10 volumes originally
edited by Rene Acuña, the 2 volumes edited by Mercedes de la Garza, the Suma
de Visita edited by Del Paso y Troncoso, a file with the original text of
the Crown mandate (Instrucción), and metadata for this collection. This
version contains only the original text of each of the RGs as transcribed by
the scholars, excluding any editorial note, commentary, or historical work.
This can be therefore used directly for corpus linguistics analyses, visualisations,
etc.
创建时间:
2020-05-25



