MedLatinEpi, MedLatinLit
收藏arXiv2021-09-12 更新2024-06-21 收录
下载链接:
https://doi.org/10.5281/zenodo.4298503
下载链接
链接失效反馈官方服务:
资源简介:
MedLatinEpi和MedLatinLit是两个专为中世纪拉丁文计算作者分析研究设计的数据集,由意大利国家研究委员会创建。MedLatinEpi包含294封书信,主要来自13至14世纪,而MedLatinLit则包含30篇文学评论和论文,平均长度约为40,000字。这两个数据集的创建旨在解决中世纪文献中作者身份的争议问题,为文学和历史学者提供重要的研究工具。数据集中的文本经过精心预处理,确保其适合用于作者分析任务,如作者归属、作者验证等。
MedLatinEpi and MedLatinLit are two datasets specifically designed for the computational analysis of medieval Latin authors, created by the National Research Council of Italy. MedLatinEpi contains 294 letters, primarily dating from the 13th to 14th centuries, while MedLatinLit includes 30 literary critiques and treatises, with an average length of approximately 40,000 words. These datasets were developed to resolve authorial disputes in medieval literary works, providing an important research tool for literary and historical scholars. The texts in the datasets have undergone rigorous preprocessing to ensure their suitability for authorship analysis tasks such as author attribution and author verification.
提供机构:
意大利国家研究委员会
创建时间:
2020-06-22



