A corpus of 42 books from European languages embracing four families.
收藏DataCite Commons2020-09-01 更新2024-07-25 收录
下载链接:
https://figshare.com/articles/dataset/The_corpus/5432128/2
下载链接
链接失效反馈官方服务:
资源简介:
A corpus of 42 books, three for each of 14 different European languages taken from the page www.gutenberg.org. The titles of the oeuvres and authors are written in the romanized way given in the page. The texts were chosen by no other reason that to be representative of each language and avoiding, as much as possible, the repetitive texts like poetry. <br>
提供机构:
figshare
创建时间:
2017-09-22



