A corpus of 42 books from European languages embracing four families.
收藏Figshare2017-09-22 更新2026-04-08 收录
下载链接:
https://figshare.com/articles/dataset/The_corpus/5432128/2
下载链接
链接失效反馈官方服务:
资源简介:
A corpus of 42 books, three for each of 14 different European languages taken from the page www.gutenberg.org. The titles of the oeuvres and authors are written in the romanized way given in the page. The texts were chosen by no other reason that to be representative of each language and avoiding, as much as possible, the repetitive texts like poetry. <br>
创建时间:
2017-09-22



