five

Colonia Corpus of Historical Portuguese

收藏
SSH Open MarketPlace2024-09-30 更新2024-10-05 收录
下载链接:
https://marketplace.sshopencloud.eu/dataset/F5Fy7w
下载链接
链接失效反馈
官方服务:
资源简介:
Portuguese is a romance language that is the native language of over 215 million speakers worldwide. Like Spanish, English and French, it was the language of both its country of origin and also that country’s colonial possessions. This corpus contains examples of historical Portuguese written between 1500 and 1936, both in Portugal and Brazil. The corpus contains complete Portuguese manuscripts published from 1500 to 1936 divided into 5 sub-corpora per century (summarized in the table below). The part of speech (POS) of words in this corpus was tagged using TreeTagger. You can find more information on this corpus on the Colonia homepage.
创建时间:
2024-09-30
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作