Annotated Corpus for Occitan
收藏NIAID Data Ecosystem2026-03-11 收录
下载链接:
https://zenodo.org/record/1182948
下载链接
链接失效反馈官方服务:
资源简介:
This corpus contains a collection of texts in Occitan which were manually annotated with parts-of-speech, lemmas.
The corpus was produced in the context of the RESTAURE project, funded by the French ANR. The current version of the corpus contains 28 documents and 12,425 tokens. The annotation process is detailed in the following article: http://hal.archives-ouvertes.fr/hal-01704806
The annotated versions are provided in a TSV CoNLL-U format.
创建时间:
2020-01-24



