five

Classical Tibetan corpus annotated for verb-argument dependency relations

收藏
NIAID Data Ecosystem2026-03-12 收录
下载链接:
https://zenodo.org/record/4727107
下载链接
链接失效反馈
官方服务:
资源简介:
This is a small hand-annotated partial treebank of Tibetan, primarily in CoNLL-U format. It builds upon the following corpus: Hill, Nathan W., & Garrett, Edward. (2017). A part-of-speech (POS) tagged corpus of Classical Tibetan [Data set]. Zenodo. http://doi.org/10.5281/zenodo.574878 This corpus differs from the above in three ways: The tagset has been converted from the SOAS tag system to the Universal Dependency part-of-speech tagset. We have added dependency relations between verbs and their argument. For some of the texts, English translations were available in digital form. These translations were manually aligned to the Tibetan texts and included in the CoNLL-U files. It was created as part of the AHRC-funded project Lexicography in Motion (PI Ulrich Pagel, 2017-2021).
创建时间:
2021-04-30
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作