Classical Tibetan corpus annotated for verb-argument dependency relations
收藏NIAID Data Ecosystem2026-03-12 收录
下载链接:
https://zenodo.org/record/4727107
下载链接
链接失效反馈官方服务:
资源简介:
This is a small hand-annotated partial treebank of Tibetan, primarily in CoNLL-U format. It builds upon the following corpus:
Hill, Nathan W., & Garrett, Edward. (2017). A part-of-speech (POS) tagged corpus of Classical Tibetan [Data set]. Zenodo. http://doi.org/10.5281/zenodo.574878
This corpus differs from the above in three ways:
The tagset has been converted from the SOAS tag system to the Universal Dependency part-of-speech tagset.
We have added dependency relations between verbs and their argument.
For some of the texts, English translations were available in digital form. These translations were manually aligned to the Tibetan texts and included in the CoNLL-U files.
It was created as part of the AHRC-funded project Lexicography in Motion (PI Ulrich Pagel, 2017-2021).
创建时间:
2021-04-30



