Universal Treebank
收藏arXiv2025-09-30 收录
下载链接:
https://github.com/ryanmcd/uni-dep-tb
下载链接
链接失效反馈官方服务:
资源简介:
该数据集是一个包含12个标签的多语言词性标注数据集,覆盖了10种不同的语言。该数据集在处理过程中未使用任何外部资源,且报告的结果是基于mBERT上下文表示的。其所涉及的任务是词性标注。
This dataset is a multilingual part-of-speech tagging dataset with 12 tags, covering 10 distinct languages. No external resources were utilized during its processing, and the reported results are based on the contextual representations of mBERT. The task involved in this dataset is part-of-speech tagging.



