uacorpus/Rada_Trees
收藏Hugging Face2025-07-08 更新2025-10-25 收录
下载链接:
https://hf-mirror.com/datasets/uacorpus/Rada_Trees
下载链接
链接失效反馈官方服务:
资源简介:
Rada_Trees是一个覆盖1990年至2024年乌克兰议会(Verkhovna Rada)转录的全面句法注释语料库,包含大约8800万个词汇。该语料库由官方全会转录文本构成,支持三种注解格式:纯文本、通用依赖关系(UD)注解和基于VESUM形态学词典的nlp_uk注解。适用于乌克兰语的形态学、句法学研究,议会话语的语言学分析以及社会语言学和政治传播研究。
Rada_Trees is a comprehensive syntactically annotated corpus of Ukrainian parliament (Verkhovna Rada) transcripts from 1990 to 2024, containing approximately 88 million tokens. The corpus is constructed from official plenary session transcripts and supports three annotation formats: plain text, Universal Dependencies (UD) annotation, and nlp_uk annotation based on the VESUM morphological dictionary. It is suitable for research on Ukrainian morphology, syntax, linguistic analysis of parliamentary discourse, as well as studies in sociolinguistics and political communication.
提供机构:
uacorpus



