Annotated Uzbek Sentences Dataset for Binary Sentiment Classification
收藏Mendeley Data2026-04-18 收录
下载链接:
https://data.mendeley.com/datasets/p58597ygp5
下载链接
链接失效反馈官方服务:
资源简介:
As part of this study we compiled the first open corpus of short Uzbek sentences annotated for binary sentiment classification.
Corpus size and composition
- 4 676 sentences in total: 3 042 — Positive, 1 634 — Negative.
- All texts are written in the modern Uzbek Latin alphabet and consist mainly of short everyday utterances (average length ≈ 6.0 tokens).
Raw class files: UZ_positive.txt and UZ_negative.txt (one sentence per line).
Upd (Version 2): Now it is available to download .csv format file, which has both negative and positive sentences and phrases.
创建时间:
2025-06-16



