five

Dataset of Vocabulary in Uzbek Primary Education

收藏
NIAID Data Ecosystem2026-05-02 收录
下载链接:
https://zenodo.org/record/12699329
下载链接
链接失效反馈
官方服务:
资源简介:
This dataset compiles words from two main sources: the "Explanatory Vocabulary of the Uzbek Language" (EDUL) and textbooks used across grades 1-4 in Uzbek primary schools (UPSC). The EDUL.txt file contains 29,190 words meticulously compiled by Urgench State University between 2019 and 2023. Additionally, the UPSC dataset includes 208,204 words extracted from primary school textbooks, sorted into separate files for each grade level. The dataset also identifies specific vocabulary words for each grade, supporting the enhancement of Uzbek language education and facilitating the development of natural language processing tools. Grade 1 lemma vocabulary: 3,188 words (all new words) Grade 2 lemma vocabulary: 4,630 words (including 1,997 new words) Grade 3 lemma vocabulary: 5,700 words (including 1,578 new words) Grade 4 lemma vocabulary: 6,397 words (including 1,356 new words) All files are conveniently packaged into a single ZIP archive for easy access and distribution.
创建时间:
2024-08-02
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作