Dataset for Uzbek Morphological Analyzer

NIAID Data Ecosystem2026-05-01 收录

下载链接：

https://zenodo.org/record/8220536

下载链接

链接失效反馈

官方服务：

资源简介：

The "Dataset for Uzbek Morphological Analyzer" is a valuable linguistic resource designed to facilitate robust morphological analysis in the Uzbek language. This curated dataset comprises a comprehensive collection of inflectional endings found in various texts, encompassing words from books and news platforms to ensure diversity and contextuality. Each entry in the dataset is annotated, providing detailed information on the inflectional morphemes' characteristics, such as tense, gender, plural, singular, case, and person. Rigorous manual verification ensures accurate division of inflectional endings from their respective words, guaranteeing the dataset's reliability and usefulness. The dataset's primary purpose is to support natural language processing tasks, including morphological analysis, part-of-speech tagging, lemmatization, and language modeling. Researchers and developers can leverage this resource to develop advanced algorithms, improve linguistic applications, and deepen their understanding of the Uzbek language's morphological structure. To foster collaboration and knowledge exchange, the dataset is made publicly available with comprehensive documentation, enabling other researchers to replicate and build upon the work. It is anticipated that this dataset will significantly contribute to advancements in Uzbek language processing and foster new avenues of linguistic research in the field of morphological analysis.

创建时间：

2023-11-19

5,000+

优质数据集

54 个

任务类型

进入经典数据集