erickfmm/wiktionary-spanish-all-forms-and-its-original-form
收藏Hugging Face2025-10-14 更新2025-10-25 收录
下载链接:
https://hf-mirror.com/datasets/erickfmm/wiktionary-spanish-all-forms-and-its-original-form
下载链接
链接失效反馈官方服务:
资源简介:
西班牙语单词形态与词干数据集,包含从Wiktionary提取的西班牙语单词屈折形式及其对应的原形(词干)。数据集适用于词形还原、形态分析、文本预处理、语言模型训练等自然语言处理任务。数据集共有约760,000个单词形态对,以CSV格式存储,包括两个字段:单词的屈折形式和原形。
The Spanish Word Forms and Lemmas Dataset, containing pairs of inflected Spanish word forms and their corresponding lemmas extracted from Wiktionary. The dataset is suitable for lemmatization, morphological analysis, text preprocessing, and language model training tasks. It consists of approximately 760,000 word form pairs stored in CSV format, with two columns for the inflected form of the word and its lemma.
提供机构:
erickfmm



