CoBaLD/enhanced-cobald
收藏Hugging Face2025-05-31 更新2025-04-26 收录
下载链接:
https://hf-mirror.com/datasets/CoBaLD/enhanced-cobald
下载链接
链接失效反馈官方服务:
资源简介:
增强型CoBaLD数据集是一个统一了Hugging Face Datasets API的CoBaLD数据集的伞形仓库。它提供了四种语言的单语种数据集,包括英语、俄语、匈牙利语和塞尔维亚语。数据集包含文本的单词、词形、词性、依存关系等特征,并分为训练集和验证集。该数据集适用于词汇分类任务,其标注由专家生成,遵循GPL-3.0许可。
The Enhanced CoBaLD Dataset is an umbrella repository for CoBaLD datasets that provides a unified Hugging Face Datasets API. It includes monolingual datasets for four languages: English, Russian, Hungarian, and Serbian. The dataset contains features such as words, lemmas, parts of speech, dependency relations, etc., and is split into training and validation sets. It is suitable for token classification tasks, with annotations generated by experts and licensed under GPL-3.0.
提供机构:
CoBaLD



