FrancophonIA/diacritics_restoration_systems
收藏Hugging Face2025-03-30 更新2025-04-12 收录
下载链接:
https://hf-mirror.com/datasets/FrancophonIA/diacritics_restoration_systems
下载链接
链接失效反馈官方服务:
资源简介:
包含12种语言的文本数据集,每种语言都有来自维基百科和普通网页的训练集、开发集和测试集,用于训练和评估变音符号恢复系统。
A corpus of texts in 12 languages, each with training, development, and test sets from Wikipedia articles and an additional (substantially larger) training set from general Web texts, used for training and evaluating diacritics restoration systems.
提供机构:
FrancophonIA



