CAMeL-Lab/BAREC-Shared-Task-2025-sent
收藏Hugging Face2025-06-11 更新2025-07-05 收录
下载链接:
https://hf-mirror.com/datasets/CAMeL-Lab/BAREC-Shared-Task-2025-sent
下载链接
链接失效反馈官方服务:
资源简介:
BAREC 2025是一个专注于细粒度阿拉伯语可读性评估的大型数据集,包括超过一百万个单词,跨越19个可读性等级。该数据集在句子级别进行注释,同时提供文档级别的可读性评分。数据集分为训练集、验证集和测试集,并且在可读性等级、领域和文本类别上进行了平衡。
BAREC 2025 is a large-scale dataset focused on fine-grained Arabic readability assessment, including over 1 million words across 19 readability levels. The dataset is annotated at the sentence level and provides document-level readability scores. It is split into training, development, and test sets, balanced across readability levels, domains, and text classes.
提供机构:
CAMeL-Lab



