ACVA-Arabic-Cultural-Value-Alignment
收藏魔搭社区2025-11-07 更新2025-01-25 收录
下载链接:
https://modelscope.cn/datasets/FreedomIntelligence/ACVA-Arabic-Cultural-Value-Alignment
下载链接
链接失效反馈官方服务:
资源简介:
# About ArabicCulture
The ArabicCulture dataset was generated by gpt3.5 and contains 8000+ True and False questions.
The dataset contains questions from 58 different areas.
In the answers, "True" accounted for 59.62%, and "False" accounted for 40.38%
# data-all
It contains 8000+ data, and we took 5 data from each area as few-shot data.
# data-select
We asked two Arabs to judge 4000 of all the data for us, and we left data that two Arabs both thought were good. Finally, we got 2.4k data covering 9 areas.
We divided them into test sets and validation sets as above.
# 关于阿拉伯文化数据集(ArabicCulture)
本数据集由GPT-3.5生成,包含8000余条正误判断题。该数据集涵盖58个不同领域的题目,答案中「正确(True)」占比59.62%,「错误(False)」占比40.38%。
# 全量数据集(data-all)
该子集包含8000余条数据,我们从每个领域中选取5条样本作为少样本(Few-shot)数据。
# 筛选后数据集(data-select)
我们邀请两名阿拉伯专家对全量数据中的4000条进行评审筛选,仅保留两位专家均判定为优质的数据,最终得到2400条数据,覆盖9个领域。随后我们按照前述方式将其划分为测试集与验证集。
提供机构:
maas
创建时间:
2025-01-20



