ariaattarml/verified-reasoning-cot-gpqa-mmlu-pro
收藏Hugging Face2024-12-08 更新2024-12-14 收录
下载链接:
https://hf-mirror.com/datasets/ariaattarml/verified-reasoning-cot-gpqa-mmlu-pro
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含来自多个来源(GPQA Diamond和MMLU Pro)的推理轨迹,并根据正确性验证标记了偏好信息。数据集的主要特征包括原始问题、助手的详细推理、正确答案、模型答案、数据来源以及偏好标记。数据集分为训练集,包含264个示例。数据集的来源是GPQA Diamond和MMLU Pro数据集,每个示例都使用Claude 3.5 Sonnet进行了正确性评估。
This dataset contains reasoning traces from multiple sources (GPQA Diamond and MMLU Pro), labeled with preference information based on correctness verification. The dataset consists of reasoning problems and their solutions, where each example has been verified for correctness and labeled with a preference score. It combines data from two main sources: GPQA Diamond and MMLU Pro. The data fields include original question, assistant response, correct answer, model answer, source, and preferred.
提供机构:
ariaattarml



