ariaattarml/verified-reasoning-o1-gpqa-mmlu-pro
收藏Hugging Face2024-12-08 更新2024-12-21 收录
下载链接:
https://hf-mirror.com/datasets/ariaattarml/verified-reasoning-o1-gpqa-mmlu-pro
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含来自多个来源(GPQA Diamond和MMLU Pro)的推理轨迹,并根据正确性验证标记了偏好信息。数据集由推理问题及其解决方案组成,每个示例都经过正确性验证并带有偏好评分。数据字段包括原始问题、助手响应、正确答案、模型答案、来源和偏好评分。数据集来源于GPQA Diamond和MMLU Pro数据集,每个示例都使用Claude 3.5 Sonnet进行正确性评估。
This dataset contains reasoning traces from multiple sources (GPQA Diamond and MMLU Pro), labeled with preference information based on correctness verification. The dataset consists of reasoning problems and their solutions, where each example has been verified for correctness and labeled with a preference score. It combines data from two main sources: GPQA Diamond and MMLU Pro. The data fields include original question, assistant response, correct answer, model answer, source, and preferred.
提供机构:
ariaattarml



