EpistemeAI/gpqa-diamond-augmented-oss20b
收藏Hugging Face2025-09-11 更新2025-09-13 收录
下载链接:
https://hf-mirror.com/datasets/EpistemeAI/gpqa-diamond-augmented-oss20b
下载链接
链接失效反馈官方服务:
资源简介:
GPQA-Diamond推理轨迹数据集(gpt-oss-20b)是一个包含从GPQA Diamond基准派生的细粒度推理轨迹的数据集,这些轨迹经过了开放权重gpt-oss-20b自回归变换器的增强。每个示例都明确保存了完整的思维链(CoT)输出,包括中间推理标记、决策路径和最终预测。该数据集旨在促进对大规模语言模型中的系统性调试、错误归因和解释性研究。
The GPQA-Diamond Reasoning Traces (gpt-oss-20b) dataset is a collection of fine-grained reasoning traces derived from the GPQA Diamond benchmark, augmented with responses generated by the open-weight gpt-oss-20b autoregressive transformer. Each example explicitly preserves the full chain-of-thought (CoT) output, including intermediate reasoning tokens, decision paths, and the final prediction. The dataset is designed to facilitate systematic debugging, error attribution, and interpretability research in large-scale language models.
提供机构:
EpistemeAI



