nvidia/AceReason-1.1-SFT
收藏Hugging Face2025-06-18 更新2025-10-25 收录
下载链接:
https://hf-mirror.com/datasets/nvidia/AceReason-1.1-SFT
下载链接
链接失效反馈官方服务:
资源简介:
AceReason-1.1-SFT是一个多样化的、高质量的关注于数理和代码推理的监督微调(SFT)数据集。该数据集包含2,668,741个数学样本和1,301,591个代码样本,数据来源于OpenMathReasoning、NuminaMath-CoT、OpenCodeReasoning、MagicoderEvolInstruct、opc-sft-stage2、leetcode、TACO和apps等多个数据集。数据集经过去污染处理,并过滤掉了与测试样本有9-gram重叠的样本。
AceReason-1.1-SFT is a diverse and high-quality supervised fine-tuning (SFT) dataset focused on math and code reasoning. It contains 2,668,741 math samples and 1,301,591 code samples from sources such as OpenMathReasoning, NuminaMath-CoT, OpenCodeReasoning, MagicoderEvolInstruct, opc-sft-stage2, leetcode, TACO, and apps. The dataset has undergone decontamination and filtering to ensure sample quality.
提供机构:
nvidia



