collinear-ai/R1-Distill-SFT-Curated
收藏Hugging Face2025-03-19 更新2025-04-08 收录
下载链接:
https://hf-mirror.com/datasets/collinear-ai/R1-Distill-SFT-Curated
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含了从ServiceNow-AI/R1-Distill-SFT数据集中的NuminaMath样本,具有问题、解决方案、来源、R1-Distill-Qwen-32B、R1-Distill-Qwen-32B消息和正确性等特征。问题、解决方案和来源特征来自NuminaMath数据集,分别对应问题陈述、真实解决方案和问题来源。R1-Distill-Qwen-32B特征包含Qwen 32B模型对问题的响应,R1-Distill-Qwen-32B消息特征是带有聊天格式的响应。正确性特征包含Collinear审稿人C1和C2对生成响应正确性的判断。
This dataset consists of NuminaMath samples from the ServiceNow-AI/R1-Distill-SFT dataset and includes features such as problem, solution, source, R1-Distill-Qwen-32B, R1-Distill-Qwen-32B-messages, and correctness. The problem, solution, and source features are from the NuminaMath dataset, corresponding to the problem statement, ground truth solution, and problem source, respectively. The R1-Distill-Qwen-32B feature contains the responses generated by the Qwen 32B model to the problem, and the R1-Distill-Qwen-32B-messages feature is the response formatted with chat. The correctness feature includes judgments on the correctness of the generated response by Collinear curators C1 and C2 as a json object, where the values corresponding to the keys C1 and C2 are the curator predictions regarding the correctness of the response.
提供机构:
collinear-ai



