five

marin-community/open-thoughts-4-6k-math-qwen3-235b-a22b-disagreed-answers

收藏
Hugging Face2026-01-22 更新2026-02-07 收录
下载链接:
https://hf-mirror.com/datasets/marin-community/open-thoughts-4-6k-math-qwen3-235b-a22b-disagreed-answers
下载链接
链接失效反馈
官方服务:
资源简介:
该数据集包含来自OpenThoughts4数学数据集的5,982个样本,其中Qwen3-32B和Qwen3-235B-A22B模型产生了不同的有效答案。数据集包含了Qwen3-235B-A22B的推理轨迹。这是从两个父数据集派生的10个子数据集之一,父数据集各包含29,963个样本。数据集中的样本具有唯一的`ms_id`值,并且在配对的32B/235B数据集中匹配。数据集可用于错误分析、消融研究和难度分析等用途。

This dataset contains 5,982 samples from the OpenThoughts4 math dataset where both Qwen3-32B and Qwen3-235B-A22B produced valid answers that differ. This dataset contains the Qwen3-235B-A22B reasoning traces. It is one of 10 child datasets derived from two parent datasets, each containing 29,963 samples. The `ms_id` values are unique across all child datasets and match between paired 32B/235B datasets. The dataset is useful for error analysis, ablation studies, and difficulty analysis.
提供机构:
marin-community
二维码
社区交流群
二维码
科研交流群
商业服务