mmqm/m196k-dedup-decon-filter_easy-r1-filter_wrong-decon_eval-domain_1k
收藏Hugging Face2025-03-28 更新2025-04-12 收录
下载链接:
https://hf-mirror.com/datasets/mmqm/m196k-dedup-decon-filter_easy-r1-filter_wrong-decon_eval-domain_1k
下载链接
链接失效反馈官方服务:
资源简介:
该数据集是一个包含多个字段的信息集合,用于训练机器学习模型。字段包括答案索引、来源文本、元数据、提示文本、答案字符、答案字符串,以及不同模型(如qwen_7b、qwen_32b、r1)提取的答案字符串和正确性标记。此外,还包括了推理过程和领域相关的信息。数据集分为训练集,共有1000个示例。
This dataset is a collection of various fields for training machine learning models. It includes fields such as answer index, source text, metadata, prompt text, answer character, answer string, answer strings extracted by different models (such as qwen_7b, qwen_32b, r1) and correctness labels. It also contains reasoning process and domain-related information. The dataset is split into a training set with a total of 1000 examples.
提供机构:
mmqm



