marin-community/open-thoughts-4-8530-math-kimi-k2pt5-annotated-32768-tokens

Name: marin-community/open-thoughts-4-8530-math-kimi-k2pt5-annotated-32768-tokens
Creator: marin-community
Published: 2026-04-17 23:33:02
License: 暂无描述

Hugging Face2026-04-17 更新2026-04-26 收录

下载链接：

https://hf-mirror.com/datasets/marin-community/open-thoughts-4-8530-math-kimi-k2pt5-annotated-32768-tokens

下载链接

链接失效反馈

官方服务：

资源简介：

--- dataset_info: split: train num_examples: 68240 license: apache-2.0 --- # open-thoughts-4-8530-math-kimi-k2pt5-annotated-32768-tokens This dataset contains 68240 rows (8530 math prompts x 8 responses each) generated by **Kimi K2.5** with a max token length of 32768. ## Columns | Column | Description | |--------|-------------| | `row_id` | Original row identifier | | `instruction_seed` | The math prompt | | `_source` | Source dataset for the prompt | | `gpt41_mini_response` | GPT-4.1-mini reference response | | `__original_row_idx` | Original row index in source | | `length` | Prompt length | | `ms_id` | Source ID | | `kimi_k2pt5_generated_text` | Response generated by Kimi K2.5 (the primary response column) | | `final_answer` | Last `\boxed{...}` answer extracted from `kimi_k2pt5_generated_text` after `</think>` | | `complete_responses_count` | Number of the 8 responses per prompt that are "complete" (have `</think>` followed by `\boxed{...}`) | ## Stats - **Complete responses**: 66368 rows have at least one complete sibling - **Prompts with all 8 complete**: 4810 / 8530 - **Prompts with 0 complete**: 234 / 8530 ## Purpose This dataset shares the same 8530 prompts as [marin-community/open-thoughts-4-8530-math-qwen3-30b-a3B-thinking-2507-annotated-32768-tokens](https://huggingface.co/datasets/marin-community/open-thoughts-4-8530-math-qwen3-30b-a3B-thinking-2507-annotated-32768-tokens), enabling direct comparison of Kimi K2.5 vs Qwen3-30B-A3B responses on the same math problems.

--- 数据集信息：拆分集：训练集（train）样本总数：68240 许可证：Apache-2.0 --- # open-thoughts-4-8530-math-kimi-k2pt5-annotated-32768-tokens 本数据集共包含68240条数据（即8530道数学提示词，每道提示词对应8条生成回复），由**Kimi K2.5**生成，单条文本最大Token长度为32768。 ## 字段说明 | 字段名 | 描述 | |--------|-------------| | `row_id` | 原始行标识符 | | `instruction_seed` | 数学提示词（prompt） | | `_source` | 该提示词的源数据集 | | `gpt41_mini_response` | GPT-4.1-mini 参考回复 | | `__original_row_idx` | 源数据集中的原始行索引 | | `length` | 提示词长度 | | `ms_id` | 源数据标识 | | `kimi_k2pt5_generated_text` | Kimi K2.5 生成的回复（核心回复字段） | | `final_answer` | 从`kimi_k2pt5_generated_text`的`</think>`标记后提取的最后一个`oxed{...}`格式答案 | | `complete_responses_count` | 单提示词的8条回复中「完整回复」的数量，完整回复指包含`</think>`后接`oxed{...}`格式内容的回复 | ## 统计信息 - **完整回复覆盖**：66368条数据至少拥有1条完整的同提示词回复 - **8条回复均完整的提示词**：4810 / 8530 - **无完整回复的提示词**：234 / 8530 ## 数据集用途本数据集与[marin-community/open-thoughts-4-8530-math-qwen3-30b-a3B-thinking-2507-annotated-32768-tokens](https://huggingface.co/datasets/marin-community/open-thoughts-4-8530-math-qwen3-30b-a3B-thinking-2507-annotated-32768-tokens)使用完全相同的8530道数学提示词，可直接对比Kimi K2.5与Qwen3-30B-A3B在同一数学题目上的生成回复效果。

提供机构：

marin-community

5,000+

优质数据集

54 个

任务类型

进入经典数据集