marin-community/open-thoughts-4-128-math-qwen3pt5-397b-annotated-32768-tokens
收藏Hugging Face2026-03-21 更新2026-03-29 收录
下载链接:
https://hf-mirror.com/datasets/marin-community/open-thoughts-4-128-math-qwen3pt5-397b-annotated-32768-tokens
下载链接
链接失效反馈官方服务:
资源简介:
# open-thoughts-4-128-math-qwen3pt5-397b-annotated-32768-tokens
Math reasoning responses generated by **Qwen3.5-397B-A17B** (Qwen/Qwen3.5-397B-A17B) via the Together AI serverless API.
## Overview
- **Total rows:** 1,024
- **Unique prompts:** 128 (each with 8 response annotations)
- **Source prompts:** [marin-community/open-thoughts-4-128-math-qwen3-32b-annotated-32768-tokens-n8-reformatted](https://huggingface.co/datasets/marin-community/open-thoughts-4-128-math-qwen3-32b-annotated-32768-tokens-n8-reformatted)
- **Generation model:** [Qwen/Qwen3.5-397B-A17B](https://huggingface.co/Qwen/Qwen3.5-397B-A17B)
- **Max tokens:** 32,768
- **Temperature:** 0.8
- **Tokenizer used for stats:** Qwen/Qwen3.5-397B-A17B
## Statistics
| Metric | Value |
|--------|-------|
| Avg tokens per response | 24,834 |
| Median tokens per response | 26,270 |
| Responses with `<think>` tag | 100.0% |
| Complete responses (has `</think>` + `\boxed{...}`) | 686/1024 (67.0%) |
| Truncated responses | 338/1024 (33.0%) |
| Empty responses | 0 |
## Columns
| Column | Description |
|--------|-------------|
| `row_id` | Sequential identifier (0-1023) |
| `instruction_seed` | The math problem prompt |
| `qwen3pt5_397b_generated_text` | Qwen3.5-397B-A17B generated response (with `<think>...</think>` reasoning trace) |
| `ms_id` | Math seed ID -- groups all 8 responses for the same prompt |
| `_source` | Source dataset identifier |
| `gpt41_mini_response` | GPT-4.1 mini reference response |
| `length` | Response length |
## Response Format
Each response in the `qwen3pt5_397b_generated_text` column follows this format:
```
<think>
[model's reasoning trace]
</think>
[final answer, typically containing \boxed{...}]
```
Responses that are truncated (hit the 32,768 token limit) may be missing the closing `</think>` tag and/or the `\boxed{...}` answer.
## Construction
Generated by sending each of the 128 math prompts to Qwen3.5-397B-A17B 8 times (n=8) via the Together AI serverless endpoint, with `max_tokens=32768` and `temperature=0.8`. The model's reasoning trace (from the `message.reasoning` API field) is wrapped in `<think>...</think>` tags.
提供机构:
marin-community



