reasoning-degeneration-dev/TIW-Qwen3-4B-Thinking-2507-cd5arg-iterative
收藏Hugging Face2025-12-19 更新2025-12-20 收录
下载链接:
https://hf-mirror.com/datasets/reasoning-degeneration-dev/TIW-Qwen3-4B-Thinking-2507-cd5arg-iterative
下载链接
链接失效反馈官方服务:
资源简介:
这是一个关于倒计时推理任务的数据集,使用迭代策略生成最多4096个token的响应。数据集包含100行和13列,每列都有详细的描述,如问题陈述、元数据、任务来源、格式化提示、模型响应等。生成参数部分详细说明了使用的模型、超参数和实验设置。
This is a dataset for countdown reasoning tasks, using an iterative strategy to generate responses up to 4096 tokens. The dataset contains 100 rows and 13 columns, each with detailed descriptions such as problem statements, metadata, task sources, formatted prompts, model responses, etc. The generation parameters section details the model used, hyperparameters, and experimental setup.
提供机构:
reasoning-degeneration-dev



