reasoning-degeneration-dev/TIW-qwen3-4B-Instruct-2507-cd5arg-iterative
收藏Hugging Face2025-12-19 更新2025-12-20 收录
下载链接:
https://hf-mirror.com/datasets/reasoning-degeneration-dev/TIW-qwen3-4B-Instruct-2507-cd5arg-iterative
下载链接
链接失效反馈官方服务:
资源简介:
使用迭代策略进行倒计时推理,目标token数为4096。数据集包含100行和13列,各列的具体描述包括问题陈述、元数据、任务来源、格式化提示、模型响应、token计数、轮次计数、继续提示、继续文本、策略类型、答案、评估和评估元数据等。
Countdown reasoning with iterative strategy to 4096 tokens. The dataset contains 100 rows and 13 columns, with detailed descriptions for each column including problem statement, metadata, task source, formatted prompt, model responses, token counts, round counts, continuation prompts, continuation text, strategy type, answers, evaluations, and evaluation metadata.
提供机构:
reasoning-degeneration-dev



