reasoning-degeneration-dev/TIW-Qwen3-4B-Thinking-2507-cd5arg-random_noise-FAILED
收藏Hugging Face2025-12-19 更新2025-12-20 收录
下载链接:
https://hf-mirror.com/datasets/reasoning-degeneration-dev/TIW-Qwen3-4B-Thinking-2507-cd5arg-random_noise-FAILED
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含100行和10列数据,主要用于倒计时推理任务,采用random_noise策略生成16384个令牌。数据集包含问题陈述、元数据(JSON格式,包含目标数字、可用数字和解决方案)、任务来源标识符、初始提示消息、模型完整响应、每个样本的总令牌计数、每个样本的延续轮数、延续提示文本、延续文本以及提示策略类型(iterative、budget_forcing或random_noise)。
This dataset contains 100 rows and 10 columns, primarily used for countdown reasoning tasks, generated with a random_noise strategy to 16384 tokens. The dataset includes problem statements, metadata (in JSON format with target numbers, available numbers, and solutions), task type identifiers, initial prompt messages, full model responses, total token counts per sample, number of continuation rounds per sample, continuation prompt texts, continuation texts, and prompting strategy types (iterative, budget_forcing, or random_noise).
提供机构:
reasoning-degeneration-dev



