reasoning-degeneration-dev/TIW-qwen3-4B-Instruct-2507-cd5arg-random_noise
收藏Hugging Face2025-12-19 更新2025-12-20 收录
下载链接:
https://hf-mirror.com/datasets/reasoning-degeneration-dev/TIW-qwen3-4B-Instruct-2507-cd5arg-random_noise
下载链接
链接失效反馈官方服务:
资源简介:
该数据集是一个用于倒计时推理任务的数据集,采用random_noise策略,目标token数为4096。数据集包含100行,13列,列包括问题描述、元数据、任务来源、格式化提示、模型响应、token计数、轮次计数、继续提示、继续文本、策略类型、答案、评估和评估元数据等。数据集的生成参数包括模型名称、超参数、输入数据集、实验描述等。
This dataset is for countdown reasoning tasks using the random_noise strategy with a target of 4096 tokens. It contains 100 rows and 13 columns, including question description, metadata, task source, formatted prompt, model responses, token counts, round counts, continuation prompts, continuation text, strategy type, answers, evaluations, and evaluation metadata. The generation parameters include model name, hyperparameters, input datasets, experiment description, etc.
提供机构:
reasoning-degeneration-dev



