himanshunakrani9/qwen-reasoning-samples-20260421_221240
收藏Hugging Face2026-04-21 更新2026-04-26 收录
下载链接:
https://hf-mirror.com/datasets/himanshunakrani9/qwen-reasoning-samples-20260421_221240
下载链接
链接失效反馈官方服务:
资源简介:
10个合成推理示例,通过vLLM使用Qwen/Qwen3.6-35B-A3B模型生成,采用结构化提示设计以引发前沿级(Opus 4.7类)多阶段推理。每个示例都通过质量过滤器,要求推理跟踪遵循明确的6阶段结构(理解→分解→探索→执行→验证→反思),并包括独立的验证步骤。生成细节包括模型、服务器、生成日期、温度、top-p、最大令牌数、主题多样性和难度等级。数据集模式包括主题、难度、问题、推理、验证、答案和常见错误。
10 synthetic reasoning examples generated with Qwen/Qwen3.6-35B-A3B via vLLM, using a structured prompt designed to elicit frontier-level (Opus 4.7 class) multi-phase reasoning. Each example is gated through a quality filter that requires the reasoning trace to follow an explicit 6-phase structure (Understand → Decompose → Explore → Execute → Verify → Reflect) and to include an independent verification step. Generation details include model, server, generation date, temperature, top-p, max tokens, topic diversity, and difficulty level. Dataset schema includes topic, difficulty, question, reasoning, verification, answer, and pitfalls.
提供机构:
himanshunakrani9



