five

aimosprite/brian-rollouts-311-specforge-conversations

收藏
Hugging Face2026-03-26 更新2026-03-29 收录
下载链接:
https://hf-mirror.com/datasets/aimosprite/brian-rollouts-311-specforge-conversations
下载链接
链接失效反馈
官方服务:
资源简介:
--- pretty_name: brian-rollouts-311-specforge-conversations license: mit tags: - specforge - gpt-oss - harmony - conversations - reasoning - rollouts - math task_categories: - text-generation size_categories: - 10K<n<100K --- # brian-rollouts-311-specforge-conversations Full-rollout `conversations` SpecForge training export derived from `aimosprite/brian-rollouts-311-compact-turns`. ## Contents - `brian-rollouts-311-specforge-conversations.jsonl` - `brian-rollouts-311-specforge-conversations.jsonl.manifest.json` Each JSONL row is one reconstructed rollout attempt in SpecForge `conversations` format. Harmony messages are reconstructed from the compact turn snapshots and merged so that repeated assistant text is not duplicated: ```json { "id": "amobench::amo-bench-1::attempt0", "conversations": [ { "role": "system", "content": "..." }, { "role": "user", "content": "..." }, { "role": "assistant_analysis", "content": "..." }, { "role": "assistant_final", "content": "..." } ], "dataset": "amobench", "problem_id": "amo-bench-1", "attempt_id": 0, "turn_count": 18, "rollout_date": "2026-03-11", "source_dataset_id": "aimosprite/brian-rollouts-311-compact-turns" } ``` ## Intended usage This export is meant for current SpecForge training flows that consume `conversations` JSONL with the GPT-OSS chat template: ```bash torchrun --standalone --nproc_per_node 8 scripts/train_eagle3.py \ --train-data-path ./brian-rollouts-311-specforge-conversations.jsonl \ --chat-template gpt-oss ``` This export follows SpecForge's actual Harmony loading path: `safe_conversations_generator -> build_eagle3_dataset -> HarmonyParser`. Consecutive `assistant_analysis` and `assistant_final` messages are preserved as distinct Harmony messages, and assistant loss is applied by SpecForge across assistant spans until the next `user` message. ## Source - Source dataset: `aimosprite/brian-rollouts-311-compact-turns` - Conversion: token-id decoding with `openai/gpt-oss-120b` - Rows: 2,096
提供机构:
aimosprite
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作