five

aimosprite/brian-rollouts-311-specforge-turn-conversations

收藏
Hugging Face2026-03-26 更新2026-03-29 收录
下载链接:
https://hf-mirror.com/datasets/aimosprite/brian-rollouts-311-specforge-turn-conversations
下载链接
链接失效反馈
官方服务:
资源简介:
--- pretty_name: brian-rollouts-311-specforge-turn-conversations license: mit tags: - specforge - gpt-oss - harmony - conversations - reasoning - rollouts - math task_categories: - text-generation size_categories: - 10K<n<100K --- # brian-rollouts-311-specforge-turn-conversations Turn-level `conversations` SpecForge training export derived from `aimosprite/brian-rollouts-311-compact-turns`. ## Contents - `brian-rollouts-311-specforge-turn-conversations.jsonl` - `brian-rollouts-311-specforge-turn-conversations.jsonl.manifest.json` Each JSONL row is a single assistant call rendered as a full SpecForge `conversations` sample. The visible prefix context is intentionally repeated across rows so that every assistant turn is independently trainable: ```json { "id": "amobench::amo-bench-1::attempt0::turn0", "conversations": [ { "role": "system", "content": "..." }, { "role": "user", "content": "..." }, { "role": "assistant_analysis", "content": "..." } ], "dataset": "amobench", "problem_id": "amo-bench-1", "attempt_id": 0, "turn_index": 0, "turn_number": 1, "turn_count": 18, "rollout_date": "2026-03-11", "source_dataset_id": "aimosprite/brian-rollouts-311-compact-turns" } ``` ## Intended usage This export is meant for current SpecForge training flows that consume `conversations` JSONL with the GPT-OSS chat template: ```bash torchrun --standalone --nproc_per_node 8 scripts/train_eagle3.py \ --train-data-path ./brian-rollouts-311-specforge-turn-conversations.jsonl \ --chat-template gpt-oss ``` This turn-level export is preferable to one-row-per-rollout data for compact GPT-OSS traces because later prompts do not retain earlier hidden `analysis` messages. Exploding to one row per assistant call preserves the current turn's reasoning/final output. ## Source - Source dataset: `aimosprite/brian-rollouts-311-compact-turns` - Conversion: token-id decoding with `openai/gpt-oss-120b` - Rows: 69,373
提供机构:
aimosprite
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作