LumiVore/lumivore-stage2-training-data

Name: LumiVore/lumivore-stage2-training-data
Creator: LumiVore
Published: 2026-03-22 13:21:49
License: 暂无描述

Hugging Face2026-03-22 更新2026-03-29 收录

下载链接：

https://hf-mirror.com/datasets/LumiVore/lumivore-stage2-training-data

下载链接

链接失效反馈

官方服务：

资源简介：

# Lumivore Stage 2 Training Dataset **Version:** 4.5-B **Created:** March 2026 **Purpose:** LoRA fine-tuning on OpenClaw-specific agent behavior **Total Examples:** ~11,880 **Format:** Alpaca (instruction, input, output) --- ## Overview This dataset was used for Stage 2 of the Lumivore-1.2B training pipeline. It focuses specifically on OpenClaw agent behavior — tool use, structured reasoning, and agentic task execution patterns. Stage 2 was a LoRA fine-tuning (rank=64) on top of the Stage 1 full fine-tuning, allowing the model to specialize while preserving general capabilities. --- ## Data Sources | Source | Description | Proportion | |--------|-------------|------------| | **OpenClaw v3.0** | Session logs with tool reasoning chains | ~50% | | **Agent Tool Use** | OpenClaw docs-derived examples | ~25% | | **ClawHub Skills** | Skill documentation and usage patterns | ~25% | --- ## Dataset Characteristics - **Task types:** Tool calls, agent loops, reasoning chains, structured JSON outputs - **Style:** Agentic, action-oriented, OpenClaw-specific - **Quality:** Filtered for correctness, PII sanitized - **Augmentation:** 5x linguistic variations --- ## Training Configuration Used with the following hyperparameters: ```python # Stage 2 Training (LoRA) - Base model: Stage 1 output (/workspace/checkpoints/stage1/final) - LoRA rank: 64, alpha: 128 - LoRA targets: q_proj, v_proj, gate - Batch size: 1 (micro), gradient_accumulation: 16 - Effective batch: 16 - Max sequence length: 1024 - Learning rate: 5e-6 (10x lower than Stage 1) - Optimizer: 8-bit AdamW - Epochs: 3 - Steps: 2,115 - Duration: ~5 hours on AMD RX 7600 XT - Peak VRAM: ~3.3/16.0 GB ``` --- ## Files - `train.jsonl` — Training examples (~11,288 after split) - `validation.jsonl` — Validation examples (~5% split) - `metadata.json` — Dataset metadata and statistics - `README.md` — This documentation --- ## Usage ```python from datasets import load_dataset dataset = load_dataset("LumiVore/lumivore-stage2-training-data") train_data = dataset["train"] val_data = dataset["validation"] ``` --- ## Related - **Stage 1 Dataset:** `LumiVore/lumivore-stage1-training-data` — General agentic fine-tuning - **Stage 3 Dataset:** `LumiVore/lumivore-stage3-identity-dataset` — Identity and conversational training - **Model:** `LumiVore/lumivore-1.2b` (when published) --- ## Citation If you use this dataset, please cite: ```bibtex @dataset{lumivore2026stage2, title={Lumivore Stage 2 Training Dataset}, author={LumiVore AI}, year={2026}, url={https://huggingface.co/datasets/LumiVore/lumivore-stage2-training-data} } ``` --- *Created for the Lumivore-1.2B training pipeline*

# Lumivore 第二阶段训练数据集 **版本：** 4.5-B **创建时间：** 2026年3月 **用途：** 针对OpenClaw特定智能体行为的低秩适配（Low-Rank Adaptation, LoRA）微调 **总样本数：** 约11880条 **格式：** Alpaca格式，字段包含instruction（指令）、input（输入）、output（输出） --- ## 概述本数据集用于Lumivore-1.2B训练流水线的第二阶段，专门聚焦OpenClaw智能体行为——涵盖工具调用、结构化推理以及智能体任务执行模式。第二阶段是在第一阶段全量微调的基础上开展低秩适配（LoRA）微调（秩=64），可在保留模型通用能力的同时实现专业化适配。 --- ## 数据源 | 数据源 | 描述 | 占比 | |--------|-------------|------------| | **OpenClaw v3.0** | 包含工具推理链的会话日志 | 约50% | | **Agent Tool Use** | 源自OpenClaw官方文档的示例数据 | 约25% | | **ClawHub Skills** | 技能文档与实际使用模式 | 约25% | --- ## 数据集特征 - **任务类型：** 工具调用、智能体循环、推理链、结构化JSON输出 - **风格：** 智能体导向、行动取向、专为OpenClaw定制 - **质量：** 经过正确性过滤，已完成个人可识别信息（Personally Identifiable Information, PII）脱敏处理 - **数据增强：** 进行了5倍的语言变体扩充 --- ## 训练配置使用以下超参数进行训练： python # 第二阶段LoRA训练 - 基础模型：第一阶段最终输出（/workspace/checkpoints/stage1/final） - LoRA秩：64，alpha值：128 - LoRA目标模块：q_proj、v_proj、gate - 微批次大小：1，梯度累积步数：16 - 有效批次大小：16 - 最大序列长度：1024 - 学习率：5e-6（比第一阶段低10倍） - 优化器：8-bit AdamW - 训练轮次：3 - 训练步数：2115 - 训练时长：在AMD RX 7600 XT显卡上约5小时 - 峰值显存占用：约3.3/16.0 GB --- ## 数据集文件 - `train.jsonl` — 训练样本集（拆分后约11288条） - `validation.jsonl` — 验证样本集（占拆分集的约5%） - `metadata.json` — 数据集元数据与统计信息 - `README.md` — 本文档说明 --- ## 使用方法 python from datasets import load_dataset dataset = load_dataset("LumiVore/lumivore-stage2-training-data") train_data = dataset["train"] val_data = dataset["validation"] --- ## 相关资源 - **第一阶段数据集：** `LumiVore/lumivore-stage1-training-data` — 通用智能体微调数据集 - **第三阶段数据集：** `LumiVore/lumivore-stage3-identity-dataset` — 身份与对话训练数据集 - **配套模型：** `LumiVore/lumivore-1.2b`（待正式发布） --- ## 引用规范若使用本数据集，请引用如下文献： bibtex @dataset{lumivore2026stage2, title={Lumivore Stage 2 Training Dataset}, author={LumiVore AI}, year={2026}, url={https://huggingface.co/datasets/LumiVore/lumivore-stage2-training-data} } --- *专为Lumivore-1.2B训练流水线打造*

提供机构：

LumiVore

5,000+

优质数据集

54 个

任务类型

进入经典数据集