ToastyPigeon/personality-assistant-9b-data

Name: ToastyPigeon/personality-assistant-9b-data
Creator: ToastyPigeon
Published: 2026-03-23 20:47:57
License: 暂无描述

Hugging Face2026-03-23 更新2026-03-29 收录

下载链接：

https://hf-mirror.com/datasets/ToastyPigeon/personality-assistant-9b-data

下载链接

链接失效反馈

官方服务：

资源简介：

--- license: apache-2.0 task_categories: - text-generation tags: - personality - assistant - tool-calling - qwen - sft --- # Personality Assistant 9B Training Data Training data for a mixed personality-based assistant with tool calling capabilities, used to fine-tune Qwen3.5-9B. ## Dataset Composition **2,997 samples total** (~5.8M tokens): | Component | Samples | Description | |-----------|---------|-------------| | Personality/Persona | 1,925 | Character-based assistant conversations with diverse personas | | Tool Calling | 1,072 | Function calling / tool use examples formatted for Qwen3.5 | ## Files - `combined.jsonl` — Pre-shuffled combined dataset ready for training (2,997 samples) - `sources/personality_combined_thinking.jsonl` — Personality data with thinking traces (1,925 samples) - `sources/toucan_cireilclaw_toolcall_qwen35.jsonl` — Tool calling data formatted for Qwen3.5 (1,072 samples) ## Format Each sample is a JSON object with a `messages` array (ChatML format: system/user/assistant turns). Personality samples include a `character` field with persona metadata. Tool calling samples include a `tools` field with function definitions. ## Training Originally trained as Qwen3.5-9B Stage 2 SFT with: - QLoRA r=32, alpha=16, RSLoRA - Assistant-only loss - 3% eval split

--- 许可证：Apache-2.0 任务类别： - 文本生成标签： - 人格 - 助手 - 工具调用 - Qwen - 监督微调（SFT） --- # 人格助手9B训练数据集本数据集为具备工具调用能力的混合人格型助手训练数据，用于对Qwen3.5-9B模型进行微调。 ## 数据集构成 **总计2997条样本（约580万Token）**： | 组件类型 | 样本数量 | 描述 | |---------|---------|-----| | 人格/人设 | 1925 | 包含多样化人设的基于角色的助手对话数据 | | 工具调用 | 1072 | 适配Qwen3.5格式的函数调用/工具使用示例数据 | ## 数据集文件 - `combined.jsonl`：已完成预洗牌的合并训练数据集，共2997条样本 - `sources/personality_combined_thinking.jsonl`：附带思维链痕迹的人格类对话数据，共1925条样本 - `sources/toucan_cireilclaw_toolcall_qwen35.jsonl`：适配Qwen3.5格式的工具调用数据，共1072条样本 ## 数据格式每条样本为JSON对象，包含`messages`数组（采用ChatML格式，包含系统、用户、助手三轮对话）。人格类样本包含`character`字段，用于存储人设元数据。工具调用类样本包含`tools`字段，用于存储函数定义信息。 ## 训练配置本数据集最初用于Qwen3.5-9B的第二阶段监督微调（SFT），训练配置如下： - QLoRA参数：秩r=32，alpha=16，采用RSLoRA技术 - 仅计算助手轮次的损失值 - 划分3%的数据作为评估集

提供机构：

ToastyPigeon

5,000+

优质数据集

54 个

任务类型

进入经典数据集