Phora68/dr-sage-dataset
收藏Hugging Face2026-03-19 更新2026-03-29 收录
下载链接:
https://hf-mirror.com/datasets/Phora68/dr-sage-dataset
下载链接
链接失效反馈官方服务:
资源简介:
---
language:
- en
license: apache-2.0
tags:
- mental-health
- therapy
- counseling
- conversational
- alpaca
- sharegpt
- bisarx
task_categories:
- conversational
- text-generation
size_categories:
- 1K<n<10K
---
# Dr. Sage — Therapeutic Conversation Dataset
**5,287 training records** for fine-tuning conversational psychiatric AI.
Created: **2026-03-19**
---
## Dataset description
This dataset trains models to follow the **BISARX therapeutic interviewing method**:
- Reflect what the patient said
- Name observed patterns honestly but without shame
- Never lecture — always end with ONE focused question
- Do not condone harmful habits — name them directly
---
## Files
| File | Format | Records | Use for |
|------|--------|---------|---------|
| `dr_sage_alpaca_4k.json` | Alpaca JSON | 5,287 | General SFT |
| `dr_sage_sharegpt_4k.jsonl` | ShareGPT JSONL | 5,287 | Chat template training (Unsloth) |
| `dr_sage_text_4k.jsonl` | Text JSONL | 5,287 | SFTTrainer `dataset_text_field` |
| `train_4k.json` | Alpaca JSON | 4,758 | Training split (90%) |
| `eval_4k.json` | Alpaca JSON | 528 | Eval split (10%) |
| `category_map.json` | JSON | — | Category → ID mapping |
---
## Schema
**Alpaca format:**
```json
{
"instruction": "I've felt empty for months and I don't know why.",
"input": "",
"output": "Months of emptiness without a clear cause — that's its own kind of disorienting. When you say empty, do you mean you feel nothing, or that you feel something you can't name?",
"category": "depression",
"source": "synthetic"
}
```
**ShareGPT format:**
```json
{
"conversations": [
{"from": "system", "value": "You are Dr. Sage..."},
{"from": "human", "value": "I've felt empty for months..."},
{"from": "gpt", "value": "Months of emptiness without a clear cause..."}
],
"category": "depression",
"source": "synthetic"
}
```
---
## Category map (17 categories)
| ID | Category |
|----|----------|
| 0 | anger |
| 1 | anxiety |
| 2 | boundaries_pleasing |
| 3 | crisis_safety |
| 4 | depression |
| 5 | family_dynamics |
| 6 | general_therapeutic |
| 7 | grief_loss |
| 8 | identity_transition |
| 9 | loneliness |
| 10 | physical_somatic |
| 11 | relationships |
| 12 | self_esteem |
| 13 | shame_esteem |
| 14 | substance_avoidance |
| 15 | trauma |
| 16 | work_burnout |
---
## Category breakdown
| Category | Samples |
|----------|---------|
| relationships | 675 |
| general_therapeutic | 596 |
| depression | 558 |
| anxiety | 544 |
| family_dynamics | 414 |
| trauma | 333 |
| work_burnout | 251 |
| anger | 250 |
| loneliness | 243 |
| substance_avoidance | 242 |
| shame_esteem | 232 |
| grief_loss | 230 |
| identity_transition | 229 |
| self_esteem | 225 |
| boundaries_pleasing | 225 |
| crisis_safety | 30 |
| physical_somatic | 10 |
---
## Trained model
See [Phora68/dr-sage-qwen2.5-3b](https://huggingface.co/Phora68/dr-sage-qwen2.5-3b) for the fine-tuned model.
## Disclaimer
For research and educational purposes only.
Not a substitute for professional mental health care.
提供机构:
Phora68



