five

Phora68/dr-sage-dataset

收藏
Hugging Face2026-03-19 更新2026-03-29 收录
下载链接:
https://hf-mirror.com/datasets/Phora68/dr-sage-dataset
下载链接
链接失效反馈
官方服务:
资源简介:
--- language: - en license: apache-2.0 tags: - mental-health - therapy - counseling - conversational - alpaca - sharegpt - bisarx task_categories: - conversational - text-generation size_categories: - 1K<n<10K --- # Dr. Sage — Therapeutic Conversation Dataset **5,287 training records** for fine-tuning conversational psychiatric AI. Created: **2026-03-19** --- ## Dataset description This dataset trains models to follow the **BISARX therapeutic interviewing method**: - Reflect what the patient said - Name observed patterns honestly but without shame - Never lecture — always end with ONE focused question - Do not condone harmful habits — name them directly --- ## Files | File | Format | Records | Use for | |------|--------|---------|---------| | `dr_sage_alpaca_4k.json` | Alpaca JSON | 5,287 | General SFT | | `dr_sage_sharegpt_4k.jsonl` | ShareGPT JSONL | 5,287 | Chat template training (Unsloth) | | `dr_sage_text_4k.jsonl` | Text JSONL | 5,287 | SFTTrainer `dataset_text_field` | | `train_4k.json` | Alpaca JSON | 4,758 | Training split (90%) | | `eval_4k.json` | Alpaca JSON | 528 | Eval split (10%) | | `category_map.json` | JSON | — | Category → ID mapping | --- ## Schema **Alpaca format:** ```json { "instruction": "I've felt empty for months and I don't know why.", "input": "", "output": "Months of emptiness without a clear cause — that's its own kind of disorienting. When you say empty, do you mean you feel nothing, or that you feel something you can't name?", "category": "depression", "source": "synthetic" } ``` **ShareGPT format:** ```json { "conversations": [ {"from": "system", "value": "You are Dr. Sage..."}, {"from": "human", "value": "I've felt empty for months..."}, {"from": "gpt", "value": "Months of emptiness without a clear cause..."} ], "category": "depression", "source": "synthetic" } ``` --- ## Category map (17 categories) | ID | Category | |----|----------| | 0 | anger | | 1 | anxiety | | 2 | boundaries_pleasing | | 3 | crisis_safety | | 4 | depression | | 5 | family_dynamics | | 6 | general_therapeutic | | 7 | grief_loss | | 8 | identity_transition | | 9 | loneliness | | 10 | physical_somatic | | 11 | relationships | | 12 | self_esteem | | 13 | shame_esteem | | 14 | substance_avoidance | | 15 | trauma | | 16 | work_burnout | --- ## Category breakdown | Category | Samples | |----------|---------| | relationships | 675 | | general_therapeutic | 596 | | depression | 558 | | anxiety | 544 | | family_dynamics | 414 | | trauma | 333 | | work_burnout | 251 | | anger | 250 | | loneliness | 243 | | substance_avoidance | 242 | | shame_esteem | 232 | | grief_loss | 230 | | identity_transition | 229 | | self_esteem | 225 | | boundaries_pleasing | 225 | | crisis_safety | 30 | | physical_somatic | 10 | --- ## Trained model See [Phora68/dr-sage-qwen2.5-3b](https://huggingface.co/Phora68/dr-sage-qwen2.5-3b) for the fine-tuned model. ## Disclaimer For research and educational purposes only. Not a substitute for professional mental health care.
提供机构:
Phora68
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作