ourafla/Mental-Health_Text-Classification_Dataset
收藏Hugging Face2025-12-18 更新2025-12-20 收录
下载链接:
https://hf-mirror.com/datasets/ourafla/Mental-Health_Text-Classification_Dataset
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含用户生成的短文本,标注为4类心理健康分类:自杀、抑郁、焦虑和正常。它是一个衍生数据集,通过合并和清理三个公开的心理健康语料库创建,然后重新标记为统一的4类方案,并导出适合经典机器学习和现代NLP模型的CSV文件。存储库包括:一个不平衡的主要训练语料库(真实的类别倾斜)、一个严格平衡的测试分割用于公平评估,以及一个包含基本文本统计(长度、URL、表情符号、标点符号等)的特征工程文件。重要提示:此数据集仅用于研究和教育目的,不是临床工具,不得用于现实世界的诊断、分诊或危机干预。
This dataset contains short, user‑generated texts labeled for 4‑class mental health classification: Suicidal, Depression, Anxiety, and Normal. It is a derived dataset created by combining and cleaning three public mental‑health corpora, then re‑labeling them into a unified 4‑class scheme and exporting CSV files suitable for both classical ML and modern NLP models. The repository includes: An unbalanced main training corpus (realistic class skew), A strictly balanced test split for fair evaluation, and A feature‑engineered file with basic text statistics (length, URLs, emojis, punctuation, etc.). Important: This dataset is intended for research and education only. It is not a clinical tool and must not be used for real‑world diagnosis, triage, or crisis intervention.
提供机构:
ourafla



