five

ourafla/Mental-Health_Text-Classification_Dataset

收藏
Hugging Face2025-12-18 更新2025-12-20 收录
下载链接:
https://hf-mirror.com/datasets/ourafla/Mental-Health_Text-Classification_Dataset
下载链接
链接失效反馈
官方服务:
资源简介:
该数据集包含用户生成的短文本,标注为4类心理健康分类:自杀、抑郁、焦虑和正常。它是一个衍生数据集,通过合并和清理三个公开的心理健康语料库创建,然后重新标记为统一的4类方案,并导出适合经典机器学习和现代NLP模型的CSV文件。存储库包括:一个不平衡的主要训练语料库(真实的类别倾斜)、一个严格平衡的测试分割用于公平评估,以及一个包含基本文本统计(长度、URL、表情符号、标点符号等)的特征工程文件。重要提示:此数据集仅用于研究和教育目的,不是临床工具,不得用于现实世界的诊断、分诊或危机干预。

This dataset contains short, user‑generated texts labeled for 4‑class mental health classification: Suicidal, Depression, Anxiety, and Normal. It is a derived dataset created by combining and cleaning three public mental‑health corpora, then re‑labeling them into a unified 4‑class scheme and exporting CSV files suitable for both classical ML and modern NLP models. The repository includes: An unbalanced main training corpus (realistic class skew), A strictly balanced test split for fair evaluation, and A feature‑engineered file with basic text statistics (length, URLs, emojis, punctuation, etc.). Important: This dataset is intended for research and education only. It is not a clinical tool and must not be used for real‑world diagnosis, triage, or crisis intervention.
提供机构:
ourafla
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作