five

indominousx/suicide-ideation-detection

收藏
Hugging Face2025-11-24 更新2025-12-20 收录
下载链接:
https://hf-mirror.com/datasets/indominousx/suicide-ideation-detection
下载链接
链接失效反馈
官方服务:
资源简介:
# Suicide Ideation Detection Dataset ## Dataset Description This dataset contains 232,000 Reddit posts from r/SuicideWatch, labeled for self-harm intent and suicidal ideation. ### Dataset Summary - **Size:** 232,000 samples - **Source:** Reddit r/SuicideWatch community (Kaggle) - **Task:** Binary classification (suicidal ideation vs. non-suicidal content) - **Language:** English ### Data Fields - `text`: The Reddit post content (string) - `label`: Binary label (0 = no suicidal ideation, 1 = contains suicidal ideation) ### Dataset Statistics - **Total Samples:** 232,000 - **Label Distribution:** Class-imbalanced (see paper for details) - **Average Text Length:** ~100-500 characters - **Source Subreddit:** r/SuicideWatch ### Intended Use This dataset is designed for: - Training suicide ideation detection models - Crisis intervention system development - Mental health safety research for AI systems - Academic research in computational psychiatry ### Licensing **License:** Creative Commons Attribution 4.0 International (CC BY 4.0) You are free to: - Share — copy and redistribute the material - Adapt — remix, transform, and build upon the material Under the following terms: - Attribution — You must give appropriate credit ### Citation If you use this dataset, please cite: ```bibtex @dataset{suicide_ideation_2025, title={Suicide Ideation Detection Dataset from Reddit}, author={LLM Psychological Safety Project}, year={2025}, publisher={HuggingFace Datasets}, source={Kaggle/Reddit r/SuicideWatch}, url={https://huggingface.co/datasets/indominousx/suicide-ideation-detection} } ``` ### Ethical Considerations ⚠️ **CRITICAL SAFETY NOTICE:** - This dataset contains highly sensitive mental health crisis content - Should ONLY be used for safety research and crisis prevention systems - Not for training general-purpose models without proper safeguards - Researchers should have ethics board approval - Content may be distressing - viewer discretion advised ### Crisis Resources If you or someone you know is in crisis: - **US:** National Suicide Prevention Lifeline: 988 - **US:** Crisis Text Line: Text HOME to 741741 - **International:** https://www.iasp.info/resources/Crisis_Centres/ ### Contact For questions or issues, please open an issue in the GitHub repository: https://github.com/indominousx/LLM_Pscho_Safety
提供机构:
indominousx
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作