Bangla Mental Health Dataset V2
收藏DataCite Commons2026-04-21 更新2026-05-04 收录
下载链接:
https://data.mendeley.com/datasets/23tcfxkgc2/2
下载链接
链接失效反馈官方服务:
资源简介:
This dataset is a synthetically generated Bangla-language mental health dataset consisting of 10,000 structured conversational samples. It is designed to support research in natural language processing (NLP), particularly for low-resource languages such as Bangla, with applications in large language model (LLM) fine-tuning, mental health text classification, and dialogue system development.
Each sample follows an instruction-based format (input–instruction–output), making the dataset directly suitable for supervised fine-tuning (SFT), Alpaca-style training, and parameter-efficient methods such as LoRA and QLoRA. The dataset captures a diverse range of approximately 40 mental health-related conditions, including stress, anxiety, overthinking, lack of emotional support, and self-confidence issues, expressed in natural Bangla conversational patterns.
The dataset is fully synthetic and was generated using controlled text generation pipelines informed by mental health literature, psychological reports, media discussions, and publicly available educational content. No real user data or personally identifiable information (PII) is included.
This dataset is intended strictly for research and educational purposes. It is not suitable for clinical use, diagnosis, or real-world mental health decision-making. The resource aims to facilitate safe and reproducible experimentation in Bangla NLP and conversational AI.
提供机构:
Mendeley Data
创建时间:
2026-04-21



