five

Farsi Mental State and Emotional Needs Dataset for Psychological Text Mining

收藏
NIAID Data Ecosystem2026-05-02 收录
下载链接:
https://data.mendeley.com/datasets/ksmcbg69rr
下载链接
链接失效反馈
官方服务:
资源简介:
The dataset consists of 993 validated text samples annotated manually by a human expert in Persian. Each sample contains free-form user-generated text expressing emotional distress, mental health concerns, or existential reflection. The samples were collected and labeled between June and July 2025 from public online Persian-language mental health forums and social discussion platforms. The raw texts were anonymized and selected based on depth of psychological content and relevance to emotion-focused analysis. Texts involving offensive or highly clinical content were excluded. Each entry in the dataset has the following structure: { "id": 75, "text": "دوست دارم یکی توی این دنیا باشه التماسش کنم... چرا اینجوری شد؟...", "mental_state": ["hopelessness", "regret"], "emotional_need": ["closure", "forgiveness"] } id: a unique numeric identifier text: the original Persian user-generated sample mental_state: a list of inferred emotional or psychological states (e.g., hopelessness, numbness, anxiety) emotional_need: a list of underlying emotional or psychological needs (e.g., support, validation, escape) The labels were assigned by a native Farsi-speaking expert with a background in psychology and AI. The annotation schema was designed to reflect both observable emotional states and implied psychological needs. The labels were selected from a predefined taxonomy derived from psychological literature, but expanded inductively to include emergent patterns. The dataset has been cleaned and formatted as a UTF-8 encoded JSON file. All records have been reviewed to remove duplicates and structural inconsistencies. No personally identifiable information (PII) is present. Applications This dataset is intended for use in Natural Language Processing (NLP) applications involving emotion detection, psychological profiling, affective computing, and Farsi language modeling. It is particularly valuable for research in AI for mental health, empathetic chatbots, and emotion-aware dialogue systems. Potential users include: Researchers in psychology and AI NLP practitioners working on low-resource languages Developers of mental health screening tools or therapeutic AI
创建时间:
2025-07-07
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作