five

Dataset of suicidal ideation texts in Brazilian Portuguese - Boamente System

收藏
NIAID Data Ecosystem2026-05-01 收录
下载链接:
https://zenodo.org/record/10065214
下载链接
链接失效反馈
官方服务:
资源简介:
We obtained non-clinical texts from tweets (user posts of the online social network Twitter). To find suicide-related tweets, we used the Twitter API to download tweets in a personalized way based on search terms associated with suicide. After different experiments to retrieve relevant texts, 5699 tweets were collected in May 2021. Each downloaded tweet had user-specific information (for example, user ID, timestamp, language, location, number of likes, etc.). Still, we kept only the post content (suicide-related texts) and discarded the additional data. Therefore, all texts were anonymized.  After data collection, three psychologists were invited to perform the data annotation, in which they individually labeled each tweet. To avoid bias in the annotation process, we selected psychologists with different psychological approaches, namely cognitive behavioral theory, psychoanalytic theory, and humanistic theory. Professionals had to classify each tweet as negative for suicidal ideation (annotated as 0), or positive for suicidal ideation (annotated as 1).  All tweets with at least one divergence between psychologists (n = 1513) were excluded, resulting in a dataset with 4186 instances. 398 duplicate tweets were excluded. The final dataset consists of 2691 instances labeled negative and 1097 labeled positive.
创建时间:
2023-11-04
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作