five

Toxic Sentence Classification Dataset with labels of categories such as religion, mental health, race, sex, body image, disability, physical abuse, and politics

收藏
NIAID Data Ecosystem2026-05-02 收录
下载链接:
https://zenodo.org/record/14196418
下载链接
链接失效反馈
官方服务:
资源简介:
The dataset has a collection of various toxic sentences belonging to different categories. It was collected from various sources. It indicates which category each sentence belongs to. The values of the category columns are binary 1 or 0 indicating whether the sentence belongs to that particular category or not. Each sentence belongs to only 1 category.    Columns:1.comment_text: Contains toxic sentences that are insensitive and offensive, focusing on various categories.2.mental_health: Binary value 1 indicates that the sentence focuses on mental health.3.Race:Binary value 1 indicates that the sentence is racist.4.sex:Binary value 1 indicates that the sentence focuses on sexuality.5.body_image:Binary value 1 indicates that the sentence focuses on body image.6.disability:Binary value 1 indicates that the sentence focuses on physical disability and related issues.7.religion:Binary value 1 indicates that the sentence can be triggering to people who are extremely religious.8.physical_abuse:Binary value 1 indicates that the sentence focuses on physical abuse issues.9.politics:Binary value 1 indicates that the sentence focuses on political issues.
创建时间:
2024-11-21
二维码
社区交流群
二维码
科研交流群
商业服务