irlab-udc/metahate-sample
收藏Hugging Face2024-01-04 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/irlab-udc/metahate-sample
下载链接
链接失效反馈官方服务:
资源简介:
---
license: apache-2.0
task_categories:
- text-classification
language:
- en
pretty_name: metahate-sample
size_categories:
- n<1K
---
# MetaHate: A Dataset for Unifying Efforts on Hate Speech Detection (SAMPLE)
This is a 100-entry sample of a meta-collection of 36 hate speech datasets from social media comments.
## Dataset Structure
The original dataset contains 1,226,202 social media posts in a TSV file. This is a sample of 100 entries. Each element contains the following fields:
| Field Name | Type | Possible Values | Description |
|------------|------|-----------------|----------------------------------------------------------------------|
| text | str | any | Social media post. Each post is unique. |
| label | int | 0, 1 | Label of the post. 0 for non-hate speech posts, 1 for hate speech. |
提供机构:
irlab-udc
原始信息汇总
MetaHate: A Dataset for Unifying Efforts on Hate Speech Detection (SAMPLE)
数据集概述
这是一个包含100条数据的样本集,源自一个包含36个仇恨言论数据集的元集合,原始数据集包含1,226,202条社交媒体帖子。
数据集结构
数据集以TSV文件格式存储,本样本包含100条数据。每条数据包含以下字段:
| 字段名 | 类型 | 可能值 | 描述 |
|---|---|---|---|
| text | str | 任意 | 社交媒体帖子,每条帖子都是唯一的。 |
| label | int | 0, 1 | 帖子的标签,0表示非仇恨言论,1表示仇恨言论。 |



