asas-ai/Arabic_Offensive_Comment_Detection
收藏Hugging Face2024-05-08 更新2024-06-12 收录
下载链接:
https://hf-mirror.com/datasets/asas-ai/Arabic_Offensive_Comment_Detection
下载链接
链接失效反馈官方服务:
资源简介:
---
configs:
- config_name: default
data_files:
- split: train
path: data/train-*
dataset_info:
features:
- name: Id
dtype: int64
- name: Platform
dtype: string
- name: Comment
dtype: string
- name: Majority_Label
dtype: string
- name: Agreement
dtype: float64
- name: NumOfJudgementUsed
dtype: int64
- name: Total_Judgement
dtype: int64
- name: Vulgar:V/HateSpeech:HS/None:-
dtype: string
splits:
- name: train
num_bytes: 1184762
num_examples: 4000
download_size: 561173
dataset_size: 1184762
license: apache-2.0
task_categories:
- text-classification
language:
- ar
size_categories:
- 1K<n<10K
tags:
- Offensive Language Detection
---
# Dataset Card for "Arabic_Offensive_Comment_Detection"
## Paper:
Shammur Absar Chowdhury, Hamdy Mubarak, Ahmed Abdelali, Soon-gyo Jung, Bernard J. Jansen, and Joni Salminen. 2020. A Multi-Platform Arabic News Comment Dataset for Offensive Language Detection. In Proceedings of the Twelfth Language Resources and Evaluation Conference, pages 6203–6212, Marseille, France. European Language Resources Association.
提供机构:
asas-ai
原始信息汇总
数据集概述
数据集名称
Arabic_Offensive_Comment_Detection
数据集配置
- 配置名称: default
- 数据文件:
- 分割: train
- 路径: data/train-*
数据集特征
- Id: int64
- Platform: string
- Comment: string
- Majority_Label: string
- Agreement: float64
- NumOfJudgementUsed: int64
- Total_Judgement: int64
- Vulgar:V/HateSpeech:HS/None:-: string
数据集分割
- 名称: train
- 字节数: 1184762
- 示例数: 4000
数据集大小
- 下载大小: 561173
- 数据集大小: 1184762
许可证
apache-2.0
任务类别
- text-classification
语言
- ar
大小类别
- 1K<n<10K
标签
- Offensive Language Detection



