ctoraman/gender-hate-speech-turkish
收藏Hugging Face2023-11-30 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/ctoraman/gender-hate-speech-turkish
下载链接
链接失效反馈官方服务:
资源简介:
---
license: cc-by-nc-sa-4.0
task_categories:
- text-classification
language:
- tr
tags:
- hate speech
- hate speech detection
- hate-speech
- tweets
- social media
- topic
- hate-speech-detection
- gender identity
- gender
---
The "gender identity" subset of the large-scale dataset published in the LREC 2022 paper "Large-Scale Hate Speech Detection with Cross-Domain Transfer".
This subset is also used in the experiments of "Şahinuç, F., Yilmaz, E. H., Toraman, C., & Koç, A. (2023). The effect of gender bias on hate speech detection. Signal, Image and Video Processing, 17(4), 1591-1597."
The "gender identity" subset includes 20,000 tweets in Turkish.
The published data split is the first fold of 10-fold cross-validation, used in the experiments mentioned above.
Train split has 18,000 tweets. Test split has 2,000 tweets.
HateLabel:
- 0 Normal
- 1 Offensive
- 2 Hate
# GitHub Repo:
https://github.com/avaapm/hatespeech
# If you use this dataset, please cite the following papers:
- Toraman, C., Şahinuç, F., & Yilmaz, E. (2022, June). Large-Scale Hate Speech Detection with Cross-Domain Transfer. In Proceedings of the Thirteenth Language Resources and Evaluation Conference (pp. 2215-2225).
- Şahinuç, F., Yilmaz, E. H., Toraman, C., & Koç, A. (2023). The effect of gender bias on hate speech detection. Signal, Image and Video Processing, 17(4), 1591-1597.
提供机构:
ctoraman
原始信息汇总
数据集概述
基本信息
- 许可证: cc-by-nc-sa-4.0
- 任务类别: text-classification
- 语言: tr
- 标签:
- hate speech
- hate speech detection
- hate-speech
- tweets
- social media
- topic
- hate-speech-detection
- gender identity
- gender
数据集详情
- 数据集名称: "gender identity" subset
- 来源: 来自LREC 2022论文 "Large-Scale Hate Speech Detection with Cross-Domain Transfer"
- 使用情况: 用于Şahinuç, F., Yilmaz, E. H., Toraman, C., & Koç, A. (2023) 的实验 "The effect of gender bias on hate speech detection"
- 数据量: 包含20,000条土耳其语推文
- 数据划分:
- 训练集: 18,000条推文
- 测试集: 2,000条推文
- 标签类别:
- 0: Normal
- 1: Offensive
- 2: Hate
引用
- Toraman, C., Şahinuç, F., & Yilmaz, E. (2022, June). Large-Scale Hate Speech Detection with Cross-Domain Transfer. In Proceedings of the Thirteenth Language Resources and Evaluation Conference (pp. 2215-2225).
- Şahinuç, F., Yilmaz, E. H., Toraman, C., & Koç, A. (2023). The effect of gender bias on hate speech detection. Signal, Image and Video Processing, 17(4), 1591-1597.



