Bengali Hate Speech Dataset
收藏arXiv2020-12-17 更新2024-08-06 收录
下载链接:
http://arxiv.org/abs/2012.09686v1
下载链接
链接失效反馈官方服务:
资源简介:
本研究介绍了名为‘Bengali Hate Speech Dataset’的数据集,由Shahjalal University of Science and Technology创建。该数据集包含30,000条来自YouTube和Facebook的Bengali语评论,其中10,000条标记为仇恨言论。数据集涵盖7个类别,通过众包方式标记并由专家验证。此数据集旨在支持Bengali语社交媒体中的仇恨言论检测研究,解决现有资源不足的问题。
This study introduces the dataset named 'Bengali Hate Speech Dataset', which was created by Shahjalal University of Science and Technology. The dataset contains 30,000 Bengali comments sourced from YouTube and Facebook, among which 10,000 are annotated as hate speech. It covers 7 categories, with annotations completed via crowdsourcing and validated by experts. This dataset is designed to support research on hate speech detection in Bengali social media, addressing the shortage of existing relevant resources.
提供机构:
Shahjalal University of Science and Technology
创建时间:
2020-12-17



