five

Bengali Hate Speech Dataset

收藏
arXiv2020-12-17 更新2024-08-06 收录
下载链接:
http://arxiv.org/abs/2012.09686v1
下载链接
链接失效反馈
官方服务:
资源简介:
本研究介绍了名为‘Bengali Hate Speech Dataset’的数据集,由Shahjalal University of Science and Technology创建。该数据集包含30,000条来自YouTube和Facebook的Bengali语评论,其中10,000条标记为仇恨言论。数据集涵盖7个类别,通过众包方式标记并由专家验证。此数据集旨在支持Bengali语社交媒体中的仇恨言论检测研究,解决现有资源不足的问题。

This study introduces the dataset named 'Bengali Hate Speech Dataset', which was created by Shahjalal University of Science and Technology. The dataset contains 30,000 Bengali comments sourced from YouTube and Facebook, among which 10,000 are annotated as hate speech. It covers 7 categories, with annotations completed via crowdsourcing and validated by experts. This dataset is designed to support research on hate speech detection in Bengali social media, addressing the shortage of existing relevant resources.
提供机构:
Shahjalal University of Science and Technology
创建时间:
2020-12-17
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作