five

BengaliSent140 - A Bengali Hate Speech Fusion Dataset

收藏
IEEE2026-04-17 收录
下载链接:
https://ieee-dataport.org/documents/bengalisent140-bengali-hate-speech-fusion-dataset
下载链接
链接失效反馈
官方服务:
资源简介:
In today's world of online communication, hate speech is a big problem. This dataset focuses on hate speech in Bengali, analyzing speeches to see if they contain hate or not. While there are many ways to analyze text online, most of them focus on languages like English, leaving out Bengali. But hate speech in Bengali is serious and common, especially on platforms like Facebook and YouTube. Sometimes, even TV shows have comments that are not nice for everyone to see. Finding and stopping hate speech in Bengali is hard because there aren't good tools for it yet. That's why we need more research in this area. One big problem is that there weren't many Bengali hate speech datasets available before ours. So, we made one with around 140,000 speeches, including 68,000 hateful ones and 71,000 that are not hateful. This dataset is one of the biggest for Bengali hate speech online. We made this dataset by combining different datasets and changing their labels to show if they contain hate or not. Having more data like this helps researchers and computers learn better ways to find and stop hate speech online. It's an important step in making the internet a safer and kinder place for everyone. 
提供机构:
Islam, Akif; Kumar Roy, Sujan
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作