BengaliSent140 - A Bengali Hate Speech Fusion Dataset
收藏IEEE2026-04-17 收录
下载链接:
https://ieee-dataport.org/documents/bengalisent140-bengali-hate-speech-fusion-dataset
下载链接
链接失效反馈官方服务:
资源简介:
In today's world of online communication, hate speech is a big problem. This dataset focuses on hate speech in Bengali, analyzing speeches to see if they contain hate or not. While there are many ways to analyze text online, most of them focus on languages like English, leaving out Bengali. But hate speech in Bengali is serious and common, especially on platforms like Facebook and YouTube. Sometimes, even TV shows have comments that are not nice for everyone to see. Finding and stopping hate speech in Bengali is hard because there aren't good tools for it yet. That's why we need more research in this area. One big problem is that there weren't many Bengali hate speech datasets available before ours. So, we made one with around 140,000 speeches, including 68,000 hateful ones and 71,000 that are not hateful. This dataset is one of the biggest for Bengali hate speech online. We made this dataset by combining different datasets and changing their labels to show if they contain hate or not. Having more data like this helps researchers and computers learn better ways to find and stop hate speech online. It's an important step in making the internet a safer and kinder place for everyone.
提供机构:
Islam, Akif; Kumar Roy, Sujan



