ViHSD
收藏arXiv2021-07-20 更新2024-08-06 收录
下载链接:
http://arxiv.org/abs/2103.11528v4
下载链接
链接失效反馈官方服务:
资源简介:
ViHSD是一个由越南胡志明市信息技术大学创建的大规模数据集,专门用于检测越南社交媒体上的仇恨言论。该数据集包含超过33,400条评论,每条评论被标注为CLEAN、OFFENSIVE或HATE三种标签之一。数据来源于越南的Facebook页面和YouTube视频,通过严格的标注过程创建,旨在通过机器学习和文本分类技术解决社交媒体上的仇恨言论问题。
ViHSD is a large-scale dataset developed by Ho Chi Minh City University of Information Technology, specifically designed for detecting hate speech on Vietnamese social media. It comprises over 33,400 comments, each annotated with one of three categorical labels: CLEAN, OFFENSIVE, or HATE. The dataset is sourced from Vietnamese Facebook pages and YouTube videos, and was compiled through a rigorous annotation process. Its primary goal is to address the problem of hate speech on social media using machine learning and text classification technologies.
提供机构:
越南胡志明市信息技术大学
创建时间:
2021-03-22



