MetaHate
收藏arXiv2024-01-12 更新2024-06-21 收录
下载链接:
https://irlab.org/metahate.html
下载链接
链接失效反馈官方服务:
资源简介:
MetaHate是由IRLab, CITIC研究中心在科鲁尼亚大学创建的一个大型数据集,旨在统一仇恨言论检测的研究努力。该数据集整合了超过60个相关数据集,最终包含1,226,202条来自社交媒体的非重复评论。数据集主要关注英语内容,通过严格的筛选标准,确保数据集的质量和相关性。MetaHate的应用领域广泛,主要用于训练和测试仇恨言论检测模型,以应对数字领域中动态和复杂的仇恨言论问题。
MetaHate is a large-scale dataset created by IRLab, CITIC Research Center at the University of A Coruña, aiming to unify research efforts in hate speech detection. This dataset integrates over 60 relevant datasets, ultimately containing 1,226,202 non-duplicate comments sourced from social media. It primarily focuses on English-language content, and adopts strict filtering criteria to ensure the quality and relevance of the dataset. MetaHate has a wide range of application scenarios, and is mainly used for training and testing hate speech detection models to address the dynamic and complex hate speech issues in the digital domain.
提供机构:
IRLab, CITIC研究中心, 科鲁尼亚大学
创建时间:
2024-01-12



