hateday
收藏huggingface.co2025-03-23 收录
下载链接:
https://huggingface.co/datasets/manueltonneau/hateday
下载链接
链接失效反馈官方服务:
资源简介:
HateDay
This dataset consists of twelve representative sets of Twitter annotated for hate speech detection for eight languages and four countries.
Each representative set corresponds to a language or country and consists of 20,000 tweets randomly sampled from all tweets posted on September 21, 2022 in that language or country, for a total of 240K annotated tweets.
We cover eight languages (Arabic, English, French, German, Indonesian, Portuguese, Spanish and Turkish) and four… See the full description on the dataset page: https://huggingface.co/datasets/manueltonneau/hateday.
本数据集包含针对八种语言及四个国家进行仇恨言论检测的十二个代表性集合。每个代表性集合对应一种语言或国家,并从该语言或国家在2022年9月21日发布的所有推文中随机抽取了20,000条推文,总计标注了240,000条推文。本数据集涵盖了阿拉伯语、英语、法语、德语、印尼语、葡萄牙语、西班牙语和土耳其语八种语言,以及四个国家……欲查看数据集的完整描述,请访问:https://huggingface.co/datasets/manueltonneau/hateday。
提供机构:
huggingface.co



