five

CONAN-SP

收藏
SSH Open MarketPlace2026-03-26 更新2026-03-28 收录
下载链接:
https://marketplace.sshopencloud.eu/dataset/8mY6ti
下载链接
链接失效反馈
官方服务:
资源简介:
CONAN-SP is a dataset for automatic counter-narrative generation in Spanish, developed by researchers at the Universidad de Jaén (SINAI research group). It provides pairs of hate speech comments (HS) and their corresponding counter-narratives (CN), covering five hate targets: islamophobia, misogyny, antisemitism, racism, and homophobia.The dataset is built upon CONAN-KN (Chung et al., 2021), an English HS-CN dataset of 195 pairs. The construction pipeline involved automatic translation into Spanish via DeepL, followed by counter-narrative generation using GPT-3.5 under three different prompting strategies — a general prompt with five examples, five target-specific prompts, and a general prompt without task definition. After removing duplicates and annotation agreement examples, the final dataset contains 238 HS-CN pairs distributed across three experiments (84, 70, and 84 instances respectively). All pairs were labelled by human annotators according to three quality metrics: Offensiveness, Stance, and Informativeness.The dataset is distributed as Excel files, one per experiment, and is intended for use in training and evaluating NLP systems focused on counter-speech, hate speech mitigation, and generative language modelling in Spanish.
创建时间:
2026-03-26
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作