five

Hate-Speech Spanish Lexicons

收藏
SSH Open MarketPlace2026-03-25 更新2026-03-28 收录
下载链接:
https://marketplace.sshopencloud.eu/dataset/JlSEyc
下载链接
链接失效反馈
官方服务:
资源简介:
This repository provides curated lexical resources for hate speech detection in Spanish, developed by researchers at the Universidad de Jaén. It includes four domain-specific lexicons compiled to support natural language processing (NLP) tasks targeting harmful content in Spanish-language social media, particularly Twitter. The collection comprises four files: Xenophobia lexicon — 44 hateful terms directed at immigrants Immigrant lexicon — 250 words referring to immigrant nationalities Misogyny lexicon — 183 terms expressing hatred toward women Insults lexicon — 279 general-purpose offensive terms These resources were created in the context of the paper "Detecting Misogyny and Xenophobia in Spanish Tweets Using Language Technologies" (Plaza-Del-Arco et al., 2020, ACM Transactions on Internet Technology), and are intended as feature extraction tools for machine learning classifiers or rule-based hate speech detection systems.
创建时间:
2026-03-25
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作