HaSpeeDe 2018 dataset
收藏Mendeley Data2024-01-31 更新2024-06-30 收录
下载链接:
https://live.european-language-grid.eu/catalogue/corpus/7497
下载链接
链接失效反馈官方服务:
资源简介:
The HaSpeeDe dataset collects 8000 Facebook comments and tweets annotated for the presence of hate speech. The dataset has been used in the context of the HaSpeeDe task, organized as part of the EVALITA 2018 evaluation campaign (http://www.evalita.it/2018). In order to meet the GDPR requirements, texts have been pseudonymized replacing all original IDs in both datasets with newly-generated ones. Mentions, emails, person names (excluded public person names), and phone numbers have been masked with, respectively, the labels MENTION, EMAIL, PERSON, PHONE, followed by a number to distinguish between different entities of the same kind within the same text.
创建时间:
2024-01-31



