Indo-HateSpeech
收藏DataCite Commons2025-05-01 更新2025-05-17 收录
下载链接:
https://data.mendeley.com/datasets/snc7mxpj6t
下载链接
链接失效反馈官方服务:
资源简介:
The Indo-HateSpeech dataset is a Hindi-English code-mixed dataset specifically designed for identifying hate speech in social media platforms. Given the multilingual nature of Indian social media users, code-mixing (the blending of two or more languages within a conversation) is prevalent, especially between Hindi and English.
印地语-英语混合仇恨言论数据集(Indo-HateSpeech dataset)是专为识别社交媒体平台仇恨言论而构建的语码混合数据集。鉴于印度社交媒体用户具有多语言使用的普遍特性,语码混合(即在一段对话中混合使用两种或多种语言)现象在当地十分盛行,尤以印地语与英语之间的语码混合为典型。
提供机构:
Mendeley Data
创建时间:
2024-12-02



