five

Multilingual Detection of Cyberbullying in Mixed Urdu, Roman Urdu, and English Social Media Conversations

收藏
DataCite Commons2024-02-16 更新2025-04-16 收录
下载链接:
https://ieee-dataport.org/documents/multilingual-detection-cyberbullying-mixed-urdu-roman-urdu-and-english-social-media
下载链接
链接失效反馈
官方服务:
资源简介:
The dataset crafted for this study is intentionally designed to encapsulate instances of cyberbullying across three distinct languages: Urdu, Roman Urdu, and English. This strategic selection aims to mirror the linguistic variations that are prevalent in social media dialogues among Urdu-speaking communities globally. Further, it undergoes meticulous annotation to encapsulate the diverse linguistic nuances characteristic of these languages. This process includes integrating critical aspects of cyberbullying, such as aggression, repetition, and intent to harm. Such a comprehensive approach is pivotal in ensuring that the dataset not only captures the complex dynamics of cyberbullying but also addresses it in a multilingual context with the depth and breadth required for effective analysis and detection.
提供机构:
IEEE DataPort
创建时间:
2024-02-16
二维码
社区交流群
二维码
科研交流群
商业服务