five

GermEval 2021 Training Dataset

收藏
arXiv2025-09-30 收录
下载链接:
https://sites.google.com/view/germeval-2021/shared-task
下载链接
链接失效反馈
官方服务:
资源简介:
该数据集包含了经过标注的Facebook评论,这些评论根据其毒性、互动性和陈述事实的特点进行了分类。评论由经过培训的标注员进行标注,测试数据来源于不同主题的讨论。数据集的规模包括3,244条训练评论和944条测试评论,其任务是对评论进行二分类,将其归入毒性、吸引互动和陈述事实的类别中。

This dataset contains annotated Facebook comments, which are categorized based on three key characteristics: toxicity, interactivity, and fact-stating nature. All comments were annotated by trained human annotators, and the test data is sourced from discussions covering a wide range of topics. The dataset comprises 3,244 training comments and 944 test comments. The primary task associated with this dataset is to perform binary classification on the comments, assigning them to the categories of toxic, interaction-attracting, and fact-stating.
提供机构:
GermEval 2021 shared task organizers
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作