One Million Posts Corpus
收藏arXiv2025-09-30 收录
下载链接:
https://ofai.github.io/million-post-corpus/
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含了来自奥地利报纸《Der Standard》的100万条评论,旨在用于评估内容审查模型。该数据集包括了用户评论,这些评论可以根据上下文信息来评估基于情境的模型。此数据集的任务是对自动内容审查进行二分类。
This dataset comprises 1 million comments sourced from the Austrian newspaper *Der Standard*, and is specifically developed for evaluating content moderation models. It includes user comments that enable the assessment of context-aware models by leveraging contextual information. The task associated with this dataset is binary classification for automated content moderation.
提供机构:
Austrian newspaper Der Standard



