C4Censor: A Lightweight Benchmark Dataset for Inappropriate Content Detection

Name: C4Censor: A Lightweight Benchmark Dataset for Inappropriate Content Detection
Creator: figshare
Published: 2025-12-24 11:13:36
License: 暂无描述

DataCite Commons2025-12-24 更新2026-02-09 收录

下载链接：

https://figshare.com/articles/dataset/C4Censor_A_Lightweight_Benchmark_Dataset_for_Inappropriate_Content_Detection/30946031/1

下载链接

链接失效反馈

官方服务：

资源简介：

C4Censor is a multi-class image benchmark for fine-grained content moderation across four categories: Blood & Gore, Pornography, Terrorism, and Neutral. Each category contains three challenging subclasses (e.g., Hentai vs. Anime, Counter-Terrorism vs. War-Crimes), with 500 images per subclass totaling 6,000 manually annotated images.This dataset addresses limitations of existing binary classification benchmarks by providing a unified, balanced challenge for real-world moderation tasks. Benchmark evaluations with state-of-the-art models (ViT, Xception, CAiT) achieved maximum accuracy of 62.13%, highlighting the dataset's complexity."This dataset contains graphic, violent, and sexually explicit imagery collected for academic research purposes only. Users must agree to ethical usage terms before access."Manuscript: Submitted to Journal of Computational Social ScienceSupplementary Materials: Includes visual examples and misclassification instances from the dataset (not included in the manuscript for JCSO ethical compliance)

提供机构：

figshare

创建时间：

2025-12-24

5,000+

优质数据集

54 个

任务类型

进入经典数据集