five

C4Censor: A Lightweight Benchmark Dataset for Inappropriate Content Detection

收藏
DataCite Commons2025-12-24 更新2026-02-09 收录
下载链接:
https://figshare.com/articles/dataset/C4Censor_A_Lightweight_Benchmark_Dataset_for_Inappropriate_Content_Detection/30946031/1
下载链接
链接失效反馈
官方服务:
资源简介:
<b>C4Censor</b> is a multi-class image benchmark for fine-grained content moderation across four categories: Blood &amp; Gore, Pornography, Terrorism, and Neutral. Each category contains three challenging subclasses (e.g., Hentai vs. Anime, Counter-Terrorism vs. War-Crimes), with 500 images per subclass totaling 6,000 manually annotated images.This dataset addresses limitations of existing binary classification benchmarks by providing a unified, balanced challenge for real-world moderation tasks. Benchmark evaluations with state-of-the-art models (ViT, Xception, CAiT) achieved maximum accuracy of 62.13%, highlighting the dataset's complexity.<b><i>"This dataset contains graphic, violent, and sexually explicit imagery collected for academic research purposes only. Users must agree to ethical usage terms before access."</i></b><b>Manuscript</b>: Submitted to Journal of Computational Social Science<b>Supplementary Materials</b>: Includes visual examples and misclassification instances from the dataset (not included in the manuscript for JCSO ethical compliance)
提供机构:
figshare
创建时间:
2025-12-24
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作