ARForensics

Name: ARForensics
Creator: maas
Published: 2025-12-29 15:48:22
License: 暂无描述

魔搭社区2025-12-29 更新2025-10-11 收录

下载链接：

https://modelscope.cn/datasets/YanranZhang/ARForensics

下载链接

链接失效反馈

官方服务：

资源简介：

# [ICCV 2025] D³QE: Learning Discrete Distribution Discrepancy-aware Quantization Error for Autoregressive-Generated Image Detection <div align='center' style='margin-bottom:20px'> <a href='http://arxiv.org/abs/2510.05891'><img src='https://img.shields.io/badge/ArXiv-red?logo=arxiv'></a>   <a href='https://ivg-yanranzhang.github.io/D3QE/'><img src='https://img.shields.io/badge/Visualization-green?logo=github'></a>   <a href="https://github.com/Zhangyr2022/D3QE"><img src="https://img.shields.io/badge/Code-9E95B7?logo=github"></a> </div> `ARForensics` dataset from [ICCV 2025] D³QE: Learning Discrete Distribution Discrepancy-aware Quantization Error for Autoregressive-Generated Image Detection <div align=center> <img src='dataset.png'> </div> ## Introduction The `ARForensics` dataset is the first benchmark for **visual autoregressive model detection**. It comprises 304,000 images (152,000 real from ImageNet, 152,000 synthetic). The dataset features 7 diverse autoregressive models: LlamaGen<small>[![Star](https://img.shields.io/github/stars/FoundationVision/LlamaGen.svg?style=social&label=Star)](https://github.com/FoundationVision/LlamaGen)</small>, VAR<small>[![Star](https://img.shields.io/github/stars/FoundationVision/VAR.svg?style=social&label=Star)](https://github.com/FoundationVision/VAR)</small>, Infinity<small>[![Star](https://img.shields.io/github/stars/FoundationVision/Infinity.svg?style=social&label=Star)](https://github.com/FoundationVision/Infinity)</small>, Janus-Pro<small>[![Star](https://img.shields.io/github/stars/FoundationVision/Infinity.svg?style=social&label=Star)](https://github.com/deepseek-ai/Janus)</small>, RAR<small>[![Star](https://img.shields.io/github/stars/bytedance/1d-tokenizer.svg?style=social&label=Star)](https://github.com/bytedance/1d-tokenizer/tree/main)</small>, Switti<small>[![Star](https://img.shields.io/github/stars/yandex-research/switti.svg?style=social&label=Star)](https://github.com/yandex-research/switti)</small>, and Open-MAGVIT2<small>[![Star](https://img.shields.io/github/stars/TencentARC/SEED-Voken.svg?style=social&label=Star)](https://github.com/TencentARC/SEED-Voken)</small>, which encompasses both token-based and scale-based architectures. It is structured into training (100k LlamaGen), validation (10k), and a comprehensive test set (6k samples from each of the 7 models), ensuring a balanced and technically varied evaluation benchmark for AI-generated image detection. ## Usage After downloading all the files in this directory, you can run the following command to merge them into a single zip file: ```bash cat ARForensics_part.* > ARForensics_part.zip ``` **Download:** The dataset `ARForensics` is uploaded and available at: [🤗 HuggingFace](https://huggingface.co/datasets/Yanran21/ARForensics) | [🤖 ModelScope](https://www.modelscope.cn/datasets/YanranZhang/ARForensics). **Folder structure (expected):** ``` ARForensics/ ├─ train/ │ ├─ 0_real/ │ └─ 1_fake/ ├─ val/ │ ├─ 0_real/ │ └─ 1_fake/ └─ test/ ├─ Infinity/ │ ├─ 0_real/ │ └─ 1_fake/ ├─ Janus_Pro/ │ ├─ .. ├─ RAR/ ├─ Switti/ ├─ VAR/ ├─ LlamaGen/ └─ Open_MAGVIT2/ ``` ## Cite If you find this repository useful for your research, please consider citing this bibtex.

# [ICCV 2025] D³QE：面向自回归生成图像检测的离散分布差异感知量化误差学习 <div align='center' style='margin-bottom:20px'> <a href='http://arxiv.org/abs/2510.05891'><img src='https://img.shields.io/badge/ArXiv-red?logo=arxiv'></a>   <a href='https://ivg-yanranzhang.github.io/D3QE/'><img src='https://img.shields.io/badge/Visualization-green?logo=github'></a>   <a href='https://github.com/Zhangyr2022/D3QE'><img src='https://img.shields.io/badge/Code-9E95B7?logo=github'></a> </div> `ARForensics` 数据集源自[ICCV 2025]论文《D³QE：面向自回归生成图像检测的离散分布差异感知量化误差学习》 <div align=center> <img src='dataset.png'> </div> ## 引言 `ARForensics` 数据集是首个面向**视觉自回归模型检测**的基准数据集。该数据集包含304,000张图像（其中152,000张来自ImageNet的真实图像，152,000张为合成图像）。数据集涵盖7种不同的自回归模型：LlamaGen<small>[![Star](https://img.shields.io/github/stars/FoundationVision/LlamaGen.svg?style=social&label=Star)](https://github.com/FoundationVision/LlamaGen)</small>, VAR<small>[![Star](https://img.shields.io/github/stars/FoundationVision/VAR.svg?style=social&label=Star)](https://github.com/FoundationVision/VAR)</small>, Infinity<small>[![Star](https://img.shields.io/github/stars/FoundationVision/Infinity.svg?style=social&label=Star)](https://github.com/FoundationVision/Infinity)</small>, Janus-Pro<small>[![Star](https://img.shields.io/github/stars/deepseek-ai/Janus.svg?style=social&label=Star)](https://github.com/deepseek-ai/Janus)</small>, RAR<small>[![Star](https://img.shields.io/github/stars/bytedance/1d-tokenizer.svg?style=social&label=Star)](https://github.com/bytedance/1d-tokenizer/tree/main)</small>, Switti<small>[![Star](https://img.shields.io/github/stars/yandex-research/switti.svg?style=social&label=Star)](https://github.com/yandex-research/switti)</small>以及Open-MAGVIT2<small>[![Star](https://img.shields.io/github/stars/TencentARC/SEED-Voken.svg?style=social&label=Star)](https://github.com/TencentARC/SEED-Voken)</small>，覆盖了基于Token（Token）和基于尺度的两类架构。数据集划分为训练集（10万张LlamaGen生成样本）、验证集（1万张样本）以及全面的测试集（7种模型各6000张样本），可为AI生成图像检测任务提供均衡且技术多样的评估基准。 ## 使用方法下载该目录下的所有文件后，可执行以下命令将其合并为单个压缩包： bash cat ARForensics_part.* > ARForensics_part.zip **下载链接**：`ARForensics` 数据集已上传并可通过以下渠道获取：[🤗 HuggingFace](https://huggingface.co/datasets/Yanran21/ARForensics) | [🤖 ModelScope](https://www.modelscope.cn/datasets/YanranZhang/ARForensics)。 **预期文件夹结构**： ARForensics/ ├─ train/ │ ├─ 0_real/ │ └─ 1_fake/ ├─ val/ │ ├─ 0_real/ │ └─ 1_fake/ └─ test/ ├─ Infinity/ │ ├─ 0_real/ │ └─ 1_fake/ ├─ Janus_Pro/ │ ├─ .. ├─ RAR/ ├─ Switti/ ├─ VAR/ ├─ LlamaGen/ └─ Open_MAGVIT2/ ## 引用若本仓库对你的研究有所帮助，请考虑引用以下BibTeX条目。

提供机构：

maas

创建时间：

2025-10-03

5,000+

优质数据集

54 个

任务类型

进入经典数据集