five

ARForensics

收藏
魔搭社区2025-12-29 更新2025-10-11 收录
下载链接:
https://modelscope.cn/datasets/YanranZhang/ARForensics
下载链接
链接失效反馈
官方服务:
资源简介:
# [ICCV 2025] D³QE: Learning Discrete Distribution Discrepancy-aware Quantization Error for Autoregressive-Generated Image Detection <div align='center' style='margin-bottom:20px'> <a href='http://arxiv.org/abs/2510.05891'><img src='https://img.shields.io/badge/ArXiv-red?logo=arxiv'></a> &nbsp; <a href='https://ivg-yanranzhang.github.io/D3QE/'><img src='https://img.shields.io/badge/Visualization-green?logo=github'></a> &nbsp; <a href="https://github.com/Zhangyr2022/D3QE"><img src="https://img.shields.io/badge/Code-9E95B7?logo=github"></a> </div> `ARForensics` dataset from [ICCV 2025] D³QE: Learning Discrete Distribution Discrepancy-aware Quantization Error for Autoregressive-Generated Image Detection <div align=center> <img src='dataset.png'> </div> ## Introduction The `ARForensics` dataset is the first benchmark for **visual autoregressive model detection**. It comprises 304,000 images (152,000 real from ImageNet, 152,000 synthetic). The dataset features 7 diverse autoregressive models: LlamaGen<small>[![Star](https://img.shields.io/github/stars/FoundationVision/LlamaGen.svg?style=social&label=Star)](https://github.com/FoundationVision/LlamaGen)</small>, VAR<small>[![Star](https://img.shields.io/github/stars/FoundationVision/VAR.svg?style=social&label=Star)](https://github.com/FoundationVision/VAR)</small>, Infinity<small>[![Star](https://img.shields.io/github/stars/FoundationVision/Infinity.svg?style=social&label=Star)](https://github.com/FoundationVision/Infinity)</small>, Janus-Pro<small>[![Star](https://img.shields.io/github/stars/FoundationVision/Infinity.svg?style=social&label=Star)](https://github.com/deepseek-ai/Janus)</small>, RAR<small>[![Star](https://img.shields.io/github/stars/bytedance/1d-tokenizer.svg?style=social&label=Star)](https://github.com/bytedance/1d-tokenizer/tree/main)</small>, Switti<small>[![Star](https://img.shields.io/github/stars/yandex-research/switti.svg?style=social&label=Star)](https://github.com/yandex-research/switti)</small>, and Open-MAGVIT2<small>[![Star](https://img.shields.io/github/stars/TencentARC/SEED-Voken.svg?style=social&label=Star)](https://github.com/TencentARC/SEED-Voken)</small>, which encompasses both token-based and scale-based architectures. It is structured into training (100k LlamaGen), validation (10k), and a comprehensive test set (6k samples from each of the 7 models), ensuring a balanced and technically varied evaluation benchmark for AI-generated image detection. ## Usage After downloading all the files in this directory, you can run the following command to merge them into a single zip file: ```bash cat ARForensics_part.* > ARForensics_part.zip ``` **Download:** The dataset `ARForensics` is uploaded and available at: [🤗 HuggingFace](https://huggingface.co/datasets/Yanran21/ARForensics) | [🤖 ModelScope](https://www.modelscope.cn/datasets/YanranZhang/ARForensics). **Folder structure (expected):** ``` ARForensics/ ├─ train/ │ ├─ 0_real/ │ └─ 1_fake/ ├─ val/ │ ├─ 0_real/ │ └─ 1_fake/ └─ test/ ├─ Infinity/ │ ├─ 0_real/ │ └─ 1_fake/ ├─ Janus_Pro/ │ ├─ .. ├─ RAR/ ├─ Switti/ ├─ VAR/ ├─ LlamaGen/ └─ Open_MAGVIT2/ ``` ## Cite If you find this repository useful for your research, please consider citing this bibtex.

# [ICCV 2025] D³QE:面向自回归生成图像检测的离散分布差异感知量化误差学习 <div align='center' style='margin-bottom:20px'> <a href='http://arxiv.org/abs/2510.05891'><img src='https://img.shields.io/badge/ArXiv-red?logo=arxiv'></a> &nbsp; <a href='https://ivg-yanranzhang.github.io/D3QE/'><img src='https://img.shields.io/badge/Visualization-green?logo=github'></a> &nbsp; <a href='https://github.com/Zhangyr2022/D3QE'><img src='https://img.shields.io/badge/Code-9E95B7?logo=github'></a> </div> `ARForensics` 数据集源自[ICCV 2025]论文《D³QE:面向自回归生成图像检测的离散分布差异感知量化误差学习》 <div align=center> <img src='dataset.png'> </div> ## 引言 `ARForensics` 数据集是首个面向**视觉自回归模型检测**的基准数据集。该数据集包含304,000张图像(其中152,000张来自ImageNet的真实图像,152,000张为合成图像)。数据集涵盖7种不同的自回归模型:LlamaGen<small>[![Star](https://img.shields.io/github/stars/FoundationVision/LlamaGen.svg?style=social&label=Star)](https://github.com/FoundationVision/LlamaGen)</small>, VAR<small>[![Star](https://img.shields.io/github/stars/FoundationVision/VAR.svg?style=social&label=Star)](https://github.com/FoundationVision/VAR)</small>, Infinity<small>[![Star](https://img.shields.io/github/stars/FoundationVision/Infinity.svg?style=social&label=Star)](https://github.com/FoundationVision/Infinity)</small>, Janus-Pro<small>[![Star](https://img.shields.io/github/stars/deepseek-ai/Janus.svg?style=social&label=Star)](https://github.com/deepseek-ai/Janus)</small>, RAR<small>[![Star](https://img.shields.io/github/stars/bytedance/1d-tokenizer.svg?style=social&label=Star)](https://github.com/bytedance/1d-tokenizer/tree/main)</small>, Switti<small>[![Star](https://img.shields.io/github/stars/yandex-research/switti.svg?style=social&label=Star)](https://github.com/yandex-research/switti)</small>以及Open-MAGVIT2<small>[![Star](https://img.shields.io/github/stars/TencentARC/SEED-Voken.svg?style=social&label=Star)](https://github.com/TencentARC/SEED-Voken)</small>,覆盖了基于Token(Token)和基于尺度的两类架构。数据集划分为训练集(10万张LlamaGen生成样本)、验证集(1万张样本)以及全面的测试集(7种模型各6000张样本),可为AI生成图像检测任务提供均衡且技术多样的评估基准。 ## 使用方法 下载该目录下的所有文件后,可执行以下命令将其合并为单个压缩包: bash cat ARForensics_part.* > ARForensics_part.zip **下载链接**:`ARForensics` 数据集已上传并可通过以下渠道获取:[🤗 HuggingFace](https://huggingface.co/datasets/Yanran21/ARForensics) | [🤖 ModelScope](https://www.modelscope.cn/datasets/YanranZhang/ARForensics)。 **预期文件夹结构**: ARForensics/ ├─ train/ │ ├─ 0_real/ │ └─ 1_fake/ ├─ val/ │ ├─ 0_real/ │ └─ 1_fake/ └─ test/ ├─ Infinity/ │ ├─ 0_real/ │ └─ 1_fake/ ├─ Janus_Pro/ │ ├─ .. ├─ RAR/ ├─ Switti/ ├─ VAR/ ├─ LlamaGen/ └─ Open_MAGVIT2/ ## 引用 若本仓库对你的研究有所帮助,请考虑引用以下BibTeX条目。
提供机构:
maas
创建时间:
2025-10-03
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作