Julien0123/Ivy-Fake
收藏Hugging Face2026-03-27 更新2026-03-29 收录
下载链接:
https://hf-mirror.com/datasets/Julien0123/Ivy-Fake
下载链接
链接失效反馈官方服务:
资源简介:
---
license: apache-2.0
language:
- en
tags:
- AIGC
size_categories:
- 100K<n<1M
---
# IVY-FAKE: Unified Explainable Benchmark and Detector for AIGC Content
[](https://openreview.net/attachment?id=RIBj1KPAWM&name=pdf)
[](https://huggingface.co/datasets/AI-Safeguard/Ivy-Fake)
[](https://github.com/Pi3AI/IvyFake) [](http://creativecommons.org/licenses/by-sa/4.0/)

This repository provides the official implementation of **IVY-FAKE** and **IVY-xDETECTOR**, a unified explainable framework and benchmark for detecting AI-generated content (AIGC) across **both images and videos**.
---
## 🔍 Overview
**IVY-FAKE** is the **first large-scale dataset** designed for **multimodal explainable AIGC detection**. It contains:
- **150K+** training samples (images + videos)
- **18.7K** evaluation samples
- **Fine-grained annotations** including:
- Spatial and temporal artifact analysis
- Natural language reasoning (<think>...</think>)
- Binary labels with explanations (<conclusion>real/fake</conclusion>)
**IVY-xDETECTOR** is a vision-language detection model trained to:
- Identify synthetic artifacts in images and videos
- Generate **step-by-step reasoning**
- Achieve **SOTA performance** across multiple benchmarks
---
## 📦 Evaluation
```bash
conda create -n ivy-detect python=3.10
conda activate ivy-detect
# Install dependencies
pip install -r requirements.txt
```
---
🚀 Evaluation Script
We provide an evaluation script to test large language model (LLM) performance on reasoning-based AIGC detection.
🔑 Environment Variables
Before running, export the following environment variables:
```bash
export OPENAI_API_KEY="your-api-key"
export OPENAI_BASE_URL="https://api.openai.com/v1" # or OpenAI's default base URL
```
▶️ Run Evaluation
```bash
python eva_scripts.py \
--eva_model_name gpt-4o-mini \
--res_json_path ./error_item.json
```
This script compares model predictions (<conclusion>real/fake</conclusion>) to the ground truth and logs mismatches to error_item.json.
---
🧪 Input Format
The evaluation script `res_json_path` accepts a JSON array (Dict in List) where each item has:
```json
{
"rel_path": "relative/path/to/file.mp4",
"label": "real or fake",
"raw_ground_truth": "<think>...</think><conclusion>fake</conclusion>",
"infer_result": "<think>...</think><conclusion>real</conclusion>"
}
```
- label: ground truth
- raw_ground_truth: reasoning by gemini2.5 pro
- infer_result: model reasoning and prediction
Example file: `./evaluate_scripts/error_item.json`
---
提供机构:
Julien0123



