ImgEdit_recap_mask
收藏魔搭社区2025-12-05 更新2025-06-07 收录
下载链接:
https://modelscope.cn/datasets/AI-ModelScope/ImgEdit_recap_mask
下载链接
链接失效反馈官方服务:
资源简介:
[ImgEdit: A Unified Image Editing Dataset and Benchmark](https://huggingface.co/papers/2505.20275)
# 🌍 Introduction
**ImgEdit** is a large-scale, high-quality image-editing dataset comprising 1.2 million carefully curated edit pairs, which contain both novel and complex single-turn edits, as well as challenging multi-turn tasks.
To ensure the data quality, we employ a multi-stage pipeline that integrates a cutting-edge vision-language model, a detection model, a segmentation model, alongside task-specific in-painting procedures and strict post-processing. ImgEdit surpasses existing datasets in both task novelty and data quality.
Using ImgEdit, we train **ImgEdit-E1**, an editing model using Vision Language Model to process the reference image and editing prompt, which outperforms existing open-source models on multiple tasks, highlighting the value of ImgEdit and model design.
For comprehensive evaluation, we introduce **ImgEdit-Bench**, a benchmark designed to evaluate image editing performance in terms of instruction adherence, editing quality, and detail preservation.
It includes a basic testsuite, a challenging single-turn suite, and a dedicated multi-turn suite.
We evaluate both open-source and proprietary models, as well as ImgEdit-E1.
# 📜 Citation
If you find our paper and code useful in your research, please consider giving a star ⭐ and citation 📝.
```bibtex
@article{ye2025imgedit,
title={ImgEdit: A Unified Image Editing Dataset and Benchmark},
author={Ye, Yang and He, Xianyi and Li, Zongjian and Lin, Bin and Yuan, Shenghai and Yan, Zhiyuan and Hou, Bohan and Yuan, Li},
journal={arXiv preprint arXiv:2505.20275},
year={2025}
}
```
[ImgEdit:统一图像编辑数据集与基准测试集](https://huggingface.co/papers/2505.20275)
# 🌍 简介
**ImgEdit** 是一款大规模、高质量的图像编辑数据集,包含120万条经过精心筛选的编辑样本对,涵盖新颖且复杂的单轮编辑任务,以及具有挑战性的多轮编辑任务。
为保障数据质量,我们采用多阶段流水线架构,整合前沿视觉语言模型(Vision-Language Model)、检测模型、分割模型,结合针对特定任务的图像修复流程与严格的后处理步骤。ImgEdit在任务新颖性与数据质量两个维度上均优于现有公开数据集。
依托ImgEdit数据集,我们训练得到**ImgEdit-E1**——一款基于视觉语言模型的图像编辑模型,可处理参考图像与编辑指令,其在多项任务上的表现优于现有开源模型,充分彰显了ImgEdit数据集的价值与模型设计的合理性。
为实现全面的性能评估,我们推出**ImgEdit-Bench**,这一基准测试集旨在从指令遵循度、编辑质量与细节保留度三个核心维度评估图像编辑模型的性能。该测试集包含基础测试套件、高难度单轮测试套件,以及专门的多轮测试套件。我们对开源模型、闭源模型以及ImgEdit-E1均开展了评测。
# 📜 引用说明
若您的研究中用到了本文与代码,欢迎点亮⭐Star并引用该论文📝。
bibtex
@article{ye2025imgedit,
title={ImgEdit: A Unified Image Editing Dataset and Benchmark},
author={Ye, Yang and He, Xianyi and Li, Zongjian and Lin, Bin and Yuan, Shenghai and Yan, Zhiyuan and Hou, Bohan and Yuan, Li},
journal={arXiv preprint arXiv:2505.20275},
year={2025}
}
提供机构:
maas
创建时间:
2025-06-04



