ImgEdit_recap_mask

Name: ImgEdit_recap_mask
Creator: maas
Published: 2025-12-05 16:37:23
License: 暂无描述

魔搭社区2025-12-05 更新2025-06-07 收录

下载链接：

https://modelscope.cn/datasets/AI-ModelScope/ImgEdit_recap_mask

下载链接

链接失效反馈

官方服务：

资源简介：

[ImgEdit: A Unified Image Editing Dataset and Benchmark](https://huggingface.co/papers/2505.20275) # 🌍 Introduction **ImgEdit** is a large-scale, high-quality image-editing dataset comprising 1.2 million carefully curated edit pairs, which contain both novel and complex single-turn edits, as well as challenging multi-turn tasks. To ensure the data quality, we employ a multi-stage pipeline that integrates a cutting-edge vision-language model, a detection model, a segmentation model, alongside task-specific in-painting procedures and strict post-processing. ImgEdit surpasses existing datasets in both task novelty and data quality. Using ImgEdit, we train **ImgEdit-E1**, an editing model using Vision Language Model to process the reference image and editing prompt, which outperforms existing open-source models on multiple tasks, highlighting the value of ImgEdit and model design. For comprehensive evaluation, we introduce **ImgEdit-Bench**, a benchmark designed to evaluate image editing performance in terms of instruction adherence, editing quality, and detail preservation. It includes a basic testsuite, a challenging single-turn suite, and a dedicated multi-turn suite. We evaluate both open-source and proprietary models, as well as ImgEdit-E1. # 📜 Citation If you find our paper and code useful in your research, please consider giving a star ⭐ and citation 📝. ```bibtex @article{ye2025imgedit, title={ImgEdit: A Unified Image Editing Dataset and Benchmark}, author={Ye, Yang and He, Xianyi and Li, Zongjian and Lin, Bin and Yuan, Shenghai and Yan, Zhiyuan and Hou, Bohan and Yuan, Li}, journal={arXiv preprint arXiv:2505.20275}, year={2025} } ```

[ImgEdit：统一图像编辑数据集与基准测试集](https://huggingface.co/papers/2505.20275) # 🌍 简介 **ImgEdit** 是一款大规模、高质量的图像编辑数据集，包含120万条经过精心筛选的编辑样本对，涵盖新颖且复杂的单轮编辑任务，以及具有挑战性的多轮编辑任务。为保障数据质量，我们采用多阶段流水线架构，整合前沿视觉语言模型（Vision-Language Model）、检测模型、分割模型，结合针对特定任务的图像修复流程与严格的后处理步骤。ImgEdit在任务新颖性与数据质量两个维度上均优于现有公开数据集。依托ImgEdit数据集，我们训练得到**ImgEdit-E1**——一款基于视觉语言模型的图像编辑模型，可处理参考图像与编辑指令，其在多项任务上的表现优于现有开源模型，充分彰显了ImgEdit数据集的价值与模型设计的合理性。为实现全面的性能评估，我们推出**ImgEdit-Bench**，这一基准测试集旨在从指令遵循度、编辑质量与细节保留度三个核心维度评估图像编辑模型的性能。该测试集包含基础测试套件、高难度单轮测试套件，以及专门的多轮测试套件。我们对开源模型、闭源模型以及ImgEdit-E1均开展了评测。 # 📜 引用说明若您的研究中用到了本文与代码，欢迎点亮⭐Star并引用该论文📝。 bibtex @article{ye2025imgedit, title={ImgEdit: A Unified Image Editing Dataset and Benchmark}, author={Ye, Yang and He, Xianyi and Li, Zongjian and Lin, Bin and Yuan, Shenghai and Yan, Zhiyuan and Hou, Bohan and Yuan, Li}, journal={arXiv preprint arXiv:2505.20275}, year={2025} }

提供机构：

maas

创建时间：

2025-06-04

5,000+

优质数据集

54 个

任务类型

进入经典数据集