five

ImgEdit_recap_mask

收藏
魔搭社区2025-12-05 更新2025-06-07 收录
下载链接:
https://modelscope.cn/datasets/AI-ModelScope/ImgEdit_recap_mask
下载链接
链接失效反馈
官方服务:
资源简介:
[ImgEdit: A Unified Image Editing Dataset and Benchmark](https://huggingface.co/papers/2505.20275) # 🌍 Introduction **ImgEdit** is a large-scale, high-quality image-editing dataset comprising 1.2 million carefully curated edit pairs, which contain both novel and complex single-turn edits, as well as challenging multi-turn tasks. To ensure the data quality, we employ a multi-stage pipeline that integrates a cutting-edge vision-language model, a detection model, a segmentation model, alongside task-specific in-painting procedures and strict post-processing. ImgEdit surpasses existing datasets in both task novelty and data quality. Using ImgEdit, we train **ImgEdit-E1**, an editing model using Vision Language Model to process the reference image and editing prompt, which outperforms existing open-source models on multiple tasks, highlighting the value of ImgEdit and model design. For comprehensive evaluation, we introduce **ImgEdit-Bench**, a benchmark designed to evaluate image editing performance in terms of instruction adherence, editing quality, and detail preservation. It includes a basic testsuite, a challenging single-turn suite, and a dedicated multi-turn suite. We evaluate both open-source and proprietary models, as well as ImgEdit-E1. # 📜 Citation If you find our paper and code useful in your research, please consider giving a star ⭐ and citation 📝. ```bibtex @article{ye2025imgedit, title={ImgEdit: A Unified Image Editing Dataset and Benchmark}, author={Ye, Yang and He, Xianyi and Li, Zongjian and Lin, Bin and Yuan, Shenghai and Yan, Zhiyuan and Hou, Bohan and Yuan, Li}, journal={arXiv preprint arXiv:2505.20275}, year={2025} } ```

[ImgEdit:统一图像编辑数据集与基准测试集](https://huggingface.co/papers/2505.20275) # 🌍 简介 **ImgEdit** 是一款大规模、高质量的图像编辑数据集,包含120万条经过精心筛选的编辑样本对,涵盖新颖且复杂的单轮编辑任务,以及具有挑战性的多轮编辑任务。 为保障数据质量,我们采用多阶段流水线架构,整合前沿视觉语言模型(Vision-Language Model)、检测模型、分割模型,结合针对特定任务的图像修复流程与严格的后处理步骤。ImgEdit在任务新颖性与数据质量两个维度上均优于现有公开数据集。 依托ImgEdit数据集,我们训练得到**ImgEdit-E1**——一款基于视觉语言模型的图像编辑模型,可处理参考图像与编辑指令,其在多项任务上的表现优于现有开源模型,充分彰显了ImgEdit数据集的价值与模型设计的合理性。 为实现全面的性能评估,我们推出**ImgEdit-Bench**,这一基准测试集旨在从指令遵循度、编辑质量与细节保留度三个核心维度评估图像编辑模型的性能。该测试集包含基础测试套件、高难度单轮测试套件,以及专门的多轮测试套件。我们对开源模型、闭源模型以及ImgEdit-E1均开展了评测。 # 📜 引用说明 若您的研究中用到了本文与代码,欢迎点亮⭐Star并引用该论文📝。 bibtex @article{ye2025imgedit, title={ImgEdit: A Unified Image Editing Dataset and Benchmark}, author={Ye, Yang and He, Xianyi and Li, Zongjian and Lin, Bin and Yuan, Shenghai and Yan, Zhiyuan and Hou, Bohan and Yuan, Li}, journal={arXiv preprint arXiv:2505.20275}, year={2025} }
提供机构:
maas
创建时间:
2025-06-04
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作