five

Abhishek-03113/MMArt-PPR10k

收藏
Hugging Face2026-04-16 更新2026-04-26 收录
下载链接:
https://hf-mirror.com/datasets/Abhishek-03113/MMArt-PPR10k
下载链接
链接失效反馈
官方服务:
资源简介:
--- license: apache-2.0 task_categories: - image-text-to-text - image-to-image - image-to-text - text-to-image language: - en tags: - Lightroom - cot - lua - xmp size_categories: - 1K<n<10K --- ### MMArt-PPR10k Dataset The `MMArt-PPR10k` is a multimodal dataset specifically created for research into the instruction-driven agentic image retouching task. It is built upon the original PPR10k dataset and offers rich, paired image data, user instructions, and information on the Lua/XMP tools used in Lightroom. ### Dataset Structure The dataset is organized into a hierarchical folder structure. Each data sample corresponds to a specific image pair and its related files, located within a unique directory. Here's a breakdown of the key components: * **Unique Sample Folders**: Inside the language folder, each subdirectory (e.g., `1000_1`) represents a single data sample. * **User Instructions**: Within each sample folder, you will find subdirectories for user instructions of varying lengths: * `user_want_long` * `user_want_middle` * `user_want_short` * **Image and Configuration Files**: Each sample folder contains the following core files: * `before.jpg`: The original, unedited image. * `processed.jpg`: The edited image, manipulated based on the user instructions. * `config.lua`: The Lua configuration file used in Lightroom. * `config.xmp`: The Xmp file, which stores metadata and editing presets for Lightroom. ### Citation If you find MMArt useful in your research, please consider citing: ```bash @article{jarvisart2025, title={JarvisArt: Liberating Human Artistic Creativity via an Intelligent Photo Retouching Agent}, author={Yunlong Lin and Zixu Lin and Kunjie Lin and Jinbin Bai and Panwang Pan and Chenxin Li and Haoyu Chen and Zhongdao Wang and Xinghao Ding and Wenbo Li and Shuicheng Yan}, year={2025}, journal={arXiv preprint arXiv:2506.17612} } ```

许可证:Apache-2.0 任务类别: - 图像-文本到文本(image-text-to-text) - 图像到图像(image-to-image) - 图像到文本(image-to-text) - 文本到图像(text-to-image) 语言:英语(en) 标签: - Lightroom - 思维链(Chain of Thought,cot) - Lua(lua) - XMP(xmp) 样本量区间:1000至10000个样本(1K<n<10K) ### MMArt-PPR10k 数据集 `MMArt-PPR10k` 是专为指令驱动型AI智能体图像修图任务研究打造的多模态数据集。该数据集基于原始PPR10k数据集构建,包含丰富的配对图像数据、用户指令,以及Lightroom中所使用的Lua/XMP工具相关信息。 ### 数据集结构 数据集采用层级文件夹结构进行组织。每个数据样本对应一组特定的图像对及其关联文件,存储于唯一的目录中。以下是核心组成部分的详细说明: * **唯一样本文件夹**:在语言文件夹内,每个子目录(例如`1000_1`)代表一个独立的数据样本。 * **用户指令**:每个样本文件夹中包含不同长度的用户指令子目录: * `user_want_long`(长文本用户指令) * `user_want_middle`(中文本用户指令) * `user_want_short`(短文本用户指令) * **图像与配置文件**:每个样本文件夹包含以下核心文件: * `before.jpg`:未经编辑的原始图像。 * `processed.jpg`:基于用户指令完成修图后的成品图像。 * `config.lua`:Lightroom中使用的Lua配置文件。 * `config.xmp`:用于存储Lightroom元数据与编辑预设的XMP文件。 ### 引用说明 若您的研究中用到MMArt数据集,请考虑引用如下文献: bash @article{jarvisart2025, title={JarvisArt: Liberating Human Artistic Creativity via an Intelligent Photo Retouching Agent}, author={Yunlong Lin and Zixu Lin and Kunjie Lin and Jinbin Bai and Panwang Pan and Chenxin Li and Haoyu Chen and Zhongdao Wang and Xinghao Ding and Wenbo Li and Shuicheng Yan}, year={2025}, journal={arXiv preprint arXiv:2506.17612} }
提供机构:
Abhishek-03113
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作