shunk031/DrawBench

Name: shunk031/DrawBench
Creator: shunk031
Published: 2023-09-27 13:13:31
License: 暂无描述

Hugging Face2023-09-27 更新2024-03-04 收录

下载链接：

https://hf-mirror.com/datasets/shunk031/DrawBench

下载链接

链接失效反馈

官方服务：

资源简介：

--- annotations_creators: - crowdsourced language: - en language_creators: [] license: - unknown multilinguality: - monolingual pretty_name: DrawBench size_categories: - n<1K source_datasets: - original tags: [] task_categories: - text-to-image task_ids: [] --- # Dataset Card for DrawBench ## Table of Contents - [Dataset Card Creation Guide](#dataset-card-creation-guide) - [Table of Contents](#table-of-contents) - [Dataset Description](#dataset-description) - [Dataset Summary](#dataset-summary) - [Supported Tasks and Leaderboards](#supported-tasks-and-leaderboards) - [Languages](#languages) - [Dataset Structure](#dataset-structure) - [Data Instances](#data-instances) - [Data Fields](#data-fields) - [Data Splits](#data-splits) - [Dataset Creation](#dataset-creation) - [Curation Rationale](#curation-rationale) - [Source Data](#source-data) - [Initial Data Collection and Normalization](#initial-data-collection-and-normalization) - [Who are the source language producers?](#who-are-the-source-language-producers) - [Annotations](#annotations) - [Annotation process](#annotation-process) - [Who are the annotators?](#who-are-the-annotators) - [Personal and Sensitive Information](#personal-and-sensitive-information) - [Considerations for Using the Data](#considerations-for-using-the-data) - [Social Impact of Dataset](#social-impact-of-dataset) - [Discussion of Biases](#discussion-of-biases) - [Other Known Limitations](#other-known-limitations) - [Additional Information](#additional-information) - [Dataset Curators](#dataset-curators) - [Licensing Information](#licensing-information) - [Citation Information](#citation-information) - [Contributions](#contributions) ## Dataset Description - **Homepage:** https://imagen.research.google/ - **Repository:** https://github.com/shunk031/huggingface-datasets_DrawBench - **Paper:** https://arxiv.org/abs/2205.11487 ### Dataset Summary [More Information Needed] ### Supported Tasks and Leaderboards [More Information Needed] ### Languages The language data in DrawBench is in English (BCP-47 en-US). ## Dataset Structure ### Data Instances [More Information Needed] ### Data Fields [More Information Needed] ### Data Splits [More Information Needed] ## Dataset Creation ### Curation Rationale [More Information Needed] ### Source Data [More Information Needed] #### Initial Data Collection and Normalization [More Information Needed] #### Who are the source language producers? [More Information Needed] ### Annotations [More Information Needed] #### Annotation process [More Information Needed] #### Who are the annotators? [More Information Needed] ### Personal and Sensitive Information [More Information Needed] ## Considerations for Using the Data ### Social Impact of Dataset [More Information Needed] ### Discussion of Biases [More Information Needed] ### Other Known Limitations [More Information Needed] ## Additional Information ### Dataset Curators [More Information Needed] ### Licensing Information [More Information Needed] ### Citation Information ```bibtex @article{saharia2022photorealistic, title={Photorealistic text-to-image diffusion models with deep language understanding}, author={Saharia, Chitwan and Chan, William and Saxena, Saurabh and Li, Lala and Whang, Jay and Denton, Emily L and Ghasemipour, Kamyar and Gontijo Lopes, Raphael and Karagol Ayan, Burcu and Salimans, Tim and others}, journal={Advances in Neural Information Processing Systems}, volume={35}, pages={36479--36494}, year={2022} } ``` ### Contributions Thanks to Google Research, Brain Team for creating this dataset.

annotations_creators: - 众包 language: - 英语 language_creators: - 无 license: - 未知 multilinguality: - 单语言 pretty_name: DrawBench size_categories: - 样本量少于1000 source_datasets: - 原始数据集 tags: - 无 task_categories: - 文本到图像生成（text-to-image） task_ids: - 无 # DrawBench 数据集卡片 ## 目录 - [数据集卡片创建指南](#dataset-card-creation-guide) - [目录](#table-of-contents) - [数据集描述](#dataset-description) - [数据集概述](#dataset-summary) - [支持任务与评测榜单](#supported-tasks-and-leaderboards) - [语言说明](#languages) - [数据集结构](#dataset-structure) - [数据实例](#data-instances) - [数据字段](#data-fields) - [数据划分](#data-splits) - [数据集构建](#dataset-creation) - [构建依据](#curation-rationale) - [源数据](#source-data) - [初始数据收集与标准化](#initial-data-collection-and-normalization) - [源语言生成者为何人？](#who-are-the-source-language-producers) - [注释](#annotations) - [注释流程](#annotation-process) - [注释人员为何人？](#who-are-the-annotators) - [个人与敏感信息](#personal-and-sensitive-information) - [数据集使用注意事项](#considerations-for-using-the-data) - [数据集的社会影响](#social-impact-of-dataset) - [偏见讨论](#discussion-of-biases) - [其他已知局限性](#other-known-limitations) - [附加信息](#additional-information) - [数据集整理者](#dataset-curators) - [授权信息](#licensing-information) - [引用信息](#citation-information) - [贡献致谢](#contributions) ## 数据集描述 - **主页**：https://imagen.research.google/ - **代码仓库**：https://github.com/shunk031/huggingface-datasets_DrawBench - **相关论文**：https://arxiv.org/abs/2205.11487 ### 数据集概述 [更多信息待补充] ### 支持任务与评测榜单 [更多信息待补充] ### 语言说明 DrawBench 所使用的语言数据为英语（BCP-47 语言编码：en-US）。 ## 数据集结构 ### 数据实例 [更多信息待补充] ### 数据字段 [更多信息待补充] ### 数据划分 [更多信息待补充] ## 数据集构建 ### 构建依据 [更多信息待补充] ### 源数据 [更多信息待补充] #### 初始数据收集与标准化 [更多信息待补充] #### 源语言生成者为何人？ [更多信息待补充] ### 注释 [更多信息待补充] #### 注释流程 [更多信息待补充] #### 注释人员为何人？ [更多信息待补充] ### 个人与敏感信息 [更多信息待补充] ## 数据集使用注意事项 ### 数据集的社会影响 [更多信息待补充] ### 偏见讨论 [更多信息待补充] ### 其他已知局限性 [更多信息待补充] ## 附加信息 ### 数据集整理者 [更多信息待补充] ### 授权信息 [更多信息待补充] ### 引用信息 bibtex @article{saharia2022photorealistic, title={具备深度语言理解能力的写实文本到图像生成扩散模型（Photorealistic text-to-image diffusion models with deep language understanding）}, author={Saharia, Chitwan and Chan, William and Saxena, Saurabh and Li, Lala and Whang, Jay and Denton, Emily L and Ghasemipour, Kamyar and Gontijo Lopes, Raphael and Karagol Ayan, Burcu and Salimans, Tim and others}, journal={Advances in Neural Information Processing Systems}, volume={35}, pages={36479--36494}, year={2022} } ### 贡献致谢感谢Google Research（谷歌研究院）脑团队创建本数据集。

提供机构：

shunk031

原始信息汇总

数据集卡片 for DrawBench

数据集描述

数据集摘要

[更多信息需要]

支持的任务和排行榜

[更多信息需要]

语言

DrawBench 中的语言数据是英语（BCP-47 en-US）。

数据集结构

数据实例

[更多信息需要]

数据字段

[更多信息需要]

数据分割

[更多信息需要]

数据集创建

策划理由

[更多信息需要]

源数据

[更多信息需要]

初始数据收集和规范化

[更多信息需要]

源语言生产者是谁？

[更多信息需要]

注释

[更多信息需要]

注释过程

[更多信息需要]

注释者是谁？

[更多信息需要]

个人和敏感信息

[更多信息需要]

使用数据的注意事项

数据集的社会影响

[更多信息需要]

偏见的讨论

[更多信息需要]

其他已知限制

[更多信息需要]

附加信息

数据集策展人

[更多信息需要]

许可信息

[更多信息需要]

引用信息

bibtex @article{saharia2022photorealistic, title={Photorealistic text-to-image diffusion models with deep language understanding}, author={Saharia, Chitwan and Chan, William and Saxena, Saurabh and Li, Lala and Whang, Jay and Denton, Emily L and Ghasemipour, Kamyar and Gontijo Lopes, Raphael and Karagol Ayan, Burcu and Salimans, Tim and others}, journal={Advances in Neural Information Processing Systems}, volume={35}, pages={36479--36494}, year={2022} }

贡献

感谢 Google Research, Brain Team 创建此数据集。

搜集汇总

数据集介绍

构建方式

在文本到图像生成领域，DrawBench数据集作为一项基准测试工具，其构建过程体现了严谨的学术考量。该数据集源自Google Research的Imagen项目，通过众包方式收集英文文本提示，旨在系统评估扩散模型对复杂语义的理解与生成能力。数据集的构建遵循了严格的规范化流程，确保每个文本描述都经过精心筛选与标注，为模型性能的量化比较提供了可靠基础。

使用方法

该数据集主要应用于文本到图像生成模型的基准测试与性能评估。研究人员可通过加载数据集中的文本提示，输入至待测生成模型，并对比生成图像与预期语义的吻合度。使用时应参考其关联论文中提出的评估框架，结合人工或自动化指标进行量化分析。数据集支持对模型在语言理解、视觉保真度及创造性表达等方面的综合考察，为相关领域的算法优化提供实证依据。

背景与挑战

背景概述

DrawBench数据集由Google Research的Brain Team于2022年创建，作为其文本到图像生成模型Imagen的评估基准。该数据集旨在解决文本到图像生成领域中的核心研究问题，即如何系统评估生成模型在理解复杂文本描述、处理视觉细节和语义一致性方面的能力。通过提供一系列精心设计的文本提示，DrawBench为研究人员提供了一个标准化的测试平台，推动了生成模型在真实性和语言理解方面的进步，对计算机视觉和自然语言处理的交叉领域产生了深远影响。

当前挑战

DrawBench数据集面临的挑战主要体现在两个方面：首先，在领域问题层面，文本到图像生成任务要求模型不仅准确捕捉文本语义，还需处理视觉细节、空间关系和抽象概念的具象化，这带来了生成图像的真实性、多样性和语义对齐的评估难题。其次，在构建过程中，数据集的创建需依赖众包标注，确保提示的多样性和复杂性，同时避免偏见和歧义，这涉及标注质量控制和数据平衡的挑战，以保障评估的全面性和公正性。

常用场景

经典使用场景

在文本到图像生成领域，DrawBench数据集作为一项精心设计的评估基准，其经典使用场景聚焦于系统性地评测扩散模型在理解复杂文本描述并生成相应图像方面的能力。该数据集通过涵盖多样化、具有挑战性的提示词，如组合性、空间关系、文本渲染和罕见概念等，为研究者提供了一个标准化的测试平台，用以量化模型在语义对齐、视觉保真度和创造性表达上的表现。

解决学术问题

DrawBench的构建旨在解决文本到图像生成研究中长期存在的评估难题，即如何超越主观的定性分析，建立客观、全面且可复现的量化评估体系。它通过引入细粒度的、覆盖多维度能力的提示集合，帮助学术界精准诊断模型的失败模式与能力边界，例如在理解抽象概念、处理逻辑组合或遵循详细指令方面的不足，从而推动了模型架构与训练方法的针对性改进。

实际应用

在实际应用层面，DrawBench为开发高性能文本到图像生成系统提供了关键的迭代反馈工具。技术团队可以依据其评测结果，优化模型对用户意图的理解精度，提升生成图像的质量与可靠性。这直接赋能于内容创作、艺术设计、教育可视化以及辅助创意产业等多个领域，确保最终部署的模型能够稳定地满足复杂、具体的图像生成需求，增强产品的实用价值与用户体验。

数据集最近研究