yzeng58/CoBSAT

Name: yzeng58/CoBSAT
Creator: yzeng58
Published: 2024-02-29 21:35:55
License: 暂无描述

Hugging Face2024-02-29 更新2024-03-04 收录

下载链接：

https://hf-mirror.com/datasets/yzeng58/CoBSAT

下载链接

链接失效反馈

官方服务：

资源简介：

--- license: mit task_categories: - text-to-image language: - en tags: - MLLM - in-context learning - text-to-image generation - T2I-ICL - ICL - NLP - natural language processing pretty_name: CoBSAT size_categories: - 1K<n<10K --- **Dataset**: The CoBSAT benchmark evaluates the ability of MLLMs to perform T2I-ICL. It covers five themes: color, background, style, action, and texture, each with two different emphases: object-inference and attribute-inference. Here, we visualize the images and their corresponding labels and captions collected for our dataset. We further integrate the images and their labels for constructing the prompts for text-to-image in-context learning using the processing code provided in https://github.com/UW-Madison-Lee-Lab/CoBSAT. **Paper Link**: https://arxiv.org/abs/2402.01293 ```tex @article{zeng2024can, title={Can MLLMs Perform Text-to-Image In-Context Learning?}, author={Zeng, Yuchen and Kang, Wonjun and Chen, Yicong and Koo, Hyung Il and Lee, Kangwook}, journal={arXiv preprint arXiv:2402.01293}, year={2024} } ```

提供机构：

yzeng58

原始信息汇总

数据集概述

基本信息

许可证: MIT
任务类别: 文本到图像
语言: 英语
标签: MLLM, in-context learning, text-to-image generation, T2I-ICL, ICL, NLP, natural language processing
易读名称: CoBSAT
大小类别: 1K<n<10K

详细描述

数据集: CoBSAT 基准测试评估 MLLMs 执行 T2I-ICL 的能力。它涵盖五个主题：颜色、背景、风格、动作和纹理，每个主题有两个不同的重点：对象推理和属性推理。数据集包括图像及其相应的标签和说明，用于构建文本到图像的上下文学习提示。

yzeng58/CoBSAT

数据集概述

基本信息

详细描述

相关文献