UICaption
收藏arXiv2025-09-30 收录
下载链接:
https://github.com/microsoft/uicaption
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含了一系列用户界面图像(包括截图和图标),并配以功能性的标题描述。具体来说,数据集中包含了113,971张独特的用户界面图像以及133,817组图像与标题的配对。这些数据涵盖了各种应用场景,文本密度、像素密度和质量各不相同,能够捕捉到用户界面元素的功能性。此外,该数据集可应用于多个任务,包括用户界面动作蕴含、基于指令的用户界面图像检索、参照表达式定位以及用户界面实体识别。
This dataset comprises a collection of user interface (UI) images, including screenshots and icons, paired with functional title descriptions. Specifically, the dataset contains 113,971 unique UI images and 133,817 image-title pairs. It covers diverse application scenarios with varying text density, pixel density and image quality, and captures the functionality of UI elements. Furthermore, this dataset can be applied to multiple downstream tasks, including UI action entailment, instruction-based UI image retrieval, referring expression grounding, and UI entity recognition.
提供机构:
Curated by the authors of the paper.



