2,858,306组通用场景图像描述数据_详细描述
收藏魔搭社区2026-05-15 更新2026-05-03 收录
下载链接:
https://modelscope.cn/datasets/DatatangBeijing/2858306Pairs_Image_Caption_Data_Of_General_Scenes
下载链接
链接失效反馈官方服务:
资源简介:
2,858,306组图像及描述,图片类型涵盖风景、动物、花卉树木、人物、汽车、运动、工业以及建筑等多种类别及一个美学子集,描述了图像的整体场景,场景中的细节及图像所表达的情感,描述语言为英语,中文两种语言。
This dataset comprises 2,858,306 image-caption pairs. The included images cover diverse categories such as landscapes, animals, flowers and trees, human figures, automobiles, sports, industrial scenes, architecture, as well as an aesthetic subset. Each caption elaborates on the overall scene of the corresponding image, the fine-grained details within the scene, and the emotional connotations conveyed by the image, and is provided in both English and Chinese.
提供机构:
maas
创建时间:
2026-01-26
搜集汇总
数据集介绍

背景与挑战
背景概述
该数据集包含285.8万组通用场景图像描述数据,覆盖风景、动物、人物等多种类别,提供中英文描述,内容涵盖整体场景、细节元素和情感表达。图像分辨率较高,描述准确率不低于95%。
以上内容由遇见数据集搜集并总结生成



