axxkaya/UVT-Explanatory-based-Vision-Tasks
收藏Hugging Face2025-02-12 更新2025-02-15 收录
下载链接:
https://hf-mirror.com/datasets/axxkaya/UVT-Explanatory-based-Vision-Tasks
下载链接
链接失效反馈官方服务:
资源简介:
UVT解释性视觉任务数据集,包含1200万个图像输入、解释性指令和输出的三元组,用于训练自回归视觉语言模型,实现指令级别的零样本任务泛化能力。
UVT Explanatory Vision Tasks dataset, containing 12 million image input → explanatory instruction → output triplets for training auto-regressive vision-language models to achieve instruction-level zero-shot task generalization capability.
提供机构:
axxkaya



