OpenFace-CQUPT/HumanCaption-HQ-311K
收藏Hugging Face2025-06-09 更新2024-12-14 收录
下载链接:
https://hf-mirror.com/datasets/OpenFace-CQUPT/HumanCaption-HQ-311K
下载链接
链接失效反馈官方服务:
资源简介:
HumanCaption-HQ-311K数据集包含约311,000张与人类相关的图片及其对应的自然语言描述。与HumanCaption-10M相比,该数据集不仅包含面部语言描述,还筛选了更高分辨率的图片,并利用GPT-4V的强大视觉理解能力生成更详细和准确的文本描述。该数据集用于训练HumanVLM模型,增强其在标题生成和视觉理解方面的能力。
HumanCaption-HQ-311K: Approximately 311,000 human-related images and their corresponding natural language descriptions. Compared to HumanCaption-10M, this dataset not only includes associated facial language descriptions but also filters out images with higher resolution and employs the powerful visual understanding capabilities of GPT-4V to generate more detailed and accurate text descriptions. This dataset is used for the second phase of training HumanVLM, enhancing the models capabilities in caption generation and visual understanding.
提供机构:
OpenFace-CQUPT



