VLM-Perception/RVL-CDIP
收藏Hugging Face2025-05-17 更新2025-11-01 收录
下载链接:
https://hf-mirror.com/datasets/VLM-Perception/RVL-CDIP
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含三个部分:认知视觉问答(cognition_vqa)、文档分类(document_classification)和推理视觉问答(reasoning_vqa)。每个部分都包括图像、文本消息、答案和相关元信息(如ID和问题类型)。认知视觉问答和推理视觉问答各有400个训练示例,而文档分类也是400个训练示例。数据集支持训练集的下载和实际使用。
The dataset consists of three parts: Cognition Visual Question Answering (cognition_vqa), Document Classification (document_classification), and Reasoning Visual Question Answering (reasoning_vqa). Each part includes images, text messages, answers, and related metadata (such as ID and question type). Both Cognition Visual Question Answering and Reasoning Visual Question Answering have 400 training examples each, and Document Classification also has 400 training examples. The dataset supports the download and actual use of the training set.
提供机构:
VLM-Perception



