OpenGVLab/Doc-750K
收藏Hugging Face2025-07-22 更新2025-08-09 收录
下载链接:
https://hf-mirror.com/datasets/OpenGVLab/Doc-750K
下载链接
链接失效反馈官方服务:
资源简介:
该数据集用于文档级理解的多模态模型改进任务,具体应用场景为问题回答。数据集的详细信息未在README中直接描述,但从其关联的论文《Docopilot: Improving Multimodal Models for Document-Level Understanding》可以推测,它可能包含文档和与之相关的问答对。另外,数据集可能非常大,包含了大量的图像文件,需要特别注意解压时的注意事项。
The dataset is used for improving multimodal models for document-level understanding in the task of question-answering. Detailed information about the dataset is not directly described in the README, but it can be inferred from the associated paper Docopilot: Improving Multimodal Models for Document-Level Understanding that it may include documents and related question-answer pairs. Additionally, the dataset might be very large, containing a significant number of image files, and special attention should be paid to the instructions for extraction.
提供机构:
OpenGVLab



