Voxel51/document-haystack-10pages
收藏Hugging Face2025-10-14 更新2025-10-25 收录
下载链接:
https://hf-mirror.com/datasets/Voxel51/document-haystack-10pages
下载链接
链接失效反馈官方服务:
资源简介:
document-haystack-10pages数据集是一个包含250个样本的FiftyOne数据集,作为Document Haystack完整数据集的10页子集。它包含25个真实世界的基础文档,每个文档有10页,每页分布了10个针(needle),旨在评估视觉语言模型在处理长篇复杂文档时的性能。
The document-haystack-10pages dataset is a FiftyOne dataset with 250 samples, serving as the 10-page subset of the full Document Haystack dataset. It includes 25 real-world base documents, each with 10 pages, and 10 needles per page, designed to evaluate the performance of Vision Language Models on long, visually complex documents.
提供机构:
Voxel51



