JohnVitz/NLP_Final_Project_ArXiv_Parsed_Images
收藏Hugging Face2025-04-07 更新2025-04-12 收录
下载链接:
https://hf-mirror.com/datasets/JohnVitz/NLP_Final_Project_ArXiv_Parsed_Images
下载链接
链接失效反馈官方服务:
资源简介:
这是一个包含图像和元数据的训练数据集,图像数据以图片文件形式存在,元数据包含每个图像的唯一标识符、图像的PDF文件路径以及图像在PDF中的页码。数据集分为训练集,共有6212个样本。
This is a training dataset containing images and metadata, with the image data in the form of image files and the metadata including a unique identifier for each image, the path to the images PDF file, and the page number of the image in the PDF. The dataset is split into a training set with a total of 6212 samples.
提供机构:
JohnVitz



