SCI-3000: A Novel Dataset for the Task of Figure, Table and Caption Extraction from Scientific PDFs
收藏NIAID Data Ecosystem2026-05-01 收录
下载链接:
https://zenodo.org/record/6564970
下载链接
链接失效反馈官方服务:
资源简介:
This dataset contains bounding boxes of figures, tables, captions in 34,791 pages extracted from 3000 open-access scientific publications from the fields of medicine, chemistry, physics, computer science, and technology. The underlying publications are also included in PDF form.
For more details, refer to the README file.
创建时间:
2023-09-19



