asoria/pdf-papers-docling
收藏Hugging Face2024-12-11 更新2024-12-14 收录
下载链接:
https://hf-mirror.com/datasets/asoria/pdf-papers-docling
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含多个字段,如文件名、内容、模式名称、版本、名称、家具、正文、组、文本、图片、表格、键值项和页面等。每个字段都有详细的子字段描述,如数据类型、结构等。数据集还提供了训练集的划分信息,包括训练集的大小和示例数量。
The dataset contains multiple fields such as filename, content, schema name, version, name, furniture, body, groups, texts, pictures, tables, key-value items, and pages. Each field has detailed subfield descriptions, such as data types and structures. The dataset also provides information on the training set split, including the size of the training set and the number of examples.
提供机构:
asoria



