Tevatron/pixmo-docs-corpus
收藏Hugging Face2025-02-10 更新2025-02-15 收录
下载链接:
https://hf-mirror.com/datasets/Tevatron/pixmo-docs-corpus
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含文档标识符(docid)、图片(image)、文本(text)和来源(source)四个字段。其中,文本和来源字段为字符串类型,图片字段为图片类型。数据集划分为训练集,共有251165个示例,大小约为54079亿字节。数据集的下载大小为53614兆字节。
The dataset includes four fields: document identifier (docid), image, text, and source. The text and source fields are of string type, and the image field is of image type. The dataset is split into a training set with a total of 251165 examples, totaling approximately 54,079 billion bytes. The download size of the dataset is 53,614 megabytes.
提供机构:
Tevatron



