mmembed/docmatix
收藏Hugging Face2025-03-26 更新2025-04-12 收录
下载链接:
https://hf-mirror.com/datasets/mmembed/docmatix
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含文本和图像数据,适用于需要处理这两种类型数据的任务。训练集包含超过125万个示例,数据集总大小约为582GB,下载大小约为535GB。数据集以训练集的形式提供,并可通过指定的路径访问。
The dataset includes both text and image data, suitable for tasks that require processing these two types of data. The training set contains more than 1.25 million examples, with the total dataset size being approximately 582GB and the download size being about 535GB. The dataset is provided in the form of a training set and can be accessed through specified paths.
提供机构:
mmembed



