mmembed/MAmmoTH-VL-Instruct-12M
收藏Hugging Face2025-04-10 更新2025-04-26 收录
下载链接:
https://hf-mirror.com/datasets/mmembed/MAmmoTH-VL-Instruct-12M
下载链接
链接失效反馈官方服务:
资源简介:
该数据集是一个包含文本和图像数据的多模态数据集,用于训练模型。它包括一个训练集,包含超过一千万个示例,数据集总大小超过800TB。文本数据以字符串形式存储,而图像数据以二进制序列存储。
This dataset is a multimodal dataset containing text and image data for model training. It includes a training set with over ten million examples, and the total size of the dataset is over 800TB. Text data is stored as strings, while image data is stored as binary sequences.
提供机构:
mmembed



