DataComp-12M

Name: DataComp-12M
Creator: maas
Published: 2025-12-05 11:39:00
License: 暂无描述

魔搭社区2025-12-05 更新2025-07-05 收录

下载链接：

https://modelscope.cn/datasets/mlfoundations/DataComp-12M

下载链接

链接失效反馈

官方服务：

资源简介：

# Dataset Card for DataComp-12M  This dataset contains a 12M subset of [DataComp-1B-BestPool](https://huggingface.co/datasets/mlfoundations/datacomp_1b). We distribute the image url-text samples and metadata under a standard Creative Common CC-BY-4.0 license. The individual images are under their own copyrights. Image-text models trained on DataComp-12M are significantly better than on CC-12M/YFCC-15M as well as DataComp-Small/Medium. DataComp-12M was introduced in [MobileCLIP paper](https://arxiv.org/abs/2311.17049) and along with the reinforced dataset [DataCompDR-12M](https://huggingface.co/datasets/apple/DataCompDR-12M). The UIDs per shards match between [mlfoundations/DataComp-12M](https://huggingface.co/datasets/mlfoundations/DataComp-12M) and [apple/DataCompDR-12M](https://huggingface.co/datasets/apple/DataCompDR-12M). ## Terms and Conditions We have terms of service that are similar to those adopted by HuggingFace (https://huggingface.co/terms-of-service), which covers their dataset library. Specifically, any content you download, access or use from our index, is at your own risk and subject to the terms of service or copyright limitations accompanying such content. The image url-text index, which is a research artifact, is provided as is. By using said index, you assume all risks, including but not limited to, liabilities related to image downloading and storage. ## Citation **[DataComp: In search of the next generation of multimodal datasets](https://arxiv.org/abs/2304.14108). (NeurIPS 2024)** Gadre, Samir Yitzhak, et al. ``` @article{gadre2024datacomp, title={Datacomp: In search of the next generation of multimodal datasets}, author={Gadre, Samir Yitzhak and Ilharco, Gabriel and Fang, Alex and Hayase, Jonathan and Smyrnis, Georgios and Nguyen, Thao and Marten, Ryan and Wortsman, Mitchell and Ghosh, Dhruba and Zhang, Jieyu and others}, journal={Advances in Neural Information Processing Systems}, volume={36}, year={2024} } ``` **[MobileCLIP: Fast Image-Text Models through Multi-Modal Reinforced Training](https://arxiv.org/pdf/2311.17049.pdf). (CVPR 2024)** *Pavan Kumar Anasosalu Vasu, Hadi Pouransari, Fartash Faghri, Raviteja Vemulapalli, Oncel Tuzel.* ```bibtex @InProceedings{mobileclip2024, author = {Pavan Kumar Anasosalu Vasu, Hadi Pouransari, Fartash Faghri, Raviteja Vemulapalli, Oncel Tuzel}, title = {MobileCLIP: Fast Image-Text Models through Multi-Modal Reinforced Training}, booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)}, month = {June}, year = {2024}, } ```

# DataComp-12M 数据集卡片  本数据集为[DataComp-1B-BestPool](https://huggingface.co/datasets/mlfoundations/datacomp_1b)的1200万样本子集。我们按照标准知识共享CC-BY-4.0许可协议发布图像URL-文本样本与元数据，单张图像的版权归其各自权利人所有。在DataComp-12M上训练的图像-文本模型性能显著优于在CC-12M、YFCC-15M以及DataComp-Small/Medium上训练的模型。 DataComp-12M首次在[MobileCLIP论文](https://arxiv.org/abs/2311.17049)中提出，同步发布的还有增强版数据集[DataCompDR-12M](https://huggingface.co/datasets/apple/DataCompDR-12M)。[mlfoundations/DataComp-12M](https://huggingface.co/datasets/mlfoundations/DataComp-12M)与[apple/DataCompDR-12M](https://huggingface.co/datasets/apple/DataCompDR-12M)各数据分片的唯一标识符（Unique Identifier，UID）完全一致。 ## 条款与条件我们采用与HuggingFace（https://huggingface.co/terms-of-service）数据集库相似的服务条款。具体而言，您从本索引下载、访问或使用的任何内容均由您自行承担风险，并受该内容附带的服务条款或版权限制约束。本图像URL-文本索引作为一项研究成果，按"现状"提供。使用本索引即意味着您将承担全部风险，包括但不限于与图像下载和存储相关的法律责任。 ## 引用文献 **[DataComp：探寻下一代多模态数据集](https://arxiv.org/abs/2304.14108)（NeurIPS 2024）** Gadre, Samir Yitzhak 等。 @article{gadre2024datacomp, title={Datacomp: In search of the next generation of multimodal datasets}, author={Gadre, Samir Yitzhak and Ilharco, Gabriel and Fang, Alex and Hayase, Jonathan and Smyrnis, Georgios and Nguyen, Thao and Marten, Ryan and Wortsman, Mitchell and Ghosh, Dhruba and Zhang, Jieyu and others}, journal={Advances in Neural Information Processing Systems}, volume={36}, year={2024} } **[MobileCLIP：通过多模态增强训练实现快速图像-文本模型](https://arxiv.org/pdf/2311.17049.pdf)（CVPR 2024）** *Pavan Kumar Anasosalu Vasu, Hadi Pouransari, Fartash Faghri, Raviteja Vemulapalli, Oncel Tuzel.* bibtex @InProceedings{mobileclip2024, author = {Pavan Kumar Anasosalu Vasu, Hadi Pouransari, Fartash Faghri, Raviteja Vemulapalli, Oncel Tuzel}, title = {MobileCLIP: Fast Image-Text Models through Multi-Modal Reinforced Training}, booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)}, month = {June}, year={2024}, }

提供机构：

maas

创建时间：

2025-10-03

5,000+

优质数据集

54 个

任务类型

进入经典数据集