Mxode/Noah-Wukong-100M
收藏Hugging Face2025-05-01 更新2025-08-30 收录
下载链接:
https://hf-mirror.com/datasets/Mxode/Noah-Wukong-100M
下载链接
链接失效反馈官方服务:
资源简介:
Noah-Wukong数据集是一个大规模的多模态中文数据集,包含1亿个<图像,文本>对。图像经过尺寸和长宽比的过滤,文本经过语言、长度、频率的过滤,并考虑了隐私和敏感词。
The Noah-Wukong dataset is a large-scale multi-modality Chinese dataset containing 100 million `<image, text>` pairs. Images in the dataset are filtered by size and aspect ratio, and text is filtered by language, length, frequency, and privacy/sensitive words.
提供机构:
Mxode



