MMInstruction/M3IT-80
收藏数据集概述
数据集名称: M3IT-80
数据集描述: M3IT-80 是 M3IT 数据集的80种语言翻译版本,涵盖了多种视觉-语言任务,包括标题生成、视觉问答(VQA)、视觉条件生成、推理和分类。
语言: 数据集包含80种语言,具体语言代码列表如下: python _LAN_CODES = [ "af", "am", "ar", "as", "ast", "be", "bg", "bn", "bs", "ca", "ceb", "cs", "cy", "da", "de", "el", "es", "et", "fi", "fr", "fuv", "gl", "gu", "ha", "he", "hi", "hr", "hu", "hy", "id", "ig", "is", "it", "ja", "jv", "ka", "kk", "km", "kn", "ko", "ky", "lb", "lg", "lij", "li", "ln", "lo", "lt", "lv", "mi", "mk", "ml", "mr", "mt", "my", "nl", "ny", "oc", "pa", "pl", "pt", "ro", "ru", "sd", "sk", "sn", "so", "sr", "sv", "ta", "te", "tg", "th", "tl", "tr", "uk", "ur", "vi", "wo", "zh", ]
数据集统计: 数据集提供了每种语言的训练/验证/测试集数量,具体统计如下:
| Task | Dataset | #Train | #Val | #Test |
|---|---|---|---|---|
| Classification | imagenet |
500 | 500 | 0 |
| Visual Question Answering | vqa-v2 |
500 | 500 | 0 |
| Knowledgeable Visual QA | okvqa |
500 | 500 | 0 |
| Reasoning | winoground |
0 | 0 | 800 |
| Generation | vist |
500 | 500 | 500 |
| Video | msrvtt |
500 | 500 | 0 |
msrvtt-qa |
500 | 500 | 0 |
源数据: 源语言为英语,使用阿里巴巴翻译服务进行翻译。
数据集结构: 数据集支持通过HuggingFace加载,具体加载方式如下: python from datasets import load_dataset
ds_name = "okvqa-zh" # 更改数据集名称 dataset = load_dataset("MMInstruction/M3IT-80", ds_name)
数据字段: 数据集包含以下字段: python features = datasets.Features( { "instruction": datasets.Value("string"), "inputs": datasets.Value("string"), "image_base64_str": [datasets.Value("string")], "outputs": datasets.Value("string"), } )
许可证信息: 原始数据集遵循其原始许可证。注释指令数据根据CC BY 4.0许可。
引用信息: bibtex @article{li2023m3it, title={M$^3$IT: A Large-Scale Dataset towards Multi-Modal Multilingual Instruction Tuning}, author={Lei Li and Yuwei Yin and Shicheng Li and Liang Chen and Peiyu Wang and Shuhuai Ren and Mukai Li and Yazheng Yang and Jingjing Xu and Xu Sun and Lingpeng Kong and Qi Liu}, journal={arXiv preprint arXiv:2306.04387}, year={2023} }
贡献: M3IT-80 是一个开源的大型多模态多语言指令调优数据集,旨在促进通用多模态代理的开发。



