AfriMM/AFRICaption
收藏Hugging Face2024-09-27 更新2025-04-26 收录
下载链接:
https://hf-mirror.com/datasets/AfriMM/AFRICaption
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
features:
- name: id
dtype: int64
- name: image_id
dtype: string
- name: eng
dtype: string
- name: afr
dtype: string
- name: amh
dtype: string
- name: bem
dtype: string
- name: cjk
dtype: string
- name: dik
dtype: string
- name: dyu
dtype: string
- name: ewe
dtype: string
- name: fuv
dtype: string
- name: hau
dtype: string
- name: ibo
dtype: string
- name: kik
dtype: string
- name: kab
dtype: string
- name: kam
dtype: string
- name: kon
dtype: string
- name: kmb
dtype: string
- name: lua
dtype: string
- name: lug
dtype: string
- name: lin
dtype: string
- name: kin
dtype: string
- name: yor
dtype: string
splits:
- name: train
num_bytes: 12340971
num_examples: 8091
download_size: 5936673
dataset_size: 12340971
configs:
- config_name: default
data_files:
- split: train
path: data/train-*
license: apache-2.0
task_categories:
- translation
---
## AfriMMD - African Multilingual Multimodal Dataset (POC)
AfriMMD is a multilingual dataset created to enhance linguistic diversity in AI,
focusing on African languages. This is a proof-of-concept experiment on the use
of multimodal datasets to represent African languages in AI. The dataset contains
translations of the captions in the widely-used Flickr8k dataset into 20 African
languages. The goal is to address the underrepresentation of African languages
in AI and foster more inclusive AI technologies. The image-text pairs have been
carefully translated into multiple African languages, providing an avenue
for advanced and inclusive AI development, particularly in multimodal tasks that
involve both text and images.
Images associated with the dataset can manually be downloaded from [Github](https://github.com/jbrownlee/Datasets/releases/tag/Flickr8k)
or [Kaggle](https://www.kaggle.com/datasets/adityajn105/flickr8k?select=Images)
## Supported Languages
Amharic (amh), Bemba (bem), Chokwe (cjk), Rek (dik), Dyula (dyu), Ewe (ewe),
Fulfulde (fuv), Hausa (hau), Igbo (ibo), Kikuyu (kik), Kabyle (kab),
Kamba (kam), Kikongo (kon), Kimbundu (kmb), LubaKasai (lua), Ganda (lug),
Lingala (lin), Kinyarwanda (kin), Yoruba (yor)
## Load Dataset
```python
from datasets import load_dataset
dataset = load_dataset('AfriMM/AfriMMD')
```
## Applications
- Multilingual multimodal tasks (eg: image captioning in African languages, pre-trained vision-language models, etc.)
- Translation and language learning for supported African languages.
- Research on cross-cultural understanding and representation in AI.
## Citation
```bibtex
@dataset{afrimm2024,
author = {AfriMM - ML Collective},
title = {AfriMMD: Multimodal Dataset for African Languages},
year = 2024,
url = {https://huggingface.co/datasets/AfriMM/AfriMMD}
}
```
数据集信息:
特征:
- 名称:id
数据类型:int64
- 名称:image_id
数据类型:string
- 名称:eng
数据类型:string
- 名称:afr
数据类型:string
- 名称:amh
数据类型:string
- 名称:bem
数据类型:string
- 名称:cjk
数据类型:string
- 名称:dik
数据类型:string
- 名称:dyu
数据类型:string
- 名称:ewe
数据类型:string
- 名称:fuv
数据类型:string
- 名称:hau
数据类型:string
- 名称:ibo
数据类型:string
- 名称:kik
数据类型:string
- 名称:kab
数据类型:string
- 名称:kam
数据类型:string
- 名称:kon
数据类型:string
- 名称:kmb
数据类型:string
- 名称:lua
数据类型:string
- 名称:lug
数据类型:string
- 名称:lin
数据类型:string
- 名称:kin
数据类型:string
- 名称:yor
数据类型:string
## AfriMMD - 非洲多语言多模态数据集(概念验证,POC)
AfriMMD是一个多语言数据集,旨在增强人工智能(AI)中的语言多样性,专注于非洲语言。这是一项关于使用多模态数据集在AI中表征非洲语言的概念验证(Proof of Concept, POC)实验。该数据集包含将广泛使用的Flickr8k数据集的标题翻译成20种非洲语言的内容。其目标是解决非洲语言在AI中的代表性不足问题,推动更具包容性的AI技术发展。图像-文本对已被精心翻译成多种非洲语言,为高级且包容性的AI开发提供了途径,尤其适用于涉及文本与图像的多模态任务。
数据集相关图像可手动从[Github](https://github.com/jbrownlee/Datasets/releases/tag/Flickr8k)或[Kaggle](https://www.kaggle.com/datasets/adityajn105/flickr8k?select=Images)下载。
## 支持的语言
阿姆哈拉语(Amharic,amh)、本巴语(Bemba,bem)、乔克韦语(Chokwe,cjk)、雷克语(Rek,dik)、迪尤拉语(Dyula,dyu)、埃维语(Ewe,ewe)、富尔富尔德语(Fulfulde,fuv)、豪萨语(Hausa,hau)、伊博语(Igbo,ibo)、基库尤语(Kikuyu,kik)、卡比尔语(Kabyle,kab)、坎巴语(Kamba,kam)、基孔戈语(Kikongo,kon)、金本杜语(Kimbundu,kmb)、卢巴-卡萨伊语(LubaKasai,lua)、干达语(Ganda,lug)、林加拉语(Lingala,lin)、基尼亚卢旺达语(Kinyarwanda,kin)、约鲁巴语(Yoruba,yor)
## 加载数据集
python
from datasets import load_dataset
dataset = load_dataset('AfriMM/AfriMMD')
## 应用
- 多语言多模态任务(例如:非洲语言的图像描述生成、预训练视觉-语言模型等)
- 支持的非洲语言的翻译与语言学习
- AI中的跨文化理解与表征研究
## 引用
bibtex
@dataset{afrimm2024,
author = {AfriMM - ML Collective},
title = {AfriMMD: Multimodal Dataset for African Languages},
year = 2024,
url = {https://huggingface.co/datasets/AfriMM/AfriMMD}
}
提供机构:
AfriMM



