BanglaLekhaImageCaptions

NIAID Data Ecosystem2026-03-11 收录

下载链接：

https://data.mendeley.com/datasets/rxxch9vw59

下载链接

链接失效反馈

官方服务：

资源简介：

This dataset consists of images and annotations in Bengali. The images are human annotated in Bengali by two adult native Bengali speakers. All popular image captioning datasets have a predominant western cultural bias with the annotations done in English. Using such datasets to train an image captioning system assumes that a good English to target language translation system exists and that the original dataset had elements of the target culture. Both these assumptions are false, leading to the need of a culturally relevant dataset in Bengali, to generate appropriate image captions of images relevant to the Bangladeshi and wider subcontinental context. The dataset presented consists of 9,154 images.

创建时间：

2019-07-28

5,000+

优质数据集

54 个

任务类型

进入经典数据集