five

BanglaView: A Bangla Image Captioning Dataset

收藏
Mendeley Data2024-03-27 更新2024-06-26 收录
下载链接:
https://data.mendeley.com/datasets/rrv8pbxrxv
下载链接
链接失效反馈
官方服务:
资源简介:
Image description is a more crucial part of the artificial intelligence world because it covers two important sectors, which are computer vision and natural language processing. As the proficiency of computers in interpreting visual data and converting it into textual forms has advanced, there has been a notable surge in research enthusiasm for endeavors such as automated image description in recent years. Although the majority of research efforts are concentrated on the English language within monolingual contexts, less attention has been directed towards resource-constrained languages like Bangla. This oversight primarily stems from the absence of standardized datasets for such languages. To overcome the limited availability of Bangla image captioning data, we propose BanglaView, a novel dataset inspired by Flickr30k. Here, English captions are translated into Bangla using Google Translate and refined by expert annotators. The BanglaView dataset contains a total of 31,783 images with 1,58,915 sentences. The total words, unique words, sentence length mean, and sentence length variance of the BanglaView dataset are 16,68,322, 26,348, 10.49, and 20.82. Each image in these datasets comes with five accompanying descriptions. Native Bangla speakers who are fluent in both Bangla and English painstakingly created the annotations. To maintain high standards, a team of professionals performed review and post-processing to ensure the dataset's quality.
创建时间:
2024-01-23
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作