Multilingual Image-Text Dataset for Cross-Lingual NLP and Sentiment Analysis

NIAID Data Ecosystem2026-05-10 收录

下载链接：

https://data.mendeley.com/datasets/r6z3xydbzz

下载链接

链接失效反馈

官方服务：

资源简介：

Multilingual Image-Text Dataset for Cross-lingual NLP and Sentiment Analysis has 2,860 images with text in Banglish (Romanised Bangla), Romanised Hindi and English. The records have a combination of visual and textual information, which is best suited to multimodal research.Apply to cross-lingual NLP, sentiment analysis, humor and sarcasm detection, political content and social media research. Text only, image only and multimodal learning Supports binary and multi-class binary and multi-class learning.Images are presented in their original form with no preprocessing or annotations and leave a researcher with absolute freedom to extract features and model them in the way they want. The data set is ethical and there is no personal identifiable information. It is a highly flexible data set that can be used to learn about multimodal understanding, sentiment prediction, cross-lingual AI models, etc.

创建时间：

2026-03-12

5,000+

优质数据集

54 个

任务类型

进入经典数据集