Multilingual Image-Text Dataset for Cross-Lingual NLP and Sentiment Analysis
收藏NIAID Data Ecosystem2026-05-10 收录
下载链接:
https://data.mendeley.com/datasets/r6z3xydbzz
下载链接
链接失效反馈官方服务:
资源简介:
Multilingual Image-Text Dataset for Cross-lingual NLP and Sentiment Analysis has 2,860 images with text in Banglish (Romanised Bangla), Romanised Hindi and English. The records have a combination of visual and textual information, which is best suited to multimodal research.Apply to cross-lingual NLP, sentiment analysis, humor and sarcasm detection, political content and social media research. Text only, image only and multimodal learning Supports binary and multi-class binary and multi-class learning.Images are presented in their original form with no preprocessing or annotations and leave a researcher with absolute freedom to extract features and model them in the way they want. The data set is ethical and there is no personal identifiable information.
It is a highly flexible data set that can be used to learn about multimodal understanding, sentiment prediction, cross-lingual AI models, etc.
创建时间:
2026-03-12



