Beyond Labels: Explainable Offensive Meme Detection with LLM-VLM Synergy

Name: Beyond Labels: Explainable Offensive Meme Detection with LLM-VLM Synergy
Creator: Singh, Abhishek
Published: 2025-04-02 00:00:00
License: 暂无描述

Figshare2025-04-02 更新2026-04-08 收录

下载链接：

https://figshare.com/articles/dataset/Beyond_Labels_Explainable_Offensive_Meme_Detection_with_LLM-VLM_Synergy/28715894/1

下载链接

链接失效反馈

官方服务：

资源简介：

The widespread use of memes on social media platforms has significantly increased the propagation of offensive content, necessitatingautomated methods for their detection and interpretation. This study introduces a novel multimodal approach leveraging advancedVision-Language Models (VLMs) and Large Language Models (LLMs) to identify offensive memes, particularly focusing on Hindilanguage content. Our methodology integrates visual embeddings from ImageBind and nomic-embed vision models with textualembeddings generated using multilingual e5-large, mBART50, and mT5 architectures. This unified multimodal framework effectivelyaddresses two hierarchical tasks: detecting offensive memes (Level 1) and distinguishing implicit from explicit offensiveness (Level2). Additionally, we propose a hierarchical zero-shot reasoning architecture that classifies memes and provides explicit reasoningexplaining the nature and context of offensiveness. Comprehensive experiments conducted on a publicly available Hindi meme datasetdemonstrate our model’s superior performance compared to existing state-of-the-art systems in terms of accuracy, interpretability,and robustness. This dual-task approach of detection and nuanced classification, coupled with interpretability, paves the way for moretransparent, accountable, and reliable automated moderation systems on digital platforms.

提供机构：

Singh, Abhishek

创建时间：

2025-04-02

5,000+

优质数据集

54 个

任务类型

进入经典数据集