Typical Concept-Driven Modality-missing Deep Cross-Modal Retrieval
收藏科学数据银行2025-04-24 更新2026-04-23 收录
下载链接:
https://www.scidb.cn/detail?dataSetId=e1a61fdb89ee400fb7e13e9fdb58189d
下载链接
链接失效反馈官方服务:
资源简介:
Cross-modal retrieval takes one modality data as a query and retrieves semantically relevant data in another modality. Most existing cross-modal retrieval methods are designed for scenarios with complete modality data. However, in real-world applications, incomplete modality data often exists, which these methods struggle to handle effectively. In this paper, we propose a typical concept-driven modality-missing deep cross-modal retrieval model. Specifically, we first propose a multi-modal Transformer integrated with multi-modal pretraining networks, which can fully capture the multi-modal fine-grained semantic interaction in the incomplete modality data, extract multi-modal fusion semantics and construct cross-modal subspace, and at the same time supervise the learning process to generate typical concepts. In addition, the typical concepts are used as the cross-attention key and value to drive the training of the modal mapping network, so that it can adaptively preserve the implicit multi-modal semantic concepts of the query modality data, generate cross-modal retrieval features, and fully preserve the pre-extracted multi-modal fusion semantics. More information about the source code: https://gitee.com/MrSummer123/CPCMR
提供机构:
军事认知与脑科学研究所; Nie Xiushan; 山东师范大学信息科学与工程学院
创建时间:
2025-04-24



