Data for: HNC: Leveraging Hard Negative Captions towards Models with Fine-Gra...
收藏B2FIND2026-03-19 收录
下载链接:
https://b2find.eudat.eu/dataset/1a57a769-dce6-5f60-b70c-bdaf82b9746a
下载链接
链接失效反馈官方服务:
资源简介:
Image-Text-Matching (ITM) is one of the defacto methods of learning generalized representations from a large corpus in Vision and Language (VL). However, due to the weak...



