Data for: HNC: Leveraging Hard Negative Captions towards Models with Fine-Gra...
收藏B2FIND2026-04-29 收录
下载链接:
https://b2find.eudat.eu/dataset/ca088247-9233-5fdd-a552-6ed358953800
下载链接
链接失效反馈官方服务:
资源简介:
Image-Text-Matching (ITM) is one of the defacto methods of learning generalized representations from a large corpus in Vision and Language (VL). However, due to the weak...



