MSVD
收藏DataCite Commons2026-01-07 更新2026-05-05 收录
下载链接:
https://service.tib.eu/ldmservice/dataset/7fd5efac-2142-422c-93c7-8f117c3c3d6b
下载链接
链接失效反馈官方服务:
资源简介:
Text-Video Retrieval (TVR) aims to align relevant video content with natural language queries. To date, most state-of-the-art TVR methods learn image-to-video transfer learning based on large-scale pre-trained vision-language models (e.g., CLIP). However, fully fine-tuning these pre-trained models for TVR incurs prohibitively expensive computation costs.
提供机构:
TIB
创建时间:
2024-12-02



