ActivityNet-QA
收藏DataCite Commons2026-01-07 更新2025-04-16 收录
下载链接:
https://service.tib.eu/ldmservice/dataset/425fddd6-021d-4b53-8fd4-9a0a09ff80b2
下载链接
链接失效反馈官方服务:
资源简介:
Video question answering (VideoQA) is an essential task in vision-language understanding, which has attracted numerous research attention recently. Nevertheless, existing works mostly achieve promising performances on short videos of duration within 15 seconds. For VideoQA on minute-level long-term videos, those methods are likely to fail because of lacking the ability to deal with noise and redundancy caused by scene changes and multiple actions in the video.
提供机构:
TIB
创建时间:
2025-01-03



