AVSD dataset
收藏DataCite Commons2026-01-07 更新2025-04-16 收录
下载链接:
https://service.tib.eu/ldmservice/dataset/925a5da4-f1d6-4ec4-9376-d5c2409f9aeb
下载链接
链接失效反馈官方服务:
资源简介:
The AVSD dataset is a benchmark for audio-visual scene-aware dialog. It consists of 7659 training, 734 prototype validation, and 733 prototype testing dialog, where the Questioner has access to the first, middle, and last static frames of the video, while the Answerer has access to the entire video, including the audio stream and the original input descriptions.
提供机构:
TIB
创建时间:
2025-01-03



