jianzongwu/VGGSound-T2AV
收藏Hugging Face2025-12-03 更新2025-12-20 收录
下载链接:
https://hf-mirror.com/datasets/jianzongwu/VGGSound-T2AV
下载链接
链接失效反馈官方服务:
资源简介:
---
task_categories:
- text-to-audio
- text-to-video
language:
- en
size_categories:
- 100K<n<1M
---
This is the VGGSound dataset (annotated with video and audio prompts) for paper "Does Hearing Help Seeing? Investigating Audio-Video Joint Denoising for Video Generation"
This repo only contains the annotated train and evaluation metadata, please download the video files from [Loie/VGGSound](https://huggingface.co/datasets/Loie/VGGSound).
arXiv: https://arxiv.org/abs/2512.02457
Project: https://jianzongwu.github.io/projects/does-hearing-help-seeing/
Code: https://github.com/jianzongwu/Does-Hearing-Help-Seeing
提供机构:
jianzongwu



