five

AVA

收藏
arXiv2025-09-30 收录
下载链接:
http://research.google.com/ava/
下载链接
链接失效反馈
官方服务:
资源简介:
该数据集名为AVA,包含了用于主动说话者检测的视频片段,其中也包括了配音电影。然而,这些配音电影由于存在不同步的说话片段,可能会导致训练和评估出现误导。特别是在AVA数据集中,这些配音电影对于训练和评估来说存在问题,因为它们包含了被标记为说话但实际上并未同步的说话片段。该任务的目的是进行主动说话者检测。

This dataset, named AVA, contains video clips intended for active speaker detection (ASD) tasks, including dubbed films. However, these dubbed movies may introduce misleading outcomes during model training and evaluation, as they possess out-of-sync speech segments. Specifically, the dubbed movies included in the AVA dataset are problematic for both training and evaluation processes, since they contain speech segments that are annotated as corresponding to speaking behaviors but are actually not synchronized. The target task of this dataset is active speaker detection.
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作