AVA-ActiveSpeaker

Name: AVA-ActiveSpeaker
Creator: OpenDataLab
Published: 2026-05-24 05:30:07
License: 暂无描述

OpenDataLab2026-05-24 更新2024-05-09 收录

下载链接：

https://opendatalab.org.cn/OpenDataLab/AVA-ActiveSpeaker

下载链接

链接失效反馈

官方服务：

资源简介：

包含视频中时间标记的人脸轨迹，其中每个人脸实例都被标记为说话与否，以及语音是否可听。该数据集包含大约 365 万个人类标记帧或大约 38.5 小时的面部轨迹，以及相应的音频。

This dataset contains time-stamped facial trajectories from videos. Each facial instance is annotated as either speaking or non-speaking, and whether the corresponding speech is audible. The dataset includes approximately 3.65 million human-annotated frames, or equivalently roughly 38.5 hours of facial trajectories, along with the corresponding audio.

提供机构：

OpenDataLab

创建时间：

2022-05-23

搜集汇总

数据集介绍