AVSpeech

Name: AVSpeech
Creator: AVSpeech
Published: 2025-09-30T13:34:04+08:00

arXiv2025-09-30 收录

面部识别

机器学习

数据链接：

https://looking-to-listen.github.io/avspeech/数据链接链接失效反馈

官方服务：

资源简介：

该数据集名为AVSpeech，包含了来自不同来源的数千小时视频片段，精心挑选以展现丰富多样的面部表情和头部姿势。该数据集被用于训练和测试MorphGAN模型，同时也用于评估面部识别网络对于姿势和表情变化的敏感性。其规模包括用于训练的13,000个视频子集，以及用于测试的150个视频。这项任务旨在进行面部识别及其鲁棒性评估。

This dataset, named AVSpeech, consists of thousands of hours of video clips collected from diverse sources, carefully curated to showcase a wide range of facial expressions and head poses. It has been used for training and testing the MorphGAN model, as well as for evaluating the sensitivity of facial recognition networks to variations in pose and expression. In terms of scale, it includes a 13,000-video training subset and 150 videos for testing. The task centered on this dataset focuses on facial recognition and the evaluation of its robustness.

提供机构：

AVSpeech

搜集汇总

数据集介绍