five

UniTalk

收藏
arXiv2025-09-30 收录
下载链接:
https://huggingface.co/datasets/plnguyen2908/UniTalk-ASD
下载链接
链接失效反馈
官方服务:
资源简介:
该数据集名为UniTalk,专为活跃说话者检测而设计,特别关注包含较少代表性语言、噪声背景以及拥挤场景等具有挑战性的真实世界情境。该数据集包含超过44.5小时的视频,并在48,693个说话者身份的帧级别提供了活跃说话者的标注。UniTalk为活跃说话者检测设立了新的基准,为在真实条件下开发和评估模型提供了宝贵的资源。该数据集规模超过44.5小时的视频,所涉及的任务是活跃说话者检测(Active Speaker Detection,简称Asd)。

This dataset, named UniTalk, is specifically designed for active speaker detection, with a particular focus on challenging real-world scenarios including underrepresented languages, noisy backgrounds, and crowded environments. The dataset contains over 44.5 hours of video, and provides frame-level annotations of active speakers for 48,693 distinct speaker identities. UniTalk sets a new benchmark for active speaker detection, serving as a valuable resource for developing and evaluating models under real-world conditions. This dataset covers over 44.5 hours of video, and the target task is Active Speaker Detection (abbreviated as Asd).
提供机构:
Hugging Face
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作