hustep-lab/VoxVietnam-Dataset

Name: hustep-lab/VoxVietnam-Dataset
Creator: hustep-lab
Published: 2025-03-31 12:51:06
License: 暂无描述

Hugging Face2025-03-31 更新2025-04-12 收录

下载链接：

https://hf-mirror.com/datasets/hustep-lab/VoxVietnam-Dataset

下载链接

链接失效反馈

官方服务：

资源简介：

VoxVietnam是一个大规模的多语种数据集，用于越南语说话人识别。它包含三个子集：train（官方训练集，包含1256位说话人的161457个样本）、train_small（从train子集采样得到，以匹配Vietnam-Celeb的大小，包含879位说话人的83000个样本）和test（测试集）。此外，还有VoxVietnam-E和VoxVietnam-H由志愿者在没有视觉信息的情况下标注，而VoxVietnam-O是一个由我们团队通过视听验证的独立测试集。数据集的最新更新包括VoxVietnam-O的发布，鼓励研究者使用该测试集进行评估。

VoxVietnam is a large-scale multi-genre dataset for Vietnamese speaker recognition. It consists of three subsets: train (official training set with 1,256 speakers and 161,457 samples), train_small (sampled from the train subset to match the size of Vietnam-Celeb, with 879 speakers and 83,000 samples), and test (test set). Additionally, there are VoxVietnam-E and VoxVietnam-H labeled by volunteers without visual information, and VoxVietnam-O, an independent test set verified by our team through audio-visual validation. The latest update to the dataset includes the release of VoxVietnam-O, which is encouraged for researchers to use for evaluation.

提供机构：

hustep-lab

5,000+

优质数据集

54 个

任务类型

进入经典数据集