five

nnenufar/speakerVerification_PTBR

收藏
Hugging Face2024-06-29 更新2024-06-29 收录
下载链接:
https://hf-mirror.com/datasets/nnenufar/speakerVerification_PTBR
下载链接
链接失效反馈
官方服务:
资源简介:
该数据集包含约80,000个巴西葡萄牙语的语音音频样本,样本长度从1秒到4秒不等,采样率为16kHz。元数据文件包括每个样本的说话者标签和相应的标签,适用于说话者识别和说话者验证任务。音频样本来自三个更大的语料库:C-ORAL Brasil、NURC Recife和NURC SP。录音主要来自独白,但有时会有研究人员的短暂打断。录音涵盖了巴西三个不同州的口音:贝洛奥里藏特、圣保罗和累西腓。数据集的结构使其可以通过HF Audiofolder加载,但建议将数据集克隆到本地机器上,然后指定本地数据目录进行加载。

This dataset includes ~80k samples of speech audio in Brazilian Portuguese. Samples have variable length ranging from 1 to 4 seconds, with a sampling rate of 16kHz. The metadata file includes speaker tags and corresponding labels for each sample, making it appropriate for speaker identification and speaker verification tasks. Audio samples are taken from three bigger corpora: C-ORAL Brasil, NURC Recife and NURC SP. The recordings comprise accents from three different states of Brazil: Belo Horizonte, São Paulo and Recife. The dataset is structured in a way that makes it possible to load with HF Audiofolder.
提供机构:
nnenufar
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作