CN-Celeb 一个室外收集的大规模说话人识别数据集

Name: CN-Celeb 一个室外收集的大规模说话人识别数据集
Creator: 帕依提提
License: 暂无描述

帕依提提2024-03-04 收录

下载链接：

https://www.payititi.com/opendatasets/show-47.html

下载链接

链接失效反馈

官方服务：

资源简介：

This is a large-scale speaker recognition dataset collected 'in the wild'. The dataset consists of two subsets, CN-Celeb1 and CN-Celeb2. All the audio files are coded as single channel and sampled at 16kHz with 16-bit precision. For CN-Celeb1, it contains more than 130,000 utterances from 1,000 Chinese celebrities, and covers 11 different genres in real world. For CN-Celeb2, it contains more than 520,000 utterances from 2,000 Chinese celebrities, and covers 11 different genres in real world. The data collection process was organized by the Center for Speech and Language Technologies, Tsinghua University. It was also funded by the National Natural Science Foundation of China No. 61633013, and the Postdoctoral Science Foundation of China No. 2018M640133. You can cite the data using the following BibTeX entry: Dong Wang, Yue Fan, Hao Cui, Jiawen Kang, Lantian Li, Kaicheng Li, Haolin Chen, Sitong Cheng, Pengyuan Zhang, Ziya Zhou, Yunqi Cai Address: ROOM 1-303, BLDG FIT, CSLT, Tsinghua University Homepage: http://cslt.org or http://cslt.riit.tsinghua.edu.cn

本数据集为大规模野外采集的说话人识别数据集，包含CN-Celeb1与CN-Celeb2两个子集。所有音频文件均采用单声道编码，采样率为16kHz，位深度为16比特。其中CN-Celeb1子集包含来自1000位中国名人的13万余条语音片段，涵盖真实场景下的11种不同类别；CN-Celeb2子集则包含来自2000位中国名人的52万余条语音片段，同样涵盖真实场景下的11种不同类别。本数据集由清华大学语音与语言技术中心（Center for Speech and Language Technologies, Tsinghua University）组织采集，同时获得国家自然科学基金（项目编号：61633013）与中国博士后科学基金（项目编号：2018M640133）资助。您可通过以下BibTeX条目引用本数据集：作者：王东、范悦、崔昊、康佳文、李蓝天、李凯诚、陈浩林、程思童、张鹏远、周子雅、蔡云祺；地址：清华大学CSLT FIT大楼1-303室；主页：http://cslt.org 或 http://cslt.riit.tsinghua.edu.cn

提供机构：

帕依提提

搜集汇总

数据集介绍

背景与挑战

背景概述

CN-Celeb是一个大规模的中文说话人识别数据集，包含两个子集CN-Celeb1和CN-Celeb2，分别涵盖1,000和2,000位中国名人的语音样本，覆盖11种不同场景。所有音频为16kHz单声道，适用于说话人识别研究。

以上内容由遇见数据集搜集并总结生成