amuvarma/eliasfiz-audio_pretrain_10m-facodec-1dups
收藏Hugging Face2024-12-09 更新2024-12-14 收录
下载链接:
https://hf-mirror.com/datasets/amuvarma/eliasfiz-audio_pretrain_10m-facodec-1dups
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含音频转录文本和对应的音频编码特征。具体特征包括转录文本(transcript)、六个音频编码序列(facodec_0到facodec_5)以及说话人嵌入(spk_embs)。数据集分为一个训练集(train),包含3243931个样本,总大小为147579503014字节。
This dataset contains audio transcriptions and corresponding audio encoding features. Specific features include transcription text (transcript), six audio encoding sequences (facodec_0 to facodec_5), and speaker embeddings (spk_embs). The dataset is divided into a training set (train) containing 3,243,931 samples, with a total size of 147,579,503,014 bytes.
提供机构:
amuvarma



