Matthijs/cmu-arctic-xvectors
收藏Hugging Face2023-02-07 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/Matthijs/cmu-arctic-xvectors
下载链接
链接失效反馈官方服务:
资源简介:
---
pretty_name: CMU ARCTIC X-Vectors
task_categories:
- text-to-speech
- audio-to-audio
license: mit
---
# Speaker embeddings extracted from CMU ARCTIC
There is one `.npy` file for each utterance in the dataset, 7931 files in total. The speaker embeddings are 512-element X-vectors.
The [CMU ARCTIC](http://www.festvox.org/cmu_arctic/) dataset divides the utterances among the following speakers:
- bdl (US male)
- slt (US female)
- jmk (Canadian male)
- awb (Scottish male)
- rms (US male)
- clb (US female)
- ksp (Indian male)
The X-vectors were extracted using [this script](https://huggingface.co/mechanicalsea/speecht5-vc/blob/main/manifest/utils/prep_cmu_arctic_spkemb.py), which uses the `speechbrain/spkrec-xvect-voxceleb` model.
Usage:
```python
from datasets import load_dataset
embeddings_dataset = load_dataset("Matthijs/cmu-arctic-xvectors", split="validation")
speaker_embeddings = embeddings_dataset[7306]["xvector"]
speaker_embeddings = torch.tensor(speaker_embeddings).unsqueeze(0)
```
提供机构:
Matthijs
原始信息汇总
数据集概述
数据集名称
- 名称: CMU ARCTIC X-Vectors
任务类别
- 类别:
- text-to-speech
- audio-to-audio
许可证
- 许可证: MIT
数据集内容
- 文件格式:
.npy - 文件数量: 7931
- 特征描述: 每个文件包含一个512元素的X-vector,代表一个语音片段的说话人嵌入。
说话人信息
- 说话人:
- bdl (US male)
- slt (US female)
- jmk (Canadian male)
- awb (Scottish male)
- rms (US male)
- clb (US female)
- ksp (Indian male)
数据提取方法
- 提取方法: 使用
speechbrain/spkrec-xvect-voxceleb模型,通过特定脚本提取X-vectors。



