VoxSim
收藏VoxSim Trainer 数据集概述
数据集描述
- 名称: VoxSim
- 类型: 语音相似度数据集
- 来源: 论文《VoxSim: A perceptual voice similarity dataset》
依赖项
-
安装依赖项:
pip install -r requirements.txt
数据准备
- 数据集准备:
- 参考 voxceleb_trainer 项目准备 VoxCeleb 数据集。
训练示例
-
ECAPA-TDNN with voxsim raw scores:
python ./trainSpeakerNet.py --config ./configs/ECAPA_TDNN.yaml --train_list data/voxsim_train_list_raw.txt
-
WavLM-ECAPA with voxsim mean scores:
python ./trainSpeakerNet.py --config ./configs/WavLM_ECAPA.yaml --train_list data/voxsim_train_list_mean.txt
-
WavLM-ECAPA pre-trained on VoxCeleb with voxsim mean scores:
python ./trainSpeakerNet.py --config ./configs/WavLM_ECAPA_sv.yaml --train_list data/voxsim_train_list_mean.txt
预训练模型
-
预训练模型下载:
- 预训练模型可从 这里 下载。
- 验证脚本返回结果:
Pearson 0.83695 ...
python ./trainSpeakerNet.py --eval --model wavlm_large --save_path test/wavlm_ecapa --test_list data/voxsim_test_list.txt --eval_frames 400 --initial_model wavlm_ecapa.model
引用
-
引用信息:
@inproceedings{ahn2024voxsim, title={VoxSim: A perceptual voice similarity dataset}, author={Ahn, Junseok and Kim, Youkyum and Choi, Yeunju and Kwak, Doyeop and Kim, Ji-Hoon and Mun, Seongkyu and Chung, Joon Son}, booktitle={Proc. Interspeech}, year={2024} }




