DSD-Corpus
收藏NIAID Data Ecosystem2026-05-02 收录
下载链接:
https://zenodo.org/record/13788454
下载链接
链接失效反馈官方服务:
资源简介:
Diverse Synthesizer for Deepfake Voice Detection - DSD-Corpus
Data Structure
The dataset is composed of different public and self-collected datasets, including LibriSpeech, AIHUB, VCTK. Additionally, fake samples are all generated by the authors, utilizing open-source synthesizers and Elevenlabs (commercial site).
We provide the metadata file at DSD_corpus_v1.csv, and the audio files in wavs.zip. Please note that you might use 7Zip to merge all the zip files. After merging, please find the wavs.zip inside a newly created folder (default name wavs_2).
If you couldn't find the DSD_corpus_v1.csv after unzip, please use this link: https://drive.google.com/file/d/1l2iywJJBYh1RI5mW_KQ5xkB_9jLsC_IC/view?usp=sharing
License
Our dataset is published under CC BY-NC 4.0, which requires you to give appropriate credit, provide a link to the license, and indicate if changes were made. You may do so in any reasonable manner, but not in any way that suggests the licensor endorses you or your use. You may not use the material for commercial purposes.
The real samples belonging to LibriSpeech are licensed under the Creative Commons Attribution 4.0 International (CC BY 4.0).
The real samples belonging to the VTCK are licensed under the Open Data Commons Attribution License (ODC-By) v1.0.
The AIHUB is provided by the Korea National Information Agency have freely to used for both non-commercial and commercial purposes, but required to give appropriate credit.
Contact
Please contact aisrc1@ssu.ac.kr or phucdt@soongsil.ac.kr for any inquiries.
Cite our work
@inproceedings{doan2024_ccs,
title={Trident of Poseidon: A Generalized Approach for Detecting Deepfake Voices},
author={Doan, Thien-Phuc and Dinh-Xuan, Hung and Ryu, Taewon and Kim, Inho and Lee, Woongjae and Hong, Kihun and Jung, Souhwan},
booktitle={Proceedings of the 2024 ACM SIGSAC Conference on Computer and Communications Security (CCS '24)},
pages={},
year={2024},
doi = {10.1145/3658644.3690311}
}
Public Dataset:
LibriSpeech
@INPROCEEDINGS{7178964,
author={Panayotov, Vassil and Chen, Guoguo and Povey, Daniel and Khudanpur, Sanjeev},
booktitle={2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)},
title={Librispeech: An ASR corpus based on public domain audio books},
year={2015},
volume={},
number={},
pages={5206-5210},
keywords={Resource description framework;Genomics;Bioinformatics;Blogs;Information services;Electronic publishing;Speech Recognition;Corpus;LibriVox},
doi={10.1109/ICASSP.2015.7178964}}
VCTK
@misc{anudc:4896
author = {Yamagishi, Junichi and Veaux, Christophe and MacDonald, Kirsten},
title = {{CSTR VCTK Corpus: English Multi-speaker Corpus for CSTR Voice Cloning Toolkit},
doi = {10.7488/ds/2645},
}
AIHUB
@misc{aihub
author = {AIHUB, Korea National Information Society Agency},
title = {{AIHUB speech corpus},
url= {aihub.or.kr},
}
创建时间:
2024-12-12



