five

Fake Audio Dataset (ElevenLabs & Respeecher)

收藏
NIAID Data Ecosystem2026-05-10 收录
下载链接:
https://data.mendeley.com/datasets/79g59sp69z
下载链接
链接失效反馈
官方服务:
资源简介:
This dataset was created during the first semester of 2025. It consists of 600 synthetic audios generated with the ElevenLabs and Respeecher tools. It includes both audios generated with TTS (text-to-speech) and V2V (voice-to-voice). Specifically, there are 335 audios generated with ElevenLabs, of which 282 correspond to V2V and 53 to TTS; while 265 audios were generated with Respeecher, of which 210 are V2V and 55 are TTS. This results in a total of 492 V2V audios and 108 TTS audios. In terms of gender distribution, 49% of the audios are male voices, while 51% are female voices. The duration of the audios ranges from 8 to 10 seconds, with a sampling rate of 22,050 Hz. Most of the voices correspond to adults (538 out of 600). This dataset can be used for: (1) training synthetic audio classification models, (2) performing external validation of synthetic audio classification models, and (3) applying attacks to audios and verifying the robustness of synthetic audio classification models.
创建时间:
2025-09-26
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作