five

Extended-Length Audio Dataset for Synthetic Voice Detection and Speaker Recognition (ELAD-SVDSR)

收藏
DataCite Commons2025-01-28 更新2025-04-16 收录
下载链接:
https://ieee-dataport.org/documents/extended-length-audio-dataset-synthetic-voice-detection-and-speaker-recognition-elad-svdsr
下载链接
链接失效反馈
官方服务:
资源简介:
Introduced here is the Extended-Length Audio Dataset for Synthetic Voice Detection and Speaker Recognition (ELAD-SVDSR), a resource designed to advance research in synthetic voice (DeepFake) detection and automatic speaker recognition (ASR). It features around 45-minute audio recordings from 36 participants, each of whom read aloud different newspaper articles during controlled sessions, captured with five different high-quality microphones. Synthetic voices generated from 20 subjects of this dataset using open-source and commercial software are also included. Supporting text-dependent  analysis, the dataset may enable diverse ASR modeling. This extended-duration audio may allow for the detection of nuanced artifacts and the generation of higher-quality synthetic samples, including those like Tortoise TTS and ElevenLabs, which already excel in shorter segments. Comprehensive metadata on speaker demographics and recording conditions are expected to provide deeper insights into voice characteristics and model efficacy.  Publicly accessible, while all personal data has been anonymized to ensure privacy, ELAD-SVDSR is expected to drive significant advancements in biometric security, audio forensics, and voice authentication systems.
提供机构:
IEEE DataPort
创建时间:
2025-01-28
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作