Extended-Length Audio Dataset for Synthetic Voice Detection and Speaker Recognition (ELAD-SVDSR)

Name: Extended-Length Audio Dataset for Synthetic Voice Detection and Speaker Recognition (ELAD-SVDSR)
Creator: IEEE DataPort
Published: 2025-01-28 22:50:31
License: 暂无描述

DataCite Commons2025-01-28 更新2025-04-16 收录

下载链接：

https://ieee-dataport.org/documents/extended-length-audio-dataset-synthetic-voice-detection-and-speaker-recognition-elad-svdsr

下载链接

链接失效反馈

官方服务：

资源简介：

Introduced here is the Extended-Length Audio Dataset for Synthetic Voice Detection and Speaker Recognition (ELAD-SVDSR), a resource designed to advance research in synthetic voice (DeepFake) detection and automatic speaker recognition (ASR). It features around 45-minute audio recordings from 36 participants, each of whom read aloud different newspaper articles during controlled sessions, captured with five different high-quality microphones. Synthetic voices generated from 20 subjects of this dataset using open-source and commercial software are also included. Supporting text-dependent  analysis, the dataset may enable diverse ASR modeling. This extended-duration audio may allow for the detection of nuanced artifacts and the generation of higher-quality synthetic samples, including those like Tortoise TTS and ElevenLabs, which already excel in shorter segments. Comprehensive metadata on speaker demographics and recording conditions are expected to provide deeper insights into voice characteristics and model efficacy.  Publicly accessible, while all personal data has been anonymized to ensure privacy, ELAD-SVDSR is expected to drive significant advancements in biometric security, audio forensics, and voice authentication systems.

提供机构：

IEEE DataPort

创建时间：

2025-01-28

5,000+

优质数据集

54 个

任务类型

进入经典数据集