Re-recorded Dataset for Speech Tampering Detection and Classification

Name: Re-recorded Dataset for Speech Tampering Detection and Classification
Creator: Science Data Bank
Published: 2026-04-28 02:07:29
License: 暂无描述

DataCite Commons2026-04-28 更新2026-05-05 收录

下载链接：

https://www.scidb.cn/detail?dataSetId=ddb6907ba0794858906c22fa846a2206

下载链接

链接失效反馈

官方服务：

资源简介：

We construct a replay dataset for multi-type tampering detection based on the Librispeech and Common Voice English speech datasets, covering three common audio tampering operations: deletion, same-source splicing, and cross-source splicing. Each sample in the dataset contains the corresponding Mel spectrogram and a ground-truth mask that marks the tampered regions. Because the time axis of the spectrogram maps directly to audio duration, the mask can indicate the temporal positions of tampering.Collecting large-scale replay data in real environments involves practical difficulties such as venue arrangement, device coordination, and high labor costs. To address this, we use acoustic simulation based on the Image Source Method (ISM) to mimic the indoor replay process. This method computes the Room Impulse Response (RIR) to reproduce the propagation paths, reflection delays, and energy attenuation of sound waves in enclosed spaces, allowing us to generate samples with realistic replay characteristics under controllable conditions. Observing the spectrograms, the simulated replay produces blurred boundaries and horizontal artifacts highly similar to those caused by real replay, indicating that the simulation effectively restores the interference introduced by actual replay, and is even more severe in certain aspects.The dataset contains 14,000 replayed audio samples in total, with 3,500 samples each for deletion tampering, same-source splicing, cross-source splicing, and untampered audio.

提供机构：

Science Data Bank

创建时间：

2026-04-28

5,000+

优质数据集

54 个

任务类型

进入经典数据集