five

Re-recorded Dataset for Speech Tampering Detection and Classification

收藏
DataCite Commons2026-04-28 更新2026-05-05 收录
下载链接:
https://www.scidb.cn/detail?dataSetId=ddb6907ba0794858906c22fa846a2206
下载链接
链接失效反馈
官方服务:
资源简介:
We construct a replay dataset for multi-type tampering detection based on the Librispeech and Common Voice English speech datasets, covering three common audio tampering operations: deletion, same-source splicing, and cross-source splicing. Each sample in the dataset contains the corresponding Mel spectrogram and a ground-truth mask that marks the tampered regions. Because the time axis of the spectrogram maps directly to audio duration, the mask can indicate the temporal positions of tampering.Collecting large-scale replay data in real environments involves practical difficulties such as venue arrangement, device coordination, and high labor costs. To address this, we use acoustic simulation based on the Image Source Method (ISM) to mimic the indoor replay process. This method computes the Room Impulse Response (RIR) to reproduce the propagation paths, reflection delays, and energy attenuation of sound waves in enclosed spaces, allowing us to generate samples with realistic replay characteristics under controllable conditions. Observing the spectrograms, the simulated replay produces blurred boundaries and horizontal artifacts highly similar to those caused by real replay, indicating that the simulation effectively restores the interference introduced by actual replay, and is even more severe in certain aspects.The dataset contains 14,000 replayed audio samples in total, with 3,500 samples each for deletion tampering, same-source splicing, cross-source splicing, and untampered audio.
提供机构:
Science Data Bank
创建时间:
2026-04-28
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作