A2IR: Audio-to-Image Representation

Mendeley Data2024-03-27 更新2024-06-26 收录

下载链接：

https://data.mendeley.com/datasets/wdng2cjhmy

下载链接

链接失效反馈

官方服务：

资源简介：

A2IR is a dataset for synthetic audio detection using deep learning. It includes five audio-to-image representations for natural and synthetic audio: spectrograms, histograms, scatter plots, bispectrum phase plots and bispectrum magnitude plots. Each category is divided into 3 subsets: training 56.72% (11,400 images), validation 33.83% (6,800 images), and test 9.47% (1,900 images). In each subset, the images are separated into two folders, natural and synthetic, with a balanced classification (i.e. each class has the same number of images as the other or very similar).

创建时间：

2024-01-23

5,000+

优质数据集

54 个

任务类型

进入经典数据集