five

A2IR: Audio-to-Image Representation

收藏
NIAID Data Ecosystem2026-03-12 收录
下载链接:
https://data.mendeley.com/datasets/wdng2cjhmy
下载链接
链接失效反馈
官方服务:
资源简介:
A2IR is a dataset for synthetic audio detection using deep learning. It includes five audio-to-image representations for natural and synthetic audio: spectrograms, histograms, scatter plots, bispectrum phase plots and bispectrum magnitude plots. Each category is divided into 3 subsets: training 56.72% (11,400 images), validation 33.83% (6,800 images), and test 9.47% (1,900 images). In each subset, the images are separated into two folders, natural and synthetic, with a balanced classification (i.e. each class has the same number of images as the other or very similar).
创建时间:
2021-09-28
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作