five

Filming the sound: Anomaly Detection on Audio Tape Recordings using Computer Vision Algorithms

收藏
NIAID Data Ecosystem2026-05-02 收录
下载链接:
https://zenodo.org/record/14028922
下载链接
链接失效反馈
官方服务:
资源简介:
This repository makes available the dataset related to the paper: Zafer Çınar, Alessandro Russo, Matteo Spanio, Niccolò Pretto, and Sergio Canazza, Filming the Sound: Anomaly Detection on Audio Tape Recordings using Computer Vision Algorithms, IAI4CH, Bozen, 2024. The dataset and the experiment are described in the publication above. This repository contains two main directories (bold indicates directory names): video samples: the actual videos used in the paper's experiment. This folder contains four subdirectories - 3.75 ips, 7.5 ips, 15 ips, and 30 ips - each representing a different playback speed (in inches per second). Within each subdirectory are several MP4 files, recorded on an A810 Studer open reel recorder, documenting the playback of magnetic audio tapes. The files follow the naming convention “Xips (Y).mp4,” where X represents the tape playback speed and Y is a serial number identifier for each video. irregularities: the metadata for each video with timestamp and type of irregularity. The folder includes four CSV files - 3.75.csv, 7.5.csv, 15.csv, and 30.csv - corresponding to the playback speeds of the video samples. Each CSV file provides handmade annotations for its respective videos, with three columns: video_id: name of the video file in the format “Xips (Y).mp4,” where X is the tape speed and Y is the ID number. time_label: timestamp indicating the irregularity, formatted as HH:MM:SS.mls. irregularity_type: category of the detected anomaly, which may be one of the following: “splice,” “shadow,” “end-of-tape,” or “annotation.”
创建时间:
2024-11-28
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作