Supplementary Material for "Navigable Semantic Sound Maps for Auditory Displays"

Name: Supplementary Material for "Navigable Semantic Sound Maps for Auditory Displays"
Creator: Bielefeld University
Published: 2025-05-26 14:20:02
License: 暂无描述

DataCite Commons2025-05-26 更新2026-05-04 收录

下载链接：

https://pub.uni-bielefeld.de/record/3003177

下载链接

链接失效反馈

官方服务：

资源简介：

This paper presents a method to use t-SNE for training low-dimensional timbre maps obtained from musical recordings in a way that similar sound spectra are represented at similar location within the map. To use such maps for generating novel sounds and timbre trajectories, we use Kernel Regression Mapping for the inverse transformation from map space to timbre space and Griffin-Lim algorithm for phase reconstruction. With this mix of methods, we achieve a way to both visually explore timbral patterns and compress control data. As an application for navigable timbre maps we introduce a novel method - related to and inspired from Wave Space Sonification (WSS) - for the auditory exploration of patterns in multivariate time-series data by using the semantic sound map as the connecting representation. We demonstrate our approach by sonifying ECG data relating to cardiological pathologies. Code Repository The source code is published as open source in a git repository, available at the GitLab of Bielefeld University here. Sound Examples Saxophone Synthesis S1.1: Trajectory T1 - $\sigma_{krm}$ = 1.0, freq = 220 Hz, duration = 4 seconds /audio> S1.2: Trajectory T2 - $\sigma_{krm}=1.0$, freq = 220 Hz, duration = 2 seconds /audio> S1.3: Trajectory T3 - $\sigma_{krm}=1.0$, freq = 110 Hz, duration = 5 seconds /audio> S1.4: Trajectory T4 - $\sigma_{krm}=1.0$, freq = 220 Hz, duration = 4 seconds /audio> S1.5: Trajectory T5 - $\sigma_{krm}=1.0$, freq = 220 Hz, duration = 1 second /audio> S1.6: Trajectory T6 - $\sigma_{krm}=1.0$, freq = 220 Hz, duration = 3 seconds /audio> Cross Synthesis S2.1: Trajectory T1 - $\sigma_{krm}$ = 0.8, No resampling (freq = None), duration = 1 second /audio> S2.2: Trajectory T2 - $\sigma_{krm}$ = 0.8, freq = 220 Hz, duration = 2.5 seconds /audio> S2.3: Trajectory T3 - $\sigma_{krm}$ = 0.8, freq = 220 Hz, duration = 2.5 seconds /audio> S2.4: Trajectory T4 - $\sigma_{krm}$ = 0.8, freq = 220 Hz, duration = 2.5 seconds /audio> S2.5: Trajectory T5 - $\sigma_{krm}$ = 0.8, freq = 220 Hz, duration = 2.5 seconds /audio> S2.6: Trajectory T6 - $\sigma_{krm}$ = 0.8, freq = 220 Hz, duration = 2.5 seconds /audio> Speech Synthesis S3.1: Trajectory T1 - $\sigma_{krm}$ = 1.0, No resampling (freq = None), duration = 2.5 seconds /audio> S3.2: Trajectory T2 - $\sigma_{krm}$ = 1.0, No resampling (freq = None), duration = 1 second /audio> S3.3: Trajectory T3 - $\sigma_{krm}$ = 1.0, No resampling (freq = None), duration = 1 second /audio> S3.4: Trajectory T4 - $\sigma_{krm}$ = 1.0, No resampling (freq = None), duration = 1 second /audio> S3.5: Trajectory T5 - $\sigma_{krm}$ = 1.0, No resampling (freq = None), duration = 1 second /audio> S3.6: Trajectory T6 - $\sigma_{krm}$ = 1.0, No resampling (freq = None), duration = 1 second /audio> Signal Reconstruction S4.1: Original preprocessed (zero padding) sample /audio> S4.2: Phase Reconstruction using Griffin-Lim /audio> S4.3: Phase Reconstruction using Random Phases /audio> ECG Sonification V5.1: Anterior weak (fast playback) /video> S5.1: Anterior weak (fast playback) /audio> V5.2: Anterior medium (fast playback) /video> S5.2: Anterior medium (fast playback) /audio> V5.3: Anterior strong (fast playback) /video> S5.3: Anterior strong (fast playback) /audio> V5.4: Inferior weak (fast playback) /video> S5.4: Inferior weak (fast playback) /audio> V5.5: Inferior medium (fast playback) /video> S5.5: Inferior medium (fast playback) /audio> V5.6: Inferior strong (fast playback) /video> S5.6: Inferior strong (fast playback) /audio> V5.7: Anterior weak (slow playback) /video> S5.7: Anterior weak (slow playback) /audio> V5.8: Anterior medium (slow playback) /video> S5.8: Anterior medium (slow playback) /audio> V5.9: Anterior strong (slow playback) /video> S5.9: Anterior strong (slow playback) /audio> V5.10: Inferior weak (slow playback) /video> S5.10: Inferior weak (slow playback) /audio> V5.11: Inferior medium (slow playback) /video> S5.11: Inferior medium (slow playback) /audio> V5.12: Inferior strong (slow playback) /video> S5.12: Inferior strong (slow playback) /audio>

提供机构：

Bielefeld University

创建时间：

2025-05-19

5,000+

优质数据集

54 个

任务类型

进入经典数据集