five

Audio Piano Triad Dataset

收藏
Mendeley Data2024-03-27 更新2024-06-28 收录
下载链接:
https://zenodo.org/record/5217057
下载链接
链接失效反馈
官方服务:
资源简介:
Created by: Agustín Macaya Valladares Date: May 5th, 2021 - Dataset contains 43.200 examples of piano triads in .wav format. - Second Version: The audios are the same as the first version, but the octave number in the names were corrected from (2, 3, 4) to (3, 4, 5), respectively. Details: - Sample rate: 16000 Hz. - Data type: 16-bit PCM (int16). - File size: Each example has a file size of 128 kB (5.53 GB for complete dataset). - Duration: 4 seconds. - Sound: Piano (digital). - Chords were played by a human on a velocity-sensitive piano keyboard. - 3 seconds pressed, 1 second released. - 3 octaves (3, 4, 5). - 12 base notes per octave: Cn, Df, Dn, Ef, En, Fn, Gf, Gn, Af, An, Bf, Bn. (n is natural, f is flat). - 4 triad types per note: major (j), minor (n), diminished (d), augmented (a). No inversions. - 3 volumes per triad: forte (f), metsoforte (m), piano (p). - 10 original examples per combination of octave, base note, triad type, and volume. (10*3*12*4*3 = 4.320 examples). - x10 data augmentation for each example (4.320 * 10 = 43.200 total examples). - Data augmentation through random temporal and amplitude shifts. - Metadata is in the name of the chord. For example: "piano_4_Af_d_m_45.wav" is a piano chord, (4) 4th octave, (Af) A flat base note, (d) diminished, (m) metsoforte, 45th example. Note: - The audios are in 16-bit PCM (int16) data type to reduce the file size. This means that the dynamic range of values in the array is -32768 to 32768, integers. To normalize the audios in the range -1 to 1 (float) just divide by 32768. Second Version: The audios are the same as the first version, but the octave number in the names were corrected from (2, 3, 4) to (3, 4, 5), respectively.
创建时间:
2023-06-28
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作