Audio Piano Triad Dataset
收藏Mendeley Data2024-03-27 更新2024-06-28 收录
下载链接:
https://zenodo.org/record/5217057
下载链接
链接失效反馈官方服务:
资源简介:
Created by: Agustín Macaya Valladares Date: May 5th, 2021 - Dataset contains 43.200 examples of piano triads in .wav format. - Second Version: The audios are the same as the first version, but the octave number in the names were corrected from (2, 3, 4) to (3, 4, 5), respectively. Details: - Sample rate: 16000 Hz. - Data type: 16-bit PCM (int16). - File size: Each example has a file size of 128 kB (5.53 GB for complete dataset). - Duration: 4 seconds. - Sound: Piano (digital). - Chords were played by a human on a velocity-sensitive piano keyboard. - 3 seconds pressed, 1 second released. - 3 octaves (3, 4, 5). - 12 base notes per octave: Cn, Df, Dn, Ef, En, Fn, Gf, Gn, Af, An, Bf, Bn. (n is natural, f is flat). - 4 triad types per note: major (j), minor (n), diminished (d), augmented (a). No inversions. - 3 volumes per triad: forte (f), metsoforte (m), piano (p). - 10 original examples per combination of octave, base note, triad type, and volume. (10*3*12*4*3 = 4.320 examples). - x10 data augmentation for each example (4.320 * 10 = 43.200 total examples). - Data augmentation through random temporal and amplitude shifts. - Metadata is in the name of the chord. For example: "piano_4_Af_d_m_45.wav" is a piano chord, (4) 4th octave, (Af) A flat base note, (d) diminished, (m) metsoforte, 45th example. Note: - The audios are in 16-bit PCM (int16) data type to reduce the file size. This means that the dynamic range of values in the array is -32768 to 32768, integers. To normalize the audios in the range -1 to 1 (float) just divide by 32768. Second Version: The audios are the same as the first version, but the octave number in the names were corrected from (2, 3, 4) to (3, 4, 5), respectively.
创建时间:
2023-06-28



