five

JasiekKaczmarczyk/maestro-v1-sustain-masked

收藏
Hugging Face2023-11-30 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/JasiekKaczmarczyk/maestro-v1-sustain-masked
下载链接
链接失效反馈
官方服务:
资源简介:
--- dataset_info: features: - name: midi_filename dtype: string - name: source dtype: string - name: pitch sequence: int16 length: 128 - name: start sequence: float32 length: 128 - name: dstart sequence: float32 length: 128 - name: duration sequence: float32 length: 128 - name: velocity sequence: int16 length: 128 - name: masking_spaces struct: - name: <Random Mask> sequence: bool length: 128 - name: <LH Mask> sequence: bool length: 128 - name: <RH Mask> sequence: bool length: 128 - name: <Harmonic Root Mask> sequence: bool length: 128 - name: <Harmonic Outliers Mask> sequence: bool length: 128 splits: - name: train num_bytes: 108676395 num_examples: 43738 - name: validation num_bytes: 12260534 num_examples: 4931 - name: test num_bytes: 14165318 num_examples: 5695 download_size: 59189080 dataset_size: 135102247 --- # Dataset Card for "maestro-v1-sustain-masked" [More Information needed](https://github.com/huggingface/datasets/blob/main/CONTRIBUTING.md#how-to-contribute-to-the-dataset-cards)

数据集信息: 特征: - 名称:MIDI文件名(midi_filename),数据类型:字符串 - 名称:来源(source),数据类型:字符串 - 名称:音高(pitch),类型:int16类型序列,长度为128 - 名称:起始时间(start),类型:float32类型序列,长度为128 - 名称:起始增量(dstart),类型:float32类型序列,长度为128 - 名称:持续时长(duration),类型:float32类型序列,长度为128 - 名称:音符力度(velocity),类型:int16类型序列,长度为128 - 名称:掩码区域(masking_spaces),结构体包含: - 随机掩码(<Random Mask>):bool类型序列,长度为128 - 左手声部掩码(<LH Mask>):bool类型序列,长度为128 - 右手声部掩码(<RH Mask>):bool类型序列,长度为128 - 和声根音掩码(<Harmonic Root Mask>):bool类型序列,长度为128 - 和声异常值掩码(<Harmonic Outliers Mask>):bool类型序列,长度为128 数据集划分: - 训练集(train):字节数108676395,样本数43738 - 验证集(validation):字节数12260534,样本数4931 - 测试集(test):字节数14165318,样本数5695 下载大小:59189080 数据集总存储大小:135102247 # 数据集卡片:"maestro-v1-sustain-masked" [需补充更多信息](https://github.com/huggingface/datasets/blob/main/CONTRIBUTING.md#how-to-contribute-to-the-dataset-cards)
提供机构:
JasiekKaczmarczyk
原始信息汇总

数据集概述

数据集信息

  • 名称: maestro-v1-sustain-masked

特征

  • midi_filename: 字符串类型
  • source: 字符串类型
  • pitch: 序列类型,int16,长度为128
  • start: 序列类型,float32,长度为128
  • dstart: 序列类型,float32,长度为128
  • duration: 序列类型,float32,长度为128
  • velocity: 序列类型,int16,长度为128
  • masking_spaces: 结构类型,包含以下子特征:
    • <Random Mask>: 序列类型,bool,长度为128
    • <LH Mask>: 序列类型,bool,长度为128
    • <RH Mask>: 序列类型,bool,长度为128
    • <Harmonic Root Mask>: 序列类型,bool,长度为128
    • <Harmonic Outliers Mask>: 序列类型,bool,长度为128

数据分割

  • train: 字节数为108676395,样本数为43738
  • validation: 字节数为12260534,样本数为4931
  • test: 字节数为14165318,样本数为5695

数据大小

  • 下载大小: 59189080字节
  • 数据集大小: 135102247字节
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作