espnet/kising-v2-segments
收藏Hugging Face2024-06-11 更新2024-06-12 收录
下载链接:
https://hf-mirror.com/datasets/espnet/kising-v2-segments
下载链接
链接失效反馈官方服务:
资源简介:
---
language:
- zh
- en
license: cc-by-nc-4.0
multilinguality:
- multilingual
size_categories:
- 10K<n<100K
source_datasets:
- original
task_categories:
- text-to-audio
- audio-to-audio
- automatic-speech-recognition
pretty_name: KiSing-v2
dataset_info:
features:
- name: audio
dtype: audio
- name: segment_id
dtype: string
- name: transcription
dtype: string
- name: singer
dtype: string
- name: label
dtype: string
- name: tempo
dtype: int64
- name: note_midi
sequence: float64
- name: note_phns
sequence: string
- name: note_lyrics
sequence: string
- name: note_start_times
sequence: float64
- name: note_end_times
sequence: float64
- name: phn
sequence: string
- name: phn_start_time
sequence: float64
- name: phn_end_time
sequence: float64
splits:
- name: train
num_bytes: 8843208465.296
num_examples: 19432
- name: validation
num_bytes: 51661360.0
num_examples: 50
- name: test
num_bytes: 1587559262.743
num_examples: 3543
download_size: 10401491812
dataset_size: 10482429088.039
configs:
- config_name: default
data_files:
- split: train
path: data/train-*
- split: validation
path: data/validation-*
- split: test
path: data/test-*
---
# Citation Information
```bibtex
@misc{shi2024singing,
title={Singing Voice Data Scaling-up: An Introduction to ACE-Opencpop and KiSing-v2},
author={Jiatong Shi and Yueqian Lin and Xinyi Bai and Keyi Zhang and Yuning Wu and Yuxun Tang and Yifeng Yu and Qin Jin and Shinji Watanabe},
year={2024},
eprint={2401.17619},
archivePrefix={arXiv},
primaryClass={cs.SD}
}
```
提供机构:
espnet
原始信息汇总
数据集概述
基本信息
- 名称: KiSing-v2
- 语言: 中文、英文
- 许可证: cc-by-nc-4.0
- 多语言性: 多语言
- 大小: 10K<n<100K
- 来源: 原始数据
- 任务类别:
- 文本到音频
- 音频到音频
- 自动语音识别
数据集特征
- 音频: 音频数据
- segment_id: 字符串
- transcription: 字符串
- singer: 字符串
- label: 字符串
- tempo: 整数
- note_midi: 浮点数序列
- note_phns: 字符串序列
- note_lyrics: 字符串序列
- note_start_times: 浮点数序列
- note_end_times: 浮点数序列
- phn: 字符串序列
- phn_start_time: 浮点数序列
- phn_end_time: 浮点数序列
数据集分割
- 训练集:
- 示例数量: 19432
- 字节数: 8843208465.296
- 验证集:
- 示例数量: 50
- 字节数: 51661360.0
- 测试集:
- 示例数量: 3543
- 字节数: 1587559262.743
数据集大小
- 下载大小: 10401491812
- 数据集大小: 10482429088.039
配置信息
- 默认配置:
- 训练集路径: data/train-*
- 验证集路径: data/validation-*
- 测试集路径: data/test-*



