reach-vb/mls-eng-tags-spacy-v2
收藏Hugging Face2024-05-01 更新2024-06-12 收录
下载链接:
https://hf-mirror.com/datasets/reach-vb/mls-eng-tags-spacy-v2
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
features:
- name: original_path
dtype: string
- name: begin_time
dtype: float64
- name: end_time
dtype: float64
- name: text
dtype: string
- name: audio_duration
dtype: float64
- name: speaker_id
dtype: string
- name: book_id
dtype: string
- name: utterance_pitch_mean
dtype: float32
- name: utterance_pitch_std
dtype: float32
- name: snr
dtype: float64
- name: c50
dtype: float64
- name: speaking_rate
dtype: float64
- name: phonemes
dtype: string
- name: repunct_text
dtype: string
splits:
- name: test
num_bytes: 3255174
num_examples: 3769
- name: dev
num_bytes: 3273586
num_examples: 3807
- name: train
num_bytes: 9304286370
num_examples: 10808037
download_size: 5101613393
dataset_size: 9310815130
configs:
- config_name: default
data_files:
- split: test
path: data/test-*
- split: dev
path: data/dev-*
- split: train
path: data/train-*
---
提供机构:
reach-vb
原始信息汇总
数据集概述
数据集特征
- original_path (字符串类型)
- begin_time (浮点数类型,64位)
- end_time (浮点数类型,64位)
- text (字符串类型)
- audio_duration (浮点数类型,64位)
- speaker_id (字符串类型)
- book_id (字符串类型)
- utterance_pitch_mean (浮点数类型,32位)
- utterance_pitch_std (浮点数类型,32位)
- snr (浮点数类型,64位)
- c50 (浮点数类型,64位)
- speaking_rate (浮点数类型,64位)
- phonemes (字符串类型)
- repunct_text (字符串类型)
数据集分割
- test
- 数据量: 3255174字节
- 示例数量: 3769
- dev
- 数据量: 3273586字节
- 示例数量: 3807
- train
- 数据量: 9304286370字节
- 示例数量: 10808037
数据集大小
- 下载大小: 5101613393字节
- 数据集总大小: 9310815130字节
配置文件
- config_name: default
- test
- 路径模式: data/test-*
- dev
- 路径模式: data/dev-*
- train
- 路径模式: data/train-*
- test



