perisolb/npsc_dataset_tmp
收藏Hugging Face2023-01-11 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/perisolb/npsc_dataset_tmp
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
features:
- name: speaker_id
dtype: string
- name: gender
dtype: string
- name: utterance_id
dtype: string
- name: language
dtype: string
- name: raw_text
dtype: string
- name: full_audio_file
dtype: string
- name: original_data_split
dtype: string
- name: region
dtype: string
- name: duration
dtype: float64
- name: start
dtype: float64
- name: end
dtype: float64
- name: utterance_audio_file
dtype: audio
- name: standardized_text
dtype: string
splits:
- name: train
num_bytes: 9050653.0
num_examples: 50
- name: test
num_bytes: 1225074.0
num_examples: 10
- name: validation
num_bytes: 1225074.0
num_examples: 10
download_size: 11505743
dataset_size: 11500801.0
---
# Dataset Card for "npsc_dataset_tmp"
[More Information needed](https://github.com/huggingface/datasets/blob/main/CONTRIBUTING.md#how-to-contribute-to-the-dataset-cards)
提供机构:
perisolb
原始信息汇总
数据集概述
数据集名称
npsc_dataset_tmp
数据集特征
- speaker_id: 字符串类型
- gender: 字符串类型
- utterance_id: 字符串类型
- language: 字符串类型
- raw_text: 字符串类型
- full_audio_file: 字符串类型
- original_data_split: 字符串类型
- region: 字符串类型
- duration: 浮点数类型
- start: 浮点数类型
- end: 浮点数类型
- utterance_audio_file: 音频类型
- standardized_text: 字符串类型
数据集分割
- train: 50个样本,占用9050653字节
- test: 10个样本,占用1225074字节
- validation: 10个样本,占用1225074字节
数据集大小
- 下载大小: 11505743字节
- 数据集大小: 11500801字节



