procit002/AddedSpeakerIdSpeakerNameNormalizedTextTofullyRefinedDatasetUpto21May
收藏Hugging Face2024-06-03 更新2024-06-12 收录
下载链接:
https://hf-mirror.com/datasets/procit002/AddedSpeakerIdSpeakerNameNormalizedTextTofullyRefinedDatasetUpto21May
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
features:
- name: speaker_id
dtype: string
- name: Accent
dtype: string
- name: Language
dtype: string
- name: Text
dtype: string
- name: Gender
dtype: string
- name: audio
dtype: audio
- name: speaker_name
dtype: string
- name: normalized_text
dtype: string
splits:
- name: train
num_bytes: 603092393.0
num_examples: 2040
download_size: 557807205
dataset_size: 603092393.0
configs:
- config_name: default
data_files:
- split: train
path: data/train-*
---
The dataset includes multiple features such as speaker_id, Accent, Language, Text, Gender, audio, speaker_name, and normalized_text. It is divided into a training set with 2040 samples, totaling 603092393 bytes. The dataset configuration is named default, and the data file path is data/train-*.
提供机构:
procit002
原始信息汇总
数据集概述
数据集特征
- speaker_id: 字符串类型
- Accent: 字符串类型
- Language: 字符串类型
- Text: 字符串类型
- Gender: 字符串类型
- audio: 音频类型
- speaker_name: 字符串类型
- normalized_text: 字符串类型
数据集分割
- 训练集:
- 数据量: 603,092,393字节
- 样本数: 2040
数据集大小与下载大小
- 数据集大小: 603,092,393字节
- 下载大小: 557,807,205字节



