sanchit-gandhi/edacc
收藏Hugging Face2024-02-15 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/sanchit-gandhi/edacc
下载链接
链接失效反馈官方服务:
资源简介:
---
configs:
- config_name: default
data_files:
- split: validation
path: data/validation-*
- split: test
path: data/test-*
dataset_info:
features:
- name: speaker
dtype: string
- name: text
dtype: string
- name: accent
dtype: string
- name: raw_accent
dtype: string
- name: gender
dtype: string
- name: language
dtype: string
- name: audio
dtype: audio
splits:
- name: validation
num_bytes: 2615426357.928
num_examples: 9848
- name: test
num_bytes: 4926406074.438
num_examples: 9289
download_size: 6951142950
dataset_size: 7541832432.365999
---
# Draft conversion of EdAcc
Final dataset will be moved to the edinburghcstr organisation.
The dataset includes multiple configurations and data files, primarily for validation and testing. The features of the dataset include speaker, text, accent, raw accent, gender, language, and audio. The dataset is divided into validation and test sets, containing 9848 and 9289 samples respectively. The total download size of the dataset is 6951142950 bytes, and the total size is 7541832432.365999 bytes.
提供机构:
sanchit-gandhi
原始信息汇总
数据集概述
配置信息
- 默认配置:
- 验证集:
- 路径:
data/validation-*
- 路径:
- 测试集:
- 路径:
data/test-*
- 路径:
- 验证集:
数据集信息
-
特征:
speaker:字符串类型text:字符串类型accent:字符串类型raw_accent:字符串类型gender:字符串类型language:字符串类型audio:音频类型
-
数据分割:
- 验证集:
- 字节数:2615426357.928
- 样本数:9848
- 测试集:
- 字节数:4926406074.438
- 样本数:9289
- 验证集:
-
下载大小:6951142950
-
数据集大小:7541832432.365999



