maxseats/aihub-464-preprocessed-680GB-set-38
收藏Hugging Face2024-07-04 更新2024-07-06 收录
下载链接:
https://hf-mirror.com/datasets/maxseats/aihub-464-preprocessed-680GB-set-38
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含音频数据和对应的标签,音频数据的采样率为16000Hz。数据集分为训练集、测试集和验证集,分别包含46892、5862和5862个样本。数据集的下载大小为22289353508字节,总大小为66978776061.0字节。
The dataset consists of three main parts: audio, labels, and input features. The audio part has a sampling rate of 16000, the labels part is of string type, and the input features part is a sequence of floats. The dataset is divided into train, test, and valid splits, each with corresponding byte counts and example counts. The total download size and dataset size are also provided.
提供机构:
maxseats
原始信息汇总
数据集概述
数据特征
- audio: 音频数据,采样率为16000。
- labels: 标签数据,数据类型为字符串。
- input_features: 输入特征,数据类型为浮点数序列。
数据集划分
- train: 训练集,包含46892个样本,大小为53582106712.37225字节。
- test: 测试集,包含5862个样本,大小为6698334674.313873字节。
- valid: 验证集,包含5862个样本,大小为6698334674.313873字节。
数据集大小
- 下载大小: 22289353508字节
- 总数据集大小: 66978776061.0字节
配置
- config_name: default
- data_files:
- train: 路径为
data/train-* - test: 路径为
data/test-* - valid: 路径为
data/valid-*
- train: 路径为
- data_files:



