MinhLe999/CombinedSpeak
收藏Hugging Face2025-10-23 更新2025-10-25 收录
下载链接:
https://hf-mirror.com/datasets/MinhLe999/CombinedSpeak
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含音频波形(waveform)、标准标识符(canonical_ids)、转录标识符(transcript_ids)和错误信息(error)四个特征。数据集分为训练集(train)、测试集(test)和验证集(val),其中训练集包含29944个示例,测试集和验证集分别包含599和459个示例。数据集的总下载大小为13562010910字节,完整大小为20453598440字节。
The dataset includes four features: audio waveform (waveform), standard identifiers (canonical_ids), transcription identifiers (transcript_ids), and error information (error). The dataset is split into three parts: training set (train), test set (test), and validation set (val), with the training set containing 29944 examples, and the test and validation sets containing 599 and 459 examples respectively. The total download size of the dataset is 13562010910 bytes, and the full size is 20453598440 bytes.
提供机构:
MinhLe999



