lmms-lab/WenetSpeech|语音识别数据集
收藏hugging_face2025-09-23 更新2025-10-18 收录
下载链接:
https://hf-mirror.com/datasets/lmms-lab/WenetSpeech
下载链接
链接失效反馈资源简介:
该数据集包含了utterance的ID、音频文件、对应的文本、音频的开始和结束时间、音频ID以及音频文件的路径。数据集被分为三个部分:开发集、会议测试集和网络测试集,分别用于不同的测试目的。每个部分的大小和样本数量都有详细说明。
The dataset includes utterance ID, audio file, corresponding text, start and end time of the audio, audio ID, and the path to the audio file. The dataset is divided into three parts: development set, meeting test set, and network test set, each for different testing purposes. The size and number of samples for each part are specified in detail.
提供机构:
lmms-lab



