tuandung2812/central_asr
收藏Hugging Face2025-12-16 更新2025-12-20 收录
下载链接:
https://hf-mirror.com/datasets/tuandung2812/central_asr
下载链接
链接失效反馈官方服务:
资源简介:
该数据集是一个包含音频及其相关文本信息的数据集,音频采样率为16000Hz。每个样本包含原始ID、音频数据、真实文本标签以及由多个不同模型(包括original_zipformer、finetuned_zipformer、elevenlabs、gpt和chunkformer)生成的文本。数据集分为训练集(20502个样本)和测试集(1080个样本)。
This dataset contains audio and related text information, with audio sampling rate of 16000Hz. Each sample includes original ID, audio data, ground truth text label, and texts generated by multiple different models (including original_zipformer, finetuned_zipformer, elevenlabs, gpt, and chunkformer). The dataset is divided into training set (20,502 samples) and test set (1,080 samples).
提供机构:
tuandung2812



