Sam04/BlueLionEcho_tran_embedded
收藏Hugging Face2025-11-08 更新2025-11-15 收录
下载链接:
https://hf-mirror.com/datasets/Sam04/BlueLionEcho_tran_embedded
下载链接
链接失效反馈官方服务:
资源简介:
这是一个包含音频文件和相关信息的语音数据集,音频采样率为16000Hz。数据集包含文件名、文件夹路径、转录文本、置信度、裁剪时间、裁剪原因、是否含有不完整单词、备注、原始路径、批次索引和全局索引等字段。数据集被划分为训练集,共有24369个示例,总大小为7079.71GB。
This is a speech dataset containing audio files and related information, with an audio sampling rate of 16000Hz. The dataset includes fields such as file name, folder path, transcription, confidence, trimming time, trimming reason, presence of incomplete words, notes, original path, batch index, and global index. The dataset is split into a training set with a total of 24,369 examples and a total size of 7079.71GB.
提供机构:
Sam04



