Sam04/yt-aud30_final_with_audio
收藏Hugging Face2025-10-19 更新2025-10-25 收录
下载链接:
https://hf-mirror.com/datasets/Sam04/yt-aud30_final_with_audio
下载链接
链接失效反馈官方服务:
资源简介:
该数据集是一个包含音频文件和相关信息的集合,音频采样率为16000Hz。每个音频文件都附带文件名、文件夹路径、转录文本、是否可转录标记、不可转录的原因、置信度、音频裁剪的起始和结束时间、裁剪原因、是否包含不完整单词、备注和原始路径等信息。数据集分为训练集,共有3168个样本。
This dataset is a collection of audio files and associated information, with an audio sampling rate of 16000Hz. Each audio file is accompanied by a file name, folder path, transcription text, transcribability mark, reason for non-transcribability, confidence, start and end times of audio trimming, trimming reasons, whether it contains incomplete words, notes, and original path, etc. The dataset is split into a training set with a total of 3168 samples.
提供机构:
Sam04



