AdoCleanCode/timit_mfa_aligned
收藏Hugging Face2025-12-14 更新2025-12-20 收录
下载链接:
https://hf-mirror.com/datasets/AdoCleanCode/timit_mfa_aligned
下载链接
链接失效反馈官方服务:
资源简介:
该数据集是一个语音数据集,包含音频文件及其转录文本、单词和音素级别的标注信息。每个样本包含唯一的id和说话者id,音频文件的采样率为16kHz。转录文本以字符串形式提供,单词和音素级别的标注包括各自的文本内容以及开始和结束时间。数据集仅包含训练集,共有4969个样本,总大小为499397350.603字节。
This dataset is a speech dataset containing audio files along with their transcriptions and word-level and phoneme-level annotations. Each sample includes a unique id and speaker id, with audio files sampled at 16kHz. The transcriptions are provided as strings, and both word-level and phoneme-level annotations include their respective text content along with start and end times. The dataset only includes a training set, with a total of 4969 samples and a total size of 499397350.603 bytes.
提供机构:
AdoCleanCode



