katyayego/ASCEND-phoneme
收藏Hugging Face2024-07-02 更新2024-07-06 收录
下载链接:
https://hf-mirror.com/datasets/katyayego/ASCEND-phoneme
下载链接
链接失效反馈官方服务:
资源简介:
该数据集是ASCEND数据集的修改版本,主要包含自发的中英混合语音。数据集增加了音标转录列,使用了phonemizer库的eSpeak后端。数据集的结构包括音频文件路径、加载的音频数组、转录文本、音标转录等字段。数据集分为训练集、验证集和测试集,分别包含9869、1130和1315个样本。数据集的语言包括英语和中文,主要用于自动语音识别任务。
This dataset is a modified version of the ASCEND dataset, consisting of spontaneous Mandarin-English code-switched speech, with an added phonetic transcription column. The dataset features include audio file path, audio array, transcription, phonetic transcription, datapoint ID, duration, language, speaker ID, session ID, and topic. The dataset is split into train, validation, and test sets, containing 9869, 1130, and 1315 utterances respectively.
提供机构:
katyayego



