UBC-NLP/NADI2025_subtask2_ASR
收藏Hugging Face2025-05-30 更新2025-05-31 收录
下载链接:
https://hf-mirror.com/datasets/UBC-NLP/NADI2025_subtask2_ASR
下载链接
链接失效反馈官方服务:
资源简介:
该数据集是一个多地区的音频数据集,包含阿尔及利亚、埃及、约旦、毛里塔尼亚、摩洛哥、巴勒斯坦、阿联酋和也门的语音数据。每个地区的数据都包括训练集和验证集,每个集合中都有1600个音频样本,每个样本都有唯一的ID、持续时间、转录文本等信息。音频文件的采样率为16000Hz。
This dataset is a multi-regional audio dataset containing speech data from Algeria, Egypt, Jordan, Mauritania, Morocco, Palestine, UAE, and Yemen. Each regions data includes a training set and a validation set, with each set containing 1600 audio samples. Each sample has a unique ID, duration, transcription text, and other information. The sampling rate of the audio files is 16000Hz.
提供机构:
UBC-NLP



