King Saud University Arabic Speech Database

Name: King Saud University Arabic Speech Database
Creator: Linguistic Data Consortium
Published: 2025-02-12 08:55:11
License: 暂无描述

DataCite Commons2025-02-12 更新2025-04-16 收录

下载链接：

https://catalog.ldc.upenn.edu/LDC2014S02

下载链接

链接失效反馈

官方服务：

资源简介：

<h3>Introduction</h3><br> <p>King Saud University Arabic Speech Database was developed by Speech Group (SG) at <a href="http://ksu.edu.sa/en/">King Saud University</a> and contains 590 hours of recorded Arabic speech from 269 male and female speakers. The utterances include read and spontaneous speech. The recordings were conducted in varied environments representing quiet and noisy settings.</p><br> <h3>Data</h3><br> <p>The corpus was designed principally for speaker recognition research. However, other possible applications include first language recognition, mobile effect, multichannel effect, and use of different type of microphones. The speech sources are word lists, sentence lists, paragraphs and question and answer sessions. Read speech text includes the following:</p><br> <ul><br> <li>Sets of sentences devised to cover allophones of each phoneme, phonetic balance, and differentiation of accents.</li><br> <li>Word lists developed to minimize missing phonemes and to represent nasals fricatives, commonly used words, and numbers.</li><br> <li>Two paragraphs selected because they included all letters of the alphabet and were easy to read.</li><br> </ul><br> <p>Spontaneous speech was captured through question and answer sessions where speakers answer questions displayed on screen. The questions were on general topics such as the weather and food and included the speaker name or number.</p><br> <p>The speakers were Saudis and non-Saudis. Among the non-Saudi participants were Arabs and non-Arabs. All female speakers were either Saudis or non-Saudi Arabs. Male speakers included non-Arabs from the Indian subcontinent, Africa, South East Asia and East Europe. Non-Arab participants were required to be able to read Arabic at an acceptable level. Most of the Non-Arab speakers were from the fourth level in the <a href="http://ali.ksu.edu.sa/en">Arabic Linguistics Institute</a> at King Saud University. The non-Saudi participants represented 28 nationalities and were chosen from clusters of areas or countries.</p><br> <p>Each speaker was recorded in three different environments: in a soundproof room , in an office and in a cafeteria. The recordings were collected via different microphones and a mobile phone and averaged between 16-19 minutes. The recordings were done in three sessions with a time-gap of an approximately 6 weeks.</p><br> <p>The data was verified for missing recordings, problems with the recording system or errors in the recording process. All files are presented as two channel 48 kHz 16-bit FLAC compressed PCM wav files. Note that sizes and file names in the documentation are for the uncompressed wav files.</p><br> <h3>Samples</h3><br> <p>Please view this <a href="desc/addenda/LDC2014S02.m.wav" rel="nofollow">male sample</a> and <a href="desc/addenda/LDC2014S02.f.wav" rel="nofollow">female sample</a>.</p><br> <h3>Updates</h3><br> <p>None at this time.</p></br> Portions © 2014 King Saud University, © 2014 Trustees of the University of Pennsylvania

<h3>介绍</h3><br><p>沙特国王大学（King Saud University）语音研究组（Speech Group, SG）开发了本阿拉伯语语音数据集，该数据集收录了来自269名男女说话人的共计590小时阿拉伯语语音录音。语音内容涵盖朗读语音与自发语音，录音场景涵盖安静与嘈杂等多种环境。</p><br><h3>数据</h3><br><p>本语料库主要面向说话人识别研究设计，但也可应用于母语识别、移动设备影响、多通道录音影响、不同类型麦克风使用等其他研究场景。语音素材来源包括单词列表、句子列表、段落以及问答会话。朗读语音文本涵盖以下内容：</p><br><ul><br><li>覆盖每个音素的所有音位变体、实现语音平衡且区分口音的句子集合。</li><br><li>旨在减少音素遗漏、涵盖鼻音、擦音、常用词汇与数字的单词列表。</li><br><li>两段涵盖阿拉伯语全部字母且易于朗读的段落。</li><br></ul><br><p>自发语音通过问答会话采集：说话人需回答屏幕上显示的问题，问题涵盖天气、饮食等通用话题，且包含说话人姓名或编号。</p><br><p>说话人包括沙特籍与非沙特籍人士。非沙特籍参与者中既有阿拉伯人，也有非阿拉伯人。所有女性说话人均为沙特籍或非沙特籍阿拉伯人；男性说话人则涵盖来自印度次大陆、非洲、东南亚与东欧的非阿拉伯人。非阿拉伯参与者需具备可接受的阿拉伯语朗读水平，其中多数来自沙特国王大学阿拉伯语言学研究所（Arabic Linguistics Institute）的四级课程。非沙特籍参与者共计涵盖28个国籍，均选自特定区域或国家集群。</p><br><p>每位说话人均在三种不同环境下进行录音：隔音室、办公室与自助餐厅。录音设备包括多款麦克风与一部手机，单段录音时长平均为16至19分钟。录音分三次完成，每次间隔约6周。</p><br><p>本数据集已针对录音缺失、录音系统故障或录音流程错误进行了校验。所有文件均采用双声道、48kHz采样率、16位量化的FLAC压缩PCM WAV格式存储。请注意：文档中所列的文件大小与文件名均对应未压缩的WAV文件。</p><br><h3>样本</h3><br><p>请查看以下<a href="desc/addenda/LDC2014S02.m.wav" rel="nofollow">男性语音样本</a>与<a href="desc/addenda/LDC2014S02.f.wav" rel="nofollow">女性语音样本</a>。</p><br><h3>更新情况</h3><br><p>暂无更新。</p><br>Portions © 2014 沙特国王大学，© 2014 宾夕法尼亚大学托管会

提供机构：

Linguistic Data Consortium

创建时间：

2020-11-30

5,000+

优质数据集

54 个

任务类型

进入经典数据集