L2-KSU Native and Non-Native Arabic Speech
收藏DataCite Commons2025-06-03 更新2025-04-16 收录
下载链接:
https://catalog.ldc.upenn.edu/LDC2024S11
下载链接
链接失效反馈官方服务:
资源简介:
<h3>Introduction</h3>
<p>L2-KSU Native and Non-Native Arabic Speech was developed by <a href="http://ksu.edu.sa/en/">King Saud University</a> (KSU) and contains approximately six hours of Modern Standard Arabic read speech from 80 subjects, along with transcripts and speaker metadata.</p>
<h3>Data</h3>
<p>The speech data was collected in 2022 from 40 native and 40 non-native speakers. Native speakers were from Saudi Arabia, Egypt, and Palestine. They provided audio recordings through the crowd sourcing platform <a href="https://khamsat.com/">Khamsat</a>. Non-native speakers were Central and West African students enrolled in KSU's Arabic Linguistics Institute; they provided speech recordings on site. All subjects read a series of ten sentences, repeating each sentence multiple times.</p>
<p>Audio is presented as 16-bit 16 kHz wav files. Transcript files in UTF-8 plain text, speaker metadata, and the Arabic sentences with transliteration, English translation and IPA transcription are also included in the documentation accompanying this release.</p>
<h3>Samples</h3>
<p>Please view these samples:</p>
<ul>
<li><a href="desc/addenda/LDC2024S11.wav">Native speaker (wav)</a></li>
<li><a href="desc/addenda/LDC2024S11.nn.wav">Non-native speaker (wav)</a></li>
<li><a href="desc/addenda/LDC2024S11.txt">Transcript (txt)</a></li>
</ul>
<h3>Updates</h3>
<p>None at this time.</p>
<p> </p>
提供机构:
Linguistic Data Consortium
创建时间:
2024-09-16



