five

L2-KSU Native and Non-Native Arabic Speech

收藏
DataCite Commons2025-06-03 更新2025-04-16 收录
下载链接:
https://catalog.ldc.upenn.edu/LDC2024S11
下载链接
链接失效反馈
官方服务:
资源简介:
<h3>Introduction</h3> <p>L2-KSU Native and Non-Native Arabic Speech was developed by <a href="http://ksu.edu.sa/en/">King Saud University</a>&nbsp;(KSU) and contains approximately six hours of Modern Standard Arabic read speech from 80 subjects, along with transcripts and speaker metadata.</p> <h3>Data</h3> <p>The speech data was collected in 2022 from 40 native and 40 non-native speakers. Native speakers were from Saudi Arabia, Egypt, and Palestine. They provided audio recordings through the crowd sourcing platform&nbsp;<a href="https://khamsat.com/">Khamsat</a>. Non-native speakers were Central and West African students enrolled in KSU's Arabic Linguistics Institute; they provided speech recordings on site. All subjects read a series of ten sentences, repeating each sentence multiple times.</p> <p>Audio is presented as 16-bit 16 kHz wav files. Transcript files in UTF-8 plain text, speaker metadata, and the Arabic sentences with transliteration, English translation and IPA transcription are also included in the documentation accompanying this release.</p> <h3>Samples</h3> <p>Please view these samples:</p> <ul> <li><a href="desc/addenda/LDC2024S11.wav">Native speaker (wav)</a></li> <li><a href="desc/addenda/LDC2024S11.nn.wav">Non-native speaker (wav)</a></li> <li><a href="desc/addenda/LDC2024S11.txt">Transcript (txt)</a></li> </ul> <h3>Updates</h3> <p>None at this time.</p> <p>&nbsp;</p>
提供机构:
Linguistic Data Consortium
创建时间:
2024-09-16
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作