five

CALLHOME Egyptian Arabic Speech

收藏
DataCite Commons2021-07-01 更新2024-07-13 收录
下载链接:
https://catalog.ldc.upenn.edu/LDC97S45
下载链接
链接失效反馈
官方服务:
资源简介:
<h3>Introduction</h3><br> <p>The <a href="../../../Catalog/docs/LDC97T19/index.html" rel="nofollow">CALLHOME Egyptian Arabic</a> corpus of telephone speech consists of 120 unscripted telephone conversations between native speakers of Egyptian Colloquial Arabic (ECA), the spoken variety of Arabic found in Egypt. The dialect of ECA that this dictionary represents is Cairene Arabic.</p><br> <h3>Data</h3><br> <p>All calls, which lasted up to 30 minutes, originated in North America and were placed to locations overseas (typically Egypt). Most participants called family members or close friends.</p><br> <p>This corpus contains speech data files ONLY, along with the minimal amount of documentation needed to describe the contents and format of the speech files and the software packages needed to uncompress the speech data. The transcripts and documentation (<a href="http://catalog.ldc.upenn.edu/LDC97T19" rel="nofollow">LDC97T19</a>) are available separately, as is an associated lexicon (<a href="http://catalog.ldc.upenn.edu/LDC99L22" rel="nofollow">LDC99L22</a>).</p><br> <h3>Samples</h3><br> <p>Please listen to this <a href="desc/addenda/LDC97S45.sph">speech sample</a>.</p><br> <h3>Updates</h3><br> <p>The "shorten" and "sphere" directories have been removed.</p><br> <p>The sphere directory contained NIST "SPeech HEader REsources" (SPHERE): C-language source code libraries and utilities for manipulating NIST SPHERE-format waveform files.</p><br> <p>The shorten directory contained files for Tony Robinson's "shorten" software for speech compression.</p><br> <p>A more recent version of the SPHERE utilities is now available on the <a href="http://www.nist.gov/speech/tools/index.htm" rel="nofollow">NIST web site</a>; additional utilities for converting from SPHERE to other waveform file formats is also available at the <a href="http://www.ldc.upenn.edu/Using/" rel="nofollow">LDC web site.</a></p></br> Portions © 1996-1997 Trustees of the University of Pennsylvania
提供机构:
Linguistic Data Consortium
创建时间:
2020-11-30
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作