Arabic Speech Recognition Pronunciation Dictionary
收藏DataCite Commons2021-07-01 更新2025-04-16 收录
下载链接:
https://catalog.ldc.upenn.edu/LDC2017L01
下载链接
链接失效反馈官方服务:
资源简介:
<h3>Introduction</h3><br>
<p>Arabic Speech Recognition Pronunciation Dictionary was developed by the <a href="http://qcri.qa/">Qatar Computing Research Institute</a>. It contains approximately two million pronunciation entries for 526,000 Modern Standard Arabic words, for an average of 3.84 pronunciations for each grapheme word.</p><br>
<h3>Data</h3><br>
<p>The dictionary was developed from news archive resources, including the Arabic news website <a href="http://www.aljazeera.net/">Aljazeera.net</a>. The selected words were those that occurred more than once in the news collection. The text was processed using <a href="http://www.ccls.columbia.edu/project/madatokan/">MADA</a>.</p><br>
<p>The dictionary is presented in a single UTF-8 plain text file.</p><br>
<h3>Samples</h3><br>
<p>Please view this <a href="desc/addenda/LDC2017L01.txt">sample</a>.</p><br>
<h3>Updates</h3><br>
<p>None at this time.</p></br>
Portions © 2017 Qatar Computing Research Institute, © 2017 Trustees of the University of Pennsylvania
提供机构:
Linguistic Data Consortium
创建时间:
2020-11-30



