five

Arabic Broadcast News Speech

收藏
DataCite Commons2021-07-01 更新2025-04-16 收录
下载链接:
https://catalog.ldc.upenn.edu/LDC2006S46
下载链接
链接失效反馈
官方服务:
资源简介:
<h3>Introduction</h3><br> <p>Arabic Broadcast News Speech was developed by the Linguistic Data Consortium (LDC) and contains eight audio files totalling 10 hours of Arabic broadcast speech. The data was recorded by LDC from Voice of America (VOA) satellite radio news broadcasts in Arabic transmitted between June 2000 and January 2001. The corresponding transcripts for these speech files are available in <a href="../../../LDC2006T20">Arabic Broadcast News Transcripts (LDC2006T20)</a>.</p><br> <p>This work was undertaken in the Networking Data Centers (NetDC) project (MLIS-5017, NSF IIS-9982201) in conjunction with the <a href="http://www.elra.info/en/">European Language Resources Association</a> (ELRA). ELRA collected 22.5 hours of Arabic broadcast data from Radio Orient (France) that is available in <a href="http://catalog.elra.info/product_info.php?products_id=13">NetDC Arabic Broadcast News Speech Corpus (ELRA-S0157)</a>. The goal of the NetDC project was to improve the infrastructure for language resources by designing and implementing new modes of cooperation between LDC and ELRA.</p><br> <h3>Data</h3><br> <p>The recordings were captured from a dedicated satellite receiver and stored as 16-bit PCM, 16-kHz, single-channel, in NIST SPHERE format. The duration of each recording is either 60 minutes or 120 minutes, depending on the VOA broadcast schedule. The date (YYYYMMDD), start-time, and end-time (HHMM EST) for each recording are indicated in the file names. The sample data are not compressed.</p><br> <h3>Samples</h3><br> <p>For an example of the speech in this corpus, please listen to this <a href="desc/addenda/LDC2006S46.wav" rel="nofollow">sample (WAV)</a>.</p><br> <h3>Updates</h3><br> <p>None at this time.</p></br> Portions © 2000, 2001, 2002, 2005, 2006 Trustees of the University of Pennsylvania
提供机构:
Linguistic Data Consortium
创建时间:
2020-11-30
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作