five

JANA: A Human-Human Dialogues Corpus for Egyptian Dialect

收藏
DataCite Commons2021-07-01 更新2025-04-16 收录
下载链接:
https://catalog.ldc.upenn.edu/LDC2016T24
下载链接
链接失效反馈
官方服务:
资源简介:
<h3>Introduction</h3><br> <p>JANA: A Human-Human Dialogues Corpus for Egyptian Dialect was developed by researchers at <a href="https://cu.edu.eg/Home">Cairo University</a>. It consists of 82 transcribed dialogues from call center inquiries annotated for dialogue acts.</p><br> <p>Data was collected from call centers for banks, airlines and mobile network providers as follows: (1) spontaneous spoken dialogues from inquiries to banks and airlines; and (2) instant messaging (chat) dialogues from a mobile network provider's online support system.</p><br> <h3>Data</h3><br> <p>The transcribed dialogues consist of 52 telephone calls and 30 instant messaging conversations, amounting to approximately 20,311 words. The data contains roughly 3,001 conversation turns, with an average of 6.7 words per turn, and 4,725 utterances, with an average of 4.3 words per utterance. The data was transcribed using <a href="http://transag.sourceforge.net/">Transcriber</a>.</p><br> <p>All data is presented as UTF-8 XML.</p><br> <h3>Samples</h3><br> <p>Please view this <a href="desc/addenda/LDC2016T24.xml">sample</a>.</p><br> <h3>Updates</h3><br> <p>None at this time.</p><br> <h3>Pricing</h3><br> <p>Not-for-profit organizations may license this data set for US$25.00 under the LDC Not-for-Profit Membership Agreement or under the LDC User Agreement for Non-Members for use in linguistic research, education and non-commercial technology development. For-profit organizations may license this data for US$1650 under the Commercial License Agreement for JANA: A Human-Human Dialogues Corpus for Egyptian Dialect (LDC2016T24).</p><br> <p>Current fees in this catalog entry reflect those pertaining to a for-profit organization license. Not-for-profit organizations should contact LDC's Membership Office to license this data set.</p></br> Portions © 2016 AbdelRahim AbdelSabour AbdelHalim Mohamed Elmadany, © 2016 Trustees of the University of Pennsylvania
提供机构:
Linguistic Data Consortium
创建时间:
2020-11-30
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作