Corpus of Conversational Persian Transcripts
收藏DataCite Commons2021-07-01 更新2025-04-16 收录
下载链接:
https://catalog.ldc.upenn.edu/LDC2019T11
下载链接
链接失效反馈官方服务:
资源简介:
<h3>Introduction</h3><br>
<p>Corpus of Conversational Persian Transcripts consists of transcripts from approximately 20 hours of naturally occurring informal conversations in the Tehrani dialect of Iranian Persian. The corresponding speech is not included in this release.</p><br>
<h3>Data</h3><br>
<p>This corpus is extracted from 1,201 minutes of conversations among 22 participants, 12 male and 10 female. The participants recorded their daily phone calls and face-to-face interactions in a variety of informal settings. The conversations represent various interaction types, settings, types of relationship, and communicative goals.</p><br>
<p>The transcripts were annotated for gender, age, and recording method and setting. See the included documentation for more information about the annotations and transcription methodology.</p><br>
<p>Each conversation is presented as a UTF-8 encoded XML file.</p><br>
<h3>Samples</h3><br>
<p>Please view this <a href="desc/addenda/LDC2019T11.xml">sample</a>.</p><br>
<h3>Updates</h3><br>
<p>None at this time.</p></br>
Portions © 2019 Ariana N. Mohammadi, © 2019 Trustees of the University of Pennsylvania
提供机构:
Linguistic Data Consortium
创建时间:
2020-11-30



