five

Corpus of Conversational Persian Transcripts

收藏
DataCite Commons2021-07-01 更新2025-04-16 收录
下载链接:
https://catalog.ldc.upenn.edu/LDC2019T11
下载链接
链接失效反馈
官方服务:
资源简介:
<h3>Introduction</h3><br> <p>Corpus of Conversational Persian Transcripts consists of transcripts from approximately 20 hours of naturally occurring informal conversations in the Tehrani dialect of Iranian Persian. The corresponding speech is not included in this release.</p><br> <h3>Data</h3><br> <p>This corpus is extracted from 1,201 minutes of conversations among 22 participants, 12 male and 10 female. The participants recorded their daily phone calls and face-to-face interactions in a variety of informal settings. The conversations represent various interaction types, settings, types of relationship, and communicative goals.</p><br> <p>The transcripts were annotated for gender, age, and recording method and setting. See the included documentation for more information about the annotations and transcription methodology.</p><br> <p>Each conversation is presented as a UTF-8 encoded XML file.</p><br> <h3>Samples</h3><br> <p>Please view this <a href="desc/addenda/LDC2019T11.xml">sample</a>.</p><br> <h3>Updates</h3><br> <p>None at this time.</p></br> Portions © 2019 Ariana N. Mohammadi, © 2019 Trustees of the University of Pennsylvania
提供机构:
Linguistic Data Consortium
创建时间:
2020-11-30
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作