Corpus of Conversational Persian Transcripts

Name: Corpus of Conversational Persian Transcripts
Creator: Linguistic Data Consortium
Published: 2021-07-01 16:33:29
License: 暂无描述

DataCite Commons2021-07-01 更新2025-04-16 收录

下载链接：

https://catalog.ldc.upenn.edu/LDC2019T11

下载链接

链接失效反馈

官方服务：

资源简介：

<h3>Introduction</h3><br> <p>Corpus of Conversational Persian Transcripts consists of transcripts from approximately 20 hours of naturally occurring informal conversations in the Tehrani dialect of Iranian Persian. The corresponding speech is not included in this release.</p><br> <h3>Data</h3><br> <p>This corpus is extracted from 1,201 minutes of conversations among 22 participants, 12 male and 10 female. The participants recorded their daily phone calls and face-to-face interactions in a variety of informal settings. The conversations represent various interaction types, settings, types of relationship, and communicative goals.</p><br> <p>The transcripts were annotated for gender, age, and recording method and setting. See the included documentation for more information about the annotations and transcription methodology.</p><br> <p>Each conversation is presented as a UTF-8 encoded XML file.</p><br> <h3>Samples</h3><br> <p>Please view this <a href="desc/addenda/LDC2019T11.xml">sample</a>.</p><br> <h3>Updates</h3><br> <p>None at this time.</p></br> Portions © 2019 Ariana N. Mohammadi, © 2019 Trustees of the University of Pennsylvania

提供机构：

Linguistic Data Consortium

创建时间：

2020-11-30

5,000+

优质数据集

54 个

任务类型

进入经典数据集