2001 HUB5 Mandarin Transcripts
收藏DataCite Commons2021-07-01 更新2025-04-16 收录
下载链接:
https://catalog.ldc.upenn.edu/LDC2003T01
下载链接
链接失效反馈官方服务:
资源简介:
<h3>Introduction</h3><br>
<p>2001 HUB5 Mandarin Transcripts was developed by the Linguistic Data Consortium (LDC).</p><br>
<p>This publication contains transcripts for twenty CALLHOME Mandarin telephone conversations. These twenty conversations were used in NIST's 2001 HUB5 Non-English evaluation, and are published as 2001 HUB5 Mandarin Evaluation (<a href="../../../LDC2002S12">LDC2002S12</a>).</p><br>
<h3>Data</h3><br>
<p>There are 20 data files in .txt format.</p><br>
<p>The .txt files are transcript files rendered in Mandarin script orthography, containing the orthographic forms that were used in the original transcription process. These forms also serve as the head-words in the associated CALLHOME Mandarin Lexicon (<a href="http://catalog.ldc.upenn.edu/LDC96L15" rel="nofollow">LDC96L15</a>).</p><br>
<p>Please follow these links for a sample transcript: <a href="desc/addenda/LDC2003T01.txt" rel="nofollow">Mandarin script</a> | <a href="desc/addenda/LDC2003T01.gif" rel="nofollow">GIF format</a>.</p><br>
<h3>Updates</h3><br>
<p>There are no updates at this time.</p></br>
Portions © 2003 Trustees of the University of Pennsylvania.
<h3>引言</h3><br>
<p>2001 HUB5普通话转录文本由语言数据联盟(Linguistic Data Consortium,LDC)开发。</p><br>
<p>本出版物包含20段CALLHOME普通话电话对话的转录文本。这20段对话曾用于NIST(National Institute of Standards and Technology)2001年HUB5非英语评估,并作为《2001 HUB5普通话评估》(<a href="../../../LDC2002S12">LDC2002S12</a>)发布。</p><br>
<h3>数据</h3><br>
<p>共有20个.txt格式的数据文件。</p><br>
<p>这些.txt文件是以普通话正字法(orthography)呈现的转录文本文件,包含原始转录过程中使用的正字形式。这些形式同时也是相关的《CALLHOME普通话词典》(<a href="http://catalog.ldc.upenn.edu/LDC96L15" rel="nofollow">LDC96L15</a>)中的词头(head-words)。</p><br>
<p>请通过以下链接查看样本转录文本:<a href="desc/addenda/LDC2003T01.txt" rel="nofollow">普通话文本</a> | <a href="desc/addenda/LDC2003T01.gif" rel="nofollow">GIF格式</a>。</p><br>
<h3>更新</h3><br>
<p>目前暂无更新。</p></br>
部分内容©2003宾夕法尼亚大学董事会。
提供机构:
Linguistic Data Consortium
创建时间:
2020-11-30



