five

2001 HUB5 Mandarin Transcripts

收藏
DataCite Commons2021-07-01 更新2025-04-16 收录
下载链接:
https://catalog.ldc.upenn.edu/LDC2003T01
下载链接
链接失效反馈
官方服务:
资源简介:
<h3>Introduction</h3><br> <p>2001 HUB5 Mandarin Transcripts was developed by the Linguistic Data Consortium (LDC).</p><br> <p>This publication contains transcripts for twenty CALLHOME Mandarin telephone conversations. These twenty conversations were used in NIST's 2001 HUB5 Non-English evaluation, and are published as 2001 HUB5 Mandarin Evaluation (<a href="../../../LDC2002S12">LDC2002S12</a>).</p><br> <h3>Data</h3><br> <p>There are 20 data files in .txt format.</p><br> <p>The .txt files are transcript files rendered in Mandarin script orthography, containing the orthographic forms that were used in the original transcription process. These forms also serve as the head-words in the associated CALLHOME Mandarin Lexicon (<a href="http://catalog.ldc.upenn.edu/LDC96L15" rel="nofollow">LDC96L15</a>).</p><br> <p>Please follow these links for a sample transcript: <a href="desc/addenda/LDC2003T01.txt" rel="nofollow">Mandarin script</a> | <a href="desc/addenda/LDC2003T01.gif" rel="nofollow">GIF format</a>.</p><br> <h3>Updates</h3><br> <p>There are no updates at this time.</p></br> Portions © 2003 Trustees of the University of Pennsylvania.

<h3>引言</h3><br> <p>2001 HUB5普通话转录文本由语言数据联盟(Linguistic Data Consortium,LDC)开发。</p><br> <p>本出版物包含20段CALLHOME普通话电话对话的转录文本。这20段对话曾用于NIST(National Institute of Standards and Technology)2001年HUB5非英语评估,并作为《2001 HUB5普通话评估》(<a href="../../../LDC2002S12">LDC2002S12</a>)发布。</p><br> <h3>数据</h3><br> <p>共有20个.txt格式的数据文件。</p><br> <p>这些.txt文件是以普通话正字法(orthography)呈现的转录文本文件,包含原始转录过程中使用的正字形式。这些形式同时也是相关的《CALLHOME普通话词典》(<a href="http://catalog.ldc.upenn.edu/LDC96L15" rel="nofollow">LDC96L15</a>)中的词头(head-words)。</p><br> <p>请通过以下链接查看样本转录文本:<a href="desc/addenda/LDC2003T01.txt" rel="nofollow">普通话文本</a> | <a href="desc/addenda/LDC2003T01.gif" rel="nofollow">GIF格式</a>。</p><br> <h3>更新</h3><br> <p>目前暂无更新。</p></br> 部分内容©2003宾夕法尼亚大学董事会。
提供机构:
Linguistic Data Consortium
创建时间:
2020-11-30
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作