five

Korean Telephone Conversations Lexicon

收藏
DataCite Commons2021-07-01 更新2025-04-16 收录
下载链接:
https://catalog.ldc.upenn.edu/LDC2003L02
下载链接
链接失效反馈
官方服务:
资源简介:
<h3>Introduction</h3><br> <p>Korean Telephone Conversations Lexicon was produced by Linguistic Data Consortium (LDC) catalog number LDC2003L02 and ISBN 1-58563-265-1.</p><br> <p>Korean Telephone Conversations Lexicon consists of 25,251 words, and contains separate fields with phonological, morphological, and frequency information for each word.</p><br> <p>The lexicon covers the tokens occurring in 100 telephone conversations transcribed and published as <a href="../../../LDC2003T08">Korean Telephone Conversations Transcripts</a>. The token coverage is 100%. The corresponding speech is published as<a href="../../../LDC2003S03"> Korean Telephone Conversations Speech</a>.</p><br> <h3>Data</h3><br> <p>The lexicon contains five tab-separated information fields:</p><br> <ol><br> <li>orthographic form in Hangul (head-word), encoded in the KSC-5601 (Wansung) system</li><br> <li>orthographic form in Yale romanization</li><br> <li>pronunciation</li><br> <li>frequency of the word in Korean Telephone Conversations Transcripts</li><br> <li>morphological analysis of the word</li><br> </ol><br> <p>Please follow this link for a sample page from the lexicon: <a href="desc/addenda/LDC2003L02.txt" rel="nofollow">txt</a> | <a href="desc/addenda/LDC2003L02.gif" rel="nofollow">gif</a>.</p><br> <h3>Updates</h3><br> <p>There are no updates available at this time.</p></br> Portions © 2003 Trustees of the University of Pennsylvania.
提供机构:
Linguistic Data Consortium
创建时间:
2020-11-30
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作