Japanese Business News Text Supplement
收藏DataCite Commons2021-07-01 更新2024-07-13 收录
下载链接:
https://catalog.ldc.upenn.edu/LDC99T34
下载链接
链接失效反馈官方服务:
资源简介:
<p>This corpus consists of newswire text from Nihon Keizai Shimbun, Inc. (NIKKEI), the largest Japanese daily financial newspaper, and Telerate, Inc. (formerly known as Dow Jones/Kyodo News Service), published primarily for managers of Japanese-owned corporations or Japanese employees working in North American financial institutions.</p><br>
<p>The Telerate portion constitutes all newswire text collected by the LDC between December 1994 and September 1998. The Telerate data collected from June 1995 to September 1998 serves as a supplement to the original publication.</p><br>
<p>All NIKKEI data was collected from December 1993 to November 1994 and is also available on the 1995 release of the Japanese Business News Text.</p><br>
<p>The data, including SGML tags, breaks down as follows.</p><br>
<p># of Files Daily Average Size Total Size -------------------------------------------------- NIKKEI 364 514K 188MB Telerate 1060 336K 357MB</p><br>
<p>The NIKKEI text was received on nine-track magnetic tape. The original character encoding was EBCDIC, but was converted to EUC encoding, which the LDC uses for its Japanese publications.</p><br>
<p>The Telerate text was received via a digital transmission service installed at the LDC by Telerate. Custom software was written by the LDC to poll a central database and download articles individually. The character encoding is EUC.</p><br>
<p>LDC added SGML tags automatically in order to identify individual stories within the daily collections.</p><br>
<h3>Additional Licensing Instructions</h3><br>
<p>This 'members-only' corpora is available to current members who can request the data at the listed reduced-license fee. Contact <a href="mailto:ldc@ldc.upenn.edu">ldc@ldc.upenn.edu</a> for information about becoming a member.</p></br>
提供机构:
Linguistic Data Consortium
创建时间:
2020-11-30



