five

Japanese Business News Text

收藏
DataCite Commons2021-07-01 更新2025-04-16 收录
下载链接:
https://catalog.ldc.upenn.edu/LDC95T8
下载链接
链接失效反馈
官方服务:
资源简介:
<p>The Linguistic Data Consortium announces the availability of a Japanese language text corpus composed of business and financial news from two sources:</p><br> <ol><br> <li>Approximately 30 million words of text have been made available from the morning edition of Nihon Kezai Shimbun, the largest Japanese financial news daily newspaper; the release this year covers all text that was published during 1994.<br> <p>The data was received at the LDC on nine-track magnetic tape; the character encoding was EBCDIC, but was standardized to EUC, which the LDC has chosen as its standard for Japanese.</p><br> </li><br> <li>A smaller part of the corpus comes from Dow Jones Telerate, which markets its Japanese Language Service. This is a financial newswire produced by Kyodo News Service; its recipients are primarily managers of Japanese owned corporations, or Japanese employees working in North American brokerage houses, banking, etc. The text is received at the LDC via a digital transmission service installed by Telerate; special software was written by the LDC to poll a central database and download articles individually. The character encoding is EUC.</li><br> </ol><br> <p>This corpus is available to LDC members only.</p><br> <p>&nbsp;</p></br> Portions © 1994-1995 Kyodo News Service, © 1995 Nihon Keizai Shimbun, © 1995 Trustees of the University of Pennsylvania
提供机构:
Linguistic Data Consortium
创建时间:
2020-11-30
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作