five

Hong Kong Hansards Parallel Text

收藏
DataCite Commons2021-07-01 更新2025-04-16 收录
下载链接:
https://catalog.ldc.upenn.edu/LDC2000T50
下载链接
链接失效反馈
官方服务:
资源简介:
<h3>Introduction</h3><br> <p>Hong Kong Hansards Parallel Text was developed by the Linguistic Data Consortium (LDC) and contains excerpts from the Official Record of Proceedings of the Legislative Council of the Hong Kong Special Administrative Region (HKSAR) from October 1995 to April 2000.</p><br> <p>LDC thanks the Hong Kong Special Administrative Region of the Peoples Republic of China for granting permission to distribute this data to the research community.</p><br> <p>The Legislative Council normally meets every Wednesday afternoon in the Chamber of the Legislative Council Building. Business includes: discussion of subsidiary legislation, papers, reports, addresses, statements, questions, the three readings of bills, motions and debates.</p><br> <p>From time to time, the Chief Executive attends a special Council meeting to brief Members on policy issues and to answer questions from Members. All Council meetings are open to the public. The proceedings of the meetings are recorded verbatim in the Official Record of Proceedings of the Legislative Council (Hansard).</p><br> <p>The record of proceedings is in the original language delivered by the speakers (Floor Version). They are then translated into English and Chinese versions separately.</p><br> <h3>Data</h3><br> <p>This corpus contains excerpts from the official record of meetings from October 1995 to April 2000. There are 11.9 million English words and 18.15 million Chinese characters in this release.&nbsp;Chinese text is presented in the traditional script and encoded as BIG5.</p><br> <p>There are 388 files in the data/ subdirectory of this corpus, half (194 files) in English in the data/english/ subdirectory and half (194 files) in Chinese in the data/chinese/ subdirectory. Data file names are in the form YYYYMMDD_[ce].doc, where YYYYMMDD indicates the date of the meeting, c=Chinese and e=English. As an example of the text in this corpus the <a href="desc/addenda/LDC2000T50_c.gif" rel="nofollow">Chinese sample</a> is part of the Chinese language record of the meeting held on May 24, 1997. The parallel English file is in the <a href="desc/addenda/LDC2000T50_e.html" rel="nofollow">English sample</a>.</p><br> <h3>Copying and Distribution</h3><br> <p>Permission has been granted to the Linguistic Data Consortium to make and distribute copies of the laws, press releases and news of Hong Kong Special Administrative Region provided this copyright notice and permission notice are distributed with all copies.</p><br> <p>Permission has been given to reproduce the laws, press releases, and/or news articles from the Hong Kong Special Administrative Region Government website for research, education, and technology development.</p><br> <h3>Updates</h3><br> <p>There are no updates at this time.</p><br> <h3>Additional Licensing Instructions</h3><br> <p>This 'members-only' corpora is available to current members who can request the data at the listed reduced-license fee. Contact&nbsp;<a href="mailto:ldc@ldc.upenn.edu">ldc@ldc.upenn.edu</a>&nbsp;for information about becoming a member.</p></br> Portions © 1995-2000 The Government of the Hong Kong Special Administrative Region, © 2000 Trustees of the University of Pennsylvania
提供机构:
Linguistic Data Consortium
创建时间:
2020-11-30
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作