five

2008/2010 NIST Metrics for Machine Translation (MetricsMaTr) GALE Evaluation Set

收藏
DataCite Commons2021-07-01 更新2025-04-16 收录
下载链接:
https://catalog.ldc.upenn.edu/LDC2011T05
下载链接
链接失效反馈
官方服务:
资源简介:
<h3>Introduction</h3><br> <p>2008/2010 NIST Metrics for Machine Translation (MetricsMaTr) GALE Evaluation Set, Linguistic Data Consortium (LDC) catalog number LDC2011T05 and isbn 1-58563-575-8, is a package containing source data, reference translations, machine translations and associated human judgments used in the NIST 2008 and 2010 MetricsMaTr evaluations. The package was compiled by researchers at NIST, making use of Arabic and Chinese broadcast, newswire and web data and reference translations collected and developed by LDC for Phase 2 and Phase 2.5 of the DARPA <a href="http://projects.ldc.upenn.edu/gale/index.html" rel="nofollow">GALE </a>program.</p><br> <p><a href="http://www.itl.nist.gov/iad/mig/tests/metricsmatr/" rel="nofollow">NIST MetricsMaTr</a> is a series of research challenge events for machine translation (MT) metrology, promoting the development of innovative MT metrics that correlate highly with human assessments of MT quality. Participants submit their metrics to NIST (National Institute of Standards and Technology). NIST runs those metrics on certain held-back test data for which it has human assessments measuring quality and then calculates correlations between the automatic metric scores and the human assessments. Specifically, the goals of MetricsMATR are: to inform other MT technology evaluation campaigns and conferences with regard to improved metrology to establish an infrastructure that encourages the development of innovative metrics to build a diverse community that will bring new perspectives to MT metrology research and to provide a forum for MT metrology discussion and for establishing future directions of MT metrology.</p><br> <p>The first MetricsMaTr challenge was held in <a href="http://www.itl.nist.gov/iad/mig/tests/metricsmatr/2008/" rel="nofollow">2008</a> the development data from the 2008 program is available from LDC, <a href="http://catalog.ldc.upenn.edu/LDC2009T05" rel="nofollow">2008 NIST Metrics for Machine Translation (MetricsMATR08) Development Data LDC2009T05</a>. The <a href="http://www.ldc.upenn.edu/Catalog/docs/LDC2011T05/NISTMetricsMaTr10EvalPlan.pdf" rel="nofollow">MetricsMaTr10 evaluation plan</a> is included in this release.</p><br> <h3>Data</h3><br> <p>This release contains 149 documents with corresponding reference translations (Arabic-to-English and Chinese-to-English), system translations and human assessments. The human assessments include the following: Adequacy7 (a 7-point scale for judging the meaning of a system translation with respect to the reference translation) Adequacy Yes/No (whether the given system segment meant essentially the same as the reference translation) Preference (the judges preference between two candidate translations when compared to a human reference translation) and HTER (Human Targeted Error Rate, human edits to a system translation to have the same meaning as a reference translation).</p><br> <h3>Updates</h3><br> <p>Additional information, updates, bug fixes may be available in the LDC catalog entry for this corpus at <a href="http://catalog.ldc.upenn.edu/LDC2011T05" rel="nofollow">LDC2011T05</a>.</p><br> <h3>Sponsorship</h3><br> <p>This work is supported in part by the Defense Advanced Research Projects Agency, GALE Program Grant No. HR0011-06-1-003. The content of this publication does not necessarily reflect the position or policy of the Government, and no official endorsement should be inferred.</p><br> <h3>Samples</h3><br> <p>Please view this&nbsp;<a href="desc/addenda/LDC2011T05.jpg">sample</a>.</p></br> Portions © 2006 Abu Dhabi TV, Agence France Presse, Al-Ahram, Al Alam News Channel, Al Arabiya, Al Hayat, Al Iraqiyah, Al Quds-Al Arabi, An Nahar, Asharq al-Awsat, China Central TV, China Military Online, Chinanews.com, Guangming Daily, Kuwait TV, New Tang Dynasty TV, Nile TV, PAC, Ltd, Peoples Daily Online, Phoenix TV, Syria TV, Xinhua News Agency, © 2006, 2011 Trustees of the University of Pennsylvania
提供机构:
Linguistic Data Consortium
创建时间:
2020-11-30
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作