five

Message Understanding Conference (MUC) 7

收藏
DataCite Commons2021-07-01 更新2025-04-16 收录
下载链接:
https://catalog.ldc.upenn.edu/LDC2001T02
下载链接
链接失效反馈
官方服务:
资源简介:
<h3>Introduction</h3><br> <p>Message Understanding Conference (MUC) 7 was produced by Linguistic Data Consortium (LDC) catalog number LDC2001T02 and ISBN 1-58563-205-8.</p><br> <p>In the 1990s, the MUC evaluations funded the development of metrics and statistical algorithms to support government evaluations of emerging information extraction technologies. Additional information from NIST can be found <a href="http://www.itl.nist.gov/iaui/894.02/related_projects/muc" rel="nofollow">here</a>.</p><br> <h3>Data</h3><br> <p>The following list shows the correspondence between versions of the IE task definition and stages of the MUC-7 evaluation.</p><br> <p>Version #Stage</p><br> <table><br> <tbody><br> <tr><br> <td>4.1</td><br> <td>training and dryrun</td><br> </tr><br> <tr><br> <td>4.2</td><br> <td>formalrun</td><br> </tr><br> <tr><br> <td>5.1</td><br> <td>final</td><br> </tr><br> </tbody><br> </table><br> <p>The dryrun and formalrun have different domains; the dryrun (and training) consists of aircrashes scenarios and the formalrun consists of missile launches scenarios. The final version updates especially the Template Relations portion of the guidelines.</p><br> <p>Normally, for each scenario, two datasets are provided: training and test. When the evaluation cycle begins, the label for the scenario dataset is training. Then the corresponding test dataset for that same scenario is used for the dryrun testing. For the formal run, a formal training set is given out four weeks before the test answers are due. The formal test is given out one week before the test answers are due. After the entire evaluation and meeting have been held, final edits are made if necessary.</p><br> <h3>Samples</h3><br> <p>Please view this <a href="desc/addenda/LDC2001T02.txt">text sample</a>.</p><br> <h3>Updates</h3><br> <p>August 22, 2001: This publication was inadvertently released without the guidelines documentation and the scoring software. These documents and programs have now been added to the publication and if you previously purchased this corpus and would like to download a complete copy of the corpus please contact<a rel="nofollow"> ldc@ldc.upenn.edu</a>.</p></br> Portions © 1996 New York Times, © 2001 Trustees of the University of Pennsylvania
提供机构:
Linguistic Data Consortium
创建时间:
2020-11-30
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作