five

AIDA Scenario 1 and 2 Reference Knowledge Base

收藏
DataCite Commons2025-05-06 更新2024-07-13 收录
下载链接:
https://catalog.ldc.upenn.edu/LDC2023T10
下载链接
链接失效反馈
官方服务:
资源简介:
<h3>Introduction</h3> <p>AIDA Scenario 1 and 2 Reference Knowledge Base was developed by the Linguistic Data Consortium (LDC) and contains the English knowledge base (KB) used for all AIDA entity linking annotation in Scenario 1 (Russia-Ukraine Relations) and Scenario 2 (Crisis in Venezuela). The KB content was drawn from GeoNames, the CIA World Leaders List and the CIA World Factbook and was supplemented with manually-created KB entries developed specifically for AIDA data.</p> <p>The DARPA AIDA (Active Interpretation of Disparate Alternatives) program aimed to develop a multi-hypothesis semantic engine to generate explicit alternative interpretations of events, situations and trends from a variety of unstructured sources. LDC supported AIDA by collecting, creating and annotating multimodal linguistic resources in multiple languages.</p> <p>Each phase of the AIDA program focused on a specific scenario, or broad topic area, with related subtopics designated as either practice subtopics or evaluation subtopics. The Phase 1 scenario focused on political relations between Russia and Ukraine in the 2010s. The socioeconomic and political crisis in Venezuela since 2010 was the scenario in Phase 2.</p> <h3>Data</h3> <p>This knowledge base supported the AIDIA entity detection and linking task for 13 entity types: GPE (Geo-Political Entity), LOC (Location), PER (Person), ORG (Organization), FAC (Facility), MHI (Medical/Health Issue), WEA (Weapon), SID (Side), COM (Commodity), CRM (Crime), LAW (Law), VEH (Vehicle), and BAL (Ballot).</p> <p>There are four inputs to the KB: GPE and LOC entities from&nbsp;<a href="http://www.geonames.org/">GeoNames</a>&nbsp;(GEO), PER entities from the&nbsp;<a href="https://www.cia.gov/resources/world-leaders/">CIA World Leaders List</a>&nbsp;(WLL), ORG entities from&nbsp;<a href="https://www.cia.gov/the-world-factbook/">Appendix B of the CIA World Factbook</a>&nbsp;(APB), and additional entities manually created by LDC. The GEO, WLL and APB entries are also found in&nbsp;<a href="../../../LDC2020T10">LORELEI Entity Detection and Linking Knowledge Base (LDC2010T10)</a>.</p> <h3>Acknowledgement</h3> <p>This material is based upon work supported by Air Force Research Laboratory (AFRL) and the Defense Advanced Research Projects Agency (DARPA) under Contract No. FA8750-18-C-0013.</p> <h3>Samples</h3> <p>Please view the following samples:</p> <ul> <li><a href="desc/addenda/LDC2023T10.altnames.tab">Alternate Names Sample</a></li> <li><a href="desc/addenda/LDC2023T10.ent.tab">Entities Sample</a></li> <li><a href="desc/addenda/LDC2023T10.ms.tab">Member States Sample</a></li> </ul> <h3>Updates</h3> <p>None at this time.</p>
提供机构:
Linguistic Data Consortium
创建时间:
2023-10-16
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作