five

AIDA Scenario 2 Practice Topic Annotation

收藏
DataCite Commons2025-06-03 更新2024-07-13 收录
下载链接:
https://catalog.ldc.upenn.edu/LDC2024T06
下载链接
链接失效反馈
官方服务:
资源简介:
<h3>Introduction</h3> <p>AIDA Scenario 2 Practice Topic Annotation was developed by the Linguistic Data Consortium (LDC) and is comprised of annotations for 29 English, Russian and Spanish web documents (text, image and video) from <a href="../../../LDC2024T04">AIDA Scenario 2 Practice Topic Source Data (LDC2024T04)</a>.</p> <p>The DARPA AIDA (Active Interpretation of Disparate Alternatives) program aimed to develop a multi-hypothesis semantic engine to generate explicit alternative interpretations of events, situations and trends from a variety of unstructured sources. LDC supported AIDA by collecting, creating and annotating multimodal linguistic resources in multiple languages.</p> <p>Each phase of the AIDA program centered on a specific scenario, or broad topic area, with related subtopics designated as either practice subtopics or evaluation subtopics. The Phase 2 scenario focused on the socioeconomic and political crisis in Venezuela since 2010. This corpus contains annotations for the set of practice documents designated for annotation in Phase 2.</p> <h3>Data</h3> <p>Annotations are presented as tab separated files in the following categories for each topic.</p> <ul> <li>Mentions: single references in source data to a real-world entity or filler, event, or relation. There are three mentions tables for each topic, one for entities and fillers, one for relations, and one for events.</li> <li>Slots: pre-defined roles in an event or relation filled by an argument (entity mention). There are two slots tables per topic, one for relations and one for events.</li> <li>Linking: entity mentions "linked" to entries in the knowledge base as a method of indicating the real-world entity to which an entity referred.</li> </ul> <h3>Sponsorship</h3> <p>This material is based upon work supported by Air Force Research Laboratory (AFRL) and the Defense Advanced Research Projects Agency (DARPA) under Contract No. FA8750-18-C-0013.</p>
提供机构:
Linguistic Data Consortium
创建时间:
2024-06-20
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作