西班牙抽象意义表示
收藏arXiv2022-04-16 更新2024-08-06 收录
下载链接:
http://arxiv.org/abs/2204.07663v1
下载链接
链接失效反馈官方服务:
资源简介:
西班牙抽象意义表示(AMR)数据集是由乔治城大学和萨尔兰大学合作创建的,旨在为西班牙语提供一个全面的AMR标注框架。该数据集包含586个手动标注的AMR,覆盖了486个独特句子,这些句子选自‘抽象意义表示2.0 - 四种翻译’语料库,主要来源于新闻领域。数据集的创建过程中,研究团队利用了AnCora-Net词典的西班牙角色集,并对英语AMR进行了扩展,以更好地捕捉西班牙语的语义特征。该数据集的应用领域包括AMR解析和生成评估,以及跨语言AMR解析的完整性研究。
The Spanish Abstract Meaning Representation (AMR) dataset was co-created by Georgetown University and Saarland University, aiming to provide a comprehensive AMR annotation framework for Spanish. This dataset contains 586 manually annotated AMRs, covering 486 unique sentences selected from the corpus "Abstract Meaning Representation 2.0 - Four Translations", which is primarily sourced from news domains. During the dataset construction, the research team utilized the Spanish role set from the AnCora-Net lexicon and extended English AMRs to better capture the semantic characteristics of Spanish. The application areas of this dataset include AMR parsing and generation evaluation, as well as research on the completeness of cross-linguistic AMR parsing.
提供机构:
乔治城大学
创建时间:
2022-04-16



