ACE 05
收藏arXiv2025-09-30 收录
下载链接:
https://www.ldc.upenn.edu/sites/www.ldc.upenn.edu/files/english-relations-guidelines-v5.8.3.pdf
下载链接
链接失效反馈官方服务:
资源简介:
该数据集,即ACE 05,是自然语言处理领域中广泛使用的一个基准数据集,它适用于实体识别、事件抽取和关系抽取等任务。该数据集包含599篇文档中标注的实体、关系和事件。此外,数据集还提供了各种实体、关系和事件的注释,并配备了一套预处理流程,以便更便捷地获取元信息,从而支持包括文档级关系抽取和共指消解在内的更多任务。该数据集的规模为599篇文档,主要针对关系分类任务。
This dataset, known as ACE 05, is a widely adopted benchmark dataset in the field of natural language processing, supporting tasks including entity recognition, event extraction and relation extraction. It encompasses annotated entities, relations and events across 599 documents. Furthermore, the dataset provides annotations for diverse entities, relations and events, and is accompanied by a dedicated preprocessing pipeline to simplify metadata acquisition, thereby enabling support for additional tasks such as document-level relation extraction and coreference resolution. Notably, the dataset consists of 599 documents in total and is primarily targeted at relation classification tasks.
提供机构:
United States government (Automated Content Extraction initiative)
搜集汇总
数据集介绍

背景与挑战
背景概述
ACE 05是自然语言处理领域的基准数据集,包含599篇文档,标注了实体、关系和事件,适用于实体识别、事件抽取和关系抽取等任务。该数据集还提供预处理流程,支持文档级关系抽取和共指消解,主要针对关系分类任务。其特点在于广泛的应用性和便捷的元信息获取方式。
以上内容由遇见数据集搜集并总结生成



