FewEvent
收藏arXiv2025-09-30 收录
下载链接:
https://github.com/231sm/Low_Resource_KBP
下载链接
链接失效反馈官方服务:
资源简介:
该数据集名为FewEvent,它融合了两个广泛使用的事件检测数据集:ACE-2005语料库和TAC-KBP-2017事件追踪数据。此外,该数据集还加入了特定领域如音乐、电影、体育和教育的外部事件类型。针对少样本设置,该数据集包含了88种事件类型,其中68种用于训练,10种用于验证,另外10种用于测试。值得注意的是,这些不同集合之间的事件类型不存在重叠。在规模上,该数据集为19种事件类型提供了70,852个样本,并为实验数据提供了15,681个样本。该数据集的任务是少样本事件检测。
The dataset named FewEvent integrates two widely used event detection datasets: the ACE-2005 corpus and the TAC-KBP-2017 Event Tracking data. Additionally, this dataset incorporates external event types from specific domains including music, film, sports, and education. For the few-shot setting, this dataset encompasses 88 event types, with 68 allocated for training, 10 for validation, and the remaining 10 for testing. Notably, there is no overlap of event types across these different splits. In terms of scale, this dataset provides 70,852 samples for 19 event types and 15,681 samples for experimental data. This dataset is designed for the few-shot event detection task.



