QCRI/AZERG-Dataset
收藏Hugging Face2025-09-03 更新2025-08-09 收录
下载链接:
https://hf-mirror.com/datasets/QCRI/AZERG-Dataset
下载链接
链接失效反馈官方服务:
资源简介:
AZERG数据集是一个全面的注释型网络安全威胁情报(CTI)报告集合,旨在用于训练和评估STIX实体和关系提取的模型。该数据集由141份真实世界的威胁分析报告构建而成,包含4011个STIX实体和2075个STIX关系。它是为了解决自动化STIX报告生成训练数据的缺乏而策划的,并支持威胁情报提取的多任务方法。数据集的抽取过程分为四个顺序子任务:实体检测、实体类型识别、相关对检测和关系类型识别。
The AZERG-Dataset is a comprehensive collection of annotated cyber threat intelligence (CTI) reports designed for training and evaluating models on STIX entity and relationship extraction. It is constructed from 141 real-world threat analysis reports and contains 4,011 STIX entities and 2,075 STIX relationships. The dataset was curated to address the lack of training data for automated STIX report generation and supports a multi-task approach to threat intelligence extraction. The extraction process is divided into four sequential subtasks: Entity Detection, Entity Type Identification, Related Pair Detection, and Relationship Type Identification.
提供机构:
QCRI



