Data Collection & Requirements
收藏NIAID Data Ecosystem2026-05-02 收录
下载链接:
https://zenodo.org/record/14976796
下载链接
链接失效反馈官方服务:
资源简介:
Open-Source Cybersecurity and AI Security Datasets
This project provides a comprehensive collection of open-source datasets focused on cybersecurity threats and AI security vulnerabilities. The datasets are carefully selected to align with specific security threats, such as:
Data Exfiltration
Data Poisoning
Model Manipulation
Adversarial Examples
Model Inversion
Model Extraction
Spoofing Attacks
Unauthorized Access
Supply Chain Compromise
Dataset Collection
Each dataset includes a detailed description, source type, purpose, and direct access links for easy retrieval.
Comprehensive, Multi-Source Cyber-Security Events
Access Here: https://csr.lanl.gov/data/
Description: 58 days of de-identified LANL network data (authentication, process events, DNS, network flow, red team).
Format: Text files
Update Frequency: Static
Use Cases: Cybersecurity Event Analysis
DARPA Intrusion Detection Data Sets
Access Here: https://archive.ll.mit.edu/ideval/data/
Description: Simulated network traffic with intrusion scenarios.
Format: PCAP files
Update Frequency: Static
Use Cases: IDS Training
MITRE ATT&CK Framework Data
Access Here: https://attack.mitre.org/
Description: Adversary TTPs (tactics, techniques, procedures) in a globally accessible knowledge base.
Format: JSON/STIX
Update Frequency: Quarterly
Use Cases: Threat Intelligence
National Vulnerability Database (NVD)
Access Here: https://nvd.nist.gov/
Description: CVEs with severity scores and descriptions.
Format: XML/JSON
Update Frequency: Daily
Use Cases: Vulnerability Management
LANL Unified Host and Network Dataset
Access Here: https://csr.lanl.gov/data/
Description: Enterprise-scale dataset with network and host logs, including real-world red-team attacks.
Format: Text files
Update Frequency: Static
Use Cases: Insider Threat Detection
CIC-IDS2017 (Intrusion Detection Dataset)
Access Here: https://www.unb.ca/cic/datasets/ids-2017.html
Description: Network traffic dataset with multiple attack types (DDoS, brute-force, infiltration).
Format: PCAP, CSV
Update Frequency: Static
Use Cases: Intrusion Detection
CIC IoV CAN Bus Dataset 2024
Access Here: https://www.unb.ca/cic/datasets/
Description: Vehicle CAN bus data, including spoofing and DoS attack traces.
Format: CSV, PCAP
Update Frequency: Static
Use Cases: Automotive Security
ASVspoof 2019 (Voice Spoofing Dataset)
Access Here: https://datashare.ed.ac.uk/handle/10283/3336
Description: Evaluates automatic speaker verification systems under spoofing attacks.
Format: WAV files
Update Frequency: Static
Use Cases: Voice Security
ToN_IoT Datasets
Access Here: https://research.unsw.edu.au/projects/toniot-datasets
Description: Federated IoT data sources, including telemetry, OS logs, and network traffic.
Format: CSV, JSON
Update Frequency: Ongoing
Use Cases: Threat Intelligence
ADFA Intrusion Detection Datasets
Access Here: https://research.unsw.edu.au/projects/adfa-ids-datasets
Description: Host-based intrusion detection datasets for Windows and Linux.
Format: CSV, JSON
Update Frequency: Static
Use Cases: Host Intrusion Detection
Security Datasets Project
Access Here: https://github.com/OTRF/Security-Datasets
Description: A community-driven initiative sharing security datasets for research.
Format: JSON, CSV
Update Frequency: Ongoing
Use Cases: Threat Intelligence
CIC-BCCC-NRC Tabular IoT Attack Dataset (2024)
Access Here: https://www.yorku.ca/research/bccc/ucs-technical/cybersecurity-datasets-cds/
Description: A comprehensive IoT network attack dataset for AI-based cybersecurity research.
Format: CSV
Update Frequency: Ongoing
Use Cases: IoT Security
Awesome Cybersecurity Datasets
Access Here: https://github.com/shramos/Awesome-Cybersecurity-Datasets
Description: A curated list of publicly available datasets for cybersecurity research.
Format: Varies
Update Frequency: Multiple
Use Cases: General Cybersecurity Research
ANISENSE Datasets
This curated list provides open-source datasets focused on autonomous driving, AI security, and related research domains.
CarlaScenes Dataset
Access Here: https://github.com/CarlaScenes/CarlaSence
Description: Synthetic dataset for odometry in autonomous driving using the CARLA simulator.
Format: PNG, PLY, BAG
Update Frequency: Ongoing
Use Cases: Autonomous Driving, SLAM, Odometry, AI Security
Realistic Vehicle Trajectories using CARLA
Access Here: https://ieee-dataport.org/documents/realistic-vehicle-trajectories-and-driving-parameters-carla-autonomous-driving-simulator
Description: Realistic vehicle trajectories in CARLA’s simulated urban environments.
Format: CSV, JSON
Update Frequency: Ongoing
Use Cases: Autonomous Driving, Traffic Simulation, AI Security
KITTI Vision Benchmark Suite
Access Here: http://www.cvlibs.net/datasets/kitti/
Description: Comprehensive dataset for training ML models in stereo, odometry, 3D detection, and segmentation.
Format: PNG, BIN, TXT
Update Frequency: Ongoing
Use Cases: Autonomous Driving, 3D Object Detection, Semantic Segmentation
OUSTER Dataset
Access Here: https://ouster.com/downloads/sample-lidar-data
Description: LiDAR data captured by Ouster sensors for super-resolution and 3D mapping.
Format: Point Clouds
Update Frequency: Ongoing
Use Cases: LiDAR Super-Resolution, 3D Mapping, AI Security
SemanticPOSS
Access Here: http://www.poss.pku.edu.cn/
Description: LiDAR dataset with instance-level annotations for dynamic objects (pedestrians, riders, vehicles).
Format: PCD, PNG
Update Frequency: Static
Use Cases: 3D Semantic Segmentation, Autonomous Driving, AI Security
创建时间:
2025-03-22



