NaifAlzanki/kwcyber-ai-agent-dataset-v4
收藏Hugging Face2026-03-05 更新2026-03-29 收录
下载链接:
https://hf-mirror.com/datasets/NaifAlzanki/kwcyber-ai-agent-dataset-v4
下载链接
链接失效反馈官方服务:
资源简介:
---
language:
- ar
- en
tags:
- cybersecurity
- rlhf
- dpo
- ctf
- malware-analysis
- arabic
- bilingual
- kwcyber
license: apache-2.0
size_categories:
- 1M<n<10M
---
# 🔐 kwcyber-ai-agent Dataset v4
**By [NaifAlzanki](https://huggingface.co/NaifAlzanki)**
داتاسيت سيبراني متكامل ثنائي اللغة (عربي + إنجليزي) لتدريب AI Agent.
## 📊 الإحصائيات
- **Total:** 387,971 rows
- **Languages:** Arabic + English
- **Categories:** cybersecurity, ctf, general
## 📦 المصادر
| Source | Category | Size |
|---|
| Anthropic HH-RLHF | General | ~170K |
| UltraFeedback (Argilla) | General | ~200K |
| OpenAssistant OASST2 | General | ~161K |
| Stanford SHP | General | ~385K |
| Cybersecurity Datasets | Cyber | ~50K+ |
| CTF SaTML24 | CTF | ~137K |
| Arabic Translation | عربي | ~50K |
## 🚀 الاستخدام
```python
from datasets import load_dataset
ds = load_dataset("NaifAlzanki/kwcyber-ai-agent-dataset-v4")
# عربي فقط
ar = ds["train"].filter(lambda x: x["lang"] == "ar")
# سيبراني فقط
cyber = ds["train"].filter(lambda x: x["category"] == "cybersecurity")
```
## 🎯 مناسب لـ
- DPO / RLHF Training
- SFT Fine-tuning
- Reward Model Training
- kwcyber-ai-agent 🔐
提供机构:
NaifAlzanki



