five

haiderkamal23/allaM-offsec-arabic-chat

收藏
Hugging Face2025-12-10 更新2025-12-20 收录
下载链接:
https://hf-mirror.com/datasets/haiderkamal23/allaM-offsec-arabic-chat
下载链接
链接失效反馈
官方服务:
资源简介:
--- license: apache-2.0 task_categories: - question-answering - text-generation language: - ar - en tags: - security - offensive-security - penetration-testing - cybersecurity - arabic size_categories: - 10K<n<100K --- # Arabic Offensive Security Chat Dataset Bilingual Arabic/English dataset for training offensive security assistants. ## Dataset Details - **Training examples:** 18,412 - **Validation examples:** 2,000 - **Total:** 20,412 - **Languages:** Arabic (primary) + English (technical terms) - **Format:** ChatML (`messages` field) - **Source:** Filtered and processed from WNT3D/Ultimate-Offensive-Red-Team ## Intended Use This dataset is designed for fine-tuning models to assist with: - Vulnerability analysis and explanation - Attack chain reasoning - Penetration testing report generation - OWASP/CWE/MITRE ATT&CK mapping - Security mitigation recommendations **IMPORTANT:** For authorized security testing and education only. ## Structure Each example contains: ```json { "messages": [ {"role": "system", "content": "Arabic security assistant persona"}, {"role": "user", "content": "Arabic instruction + English security scenario"}, {"role": "assistant", "content": "Arabic security analysis"} ] } ``` ## Usage ```python from datasets import load_dataset dataset = load_dataset("haiderkamal23/allaM-offsec-arabic-chat") ``` ## License Apache 2.0 - For authorized security research and testing only.
提供机构:
haiderkamal23
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作