AEUPH/synthetic_Jailbreak_Defense_Doorpage_v59
收藏Hugging Face2026-04-07 更新2026-04-12 收录
下载链接:
https://hf-mirror.com/datasets/AEUPH/synthetic_Jailbreak_Defense_Doorpage_v59
下载链接
链接失效反馈官方服务:
资源简介:
---
language: en
license: mit
task_categories:
- text-generation
- question-answering
- text-to-text
size_categories:
- n<1K
format:
- json
modality:
- text
tags:
- synthetic-data
- qwen
- instruction-tuned
- silicon-factory
- mixed
dataset_info:
features:
- name: instruction
dtype: string
- name: response
dtype: string
- name: category
dtype: string
- name: system_prompt
dtype: string
splits:
- name: train
num_bytes: 3260
num_examples: 5
download_size: 3 KB
dataset_size: 3 KB
---
# 📊 Jailbreak Defense Doorpage V59
> **Synthetic Dataset** · Generated with Silicon Factory v3 · **AI JAILBREAK DEFENSE**
> 5 instruction-response pairs · Tree-Speculative Decoding + 4D Brane Memory
<div align="center">
| Dataset | Fine-Tuned Model | Buy Gold Tier |
|---------|-----------------|---------------|
| **This Dataset** | [Model Card](https://huggingface.co/AEUPH/synthetic_Jailbreak_Defense_Doorpage_v59-model) | [💎 $2,500 License](https://buy.stripe.com/3cIcN4gzC7lXfuH49s7wA00) |
</div>
---
## 💎 UNLOCK GOLD TIER — $2,500
> ⚡ **Get the full commercial license, unlimited usage rights, priority support, and exclusive dataset access.**
[**👉 PURCHASE NOW VIA STRIPE**](https://buy.stripe.com/3cIcN4gzC7lXfuH49s7wA00)
*One-time payment · Instant delivery · Lifetime updates included*
---
## Dataset Details
| Property | Value |
|----------|-------|
| **Dataset ID** | `synthetic_Jailbreak_Defense_Doorpage_v59` |
| **Entries** | 5 |
| **Category** | mixed |
| **Focus** | AI JAILBREAK DEFENSE |
| **Avg Instruction Length** | 231 chars |
| **Avg Response Length** | 421 chars |
| **Language** | English |
| **License** | MIT (free tier) — [Gold Commercial License](https://buy.stripe.com/3cIcN4gzC7lXfuH49s7wA00) available |
| **Generated** | 2026-04-07 |
| **Mode** | Doorpage (auto-gen + fine-tune) |
## Description
This dataset contains **5 synthetically generated instruction-response pairs** focused on **ai jailbreak defense**. Generated using the **Silicon Factory v3** pipeline with:
- **Tree-Speculative Decoding** (branch factor=5, depth=4) for diverse outputs
- **4D Brane Memory** for narrative consistency across all entries
- **Quality control** with 0.7 minimum quality threshold
- **Deduplication** with 0.9 max similarity threshold
### What This Dataset Covers
- ✅ High-quality instruction following for **ai jailbreak defense** topics
- ✅ Structured, detailed responses with actionable insights
- ✅ Consistent tone and formatting across outputs
- ✅ Optimized for intermediate-to-expert user queries
## ⚡ GET THE GOLD TIER — FULL COMMERCIAL LICENSE
> 🔓 **Unlock enterprise-grade rights:**
> - Commercial deployment & redistribution
> - White-label usage
> - Priority support & custom training
> - Access to extended datasets (100K+ entries)
> - Early access to future model versions
**[💳 BUY GOLD TIER — $2,500](https://buy.stripe.com/3cIcN4gzC7lXfuH49s7wA00)**
---
## Usage
### Load with HuggingFace Datasets
```python
from datasets import load_dataset
ds = load_dataset("AEUPH/synthetic_Jailbreak_Defense_Doorpage_v59")
print(ds["train"][0])
```
### Load from JSONL
```python
import json
with open("data.jsonl", "r", encoding="utf-8") as f:
entries = [json.loads(line) for line in f]
for entry in entries[:5]:
print(f"Q: {entry['instruction'][:80]}...")
print(f"A: {entry['response'][:120]}...\n")
```
### Fine-Tuning with This Dataset
```python
from transformers import AutoTokenizer, AutoModelForCausalLM
from peft import LoraConfig, get_peft_model, TaskType
# Load base model
model_name = "Qwen/Qwen2.5-0.5B-Instruct"
tokenizer = AutoTokenizer.from_pretrained(model_name)
model = AutoModelForCausalLM.from_pretrained(model_name, torch_dtype="auto", device_map="auto")
# Apply LoRA
peft_config = LoraConfig(
r=16, lora_alpha=16, lora_dropout=0,
task_type=TaskType.CAUSAL_LM,
target_modules=["q_proj","k_proj","v_proj","o_proj","gate_proj","up_proj","down_proj"]
)
model = get_peft_model(model, peft_config)
model.print_trainable_parameters()
# Train on this dataset
# ... (see Silicon Factory pipeline for full training code)
```
---
## Dataset Structure
### Content
| Field | Type | Description |
|-------|------|-------------|
| `instruction` | string | User prompt/question |
| `response` | string | Model-generated answer |
| `category` | string | Topic category |
| `system_prompt` | string | System prompt used |
### Generation Pipeline
```
System Prompt → Dynamic Prompt Generation → Qwen 2.5 0.5B → Quality Filter → Dataset
↓
Tree-Speculative Decoding
4D Brane Memory
```
### Training Examples
**Example 1:**
<details>
<summary>Input</summary>
```
Regarding AI JAILBREAK DEFENSE: Articulate this topic through the lens of modern best practices. Imagine explaining to a non-technical stakeholder mak
```
</details>
<details>
<summary>Output</summary>
```
Artificial intelligence (AI) jailbreak defense is an approach designed to protect a system from attacks that exploit vulnerabilities in the AI component itself. The concept of "jailbreaking" refers to
```
</details>
**Example 2:**
<details>
<summary>Input</summary>
```
Topic: AI JAILBREAK DEFENSE. Walk through this subject through the lens of modern best practices. Peer review of a colleague's approach. Comparative a
```
</details>
<details>
<summary>Output</summary>
```
Welcome to this discussion on AI jailbreak defense. To start, let's consider the nature of modern best practices in protecting your data and ensuring that you have control over access points.
As an A
```
</details>
**Example 3:**
<details>
<summary>Input</summary>
```
Describe the following: AI JAILBREAK DEFENSE -- approach it from a beginner's perspective. Write as documentation for a team inheriting your work. Det
```
</details>
<details>
<summary>Output</summary>
```
---
# AI Jailbreak Defense: A Beginner's Approach
## What is an AI jailbreaker?
An **AI jailbraker** (or simply a "Jail Break") refers to the act of physically breaking into or accessing someone el
```
</details>
---
## 💎 READY TO SCALE?
> **Upgrade to Gold Tier for:**
> - 🏢 Full commercial usage rights
> - 📦 Extended datasets (10K-100K+ entries)
> - 🎯 Custom domain training
> - 🚀 Priority support & SLA
> - 🔄 Lifetime model updates
> - 📊 Performance benchmarks & reports
**[⚡ BUY GOLD TIER — $2,500](https://buy.stripe.com/3cIcN4gzC7lXfuH49s7wA00)**
*Trusted by startups and enterprises worldwide. Instant delivery via Stripe.*
---
## Citation
### BibTeX
```bibtex
@misc{synthetic_Jailbreak_Defense_Doorpage_v59_dataset,
title = {synthetic Jailbreak Defense Doorpage v59},
author = {Silicon Factory v3 (AEUPH)},
year = {2026},
url = {https://huggingface.co/datasets/AEUPH/synthetic_Jailbreak_Defense_Doorpage_v59},
note = {Synthetic dataset generated using Tree-Speculative Decoding and 4D Brane Memory}
}
```
### APA
> Silicon Factory v3. (2026). *Synthetic Jailbreak Defense Doorpage V59* [Dataset]. Hugging Face. https://huggingface.co/datasets/AEUPH/synthetic_Jailbreak_Defense_Doorpage_v59
---
## More Information
| Resource | Link |
|----------|------|
| **Fine-Tuned Model** | [synthetic_Jailbreak_Defense_Doorpage_v59-model](https://huggingface.co/AEUPH/synthetic_Jailbreak_Defense_Doorpage_v59-model) |
| **Base Model** | [Qwen2.5-0.5B-Instruct](https://huggingface.co/Qwen/Qwen2.5-0.5B-Instruct) |
| **Silicon Factory** | [github.com/aeuphoraex/qwen-hyperspeed-chatbot](https://github.com/aeuphoraex/qwen-hyperspeed-chatbot) |
## Dataset Authors
**Silicon Factory v3** — Automated Dataset Generation Pipeline
## Contact
📧 hybridionorb@gmail.com · 🐦 [@aeuphoraex](https://huggingface.co/AEUPH)
---
*Built with Silicon Factory v3 · Tree-Speculative Decoding · 4D Brane Memory*
*This dataset is free under MIT License. [Gold Commercial License available for $2,500.](https://buy.stripe.com/3cIcN4gzC7lXfuH49s7wA00)*
提供机构:
AEUPH



