roseking/openclaw-exposure-dataset
收藏Hugging Face2026-03-04 更新2026-03-29 收录
下载链接:
https://hf-mirror.com/datasets/roseking/openclaw-exposure-dataset
下载链接
链接失效反馈官方服务:
资源简介:
---
license: cc-by-nc-sa-4.0
task_categories:
- tabular-classification
tags:
- security
- openclaw
- exposure
- cybersecurity
- threat-intel
language:
- en
pretty_name: OpenClaw Exposure Watchboard Dataset (Sanitized)
size_categories:
- 100K<n<1M
---
🇨🇳 [查看中文版 README](./README_ZH.md)
---
> ⚠️ **Unofficial Dataset** — This dataset is **NOT** affiliated with, endorsed by,
> or officially released by `openclaw.allegro.earth`. It is an independent research snapshot.
---
# OpenClaw Exposure Watchboard Dataset (Sanitized)
## 📌 Data Source & Attribution
> **Original Source**: [OpenClaw Exposure Watchboard](https://openclaw.allegro.earth/)
> All original data is collected and maintained by **openclaw.allegro.earth**.
> Full credit and ownership of the source data belong to the original maintainers.
This dataset is an **unofficial, sanitized snapshot** of publicly accessible data from
[openclaw.allegro.earth](https://openclaw.allegro.earth/) as of **2026-03-04**.
It was collected for **non-commercial security research and educational purposes only**.
---
## 🔐 Copyright Notice
```
OpenClaw Exposure Watchboard Data © openclaw.allegro.earth
Snapshot Date: 2026-03-04
This dataset is an unofficial derivative work. All rights to the original data
remain with openclaw.allegro.earth.
```
The underlying monitored data (IP addresses, CVEs, domain names, ASN info) consists of
factual public information and is generally not subject to copyright. However, the
**compilation and curation** of this dataset may constitute a database right.
We assert no ownership over the original data.
---
## 🛡️ Privacy & Sanitization
All IP addresses have been **sanitized**:
- IPv4 last octet replaced with `xxx` (e.g. `1.2.3.4` → `1.2.3.xxx`)
- `endpoint_url` field follows the same rule
- **No personal data** is included in this dataset
> Note: The `asi_domains` field may contain corporate domain names (e.g. `jd.com`,
> `bytedance.com`). These are included **as factual security metadata only** and do not
> imply any endorsement, accusation, or commercial relationship.
---
## 📊 Dataset Overview
| Item | Value |
|------|-------|
| Record count | 240,561 |
| Snapshot date | 2026-03-04 |
| Fields | 20 |
| Format | CSV (UTF-8 BOM) |
## Fields
| Field | Description |
|-------|-------------|
| `endpoint` | Sanitized IP:Port |
| `endpoint_url` | Sanitized URL |
| `Country` | Country / Region |
| `auth_required` | Whether auth is required |
| `is_active` | Whether the instance is active |
| `has_leaked_creds` | Credential leak status |
| `asn` / `asn_name` | Autonomous System info |
| `org` | Organization / ISP |
| `first_seen` / `last_seen` | Detection timestamps |
| `asi_has_breach` | Breach flag |
| `asi_has_threat_actor` | Threat actor flag |
| `asi_threat_actors` | Associated APT groups |
| `asi_cves` | Associated CVE numbers |
| `asi_enriched_at` | Enrichment timestamp |
| `asi_domains` | Related domains (factual metadata only) |
---
## 📖 License & Usage Terms
This dataset is released under **[CC BY-NC-SA 4.0](https://creativecommons.org/licenses/by-nc-sa/4.0/)**.
| You are free to | Conditions |
|-----------------|-----------|
| ✅ Share and redistribute | ℹ️ Attribute to this dataset **and** original source |
| ✅ Adapt and transform | 🚫 **Non-commercial use only** |
| | 🔄 Share-alike under same license |
**Prohibited uses:**
- ❌ Commercial exploitation
- ❌ Targeted harassment or attacks against any listed organization
- ❌ Re-identification of sanitized IP addresses
---
## 📥 DMCA & Takedown Requests
If you believe this dataset infringes your rights:
1. **Contact us** via Hugging Face discussion tab on this repository
2. **Contact Hugging Face** at: dmca@huggingface.co
3. **Reference**: [Hugging Face DMCA Policy](https://huggingface.co/dmca)
We will respond promptly and cooperate fully with legitimate takedown requests.
---
## ⚖️ Disclaimer
This dataset is provided **"as-is"** for security awareness and academic research only.
- The contributors are **not affiliated** with `openclaw.allegro.earth`
- We make **no warranty** regarding accuracy or completeness
- Contributors are **not responsible** for any misuse of this information
- Mention of any company or organization does **not** imply security accusations or endorsement
提供机构:
roseking



