five

roseking/openclaw-exposure-dataset

收藏
Hugging Face2026-03-04 更新2026-03-29 收录
下载链接:
https://hf-mirror.com/datasets/roseking/openclaw-exposure-dataset
下载链接
链接失效反馈
官方服务:
资源简介:
--- license: cc-by-nc-sa-4.0 task_categories: - tabular-classification tags: - security - openclaw - exposure - cybersecurity - threat-intel language: - en pretty_name: OpenClaw Exposure Watchboard Dataset (Sanitized) size_categories: - 100K<n<1M --- 🇨🇳 [查看中文版 README](./README_ZH.md) --- > ⚠️ **Unofficial Dataset** — This dataset is **NOT** affiliated with, endorsed by, > or officially released by `openclaw.allegro.earth`. It is an independent research snapshot. --- # OpenClaw Exposure Watchboard Dataset (Sanitized) ## 📌 Data Source & Attribution > **Original Source**: [OpenClaw Exposure Watchboard](https://openclaw.allegro.earth/) > All original data is collected and maintained by **openclaw.allegro.earth**. > Full credit and ownership of the source data belong to the original maintainers. This dataset is an **unofficial, sanitized snapshot** of publicly accessible data from [openclaw.allegro.earth](https://openclaw.allegro.earth/) as of **2026-03-04**. It was collected for **non-commercial security research and educational purposes only**. --- ## 🔐 Copyright Notice ``` OpenClaw Exposure Watchboard Data © openclaw.allegro.earth Snapshot Date: 2026-03-04 This dataset is an unofficial derivative work. All rights to the original data remain with openclaw.allegro.earth. ``` The underlying monitored data (IP addresses, CVEs, domain names, ASN info) consists of factual public information and is generally not subject to copyright. However, the **compilation and curation** of this dataset may constitute a database right. We assert no ownership over the original data. --- ## 🛡️ Privacy & Sanitization All IP addresses have been **sanitized**: - IPv4 last octet replaced with `xxx` (e.g. `1.2.3.4` → `1.2.3.xxx`) - `endpoint_url` field follows the same rule - **No personal data** is included in this dataset > Note: The `asi_domains` field may contain corporate domain names (e.g. `jd.com`, > `bytedance.com`). These are included **as factual security metadata only** and do not > imply any endorsement, accusation, or commercial relationship. --- ## 📊 Dataset Overview | Item | Value | |------|-------| | Record count | 240,561 | | Snapshot date | 2026-03-04 | | Fields | 20 | | Format | CSV (UTF-8 BOM) | ## Fields | Field | Description | |-------|-------------| | `endpoint` | Sanitized IP:Port | | `endpoint_url` | Sanitized URL | | `Country` | Country / Region | | `auth_required` | Whether auth is required | | `is_active` | Whether the instance is active | | `has_leaked_creds` | Credential leak status | | `asn` / `asn_name` | Autonomous System info | | `org` | Organization / ISP | | `first_seen` / `last_seen` | Detection timestamps | | `asi_has_breach` | Breach flag | | `asi_has_threat_actor` | Threat actor flag | | `asi_threat_actors` | Associated APT groups | | `asi_cves` | Associated CVE numbers | | `asi_enriched_at` | Enrichment timestamp | | `asi_domains` | Related domains (factual metadata only) | --- ## 📖 License & Usage Terms This dataset is released under **[CC BY-NC-SA 4.0](https://creativecommons.org/licenses/by-nc-sa/4.0/)**. | You are free to | Conditions | |-----------------|-----------| | ✅ Share and redistribute | ℹ️ Attribute to this dataset **and** original source | | ✅ Adapt and transform | 🚫 **Non-commercial use only** | | | 🔄 Share-alike under same license | **Prohibited uses:** - ❌ Commercial exploitation - ❌ Targeted harassment or attacks against any listed organization - ❌ Re-identification of sanitized IP addresses --- ## 📥 DMCA & Takedown Requests If you believe this dataset infringes your rights: 1. **Contact us** via Hugging Face discussion tab on this repository 2. **Contact Hugging Face** at: dmca@huggingface.co 3. **Reference**: [Hugging Face DMCA Policy](https://huggingface.co/dmca) We will respond promptly and cooperate fully with legitimate takedown requests. --- ## ⚖️ Disclaimer This dataset is provided **"as-is"** for security awareness and academic research only. - The contributors are **not affiliated** with `openclaw.allegro.earth` - We make **no warranty** regarding accuracy or completeness - Contributors are **not responsible** for any misuse of this information - Mention of any company or organization does **not** imply security accusations or endorsement
提供机构:
roseking
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作