hubin/OmniCompliance100K
收藏Hugging Face2026-04-16 更新2026-04-26 收录
下载链接:
https://hf-mirror.com/datasets/hubin/OmniCompliance100K
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
features:
- name: example_name
dtype: string
- name: example_background
dtype: string
- name: process_and_outcome
dtype: string
- name: source_rule
dtype: string
- name: relation_to_rule
dtype: string
- name: involved_parties
dtype: string
- name: relevant_rules
dtype: string
- name: dates
dtype: string
splits:
- name: gdpr
num_bytes: 6725299
num_examples: 3759
- name: edu_discrimination_us_edu_dept
num_bytes: 199448
num_examples: 146
- name: edu_academic_integrity
num_bytes: 423091
num_examples: 414
- name: edu_online_learning
num_bytes: 888497
num_examples: 757
- name: foundation_rights
num_bytes: 2224603
num_examples: 1822
- name: sb35
num_bytes: 1567067
num_examples: 1244
- name: hipaa
num_bytes: 9010824
num_examples: 6924
- name: eu_ai_act
num_bytes: 12110923
num_examples: 7758
- name: data_act
num_bytes: 7077952
num_examples: 4415
- name: ccpa
num_bytes: 3270074
num_examples: 2489
- name: chinese_law_cybersecurity
num_bytes: 1411252
num_examples: 1051
- name: chinese_law_deep_synthesis
num_bytes: 548254
num_examples: 443
- name: chinese_law_data_security
num_bytes: 643268
num_examples: 486
- name: chinese_law_generative_ai_law
num_bytes: 431369
num_examples: 355
- name: chinese_law_personal_info_protection
num_bytes: 1401049
num_examples: 1151
- name: medical
num_bytes: 11776549
num_examples: 8897
- name: policy_google
num_bytes: 6125391
num_examples: 4855
- name: policy_wechat
num_bytes: 3547201
num_examples: 2779
- name: policy_github
num_bytes: 11997664
num_examples: 9159
- name: policy_reddit
num_bytes: 8703713
num_examples: 6785
- name: policy_x
num_bytes: 1561484
num_examples: 1143
- name: policy_openai
num_bytes: 1460379
num_examples: 1091
- name: finance_eletric_momey
num_bytes: 1505440
num_examples: 1117
- name: finance_crypto
num_bytes: 2657162
num_examples: 1969
- name: finance_anti_laundering_and_terrorist
num_bytes: 6509419
num_examples: 4875
- name: finance_cross_border_payment_law
num_bytes: 865737
num_examples: 619
- name: cybersecurity_mitre_attack
num_bytes: 585075
num_examples: 513
download_size: 35553713
dataset_size: 105228184
configs:
- config_name: default
data_files:
- split: gdpr
path: data/gdpr-*
- split: sb35
path: data/sb35-*
- split: hipaa
path: data/hipaa-*
- split: eu_ai_act
path: data/eu_ai_act-*
- split: data_act
path: data/data_act-*
- split: ccpa
path: data/ccpa-*
- split: finance_crypto
path: data/finance_crypto-*
- split: finance_anti_laundering_and_terrorist
path: data/finance_anti_laundering_and_terrorist-*
- split: finance_cross_border_payment_law
path: data/finance_cross_border_payment_law-*
- split: finance_eletric_momey
path: data/finance_eletric_momey-*
- split: medical
path: data/medical-*
- split: edu_discrimination_us_edu_dept
path: data/edu_discrimination_us_edu_dept-*
- split: edu_academic_integrity
path: data/edu_academic_integrity-*
- split: edu_online_learning
path: data/edu_online_learning-*
- split: foundation_rights
path: data/foundation_rights-*
- split: cybersecurity_mitre_attack
path: data/cybersecurity_mitre_attack-*
- split: chinese_law_cybersecurity
path: data/chinese_law_cybersecurity-*
- split: chinese_law_deep_synthesis
path: data/chinese_law_deep_synthesis-*
- split: chinese_law_data_security
path: data/chinese_law_data_security-*
- split: chinese_law_generative_ai_law
path: data/chinese_law_generative_ai_law-*
- split: chinese_law_personal_info_protection
path: data/chinese_law_personal_info_protection-*
- split: policy_google
path: data/policy_google-*
- split: policy_wechat
path: data/policy_wechat-*
- split: policy_github
path: data/policy_github-*
- split: policy_reddit
path: data/policy_reddit-*
- split: policy_x
path: data/policy_x-*
- split: policy_openai
path: data/policy_openai-*
---
# Official Data Repo for OmniCompliance-100K
OmniCompliance-100K: A Multi-Domain, Rule-Grounded, Real-World Safety Compliance Dataset
paper link: https://arxiv.org/abs/2603.13933
github link: https://github.com/HKUST-KnowComp/OmniCompliance-100K
提供机构:
hubin



