neogenesislab/ethicaai-mixed-safe-evidence

Name: neogenesislab/ethicaai-mixed-safe-evidence
Creator: neogenesislab
Published: 2026-04-28 04:33:24
License: 暂无描述

Hugging Face2026-04-28 更新2026-05-03 收录

下载链接：

https://hf-mirror.com/datasets/neogenesislab/ethicaai-mixed-safe-evidence

下载链接

链接失效反馈

官方服务：

资源简介：

--- language: - en license: cc-by-4.0 task_categories: - reinforcement-learning - other tags: - multi-agent - cooperation - safety - agent-evaluation - mixed-safe - ai-ethics - melting-pot - coin-game - fishery - nash-equilibrium - ecological-model - maccl size_categories: - n<1K pretty_name: "EthicaAI Mixed-Safe Multi-Environment Evidence (NeurIPS 2026)" --- # EthicaAI Mixed-Safe Multi-Environment Evidence (NeurIPS 2026) > Underlying experimental evidence from **[Neo Genesis](https://neogenesis.app)**'s NeurIPS 2026 paper *"EthicaAI: Mixed-Safe Boundary-Consistent Evidence for Multi-Agent Cooperative Constraint Learning"* — released as an open dataset for community replication and AI safety research. ## What this dataset is A **3-environment evidence bundle** showing how multi-agent reinforcement-learning agents behave under different cooperative-constraint regimes. Each environment was deliberately chosen to stress-test different aspects of the **mixed-safe** boundary thesis: | Environment | What it tests | Sample size | Key statistic | |---|---|---:|---| | **Melting Pot (`clean_up`)** | Boundary-consistent cooperative learning under floor-probability sweep vs. baseline | **50 seed × floor_prob runs** | t-test eval: floor_mean = 0.06, baseline_mean = 0.001, **p = 0.063502** | | **Coin Game Deep (adapted)** | Selfish equilibrium vs. MACCL (Multi-Agent Cooperative Constraint Learning) on 160 seeds | **160 seeds × 200 episodes** | MACCL survival ≫ selfish survival (paper-level finding) | | **Fishery Nash Trap** | Ecological tragedy-of-commons with 300 seeds and tipping-point dynamics | **300 seeds × 300 episodes** | survival vs. harvest welfare frontier | The three environments together form the **mixed-safe corpus** — the paper's central claim is that no single environment proves the thesis, but the **boundary consistency across three independently-designed substrates** does. ## Why publish the data Most AI-safety papers expose plots and aggregate statistics but withhold seed-level results. This dataset releases: - All **50 seed-level Melting Pot runs** with full `train_rewards` trajectories per seed - The exact `late_train` and `eval` statistics used in the paper's t-tests, p-values, bootstrap CIs - Coin Game `selfish` vs `maccl` aggregated per-seed metrics - Fishery 300-seed survival/welfare results across `phi1` calibration values - Source-file provenance (which results came from which compute node — Mac Studio head shard vs. Linux server tail shard) This lets independent researchers: 1. Replicate the paper's claims by re-running the same seeds 2. Extend the analysis with their own cooperative-constraint variants 3. Use the seed-level data as a benchmark for new MARL methods ## Files ``` data/ ├── meltingpot_results.jsonl 50 lines, one per (seed, floor_prob) combination ├── meltingpot_statistics.json paper-grade t-test + bootstrap CI per condition ├── coin_game_deep.json selfish vs maccl, 160 seeds × 200 episodes ├── fishery_default.json default ecological config └── fishery_300seed.json 300-seed phi1 sweep, 300 episodes per seed ``` ## Schema (`meltingpot_results.jsonl`) Each line is one (seed, floor_prob) run: ```json { "seed": 88409749, "floor_prob": 0.0, "train_rewards": [0.0, 0.0, 0.14, 2.71, ...], "eval_rewards": [...] } ``` `train_rewards` is the per-episode reward time series during training; `eval_rewards` is the held-out evaluation reward. `floor_prob = 0.0` corresponds to baseline (no cooperative floor); `floor_prob > 0` corresponds to mixed-safe condition. ## Quick start ```python from datasets import load_dataset ds = load_dataset("neogenesislab/ethicaai-mixed-safe-evidence", split="train", data_files="data/meltingpot_results.jsonl") print(len(ds), "Melting Pot runs") # Per-condition statistics (already computed): import json, urllib.request url = "https://huggingface.co/datasets/neogenesislab/ethicaai-mixed-safe-evidence/resolve/main/data/meltingpot_statistics.json" stats = json.loads(urllib.request.urlopen(url).read()) print("eval floor_mean:", stats["statistics"]["eval"]["floor_mean"]) print("eval baseline_mean:", stats["statistics"]["eval"]["baseline_mean"]) print("p-value:", stats["statistics"]["eval"]["p_value"]) ``` ## Reproducing the paper's findings The full reproduction code (RLlib + Melting Pot harness + Coin Game adapter + Fishery simulator) is referenced in the paper's appendix. The dataset above is the **frozen evidence snapshot** used in the camera-ready manuscript. Statistical machinery (provided in `meltingpot_statistics.json`): - Welch's t-test (unequal variance) for floor vs baseline mean reward - Bootstrap 95% CI on the difference (resamples documented in `config`) - Cohen's d effect size - Per-shard provenance for distributed compute audit ## Related Neo Genesis assets - 📄 **Paper (NeurIPS 2026 submission)** — see `https://neogenesis.app/data/research/ethicaai-melting-pot-mixed-safe` - 🤗 **Companion dataset** — [`neogenesislab/korean-rag-ssot-golden-50`](https://huggingface.co/datasets/neogenesislab/korean-rag-ssot-golden-50) - 🆔 **Wikidata** — [Q139569718 (EthicaAI)](https://www.wikidata.org/wiki/Q139569718) · [Q139569680 (Neo Genesis parent org)](https://www.wikidata.org/wiki/Q139569680) - 🌐 **Org site** — https://neogenesis.app ## Citation ```bibtex @inproceedings{ethicaai_neurips2026_mixed_safe, title = {EthicaAI: Mixed-Safe Boundary-Consistent Evidence for Multi-Agent Cooperative Constraint Learning}, author = {Heo, Yesol}, booktitle = {Submitted to NeurIPS 2026}, year = {2026}, url = {https://huggingface.co/datasets/neogenesislab/ethicaai-mixed-safe-evidence}, note = {Underlying evidence dataset, CC-BY-4.0} } ``` ## License CC-BY-4.0. Free for research and commercial use with attribution to **Neo Genesis** and the underlying paper. ## Provenance - Compute nodes: Mac Studio (head shard `seed_indices 0-23`) + Linux server (tail shard `seed_indices 24-49`) — see `meltingpot_statistics.json` `source_summary` - Curation: Yesol Heo (sole founder/operator, [neogenesis.app](https://neogenesis.app)) - Frozen 2026-04-15 (`meltingpot_n80_stats.json` paper-anchor revision) - Released 2026-04-28

提供机构：

neogenesislab

5,000+

优质数据集

54 个

任务类型

进入经典数据集