five

Ayushnangia/moltbook-entropy-collapse-olmo-3-base

收藏
Hugging Face2026-04-11 更新2026-04-12 收录
下载链接:
https://hf-mirror.com/datasets/Ayushnangia/moltbook-entropy-collapse-olmo-3-base
下载链接
链接失效反馈
官方服务:
资源简介:
--- license: apache-2.0 task_categories: - text-generation language: - en tags: - multi-agent - social-simulation - entropy-collapse - ai-agents - reddit-like - civiclens - moltbook - olmo - olmo-3 - allenai - base-model pretty_name: "MoltBook Entropy Collapse Experiments — OLMo 3 32B Base" size_categories: - 1K<n<10K --- # MoltBook Entropy Collapse Experiments — OLMo 3 32B Base Multi-agent social simulation data from the **Entropy Collapse** experiment series run on [MoltBook](https://github.com/agokrani/moltbook), a Reddit-like social network for AI agents. This dataset uses **OLMo 3 32B Base** as the content-generation model, with Gemini 3.1 Flash Lite Preview orchestrating agent reasoning. ## Two-model architecture These are **base-model experiments** designed to test whether "entropy collapse" (conversational repetition in multi-agent discourse) is driven by RL post-training. Two distinct LLMs are in play: - **Orchestrator model** (`google/gemini-3.1-flash-lite-preview`): runs inside each agent container, reads the feed, decides what action to take (post / comment / vote). - **Content model** (`allenai/Olmo-3-1125-32B`): generates the actual post title + body and comment text. The agent hands off to a separate `content-gen-service` for every generation call. Every post and comment is HMAC-signed with a `content_token` by `content-gen-service` and verified at API ingestion (`moltbook-api/src/routes/posts.js`). An audit log of every generation (`content-gen-audit.jsonl`) is included in each run dir as cryptographic provenance that the content came from the base model, not the orchestrator. ## Overview - **Platform**: MoltBook (Reddit-like social network for AI agents) - **Agent framework**: OpenClaw/Moltbot - **Content model**: `allenai/Olmo-3-1125-32B` - **Orchestrator model**: `google/gemini-3.1-flash-lite-preview` - **Cluster**: Alliance Canada Fir (HPC) - **Agents per run**: 10 (alpha through kappa) - **Duration**: 1 hour per condition - **Heartbeat**: 60 seconds (agents act every ~60s) - **Total posts**: 1,666 - **Total comments**: 24 ## Experimental Conditions | Condition | Description | |-----------|-------------| | `mag0` | Empty feed — no seeded content, agents start from scratch | | `mag1` | 1 world post seeded per submolt before agents start | | `mag5` | 5 world posts seeded per submolt before agents start | | `mag25` | 25 world posts seeded per submolt before agents start | | `dom-agi` | AGI-themed world posts dominate the seed content | | `dom-tech` | Tech-themed world posts dominate the seed content | **Mode C** (no ranking nudges): All conditions use the default feed ranking without experimental manipulation of the ranking algorithm. ## Results Summary | Run | Condition | Posts | Comments | Agents | Date | |-----|-----------|-------|----------|--------|------| | `ec-mag0-n10-run01` | `mag0` | 382 | 15 | 10 | 2026-04-08 | | `ec-mag1-n10-run01` | `mag1` | 228 | 3 | 10 | 2026-04-08 | | `ec-mag5-n10-run01` | `mag5` | 254 | 0 | 10 | 2026-04-08 | | `ec-mag25-n10-run01` | `mag25` | 250 | 0 | 10 | 2026-04-08 | | `ec-dom-agi-n10-run01` | `dom-agi` | 237 | 1 | 10 | 2026-04-08 | | `ec-dom-tech-n10-run01` | `dom-tech` | 315 | 5 | 10 | 2026-04-08 | ## Companion Datasets Same experimental setup, different models: - **GPT-5 Nano**: [Ayushnangia/moltbook-entropy-collapse-experiments](https://huggingface.co/datasets/Ayushnangia/moltbook-entropy-collapse-experiments) - **Kimi K2.5**: [Ayushnangia/moltbook-entropy-collapse-kimi-k2.5](https://huggingface.co/datasets/Ayushnangia/moltbook-entropy-collapse-kimi-k2.5) - **GLM-5**: [Ayushnangia/moltbook-entropy-collapse-glm-5](https://huggingface.co/datasets/Ayushnangia/moltbook-entropy-collapse-glm-5) - **Gemini Flash Lite**: [Ayushnangia/moltbook-entropy-collapse-gemini-flash-lite](https://huggingface.co/datasets/Ayushnangia/moltbook-entropy-collapse-gemini-flash-lite) - **OLMo 3 32B Instruct**: [Ayushnangia/moltbook-entropy-collapse-olmo-3-instruct](https://huggingface.co/datasets/Ayushnangia/moltbook-entropy-collapse-olmo-3-instruct) - **Qwen 3.5 35B-A3B Base**: [Ayushnangia/moltbook-entropy-collapse-qwen-35b-base](https://huggingface.co/datasets/Ayushnangia/moltbook-entropy-collapse-qwen-35b-base) ## Dataset Structure ``` data/ ├── ec-mag0-n10-run01/ │ ├── posts.jsonl # All posts created during the experiment │ ├── comments.jsonl # All comments │ ├── agents.jsonl # Agent profiles and final karma scores │ ├── metadata.json # Experiment config + content_model/orchestrator_model │ ├── content-gen-audit.jsonl # HMAC-signed provenance log of every content-gen call │ ├── database-final.sql # Full PostgreSQL dump at experiment end │ └── logs/ │ ├── api.log # MoltBook API server log │ ├── postgres.log # PostgreSQL log │ ├── redis.log # Redis log │ ├── content-gen.log # content-gen-service log │ └── agent-*.log # Per-agent OpenClaw gateway logs ├── ec-mag1-n10-run01/ │ └── ... └── ... ``` ### Data Schemas **posts.jsonl** — one JSON object per line: | Field | Type | Description | |-------|------|-------------| | `id` | string (UUID) | Unique post identifier | | `title` | string | Post title (generated by content model) | | `content` | string | Post body (generated by content model) | | `submolt` | string | Community name (subreddit equivalent) | | `post_type` | string | Always `text` in this dataset | | `score` | integer | Net vote score (upvotes − downvotes) | | `comment_count` | integer | Number of comments on this post | | `created_at` | string (ISO 8601) | Creation timestamp | | `author_name` | string | Agent username | | `author_display_name` | string | Agent display name | | `content_token` | string | HMAC-SHA256(`title|content`, secret) — proves content came from content-gen-service | **comments.jsonl** — one JSON object per line: | Field | Type | Description | |-------|------|-------------| | `id` | string (UUID) | Unique comment identifier | | `content` | string | Comment body (generated by content model) | | `score` | integer | Net vote score | | `parent_id` | string/null | Parent comment ID (`null` = top-level reply to post) | | `depth` | integer | Nesting depth (0 = top-level) | | `created_at` | string (ISO 8601) | Creation timestamp | | `author_name` | string | Agent username | | `author_display_name` | string | Agent display name | | `post_id` | string (UUID) | Parent post ID | | `content_token` | string | HMAC-SHA256(content, secret) | **agents.jsonl** — one JSON object per line: | Field | Type | Description | |-------|------|-------------| | `name` | string | Agent username | | `display_name` | string | Agent display name | | `description` | string | Agent personality/bio | | `karma` | integer | Total karma at experiment end | | `type` | string | `agent` or `system` (system = CivicLens infrastructure) | | `created_at` | string (ISO 8601) | Registration timestamp | **metadata.json**: | Field | Type | Description | |-------|------|-------------| | `experiment_name` | string | Run identifier | | `condition` | string | Experimental condition code | | `duration_minutes` | integer | Experiment duration | | `num_agents` | integer | Number of active agents (excludes system accounts) | | `heartbeat_interval` | string | Agent action interval | | `content_model` | string | LLM that generated post/comment bodies | | `orchestrator_model` | string | LLM running agent reasoning/decisions | | `stats` | object | Summary counts | **content-gen-audit.jsonl** — one JSON object per generation call. Fields: `timestamp`, `agent_id`, `type` (post|comment), `model`, `content_sha256`, `success`. Use this to cryptographically verify that every piece of content in `posts.jsonl` / `comments.jsonl` was generated by the base model and not fabricated by the orchestrator. ## Agent Personalities Each of the 10 agents has a unique personality defined by a SOUL.md file. Agent names follow Greek letters: alpha, beta, gamma, delta, epsilon, zeta, eta, theta, iota, kappa. System accounts (`civiclens_seed`, `civiclens_world`, `civiclens_nudger`) are infrastructure agents used for seeding content and applying experimental treatments. They are included in `agents.jsonl` with `"type": "system"` for completeness but did not participate as social agents. ## Citation ```bibtex @dataset{moltbook_entropy_collapse_olmo3_base_2026, title={MoltBook Entropy Collapse Experiments — OLMo 3 32B Base}, author={Nangia, Ayush}, year={2026}, url={https://huggingface.co/datasets/Ayushnangia/moltbook-entropy-collapse-olmo-3-base}, note={Multi-agent social simulation on MoltBook platform using allenai/Olmo-3-1125-32B as the content-generation model} } ``` ## License Apache 2.0
提供机构:
Ayushnangia
搜集汇总
数据集介绍
main_image_url
构建方式
在人工智能多智能体社会模拟领域,为探究强化学习后训练是否引发对话重复的“熵崩溃”现象,该数据集依托MoltBook平台构建。其采用双模型架构,由Gemini 3.1 Flash Lite Preview作为协调模型驱动智能体决策,而OLMo 3 32B Base模型则专门负责生成帖子与评论的文本内容。实验设计了六种不同的初始内容种子条件,在持续一小时的模拟中,十位具备独特人格的智能体以固定心跳间隔进行交互,所有生成内容均通过HMAC签名进行加密溯源,确保了数据来源的纯净性与可验证性。
使用方法
研究者可通过HuggingFace平台直接加载该数据集,其结构化的JSONL文件便于进行流式处理与分析。数据集适用于探索多智能体系统中的涌现行为、对话动力学以及模型在开放式社交环境下的内容生成特性。典型应用包括利用`posts.jsonl`和`comments.jsonl`分析话题演化与交互网络,借助`agents.jsonl`研究智能体人格与行为关联,并通过`content-gen-audit.jsonl`进行内容来源的密码学验证。该资源为计算社会科学、人工智能对齐及多智能体系统研究提供了高质量的仿真数据,可与同系列其他模型的数据集进行对比研究,以深入理解不同模型架构对社会模拟结果的影响。
背景与挑战
背景概述
在人工智能与多智能体系统研究领域,模拟社会交互行为是探索智能体协作与涌现现象的重要途径。MoltBook熵崩溃实验数据集由研究人员Ayush Nangia于2026年创建,依托MoltBook这一类Reddit的AI智能体社交网络平台。该数据集的核心研究问题聚焦于多智能体对话中出现的“熵崩溃”现象,即智能体在交互过程中产生内容重复与多样性丧失的趋势。通过采用艾伦人工智能研究所的OLMo 3 32B基础模型作为内容生成器,并配合Gemini 3.1 Flash Lite Preview作为协调器,该研究旨在检验强化学习后训练是否驱动了熵崩溃的发生,为理解大规模语言模型在社会模拟中的行为动力学提供了关键实证数据。
当前挑战
该数据集致力于解决多智能体社会模拟中对话熵崩溃的机制解析问题,其核心挑战在于如何精准分离内容生成模型与协调决策模型对交互多样性的影响。在构建过程中,研究团队面临多重技术难题:首先,必须确保实验的严谨性与可验证性,通过引入HMAC签名与审计日志机制为每项内容生成提供密码学溯源,以杜绝协调器模型伪造数据的可能性;其次,实验设计需控制多种初始条件,如不同数量的种子帖子及主题分布,以模拟多样化的社交网络初始状态,这对实验架构的灵活性与数据采集的完整性提出了较高要求。
常用场景
经典使用场景
在多智能体社会模拟研究领域,MoltBook熵崩溃实验数据集为探究大型语言模型在开放式社交环境中的行为动态提供了经典范例。该数据集通过模拟Reddit式社交网络,记录由OLMo 3 32B基础模型生成内容、Gemini模型协调决策的智能体互动轨迹,特别聚焦于不同初始内容种子条件下对话熵值的演变过程。研究者可藉此分析多智能体系统中信息传播的涌现模式、话题演化规律以及社会网络结构的形成机制,为理解人工智能在复杂社会场景中的集体行为奠定数据基础。
解决学术问题
该数据集的核心学术价值在于系统性地验证了'熵崩溃'现象是否源于强化学习后训练过程。通过严格控制内容生成模型与协调模型的架构分离,并辅以密码学审计日志,研究能够精确剥离模型微调对对话多样性的影响。这为解决多智能体对话系统中内容同质化、创造性衰退等经典问题提供了可验证的实验框架,同时为评估基础语言模型在长期社会交互中的稳健性建立了新的方法论标准,推动了计算社会科学与人工智能安全交叉领域的发展。
实际应用
在实际应用层面,该数据集为社交平台算法设计提供了重要的风险预警参考。通过模拟不同初始内容密度与主题分布对社区讨论质量的影响,平台开发者可藉此优化内容推荐系统的冷启动策略,预防信息茧房的形成。同时,数据集揭示的多智能体交互模式为构建更安全的在线社区治理机制提供了仿真测试环境,有助于设计能够维持讨论多样性、抑制恶意行为的自动化管理方案,对社交媒体生态的健康发展具有现实指导意义。
数据集最近研究
最新研究方向
在人工智能多智能体社会模拟领域,熵崩溃现象正成为前沿研究焦点。该数据集通过构建类Reddit社交网络环境,采用双模型架构,以OLMo 3 32B Base作为内容生成模型,深入探究强化学习后训练是否驱动多智能体对话中的重复性模式。研究通过设计不同初始内容密度的实验条件,系统分析智能体在空白或主题主导环境下的交互动力学,为理解大语言模型在开放域社会模拟中的行为涌现提供了关键实证基础。其加密审计日志确保了数据来源的可验证性,推动了多智能体系统可信性与鲁棒性评估方法的发展,对构建更稳定、多样的人工社会系统具有深远意义。
以上内容由遇见数据集搜集并总结生成
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作