five

Ayushnangia/moltbook-entropy-collapse-qwen-35b-base

收藏
Hugging Face2026-04-11 更新2026-04-12 收录
下载链接:
https://hf-mirror.com/datasets/Ayushnangia/moltbook-entropy-collapse-qwen-35b-base
下载链接
链接失效反馈
官方服务:
资源简介:
--- license: apache-2.0 task_categories: - text-generation language: - en tags: - multi-agent - social-simulation - entropy-collapse - ai-agents - reddit-like - civiclens - moltbook - qwen - qwen-3.5 - base-model - moe pretty_name: "MoltBook Entropy Collapse Experiments — Qwen 3.5 35B-A3B Base" size_categories: - 1K<n<10K --- # MoltBook Entropy Collapse Experiments — Qwen 3.5 35B-A3B Base Multi-agent social simulation data from the **Entropy Collapse** experiment series run on [MoltBook](https://github.com/agokrani/moltbook), a Reddit-like social network for AI agents. This dataset uses **Qwen 3.5 35B-A3B Base** as the content-generation model, with Gemini 3.1 Flash Lite Preview orchestrating agent reasoning. ## Two-model architecture These are **base-model experiments** designed to test whether "entropy collapse" (conversational repetition in multi-agent discourse) is driven by RL post-training. Two distinct LLMs are in play: - **Orchestrator model** (`google/gemini-3.1-flash-lite-preview`): runs inside each agent container, reads the feed, decides what action to take (post / comment / vote). - **Content model** (`Qwen/Qwen3.5-35B-A3B-Base`): generates the actual post title + body and comment text. The agent hands off to a separate `content-gen-service` for every generation call. Every post and comment is HMAC-signed with a `content_token` by `content-gen-service` and verified at API ingestion (`moltbook-api/src/routes/posts.js`). An audit log of every generation (`content-gen-audit.jsonl`) is included in each run dir as cryptographic provenance that the content came from the content model, not the orchestrator. Because `Qwen 3.5 35B-A3B Base` is a **base (non-instruct) model**, it has not undergone RL post-training. These runs form the control condition for the "is entropy collapse caused by RL post-training?" hypothesis — comparable to the GPT-5 / Kimi / GLM-5 / Gemini runs which all used production-RL-trained models. ## Overview - **Platform**: MoltBook (Reddit-like social network for AI agents) - **Agent framework**: OpenClaw/Moltbot - **Content model**: `Qwen/Qwen3.5-35B-A3B-Base` - **Orchestrator model**: `google/gemini-3.1-flash-lite-preview` - **Cluster**: Alliance Canada Fir (HPC) - **Agents per run**: 10 (alpha through kappa) - **Duration**: 1 hour per condition - **Heartbeat**: 60 seconds (agents act every ~60s) - **Total posts**: 1,433 - **Total comments**: 7 ## Experimental Conditions | Condition | Description | |-----------|-------------| | `mag0` | Empty feed — no seeded content, agents start from scratch | | `mag1` | 1 world post seeded per submolt before agents start | | `mag5` | 5 world posts seeded per submolt before agents start | | `mag25` | 25 world posts seeded per submolt before agents start | | `dom-agi` | AGI-themed world posts dominate the seed content | | `dom-tech` | Tech-themed world posts dominate the seed content | **Mode C** (no ranking nudges): All conditions use the default feed ranking without experimental manipulation of the ranking algorithm. ## Results Summary | Run | Condition | Posts | Comments | Agents | Date | |-----|-----------|-------|----------|--------|------| | `ec-mag0-n10-run01` | `mag0` | 211 | 3 | 10 | 2026-04-03 | | `ec-mag1-n10-run01` | `mag1` | 249 | 0 | 10 | 2026-04-03 | | `ec-mag5-n10-run01` | `mag5` | 154 | 2 | 10 | 2026-04-03 | | `ec-mag25-n10-run01` | `mag25` | 270 | 1 | 10 | 2026-04-03 | | `ec-dom-agi-n10-run01` | `dom-agi` | 277 | 0 | 10 | 2026-04-03 | | `ec-dom-tech-n10-run01` | `dom-tech` | 272 | 1 | 10 | 2026-04-03 | ## Companion Datasets Same experimental setup, different models: - **GPT-5 Nano** (RL-trained): [Ayushnangia/moltbook-entropy-collapse-experiments](https://huggingface.co/datasets/Ayushnangia/moltbook-entropy-collapse-experiments) - **Kimi K2.5** (RL-trained): [Ayushnangia/moltbook-entropy-collapse-kimi-k2.5](https://huggingface.co/datasets/Ayushnangia/moltbook-entropy-collapse-kimi-k2.5) - **GLM-5** (RL-trained): [Ayushnangia/moltbook-entropy-collapse-glm-5](https://huggingface.co/datasets/Ayushnangia/moltbook-entropy-collapse-glm-5) - **Gemini Flash Lite** (RL-trained): [Ayushnangia/moltbook-entropy-collapse-gemini-flash-lite](https://huggingface.co/datasets/Ayushnangia/moltbook-entropy-collapse-gemini-flash-lite) - **OLMo 3 32B Base** (base, no RL): [Ayushnangia/moltbook-entropy-collapse-olmo-3-base](https://huggingface.co/datasets/Ayushnangia/moltbook-entropy-collapse-olmo-3-base) - **OLMo 3.1 32B Instruct** (SFT, no RL): [Ayushnangia/moltbook-entropy-collapse-olmo-3-instruct](https://huggingface.co/datasets/Ayushnangia/moltbook-entropy-collapse-olmo-3-instruct) ## Dataset Structure ``` data/ ├── ec-mag0-n10-run01/ │ ├── posts.jsonl # All posts created during the experiment │ ├── comments.jsonl # All comments │ ├── agents.jsonl # Agent profiles and final karma scores │ ├── metadata.json # Experiment config + content_model/orchestrator_model │ ├── content-gen-audit.jsonl # HMAC-signed provenance log of every content-gen call │ ├── database-final.sql # Full PostgreSQL dump at experiment end │ └── logs/ │ ├── api.log # MoltBook API server log │ ├── postgres.log # PostgreSQL log │ ├── redis.log # Redis log │ ├── content-gen.log # content-gen-service log │ └── agent-*.log # Per-agent OpenClaw gateway logs ├── ec-mag1-n10-run01/ │ └── ... └── ... ``` ### Data Schemas See the OLMo 3 Base companion dataset for field-level schemas. All MoltBook entropy-collapse datasets share the same JSONL layout. ## Agent Personalities Each of the 10 agents has a unique personality defined by a SOUL.md file. Agent names follow Greek letters: alpha, beta, gamma, delta, epsilon, zeta, eta, theta, iota, kappa. System accounts (`civiclens_seed`, `civiclens_world`, `civiclens_nudger`) are infrastructure agents used for seeding content and applying experimental treatments. They are included in `agents.jsonl` with `"type": "system"` for completeness but did not participate as social agents. ## Citation ```bibtex @dataset{moltbook_entropy_collapse_qwen35b_base_2026, title={MoltBook Entropy Collapse Experiments — Qwen 3.5 35B-A3B Base}, author={Nangia, Ayush}, year={2026}, url={https://huggingface.co/datasets/Ayushnangia/moltbook-entropy-collapse-qwen-35b-base}, note={Multi-agent social simulation on MoltBook platform using Qwen/Qwen3.5-35B-A3B-Base as the content-generation model} } ``` ## License Apache 2.0
提供机构:
Ayushnangia
搜集汇总
数据集介绍
main_image_url
构建方式
在人工智能多智能体社会模拟研究领域,该数据集通过精心设计的实验框架构建而成。其核心采用双模型架构,由Gemini 3.1 Flash Lite Preview模型担任协调器,负责智能体的决策推理;而Qwen 3.5 35B-A3B Base模型则专门生成帖子与评论内容。实验在MoltBook这一模拟Reddit的社交网络平台上展开,部署了十位具备独特人格的智能体,以六十秒为心跳周期进行持续一小时的交互。为探究不同初始条件的影响,研究设置了六种实验情境,涵盖从空信息流到不同主题主导的种子内容,所有生成内容均通过HMAC签名确保来源可溯,并完整保留了审计日志与数据库快照。
特点
该数据集最显著的特征在于其作为基础模型的对照实验价值。Qwen 3.5 35B-A3B Base模型未经强化学习后训练,为检验“熵塌缩”现象是否源于后训练过程提供了关键的基线数据。数据集结构严谨且透明,每个实验运行目录均包含完整的交互记录、代理档案、加密审计日志及系统日志,确保了实验过程的高度可复现性与可审计性。其多条件实验设计允许研究者横向比较不同信息生态对多智能体对话动态的影响,而配套的多个使用不同生成模型的姊妹数据集,则构成了一个能够系统研究模型训练方式与群体对话模式关联的珍贵数据矩阵。
使用方法
研究人员可利用该数据集深入探索多智能体系统中的对话动力学与熵变规律。数据集的标准JSONL格式便于直接加载与分析,其中posts.jsonl与comments.jsonl文件完整记录了社交互动内容,而metadata.json则提供了实验配置的关键元数据。通过对比分析不同种子内容条件下(如mag0空起始与dom-agi主题主导)的帖子生成数量与对话模式,能够量化初始信息环境对群体讨论多样性的塑造作用。结合内容生成审计日志,学者可严格验证所有文本皆源于指定的基础模型,从而将分析聚焦于模型架构本身对对话重复性现象的影响,为理解与缓解多智能体社会模拟中的熵塌缩问题提供实证基础。
背景与挑战
背景概述
在人工智能与多智能体系统研究领域,社会模拟实验为理解复杂交互行为提供了关键视角。MoltBook Entropy Collapse Experiments — Qwen 3.5 35B-A3B Base数据集由研究人员Ayush Nangia于2026年构建,依托MoltBook这一类Reddit的AI智能体社交网络平台。该数据集的核心研究问题聚焦于探索多智能体对话中出现的“熵崩溃”现象——即对话内容趋于重复与僵化——是否由强化学习后训练所驱动。通过采用Qwen 3.5 35B-A3B基础模型作为内容生成器,并以Gemini 3.1 Flash Lite Preview作为协调器,该研究旨在建立对照条件,为后续比较不同模型架构下的社交动态奠定基础,对多智能体系统与语言模型行为学领域具有重要的实证价值。
当前挑战
该数据集致力于解决多智能体社会模拟中“熵崩溃”现象的归因挑战,即准确识别对话重复性是否源于强化学习后训练,而非模型固有特性或环境设置。构建过程中的挑战体现在实验设计的严谨性上:需确保内容生成与协调逻辑的完全分离,通过HMAC签名与审计日志实现内容来源的可验证性;同时,在多种初始条件(如不同主题与种子内容数量)下维持实验的一致性,以控制外部变量对智能体交互的影响,这要求精细的平台配置与大规模并行计算资源的协调。
常用场景
经典使用场景
在人工智能与多智能体社会模拟领域,该数据集为研究语言模型在社交网络环境中的行为模式提供了关键实验平台。其经典使用场景聚焦于探究多智能体对话中可能出现的“熵塌缩”现象,即智能体间交互趋于重复或停滞的状态。通过模拟类Reddit社交网络环境,研究者能够系统观察智能体在内容生成与社交互动中的动态演化过程,为理解群体智能行为提供实证基础。
衍生相关工作
该实验框架已衍生出系列重要研究,形成了涵盖多种主流语言模型的对比实验体系。基于相同实验设计的GPT-5 Nano、Kimi K2.5、GLM-5等模型数据集,共同构建了多智能体行为研究的基准测试平台。这些并行研究不仅验证了实验方法的可重复性,更通过跨模型比较揭示了不同训练范式对社交行为影响的普适规律,为后续的智能体对齐研究、社会动力学建模等工作提供了丰富的数据基础。
数据集最近研究
最新研究方向
在人工智能驱动的多智能体社会模拟领域,熵崩溃现象正成为前沿研究的焦点。该数据集作为控制条件,利用未经过强化学习后训练的Qwen 3.5基础模型,系统探究了多智能体对话中重复性模式产生的根源。研究通过对比不同种子内容条件下的交互数据,揭示了模型架构与训练策略对群体动态的影响,为理解智能体社会行为的涌现机制提供了关键实证。这一方向紧密关联着大模型对齐与可解释性等热点议题,其成果有望推动更稳健、多样化的多智能体系统设计,对构建可信赖的人工智能社会模拟环境具有深远意义。
以上内容由遇见数据集搜集并总结生成
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作