brandburner/westwing-mega-narrative-kg
收藏Hugging Face2026-02-24 更新2026-03-29 收录
下载链接:
https://hf-mirror.com/datasets/brandburner/westwing-mega-narrative-kg
下载链接
链接失效反馈官方服务:
资源简介:
---
license: cc-by-sa-4.0
task_categories:
- graph-ml
- text-generation
tags:
- narrative
- knowledge-graph
- screenplay
- fabula
- neo4j
- graph-gravity
size_categories:
- 1K<n<10K
---
# The West Wing - Narrative Knowledge Graph
A rich narrative knowledge graph extracted from *The West Wing* screenplays using the
[Fabula](https://fabula.productions) pipeline. Contains characters,
locations, objects, organizations, events, themes, and conflict arcs with full
participation semantics and Graph Gravity importance tiers.
## Dataset Overview
| Metric | Value |
|--------|-------|
| Source database | `westwing.mega` |
| Type | Megagraph (cross-season merged) |
| Episodes | 89 |
| Seasons merged | 1, 2, 3, 4 |
| Total nodes | 24,246 |
| Total edges | 62,796 |
| Schema version | 1.1.0 |
| Exported | 2026-02-24 |
### Entity Breakdown
| Type | Count |
|------|-------|
| Act | 404 |
| Agent | 2,132 |
| ConflictArc | 357 |
| Episode | 89 |
| Event | 3,303 |
| Location | 1,589 |
| Object | 2,939 |
| Organization | 1,347 |
| PlotBeat | 9,689 |
| SceneBoundary | 2,141 |
| Theme | 227 |
| Writer | 29 |
### Graph Gravity Tiers
| Tier | Count | Description |
|------|-------|-------------|
| anchor | 185 | Main characters / key locations |
| planet | 865 | Recurring entities |
| asteroid | 6,957 | Minor / one-off entities |
### Relationship Types
`AFFILIATED_WITH`, `BELONGS_TO_EPISODE`, `CALLBACK`, `CAUSAL`, `CHARACTER_CONTINUITY`, `CONTAINS_BEAT`, `CREDITED_ON`, `EMOTIONAL_ECHO`, `ESCALATION`, `EXEMPLIFIES_THEME`, `FORESHADOWING`, `INVOLVED_IN_ARC`, `INVOLVED_WITH`, `IN_EVENT`, `NARRATIVELY_FOLLOWS`, `OCCURS_IN`, `OWNS`, `PARTICIPATED_AS`, `PART_OF`, `PART_OF_ACT` ... and 6 more
## Files
| File | Description |
|------|-------------|
| `nodes.parquet` | All graph nodes with properties |
| `edges.parquet` | All relationships with properties |
| `positions.parquet` | 3D layout coordinates for visualization |
| `meta.json` | Dataset metadata and entity counts |
## Schema
### Nodes (`nodes.parquet`)
| Column | Type | Description |
|--------|------|-------------|
| `node_id` | string | Unique node identifier (UUID) |
| `primary_label` | string | Node type (Agent, Location, Event, etc.) |
| `name` | string | Display name |
| `description` | string | Foundational description |
| `tier` | string (nullable) | Graph Gravity tier: anchor / planet / asteroid |
| `episode_count` | int (nullable) | Number of distinct episodes entity appears in |
| `first_episode_seq` | int (nullable) | First appearance episode |
| `last_episode_seq` | int (nullable) | Last appearance episode |
| `properties_json` | string | Full node properties as JSON |
### Edges (`edges.parquet`)
| Column | Type | Description |
|--------|------|-------------|
| `source_node_id` | string | Source node UUID |
| `target_node_id` | string | Target node UUID |
| `relationship_type` | string | Relationship type (e.g., PARTICIPATED_AS) |
| `properties_json` | string | Edge properties as JSON |
### Positions (`positions.parquet`)
| Column | Type | Description |
|--------|------|-------------|
| `node_id` | string | Node UUID |
| `x`, `y`, `z` | float | 3D coordinates |
| `size` | float | Node size (Graph Gravity weighted) |
| `r`, `g`, `b` | int | RGB color by entity type |
| `community` | int | Louvain community index |
| `tier` | string (nullable) | Graph Gravity tier |
## Usage
```python
from datasets import load_dataset
import pandas as pd
# Load from HuggingFace
ds = load_dataset("brandburner/westwing-mega-narrative-kg")
# Or load parquet directly
nodes = pd.read_parquet("nodes.parquet")
edges = pd.read_parquet("edges.parquet")
# Filter to anchor characters
anchors = nodes[(nodes['primary_label'] == 'Agent') & (nodes['tier'] == 'anchor')]
# Build a NetworkX graph
import networkx as nx
G = nx.DiGraph()
for _, n in nodes.iterrows():
G.add_node(n['node_id'], label=n['primary_label'], name=n['name'])
for _, e in edges.iterrows():
G.add_edge(e['source_node_id'], e['target_node_id'], type=e['relationship_type'])
```
## Citation
```bibtex
@misc{fabula_westwing_mega,
title = {The West Wing Narrative Knowledge Graph},
author = {Fabula Pipeline},
year = {2026},
publisher = {HuggingFace},
howpublished = {\url{https://huggingface.co/datasets/brandburner/westwing-mega-narrative-kg}}
}
```
## License
CC BY-SA 4.0
提供机构:
brandburner



