five

WorldPT: A Structured Dataset of Fictional Worlds in Portuguese

收藏
Zenodo2026-04-22 更新2026-05-26 收录
下载链接:
https://zenodo.org/doi/10.5281/zenodo.18989735
下载链接
链接失效反馈
官方服务:
资源简介:
Dataset Description WorldPT: A Structured Dataset of Fictional Worlds in Portuguese is a curated and fully structured dataset designed for the study, generation, and analysis of fictional universes (worldbuilding). The dataset organizes narrative elements into multiple interconnected categories, including characters, locations, cultures, events, organizations, technologies, magic systems, religions, races, and more, each stored as standardized JSON files written entirely in Portuguese. Each world is represented as an independent directory containing entities with consistent metadata fields such as unique ID, title, genre, narrative text, content type, data source, and automatically detected cross-entity relationships. This relational structure enables the dataset to be used for knowledge graph construction, narrative network analysis, AI world generation, and computational creativity research. WorldPT serves as a reference resource for researchers and developers interested in natural language processing (NLP), semantic modeling, and artificial intelligence applied to storytelling and worldbuilding, especially in the context of the Portuguese language, where such resources remain scarce. At its core, WorldPT models each fictional universe as a graph, where: Entities (nodes) represent elements of the world (e.g., characters, places, artifacts). Relationships (edges) represent how these entities are connected. However, unlike conventional knowledge graphs, WorldPT introduces two critical innovations: Directed Relationships with Semantic Grounding All relationships are directional and meaningful. The direction of a connection encodes a dependency: The source entity is understood as being contextually grounded in the target entity. The target entity provides the context, origin, or constraint for the source. For example: A character is grounded in an organization (membership). An event is grounded in a timeline (temporal placement). This principle ensures that the graph reflects how the world is structured, not just which elements are associated. Multilayer Narrative Structure (Hexapartite System) Each relationship exists within one of six distinct semantic layers, forming a multilayer (or multiplex) graph. This allows the same pair of entities to be connected in different ways depending on context. Each layer represents a different dimension of narrative meaning: 1. Structural Layer Captures spatial organization and hierarchical containment.Examples: A city located within a kingdom A character belonging to a region 2. Causal Layer Represents cause-and-effect relationships and transformations.Examples: A technology influencing an economy An event triggering another event 3. Temporal Layer Encodes chronological structure and time anchoring.Examples: An event occurring within a specific timeline Historical sequencing of occurrences 4. Social Layer Models interpersonal relationships and social structures.Examples: Alliances, kinship, or membership Authority and social roles 5. Ontological Layer Defines the fundamental rules and systems governing the world.Examples: A character using a magic system Entities constrained by metaphysical laws 6. Symbolic Layer Represents meaning, ideology, and cultural influence.Examples: A religion shaping a culture Symbols influencing collective beliefs This layered structure allows the dataset to disambiguate complex relationships that would otherwise be conflated in traditional representations. For instance, a character may simultaneously belong to a group (social), be located in a place (structural), and be influenced by an ideology (symbolic). Ontological Classes WorldPT defines a fixed set of 13 ontological classes to categorize all entities in a fictional world. These classes ensure full coverage of narrative elements while minimizing ambiguity. They include: Agents and groups: characters, races, organizations Spatiotemporal anchors: places, events, timelines Societal systems: culture, religion, politics, economy World constraints: magic systems, technology, artifacts This taxonomy ensures that every entity has a clearly defined role within the world, enabling consistent modeling and analysis
提供机构:
Zenodo
创建时间:
2026-03-12
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作