超高层建筑绿色智慧运维辅助决策知识图谱规范三元组数据
收藏国家基础学科公共科学数据中心2026-01-30 收录
下载链接:
https://nbsdc.cn/general/dataDetail?id=694abc9a195d261fbbe0d8bb&type=1
下载链接
链接失效反馈官方服务:
资源简介:
本数据集主要面向超高层建筑绿色、安全与智能化运维的研究与应用需求建设。数据内容为符合知识图谱规范的语义三元组,其核心知识覆盖结构本体、设备设施、建筑环境、能源利用四个关键领域。
数据来源于中华人民共和国住房和城乡建设部官方网站的“文件库”标准公告,确保了来源的权威性与可追溯性。原始规范文本于2025年8月8日在北京与深圳两地,通过定制化的检索方案采集,并由领域专家核验,确保所有采用的规范均为现行有效版本。
在产生方法上,本项目首先为不同类型的建筑规范设计专用提示词模板,随后利用大模型的深层语义理解能力,从规范文本中精准抽取出结构化三元组知识,最后经由专家团队进行人工质检与消歧,确保知识的准确性与一致性。
数据集最终以UTF-8编码的文本文件格式存储,主要包含了从权威建筑规范中抽取、消歧并验证后的高质量(实体-关系-实体)三元组以及neo4j软件三元组导出数据,体量规模为1.5MB。
This dataset is developed to meet the research and application requirements of green, safe and intelligent operation and maintenance for ultra-high-rise buildings. The data consists of semantic triples conforming to knowledge graph specifications, with its core knowledge covering four key domains: structural ontology, equipment and facilities, built environment, and energy utilization.
The data is sourced from the standard announcements in the "Document Library" section of the official website of the Ministry of Housing and Urban-Rural Development of the People's Republic of China, ensuring the authority and traceability of the data source. The original standard documents were collected on August 8, 2025 in Beijing and Shenzhen via a customized retrieval solution, and verified by domain experts to ensure that all adopted standards are currently valid versions.
For the data generation pipeline, this project first designed dedicated prompt templates for various types of building codes, then utilized the deep semantic understanding ability of large language models (LLMs) to accurately extract structured triple knowledge from the standard texts, and finally performed manual quality inspection and disambiguation via an expert team to guarantee the accuracy and consistency of the knowledge.
The dataset is finally stored in UTF-8 encoded text file format, mainly including high-quality (entity-relation-entity) triples extracted, disambiguated and validated from authoritative building codes, as well as triple export data from Neo4j software, with a total size of 1.5 MB.
提供机构:
北京科技大学
搜集汇总
数据集介绍

背景与挑战
背景概述
该数据集为超高层建筑绿色智慧运维辅助决策提供知识图谱规范三元组数据,覆盖结构本体、设备设施、建筑环境和能源利用四个核心领域。数据源自住房和城乡建设部权威标准公告,通过大模型抽取和专家核验确保质量,以文本文件格式存储,体量1.5MB。
以上内容由遇见数据集搜集并总结生成



