文博标准数据集
收藏国家数据集管理服务平台2026-04-28 更新2026-04-29 收录
下载链接:
https://www.ndsms.cn/dataRetrieval/datasetDetail/?id=3f80c9c1832433f516b430b7e1b2c74d
下载链接
链接失效反馈官方服务:
资源简介:
本数据集面向文博领域标准化应用研发团队、文博信息化建设机构及智能审核系统开发者,旨在解决文博相关标准分散、查询效率低、难以批量用于模型训练的问题。数据集成国家标准、行业标准、地方标准三大类别,全面覆盖文物保护、藏品管理、博物馆服务、考古作业、文博信息化等核心业务领域,以结构化文本形式呈现。与传统标准文档库不同,本数据集对各级标准进行了统一格式清洗、条款拆解与分类标注,将非结构化的PDF标准转化为可直接用于模型训练和规则匹配的字段化数据。
This dataset is designed for teams engaged in standardized application R&D in the cultural heritage field, cultural heritage informatization construction institutions, and developers of intelligent audit systems. It aims to address the issues of scattered cultural heritage-related standards, low query efficiency, and difficulty in batch utilization for model training. The dataset integrates three categories of standards: national standards, industry standards, and local standards, comprehensively covering core business domains such as cultural relic protection, collection management, museum services, archaeological operations, and cultural heritage informatization, and is presented in structured text format. Unlike traditional standard document repositories, this dataset has conducted unified format cleaning, clause decomposition and classification annotation on standards at all levels, converting unstructured PDF standard documents into fielded data that can be directly used for model training and rule matching.
提供机构:
上海库帕思科技有限公司
创建时间:
2026-04-27
搜集汇总
数据集介绍

背景与挑战
背景概述
该数据集面向文博领域标准化应用研发团队和信息化建设者,旨在解决文博标准分散、查询效率低和难以批量用于模型训练的问题。它整合了国家标准、行业标准和地方标准,覆盖文物保护、藏品管理、博物馆服务等核心业务领域,并以结构化文本形式呈现,通过统一格式清洗和条款标注,将非结构化PDF标准转化为可直接用于模型训练和规则匹配的字段化数据。
以上内容由遇见数据集搜集并总结生成



