five

12345热线智能分析数据库

收藏
国家基础学科公共科学数据中心2026-01-30 收录
下载链接:
https://nbsdc.cn/general/dataDetail?id=696fa68d195d265a6c49a246&type=1
下载链接
链接失效反馈
官方服务:
资源简介:
本数据集源于与数字政府项目合作,以黑龙江省和北京市等地的“12345政务服务热线”真实业务构建政务服务场景下的多维结构化数据库,数据量为614KB,包含企业、政府部门、个人与投诉记录四个核心表,反映政务服务全过程的要素与互动关系。研究条件上,数据采集依托12345热线系统业务数据接口,由合作方汇聚原始数据并脱敏后提供给课题组,采集时间为2024年8月至10月,地点涵盖黑龙江省与北京市,时间精度为秒级,采集后计算和使用方式主要为自然语言处理。在数据处理上,我方接收脱敏数据后,再次对涉及个人隐私、商业机密的敏感字段匿名化或模糊化,确保安全合规,随后进行数据清洗,处理缺失值、修正格式与逻辑错误。质量控制方面,来源采集环节数据由政务信息系统实时生成,权威性高,且遵循统一标准规范,保证格式定义规范;加工处理环节注重数据安全与清洗;质量保证环节重点核查数据完整性、一致性和准确性,并与数据提供方沟通,对存疑数据溯源确认。该数据集潜在利用价值高,能为政务服务研究、政策制定、服务优化等提供有力数据支撑,助力提升政务服务质量和效率。

This dataset is developed in collaboration with a digital government project, constructing a multi-dimensional structured database for government service scenarios based on real business data from the "12345 Government Service Hotline" in Heilongjiang Province, Beijing Municipality and other regions. With a total size of 614 KB, the dataset comprises four core tables: enterprises, government departments, individuals, and complaint records, which reflect the elements and interactive relationships throughout the entire process of government services. In terms of research conditions, the data collection relies on the business data interface of the 12345 Government Service Hotline system. The original data was aggregated by the cooperative partner, anonymized, and then provided to the research team. The data was collected from August to October 2024, covering Heilongjiang Province and Beijing Municipality, with second-level time precision. Post-collection computation and usage primarily adopt natural language processing (NLP) technologies. During data processing, after receiving the pre-anonymized data, we further anonymize or obfuscate sensitive fields involving personal privacy and commercial secrets to ensure compliance and data security, followed by data cleaning operations including handling missing values, correcting format errors and logical inconsistencies. For quality control, the source-collected data is generated in real-time by official government information systems, featuring high authority and compliance with unified standard specifications to guarantee standardized format definitions. The data processing stage prioritizes data security and cleaning. The quality assurance stage mainly verifies data integrity, consistency and accuracy, and communicates with the data provider to trace and confirm any questionable data entries. This dataset has high potential application value, providing robust data support for government service research, policy formulation, service optimization and other related fields, and contributing to the improvement of government service quality and efficiency.
提供机构:
中国人民大学
搜集汇总
数据集介绍
main_image_url
背景与挑战
背景概述
12345热线智能分析数据库是一个基于黑龙江省和北京市政务服务热线真实业务构建的多维结构化数据库,包含企业、政府部门、个人与投诉记录四个核心表,数据经过脱敏和清洗处理,适用于政务服务研究和政策制定等领域。
以上内容由遇见数据集搜集并总结生成
二维码
社区交流群
二维码
科研交流群
商业服务