企业合同全要素台账数据集
收藏贵州省数据知识产权登记平台2026-01-09 更新2026-01-10 收录
下载链接:
https://gzdipp.gzsis.cn:12020/noticeDetail?id=2178&type=1
下载链接
链接失效反馈官方服务:
资源简介:
数据采集环节,从原始台账中完整提取8类核心字段,包括合同编号、对方单位、合同类型、价税合计、签订日期、履行状态、付款方式、备注,确保覆盖合同“基本信息 - 金额 - 时间 - 状态”全要素,无关键信息遗漏;数据加工环节分三步推进:第一步是脱敏处理,采用“首尾字符保留 + 中间替换”规则,在保障信息安全的同时保留辨识度;第二步是数据清洗,剔除重复合同记录,将“签订日期”标准化格式,统一数值单位;第三步是结构化分析,通过“金额阈值分层”(核心客户≥100万元、重要客户20-100万元、潜力客户<20万元)完成客户分层,采用“类别聚合统计法”计算18类合同的数量与金额占比,借助“时间序列分析法”生成月度合同金额趋势,所有算法逻辑可追溯、结果可复现,且加工过程不改变原始数据核心信息。
Data collection stage: Fully extract 8 types of core fields from the original ledger, including contract number, counterparty unit, contract type, total price including tax, signing date, performance status, payment method and remarks, so as to cover all key elements across the four dimensions of contract basic information, amount, time and status, with no critical information omitted. The data processing stage proceeds in three sequential steps: The first step is data desensitization, which adopts the rule of retaining the first and last characters while replacing the middle content, ensuring information security while maintaining recognizability of the data. The second step is data cleaning, which removes duplicate contract records, standardizes the format of the "signing date" field, and unifies the numerical units of relevant data. The third step is structured analysis: implement customer segmentation via the "amount threshold stratification" method (core customers: ≥ 1,000,000 yuan; important customers: 200,000 - 1,000,000 yuan; potential customers: < 200,000 yuan); calculate the quantity and amount proportion of 18 types of contracts using the "category aggregation statistics method"; generate monthly contract amount trends with the aid of the "time series analysis method". All algorithmic logic is traceable and the results are reproducible, and the core information of the original data is not altered during the entire processing procedure.
提供机构:
贵州贵仁实业有限公司
创建时间:
2026-01-07
搜集汇总
数据集介绍

背景与挑战
背景概述
该数据集聚焦企业合同管理,涵盖合同编号、对方单位、价税合计等8类核心要素,数据规模为36KB,每年更新。它专为批发和零售业设计,通过系统化脱敏、清洗和分析流程,支持业务、销售、运营和财务部门进行资源优化、客户分层和资金控制,实现全链路业务支撑。
以上内容由遇见数据集搜集并总结生成



