Companies House
收藏Snowflake2026-04-29 更新2026-04-30 收录
下载链接:
https://app.snowflake.com/marketplace/listing/GZTDZ12PVPA
下载链接
链接失效反馈官方服务:
资源简介:
Near-real-time snapshot of the UK Companies House register, delivered as Apache Iceberg tables on S3 and queried in place by Snowflake via Glue catalog integration. Zero data copies, zero ETL to maintain on the consumer side — just SELECT.
## What's included
Four normalized, joinable tables covering the UK statutory company register:
**company_info**
One row per active/historic UK company
company_number, company_name, company_status, company_type, date_of_creation, jurisdiction, registered office address fields, sic_codes, last_updated
**financials**
One row per filed accounting fact
company_number, transaction_id, accounting_period_start, accounting_period_end, fact_name, current_year_value, previous_year_value, currency, doc_type, last_updated
**officers**
One row per officer (director/secretary/etc.)
officer_id, name, nationality, occupation, date_of_birth_year, date_of_birth_month, country_of_residence, last_updated
**officer_appointments**
One row per appointment (officer ↔ company)
company_number, officer_id, appointment_id, officer_role, appointed_on, resigned_on, is_active, last_updated
## Why it's useful
- Build the full graph: officers ↔ officer_appointments ↔ company_info gives you the complete network of UK directorships in SQL.
- Standardized financials: financials extracts structured iXBRL facts (revenue, assets, liabilities, employees, etc.) from filed annual accounts — no HTML parsing required.
- Time-traveling: Backed by Apache Iceberg, so you can query historical snapshots (AT (TIMESTAMP => ...)), audit changes, and safely re-run joins against a consistent point in time.
- Always fresh: Sourced directly from the Companies House streaming API and incrementally merged on a daily cadence. last_updated columns let you implement change-data capture on the consumer side.
- Partitioned for performance: Company-centric tables bucket on company_number; financials partition by years(accounting_period_end) for efficient period filters.
## Example queries
## Use cases
- KYC / KYB onboarding — verify legal entity status, registered address, active officers.
- Credit & counterparty risk — track status changes (dissolved, in liquidation) and filed financials.
- Sales intelligence / ABM — segment UK companies by SIC code, incorporation date, jurisdiction.
- AML & investigations — traverse director networks to surface shared control across entities.
- Public sector / research — longitudinal analysis of UK corporate demography.
## Refresh cadence
- Source ingestion: continuous (Companies House streaming API).
- Iceberg MERGE: daily (UTC).
- Typical freshness: < 24 hours for new events; last_updated on every row.
## Data provenance & licensing
- Source: Companies House public data (Crown Copyright, Open Government Licence v3.0).
- Processing: parsed iXBRL/JSON, normalized into four joinable dimensions/facts; no enrichment or inference — every row traces back to a filed event.
## Coverage
UK-registered companies (England & Wales, Scotland, Northern Ireland). Includes active and dissolved entities within the Companies House history window.
提供机构:
Redbeard Analytics
创建时间:
2026-04-17
原始信息汇总
数据集概述:Companies House
- 提供商:Redbeard Analytics
- 价格:免费
- 访问权限:无限访问
- 交付方式:安全共享
数据集描述
该数据集提供英国公司注册处(Companies House)的准实时快照,以 Apache Iceberg 表的形式存储在 S3 上,并通过 Snowflake 的 Glue 目录集成进行就地查询。消费者无需维护数据副本或 ETL 流程。
包含内容
数据集包含四个规范化、可连接的表:
company_info:每行对应一家活跃或历史英国公司。- 字段:
company_number、company_name、company_status、company_type、date_of_creation、jurisdiction、注册办公地址字段、sic_codes、last_updated。
- 字段:
financials:每行对应一个已提交的会计事实。- 字段:
company_number、transaction_id、accounting_period_start、accounting_period_end、fact_name、current_year_value、previous_year_value、currency、doc_type、last_updated。
- 字段:
officers:每行对应一名高管(董事、秘书等)。- 字段:
officer_id、name、nationality、occupation、date_of_birth_year、date_of_birth_month、country_of_residence、last_updated。
- 字段:
officer_appointments:每行对应一次任职记录(高管 ↔ 公司)。- 字段:
company_number、officer_id、appointment_id、officer_role、appointed_on、resigned_on、is_active、last_updated。
- 字段:
用途与价值
- 构建完整关系图:通过
officers↔officer_appointments↔company_info在 SQL 中查询英国董事网络的完整图谱。 - 标准化财务数据:从提交的年度账目中提取结构化的 iXBRL 事实(收入、资产、负债、员工等),无需解析 HTML。
- 时间旅行查询:基于 Apache Iceberg,可查询历史快照(使用
AT (TIMESTAMP => ...)),审计变更,并在一致的时间点安全地重新运行连接。 - 持续更新:直接来源于 Companies House 流式 API,每日增量合并。
last_updated列支持消费者端实现变更数据捕获。 - 性能分区:以公司为中心的表按
company_number分桶;financials表按accounting_period_end年份分区,以便高效过滤时间段。
业务需求
- 客户获取:验证法律实体状态、注册地址和活跃高管。
使用案例
- KYC / KYB 入职:验证法律实体状态、注册地址、活跃高管。
- 信用与交易对手风险:追踪状态变更(解散、清算)和已提交的财务数据。
- 销售情报 / ABM:按 SIC 代码、成立日期、管辖权细分英国公司。
- 反洗钱与调查:遍历董事网络以发现跨实体的共同控制关系。
- 公共部门/研究:对英国企业人口统计进行纵向分析。
刷新频率
- 源数据摄取:持续进行(使用 Companies House 流式 API)。
- Iceberg 合并:每日(UTC 时间)。
- 典型新鲜度:新事件更新延迟小于 24 小时;每行均有
last_updated字段。
数据来源与许可
- 来源:Companies House 公共数据(Crown Copyright, Open Government Licence v3.0)。
- 处理方式:解析 iXBRL/JSON,标准化为四个可连接的维度/事实表;无数据增强或推断,每行数据均可追溯到已提交的事件。
覆盖范围
- 涵盖:英国注册公司(英格兰和威尔士、苏格兰、北爱尔兰),包括活跃和已解散的实体。
类别
- 类别:商业、金融服务、客户获取、财务
Cortex AI 就绪
- 是
更新时间
- 每日
时间覆盖范围
- 过去 12 个月
- 基于事件
法律条款
- 标准



