taXonomy
收藏Databricks2024-08-16 收录
下载链接:
https://marketplace.databricks.com/details/cb8a4cdf-64a3-42d5-b15e-6447f8482d20/HealthVerity_taXonomy
下载链接
链接失效反馈官方服务:
资源简介:
**Overview**
HealthVerity taXonomy is the nation's most comprehensive, consistent, and curated clinical dataset designed to power health economics, outcomes and real-world safety studies.
**Key Features**
1. Comprehensive: The broadest payer claims coverage ever assembled
HealthVerity taXonomy consists of over 245 million covered lives. This is the broadest coverage of any payer claims dataset ever assembled and includes patient data for up to nine years. Longitudinality is critical, particularly when working with rare diseases where you need to be able to discover a particular population with increasingly precise and narrow criteria. This vast closed claims dataset brings together all of the attributes you would expect, plus other elements, such as race and mortality, and represents all age groups, races and geographies.
2. Consistent: Manage data discontinuity
With over 225 payers represented in HealthVerity taXonomy, we see across a broad number of dimensions and ensure a diverse and representative sample that includes commercial, Medicare and Medicaid. Additionally, our data provider matrix maximizes stability in data supply, offering continuity in a market where some long-standing data sources are being retracted almost overnight. That’s because HealthVerity has created a bi-directional trust network between our data partners and end researchers with transparency and clarity into data provenance, data rights and use applications. This visibility allows data partners to be confident on where their data is being delivered and how it is being used. With our open architecture, this stable closed claims dataset provides a foundational view that also allows for new payers to consistently join the dataset.
3. Curated: Research ready from day one
With the feedback that we received from our top pharma clients, we decided to simplify, consolidate and curate our data model and HealthVerity taXonomy does exactly that. It’s highly curated and de-duplicated across sources, providing a visit-centric view that includes standardized costs and reference tables. This new approach pulls together a comprehensive view of patient journeys while still being within the privacy-compliant framework that allows for additional data to be synchronized from across HealthVerity Marketplace, such as EMR, lab results and open claims data, all within a single contract. This eliminates any concern about needing to perform an independent privacy review or taking months to assemble a multi-data-source cohort.
4. Streamlined and simplified data model
We spent extensive time synchronizing the data to inform a thoughtful and meaningful design with 11 pre-assembled, purpose-built and curated tables that allow you to focus on your research while avoiding the struggle and frustration of a traditional claims layout. This simplified data model provides everything you need across medical and pharmacy claims, enrollment, demographics, vital statuses, and reference tables from CMS and FDA. By de-duplicating records and prioritizing completeness of records, we are providing the most comprehensive view of patient visits and encounters. Diagnoses, procedures and prescriptions can easily be contextualized by the provided reference tables. Enrollment and remittance data are organized by payer type and Census region, allowing you to focus your cost analysis on specific groups of people and parts of the country. Instead of needing to contract and link with a number of different providers in a potentially weeks or months long process, we provide demographics and month of death as part of HealthVerity taXonomy, sharply reducing time to insights for overall survival outcomes research.
5. More value with standardized costs
By introducing remit data, we fill the gaps in legacy costs data, providing the result of real-world payer and provider negotiations, not the requested reimbursement amount normally found in open claims. By mapping the remit data to procedures, we are also able to calculate and understand median negotiated rates across hundreds of payers, instead of a single payer or small mix of payers where the costs are subject to their specific formularies and limited to the geographies and pay types they serve.
This approach to cost data will be stored in the procedure table, next to the payer submitted allotment amount, which gives you optionality and the ability to compare and diversify your sources for cost data. For example, in reviewing a single procedure code over the last three years, we see a 10% difference between actual cost and the calculated standard market cost. Providing both side by side, we offer options to take analyses to the next level. The standard market cost is not imputed, it’s the real cost in the real world. It’s a median across visit, year, pay type and Census region, thus making it more representative of the U.S. population receiving care and the most unbiased, real-market cost.
**Additional Details**
This is only a sample dataset for taXonomy representing a small portion of the volume of Real-world Data that is available in this Data Product. This sample dataset is available for free. For access to the full dataset, HealthVerity would welcome the opportunity to learn more about your project and discuss licensing. To get started, please contact us through https://www.healthverity.com/contact/
**概述**
HealthVerity taXonomy是美国当前最全面、最具一致性且经过精心整理(curated)的临床数据集,旨在为卫生经济学、临床疗效研究与真实世界安全性研究提供数据支撑。
**核心特性**
1. 全面覆盖:整合迄今最广泛的支付方索赔(payer claims)数据覆盖范围
HealthVerity taXonomy覆盖超过2.45亿参保人员,是目前已整合的支付方索赔数据集中覆盖范围最广的数据集,包含最长达9年的患者数据。纵向数据追踪能力至关重要,尤其针对罕见病研究——研究人员需通过愈发精准、细化的筛选标准定位特定患者群体。该庞大的闭合式索赔数据集(closed claims dataset)不仅包含所有核心属性,还补充了种族、死亡率等维度信息,覆盖全年龄段、各种族及所有地理区域。
2. 统一规范:解决数据间断性问题
HealthVerity taXonomy涵盖超过225家支付方,覆盖多维度数据,确保样本具备多样性与代表性,包含商业保险、联邦医疗保险(Medicare)及医疗补助计划(Medicaid)三类参保群体。此外,我们的数据供应商矩阵最大化了数据供给的稳定性,在当前部分老牌数据源几乎一夜之间被撤回的市场环境下,保障了数据连续性。这得益于HealthVerity构建了数据合作方与终端研究人员之间的双向信任网络,对数据来源、数据权限及使用场景均具备透明清晰的披露机制。这种透明度让数据合作方能清晰知晓其数据的交付去向与使用方式。依托开放式架构,该稳定的闭合式索赔数据集不仅提供基础数据视图,还可支持新的支付方持续接入数据集。
3. 专业整理:开箱即可用于研究
我们基于顶尖制药客户的反馈,对数据模型进行了简化、整合与专业整理(curated),HealthVerity taXonomy正是这一工作的成果。该数据集经过高度整理并跨源去重,提供以就诊为中心的数据视图,包含标准化成本与参考表。这一新的整合方案全面呈现了患者诊疗历程,同时符合隐私合规框架,可在单一合同框架下同步来自HealthVerity Marketplace的各类数据,例如电子病历(Electronic Medical Record, EMR)、检验结果与公开索赔数据。此举无需再开展独立的隐私审查,也无需花费数月时间整合多数据源队列,彻底解决了相关顾虑。
4. 精简简化的数据模型
我们投入大量时间对数据进行同步优化,打造兼具思考性与实用性的设计,包含11张预组装、专为研究定制且经过专业整理的表格,让研究人员可专注于自身研究,无需应对传统索赔数据布局带来的繁琐与困扰。该简化后的数据模型涵盖医疗与药房索赔、参保登记、人口统计学信息、生命状态以及来自美国医疗保险与医疗补助服务中心(Centers for Medicare & Medicaid Services, CMS)和美国食品药品监督管理局(Food and Drug Administration, FDA)的参考表等所有所需内容。通过对记录进行去重并优先保障记录完整性,我们提供了最全面的患者就诊与就医事件视图。诊断、手术操作与处方信息均可通过配套参考表轻松实现上下文关联。参保登记与付款数据按支付方类型及人口普查区域(Census region)进行组织,方便研究人员针对特定群体及特定地区开展成本分析。我们已将人口统计学信息与死亡月份纳入HealthVerity taXonomy中,无需再通过数周或数月的流程与多家供应商签订合同并进行数据关联,大幅缩短了总生存期疗效研究的洞察产出时间。
5. 标准化成本带来更多价值
通过引入理赔结算数据(remit data),我们填补了传统成本数据的空白,提供的是真实世界中支付方与医疗服务提供商协商后的实际结算结果,而非公开索赔数据中常见的申请报销金额。我们将理赔结算数据与手术操作代码进行映射,因此能够计算并了解数百家支付方的协商费率中位数,而非仅基于单个或少数几家支付方的数据——这类数据往往受限于其特定的药品目录,且仅覆盖其服务的地理区域与支付类型。
该成本数据处理方式将存储在手术操作表中,与支付方提交的拨款金额并列呈现,为研究人员提供灵活性,可对比多样化的成本数据源并进行优化。例如,在回顾过去三年的单一手术操作代码时,我们发现实际成本与计算得出的标准市场成本之间存在10%的差异。我们同时提供两类数据,为研究分析提供升级选项。标准市场成本并非估算值,而是真实世界中的实际成本——它是基于就诊、年份、支付类型及人口普查区域计算得出的中位数,因此更能代表接受诊疗的美国人群,是最具无偏性的真实市场成本。
**补充说明**
本数据集仅为taXonomy的示例数据集,仅代表该数据产品中可用真实世界数据的一小部分。该示例数据集可免费获取。如需获取完整数据集,HealthVerity期待了解您的研究项目并探讨授权合作事宜。如需启动合作,请通过https://www.healthverity.com/contact/联系我们。
提供机构:
HealthVerity
搜集汇总
数据集介绍

以上内容由遇见数据集搜集并总结生成



