five

Imputation of Mixed Data With Multilevel Singular Value Decomposition

收藏
Taylor & Francis Group2019-10-25 更新2026-04-16 收录
下载链接:
https://tandf.figshare.com/articles/Imputation_of_Mixed_Data_With_Multilevel_Singular_Value_Decomposition/7827983/1
下载链接
链接失效反馈
官方服务:
资源简介:
Statistical analysis of large datasets offers new opportunities to better understand underlying processes. Yet, data accumulation often implies relaxing acquisition procedures or compounding diverse sources. As a consequence, datasets often contain mixed data, that is, both quantitative and qualitative, and many missing values. Furthermore, aggregated data present a natural <i>multilevel</i> structure, where individuals or samples are nested within different sites, such as countries or hospitals. Imputation of multilevel data has therefore drawn some attention recently, but current solutions are not designed to handle mixed data, and suffer from important drawbacks, such as their computational cost. In this article, we propose a single imputation method for multilevel data, which can be used to complete either quantitative, categorical, or mixed data. The method is based on multilevel singular value decomposition (SVD), which consists in decomposing the variability of the data into two components, the between and within groups variability, and performing an SVD on both parts. We show on a simulation study that in comparison to competitors, the method has the advantages of handling datasets of various size, and being computationally faster. Furthermore, it is the first so far to handle mixed data. We apply the method to impute a medical dataset resulting from the aggregation of several hospitals datasets. This application falls in the framework of a larger project on Trauma patients. To overcome obstacles associated to the aggregation of medical data, we turn to distributed computation. The method is implemented in the R package <i>missMDA</i>. Supplementary materials for this article are available online.
提供机构:
Balasubramanian Narasimhan
创建时间:
2019-10-25
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作