five

A Geometric Algorithm for Contrastive Principal Component Analysis in High Dimension

收藏
Taylor & Francis Group2024-01-08 更新2026-04-16 收录
下载链接:
https://tandf.figshare.com/articles/dataset/A_Geometric_Algorithm_for_Contrastive_Principal_Component_Analysis_in_High_Dimension/24712600/1
下载链接
链接失效反馈
官方服务:
资源简介:
Principal component analysis (PCA) has been widely used in exploratory data analysis. Contrastive PCA (Abid et al.), a generalized method of PCA, is a new tool used to capture features of a target dataset relative to a background dataset while preserving the maximum amount of information contained in the data. With high dimensional data, contrastive PCA becomes impractical due to its high computational requirement of forming the contrastive covariance matrix and associated eigenvalue decomposition for extracting leading components. In this article, we propose a geometric curvilinear-search method to solve this problem and provide a convergence analysis. Our approach offers significant computational efficiencies. Specifically, it reduces the time complexity from O((n∨m)p2) to a more manageable O((n∨m)pr), where <i>n</i>, <i>m</i> are the sample sizes of the target data and background data, respectively, <i>p</i> is the data dimension and <i>r</i> is the number of leading components. Additionally, we streamline the space complexity from O(p2), necessary for storing the contrastive covariance matrix, to a more economical O((n∨m)p), sufficient for storing the data alone. Numerical examples are presented to show the merits of the proposed algorithm. Supplementary materials for this article are available online.
提供机构:
Wang, Shao-Hsuan; Lu, Rung-Sheng; Huang, Su-Yun
创建时间:
2023-12-01
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作