five

A New Projection Pursuit Index for Big Data

收藏
NIAID Data Ecosystem2026-05-02 收录
下载链接:
https://figshare.com/articles/dataset/A_New_Projection_Pursuit_Index_for_Big_Data/28462333
下载链接
链接失效反馈
官方服务:
资源简介:
Visualization of extremely large datasets, whether in static or dynamic form, poses a significant challenge due to the limitations of most traditional methods in handling big-data problems. To address this challenge, a novel visualization approach for big data is proposed based on Projection Pursuit, Grand and Guided Tours, and Data Nuggets methods. The aim of this new methodology is to discover hidden structures such as clusters, outliers, and other nonlinear structures within large datasets. The Guided Tour, a dynamic graphical tool for high-dimensional data, integrates Projection Pursuit and Grand Tour techniques to present a dynamic sequence of low-dimensional projections obtained by Projection Pursuit index functions to navigate the data space. While various Projection Pursuit indices have been developed in the past, computational constraints arise when applying the original Guided Tour approach to big-data scenarios. A new PP index is developed to be computable for big data, with the help of a data compression method called “Data Nuggets” that reduces large datasets while maintaining the original data structure. The effectiveness of the proposed methodology is demonstrated on simulated datasets. A big data application is presented to illustrate the new method in the real world. The development of static and dynamic graphical tools based on the proposed Projection Pursuit index holds promise for detecting nonlinear structures within big-data contexts, offering valuable insights for big-data analysis.
创建时间:
2025-02-21
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作