Predictive Subdata Selection for Computer Models
收藏Taylor & Francis Group2022-07-25 更新2026-04-16 收录
下载链接:
https://tandf.figshare.com/articles/dataset/Predictive_Subdata_Selection_for_Computer_Models/20263820/1
下载链接
链接失效反馈官方服务:
资源简介:
An explosion in the availability of rich data from the technological advances is hindering efforts at statistical analysis due to constraints on time and memory storage, regardless of whether researchers employ simple methods (e.g., linear regression) or complex models (e.g., Gaussian processes). A recent approach to overcoming these limits involves information-based optimal subdata selection and Latin hypercube subagging. In the current study, we develop a novel subdata selection method for large-scale computer models based on expected improvement optimization. Numerical and empirical analysis using real-world data are used to select subdata by which to derive accurate predictions. During the optimization procedure, the proposed scheme employs the geometry of the input feature region as well as information related to output values. The data points associated with the largest improvement in prediction accuracy are combined in the construction of a subdataset that can be used to formulate predictions with affordable computing time. Supplementary materials for this article, including proofs of theorems and additional numerical results, are available online.
提供机构:
Chang, Ming-Chung
创建时间:
2022-07-07



