Data Fission: Splitting a Single Data Point
收藏Taylor & Francis Group2023-12-14 更新2026-04-16 收录
下载链接:
https://tandf.figshare.com/articles/dataset/Data_fission_splitting_a_single_data_point/24328745/2
下载链接
链接失效反馈官方服务:
资源简介:
Suppose we observe a random vector <i>X</i> from some distribution in a known family with unknown parameters. We ask the following question: when is it possible to split <i>X</i> into two pieces <i>f</i>(<i>X</i>) and <i>g</i>(<i>X</i>) such that neither part is sufficient to reconstruct X by itself, but both together can recover X fully, and their joint distribution is tractable? One common solution to this problem when multiple samples of X are observed is data splitting, but Rasines and Young offers an alternative approach that uses additive Gaussian noise—this enables post-selection inference in finite samples for Gaussian distributed data and asymptotically when errors are non-Gaussian. In this article, we offer a more general methodology for achieving such a split in finite samples by borrowing ideas from Bayesian inference to yield a (frequentist) solution that can be viewed as a continuous analog of data splitting. We call our method data fission, as an alternative to data splitting, data carving and <i>p</i>-value masking. We exemplify the method on several prototypical applications, such as post-selection inference for trend filtering and other regression problems, and effect size estimation after interactive multiple testing. Supplementary materials for this article are available online.
提供机构:
Duan, Boyan; Leiner, James; Ramdas, Aaditya; Wasserman, Larry
创建时间:
2023-12-14



