Balancing Errors as an Approach Towards Better Use of Larger Samples in Psychological Research

Name: Balancing Errors as an Approach Towards Better Use of Larger Samples in Psychological Research
Creator: ZPID (Leibniz Institute for Psychology Information)
Published: 2019-04-03 12:29:04
License: 暂无描述

PsychArchives2019-04-03 更新2026-04-25 收录

下载链接：

https://hdl.handle.net/20.500.12034/2028

下载链接

链接失效反馈

官方服务：

资源简介：

A well-documented concern about psychological research that has been frequently stressed during the current replicability crisis maintains that studies are often based on too small samples and tests thus yield insufficient statistical power. Correspondingly, it has been repeatedly emphasized that a vital step in overcoming the low replicability of psychological studies is to recruit larger samples whenever possible. However, despite the indubitable importance of larger samples – and the value of corresponding policy changes by editors and reviewers – increasing power will, all else being equal, only reduce one type of error (namely, β) whereas α is held constant at 5%. As a consequence, errors may be severely imbalanced, which is problematic for at least two reasons: First, retaining imbalanced errors implicitly assigns greater importance to one error over the other by affecting the “relative seriousness” of errors. In the extreme, increasing sample sizes and thus statistical power will inadvertently assign greater seriousness to β than to α if the latter is held constant at .05. Second, and more importantly, fixing α at .05 ultimately means that the statistical test cannot achieve consistency, meaning that tests will not point to the true state of the world even in the large sample limit. By implication, the conclusiveness of (non-)significant results will remain limited despite large samples and high power. To demonstrate this, we conducted two simulations comparing the Positive Predictive Value (PPV) and the proportion of correct inferences implied by fixed α versus balanced errors (i.e., α = β). For PPV, simulations showed that once the sample size is sufficiently large to render β < α (i.e., 1–β > .95), adjusting α corresponding to β results in higher PPV than holding α fixed at .05, irrespective of the probability p(H1) that the alternative hypothesis is true. For the proportion of correct inferences, in turn, results imply that balanced errors are to be preferred over fixed α in two situations: (1) whenever β < α (i.e., as soon as the sample size yields β < .05) which holds practically independent of p(H1) and (2) whenever p(H1) > .50, practically irrespective of the absolute magnitude of α and β. Fixing α = .05, by contrast, is only superior whenever β > α and p(H1) < .50, that is, whenever statistical power is not entirely satisfactory and the alternative hypothesis is known to be less likely to hold than the null. Overall, we therefore advocate for extending the calls for higher statistical power by also calling for balanced errors based on straightforward compromise power analyses if samples are large. In other words, to fully exploit the advantages of large samples and to render statistical tests consistent, researchers should not blindly replace a general lack of power with increasingly imbalanced errors, but instead strive for smaller error probabilities in general.

提供机构：

ZPID (Leibniz Institute for Psychology Information)

创建时间：

2019-04-03

5,000+

优质数据集

54 个

任务类型

进入经典数据集