子群分析
虚假关系
样本量测定
医学
临床试验
统计
荟萃分析
数学
内科学
作者
Sara Brookes,Elise Whitely,Matthias Egger,George Davey Smith,Paul A. Mulheran,T. J. Peters
标识
DOI:10.1016/j.jclinepi.2003.08.009
摘要
Abstract Objective Despite guidelines recommending the use of formal tests of interaction in subgroup analyses in clinical trials, inappropriate subgroup-specific analyses continue. Moreover, trials designed to detect overall treatment effects have limited power to detect treatment–subgroup interactions. This article quantifies the error rates associated with subgroup analyses. Study design and setting Simulations quantified the risks of misinterpreting subgroup analyses as evidence of differential subgroup effects and the limited power of the interaction test in trials designed to detect overall treatment effects. Results Although formal interaction tests performed as expected with respect to false positives, subgroup-specific tests were considerably less reliable: A significant effect in one subgroup only was observed in 7% to 64% of simulations depending on trial characteristics. Regarding power of the interaction test, a trial with 80% power for the overall effect had only 29% power to detect an interaction effect of the same magnitude. For interactions of this size to be detected with the same power as the overall effect, sample sizes should be inflated fourfold, increasing dramatically for interactions smaller than 20% of the overall effect. Conclusion Although it is generally recognized that subgroup analyses can produce spurious results, the extent of the problem may be underestimated.
科研通智能强力驱动
Strongly Powered by AbleSci AI