计数数据
非参数统计
贝叶斯概率
统计
计算机科学
计量经济学
数学
泊松分布
作者
Serena Arima,Silvia Polettini,Giuseppe Pasculli,Loreto Gesualdo,Francesco Pesce,Deni-Aldo Procaccini
出处
期刊:Biostatistics
[Oxford University Press]
日期:2023-09-16
卷期号:25 (3): 904-918
标识
DOI:10.1093/biostatistics/kxad027
摘要
Summary We propose a nonparametric compound Poisson model for underreported count data that introduces a latent clustering structure for the reporting probabilities. The latter are estimated with the model’s parameters based on experts’ opinion and exploiting a proxy for the reporting process. The proposed model is used to estimate the prevalence of chronic kidney disease in Apulia, Italy, based on a unique statistical database covering information on m = 258 municipalities obtained by integrating multisource register information. Accurate prevalence estimates are needed for monitoring, surveillance, and management purposes; yet, counts are deemed to be considerably underreported, especially in some areas of Apulia, one of the most deprived and heterogeneous regions in Italy. Our results agree with previous findings and highlight interesting geographical patterns of the disease. We compare our model to existing approaches in the literature using simulated as well as real data on early neonatal mortality risk in Brazil, described in previous research: the proposed approach proves to be accurate and particularly suitable when partial information about data quality is available.
科研通智能强力驱动
Strongly Powered by AbleSci AI