计算机科学
回归分析
回归
多项式分布
数据挖掘
机器学习
多元统计
预测建模
计数数据
人工智能
Dirichlet分布
多元自适应回归样条
多项式logistic回归
统计
贝叶斯多元线性回归
数学
数学分析
泊松分布
边值问题
作者
Pantea Koochemeshkian,Nuha Zamzami,Nizar Bouguila
标识
DOI:10.1080/01969722.2020.1758464
摘要
Data mining techniques have been successfully utilized in different applications of significant fields, including medical research. With the wealth of data available within the health-care systems, there is a lack of practical analysis tools to discover hidden relationships and trends in data. The complexity of medical data that is unfavorable for most models is a considerable challenge in prediction. The ability of a model to perform accurately and efficiently in disease diagnosis is extremely significant. Thus, the model must be selected to fit the data better, such that the learning from previous data is most efficient, and the diagnosis of the disease is highly accurate. This work is motivated by the limited number of regression analysis tools for multivariate counts in the literature. We propose two regression models for count data based on flexible distributions, namely, the multinomial Beta-Liouville and multinomial scaled Dirichlet, and evaluated the proposed models in the problem of disease diagnosis. The performance is evaluated based on the accuracy of the prediction which depends on the nature and complexity of the dataset. Our results show the efficiency of the two proposed regression models where the prediction performance of both models is competitive to other previously used regression models for count data and to the best results in the literature.
科研通智能强力驱动
Strongly Powered by AbleSci AI