特征选择
人工智能
机器学习
随机森林
计算机科学
二元分类
分类器(UML)
特征工程
特征(语言学)
甲状腺炎
甲状腺疾病
模式识别(心理学)
支持向量机
疾病
甲状腺
深度学习
医学
病理
语言学
哲学
内科学
作者
Rajasekhar Chaganti,Furqan Rustam,Isabel de la Torre Díez,Juan Luís Vidal Mazón,Carmen Lili Rodríguez,Imran Ashraf
出处
期刊:Cancers
[MDPI AG]
日期:2022-08-13
卷期号:14 (16): 3914-3914
被引量:55
标识
DOI:10.3390/cancers14163914
摘要
Thyroid disease prediction has emerged as an important task recently. Despite existing approaches for its diagnosis, often the target is binary classification, the used datasets are small-sized and results are not validated either. Predominantly, existing approaches focus on model optimization and the feature engineering part is less investigated. To overcome these limitations, this study presents an approach that investigates feature engineering for machine learning and deep learning models. Forward feature selection, backward feature elimination, bidirectional feature elimination, and machine learning-based feature selection using extra tree classifiers are adopted. The proposed approach can predict Hashimoto's thyroiditis (primary hypothyroid), binding protein (increased binding protein), autoimmune thyroiditis (compensated hypothyroid), and non-thyroidal syndrome (NTIS) (concurrent non-thyroidal illness). Extensive experiments show that the extra tree classifier-based selected feature yields the best results with 0.99 accuracy and an F1 score when used with the random forest classifier. Results suggest that the machine learning models are a better choice for thyroid disease detection regarding the provided accuracy and the computational complexity. K-fold cross-validation and performance comparison with existing studies corroborate the superior performance of the proposed approach.
科研通智能强力驱动
Strongly Powered by AbleSci AI