机器学习
计算机科学
标准化
人工智能
科恩卡帕
分类器(UML)
二元分类
算法
二进制数
数据挖掘
班级(哲学)
数学
支持向量机
算术
操作系统
作者
Gireen Naidu,Tranos Zuva,Elias Mmbongeni Sibanda
出处
期刊:Lecture notes in networks and systems
日期:2023-01-01
卷期号:: 15-25
被引量:21
标识
DOI:10.1007/978-3-031-35314-7_2
摘要
With the increase in the adoption rate of machine learning algorithms in multiple sectors, the need for accurate measurement and assessment is imperative, especially when classifiers are applied to real world applications. Determining which are the most appropriate evaluation metrics to effectively assess and evaluate the performance of a binary, multi-class and multi-labelled classifier needs to be further understood. Another significant challenge impacting research is that results from models that are similar in nature cannot be adequately compared if the criteria for the measurement and evaluation of these models are not standardized. This review paper aims at highlighting the various evaluation metrics being applied in research and the non-standardization of evaluation metrics to measure the classification results of the model. Although Accuracy, Precision, Recall and F1-Score are the most applied evaluation metrics, there are certain limitations when considering these metrics in isolation. Other metrics such as ROC\AUC and Kappa statistics have proven to provide additional insightful into the effectiveness of an algorithms adequacy and should also be considered when evaluating the effectiveness of binary, multi-class and multi-labelled classifiers. The adoption of a standardized and consistent evaluation methodology should be explored as an area of future work.
科研通智能强力驱动
Strongly Powered by AbleSci AI