计算机科学
朴素贝叶斯分类器
决策树
度量(数据仓库)
接收机工作特性
算法
贝叶斯定理
人工智能
机器学习
数据挖掘
支持向量机
贝叶斯概率
作者
Jin Huang,Charles X. Ling
出处
期刊:IEEE Transactions on Knowledge and Data Engineering
[Institute of Electrical and Electronics Engineers]
日期:2005-03-01
卷期号:17 (3): 299-310
被引量:1721
摘要
The area under the ROC (receiver operating characteristics) curve, or simply AUC, has been traditionally used in medical diagnosis since the 1970s. It has recently been proposed as an alternative single-number measure for evaluating the predictive ability of learning algorithms. However, no formal arguments were given as to why AUC should be preferred over accuracy. We establish formal criteria for comparing two different measures for learning algorithms and we show theoretically and empirically that AUC is a better measure (defined precisely) than accuracy. We then reevaluate well-established claims in machine learning based on accuracy using AUC and obtain interesting and surprising new results. For example, it has been well-established and accepted that Naive Bayes and decision trees are very similar in predictive accuracy. We show, however, that Naive Bayes is significantly better than decision trees in AUC. The conclusions drawn in this paper may make a significant impact on machine learning and data mining applications.
科研通智能强力驱动
Strongly Powered by AbleSci AI