朴素贝叶斯分类器
人工智能
计算机科学
概率分类
多项式分布
分类器(UML)
贝叶斯概率
机器学习
贝叶斯分类器
模式识别(心理学)
概率逻辑
贝叶斯定理
自然语言处理
数学
支持向量机
统计
标识
DOI:10.1177/0165551516677946
摘要
Text classification is the task of assigning predefined categories to natural language documents, and it can provide conceptual views of document collections. The Naïve Bayes (NB) classifier is a family of simple probabilistic classifiers based on a common assumption that all features are independent of each other, given the category variable, and it is often used as the baseline in text classification. However, classical NB classifiers with multinomial, Bernoulli and Gaussian event models are not fully Bayesian. This study proposes three Bayesian counterparts, where it turns out that classical NB classifier with Bernoulli event model is equivalent to Bayesian counterpart. Finally, experimental results on 20 newsgroups and WebKB data sets show that the performance of Bayesian NB classifier with multinomial event model is similar to that of classical counterpart, but Bayesian NB classifier with Gaussian event model is obviously better than classical counterpart.
科研通智能强力驱动
Strongly Powered by AbleSci AI