共同训练
计算机科学
半监督学习
人工智能
机器学习
分类器(UML)
利用
监督学习
标记数据
模式识别(心理学)
人工神经网络
训练集
任务(项目管理)
计算机安全
管理
经济
出处
期刊:IEEE Transactions on Knowledge and Data Engineering
[Institute of Electrical and Electronics Engineers]
日期:2005-09-26
卷期号:17 (11): 1529-1541
被引量:1109
标识
DOI:10.1109/tkde.2005.186
摘要
In many practical data mining applications, such as Web page classification, unlabeled training examples are readily available, but labeled ones are fairly expensive to obtain. Therefore, semi-supervised learning algorithms such as co-training have attracted much attention. In this paper, a new co-training style semi-supervised learning algorithm, named tri-training, is proposed. This algorithm generates three classifiers from the original labeled example set. These classifiers are then refined using unlabeled examples in the tri-training process. In detail, in each round of tri-training, an unlabeled example is labeled for a classifier if the other two classifiers agree on the labeling, under certain conditions. Since tri-training neither requires the instance space to be described with sufficient and redundant views nor does it put any constraints on the supervised learning algorithm, its applicability is broader than that of previous co-training style algorithms. Experiments on UCI data sets and application to the Web page classification task indicate that tri-training can effectively exploit unlabeled data to enhance the learning performance.
科研通智能强力驱动
Strongly Powered by AbleSci AI