人工智能
模式识别(心理学)
聚类分析
相似性(几何)
核(代数)
支持向量机
概率逻辑
计算机科学
相似性度量
数学
图像(数学)
组合数学
作者
Li Zhan,Guohua Geng,Jufu Feng,Jinye Peng,Chao Wen,Junli Liang
标识
DOI:10.1016/j.patrec.2013.11.013
摘要
Previous studies on multiple instance learning (MIL) have shown that the MIL problem holds three characteristics: positive instance clustering, bag structure and instance probabilistic influence to bag label. In this paper, combined with the advantages of these three characteristics, we propose two simple yet effective MIL algorithms, CK_MIL and ck_MIL. We take three steps to convert MIL to a standard supervised learning problem. In the first step, we perform K-means clustering algorithm on the positive and negative sets separately to obtain the cluster centers, further use them to select the most positive instances in bags. Next, we combine three distances, including the maximum, minimum and the average distances from bag to cluster centers, as bag structure. For CK_MIL, we simply compose the positive instance and bag structure to form a new vector as bag representation, then apply RBF kernel to measure bag similarity, while for ck_MIL algorithm we construct a new kernel by introducing a probabilistic coefficient to balance the influences between the positive instance similarity and bag structure similarity. As a result, the MIL problem is converted to a standard supervised learning problem that can be solved directly by SVM method. Experiments on MUSK and COREL image set have shown that our two algorithms perform better than other key existing MIL algorithms on the drug prediction and image classification tasks.
科研通智能强力驱动
Strongly Powered by AbleSci AI