计算机科学
互联网
联动装置(软件)
匹配(统计)
代表(政治)
任务(项目管理)
特征(语言学)
班级(哲学)
光学(聚焦)
社交网络(社会语言学)
情报检索
人工智能
身份(音乐)
机器学习
万维网
社会化媒体
法学
管理
化学
物理
经济
哲学
光学
统计
基因
政治
生物化学
语言学
数学
声学
政治学
作者
Yuanyuan Qiao,Yuewei Wu,Fan Duo,Wen‐Hui Lin,Jie Yang
出处
期刊:IEEE transactions on neural networks and learning systems
[Institute of Electrical and Electronics Engineers]
日期:2019-08-13
卷期号:31 (8): 2741-2751
被引量:26
标识
DOI:10.1109/tnnls.2019.2929575
摘要
Linking online identities of users among countless heterogeneous network services on the Internet can provide an explicit digital representation of users, which can benefit both research and industry. In recent years, user identity linkage (UIL) through the Internet has become an emerging task with great potential and many challenges. Existing works mainly focus on online social networks that consider inconsistent profiles, content, and networks as features or use sparse location-based data sets to link the online behaviors of a real person. To extend the UIL problem to a general scenario, we try to link the web-browsing behaviors of users, which can help to distinguish specific users from others, such as children or malicious users. More specifically, we propose a Siamese neural network (NN) architecture-based UIL (SAUIL) model that learns and compares the highest-level feature representation of input web-browsing behaviors with deep NNs. Although the number of matching and nonmatching pairs for the UIL problem is highly imbalanced, previous studies have not considered imbalanced UIL data sets. Therefore, we further address the imbalanced learning issue by proposing cost-sensitive SAUIL (C-SAUIL) model, which assumes higher costs for misclassifying the minority class. In the experiments, the proposed model is robust and exhibits a good performance on very large, real-world data sets collected from different regions with distinct characteristics.
科研通智能强力驱动
Strongly Powered by AbleSci AI