计算机科学
随机游动
杠杆(统计)
多路复用
利用
理论计算机科学
人工智能
生物
生物信息学
数学
计算机安全
统计
作者
Alberto Valdeolivas,Laurent Tichit,Claire Navarro,Sophie Perrin,Gaëlle Odelin,Nicolas Lévy,Pierre Cau,Élisabeth Rémy,Anaı̈s Baudot
摘要
ABSTRACT Recent years have witnessed an exponential growth in the number of identified interactions between biological molecules. These interactions are usually represented as large and complex networks, calling for the development of appropriated tools to exploit the functional information they contain. Random walk with restart is the state-of-the-art guilt-by-association approach. It explores the network vicinity of gene/protein seeds to study their functions, based on the premise that nodes related to similar functions tend to lie close to each others in the networks. In the present study, we extended the random walk with restart algorithm to multiplex and heterogeneous networks. The walk can now explore different layers of physical and functional interactions between genes and proteins, such as protein-protein interactions and co-expression associations. In addition, the walk can also jump to a network containing different sets of edges and nodes, such as phenotype similarities between diseases. We devised a leave-one-out cross-validation strategy to evaluate the algorithms abilities to predict disease-associated genes. We demonstrate the increased performances of the multiplex-heterogeneous random walk with restart as compared to several random walks on monoplex or heterogeneous networks. Overall, our framework is able to leverage the different interaction sources to outperform current approaches. Finally, we applied the algorithm to predict genes candidate for being involved in the Wiedemann-Rautenstrauch syndrome, and to explore the network vicinity of the SHORT syndrome. The source code and the software are freely available at: https://github.com/alberto-valdeolivas/RWR-MH .
科研通智能强力驱动
Strongly Powered by AbleSci AI