疾病
药品
计算生物学
管道(软件)
药物靶点
集合(抽象数据类型)
药物开发
药物发现
药物反应
生物
生物信息学
计算机科学
医学
癌症研究
药理学
病理
程序设计语言
作者
Zhihui Luo,Li Zhu,Yamin Wang,Sheng Hu Qian,Menglu Li,Wen Zhang,Zhen‐Xia Chen
摘要
Abstract Disease pathogenesis is always a major topic in biomedical research. With the exponential growth of biomedical information, drug effect analysis for specific phenotypes has shown great promise in uncovering disease-associated pathways. However, this method has only been applied to a limited number of drugs. Here, we extracted the data of 4634 diseases, 3671 drugs, 112 809 disease–drug associations and 81 527 drug–gene associations by text mining of 29 168 919 publications. On this basis, we proposed a ‘Drug Set Enrichment Analysis by Text Mining (DSEATM)’ pipeline and applied it to 3250 diseases, which outperformed the state-of-the-art method. Furthermore, diseases pathways enriched by DSEATM were similar to those obtained using the TCGA cancer RNA-seq differentially expressed genes. In addition, the drug number, which showed a remarkable positive correlation of 0.73 with the AUC, plays a determining role in the performance of DSEATM. Taken together, DSEATM is an auspicious and accurate disease research tool that offers fresh insights.
科研通智能强力驱动
Strongly Powered by AbleSci AI