计算机科学
稳健性(进化)
人工智能
试验装置
有效载荷(计算)
机器学习
源代码
集合(抽象数据类型)
代表(政治)
接收机工作特性
实现(概率)
程序设计语言
计算机网络
生物化学
化学
网络数据包
政治
政治学
法学
基因
统计
数学
作者
Liye Chen,Biaoshun Li,Yihao Chen,Mujie Lin,Shipeng Zhang,Chenxin Li,Yu Pang,Ling Wang
出处
期刊:Cornell University - arXiv
日期:2024-01-01
被引量:1
标识
DOI:10.48550/arxiv.2401.09176
摘要
Antibody-drug conjugate (ADC) has revolutionized the field of cancer treatment in the era of precision medicine due to their ability to precisely target cancer cells and release highly effective drug. Nevertheless, the realization of rational design of ADC is very difficult because the relationship between their structures and activities is difficult to understand. In the present study, we introduce a unified deep learning framework called ADCNet to help design potential ADCs. The ADCNet highly integrates the protein representation learning language model ESM-2 and small-molecule representation learning language model FG-BERT models to achieve activity prediction through learning meaningful features from antigen and antibody protein sequences of ADC, SMILES strings of linker and payload, and drug-antibody ratio (DAR) value. Based on a carefully designed and manually tailored ADC data set, extensive evaluation results reveal that ADCNet performs best on the test set compared to baseline machine learning models across all evaluation metrics. For example, it achieves an average prediction accuracy of 87.12%, a balanced accuracy of 0.8689, and an area under receiver operating characteristic curve of 0.9293 on the test set. In addition, cross-validation, ablation experiments, and external independent testing results further prove the stability, advancement, and robustness of the ADCNet architecture. For the convenience of the community, we develop the first online platform (https://ADCNet.idruglab.cn) for the prediction of ADCs activity based on the optimal ADCNet model, and the source code is publicly available at https://github.com/idrugLab/ADCNet.
科研通智能强力驱动
Strongly Powered by AbleSci AI