Semisupervised Learning to Boost hERG, Nav1.5, and Cav1.2 Cardiac Ion Channel Toxicity Prediction by Mining a Large Unlabeled Small Molecule Data Set

赫尔格 化学 心脏毒性 集合(抽象数据类型) 频道(广播) 可用性 计算机科学 小分子 毒性 药物发现 机器学习 虚拟筛选 化学 数据挖掘 人工智能 生物信息学 人机交互 生物 钾通道 医学 生物化学 内分泌学 有机化学 程序设计语言 计算机网络
作者
Issar Arab,Kris Laukens,Wout Bittremieux
出处
期刊:Journal of Chemical Information and Modeling [American Chemical Society]
卷期号:64 (16): 6410-6420 被引量:3
标识
DOI:10.1021/acs.jcim.4c01102
摘要

Predicting drug toxicity is a critical aspect of ensuring patient safety during the drug design process. Although conventional machine learning techniques have shown some success in this field, the scarcity of annotated toxicity data poses a significant challenge in enhancing models' performance. In this study, we explore the potential of leveraging large unlabeled small molecule data sets using semisupervised learning to improve drug cardiotoxicity predictive performance across three cardiac ion channel targets: the voltage-gated potassium channel (hERG), the voltage-gated sodium channel (Nav1.5), and the voltage-gated calcium channel (Cav1.2). We extensively mined the ChEMBL database, comprising approximately 2 million small molecules, and then employed semisupervised learning to construct robust classification models for this purpose. We achieved a performance boost on highly diverse (i.e., structurally dissimilar) test data sets across all three targets. Using our built models, we screened the whole ChEMBL database and a large set of FDA-approved drugs, identifying several compounds with potential cardiac ion channel activity. To ensure broad accessibility and usability for both technical and nontechnical users, we developed a cross-platform graphical user interface that allows users to make predictions and gain insights into the cardiotoxicity of drugs and other small molecules. The software is made available as open source under the permissive MIT license at https://github.com/issararab/CToxPred2.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
刚刚
Yumii完成签到,获得积分10
刚刚
爆米花应助lkkk采纳,获得10
刚刚
Hysen_L发布了新的文献求助10
1秒前
1秒前
1秒前
2秒前
2秒前
momo发布了新的文献求助10
2秒前
小yy发布了新的文献求助10
3秒前
lseonf发布了新的文献求助10
3秒前
NexusExplorer应助YXL采纳,获得10
5秒前
5秒前
5秒前
chc关注了科研通微信公众号
6秒前
想睡觉发布了新的文献求助10
6秒前
英俊的铭应助DZW采纳,获得10
6秒前
7秒前
7秒前
weikq2001发布了新的文献求助10
7秒前
张鸿杰完成签到,获得积分10
7秒前
7秒前
7秒前
8秒前
8秒前
8秒前
8秒前
dolores发布了新的文献求助10
9秒前
Ava应助Pw采纳,获得10
9秒前
orixero应助天真大神采纳,获得10
9秒前
jzfbx发布了新的文献求助10
9秒前
10秒前
了了完成签到,获得积分10
11秒前
cocomango完成签到,获得积分10
11秒前
11秒前
11秒前
11秒前
12秒前
NexusExplorer应助bb采纳,获得10
12秒前
12秒前
高分求助中
Adhesion Science: Principles & Practice 1234
Signals, Systems, and Signal Processing 610
Introduction to Cosmetic Formulation and Technology, 2nd Edition 400
Petrology and Plate Tectonics,2025 400
Burger's Medicinal Chemistry and Drug Discovery 400
A Step-by-Step Guide to Qualitative Data Coding 2nd Edition 400
Programming for Chemical Engineers Using C, C++, and MATLAB 320
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6702289
求助须知:如何正确求助?哪些是违规求助? 8443831
关于积分的说明 18037074
捐赠科研通 5938884
什么是DOI,文献DOI怎么找? 2989422
邀请新用户注册赠送积分活动 1965385
关于科研通互助平台的介绍 1909406