TGC-ARG: Predicting Antibiotic Resistance through Transformer-based Modeling and Contrastive Learning

计算机科学 变压器 抗生素耐药性 抗生素 微生物学 工程类 生物 电压 电气工程
作者
Yihan Dong,Xiaowen Hu,Zhijian Huang,Lei Deng
标识
DOI:10.1109/bibm58861.2023.10385506
摘要

The escalating severity of antibiotic resistance poses substantial challenges across diverse sectors, encompassing everyday life, agriculture, and clinical medical interventions. Conventional methods for investigating antibiotic resistance genes (ARGs), such as culture-based techniques and whole-genome sequencing, often suffer from demands of time, labor, and limited accuracy. Moreover, the fragmented nature of existing datasets hampers a comprehensive analysis of antibiotic resistance gene sequences. In this study, we introduce an innovative computational framework known as TGC-ARG, designed to predict potential ARGs. TGC-ARG harnesses protein sequences as input, retrieves protein structures through SCRATCH-1D, and employs a feature extraction module to deduce feature representations for both protein sequences and structures. Subsequently, we integrate a siamese network to establish a contrastive learning paradigm, thus augmenting the model's representational capabilities. The resultant sequence embeddings and structure embeddings are merged and directed into a Multilayer Perceptron (MLP) for predicting ARG presence. To assess the performance, we curate a pioneering publicly available dataset named ARSS (Antibiotic Resistance Sequence Statistics). Our extensive comparative experimental outcomes underscore the superiority of our approach over the current state-of-the-art (SOTA) methodology. Furthermore, through comprehensive case analyses, we demonstrate the efficacy of our approach in predicting potential ARGs. The dataset and source code are accessible at https://github.com/angel1gel/TGC-ARG.

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
希望完成签到 ,获得积分10
2秒前
guosien发布了新的文献求助10
2秒前
www完成签到 ,获得积分10
3秒前
WJZ完成签到 ,获得积分10
3秒前
3秒前
饱满的煎饼完成签到 ,获得积分10
3秒前
我是老大应助paz采纳,获得10
4秒前
4秒前
5秒前
zmftl发布了新的文献求助20
6秒前
Atom完成签到,获得积分10
7秒前
王女士发布了新的文献求助10
7秒前
tiptip应助alixy采纳,获得10
7秒前
8秒前
8秒前
Akim应助菲菲采纳,获得10
9秒前
10秒前
无极微光应助无私的梦柏采纳,获得20
11秒前
12秒前
斯文忘幽发布了新的文献求助30
13秒前
Akim应助chenmeimei2012采纳,获得10
14秒前
mxy完成签到,获得积分10
15秒前
16秒前
张逸凡发布了新的文献求助10
17秒前
17秒前
自觉的念云完成签到,获得积分10
19秒前
foct1发布了新的文献求助10
19秒前
sleep君发布了新的文献求助100
21秒前
科目三应助张逸凡采纳,获得10
23秒前
23秒前
24秒前
忐忑的蘑菇完成签到 ,获得积分10
24秒前
25秒前
不个完成签到 ,获得积分10
25秒前
26秒前
11235应助科研通管家采纳,获得10
26秒前
香蕉觅云应助科研通管家采纳,获得10
26秒前
我是老大应助科研通管家采纳,获得10
26秒前
CodeCraft应助科研通管家采纳,获得10
27秒前
李爱国应助科研通管家采纳,获得10
27秒前
高分求助中
(应助此贴封号)【重要!!请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000
PowerCascade: A Synthetic Dataset for Cascading Failure Analysis in Power Systems 2000
Picture this! Including first nations fiction picture books in school library collections 1500
Signals, Systems, and Signal Processing 610
Unlocking Chemical Thinking: Reimagining Chemistry Teaching and Learning 555
CLSI M100 Performance Standards for Antimicrobial Susceptibility Testing 36th edition 400
Cancer Targets: Novel Therapies and Emerging Research Directions (Part 1) 400
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6361068
求助须知:如何正确求助?哪些是违规求助? 8174995
关于积分的说明 17220415
捐赠科研通 5416017
什么是DOI,文献DOI怎么找? 2866116
邀请新用户注册赠送积分活动 1843370
关于科研通互助平台的介绍 1691365