INGOT-DR: an interpretable classifier for predicting drug resistance in M. tuberculosis

可解释性 计算机科学 机器学习 人工智能 分类器(UML) 肺结核 结核分枝杆菌 试验装置 数据挖掘 医学 病理
作者
Hooman Zabeti,Nick Dexter,Amir Safari,Nafiseh Sedaghat,Maxwell W. Libbrecht,Leonid Chindelevitch
出处
期刊:Algorithms for Molecular Biology [Springer Nature]
卷期号:16 (1) 被引量:9
标识
DOI:10.1186/s13015-021-00198-1
摘要

Prediction of drug resistance and identification of its mechanisms in bacteria such as Mycobacterium tuberculosis, the etiological agent of tuberculosis, is a challenging problem. Solving this problem requires a transparent, accurate, and flexible predictive model. The methods currently used for this purpose rarely satisfy all of these criteria. On the one hand, approaches based on testing strains against a catalogue of previously identified mutations often yield poor predictive performance; on the other hand, machine learning techniques typically have higher predictive accuracy, but often lack interpretability and may learn patterns that produce accurate predictions for the wrong reasons. Current interpretable methods may either exhibit a lower accuracy or lack the flexibility needed to generalize them to previously unseen data.In this paper we propose a novel technique, inspired by group testing and Boolean compressed sensing, which yields highly accurate predictions, interpretable results, and is flexible enough to be optimized for various evaluation metrics at the same time.We test the predictive accuracy of our approach on five first-line and seven second-line antibiotics used for treating tuberculosis. We find that it has a higher or comparable accuracy to that of commonly used machine learning models, and is able to identify variants in genes with previously reported association to drug resistance. Our method is intrinsically interpretable, and can be customized for different evaluation metrics. Our implementation is available at github.com/hoomanzabeti/INGOT_DR and can be installed via The Python Package Index (Pypi) under ingotdr. This package is also compatible with most of the tools in the Scikit-learn machine learning library.

科研通智能强力驱动
Strongly Powered by AbleSci AI
更新
大幅提高文件上传限制,最高150M (2024-4-1)

科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
bazinga182完成签到,获得积分10
1秒前
3秒前
5秒前
bazinga182发布了新的文献求助10
7秒前
molingyue发布了新的文献求助10
7秒前
ljw完成签到,获得积分10
8秒前
个性的向秋完成签到 ,获得积分10
10秒前
愉快迎荷发布了新的文献求助10
11秒前
ridder完成签到,获得积分20
11秒前
青年才俊发布了新的文献求助10
12秒前
12秒前
Min完成签到,获得积分10
12秒前
13秒前
14秒前
15秒前
15秒前
Fernweh完成签到,获得积分20
16秒前
zhu完成签到,获得积分10
17秒前
nana发布了新的文献求助10
18秒前
神勇的惜文完成签到 ,获得积分10
21秒前
zhu发布了新的文献求助10
21秒前
香蕉觅云应助qqqq采纳,获得10
23秒前
药罐子本罐完成签到,获得积分10
23秒前
25秒前
彭于晏应助xecbouwbcou采纳,获得10
29秒前
38秒前
Fly发布了新的文献求助10
41秒前
41秒前
Jasper应助nana采纳,获得10
42秒前
小白发布了新的文献求助30
42秒前
卷aaaa完成签到,获得积分10
42秒前
呼呼发布了新的文献求助10
43秒前
xecbouwbcou发布了新的文献求助10
48秒前
大饼卷肉完成签到,获得积分10
48秒前
愉快迎荷完成签到,获得积分10
49秒前
53秒前
qq完成签到 ,获得积分20
59秒前
汤汤完成签到,获得积分10
59秒前
1分钟前
上瘾倪妮完成签到,获得积分10
1分钟前
高分求助中
LNG地上式貯槽指針 (JGA指 ; 108) 1000
LNG地下式貯槽指針(JGA指-107)(LNG underground storage tank guidelines) 1000
Generalized Linear Mixed Models 第二版 1000
Preparation and Characterization of Five Amino-Modified Hyper-Crosslinked Polymers and Performance Evaluation for Aged Transformer Oil Reclamation 700
Operative Techniques in Pediatric Orthopaedic Surgery 510
九经直音韵母研究 500
Full waveform acoustic data processing 500
热门求助领域 (近24小时)
化学 医学 材料科学 生物 工程类 有机化学 生物化学 物理 内科学 纳米技术 计算机科学 化学工程 复合材料 基因 遗传学 物理化学 催化作用 免疫学 细胞生物学 电极
热门帖子
关注 科研通微信公众号,转发送积分 2927360
求助须知:如何正确求助?哪些是违规求助? 2576453
关于积分的说明 6954189
捐赠科研通 2227470
什么是DOI,文献DOI怎么找? 1183794
版权声明 589339
科研通“疑难数据库(出版商)”最低求助积分说明 579334