Machine learning to detect the SINEs of cancer

癌症 计算生物学 分析物 非整倍体 肿瘤科 DNA测序 生物 内科学 医学 计算机科学 生物信息学 DNA 遗传学 染色体 基因 化学 色谱法
作者
Christopher Douville,Kamel Lahouel,Albert Kuo,Haley Grant,Bracha Erlanger Avigdor,Samuel D. Curtis,Mahmoud Summers,Joshua D. Cohen,Yuxuan Wang,Austin K. Mattox,Jonathan C. Dudley,Lisa Dobbyn,Maria Popoli,Janine Ptak,Nadine T. Nehme,Natalie Silliman,Cheríe Blair,Katharine Romans,Christopher J. Thoburn,Jennifer Gizzi
出处
期刊:Science Translational Medicine [American Association for the Advancement of Science]
卷期号:16 (731) 被引量:14
标识
DOI:10.1126/scitranslmed.adi3883
摘要

We previously described an approach called RealSeqS to evaluate aneuploidy in plasma cell-free DNA through the amplification of ~350,000 repeated elements with a single primer. We hypothesized that an unbiased evaluation of the large amount of sequencing data obtained with RealSeqS might reveal other differences between plasma samples from patients with and without cancer. This hypothesis was tested through the development of a machine learning approach called Alu Profile Learning Using Sequencing (A-PLUS) and its application to 7615 samples from 5178 individuals, 2073 with solid cancer and the remainder without cancer. Samples from patients with cancer and controls were prespecified into four cohorts used for model training, analyte integration, and threshold determination, validation, and reproducibility. A-PLUS alone provided a sensitivity of 40.5% across 11 different cancer types in the validation cohort, at a specificity of 98.5%. Combining A-PLUS with aneuploidy and eight common protein biomarkers detected 51% of the cancers at 98.9% specificity. We found that part of the power of A-PLUS could be ascribed to a single feature—the global reduction of AluS subfamily elements in the circulating DNA of patients with solid cancer. We confirmed this reduction through the analysis of another independent dataset obtained with a different approach (whole-genome sequencing). The evaluation of Alu elements may therefore have the potential to enhance the performance of several methods designed for the earlier detection of cancer.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
Adhklu完成签到 ,获得积分10
刚刚
Yan发布了新的文献求助10
刚刚
1秒前
blue完成签到,获得积分10
2秒前
小岚花完成签到 ,获得积分10
2秒前
3秒前
小资发布了新的文献求助10
4秒前
4秒前
000完成签到 ,获得积分10
5秒前
7777完成签到,获得积分10
6秒前
7秒前
英勇念文完成签到,获得积分20
7秒前
大气代珊发布了新的文献求助10
8秒前
8秒前
小宇完成签到 ,获得积分10
9秒前
RU完成签到,获得积分10
9秒前
11秒前
RU发布了新的文献求助10
12秒前
冷静尔云发布了新的文献求助10
13秒前
风中垣完成签到,获得积分10
14秒前
赵佳露完成签到,获得积分10
14秒前
14秒前
15秒前
我是老大应助阿方采纳,获得10
15秒前
16秒前
壮观飞鸟发布了新的文献求助10
17秒前
某人完成签到 ,获得积分10
17秒前
emmm发布了新的文献求助10
19秒前
英姑应助小资采纳,获得10
20秒前
研友_VZG7GZ应助挣钱养刺猬采纳,获得10
21秒前
22秒前
24秒前
24秒前
无限尔蓝发布了新的文献求助10
26秒前
风月行发布了新的文献求助10
29秒前
30秒前
挣钱养刺猬完成签到,获得积分10
30秒前
云海发布了新的文献求助10
33秒前
long发布了新的文献求助10
33秒前
cy完成签到,获得积分20
34秒前
高分求助中
(应助此贴封号)【重要!!请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000
PowerCascade: A Synthetic Dataset for Cascading Failure Analysis in Power Systems 2000
Picture this! Including first nations fiction picture books in school library collections 1000
Signals, Systems, and Signal Processing 610
Unlocking Chemical Thinking: Reimagining Chemistry Teaching and Learning 555
Photodetectors: From Ultraviolet to Infrared 500
信任代码:AI 时代的传播重构 450
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6357722
求助须知:如何正确求助?哪些是违规求助? 8172278
关于积分的说明 17207451
捐赠科研通 5413235
什么是DOI,文献DOI怎么找? 2864968
邀请新用户注册赠送积分活动 1842489
关于科研通互助平台的介绍 1690595