IntroUNET: identifying introgressed alleles via semantic segmentation

分割 等位基因 人工智能 自然语言处理 计算机科学 生物 遗传学 进化生物学 基因
作者
Dylan D. Ray,Lex E. Flagel,Daniel R. Schrider
标识
DOI:10.1101/2023.02.07.527435
摘要

A growing body of evidence suggests that gene flow between closely related species is a widespread phenomenon. Alleles that introgress from one species into a close relative are typically neutral or deleterious, but sometimes confer a significant fitness advantage. Given the potential relevance to speciation and adaptation, numerous methods have therefore been devised to identify regions of the genome that have experienced introgression. Recently, supervised machine learning approaches have been shown to be highly effective for detecting introgression. One especially promising approach is to treat population genetic inference as an image classification problem, and feed an image representation of a population genetic alignment as input to a deep neural network that distinguishes among evolutionary models (i.e. introgression or no introgression). However, if we wish to investigate the full extent and fitness effects of introgression, merely identifying genomic regions in a population genetic alignment that harbor introgressed loci is insufficient---ideally we would be able to infer precisely which individuals have introgressed material and at which positions in the genome. Here we adapt a deep learning algorithm for semantic segmentation, the task of correctly identifying the type of object to which each individual pixel in an image belongs, to the task of identifying introgressed alleles. Our trained neural network is thus able to infer, for each individual in a two-population alignment, which of those individual's alleles were introgressed from the other population. We use simulated data to show that this approach is highly accurate, and that it can be readily extended to identify alleles that are introgressed from an unsampled "ghost" population, performing comparably to a supervised learning method tailored specifically to that task. Finally, we apply this method to data from Drosophila , showing that it is able to accurately recover introgressed haplotypes from real data. This analysis reveals that introgressed alleles are typically confined to lower frequencies within genic regions, suggestive of purifying selection, but are found at much higher frequencies in a region previously shown to be affected by adaptive introgression. Our method's success in recovering introgressed haplotypes in challenging real-world scenarios underscores the utility of deep learning approaches for making richer evolutionary inferences from genomic data.

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
fosca完成签到,获得积分10
1秒前
4秒前
llc完成签到 ,获得积分10
4秒前
赘婿应助生动半梅采纳,获得10
5秒前
lianhua发布了新的文献求助10
5秒前
5秒前
5秒前
djbj2022发布了新的文献求助10
6秒前
7秒前
钟煜钟煜完成签到,获得积分10
8秒前
qingsuifengqu完成签到,获得积分10
10秒前
雪满头发布了新的文献求助10
11秒前
fsxadada123完成签到,获得积分10
12秒前
周小北完成签到 ,获得积分10
12秒前
妙妙发布了新的文献求助10
12秒前
13秒前
赘婿应助从容山槐采纳,获得30
13秒前
HFH应助qingsuifengqu采纳,获得50
14秒前
隔壁小王完成签到,获得积分10
15秒前
花椒发布了新的文献求助10
15秒前
15秒前
Hongyt发布了新的文献求助60
15秒前
adinike完成签到,获得积分10
17秒前
19秒前
Lucas应助加壹采纳,获得10
19秒前
20秒前
yshj完成签到,获得积分10
23秒前
Hello应助szl采纳,获得10
24秒前
24秒前
水果咔咔咔完成签到,获得积分10
27秒前
27秒前
Jasper应助哈哈哈采纳,获得30
28秒前
严十三发布了新的文献求助10
29秒前
demonapple12完成签到,获得积分10
29秒前
29秒前
科研通AI2S应助橙汁采纳,获得10
30秒前
30秒前
柴丽完成签到,获得积分10
31秒前
szl完成签到,获得积分20
32秒前
千倾发布了新的文献求助10
32秒前
高分求助中
(应助此贴封号)【重要!!请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000
Developing Genetic Editing Tools for Lysobacter 2000
卤化钙钛矿人工突触的研究 2000
Моделирование процессов самоорганизации в кристаллообразующих системах 1000
History of U.S. Space Surveillance and Satellite Cataloging 1000
Signals, Systems, and Signal Processing 610
Fundamentals of Pharmaceutical and Biologics Regulations: A Global Perspective, Second Edition 600
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6517275
求助须知:如何正确求助?哪些是违规求助? 8310336
关于积分的说明 17764916
捐赠科研通 5619595
什么是DOI,文献DOI怎么找? 2925917
邀请新用户注册赠送积分活动 1902738
关于科研通互助平台的介绍 1763767