Sequential Optimal Experimental Design of Perturbation Screens Guided by Multi-modal Priors

计算机科学 先验概率 忠诚 摄动(天文学) 机器学习 人工智能 贝叶斯概率 电信 物理 量子力学
作者
Kexin Huang,Romain Lopez,Jan-Christian Hütter,Takamasa Kudo,Antonio Ríos,Aviv Regev
标识
DOI:10.1101/2023.12.12.571389
摘要

Abstract Understanding a cell’s expression response to genetic perturbations helps to address important challenges in biology and medicine, including the function of gene circuits, discovery of therapeutic targets and cell reprogramming and engineering. In recent years, Perturb-seq, pooled genetic screens with single cell RNA-seq (scRNA-seq) readouts, has emerged as a common method to collect such data. However, irrespective of technological advances, because combinations of gene perturbations can have unpredictable, non-additive effects, the number of experimental configurations far exceeds experimental capacity, and for certain cases, the number of available cells. While recent machine learning models, trained on existing Perturb-seq data sets, can predict perturbation outcomes with some degree of accuracy, they are currently limited by sub-optimal training set selection and the small number of cell contexts of training data, leading to poor predictions for unexplored parts of perturbation space. As biologists deploy Perturb-seq across diverse biological systems, there is an enormous need for algorithms to guide iterative experiments while exploring the large space of possible perturbations and their combinations. Here, we propose a sequential approach for designing Perturb-seq experiments that uses the model to strategically select the most informative perturbations at each step for subsequent experiments. This enables a significantly more efficient exploration of the perturbation space, while predicting the effect of the rest of the unseen perturbations with high-fidelity. Analysis of a previous large-scale Perturb-seq experiment reveals that our setting is severely restricted by the number of examples and rounds, falling into a non-conventional active learning regime called “active learning on a budget”. Motivated by this insight, we develop I ter P ert , a novel active learning method that exploits rich and multi-modal prior knowledge in order to efficiently guide the selection of subsequent perturbations. Using prior knowledge for this task is novel, and crucial for successful active learning on a budget. We validate I ter P ert using insilico benchmarking of active learning, constructed from a large-scale CRISPRi Perturb-seq data set. We find that I ter P ert outperforms other active learning strategies by reaching comparable accuracy at only a third of the number of perturbations profiled as the next best method. Overall, our results demonstrate the potential of sequentially designing perturbation screens through I ter P ert .

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
JJ完成签到,获得积分10
刚刚
kimiwanano完成签到,获得积分10
1秒前
阿莳完成签到 ,获得积分10
2秒前
流星雨完成签到 ,获得积分10
2秒前
4秒前
为霜完成签到 ,获得积分10
5秒前
SDLC完成签到,获得积分10
5秒前
早睡早起完成签到,获得积分10
5秒前
乔巴完成签到,获得积分10
6秒前
元舒甜完成签到,获得积分10
7秒前
甜甜蜜蜜小白周完成签到 ,获得积分10
8秒前
射天狼完成签到,获得积分10
9秒前
bkagyin应助whaoe采纳,获得10
9秒前
其实是北北吖完成签到,获得积分10
10秒前
10秒前
儒雅水池完成签到 ,获得积分10
11秒前
MMTI完成签到,获得积分10
11秒前
J18完成签到,获得积分10
12秒前
七龙珠完成签到,获得积分10
14秒前
无辜听兰应助跳跃的鱼采纳,获得10
14秒前
尘曦完成签到,获得积分10
14秒前
user20011125完成签到 ,获得积分10
15秒前
Enquinn完成签到,获得积分10
15秒前
lizishu给高立蕊的求助进行了留言
17秒前
石林完成签到,获得积分10
18秒前
青青完成签到,获得积分10
20秒前
林千万完成签到,获得积分10
21秒前
21秒前
前程似锦完成签到 ,获得积分10
21秒前
22秒前
revew666完成签到,获得积分10
23秒前
大方的慕青完成签到,获得积分10
23秒前
将爱却晚秋完成签到,获得积分10
24秒前
24秒前
梁平完成签到 ,获得积分10
25秒前
苗儿发布了新的文献求助30
26秒前
nexus应助科研通管家采纳,获得10
26秒前
慕青应助科研通管家采纳,获得10
26秒前
华仔应助科研通管家采纳,获得10
26秒前
跳跃的鱼完成签到,获得积分10
26秒前
高分求助中
(应助此贴封号)【重要!!请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000
AnnualResearch andConsultation Report of Panorama survey and Investment strategy onChinaIndustry 1000
卤化钙钛矿人工突触的研究 1000
Engineering for calcareous sediments : proceedings of the International Conference on Calcareous Sediments, Perth 15-18 March 1988 / edited by R.J. Jewell, D.C. Andrews 1000
Continuing Syntax 1000
Signals, Systems, and Signal Processing 610
2026 Hospital Accreditation Standards 500
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6262843
求助须知:如何正确求助?哪些是违规求助? 8084887
关于积分的说明 16891997
捐赠科研通 5333349
什么是DOI,文献DOI怎么找? 2839003
邀请新用户注册赠送积分活动 1816435
关于科研通互助平台的介绍 1670192