生物信息学
突变
荧光蛋白
机器学习
蛋白质工程
集成学习
突变
计算机科学
人工智能
计算生物学
生物
生物化学
遗传学
绿色荧光蛋白
基因
酶
作者
Sarah J. Wait,Marc Expòsit,Sophia Lin,Michael Rappleye,Justin Daho Lee,Samuel A. Colby,Lily Torp,Anthony Asencio,Annette C. Smith,Michael Regnier,Farid Moussavi‐Harami,David Baker,Christina K. Kim,A. Berndt
标识
DOI:10.1038/s43588-024-00611-w
摘要
Here we used machine learning to engineer genetically encoded fluorescent indicators, protein-based sensors critical for real-time monitoring of biological activity. We used machine learning to predict the outcomes of sensor mutagenesis by analyzing established libraries that link sensor sequences to functions. Using the GCaMP calcium indicator as a scaffold, we developed an ensemble of three regression models trained on experimentally derived GCaMP mutation libraries. The trained ensemble performed an in silico functional screen on 1,423 novel, uncharacterized GCaMP variants. As a result, we identified the ensemble-derived GCaMP (eGCaMP) variants, eGCaMP and eGCaMP+, which achieve both faster kinetics and larger ∆F/F0 responses upon stimulation than previously published fast variants. Furthermore, we identified a combinatorial mutation with extraordinary dynamic range, eGCaMP2+, which outperforms the tested sixth-, seventh- and eighth-generation GCaMPs. These findings demonstrate the value of machine learning as a tool to facilitate the efficient engineering of proteins for desired biophysical characteristics. Andre Berndt and colleagues introduce a machine learning approach to enhance the biophysical characteristics of genetically encoded fluorescent indicators, deriving and testing in vitro new GCaMP mutations that surpass the performance of existing fast GCaMP indicators.
科研通智能强力驱动
Strongly Powered by AbleSci AI