清脆的
计算生物学
聚类分析
计算机科学
生物
CRISPR干扰
基因组编辑
基因
遗传学
人工智能
作者
Han Altae-Tran,Soumya Kannan,Anthony J. Suberski,Kepler S. Mears,F. Esra Demircioglu,Lukas Möller,Selin Kocalar,Rachel Oshiro,Kira S. Makarova,Rhiannon K. Macrae,Eugene V. Koonin,Feng Zhang
出处
期刊:Science
[American Association for the Advancement of Science (AAAS)]
日期:2023-11-23
卷期号:382 (6673)
被引量:40
标识
DOI:10.1126/science.adi1910
摘要
Microbial systems underpin many biotechnologies, including CRISPR, but the exponential growth of sequence databases makes it difficult to find previously unidentified systems. In this work, we develop the fast locality-sensitive hashing–based clustering (FLSHclust) algorithm, which performs deep clustering on massive datasets in linearithmic time. We incorporated FLSHclust into a CRISPR discovery pipeline and identified 188 previously unreported CRISPR-linked gene modules, revealing many additional biochemical functions coupled to adaptive immunity. We experimentally characterized three HNH nuclease–containing CRISPR systems, including the first type IV system with a specified interference mechanism, and engineered them for genome editing. We also identified and characterized a candidate type VII system, which we show acts on RNA. This work opens new avenues for harnessing CRISPR and for the broader exploration of the vast functional diversity of microbial proteins.
科研通智能强力驱动
Strongly Powered by AbleSci AI