An atlas of protein homo-oligomerization across domains of life
生物
地图集(解剖学)
进化生物学
计算生物学
遗传学
解剖
作者
Hugo Schweke,Martin Pačesa,Tal Levin,Casper A. Goverde,Prasun Kumar,Yoan Duhoo,L Dornfeld,Benjamin Dubreuil,Sandrine Georgeon,Sergey Ovchinnikov,Derek N. Woolfson,Bruno E. Correia,Sucharita Dey,Emmanuel D. Levy
Protein structures are essential to understanding cellular processes in molecular detail. While advances in artificial intelligence revealed the tertiary structure of proteins at scale, their quaternary structure remains mostly unknown. We devise a scalable strategy based on AlphaFold2 to predict homo-oligomeric assemblies across four proteomes spanning the tree of life. Our results suggest that approximately 45% of an archaeal proteome and a bacterial proteome and 20% of two eukaryotic proteomes form homomers. Our predictions accurately capture protein homo-oligomerization, recapitulate megadalton complexes, and unveil hundreds of homo-oligomer types, including three confirmed experimentally by structure determination. Integrating these datasets with omics information suggests that a majority of known protein complexes are symmetric. Finally, these datasets provide a structural context for interpreting disease mutations and reveal coiled-coil regions as major enablers of quaternary structure evolution in human. Our strategy is applicable to any organism and provides a comprehensive view of homo-oligomerization in proteomes.