计算生物学
蛋白质组
序列(生物学)
生物
计算机科学
编码(社会科学)
细胞器
生物信息学
生物系统
化学
细胞生物学
遗传学
数学
统计
作者
Shihua Li,Kai Yu,Qingfeng Zhang,Zexian Liu,Jia Liu,Huai-Qiang Ju,Zhixiang Zuo,Xiaoxing Li,Zhenlong Wang,Han Cheng
标识
DOI:10.1101/2021.01.30.428971
摘要
ABSTRACT Membrane based cells are the fundamental structure and function units of organisms, while evidences were increasing that liquid-liquid phase separation (LLPS) is associated with the formation of membraneless organelles, such as P-bodies, nucleoli and stress granules. Many studies have been undertaken to explore the functions of protein phase separation, but these studies lacked an effective tool to identify the sequence segments that critical for LLPS (SCOPEs). In this study, we presented a novel software called dSCOPE ( http://dscope.omicsbio.info ) to predict the SCOPEs. To develop the predictor, we curated experimentally identified sequence segments that can drive LLPS from published literature. Then sliding sequence window based physiological, biochemical, structural and coding features were integrated by random forest algorithm to perform prediction. Through rigorous evaluation, dSCOPE was demonstrated to achieve satisfactory performance. Furthermore, large-scale analysis of human proteome based on dSCOPE showed that the predicted SCOPEs enriched various protein post-translational modifications and cancer mutations, and the proteins which contain predicted SCOPEs enriched critical cellular signaling pathways. Taken together, dSCOPE precisely predicted the protein sequence segments critical for LLPS, with various helpful information visualized in the webserver to facilitate LLPS related research.
科研通智能强力驱动
Strongly Powered by AbleSci AI