背景(考古学)
度量(数据仓库)
公制(单位)
排名(信息检索)
聚类分析
多蛋白复合物
蛋白质折叠
功能(生物学)
理论(学习稳定性)
计算机科学
化学
数据挖掘
人工智能
机器学习
生物
遗传学
生物化学
古生物学
运营管理
基因
经济
作者
Joshua Teitz,Jöerg Sander,Hassan Sarker,Carlos Fernández-Patrón
摘要
Abstract Determining the interacting proteins in multiprotein complexes can be technically challenging. An emerging biochemical approach to this end is based on the ‘thermal proximity co-aggregation’ (TPCA) phenomenon. Accordingly, when two or more proteins interact to form a complex, they tend to co-aggregate when subjected to heat-induced denaturation and thus exhibit similar melting curves. Here, we explore the potential of leveraging TPCA for determining protein interactions. We demonstrate that dissimilarity measure-based information retrieval applied to melting curves tends to rank a protein-of-interest’s interactors higher than its non-interactors, as shown in the context of pull-down assay results. Consequently, such rankings can reduce the number of confirmatory biochemical experiments needed to find bona fide protein–protein interactions. In general, rankings based on dissimilarity measures generated through metric learning further reduce the required number of experiments compared to those based on standard dissimilarity measures such as Euclidean distance. When a protein mixture’s melting curves are obtained in two conditions, we propose a scoring function that uses melting curve data to inform how likely a protein pair is to interact in one condition but not another. We show that ranking protein pairs by their scores is an effective approach for determining condition-specific protein–protein interactions. By contrast, clustering melting curve data generally does not inform about the interacting proteins in multiprotein complexes. In conclusion, we report improved methods for dissimilarity measure-based computation of melting curves data that can greatly enhance the determination of interacting proteins in multiprotein complexes.
科研通智能强力驱动
Strongly Powered by AbleSci AI