语义相似性
相似性(几何)
相关性
基因注释
计算机科学
注释
皮尔逊积矩相关系数
聚类分析
表达式(计算机科学)
相似性度量
本体论
基因本体论
相似
人工智能
数据挖掘
基因
数学
基因表达
统计
生物
遗传学
图像(数学)
基因组
哲学
几何学
认识论
程序设计语言
作者
Joseph Sevilla,Víctor Segura,Adam Podhorski,Elizabeth Guruceaga,José M. Mato,Luis Alfonso Martínez‐Cruz,Fernando J. Corrales,Ángel Rubio
摘要
This research analyzes some aspects of the relationship between gene expression, gene function, and gene annotation. Many recent studies are implicitly based on the assumption that gene products that are biologically and functionally related would maintain this similarity both in their expression profiles as well as in their Gene Ontology (GO) annotation. We analyze how accurate this assumption proves to be using real publicly available data. We also aim to validate a measure of semantic similarity for GO annotation. We use the Pearson correlation coefficient and its absolute value as a measure of similarity between expression profiles of gene products. We explore a number of semantic similarity measures (Resnik, Jiang, and Lin) and compute the similarity between gene products annotated using the GO. Finally, we compute correlation coefficients to compare gene expression similarity against GO semantic similarity. Our results suggest that the Resnik similarity measure outperforms the others and seems better suited for use in Gene Ontology. We also deduce that there seems to be correlation between semantic similarity in the GO annotation and gene expression for the three GO ontologies. We show that this correlation is negligible up to a certain semantic similarity value; then, for higher similarity values, the relationship trend becomes almost linear. These results can be used to augment the knowledge provided by clustering algorithms and in the development of bioinformatic tools for finding and characterizing gene products.
科研通智能强力驱动
Strongly Powered by AbleSci AI