平动调节
生物
核糖体分析
计算生物学
生物信息学
翻译(生物学)
五素未翻译区
翻译效率
遗传学
编码区
基因
真核翻译
信使核糖核酸
作者
Jialin He,Lei Xiong,Shaohui Shi,Chengyu Li,Kexuan Chen,Qianchen Fang,Jiuhong Nan,Ke Ding,Jingyun Li,Yuanhui Mao,Carles A. Boix,Xinyang Hu,Manolis Kellis,Xushen Xiong
标识
DOI:10.1101/2024.02.26.582217
摘要
Gene expression involves transcription and translation. Despite large datasets and increasingly powerful methods devoted to calculating genetic variants' effects on transcription, discrepancy between mRNA and protein levels hinders the systematic interpretation of the regulatory effects of disease-associated variants. Accurate models of the sequence determinants of translation are needed to close this gap and to interpret disease-associated variants that act on translation. Here, we present Translatomer, a multimodal transformer framework that predicts cell-type-specific translation from mRNA expression and gene sequence. We train Translatomer on 33 tissues and cell lines, and show that the inclusion of sequence substantially improves the prediction of ribosome profiling signal, indicating that Translatomer captures sequence-dependent translational regulatory information. Translatomer achieves accuracies of 0.72 to 0.80 for de novo prediction of cell-type-specific ribosome profiling. We develop an in silico mutagenesis tool to estimate mutational effects on translation and demonstrate that variants associated with translation regulation are evolutionarily constrained, both within the human population and across species. Notably, we identify cell-type-specific translational regulatory mechanisms independent of eQTLs for 3,041 non-coding and synonymous variants associated with complex diseases, including Alzheimer's disease, schizophrenia, and congenital heart disease. Translatomer accurately models the genetic underpinnings of translation, bridging the gap between mRNA and protein levels, and providing valuable mechanistic insights toward mapping "missing regulation" in disease genetics.
科研通智能强力驱动
Strongly Powered by AbleSci AI