计算生物学
亚细胞定位
钥匙(锁)
基因本体论
水准点(测量)
计算机科学
伪氨基酸组成
生物
基因
人工智能
生物化学
基因表达
地理
计算机安全
大地测量学
作者
Xuan Xiao,Xiang Cheng,Shengchao Su,毛琦 Mao Qi,Kuo‐Chen Chou
出处
期刊:Natural Science
[Scientific Research Publishing, Inc.]
日期:2017-01-01
卷期号:09 (09): 330-349
被引量:47
标识
DOI:10.4236/ns.2017.99032
摘要
The basic unit in life is cell.It contains many protein molecules located at its different organelles.The growth and reproduction of a cell as well as most of its other biological functions are performed via these proteins.But proteins in different organelles or subcellular locations have different functions.Facing the avalanche of protein sequences generated in the postgenomic age, we are challenged to develop high throughput tools for identifying the subcellular localization of proteins based on their sequence information alone.Although considerable efforts have been made in this regard, the problem is far apart from being solved yet.Most existing methods can be used to deal with single-location proteins only.Actually, proteins with multi-locations may have some special biological functions that are particularly important for drug targets.Using the ML-GKR (Multi-Label Gaussian Kernel Regression) method, we developed a new predictor called "pLoc-mGpos" by in-depth extracting the key information from GO (Gene Ontology) into the Chou's general PseAAC (Pseudo Amino Acid Composition) for predicting the subcellular localization of Gram-positive bacterial proteins with both single and multiple location sites.Rigorous cross-validation on a same stringent benchmark dataset indicated that the proposed pLoc-mGpos predictor is remarkably superior to "iLoc-Gpos", the state-of-the-art predictor for the same purpose.To maximize the convenience of most experimental scientists, a user-friendly web-server for the new powerful predictor has been established at
科研通智能强力驱动
Strongly Powered by AbleSci AI