Python(编程语言)
计算机科学
UniProt公司
Web服务器
服务器
蛋白质测序
基因本体论
蛋白质功能预测
数据挖掘
序列数据库
Web服务
相似性(几何)
生物信息学
作者
Maarten J.M.F. Reijnders
出处
期刊:PeerJ
[PeerJ]
日期:2022-02-15
卷期号:10: e12931-e12931
摘要
Background Protein function prediction is an important part of bioinformatics and genomics studies. There are many different predictors available, however most of these are in the form of web-servers instead of open-source locally installable versions. Such local versions are necessary to perform large scale genomics studies due to the presence of limitations imposed by web servers such as queues, prediction speed, and updatability of databases. Methods This paper describes Wei2GO: a weighted sequence similarity and python-based open-source protein function prediction software. It uses DIAMOND and HMMScan sequence alignment searches against the UniProtKB and Pfam databases respectively, transfers Gene Ontology terms from the reference protein to the query protein, and uses a weighing algorithm to calculate a score for the Gene Ontology annotations. Results Wei2GO is compared against the Argot2 and Argot2.5 web servers, which use a similar concept, and DeepGOPlus which acts as a reference. Wei2GO shows an increase in performance according to precision and recall curves, F max scores, and S min scores for biological process and molecular function ontologies. Computational time compared to Argot2 and Argot2.5 is decreased from several hours to several minutes. Availability Wei2GO is written in Python 3, and can be found at https://gitlab.com/mreijnders/Wei2GO .
科研通智能强力驱动
Strongly Powered by AbleSci AI