人病毒体
病毒分类
生物
计算生物学
分类学(生物学)
病毒
基因组
计算机科学
遗传学
基因
植物
作者
Jing-Zhe Jiang,Wenguang Yuan,Jiayu Shang,Yinghui Shi,Liling Yang,Min Liu,Peng Zhu,Tao Jin,Yanni Sun,Lihong Yuan
出处
期刊:Research Square - Research Square
日期:2022-05-20
标识
DOI:10.21203/rs.3.rs-1658089/v1
摘要
Abstract Background: Viruses are the most ubiquitous and diverse entities in the biome. Due to the rapid growth of newly identified viruses, there is an urgent need for accurate and comprehensive virus classification, particularly for novel viruses. Results: Here, we present PhaGCN2, which can rapidly classify the taxonomy of viral sequences at family level and supports the visualization of the associations of all families. We evaluate the performance of PhaGCN2 and compare it with the state-of-the-art virus classification tools, such as vConTACT2, CAT, and VPF-Class, using the widely accepted metrics. The results show that PhaGCN2 largely improves the precision and recall of virus classification, increases the number of classifiable virus sequences in the Global Ocean Virome dataset (v2.0) by 4 times, and classifies more than 90% of the Gut Phage Database. Conclusions: Here, we present PhaGCN2, which can rapidly classify the taxonomy of viral sequences at family level and supports the conduction of high-throughput and automatic expansion of the database of the International Committee on Taxonomy of Viruses.
科研通智能强力驱动
Strongly Powered by AbleSci AI