德布鲁恩序列
康蒂格
德布鲁因图
顺序装配
免疫球蛋白轻链
计算机科学
序列(生物学)
计算生物学
多序列比对
序列分析
图形
自由序列分析
序列比对
生物
抗体
理论计算机科学
肽序列
基因
遗传学
数学
组合数学
基因组
基因表达
转录组
作者
Yi Lu,Guangcun Cheng,Biao Cai,Qing Xu,Ren Kong,Shan Chang
出处
期刊:Mathematical Biosciences and Engineering
[American Institute of Mathematical Sciences]
日期:2023-01-01
卷期号:20 (4): 6174-6190
摘要
With the development of next-generation protein sequencing technologies, sequence assembly algorithm has become a key technology for de novo sequencing process. At present, the existing methods can address the assembly of an unknown single protein chain. However, for monoclonal antibodies with light and heavy chains, the assembly is still an unsolved question. To address this problem, we propose a new assembly method, DBAS, which integrates the quality scores and sequence alignment scores from de novo sequencing peptides into a weighted de Bruijn graph to assemble the final protein sequences. The established method is used to assembling sequences from two datasets with mixed light and heavy chains from antibodies. The results show that the DBAS can assemble long antibody sequences for both mixed light and heavy chains and single chains. In addition, DBAS is able to distinguish the light and heavy chains by using BLAST sequence alignment. The results show that the algorithm has good performance for both target sequence coverage and contig assembly accuracy.
科研通智能强力驱动
Strongly Powered by AbleSci AI