拷贝数变化
离群值
结构变异
变化(天文学)
计算生物学
计算机科学
异常检测
基因组
生物
遗传学
数据挖掘
基因
人工智能
天体物理学
物理
作者
Chengyou Li,Shiqiang Fan,Haiyong Zhao,Xiaotong Liu
标识
DOI:10.1142/s0219720023500269
摘要
Copy number variation (CNV), as a type of genomic structural variation, accounts for a large proportion of structural variation and is related to the pathogenesis and susceptibility to some human diseases, playing an important role in the development and change of human diseases. The development of next-generation sequencing technology (NGS) provides strong support for the design of CNV detection algorithms. Although a large number of methods have been developed to detect CNVs using NGS data, it is still considered a difficult problem to detect CNVs with low purity and coverage. In this paper, a new calculation method CNV-FB is proposed to detect CNVs from NGS data. The core idea of CNV-FB is to randomly sample the read depth values of the genome fragment, and then each sample is individually detected for outliers, and finally combined into a final outlier score. The CNV-FB method was applied to simulation data and real data experiments and compared with the other five methods of the same type. The results show that the CNV-FB method has a better detection effect than other methods. Therefore, the CNV-FB method may be an effective algorithm for detecting genomic mutations.
科研通智能强力驱动
Strongly Powered by AbleSci AI