DNA测序
杂交测序
计算生物学
DNA
寡核苷酸
错误检测和纠正
序列(生物学)
结扎测序
计算机科学
DNA纳米球测序
碱基对
标识符
生物
算法
遗传学
基因组文库
基序列
DNA测序器
程序设计语言
作者
Huiran Yeom,Namphil Kim,Amos Chungwon Lee,Jinhyun Kim,Hamin Kim,Hansol Choi,Seo Woo Song,Sunghoon Kwon,Yeongjae Choi
标识
DOI:10.1021/acssynbio.3c00308
摘要
A comprehensive error analysis of DNA-stored data during processing, such as DNA synthesis and sequencing, is crucial for reliable DNA data storage. Both synthesis and sequencing errors depend on the sequence and the transition of bases of nucleotides; ignoring either one of the error sources leads to technical challenges in minimizing the error rate. Here, we present a methodology and toolkit that utilizes an oligonucleotide library generated from a 10-base-shifted sequence array, which is individually labeled with unique molecular identifiers, to delineate and profile DNA synthesis and sequencing errors simultaneously. This methodology enables position- and sequence-independent error profiling of both DNA synthesis and sequencing. Using this toolkit, we report base transitional errors in both synthesis and sequencing in general DNA data storage as well as degenerate-base-augmented DNA data storage. The methodology and data presented will contribute to the development of DNA sequence designs with minimal error.
科研通智能强力驱动
Strongly Powered by AbleSci AI