十字形
概率逻辑
序列(生物学)
计算机科学
DNA测序
DNA
计算生物学
算法
人工智能
生物系统
遗传学
生物
工程类
结构工程
作者
Matej Lexa,Marie Brázdová,Lucie Navrátilová,Karel Nejedlý
标识
DOI:10.5220/0003705701240130
摘要
Sequence-dependent secondary DNA structures, such as cruciform or triplex DNA, are implicated in regulation of gene transcription and other important biological processes at the molecular level. Sequences capable of forming these structures can readily be identified in entire genomes by appropriate searching techniques. However, not every DNA segment containing the proper sequence has equal probability of forming an alternative structure. Calculating the free energy of the potential structures provides an estimate of their stability in vivo, but there are other structural factors, both local and non-local, not taken into account by such simplistic approach. In is paper we present the procedure we currently use to identify potential cruciform structures in DNA sequences. The procedure relies on identification of palindromes (or inverted repeats) and their evaluation by a nucleic acid folding program (UNAFold). We further extended the procedure by adding a modelling step to filter the predicted cruciforms. The model takes into account superhelical density of the analyzed segments of DNA and calculates the probability of cruciforms forming at several locations of the analyzed DNA, based on the sequences in the stem and loop areas of the structures and competition among them.
科研通智能强力驱动
Strongly Powered by AbleSci AI