编码(社会科学)
序列(生物学)
编码区
王国
进化生物学
生物
计算生物学
遗传学
统计
数学
古生物学
基因
作者
Sebu Aboma Temesgen,Bakanina Kissanga Grace-Mercure,Basharat Ahmad,Yan-Ting Jin,Li Liu,Hao Lin
标识
DOI:10.2174/0115748936355149250108083111
摘要
Background: Genetic information about organisms' traits is stored and encoded in deoxyribonucleic acid (DNA) sequences. The fundamental inquiry into the storage mechanisms of this genetic information within genomes has long been of interest to geneticists and biophysicists. Objective: The objective of this study was to investigate the distribution of coding sequence (CDS) lengths in species genomes across different kingdoms. Methods: In this study, we used the maximum entropy principle and the gamma distribution model based on a comprehensive dataset including viruses, archaea, bacteria, and eukaryote species Results: Our study result revealed unique patterns in CDS length distributions among kingdoms and CDS lengths exhibit a right-skewed distribution, with varying preferences among kingdoms. Eukaryotes displayed bimodal distributions, with CDS sequences longer than those of prokaryotes. Fitting the gamma distribution model revealed differences in shape and scale parameters among kingdoms, with eukaryotes exhibiting larger scale parameters, indicating longer CDS sequences. Additionally, analysis of moments highlighted the complexity of eukaryotic genomes relative to prokaryotes. Conclusion: This study result deepens our understanding of genome evolution and provides valuable insights for biological research.Conclusion: This study result deepens our understanding of genome evolution and provides valuable insights for biological research.
科研通智能强力驱动
Strongly Powered by AbleSci AI