生物导体
计数数据
计算机科学
计算生物学
R包
推论
软件
过度分散
数据挖掘
生物
遗传学
基因
统计
泊松分布
数学
人工智能
计算科学
程序设计语言
作者
Mark D. Robinson,Davis J. McCarthy,Gordon K. Smyth
出处
期刊:Bioinformatics
[Oxford University Press]
日期:2009-11-11
卷期号:26 (1): 139-140
被引量:34345
标识
DOI:10.1093/bioinformatics/btp616
摘要
Summary: It is expected that emerging digital gene expression (DGE) technologies will overtake microarray technologies in the near future for many functional genomics applications. One of the fundamental data analysis tasks, especially for gene expression studies, involves determining whether there is evidence that counts for a transcript or exon are significantly different across experimental conditions. edgeR is a Bioconductor software package for examining differential expression of replicated count data. An overdispersed Poisson model is used to account for both biological and technical variability. Empirical Bayes methods are used to moderate the degree of overdispersion across transcripts, improving the reliability of inference. The methodology can be used even with the most minimal levels of replication, provided at least one phenotype or experimental condition is replicated. The software may have other applications beyond sequencing data, such as proteome peptide count data.
科研通智能强力驱动
Strongly Powered by AbleSci AI