假阳性悖论
错误发现率
计算机科学
假阳性和假阴性
蛋白质组学
公制(单位)
假阳性率
数据挖掘
统计假设检验
真阳性率
集合(抽象数据类型)
数据集
统计
人工智能
数学
生物
经济
基因
生物化学
程序设计语言
运营管理
作者
Suruchi Aggarwal,Amit Kumar Yadav
标识
DOI:10.1007/978-1-4939-3106-4_7
摘要
With the advancement in proteomics separation techniques and improvements in mass analyzers, the data generated in a mass-spectrometry based proteomics experiment is rising exponentially. Such voluminous datasets necessitate automated computational tools for high-throughput data analysis and appropriate statistical control. The data is searched using one or more of the several popular database search algorithms. The matches assigned by these tools can have false positives and statistical validation of these false matches is necessary before making any biological interpretations. Without such procedures, the biological inferences do not hold true and may be outright misleading. There is a considerable overlap between true and false positives. To control the false positives amongst a set of accepted matches, there is a need for some statistical estimate that can reflect the amount of false positives present in the data processed. False discovery rate (FDR) is the metric for global confidence assessment of a large-scale proteomics dataset. This chapter covers the basics of FDR, its application in proteomics, and methods to estimate FDR.
科研通智能强力驱动
Strongly Powered by AbleSci AI