非参数统计
航程(航空)
功能(生物学)
计算机科学
回归
统计
度量(数据仓库)
数学
数据挖掘
生物
进化生物学
工程类
航空航天工程
作者
David N. Reshef,Yakir Reshef,Hilary K. Finucane,Sharon R. Grossman,Gil McVean,Peter J. Turnbaugh,Eric S. Lander,Michael Mitzenmacher,Pardis C. Sabeti
出处
期刊:Science
[American Association for the Advancement of Science (AAAS)]
日期:2011-12-16
卷期号:334 (6062): 1518-1524
被引量:2538
标识
DOI:10.1126/science.1205438
摘要
Identifying interesting relationships between pairs of variables in large data sets is increasingly important. Here, we present a measure of dependence for two-variable relationships: the maximal information coefficient (MIC). MIC captures a wide range of associations both functional and not, and for functional relationships provides a score that roughly equals the coefficient of determination (R(2)) of the data relative to the regression function. MIC belongs to a larger class of maximal information-based nonparametric exploration (MINE) statistics for identifying and classifying relationships. We apply MIC and MINE to data sets in global health, gene expression, major-league baseball, and the human gut microbiota and identify known and novel relationships.
科研通智能强力驱动
Strongly Powered by AbleSci AI