Python(编程语言)
计算机科学
回归分析
真线性模型
逻辑回归
多元自适应回归样条
回归诊断
多项式回归
线性回归
可视化
回归
局部回归
分段回归
数据可视化
数据挖掘
特征选择
机器学习
统计
人工智能
数学
程序设计语言
作者
Laura Igual,Santi Seguí
出处
期刊:Undergraduate topics in computer science
日期:2017-01-01
卷期号:: 97-114
被引量:1
标识
DOI:10.1007/978-3-319-50017-1_6
摘要
In this chapter, we introduce regression analysis and some of its applications in data science using Python tools. We show how regression analysis allows us to understand the behavior of data better, to predict data values (continuous or discrete), and to find important variables by means of building a model from the data. We present four different regression models: simple linear regression, multiple linear regression, polynomial regression and logistic regression. We also emphasize the properties of sparse models in the selection of variables. We use different Python toolboxes to build and apply regression models with ease. Specific visualization tools from Seaborn allow qualitative evaluation; while tools from the Scikit-learn library make quantitative evaluation easier, computing several validation measures. Depending on our aim, visual inspection of the data, statistical analysis or prediction, we chose one tool or another. Regression models are motivated by three real problems that deal with the following questions. Is the climate really changing? Can we predict the price of a new market, given any of its attributes? How many goals makes a football team the winner or the loser?
科研通智能强力驱动
Strongly Powered by AbleSci AI