梯度升压
人工智能
机器学习
决策树
特征选择
Boosting(机器学习)
计算机科学
二元分类
集成学习
降维
数据挖掘
维数之咒
回归
随机森林
人工神经网络
支持向量机
模式识别(心理学)
数学
统计
作者
Sandeep Kumar Singla,Rahul Garg,Om Prakash Dubey
出处
期刊:Intelligent Data Analysis
[IOS Press]
日期:2021-09-15
卷期号:25 (5): 1291-1322
被引量:1
摘要
Recent technological enhancements in the field of information technology and statistical techniques allowed the sophisticated and reliable analysis based on machine learning methods. A number of machine learning data analytical tools may be exploited for the classification and regression problems. These tools and techniques can be effectively used for the highly data-intensive operations such as agricultural and meteorological applications, bioinformatics and stock market analysis based on the daily prices of the market. Machine learning ensemble methods such as Decision Tree (C5.0), Classification and Regression (CART), Gradient Boosting Machine (GBM) and Random Forest (RF) has been investigated in the proposed work. The proposed work demonstrates that temporal variations in the spectral data and computational efficiency of machine learning methods may be effectively used for the discrimination of types of sugarcane. The discrimination has been considered as a binary classification problem to segregate ratoon from plantation sugarcane. Variable importance selection based on Mean Decrease in Accuracy (MDA) and Mean Decrease in Gini (MDG) have been used to create the appropriate dataset for the classification. The performance of the binary classification model based on RF is the best in all the possible combination of input images. Feature selection based on MDA and MDG measures of RF is also important for the dimensionality reduction. It has been observed that RF model performed best with 97% accuracy, whereas the performance of GBM method is the lowest. Binary classification based on the remotely sensed data can be effectively handled using random forest method.
科研通智能强力驱动
Strongly Powered by AbleSci AI