人工智能
机器学习
计算机科学
决策树
支持向量机
随机森林
深度学习
Boosting(机器学习)
嵌入
原始数据
特征(语言学)
梯度升压
特征向量
模式识别(心理学)
数据挖掘
哲学
语言学
程序设计语言
作者
Viet Cuong Ta,Thi-Linh Hoang,Nhat Trung Doan,Van-Thang Nguyen,Ntawangaheza Jean de Dieu,Thi Thanh Thuy Pham,Nguyễn Đăng Nam
标识
DOI:10.1080/10916466.2023.2223623
摘要
The well log data is represented as raw tabular data with diverse and nonlinear features. This poses a challenge for feature learning by machine learning models. The recent popular decision tree-based algorithms, such as random forest (RF), extreme gradient boosting (XGB) are prominent for learning nonlinear relationships of well log data in comparison with other methods of support vector machines (SVMs) and even deep learning models. In this work, we proposed using Tabnet model for direct learning tabular data of well logs. To our knowledge, this is the first time a state-of-the-art transformer-based model of Tabnet has been utilized for this task. The efficiency of Tabnet-based feature embedding is evaluated in two tasks of rock facies classification and learning feature embedding. We prove the efficiency of Tabnet model by experimental results on two small datasets of public Kansas dataset, which has nine wells for training and two wells for testing, and our own-built dataset, which has four wells for training and one well for testing. Although training on the modest amount of well log data, the proposed Tabnet model still promotes better classification efficiency than tree-based models of RF, XGBoost, LightGBM and deep learning models of MLP, CNN-1D, and ResNet-1D. KEY POINTS:Tabnet efficiency for facies classification and learning feature embedding from well log data.A challenge to learn these raw features directly for separating classes of facies.The superiority of the Tabnet network in comparison with other ruling tree-based methods and deep learning models.Facies classification and learning feature embeddings for categorical variables of well logs.
科研通智能强力驱动
Strongly Powered by AbleSci AI