过度拟合
碳化物
高熵合金
计算机科学
熵(时间箭头)
人工智能
样本熵
陶瓷
机器学习
数据挖掘
模式识别(心理学)
材料科学
热力学
物理
冶金
人工神经网络
合金
作者
Rahul Mitra,Anurag Bajpai,Krishanu Biswas
标识
DOI:10.1016/j.commatsci.2023.112142
摘要
The lack of appropriate data and data imbalance hindered the development of ML models for identifying novel high-entropy ceramics. To circumvent data imbalance for ML-based ceramic design, we build a semi-synthetic database of high entropy carbides using literature data, atomic environment mapping-based structure plots and adaptive synthetic sampling (ADASYN) technique. A 5-fold cross-validated kNN classifier was trained on both original and balanced datasets. The kNN model trained on a balanced dataset has 95% testing accuracy while controlling for overfitting. SHAP describes the relationship between characteristics and goal variables. This paper shows a new ML approach with a decreased bias to predict high-entropy single-phase carbides preemptively.
科研通智能强力驱动
Strongly Powered by AbleSci AI