带隙
计算机科学
钙钛矿(结构)
非线性系统
人工智能
材料科学
Boosting(机器学习)
机器学习
卤化物
决策树
梯度升压
算法
物理
光电子学
工程类
量子力学
随机森林
无机化学
化学工程
化学
作者
Ruoting Zhao,Bangyu Xing,Huimin Mu,Yuhao Fu,Lijun Zhang
出处
期刊:Chinese Physics B
[IOP Publishing]
日期:2022-05-01
卷期号:31 (5): 056302-056302
被引量:8
标识
DOI:10.1088/1674-1056/ac5d2d
摘要
With the rapid development of artificial intelligence and machine learning (ML) methods, materials science is rapidly entering the era of data-driven materials informatics. ML models serve as the most crucial component, closely bridging material structure and material properties. There is a considerable difference in the prediction performance of different ML methods for material systems. Herein, we evaluated three categories (linear, kernel, and nonlinear methods) of models, with twelve ML algorithms commonly used in the materials field. In addition, halide perovskite was chosen as an example to evaluate the fitting performance of different models. We constructed a total dataset of 540 halide perovskites and 72 features, with formation energy and bandgap as target properties. We found that different categories of ML models show similar trends for different target properties. Among them, the difference between the models is enormous for the formation energy, with the coefficient of determination ( R 2 ) range 0.69–0.953. The fitting performance between the models is closer for bandgap, with the R 2 range 0.941–0.997. The nonlinear-ensemble model shows the best fitting performance for both the formation energy and the bandgap. It shows that the nonlinear-ensemble model, constructed by combining multiple weak learners, effectively describes the nonlinear relationship between material features and target property. In addition, the extreme gradient boosting decision tree model shows the most superior results among all the models and searches for two new descriptors that are crucial for formation energy and bandgap. Our work provides useful guidance for the selection of effective machine learning methods in the data-mining studies of specific material systems. The dataset that supported the findings of this study is available in Science Data Bank, with the link https://www.doi.org/10.11922/sciencedb.01611 .
科研通智能强力驱动
Strongly Powered by AbleSci AI