符号回归
计算机科学
回归
变量(数学)
特征选择
财产(哲学)
回归分析
钥匙(锁)
理论(学习稳定性)
选择(遗传算法)
数学优化
算法
理论计算机科学
人工智能
机器学习
数学
统计
遗传程序设计
认识论
数学分析
哲学
计算机安全
作者
Zhen Guo,Shunbo Hu,Zhongkang Han,Runhai Ouyang
标识
DOI:10.1021/acs.jctc.2c00281
摘要
Symbolic regression offers a promising avenue for describing the structure-property relationships of materials with explicit mathematical expressions, yet it meets challenges when the key variables are unclear because of the high complexity of the problems. In this work, we propose to solve the difficulty by automatically searching for important variables from a large pool of input features. A new algorithm that integrates symbolic regression with iterative variable selection (VS) was designed for optimization of the model with a large amount of input features. Using the recent method SISSO for symbolic regression and random search for variable selection, we show that the VS-assisted SISSO (VS-SISSO) can effectively manage even hundreds of input features that the SISSO alone was computationally hindered, and it fastly converges to (near) optimal solutions when the model complexity is not high. The efficiency of this approach for improving the accuracy of symbolic regression in materials science was demonstrated in the two showcase applications of learning approximate equations for the band gap of inorganic halide perovskites and the stability of single-atom alloy catalysts.
科研通智能强力驱动
Strongly Powered by AbleSci AI