强化学习
计算机科学
过度拟合
图形
等变映射
人工智能
理论计算机科学
人工神经网络
数学
纯数学
作者
Jonas Elsborg,Arghya Bhowmik
标识
DOI:10.1021/acs.jcim.3c00394
摘要
We have developed an actor–critic-type policy-based reinforcement learning (RL) method to find low-energy nanoparticle structures and compared its effectiveness to classical basin-hopping. We took a molecule building approach where nanoalloy particles can be regarded as metallic molecules, albeit with much higher flexibility in structure. We explore the strengths of our approach by tasking an agent with the construction of stable mono- and bimetallic clusters. Following physics, an appropriate reward function and an equivariant molecular graph representation framework is used to learn the policy. The agent succeeds in finding well-known stable configuration for small clusters in both single and multicluster experiments. However, for certain use cases the agent lacks generalization to avoid overfitting. We relate this to the pitfalls of actor–critic methods for molecular design and discuss what learning properties an agent will require to achieve universality.
科研通智能强力驱动
Strongly Powered by AbleSci AI