计算机科学
机器学习
光伏系统
维数(图论)
钙钛矿(结构)
人工智能
任务(项目管理)
预测建模
化学空间
实验数据
数据挖掘
系统工程
数学
工程类
统计
电气工程
纯数学
化学工程
生物信息学
药物发现
生物
作者
Simone Sala,Eleonora Quadrivi,Paolo Biagini,Riccardo Pò
出处
期刊:Solar RRL
[Wiley]
日期:2023-08-31
卷期号:7 (20)
标识
DOI:10.1002/solr.202300490
摘要
Machine learning models have become widespread in materials science research. An open‐access and community‐driven database containing over 40 000 perovskite photovoltaic devices has been recently published. This resource enables the application of predictive data‐driven models to correlate device structure with photovoltaic performance, whereas the literature usually focuses on specific device layers. Herein, the concept of device‐level performance prediction is explored using gradient‐boosted regression trees as the core algorithm and Shapley values analysis to interpret and rationalize the results. The main pitfalls and conceptual limitations of the approach are discussed and correlated with the database structure and dimension, by comparing the performance of different choices of descriptors and dataset size. Evidence suggests that the additional features introduced herein, in particular chemical descriptors of perovskite additives, can boost regression performance at a device level. A specific model is finally trained to predict the performance of unseen devices and tested on experimental data from the literature. This task is found to be particularly challenging, as the ability of the model to generalize to a new chemical space is limited by several factors, including the amount and the quality of available data.
科研通智能强力驱动
Strongly Powered by AbleSci AI