分子描述符
生物系统
计算机科学
集合(抽象数据类型)
人工智能
机器学习
有机太阳能电池
构造(python库)
预测建模
数据挖掘
模式识别(心理学)
数量结构-活动关系
光伏系统
工程类
电气工程
程序设计语言
生物
作者
Zhi‐Wen Zhao,Marcos del Cueto,Yun Geng,Alessandro Troisi
标识
DOI:10.1021/acs.chemmater.0c02325
摘要
In this work, we analyzed a data set formed by 566 donor/acceptor pairs, which are part of organic solar cells recently reported. We explored the effect of different descriptors in machine learning (ML) models to predict the power conversion efficiency (PCE) of these cells. The investigated descriptors are classified into two main categories: structural (topology properties) and physical descriptors (energy levels, molecular size, light absorption, and mixing properties). In line with previous observations, ML predictions are more accurate when using both structural and physical descriptors, as opposed to only using one of them. We observed that ML predictions are also improved by using larger and more varied data sets. Importantly, the structural descriptors are the ones contributing the most to the ML models. Some physical properties are highly correlated with PCE, although they do not improve notably the ML prediction accuracy as they carry information already encoded in the structural descriptors. Given that various descriptors have significantly different computational costs, the analysis presented here can be used as a guide to construct ML models that maximize predictive power and minimize computational costs for screening large sets of candidates.
科研通智能强力驱动
Strongly Powered by AbleSci AI