化学空间
磷化铟
计算机科学
量子点
磷化物
材料科学
机器学习
纳米技术
算法
化学
光电子学
生物化学
药物发现
冶金
镍
砷化镓
作者
Hao Nguyen,Florence Y. Dou,Nayon Park,Shenwei Wu,Harrison Sarsito,Benedicte Diakubama,Helen Larson,Emily Nishiwaki,Micaela Homer,Melanie N. Cash,Brandi M. Cossairt
标识
DOI:10.1021/acs.chemmater.2c00640
摘要
Prediction of chemical reaction outcomes using machine learning (ML) has emerged as a powerful tool for advancing materials synthesis. However, this approach requires large and diverse datasets, which are extremely limited in the field of nanomaterials synthesis due to inconsistent and nonstandardized reporting in the literature and a lack of understanding of synthetic mechanisms. In this study, we extracted parameters of InP quantum dot (QD) syntheses as our inputs and resultant properties (absorption, emission, diameter) as our outputs from 72 publications. We "filled in" missing outputs using a data imputation method to prepare a complete dataset containing 216 entries for training and testing predictive ML models. We defined the descriptor space in two ways (condensed and extended) based on either chemical identity or the role of reagents to explore the best approach for categorizing input features. We achieved mean absolute errors (MAEs) as low as 20.29, 11.46, and 0.33 nm for absorption, emission, and diameter, respectively, with our best ML model. We used these models to deploy an accessible and interactive web app for designing syntheses of InP (https://share.streamlit.io/cossairt-lab/indium-phosphide/Hot_injection/hot_injection_prediction.py). Using this web app, we investigated chemical trends in InP syntheses, such as the effects of common additives, like zinc salts and trioctylphosphine. We also designed and conducted new experiments based on extensions of literature procedures and compared our experimentally measured properties to predictions, thus evaluating the "real-life" accuracy of our models. Conversely, we used inverse design to obtain InP QDs with specific properties. Finally, we applied the same approach to train, test, and launch predictive models for CdSe QDs by expanding a previously published dataset. Altogether, our data preprocessing method and ML implementations demonstrate the ability to design materials with targeted properties and explore underlying reaction mechanisms even when faced with limited data resources.
科研通智能强力驱动
Strongly Powered by AbleSci AI