计算机科学
数据科学
背景(考古学)
大数据
钥匙(锁)
数据驱动
人工智能
期限(时间)
机器学习
数据挖掘
古生物学
生物
物理
量子力学
计算机安全
标识
DOI:10.1002/anie.202219170
摘要
Machine learning (ML) algorithms are currently emerging as powerful tools in all areas of science. Conventionally, ML is understood as a fundamentally data-driven endeavour. Unfortunately, large well-curated databases are sparse in chemistry. In this contribution, I therefore review science-driven ML approaches which do not rely on "big data", focusing on the atomistic modelling of materials and molecules. In this context, the term science-driven refers to approaches that begin with a scientific question and then ask what training data and model design choices are appropriate. As key features of science-driven ML, the automated and purpose-driven collection of data and the use of chemical and physical priors to achieve high data-efficiency are discussed. Furthermore, the importance of appropriate model evaluation and error estimation is emphasized.
科研通智能强力驱动
Strongly Powered by AbleSci AI