计算机科学
分拆(数论)
编码(集合论)
介绍(产科)
领域(数学)
机器学习
选择(遗传算法)
人工智能
理论计算机科学
程序设计语言
数学
医学
组合数学
放射科
集合(抽象数据类型)
纯数学
作者
Christian Henning Gierlich,Stefan Palkovits
标识
DOI:10.1016/j.coche.2022.100840
摘要
Chemical data must be encoded for the use in machine-learning algorithms. In this article, we present a selection of featurization methods. Furthermore, we give an insight into the field of application for which individual methods can be used. The introduction to this topic is facilitated by the presentation of a code example. Our goal was to provide a step-by-step tutorial that provides a basic understanding of ML in chemistry. For this purpose, a dataset consisting of partition coefficients for the extraction of dimethoxymethanol from an aqueous system is provided. We could show that the way of featurizing this dataset plays a crucial role for model performance.
科研通智能强力驱动
Strongly Powered by AbleSci AI