High-throughput discovery of chemical structure-polarity relationships combining automation and machine-learning techniques

极性(国际关系) 自动化 计算机科学 人工智能 标准化 薄层色谱法 吞吐量 机器学习 化学 色谱法 生物系统 工程类 操作系统 生物 机械工程 电信 无线 细胞 生物化学
作者
Hao Xu,Jinglong Lin,Qianyi Liu,Yuntian Chen,Jianning Zhang,Yang Yang,Michael C. Young,Yan Xu,Dongxiao Zhang,Fanyang Mo
出处
期刊:Chem [Elsevier BV]
卷期号:8 (12): 3202-3214 被引量:9
标识
DOI:10.1016/j.chempr.2022.08.008
摘要

•An automated platform is invented to conduct high-throughput TLC analysis •4,944 standardized Rf values from 387 compounds under 17 solvent conditions •A machine-learning model facilitates Rf prediction and chromatographic separation •Higher topological polar surface area (TPSA) contributes to smaller Rf values As an essential attribute of organic compounds, polarity has a profound influence on many molecular properties. Thin-layer chromatography (TLC) represents a commonly used technique for empirical polarity estimations. Current TLC techniques need repetitive attempts to obtain suitable development conditions and have low reproducibility due to a low degree of standardization. Herein, we describe an automated system to conduct TLC analysis automatically, facilitating high-throughput collection of a large quantity of experimental data under standardized conditions. Using this dataset, machine-learning (ML) methods are employed to construct surrogate models correlating organic compound structures and their polarity reflected by retardation factor (Rf). The trained ML models are able to predict the Rf value curve of organic compounds in different solvent combinations with high accuracy, thus providing general guidelines for the selection of purification conditions and expediting the generation and analysis of quality TLC data. As an essential attribute of organic compounds, polarity has a profound influence on many molecular properties. Thin-layer chromatography (TLC) represents a commonly used technique for empirical polarity estimations. Current TLC techniques need repetitive attempts to obtain suitable development conditions and have low reproducibility due to a low degree of standardization. Herein, we describe an automated system to conduct TLC analysis automatically, facilitating high-throughput collection of a large quantity of experimental data under standardized conditions. Using this dataset, machine-learning (ML) methods are employed to construct surrogate models correlating organic compound structures and their polarity reflected by retardation factor (Rf). The trained ML models are able to predict the Rf value curve of organic compounds in different solvent combinations with high accuracy, thus providing general guidelines for the selection of purification conditions and expediting the generation and analysis of quality TLC data.

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
Roy完成签到,获得积分10
1秒前
xj_yjl完成签到,获得积分10
1秒前
尚影芷完成签到,获得积分10
1秒前
张博完成签到,获得积分10
2秒前
william完成签到,获得积分10
2秒前
3秒前
今天放假了吗完成签到,获得积分10
3秒前
Tian完成签到,获得积分10
4秒前
Frank完成签到 ,获得积分10
6秒前
zyyzyyoo发布了新的文献求助10
6秒前
xx完成签到 ,获得积分10
6秒前
7秒前
星启完成签到 ,获得积分10
7秒前
minerva完成签到,获得积分10
7秒前
shift3310完成签到,获得积分10
8秒前
张sir完成签到,获得积分10
8秒前
yangyihuan完成签到 ,获得积分10
9秒前
Wang发布了新的文献求助10
9秒前
反证谁能想的到完成签到,获得积分10
9秒前
10秒前
缓慢的含海完成签到 ,获得积分10
12秒前
adeno发布了新的文献求助10
14秒前
ffwwxye完成签到,获得积分10
15秒前
未来的院士完成签到 ,获得积分10
16秒前
只道寻常完成签到,获得积分10
16秒前
笑点低的凉面完成签到,获得积分10
17秒前
c1302128340完成签到,获得积分10
19秒前
CQ完成签到 ,获得积分10
19秒前
火星上外套完成签到,获得积分10
20秒前
鼠牵牛发布了新的文献求助10
20秒前
adeno完成签到,获得积分10
21秒前
徐先生完成签到,获得积分10
22秒前
顾守完成签到,获得积分10
23秒前
强小强努力努力完成签到,获得积分10
26秒前
27秒前
爆米花应助zyyzyyoo采纳,获得10
28秒前
娜行完成签到 ,获得积分10
28秒前
lyf完成签到,获得积分10
28秒前
6S6完成签到,获得积分10
29秒前
livra1058完成签到,获得积分10
30秒前
高分求助中
(应助此贴封号)【重要!!请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000
Developing Genetic Editing Tools for Lysobacter 2000
Adhesion Science: Principles & Practice 800
The Graphene Handbook (2019 Edition) 700
Signals, Systems, and Signal Processing 610
IEST-RP-CC018: Cleanroom Cleaning and Sanitization: Operating and Monitoring Procedures 600
Fundamentals of Pharmaceutical and Biologics Regulations: A Global Perspective, Second Edition 600
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6530442
求助须知:如何正确求助?哪些是违规求助? 8323164
关于积分的说明 17818278
捐赠科研通 5631798
什么是DOI,文献DOI怎么找? 2932200
邀请新用户注册赠送积分活动 1908853
关于科研通互助平台的介绍 1768148