亲爱的研友该休息了!由于当前在线用户较少,发布求助请尽量完整的填写文献信息,科研通机器人24小时在线,伴您度过漫漫科研夜!身体可是革命的本钱,早点休息,好梦!

Deep learning‐based multi‐omics study reveals the polymolecular phenotypic of diabetic kidney disease

生物标志物发现 疾病 医学 糖尿病 组学 脂类学 生物标志物 代谢组学 生物信息学 内分泌学 蛋白质组学 内科学 生物 生物化学 基因
作者
Huan Zhao,Yuan Yu,Siyu Chen,Yaqi Yao,Chenghao Bi,Chuanxin Liu,Guijiang Sun,Haihua Su,Xinyue Li,Xiaomeng Li,Xingxu Yan,Yubo Li
出处
期刊:Clinical and translational medicine [Wiley]
卷期号:13 (6) 被引量:1
标识
DOI:10.1002/ctm2.1301
摘要

Dear Editor, Approximately 30% to 40% of patients with type 2 diabetes mellitus (T2DM) develop diabetic kidney disease (DKD), and most will go on to develop end-stage renal disease.1 The presence of kidney disease complicates the management of patients with T2DM.2 Therefore, identifying biomarkers for the early diagnosis of DKD based on circulating molecular factors associated with physiological alterations in patients with T2DM can effectively reduce and delay the incidence of DKD. We used deep learning (DL) to analyze and process multi-omics data and establish key molecular characteristics (biomarker panels) that affect the incidence and development of DKD. Based on strict diagnostic inclusion and exclusion criteria, 405 subjects from two centers in China were included in the discovery (n = 105) and test (n = 300) sets and divided into healthy control (HC), T2DM, and DKD groups (Table 1 and Supplementary Materials). In the discovery set, the combination of lipidomics and data-independent acquisition quantitative proteomics enabled the discovery of additional potential biomarkers and pathological mechanisms related to the occurrence and development of DKD. Lipidomics revealed that the metabolic profile of the both disease group changed significantly compared to that of HC; however, the metabolic profiles of T2DM and DKD groups were relatively similar (Figure 1A). Using the criteria of variable importance in projection > 1 and p < .05, 70 differential serum metabolites (Table S2) were identified (Figure 1B and Figure S1A). These mainly involved metabolic pathways, such as sphingolipid metabolism, steroid hormone biosynthesis, glycerol phospholipid metabolism and arachidonic acid metabolism (Figure 1C). In addition, the distribution of lipid abundance and lipid classes among the all groups showed that the glycerolipid and glycerophospholipid proportions were the highest. Proteomic data showed that protein content may vary depending on the physiological state of the individual (Figure 1D). With fold change (≥ 1.5 or ≤ .67) and p < .05 as screening criteria, 219 differential proteins were quantified (Figure S1B and Figure 1E–F), most of which were highly expressed in the both disease group (Table S3). In addition, the Gene Ontology and Kyoto Encyclopedia of Genes and Genomes analyses of the 219 proteins showed that complement and coagulation cascades, focal adhesions and phagosomes were significantly enriched, revealing that the development of DKD was related to pro-inflammatory signals (Figures S1C and D). Research is increasingly focusing on applying multi-omics to identify ‘at-risk’ profiles.3 At present, biomarkers for the risk of diabetes progressing to DKD at the single-molecule level have been identified; however, their diagnostic efficacy is poor.4, 5 DKD is a complex secondary disease, and studies on risk markers at multiple molecular levels would be helpful in reflecting disease risk.6 We used support vector machine and convolutional neural network (CNN) models to evaluate the accuracy of single- or multi-omics and found that the CNN model in multi-omics showed significant advantages (Table S4), with the highest internal and prediction accuracies (100% and 90.48%, respectively). The neighborhood component analysis algorithm selected 58 fusion features (20%) from the 289 features, including 32 different proteins and 26 different lipids. To reveal the intrinsic association of the 58 fusion features with DKD, Pearson correlation coefficient analysis was performed (Figure 2A). Twelve lipid metabolites showed significant association (R > .5) with 26 differentially expressed proteins (Figure 2B). By plotting the relative abundance of these lipid metabolites, we observed that the vast majority of lipids were significantly enriched in patients with T2DM than those with DKD (Figure 2C) and showed a linear increase with disease progression. A strong positive correlation between trihydroxycoprostanoic acid, Cer (d18:1/16:0), and 3α, 7 α-dihydroxycoprostanic acid was observed (Figure 2D, R > .85, p < .01). These results suggest that DKD-related proteins are associated with changes in serum lipid metabolite levels. In the test set, four lipid metabolites and four proteins in the 58 fusion features showed similar trends and content changes as that in the discovery set (Tables S5 and 6). Recently, several clinical histological studies have focused on the concept of “biomarker panel”.2, 7, 8 Based on the above results, we selected 3α, 7α-dihydroxycoprostanic acid and Cer (d18:1/16:0) with an absolute high contribution to draw the receiver operating characteristic curve, with an area under the curve (AUC) of .800 (95% confidence interval [CI]: .698–.902), to establish the diagnostic distinction between T2DM and DKD (Figure 2E). Subsequently, the remaining six substances were added to obtain the best biomarker panel to predict the development of DKD, which was composed of 3α, 7α -dihydroxycoprostanic acid, Cer (d18:1/16:0), cyclase-associated protein 1 (CAP1) and talin-1 (TLN1) (AUC = .873; 95% CI: .794–.951) (Figure 2F and S2A–B). We applied the obtained biomarker panel to the discovery (AUC = .838, 95% CI: .726–.950) and test sets (AUC = .938, 95% CI: .8670–1.000) that showed a strong diagnostic ability far higher than serum creatinine (SCR) (AUC = .620, 95% CI: .485–.755), and blood urea nitrogen (BUN) (AUC = .638, 95% CI: .506–.770) (Figure 2G and H). We found that the two lipid metabolites, Cer (d18:1/16:0) and 3α, 7α-dihydroxycoprostanic acid, had prominent and robust positive correlations with hemoglobin A1c and glucose levels (Figures S2C). In addition, the positive correlations between CAP1, TLN1, SCR and BUN were stronger than those between the two lipid metabolites (Figures S2D). Furthermore, all four markers were positively correlated with a history of diabetes to varying degrees, with the two lipid metabolites being particularly significant (Figures S2E). This emphasizes the complementary nature and importance of a biomarker panel. In conclusion, this study combined multiple bioinformatic tools and learning algorithms to synthetically identify the optimal diagnosis of a disease biomarker panel. Our findings provide insights for the integrated modelling of multi-omics data and new research opportunities for T2DM complications. Furthermore, the combined use of two powerful histological techniques, lipidomics and proteomics, provided a comprehensive understanding of this disease.9, 10 The advent of DL will enable the handling of large amounts of high-dimensional and complex-structured data, further enabling the identification of key metabolic features. This study used training models from small populations to validate large cohorts because of complications such as sample collection and time constraints, which may have resulted in some features being neglected. Therefore, in future studies, attention should be paid to the cohort settings (usually 8:1 to 4:1). The authors would like to thank Shanghai Jiaotong University for providing the proteomics research platform for this study. They thanks all participants of the cohort for their contributions to this study. The authors declare no conflict of interest. Please note: The publisher is not responsible for the content or functionality of any supporting information supplied by the authors. Any queries (other than missing content) should be directed to the corresponding author for the article.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
更新
大幅提高文件上传限制,最高150M (2024-4-1)

科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
3秒前
晓凡完成签到,获得积分10
7秒前
Smectite发布了新的文献求助10
7秒前
桐桐应助科研通管家采纳,获得10
9秒前
SciGPT应助科研通管家采纳,获得10
9秒前
winnie_ymq完成签到 ,获得积分10
10秒前
FashionBoy应助xixi采纳,获得10
11秒前
桐桐应助田柾国采纳,获得10
30秒前
Smectite完成签到,获得积分10
39秒前
winnie_ymq发布了新的文献求助10
39秒前
43秒前
51秒前
影子发布了新的文献求助10
55秒前
影子完成签到,获得积分10
1分钟前
1分钟前
2分钟前
小化发布了新的文献求助10
2分钟前
隐形曼青应助科研通管家采纳,获得10
2分钟前
烟花应助科研通管家采纳,获得30
2分钟前
2分钟前
zsmj23完成签到 ,获得积分0
2分钟前
3分钟前
北雨发布了新的文献求助30
3分钟前
hahahan完成签到 ,获得积分10
3分钟前
3分钟前
TAFFY发布了新的文献求助10
3分钟前
4分钟前
北雨完成签到,获得积分20
4分钟前
4分钟前
小黄想躺平完成签到,获得积分20
4分钟前
uu完成签到,获得积分10
4分钟前
4分钟前
andrele发布了新的文献求助10
4分钟前
4分钟前
火火发布了新的文献求助30
4分钟前
小马甲应助狄绮采纳,获得10
4分钟前
4分钟前
狄绮发布了新的文献求助10
5分钟前
狄绮完成签到,获得积分10
5分钟前
火火发布了新的文献求助10
5分钟前
高分求助中
Evolution 10000
юрские динозавры восточного забайкалья 800
English Wealden Fossils 700
Distribution Dependent Stochastic Differential Equations 500
A new species of Coccus (Homoptera: Coccoidea) from Malawi 500
A new species of Velataspis (Hemiptera Coccoidea Diaspididae) from tea in Assam 500
PraxisRatgeber: Mantiden: Faszinierende Lauerjäger 500
热门求助领域 (近24小时)
化学 医学 生物 材料科学 工程类 有机化学 生物化学 物理 内科学 纳米技术 计算机科学 化学工程 复合材料 基因 遗传学 催化作用 物理化学 免疫学 量子力学 细胞生物学
热门帖子
关注 科研通微信公众号,转发送积分 3158604
求助须知:如何正确求助?哪些是违规求助? 2809798
关于积分的说明 7883671
捐赠科研通 2468473
什么是DOI,文献DOI怎么找? 1314182
科研通“疑难数据库(出版商)”最低求助积分说明 630572
版权声明 601982