队列
生物标志物
机器学习
支持向量机
医学
Lasso(编程语言)
人工智能
算法
交叉验证
特征选择
肿瘤科
金标准(测试)
肉瘤
内科学
计算机科学
生物
病理
遗传学
万维网
作者
Yonghua Pang,Jiahui Liang,Yi Deng,Weinan Chen,Yunyan Shen,Jing Li,Xin Wang,Zhiyao Ren
标识
DOI:10.3389/fimmu.2025.1449355
摘要
Introduction Early diagnosis of Ewing sarcoma (ES) is critical for improving patient prognosis. However, the accurate diagnosis of ES remains challenging, underscoring the need for novel diagnostic biomarkers to enhance diagnostic precision and reliability. This study aimed to identify potential gene expression-based biomarkers for the diagnosis of ES. Methods We selected the GSE17679, GSE45544, and GSE68776 datasets from the Gene Expression Omnibus (GEO) database. After correcting for batch effects, we combined ES and normal tissue samples from the GSE17679 and GSE45544 datasets to create a combined cohort. Two-thirds of both the tumor and normal samples from the combined cohort were randomly selected for the training cohort, while the remaining one-third served as the internal validation cohort. Additionally, the GSE68776 dataset was used for external validation. To identify key diagnostic genes, we applied three machine learning algorithms: least absolute shrinkage and selection operator (LASSO), support vector machine recursive feature elimination (SVM-RFE), and random forest (RF). Results HOXC6 was identified as a key diagnostic biomarker for ES. It demonstrated strong diagnostic performance across all cohorts, with area under the curve (AUC) values of 0.956 (95% CI: 0.909−0.990) in the training cohort, 0.995 (95% CI: 0.977−1.000) in the internal validation cohort, and 0.966 (95% CI: 0.910−0.999) in the external validation cohort. Functional validation through HOXC6 knockdown in the RD-ES cell line revealed that its suppression significantly inhibited cell proliferation and migration. Furthermore, transcriptome sequencing suggested potential oncogenic mechanisms underlying HOXC6 function. Discussion These findings highlight HOXC6 as a promising diagnostic biomarker for ES, demonstrating robust performance across multiple datasets. Additionally, its functional role suggests potential as a therapeutic target.
科研通智能强力驱动
Strongly Powered by AbleSci AI