微阵列分析技术
生物标志物
生物标志物发现
微阵列
癌症
基因
癌症生物标志物
选择(遗传算法)
计算生物学
基因选择
特征选择
DNA微阵列
水准点(测量)
微阵列数据库
计算机科学
生物
生物信息学
基因表达
蛋白质组学
遗传学
机器学习
大地测量学
地理
作者
Han-Jing Jiang,Jinguang Lin,Hongjie Zhu,Yabing Huang
标识
DOI:10.1109/bibm58861.2023.10385464
摘要
Cancer-associated biomarker genes play an indispensable role in the intricate tapestry of cancer development and manifestation. The expression of biomarkers in different types of tumor cells has beneficial implications for shedding light on the development of various cancers, guiding clinical diagnosis, and treatment. Microarray technology enables the expression levels of thousands of genes in samples to be sequenced simultaneously. However, sparse and high-dimensional microarray data present a formidable challenge in identifying biomarker genes. This study presents EREF-NSGA2, a novel method for cancer biomarker selection from microarray data, employing a hybrid gene selection approach. Firstly, the combination of the wrapper and embedded gene selection methods is proposed to filter the microarray data, which efficiently decreases the search space of the algorithm. After that, the improved NSGA-II algorithm is used to search the genes subset obtained from the previous step to reach the optimal subset of cancer biomarker genes. The proposed EREF-NSGA2 is compared with other reported methods on six cancer benchmark gene expression datasets. A detailed biological analysis is performed to analyze the relationship between the selected genes and the cancer data sets they belong to. To summarize, EREF-NSGA2 proves its effectiveness in selecting a feature subset comprising the fewest genes while maintaining the highest classification accuracy.
科研通智能强力驱动
Strongly Powered by AbleSci AI