大数据
计算机科学
领域(数学)
稀缺
数据科学
数据质量
医学
数据挖掘
公共卫生
数据库
病理
工程类
数学
经济
微观经济学
公制(单位)
纯数学
运营管理
作者
Wentao Wu,Yuan-Jie Li,Aozi Feng,Li Li,Tao Huang,Chunxue Wang,Jun Lyu
标识
DOI:10.1186/s40779-021-00338-z
摘要
Many high quality studies have emerged from public databases, such as Surveillance, Epidemiology, and End Results (SEER), National Health and Nutrition Examination Survey (NHANES), The Cancer Genome Atlas (TCGA), and Medical Information Mart for Intensive Care (MIMIC); however, these data are often characterized by a high degree of dimensional heterogeneity, timeliness, scarcity, irregularity, and other characteristics, resulting in the value of these data not being fully utilized. Data-mining technology has been a frontier field in medical research, as it demonstrates excellent performance in evaluating patient risks and assisting clinical decision-making in building disease-prediction models. Therefore, data mining has unique advantages in clinical big-data research, especially in large-scale medical public databases. This article introduced the main medical public database and described the steps, tasks, and models of data mining in simple language. Additionally, we described data-mining methods along with their practical applications. The goal of this work was to aid clinical researchers in gaining a clear and intuitive understanding of the application of data-mining technology on clinical big-data in order to promote the production of research results that are beneficial to doctors and patients.
科研通智能强力驱动
Strongly Powered by AbleSci AI