B-123 Utilization of Five Data Mining Algorithms Combined with Simplified Preprocessing to Establish Reference Intervals of Thyroid Related Hormones for Nonelderly Adults

算法 数据集 计算机科学 人口 数学 统计 数据挖掘 医学 环境卫生
作者
Jian Zhong,Chaochao Ma,Le Hou,Yufeng Yin,Fang Zhao,Yingying Hu,An Song,Dawei Wang,Li L,Xinqi Cheng,Ling Qiu
出处
期刊:Clinical Chemistry [American Association for Clinical Chemistry]
卷期号:69 (Supplement_1)
标识
DOI:10.1093/clinchem/hvad097.457
摘要

Abstract Background Despite the extensive research on data mining algorithms, there is still a lack of a standard protocol to evaluate the performance of the existing algorithms. Therefore, the study aims to provide a novel procedure that combines data mining algorithms and simplified preprocessing to establish reference intervals (RIs), with the performance of five algorithms assessed objectively as well. Methods The Test data set and the Reference data set are the two data sets derived from the population undergoing a physical examination. After the thyroid-related hormone including thyroid stimulating hormone (TSH), free triiodo-thyronine (FT3), total triiodo-thyronine (TT3), free thyroxine(FT4), and total thyroxine (TT4) were measured by an ADVIA Centaur XP chemiluminescence immunoassay analyzer, five data algorithms were used to calculated RIs. Hoffmann, Bhattacharya, Expectation Maximum (EM), kosmic, and refineR algorithms combined with two-step data preprocessing respectively were implemented in the Test data set to establish RIs for thyroid-related hormones. The first step is to conduct a random sampling strategy to balance the ratio of sex and age, and the second step is to identify the outliers of variables in each subgroup by the Tukey method. Algorithm-calculated RIs were compared with the standard RIs calculated by transformed parametric method from the Reference data set in which reference individuals were selected following strict inclusion and exclusion criteria. RIs partition were comprehensively determined by the multiple linear regression and variance component analysis. Objective assessment of the methods is implemented by the bias ratio (BR) matrix, of which the BR threshold was set to 0.375. Results The levels of the all five thyroid-related hormones are significantly different in sex, with the male having lower TSH and higher FT3, FT4, TT3, and TT4 compared to the female. Further analysis indicates the establishment of sex-specific RIs for FT3 and FT4. Standard RIs derived from the Reference data set by transformed parametric method are 0.801–4.221 μIU/L for TSH, 2.58–3.82 pg/mL for FT3, 0.98–1.53 ng/dL for FT4, 0.80–1.38 ng/mL for TT3, 5.46–10.05 g/dL for TT4, respectively. There is a high consistency between TSH RIs established by the EM algorithm and the standard TSH RIs (BR = 0.063), although EM algorithms seems to perform poor on other hormones with the BR higher than 0.375. RIs calculated by Hoffmann, Bhattacharya, and refineR methods for free and total triiodo-thyronine, free and total thyroxine respectively are close and matched the standard RIs. Conclusion An effective approach for objectively evaluating the performance of the algorithm based on the BR matrix is established. EM algorithm combined with simplified preprocessing can handle data with significant skewness, but its performance is limited in other scenarios. The other four algorithms perform well for data with Gaussian or near-Gaussian distribution. Using the appropriate algorithm based on the data distribution characteristics is recommended.

科研通智能强力驱动
Strongly Powered by AbleSci AI
更新
PDF的下载单位、IP信息已删除 (2025-6-4)

科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
lc完成签到,获得积分10
1秒前
4秒前
李健的小迷弟应助anna采纳,获得10
5秒前
量子星尘发布了新的文献求助10
5秒前
7秒前
7秒前
嘀嘀咕咕发布了新的文献求助10
7秒前
大观天下完成签到,获得积分10
7秒前
科研通AI2S应助科研通管家采纳,获得10
8秒前
英姑应助科研通管家采纳,获得10
8秒前
科研通AI5应助科研通管家采纳,获得10
8秒前
8秒前
科研通AI2S应助科研通管家采纳,获得10
8秒前
共享精神应助科研通管家采纳,获得10
8秒前
科目三应助科研通管家采纳,获得10
8秒前
bkagyin应助科研通管家采纳,获得10
8秒前
SciGPT应助科研通管家采纳,获得10
8秒前
CodeCraft应助科研通管家采纳,获得10
8秒前
orixero应助科研通管家采纳,获得10
9秒前
脑洞疼应助科研通管家采纳,获得10
9秒前
9秒前
兴奋千兰发布了新的文献求助10
10秒前
有机发布了新的文献求助10
11秒前
yukang发布了新的文献求助10
11秒前
13秒前
大观天下发布了新的文献求助30
14秒前
14秒前
16秒前
17秒前
小盘子完成签到,获得积分10
17秒前
18秒前
今后应助务实的大神采纳,获得10
18秒前
anna发布了新的文献求助10
21秒前
21秒前
Elaine完成签到,获得积分10
21秒前
23秒前
nolan完成签到 ,获得积分10
23秒前
25秒前
彭于晏应助嘀嘀咕咕采纳,获得10
25秒前
搜集达人应助感动的山槐采纳,获得10
26秒前
高分求助中
A new approach to the extrapolation of accelerated life test data 1000
ACSM’s Guidelines for Exercise Testing and Prescription, 12th edition 500
‘Unruly’ Children: Historical Fieldnotes and Learning Morality in a Taiwan Village (New Departures in Anthropology) 400
Indomethacinのヒトにおける経皮吸収 400
Phylogenetic study of the order Polydesmida (Myriapoda: Diplopoda) 370
基于可调谐半导体激光吸收光谱技术泄漏气体检测系统的研究 350
Robot-supported joining of reinforcement textiles with one-sided sewing heads 320
热门求助领域 (近24小时)
化学 材料科学 医学 生物 工程类 有机化学 生物化学 物理 内科学 纳米技术 计算机科学 化学工程 复合材料 遗传学 基因 物理化学 催化作用 冶金 细胞生物学 免疫学
热门帖子
关注 科研通微信公众号,转发送积分 3989069
求助须知:如何正确求助?哪些是违规求助? 3531351
关于积分的说明 11253589
捐赠科研通 3269939
什么是DOI,文献DOI怎么找? 1804851
邀请新用户注册赠送积分活动 882074
科研通“疑难数据库(出版商)”最低求助积分说明 809073