操作化
计算机科学
数据质量
完备性(序理论)
数据挖掘
数据科学
质量(理念)
数据验证
领域(数学分析)
数据库
工程类
运营管理
数学
认识论
数学分析
哲学
公制(单位)
作者
Kathleen Lee,Nicole G. Weiskopf,Jyotishman Pathak
出处
期刊:PubMed
日期:2017-01-01
卷期号:2017: 1080-1089
被引量:43
摘要
The wide availability of electronic health record (EHR) data for multi-institutional clinical research relies on accurately defined patient cohorts to ensure validity, especially when used in conjunction with open-access research data. There is a growing need to utilize a consensus-driven approach to assess data quality. To achieve this goal, we modified an existing data quality assessment (DQA) framework by re-operationalizing dimensions of quality for a clinical domain of interest - heart failure. We then created an inventory of common phenotype data elements (CPDEs) derived from open-access datasets and evaluated it against the modified DQA framework. We measured our inventory of CPDEs for Conformance, Completeness, and Plausibility. DQA scores were high on Completeness, Value Conformance, and Atemporal and Temporal Plausibility. Our work exhibits a generalizable approach to DQA for clinical research. Future work will 1) map datasets to standard terminologies and 2) create a quantitative DQA tool for research datasets.
科研通智能强力驱动
Strongly Powered by AbleSci AI