全基因组关联研究
计算机科学
遗传关联
可视化
人口
并行计算
数据挖掘
单核苷酸多态性
生物
遗传学
基因
基因型
社会学
人口学
作者
Lilin Yin,Haohao Zhang,Zhenshuang Tang,Jingya Xu,Yin Ding,Zhiwu Zhang,Xiaohui Yuan,Mengjin Zhu,Shuhong Zhao,Xinyun Li,Xiaolei Liu
标识
DOI:10.1016/j.gpb.2020.10.007
摘要
Along with the development of high-throughput sequencing technologies, both sample size and SNP number are increasing rapidly in genome-wide association studies (GWAS), and the associated computation is more challenging than ever. Here, we present a memory-efficient, visualization-enhanced, and parallel-accelerated R package called "rMVP" to address the need for improved GWAS computation. rMVP can 1) effectively process large GWAS data, 2) rapidly evaluate population structure, 3) efficiently estimate variance components by Efficient Mixed-Model Association eXpedited (EMMAX), Factored Spectrally Transformed Linear Mixed Models (FaST-LMM), and Haseman-Elston (HE) regression algorithms, 4) implement parallel-accelerated association tests of markers using general linear model (GLM), mixed linear model (MLM), and fixed and random model circulating probability unification (FarmCPU) methods, 5) compute fast with a globally efficient design in the GWAS processes, and 6) generate various visualizations of GWAS-related information. Accelerated by block matrix multiplication strategy and multiple threads, the association test methods embedded in rMVP are significantly faster than PLINK, GEMMA, and FarmCPU_pkg. rMVP is freely available at https://github.com/xiaolei-lab/rMVP.
科研通智能强力驱动
Strongly Powered by AbleSci AI