分析
数据库
组学
数据科学
计算机科学
地理
生物
生物信息学
作者
Parisa Sarzaeim,Hasnat Aslam,Francisco Muñoz‐Arriola
出处
期刊:CERN European Organization for Nuclear Research - Zenodo
日期:2023-01-02
被引量:7
标识
DOI:10.5281/zenodo.7490246
摘要
CLIM4OMICS Analytics and Database is Improved database of G2F data repository that contains OMICs (genetic and phenotypic) and environmental data for maize yield predictability across 84 experimental fields in the U.S. and province of ON in Canada between 2014-2017. The goal of this pipeline is to aggregate, improve, and synthesize multi-dimensional G2F data including Geno-type, Phenotype and Environmental data for GxE modeling. This dataset contains 8,171 phenotype measurements, 376 genotypes of maize lines, environmental data of 84 locations and Python Scripts for Quality control (QC), Consistency control (CC) steps and ML models for GxE interactions. The Environmental data is extracted from NWS, DayMet and NSRDB databases and processed for QC and CC. The environmental dataset contains the minimum temperature (Tmin), average temperature (Tmean), maximum temperature (Tmax), minimum dew point (DPmin), average dew point (DPmean), maximum dew point (DPmax), minimum relative humidity (RHmin), average relative humidity (RHmean), maximum relative humidity (RHmax), minimum solar radiation (SRmin), average solar radiation (SRmean), maximum solar radiation (SRmax), accumulative rainfall (Racc), average wind speed (WSmean), and average wind direction (WDmean). This package also contains the raw G2F data and preprocessing pipeline.
科研通智能强力驱动
Strongly Powered by AbleSci AI