基因组选择
选择(遗传算法)
变压器
农业工程
计算机科学
环境科学
农学
生物
人工智能
工程类
遗传学
单核苷酸多态性
电气工程
基因型
电压
基因
作者
R.R. Chen,Wenwei Han,Haohao Zhang,Hong Su,Z.R. Wang,Xiaolei Liu,Han Jiang,Wanli Ouyang,Nanqing Dong
出处
期刊:Cornell University - arXiv
日期:2024-05-15
标识
DOI:10.48550/arxiv.2405.09585
摘要
Genomic selection (GS), as a critical crop breeding strategy, plays a key role in enhancing food production and addressing the global hunger crisis. The predominant approaches in GS currently revolve around employing statistical methods for prediction. However, statistical methods often come with two main limitations: strong statistical priors and linear assumptions. A recent trend is to capture the non-linear relationships between markers by deep learning. However, as crop datasets are commonly long sequences with limited samples, the robustness of deep learning models, especially Transformers, remains a challenge. In this work, to unleash the unexplored potential of attention mechanism for the task of interest, we propose a simple yet effective Transformer-based framework that enables end-to-end training of the whole sequence. Via experiments on rice3k and wheat3k datasets, we show that, with simple tricks such as k-mer tokenization and random masking, Transformer can achieve overall superior performance against seminal methods on GS tasks of interest.
科研通智能强力驱动
Strongly Powered by AbleSci AI