Lightweight Vision Transformer for damaged wheat detection and classification using spectrograms

光谱图 人工智能 计算机科学 计算机视觉 图像处理 模式识别(心理学) 图像(数学)
作者
Hao Lin,Min Guo,Miao Ma
出处
期刊:Journal of Electronic Imaging [SPIE - International Society for Optical Engineering]
卷期号:33 (05)
标识
DOI:10.1117/1.jei.33.5.053063
摘要

Grain is one of the basic human necessities, and its quality and safety directly impact human dietary health. Various issues occur during grain storage, primarily mold and pest infestation. With the development of artificial intelligence, increasingly, more technologies are applied to grain detection and classification. Transformer-based models are becoming popular in grain detection. Although transformer models exhibit excellent performance, they are often large and cumbersome, limiting practical applications. We propose a framework named KD-ASF based on intermediate layer knowledge distillation and one-shot neural architecture search, to optimize the hyperparameters of vision transformer (ViT) for detecting and classifying molded wheat kernels (MDK), Insect-Damaged wheat kernels (IDK), and undamaged wheat kernels (UDK). In KD-ASF, we use the ViT model as our teacher network. Next, we design a search space containing adjustable hyperparameters of transformer building blocks. The super-network stacks maximum transformer building blocks and is trained under the guidance of the teacher network. Subsequently, the trained super-network undergoes evolutionary search, and the resulting networks are used for classifying different wheat kernels. We conducted experiments using a five-fold cross-validation approach and obtained an F1 score of 97.13%, and the last model parameter size is only 5.94M. The results demonstrate that this method not only outperforms the majority of neural networks in terms of performance but also has a significantly smaller model size than most network models. Its lightweight nature facilitates easy deployment and application. These findings indicate that the structure of KD-ASF is feasible and effective.

科研通智能强力驱动
Strongly Powered by AbleSci AI
更新
PDF的下载单位、IP信息已删除 (2025-6-4)

科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
panpan发布了新的文献求助10
1秒前
2秒前
科研通AI6应助兰粥拉面采纳,获得10
2秒前
3秒前
Yin完成签到,获得积分10
4秒前
芋圆发布了新的文献求助10
4秒前
5秒前
稳重冰兰完成签到 ,获得积分20
5秒前
dyjjudy发布了新的文献求助10
6秒前
6秒前
zhu发布了新的文献求助10
8秒前
8秒前
科研通AI6应助AUBECHU采纳,获得10
8秒前
LD发布了新的文献求助10
9秒前
汉堡包应助AI采纳,获得10
9秒前
天天快乐应助啦啦啦采纳,获得10
10秒前
Jessiez94发布了新的文献求助10
10秒前
斯文败类应助xmy采纳,获得10
10秒前
11秒前
wanci应助何苏苏采纳,获得10
11秒前
充电宝应助guyankuan采纳,获得10
12秒前
科研通AI2S应助美女采纳,获得10
15秒前
16秒前
大渡河完成签到,获得积分10
16秒前
17秒前
今后应助百杜采纳,获得10
18秒前
19秒前
19秒前
梦里荒芜发布了新的文献求助10
20秒前
20秒前
顺利翠萱完成签到,获得积分10
21秒前
22秒前
cjh发布了新的文献求助30
22秒前
22秒前
momo关注了科研通微信公众号
22秒前
小黑发布了新的文献求助10
23秒前
科研通AI6应助仙妮宝贝采纳,获得10
23秒前
24秒前
25秒前
小马甲应助HF采纳,获得10
25秒前
高分求助中
Encyclopedia of Quaternary Science Third edition 2025 12000
(应助此贴封号)【重要!!请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000
HIGH DYNAMIC RANGE CMOS IMAGE SENSORS FOR LOW LIGHT APPLICATIONS 1500
The Social Work Ethics Casebook: Cases and Commentary (revised 2nd ed.). Frederic G. Reamer 800
Beyond the sentence : discourse and sentential form / edited by Jessica R. Wirth 600
Holistic Discourse Analysis 600
Vertébrés continentaux du Crétacé supérieur de Provence (Sud-Est de la France) 600
热门求助领域 (近24小时)
化学 材料科学 医学 生物 工程类 有机化学 生物化学 物理 纳米技术 计算机科学 内科学 化学工程 复合材料 物理化学 基因 遗传学 催化作用 冶金 量子力学 光电子学
热门帖子
关注 科研通微信公众号,转发送积分 5343193
求助须知:如何正确求助?哪些是违规求助? 4478776
关于积分的说明 13940737
捐赠科研通 4375743
什么是DOI,文献DOI怎么找? 2404236
邀请新用户注册赠送积分活动 1396745
关于科研通互助平台的介绍 1369116