亲爱的研友该休息了!由于当前在线用户较少,发布求助请尽量完整的填写文献信息,科研通机器人24小时在线,伴您度过漫漫科研夜!身体可是革命的本钱,早点休息,好梦!

MLP-based multimodal tomato detection in complex scenarios: Insights from task-specific analysis of feature fusion architectures

RGB颜色模型 人工智能 特征(语言学) 计算机科学 编码器 卷积神经网络 深度学习 模式识别(心理学) 特征提取 计算机视觉 操作系统 哲学 语言学
作者
Wenjun Chen,Yuan Rao,Fengyi Wang,Yanwen Zhang,Tan Wang,Xiu Jin,Wenhui Hou,Zhaohui Jiang,Zhang Wu
出处
期刊:Computers and Electronics in Agriculture [Elsevier]
卷期号:221: 108951-108951 被引量:3
标识
DOI:10.1016/j.compag.2024.108951
摘要

Accurate and efficient tomato detection is essential for the practical deployment of robotic picking in practical agricultural applications, but it still remains significantly challenging to detect tomatoes in complex scenarios with fluctuating light, overlapping fruits, and occlusion from branches and leaves when solely using RGB images. The recent development of RGB-D sensors has brought one promising opportunity to adopt multimodal fusion for implementing high-quality fruit detection. However, the feasibility of the existing multimodal fusion and feature extraction architectures for lightweight tomato detection tasks, especially in complex agricultural scenarios, raises questions that need to be explored. As a remedy, we proposed a multimodal fusion encoder that leveraged depth and near-infrared modalities to assist RGB images in making full use of multimodal data. Moreover, the encoder contained a plug-and-play structure capable of being implemented as MLP-based (Multi-Layer Perceptron), ViT-based (Vision Transformer), or CNN-based (Convolutional Neural Networks) architectures. Furthermore, we developed a lightweight experimental detection framework based on YOLOv7-tiny by means of integrating the multimodal fusion encoder, and YOLO-DNA (Depth and Near-infrared Assisted) was put forward based on the MLP-based architecture after conducting comprehensive analysis of the aforementioned three architectures. In addition, a tomato multimodal dataset containing visible, depth, and near-infrared images was established. Experimental results demonstrated that YOLO-DNA achieved mAP0.5 of 98.13% and mAP0.5:0.95 of 74.0%, an average increase of 5.01% in mAP0.5 and 14.55% in mAP0.5:0.95 over mainstream lightweight detection models, with a detection speed of 37.12 FPS, meeting the demand of real-time tomato detection. This finding has the potential to advance research on fruit detection in the field of intelligent agricultural harvesting.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
更新
大幅提高文件上传限制,最高150M (2024-4-1)

科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
烟花应助佳佳采纳,获得10
5秒前
13秒前
14秒前
16秒前
佳佳发布了新的文献求助10
19秒前
fff发布了新的文献求助10
20秒前
28秒前
yu发布了新的文献求助10
35秒前
通莲完成签到,获得积分10
37秒前
汉堡包应助jyy采纳,获得10
38秒前
46秒前
细心的如天完成签到 ,获得积分10
51秒前
1分钟前
辛勤的媚颜完成签到 ,获得积分10
1分钟前
是啊余啊发布了新的文献求助10
1分钟前
是啊余啊完成签到,获得积分20
1分钟前
pokemeow完成签到,获得积分10
1分钟前
搜集达人应助科研通管家采纳,获得10
1分钟前
Jasper应助科研通管家采纳,获得10
1分钟前
pokemeow发布了新的文献求助10
2分钟前
YY完成签到,获得积分0
2分钟前
别找了睡觉吧完成签到 ,获得积分10
2分钟前
汉堡包应助hhhhhhhm采纳,获得10
2分钟前
薄荷小新完成签到 ,获得积分10
2分钟前
情怀应助甜心心采纳,获得10
2分钟前
闪闪的从彤完成签到 ,获得积分10
2分钟前
雨天爱吃冰淇淋完成签到 ,获得积分10
3分钟前
在水一方应助壮壮采纳,获得10
3分钟前
大个应助无糖零脂采纳,获得10
3分钟前
3分钟前
壮壮发布了新的文献求助10
3分钟前
未青易完成签到 ,获得积分10
3分钟前
3分钟前
明理丹烟应助科研通管家采纳,获得10
3分钟前
科研通AI2S应助科研通管家采纳,获得10
3分钟前
NexusExplorer应助科研通管家采纳,获得10
3分钟前
科研通AI2S应助科研通管家采纳,获得10
3分钟前
明理丹烟应助科研通管家采纳,获得10
3分钟前
英姑应助科研通管家采纳,获得10
3分钟前
汉堡包应助科研通管家采纳,获得10
3分钟前
高分求助中
中国国际图书贸易总公司40周年纪念文集 大事记1949-1987 2000
TM 5-855-1(Fundamentals of protective design for conventional weapons) 1000
草地生态学 880
Threaded Harmony: A Sustainable Approach to Fashion 799
Basic Modern Theory of Linear Complex Analytic 𝑞-Difference Equations 510
中国有机(类)肥料 500
Queer Politics in Times of New Authoritarianisms: Popular Culture in South Asia 500
热门求助领域 (近24小时)
化学 医学 生物 材料科学 工程类 有机化学 生物化学 物理 内科学 纳米技术 计算机科学 化学工程 复合材料 基因 遗传学 催化作用 物理化学 免疫学 量子力学 细胞生物学
热门帖子
关注 科研通微信公众号,转发送积分 3059518
求助须知:如何正确求助?哪些是违规求助? 2715495
关于积分的说明 7445189
捐赠科研通 2361002
什么是DOI,文献DOI怎么找? 1251087
科研通“疑难数据库(出版商)”最低求助积分说明 607698
版权声明 596448