计算机科学
人工智能
注释
机器学习
背景(考古学)
公制(单位)
目标检测
词汇
植物病害
模式识别(心理学)
性能指标
生物技术
生物
工程类
古生物学
运营管理
经济
语言学
哲学
管理
作者
Jinyang Li,Fengting Zhao,Hongmin Zhao,Guoxiong Zhou,Jiaxin Xu,Mingzhou Gao,Xin Li,Weisi Dai,Hongliang Zhou,Yahui Hu,Mingfang He
标识
DOI:10.34133/plantphenomics.0220
摘要
Precise disease detection is crucial in modern precision agriculture, especially in ensuring the health of tomato crops and enhancing agricultural productivity and product quality. Although most existing disease detection methods have helped growers identify tomato leaf diseases to some extent, these methods typically target fixed categories. When faced with new diseases, extensive and costly manual annotation is required to retrain the dataset. To overcome these limitations, this study proposes a multimodal model PDC-VLD based on the open-vocabulary object detection (OVD) technology within the VLDet framework, which can accurately identify new tomato leaf diseases without manual annotation by using only image-text pairs. First, we developed a progressive visual transformer-convolutional pyramid module (PVT-C) that effectively extracts tomato leaf disease features and optimizes anchor box positioning using the self-supervised learning algorithm DINO, suppressing interference from irrelevant backgrounds. Then, a context feature guided module (CFG) was adopted to address the low adaptability and recognition accuracy of the model in data-scarce environments. To validate the model's effectiveness, we constructed a tomato leaf disease image dataset containing 4 base classes and 2 new categories. Experimental results show that the PDC-VLD model achieved 61.2% on the main evaluation metric
科研通智能强力驱动
Strongly Powered by AbleSci AI