基于Vision Transformer的小麦病害图像识别算法

doi:10.13733/j.jcam.issn.2095-5553.2024.02.038

摘要/Abstract

摘要： 小麦白粉病、赤霉病和锈病是危害小麦产量的三大病害。为提高小麦病害图像的识别准确率,构建一种基于Vision Transformer的小麦病害图像识别算法。首先,通过田间拍摄的方式收集包含小麦白粉病、赤霉病和锈病3种病害在内的小麦病害图像,并对原始图像进行预处理,建立小麦病害图像识别数据集;然后,基于改进的Vision Transformer构建小麦病害图像识别算法,分析不同迁移学习方式和数据增强对模型识别效果的影响。试验可知,全参数迁移学习和数据增强能明显提高Vision Transformer模型的收敛速度和识别精度。最后,在相同时间条件下,对比Vision Transformer、AlexNet和VGG16算法在相同数据集上的表现。试验结果表明,Vision Transformer模型对3种小麦病害图像的平均识别准确率为96.81%,相较于AlexNet和VGG16模型识别准确率分别提高6.68%和4.94%。

关键词: 小麦病害, Vision Transformer, 迁移学习, 图像识别, 数据增强

Abstract: Wheat powdery mildew, head blight, and rust are the three major diseases that harm wheat yield. In order to improve the recognition accuracy of wheat disease images, a wheat disease image recognition algorithm based on Vision Transformer was proposed. Firstly, the images of wheat diseases, including wheat powdery mildew, scab, and rust, were collected by field shooting, and the original images were preprocessed to establish the wheat disease image recognition data set. Then, the wheat disease image recognition algorithm was constructed based on the improved Vision Transformer, analyzing the influence of different transfer learning methods and data enhancement on the model identification effect. The experiments showed that full parameter transfer learning and data enhancement could significantly improve the convergence speed and identification accuracy of the Vision Transformer model. Finally, the performance of Vision Transformer, AlexNet and VGG 16 algorithms on the same dataset was compared under the same time condition. The experimental results showed that the average recognition accuracy of the Vision Transformer model for the three wheat disease images was 96.81%, which was 6.68% and 4.94% higher than that of AlexNet and VGG 16 models, respectively.

Key words: wheat disease, Vision Transformer, transfer learning, image recognition, data augmentation

中图分类号:

TP18: S512.1

白玉鹏, 冯毅琨, 李国厚, 赵明富, 周浩宇, 侯志松, . 基于Vision Transformer的小麦病害图像识别算法[J]. 中国农机化学报, 2024, 45(2): 267-274.

Bai Yupeng, Feng Yikun, Li Guohou, Zhao Mingfu, Zhou Haoyu, Hou Zhisong, . Algorithm of wheat disease image identification based on Vision Transformer[J]. Journal of Chinese Agricultural Mechanization, 2024, 45(2): 267-274.

参考文献

［1］　姜玉英, 刘万才, 黄冲, 等. 2020年全国农作物重大病虫害发生趋势预报［J］. 中国植保导刊, 2020, 40(2): 37-39, 53.
［2］　史雪岩, 李红宝, 王海光, 等. 我国小麦病虫草害防治农药减施增效技术研究进展［J］. 中国农业大学学报, 2022, 27(3): 53-62.
Shi Xueyan, Li Hongbao, Wang Haiguang, et al. Progresses of pesticide reduction techniques in wheat production and the synergistic effects on the prevention and control of wheat pests ［J］. Journal of China Agricultural University, 2022, 27(3): 53-62.
［3］　周长建, 宋佳, 向文胜. 基于人工智能的作物病害识别研究进展［J］. 植物保护学报, 2022, 49(1): 316-324.
Zhou Changjian, Song Jia, Xiang Wensheng.Research progresses in artificial intelligence-based crop disease identification ［J］. Journal of Plant Protection, 2022, 49(1): 316-324.
［4］　秦丰, 刘东霞, 孙炳达, 等. 基于深度学习和支持向量机的4种苜蓿叶部病害图像识别［J］. 中国农业大学学报, 2017, 22(7): 123-133.
Qin Feng, Liu Dongxia, Sun Bingda, et al. Image recognition of four different alfalfa leaf diseases based on deep learning and support vector machine ［J］. Journal of China Agricultural University, 2017, 22(7): 123-133.
［5］　Wang J, Jiang H, Chen Q. High-precision recognition of wheat mildew degree based on colorimetric sensor technique combined with multivariate analysis ［J］. Microchemical Journal, 2021, 168: 106468.
［6］　Feng L, Wu B, Zhu S, et al. Investigation on data fusion of multisource spectral data for rice leaf diseases identification using machine learning methods ［J］. Frontiers in Plant Science, 2020, 11: 577063.
［7］　周惠汝, 吴波明. 深度学习在作物病害图像识别方面应用的研究进展［J］. 中国农业科技导报, 2021, 23(5): 61-68.
Zhou Huiru, Wu Boming. Advances in research on deep learning for crop disease image recognition ［J］. Journal of Agricultural Science and Technology, 2021, 23(5): 61-68.
［8］　Chen H C, Widodo A M, Wisnujati A, et al. AlexNet convolutional neural network for disease detection and classification of tomato leaf ［J］. Electronics, 2022, 11(6): 951.
［9］　Chen J, Chen J, Zhang D, et al. Using deep transfer learning for image-based plant disease identification ［J］. Computers and Electronics in Agriculture, 2020, 173: 105393.
［10］　Li Y, Wang H, Dang L M, et al. Crop pest recognition in natural scenes using convolutional neural networks ［J］. Computers and Electronics in Agriculture, 2020, 169: 105174.
［11］　Rangarajan A K, Purushothaman R, Ramesh A. Tomato crop disease classification using pre-trained deep learning algorithm ［J］. Procedia Computer Science, 2018, 133: 1040-1047.
［12］　侯志松, 冀金泉, 李国厚, 等. 集成学习与迁移学习的作物病害图像识别算法［J］. 中国科技论文, 2021, 16(7): 708-714.
Hou Zhisong, Ji Jinquan, Li Guohou, et al. Crop disease image recognition algorithm based on ensemble learning and transfer learning ［J］. China Sciencepaper, 2021, 16(7): 708-714.
［13］　周宏威, 沈恒宇, 袁新佩, 等. 基于迁移学习的苹果树叶片病虫害识别方法研究［J］. 中国农机化学报, 2021, 42(11): 151-158.
Zhou Hongwei, Shen Hengyu, Yuan Xinpei, et al. Research on identification method of apple leaf diseases based on transfer learning ［J］. Journal of Chinese Agricultural Mechanization, 2021, 42(11): 151-158.
［14］　张珂, 冯晓晗, 郭玉荣, 等. 图像分类的深度卷积神经网络模型综述［J］. 中国图象图形学报, 2021, 26(10): 2305-2325.
Zhang Ke, Feng Xiaohan, Guo Yurong, et al. Overview of deep convolutional neural networks for image classification ［J］. Journal of Image and Graphics, 2021, 26(10): 2305-2325.
［15］　刘文婷, 卢新明. 基于计算机视觉的Transformer研究进展［J］. 计算机工程与应用, 2022, 58(6): 1-16.
Liu Wenting, Lu Xinming. Research progress of Transformer based on computer vision ［J］. Computer Engineering and Applications, 2022, 58(6): 1-16.
［16］　Vaswani A, Shazeer N, Parmar N, et al. Attention is all you need ［J］. Advances in Neural Information Processing Systems, 2017, 30.
［17］　Dosovitskiy A, Beyer L, Kolesnikov A, et al. An image is worth 16×16 words: Transformers for image recognition at scale ［J］. arXiv preprint arXiv: 2010.11929, 2020.
［18］　Ma X, Yan M. Design and implementation of craweper based on Scrapy ［C］. Journal of Physics: Conference Series. IOP Publishing, 2021, 2033(1): 012204.
［19］　张重生, 陈杰, 纵瑞星, 等. 基于Transformer的低质场景字符检测算法［J］. 北京邮电大学学报, 2022, 45(2): 124-130.
Zhang Chongsheng, Chen Jie, Zong Ruixing, et al. Transformer based scene character detection over low quality images ［J］. Journal of Beijing University of Posts and Telecommunications, 2022, 45(2): 124-130.
［20］　Parmar N, Vaswani A, Uszkoreit J, et al. Image transformer ［C］. International Conference on Machine Learning. PMLR, 2018: 4055-4064.
［21］　Barani F, Savadi A, Yazdi H S. Convergence behavior of diffusion stochastic gradient descent algorithm ［J］. Signal Processing, 2021, 183: 108014.

[1]	王瑞彬, 杨世忠, 高升. 融合残差网络与注意力机制的草莓检测[J]. 中国农机化学报, 2024, 45(1): 266-273.
[2]	刘广同, 曹淑楠, 杨万林, 刘橙, 何金成. 基于多传感器的方舱猪舍猪只采食行为分析系统[J]. 中国农机化学报, 2023, 44(9): 176-182.
[3]	汪健, 梁兴建, 雷刚. 基于深度残差网络与迁移学习的水稻病虫害图像识别[J]. 中国农机化学报, 2023, 44(9): 198-204.
[4]	吴潇, 杨颖, 刘刚, 张倩, 宁远霖. 基于迁移学习和改进ResNet34的猪个体识别方法[J]. 中国农机化学报, 2023, 44(9): 214-221.
[5]	余胜, 谢莉. 基于迁移学习和卷积视觉转换器的农作物病害识别研究[J]. 中国农机化学报, 2023, 44(8): 191-197.
[6]	陈维美, 刘馨蔚, 王铁伟, 徐文凯, 李娟. 基于两级融合深度学习的松材线虫病识别[J]. 中国农机化学报, 2023, 44(7): 214-219.
[7]	李平, 马玉琨, 李艳翠, 冯继克, 赵明富. 基于迁移学习的小麦籽粒品种识别研究[J]. 中国农机化学报, 2023, 44(7): 220-228.
[8]	王磊, 袁英, 高玲. 基于改进多元宇宙算法的番茄病害图像识别[J]. 中国农机化学报, 2023, 44(5): 176-181.
[9]	付健, 薛新宇, 孙竹, 徐阳. 油菜地块边界提取研究[J]. 中国农机化学报, 2023, 44(4): 137-144.
[10]	王宇博, 马廷淮, 陈光明. 基于改进YOLOv5算法的农田杂草检测[J]. 中国农机化学报, 2023, 44(4): 167-173.
[11]	惠巧娟, 马伟, 边超. 融合强化注意力机制的农田杂草识别方法[J]. 中国农机化学报, 2023, 44(4): 195-201.
[12]	何雨霜, 王琢, 王湘平, 肖进, 罗友谊, 张俊峰. 深度学习在农作物病害图像识别中的研究进展[J]. 中国农机化学报, 2023, 44(2): 148-155.
[13]	肖章, 彭江, 刘俊杰, 孙二杰, 彭如恕. 基于YOLOv5-CP的复杂环境下油茶果检测[J]. 中国农机化学报, 2023, 44(12): 193-199.
[14]	李庆松, 康丽春, 饶洪辉, 李泽锋, 刘木华. 基于改进YOLOv4-Tiny的自然环境下油茶果识别方法[J]. 中国农机化学报, 2023, 44(10): 224-230.
[15]	胡奕帆, 赵贤林, 李佩娟, 赵辰雨, 陈光明. 基于改进YOLOv5的自然环境下番茄果实检测[J]. 中国农机化学报, 2023, 44(10): 231-237.