融合残差网络与注意力机制的草莓检测

doi:10.13733/j.jcam.issn.2095-5553.2024.01.037

摘要/Abstract

摘要： 针对草莓果实因受到自然光光照、枝叶遮挡、果实间存在遮挡等因素，较难实现成熟草莓果实识别的现状，提出融合深度残差网络与注意力机制的成熟草莓目标检测算法。引用信息表达能力更强的深度残差网络Resnet50对SSD目标检测算法模型基础骨干网络进行替换，对经过残差网络结构和新增卷积特征提取层得到信息特征提取图进行通道和空间方向的注意力机制方法处理，建立能准确实现成熟草莓目标检测的RC-SSD目标检测模型。试验结果表明，本文的RC-SSD算法模型对比Faster R-CNN、YOLOv3、SSD-VGG模型拥有较少的参数量，平均精度均值mAP分别提升46.05%、10.16%、5.77%，其中成熟草莓的识别精度达到99.04%。对比轻量化网络结构模型SSD-Mobilenetv2，RC-SSD算法模型在FPS相对于轻量化网络模型降低25帧的情况下，精度提升20.20%，FPS在GPU运行设备上达到86帧。

关键词: 残差网络, 注意力机制, 损失函数, 目标检测, 草莓图像识别

Abstract: In view of the current situation that it is difficult to recognize ripe strawberry fruit due to the factors such as natural light illumination, branch and leaf shading, and interfruit shading, this paper proposes a ripe strawberry target detection algorithm that combines deep residual network and attention mechanism. In this paper, the deep residual network Resnet50, which had stronger information expression capability, was invoked to replace the backbone network underlying the SSD target detection algorithm model, and the attention mechanism method of channel and spatial direction was processed to obtain the information feature extraction map after the residual network structure and the new convolutional feature extraction layer, and the RC-SSD target detection model that could accurately implement the mature strawberry target detection was established. The experimental results showed that the RC-SSD algorithm model in this paper had less number of parameters than the models Faster R-CNN, YOLOv3 and SSD-VGG models, and the average accuracy mean mAP was improved by 46.05%, 10.16% and 5.77%, respectively, in which the recognition accuracy of mature strawberry reached 99.04%, and compared with the lightweight network structure model SSD-Mobilenetv2, the RC-SSD algorithm model improved the accuracy by 20.20% with a 25 fps reduction in FPS relative to the lightweight network model, and the FPS reached 86 fps on the GPU running device.

Key words: residual network, attention mechanism, loss function, object detection, strawberry image recognition

王瑞彬, 杨世忠, 高升. 融合残差网络与注意力机制的草莓检测[J]. 中国农机化学报, 2024, 45(1): 266-273.

Wang Ruibin, Yang Shizhong, Gao Sheng. Strawberry detection combining residual network with attention mechanism[J]. Journal of Chinese Agricultural Mechanization, 2024, 45(1): 266-273.

参考文献

［1］ Jiao L, Zhang F, Liu F, et al. A survey of deep learningbased object detection ［J］. IEEE Access, 2019, 7: 128837-128868.
［2］徐艺格, 王丽娟. 草莓品质育种研究进展［J］. 北方园艺, 2020(18): 152-157.
Xu Yige, Wang Lijuan. Research progress on strawberry quality breeding ［J］. Northern Horticulture, 2020(18): 152-157.
［3］李长勇, 房爱青, 谭红, 等. 高架草莓采摘机器人系统研究［J］. 机械设计与制造, 2017(6): 245-247, 251.
Li Changyong, Fang Aiqing, Tan Hong, et al. Elevated strawberry picking robot system research ［J］. Machinery Design & Manufacture, 2017(6): 245-247, 251.
［4］毛彦栋, 宫鹤. 基于SVM和DS证据理论融合多特征的玉米病害识别研究［J］. 中国农机化学报, 2020, 41(4): 152-157.
Mao Yandong, Gong He. Corn disease identification study based on SVM and DS evidence theory fusion multifeatures ［J］. Journal of Chinese Agricultural Mechanization, 2020, 41(4): 152-157.
［5］杨英茹, 吴华瑞, 张燕, 等. 基于复杂环境的番茄叶部图像病虫害识别［J］. 中国农机化学报, 2021, 42(9): 177-186.
Yang Yingru, Wu Huarui, Zhang Yan, et al. Tomato disease recognition using leaf image based on complex environment ［J］. Journal of Chinese Agricultural Mechanization, 2021, 42(9): 177-186.
［6］ Le Cun Y, Bengio Y, Hinton G. Deep learning ［J］. Nature, 2015, 521(7553): 436-444.
［7］ Girshick R, Donahue J, Darrell T, et al. Rich feature hierarchies for accurate object detection and semantic segmentation［C］. Conference on Computer Vision and Pattern Recognition, Columbus; IEEE, 2014: 580-587.
［8］ Girshick R. Fast RCNN［C］. Conference on Computer Vision and Pattern Recognition, Boston; IEEE, 2015: 1440-1448.
［9］ Ren S, He K, Girshick R, et al. Faster RCNN: Towards realtime object detection with region proposal networks ［J］. Advances in Neural Information Processing Systems, 2015, 28.
［10］ Redmon J, Divvala S, Girshick R, et al. You only look once: Unified, realtime object detection ［C］. Conference on Computer Vision and Pattern Recognition, Las Vegas; IEEE, 2016: 779-788.
［11］ Liu Wei， Anguelov D， Erhan D， et al. SSD: single shot multiBox detector［C］. European Conference on Computer Vision， Amsterdam; Springer, 2016: 21-37.
［12］宋中山, 汪进, 郑禄, 等. 基于二值化的Faster R-CNN柑橘病虫害识别研究［J］. 中国农机化学报, 2022, 43(6): 150-158.
Song Zhongshan, Wang Jin， Zheng Lu， et al. Research on citrus pest identification based on Binary Faster R-CNN ［J］. Journal of Chinese Agricultural Mechanization, 2022, 43(6): 150-158.
［13］李就好, 林乐坚, 田凯, 等. 改进Faster R-CNN的田间苦瓜叶部病害检测［J］. 农业工程学报, 2020, 36(12): 179-185.
Li Jiuhao， Lin Lejian, Tian Kai, et al. Detection of leaf diseases of balsam pear in the field based on improved Faster R-CNN ［J］. Transactions of the Chinese Society of Agricultural Engineering, 2020, 36(12): 179-185.
［14］赵德安, 吴任迪, 刘晓洋, 等. 基于YOLO深度卷积神经网络的复杂背景下机器人采摘苹果定位［J］. 农业工程学报, 2019, 35(3): 164-173.
Zhao Dean， Wu Rendi， Liu Xiaoyang, et al. Apple positioning based on YOLO deep convolutional neural network for picking robot in complex background ［J］. Transactions of the Chinese Society of Agricultural Engineering, 2019, 35(3): 164-173.
［15］李善军, 胡定一, 高淑敏, 等. 基于改进SSD的柑橘实时分类检测［J］. 农业工程学报, 2019, 35(24): 307-313.
Li Shanjun, Hu Dingyi, Gao Shumin， et al. Realtime classification and detection of citrus based on improved single short multibox detecter ［J］. Transactions of the Chinese Society of Agricultural Engineering, 2019, 35(24): 307-313.
［16］ Lu X, Ji J, Xing Z, et al. Attention and feature fusion SSD for remote sensing object detection ［J］. IEEE Transactions on Instrumentation and Measurement, 2021, 70: 1-9.
［17］ He K, Zhang X, Ren S, et al. Deep residual learning for image recognition［C］. Conference on Computer Vision and Pattern Recognition, Las Vegas; IEEE, 2016: 770-778.
［18］付中正, 何潇, 方逵, 等. 基于改进SSD网络的西兰花叶片检测研究［J］. 中国农机化学报, 2020, 41(4): 92-97.
Fu Zhongzheng, He Xiao, Fang Kui, et al. Study on the detection of broccoli leaves based on the improved SSD network ［J］. Journal of Chinese Agricultural Mechanization, 2020, 41(4): 92-97.
［19］郭玥秀, 杨伟, 刘琦, 等. 残差网络研究综述［J］. 计算机应用研究, 2020, 37(5): 1292-1297.
Guo Yuexiu, Yang Wei, Liu Qi, et al. Survey of residual network ［J］. Application Research of Computers, 2020, 37(5): 1292-1297.
［20］任欢, 王旭光. 注意力机制综述［J］. 计算机应用, 2021, 41(S1): 1-6.
Ren Huan, Wang Xuguang. Review of attention mechanism ［J］. Journal of Computer Applications, 2021, 41(S1): 1-6.
［21］ Woo S, Park J, Lee J Y, et al. Cbam: Convolutional block attention module［C］. Proceedings of the European conference on computer vision (ECCV), 2018: 3-19.
［22］洪哲昊, 陈东方, 王晓峰. 基于多任务分支SSD的目标检测算法［J］. 计算机工程与设计, 2022, 43(3): 677-684.
Hong Zhehao, Chen Dongfang, Wang Xiaofeng. Object detection algorithm based on multitask branch SSD ［J］. Computer Engineering and Design, 2022, 43(3): 677-684.

[1]	达措, , 赵启军, , , 高定国, , 索南尖措, 尼玛扎西. 基于注意力网络的长时牦牛个体识别研究[J]. 中国农机化学报, 2024, 45(1): 202-208.
[2]	黎远江, 李云伍, , 赵颖, , 台少瑜, 王克超. 基于改进Deeplabv3+模型的果树语义分割研究[J]. 中国农机化学报, 2024, 45(1): 209-216.
[3]	叶荣, 马自飞, 高泉, 李彤, 邵郭奇, 王白娟. 基于改进YOLOv5sECAASFF算法的茶叶病害目标检测[J]. 中国农机化学报, 2024, 45(1): 244-251.
[4]	林森, , 许童羽, 葛禹豪, 马璟, 孙添龙, 赵春江. 基于改进YOLOv5l的设施番茄3D信息检测方法[J]. 中国农机化学报, 2024, 45(1): 274-284.
[5]	侯炳法, 李小敏, 牟向伟, 姚华平. 圣女果温室巡检机器人系统设计与试验[J]. 中国农机化学报, 2024, 45(1): 285-294.
[6]	何朝霞, 朱嵘涛, 徐俊英. 基于生成对抗网络的田间杂草图像超分辨率重建[J]. 中国农机化学报, 2023, 44(9): 154-160.
[7]	汪健, 梁兴建, 雷刚. 基于深度残差网络与迁移学习的水稻病虫害图像识别[J]. 中国农机化学报, 2023, 44(9): 198-204.
[8]	高芳征, 汤文俊, 陈光明, 黄家才. 基于改进YOLOv3的复杂环境下西红柿成熟果实快速识别[J]. 中国农机化学报, 2023, 44(8): 174-183.
[9]	余胜, 谢莉. 基于迁移学习和卷积视觉转换器的农作物病害识别研究[J]. 中国农机化学报, 2023, 44(8): 191-197.
[10]	马丽, 周巧黎, 赵丽亚, 胡远辉. 基于深度学习的番茄叶片病害分类识别研究[J]. 中国农机化学报, 2023, 44(7): 187-193.
[11]	李宗南, 蒋怡, 王思, 李源洪, 黄平, 魏鹏. 基于YOLOv5模型的飞蓬属入侵植物目标检测[J]. 中国农机化学报, 2023, 44(7): 200-206.
[12]	王春桃, , , , 梁炜健, 郭庆文, 钟浩, 甘雨, 肖德琴, , . 农业害虫智能视觉检测研究综述[J]. 中国农机化学报, 2023, 44(7): 207-213.
[13]	李明, 丁智欢, 赵靖暄, 陈思铭, 李文勇, 杨信廷. 基于改进YOLOv5s的日光温室黄瓜霜霉病孢子囊检测计数方法[J]. 中国农机化学报, 2023, 44(5): 63-70.
[14]	付豪, , 赵学观, 翟长远, , 郑康, 郑申玉, 王秀. 基于深度学习的杂草识别方法研究进展[J]. 中国农机化学报, 2023, 44(5): 198-207.
[15]	段宇飞, 董庚, 孙记委, 王焱清, . 基于SE-ResNet网络的油茶果果壳与茶籽分选模型[J]. 中国农机化学报, 2023, 44(4): 89-95.