基于迁移学习与轻量化YOLOv5s的草莓目标检测方法

doi:10.13733/j.jcam.issn.2095-5553.2025.03.037

摘要/Abstract

摘要：

为实现草莓采摘时精准检测，同时考虑到嵌入式设备内存小、计算能力低下，而当下目标检测模型参数量和计算量巨大的问题，提出一种基于YOLOv5s的轻量化网络模型。首先，对YOLOv5s进行轻量化处理，利用深度卷积（DWConv）替换普通卷积，同时用C3Ghost模块替换原网络模型中的C3模块，降低模型的复杂度。然后，为增强主干网络对特征信息的提取能力，加强输入特征图通道间的信息交互，在主干网络的C3模块中融合高效通道注意力（ECA）结构，在特征融合网络添加无参数注意力模块（SimAM），使网络聚焦更多的有效特征信息，达到不增加模型的参数量，同时又提升模型识别精度的目的。最后，结合迁移学习加快模型收敛速度并进一步提升模型检测精度。结果表明，轻量化后的网络模型体积减小55.8%，计算量减少55.1%，在自制草莓数据集上的平均精度均值mAP@0.75达到74.9%，比原模型提高3.1%，单张图片平均推理时间仅6.4 ms，能够实现在草莓采摘任务中的精准快速检测，为草莓生产智能化提供支持。

关键词: 草莓目标检测, 深度学习, 注意力机制, 轻量化模型, 迁移学习

Abstract:

To achieve the accurate detection of strawberry in agricultural harvesting, a lightweight network model based on YOLOv5s is proposed considering the limited memory and low computational power of embedded devices, as well as the huge parameters and computational demands of current target detection models. First, the YOLOv5s structure is lightweight processed by replacing ordinary convolutions with depthwise convolutions （DWConv） and substituting the C3 module in the original network with the C3Ghost module to reduce the model complexity. Second, to enhance the ability of the backbone network to extract feature information and improve the interaction between channels in the input feature maps, an efficient channel attention （ECA） structure is integrated into the C3 module of the backbone network. Additionally, a parameter-free attention module （SimAM） is added to the feature fusion network, so that the model can focus on more effective feature information without increasing the number of parameters of the model while improving the recognition accuracy. Finally, transfer learning is combined to accelerate the convergence speed of the model and further improve the detection accuracy. The results indicate that the lightweight model reduces network size by 55.8% and computation by 55.1%. The mAP@0.75 tested on a custom strawberry dataset reaches 74.9%, which is 3.1% higher than that of the original model. The average inference time per image is only 6.4 ms. This enables accurate and fast detection in strawberry picking tasks and provides support for the intelligent production of strawberries.

Key words: strawberry target detection, deep learning, attention mechanism, lightweight model, transfer learning

中图分类号:

S126
TP391.4

郭敬涛, 吕凤, 章慧婷, 杨彪, 刘大洋. 基于迁移学习与轻量化YOLOv5s的草莓目标检测方法[J]. 中国农机化学报, 2025, 46(3): 253-260.

Guo Jingtao, Lü Feng, Zhang Huiting, Yang Biao, Liu Dayang. Strawberry target detection method based on transfer learning and lightweight YOLOv5s [J]. Journal of Chinese Agricultural Mechanization, 2025, 46(3): 253-260.

参考文献

［1］　张晓慧. 草莓病害研究进展［J］. 安徽农学通报, 2018, 24（18）: 52-57.
［2］　2023—2029年中国草莓种植与深加工行业市场现状调查及投资方向研究报告［EB/OL］. https://wwwchyxxcom/research/1135804html?bd_vid=8237899329342221593， 2023-08-22.
［3］　王卓, 王健, 王枭雄,等. 基于改进YOLOv4的自然环境苹果轻量级检测方法［J］. 农业机械学报, 2022, 53（8）: 294-302.
Wang Zhuo, Wang Jian, Wang Xiaoxiong, et al. Lightweight realtime apple detection method based on improved YOLOv4［J］. Transactions of the Chinese Society for Agricultural Machinery,2022,53（8）:294-302.
［4］　闫彬, 樊攀, 王美茸, 等. 基于改进YOLOv5m的采摘机器人苹果采摘方式实时识别［J］. 农业机械学报, 2022, 53（9）: 28-38，59.
Yan Bin, Fan Pan, Wang Meirong, et al. Realtime apple picking pattern recognition for picking robot based on improved YOLOv5m ［J］. Transactions of the Chinese Society for Agricultural Machinery,2022,53（9）:28-38，59.
［5］　宋怀波, 王亚男, 王云飞,等. 基于YOLOv5s的自然场景油茶果识别方法［J］. 农业机械学报, 2022, 53（7）: 234-242.
Song Huaibo, Wang Yanan, Wang Yunfei, et al. Camellia oleifera fruit detection in natural scene based on YOLOv5s ［J］. Transactions of the Chinese Society for Agricultural Machinery,2022,53（7）:234-242.
［6］　Bargoti S, Underwood J. Deep fruit detection in orchards ［C］. 2017 IEEE International Conference on Robotics and Automation （ICRA）, 2017: 3626-3633.
［7］　闫建伟, 赵源, 张乐伟, 等. 改进Faster R—CNN自然环境下识别刺梨果实［J］. 农业工程学报, 2019, 35（18）: 143-150.
Yan Jianwei, Zhao Yuan, Zhang Lewei, et al. Recognition of rosa roxbunghii in natural environment based on improved Faster R—CNN ［J］. Transactions of the Chinese Society of Agricultural Engineering, 2019, 35（18）: 143-150.
［8］　Liu W, Anguelov D, Erhan D, et al. SSD: Single shot multibox detector ［C］. Computer VisionECCV 2016, 2016: 21-37.
［9］　周桂红, 马帅, 梁芳芳. 基于改进YOLOv4模型的全景图像苹果识别［J］. 农业工程学报, 2022, 38（21）: 159-168.
Zhou Guihong, Ma Shuai, Liang Fangfang. Recognition of the apple in panoramic images based on improved YOLOv4model ［J］. Transactions of the Chinese Society of Agricultural Engineering, 2022, 38（21）: 159-168.
［10］　Fan Y, Zhang S, Feng K, et al. Strawberry maturity recognition algorithm combining dark channel enhancement and YOLOv5［J］. Sensors, 2022, 22（2）: 419.
［11］　Fu L S, Feng Y L, Wu J Z, et al. Fast and accurate detection of kiwifruit in orchard using improved YOLOv3—tiny model ［J］. Precision Agriculture, 2021, 22（3）: 754-776.
［12］　孙俊, 陈义德, 周鑫, 等. 快速精准识别棚内草莓的改进YOLOv4—Tiny模型［J］. 农业工程学报, 2022, 38（18）: 195-203.
Sun Jun, Chen Yide, Zhou Xin, et al. Fast and accurate recognition of the strawberries in greenhouse based on improved YOLOv4—Tiny model ［J］. Transactions of the Chinese Society of Agricultural Engineering, 2022, 38（18）: 195-203.
［13］　陈仁凡, 谢知, 林晨. 基于YOLO—ODM的温室草莓成熟度的快速检测［J］. 华中农业大学学报, 2023, 42（4）: 262-269.
［14］　Krizhevsky A, Sutskever I, Hinton G E. Imagenet classification with deep convolutional neural networks ［J］. Communications of the ACM, 2017, 60（6）: 84-90.
［15］　Han K, Wang Y, Tian Q, et al. GhostNet: More features from cheap operations ［C］. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition （CVPR）, 2020: 1577-1586.
［16］　Wang Q, Wu B, Zhu P, et al. ECA—Net: Efficient channel attention for deep convolutional neural networks ［C］. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition （CVPR）, 2020: 11531-11539.
［17］　Yang L, Zhang RY, Li L, et al. SimAM: A simple, parameterfree attention module for convolutional neural networks ［C］. Proceedings of the 38th International Conference on Machine Learning, 2021: 11863-11874.

[1]	吕宗旺, 王甜甜, 孙福艳, 祝玉华, . 基于改进YOLOv8m的小麦仓储粮虫检测方法[J]. 中国农机化学报, 2025, 46(3): 108-114.
[2]	吴坚, , 叶梦焱, 张同锋. 茶鲜叶智能分级装置设计与试验[J]. 中国农机化学报, 2025, 46(3): 139-145.
[3]	许毓超, , 吴茜, , 张兵园, , , 周玲莉, , 任妮, , , 张美娜, , . 轻量级深度学习网络在农作物目标检测的应用进展[J]. 中国农机化学报, 2025, 46(3): 261-270.
[4]	颜士军, 朱红梅, 王雅童, 张亮. 基于字词融合和注意力机制的兽药文本命名实体识别#br#[J]. 中国农机化学报, 2025, 46(3): 336-342.
[5]	吴秋兰, 陈雪飞, 陈超, 张峰, 王姝妹, 赵恒. 基于改进YOLOv5s的香菇菌棒污染识别方法[J]. 中国农机化学报, 2025, 46(2): 217-223.
[6]	戴敏, 孙文靖, 缪宏. 基于轻量化CBAM—GoogLeNet的辣椒病虫害识别[J]. 中国农机化学报, 2025, 46(2): 224-229.
[7]	吴阳华, 王建楠, 刘敏基, 游兆延, 谢焕雄, 杜元杰. 基于改进YOLOv5n的花生荚果实时检测方法[J]. 中国农机化学报, 2025, 46(2): 230-236.
[8]	彭雨侬, 柳平增, 张艳, . 基于深度学习的玉米生产过程知识图谱构建[J]. 中国农机化学报, 2025, 46(2): 245-252.
[9]	李涛, 买买提明⋅艾尼, 古丽巴哈尔⋅托乎提, 杨佳雨. 基于改进YOLOv8的小棚架下无核白葡萄果梗识别[J]. 中国农机化学报, 2025, 46(2): 259-263.
[10]	段小勇, 何超, 刘学渊. 基于改进DeepLabV3+的非结构化道路可行驶区域检测[J]. 中国农机化学报, 2025, 46(2): 271-278.
[11]	李先旺, 刘赛虎, 黄忠祥, 章霞东. 基于XLNet—BiLSTM—AFF—CRF的谷物收割机械维修知识命名实体识别[J]. 中国农机化学报, 2025, 46(2): 319-325.
[12]	夏子林, 张新洲, 王文波, 夏先飞, 陈兰, 顾寄南. 基于机器视觉的蚕豆荚高精度检测方法研究[J]. 中国农机化学报, 2025, 46(1): 157-163.
[13]	高芳征, 温鑫, 黄家才, 陈光明, 金少宇, 赵雪迪. 基于AD-YOLOX-Nano的茶叶嫩芽识别算法[J]. 中国农机化学报, 2025, 46(1): 178-184.
[14]	王鑫淼, 张正, 董晓威, 王林烽, 李瑞祥. 基于改进YOLOv8算法的谷子田杂草检测[J]. 中国农机化学报, 2025, 46(1): 185-189.
[15]	高天赐, 王克俭, 陈晨, 韩宪忠, 王超, 李会平, . 基于DCP-ShuffleNetV2的轻量级森林害虫识别方法[J]. 中国农机化学报, 2025, 46(1): 190-197.