基于改进YOLOv5s的茶叶嫩芽检测

doi:10.13733/j.jcam.issn.2095-5553.2024.04.024

摘要/Abstract

摘要： 为提高对茶叶嫩芽识别的准确率，提升自动采摘机器人的工作效率，减少人工采摘成本，提出一种对茶叶嫩芽目标检测的模型。通过拍摄包含白豪早茶叶嫩芽图片，进行筛选后得到179张图像，使用Mosic数据扩增后获得716张图像，建立数据集，按照训练集、测试集和验证集7∶2∶1的比例划分数据集。针对复杂背景下茶叶嫩芽存在重叠以及遮挡所导致的识别精准度低的问题，对YOLOv5s模型进行改动，在骨干网络上增添注意力机制模块SE和CBAM进行比较；Neck网络由原来的PAFPN改为可以进行双向加权融合的BiFPN，Head结构增加浅层下采样的P2模块，提出一种茶叶嫩芽检测的模型。试验表明YOLOv5s添加SE模块结合BiFPN时模型具有更高的检测精度，并对试验结果进行十折交叉验证，相较于基线精确率提高10.46%，达到88.30%，平均精度均值mAP提高6.47%, 达到85.83%。最后使用相同的数据集和预处理方法对比YOLOv5m、Faster RCNN 和 YOLOv4tiny，证明该试验方法综合强于其他经典深度学习方法，能更有效地提升茶叶嫩芽检测精准度，可以为茶叶自动采摘机器人提供理论依据。

关键词: 茶叶, 嫩芽检测, YOLOv5s, 注意力机制, 双向特征金字塔

Abstract: In order to improve the accuracy of tea bud recognition, improve the efficiency of automatic picking robot and reduce the cost of manual picking, this paper proposes a model for tea bud target detection. Through taking pictures of the buds of Baihao early tea, 179 images were obtained after screening, and 716 images were obtained after using Mosc data amplification. The data set was established, and the data set was divided according to the 7∶2∶1 ratio of training set, test set and validation set. In view of the low recognition accuracy caused by the overlap and occlusion of tea buds under complex background, this paper modifies the YOLOv5s model and adds the attention mechanism module SE and CBAM to the backbone network for comparison. The Neck network is changed from the original PAFPN to the BiFPN that can carry out twoway weighted fusion. The Head structure adds a P2 module for shallow sampling, and proposes a tea bud detection model. The experiment shows that the model has higher detection accuracy when YOLOv5s adds SE module combined with BiFPN, and the experimental results are crossverified with ten folds. Compared with the baseline accuracy, the accuracy rate is increased by 10.46%, reaching 88.30%, and the average accuracy mAP is increased by 6.47%, reaching 85.83%. Finally, using the same data set and preprocessing method to compare YOLOv5m, Faster RCNN and YOLOv4tiny, it is proved that the experimental method proposed in this paper is more comprehensive than other classical deep learning methods, can more effectively improve the accuracy of tea bud detection and can provide theoretical basis for the tea automatic picker.

Key words: tea, bud detection, YOLOv5s, attention mechanism, bidirectional feature pyramid

中图分类号:

严蓓蓓, 纪元浩, 曲凤凤, 许金普. 基于改进YOLOv5s的茶叶嫩芽检测[J]. 中国农机化学报, 2024, 45(4): 168-174.

Yan Beibei, Ji Yuanhao, Qu Fengfeng, Xu Jinpu. Detection of tea buds based on improved YOLOv5s[J]. Journal of Chinese Agricultural Mechanization, 2024, 45(4): 168-174.

参考文献

［1］　罗浩伦, 冯泽霖, 冉钟南, 等. 基于VGG16网络的茶叶嫩芽自动检测研究［J］. 农业与技术, 2020, 40(1): 15-17.
［2］　孙红, 李松, 李民赞, 等. 农业信息成像感知与深度学习应用研究进展［J］. 农业机械学报, 2020, 51(5): 1-17.
Sun Hong, Li Song, Li Minzan, et al. Research progress of image sensing and deep learning in agriculture ［J］. Transactions of the Chinese Society for Agricultural Machinery, 2020, 51(5): 1-17.
［3］　Tao W, Kunming Z, Wu Z, et al. Tea picking point detection and location based on Mask RCNN ［J］. Information Processing in Agriculture, 2023, 10(2): 267-275.
［4］　Redmon J, Farhadi A. YOLO9000: Better, faster, stronger ［C］. IEEE Conference on Computer Vision and Pattern Recognition. IEEE Computer Society, 2017: 6517-6525.
［5］　Redmon J, Divvala S, Girshick R, et al. You only look once: Unified, realtime object detection ［C］. IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, 2016: 779-788.
［6］　Rdje N, Gunnar Ellingsen, Tone Bratteteig, et al. ECSCW 2015: Proceedings of the 14th European Conference on Computer Supported Cooperative Work ［C］. Springer, Cham: 2015.
［7］　Xu Wenkai, Zhao Longgang, Li Juan, et al. Detection and classification of tea buds based on deep learning ［J］. Computers and Electronics in Agriculture, 2022, 192.
［8］　胡根生, 吴继甜, 鲍文霞, 等. 基于改进YOLOv5网络的复杂背景图像中茶尺蠖检测［J］. 农业工程学报, 2021, 37(21): 191-198.
Hu Gensheng, Wu Jitian, Bao Wenxia, et al. Detection of ectropis oblique in complex background images using improved YOLOv5 ［J］. Transactions of the Chinese Society of Agricultural Engineering, 2021, 37(21): 191-198.
［9］　方梦瑞, 吕军, 阮建云, 等. 基于改进YOLOv4tiny的茶叶嫩芽检测模型［J］. 茶叶科学, 2022, 42(4): 549-560.
Fang Mengrui, Lü Jun, Ruan Jianyun, et al. A tea sprout detection model based on improved YOLOv4 tiny ［J］. Tea Science, 2022, 42 (4): 549-560
［10］　许宝阳, 高延峰. 基于Faster RCNN深度学习的茶叶嫩芽多维度识别及其性能分析［J］. 农业装备与车辆工程, 2023, 61(2): 19-24.
Xu Baoyang, Gao Yanfeng. Multidimensional recognition and performance analysis of tea tender buds based on Faster RCNN deep learning ［J］. Agricultural Equipment and Vehicle Engineering, 2023, 61 (2): 19-24.
［11］　李旭, 李振海, 杨海滨, 等. 基于Faster RCNN网络的茶叶嫩芽检测［J］. 农业机械学报, 2022, 53(5): 217-224.
Li Xu, Li Zhenhai, Yang Haibin, et al. Tea bud detection based on faster RCNN network ［J］. Transactions of the Chinese Society for Agricultural Machinery, 2022, 53(5): 217-224.
［12］　姜苗苗, 问美倩, 周宇, 等. 基于颜色因子与图像融合的茶叶嫩芽检测方法［J］. 农业装备与车辆工程, 2020, 58(10): 44-47.
Jiang Miaomiao, Wen Meiqian, Zhou Yu, et al. Tea bud detection method based on color factor and image fusion ［J］. Agricultural Equipment & Vehicle Engineering, 2020, 58(10): 44-47.
［13］　汪建. 结合颜色和区域生长的茶叶图像分割算法研究［J］. 茶叶科学, 2011, 31(1): 72-77.
Wang Jian. Segmentation algorithm of tea combined with the color and region growing ［J］. Journal of Tea Science, 2011, 31(1): 72-77.
［14］　龙樟, 姜倩, 王健, 等. 茶叶嫩芽视觉识别与采摘点定位方法研究［J］. 传感器与微系统, 2022, 41(2): 39-41.
Long Zhang, Jiang Qian, Wang Jian, et al. Research on method of tea flushes vision recognition and picking point localization ［J］.Transducer and Microsystem Technologies, 2022, 41(2): 39-41.
［15］　周巧黎, 马丽, 曹丽英, 等. 基于改进轻量级卷积神经网络MobileNetV3的番茄叶片病害识别［J］. 智慧农业(中英文), 2022, 4(1): 47-56.
Zhou Qiaoli, Ma Li, Cao Liying, et al. Identification of tomato leaf diseases based on improved lightweight convolutional neural networks MobileNetV3 ［J］. Smart Agriculture, 2022, 4(1): 47-56.
［16］　张志远, 罗铭毅, 郭树欣, 等. 基于改进YOLO v5的自然环境下樱桃果实识别方法［J］. 农业机械学报, 2022, 53(S1): 232-240.
Zhang Zhiyuan, Luo Mingyi, Guo Shuxin, et al. Cherry fruit detection method in natural scene based on improved YOLO v5 ［J］. Transactions of the Chinese Society for Agricultural Machinery, 2022, 53(S1): 232-240.
［17］　Liu S, Qi L, Qin H, et al. Path aggregation network for instance segmentation ［C］. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Salt Lake City: IEEE, 2018: 8759-8768.
［18］　Neubeck A, Van Gool L. Efficient nonmaximum suppression ［C］. 18th International Conference on Pattern Recognition (ICPR06). Hong Kong: IEEE, 2006, 3: 850-855.
［19］　刘刚, 冯彦坤, 康熙. 基于改进YOLO v4的生猪耳根温度热红外视频检测方法［J］. 农业机械学报, 2023, 54(2): 240-248.
Liu Gang, Feng Yankun, Kang Xi. Detection method of pig ear root temperature based on improved YOLOv4 ［J］. Transactions of the Chinese Society for Agricultural Machinery, 2023, 54(2): 240-248.
［20］　Syazwany N S, Nam J H, Lee S C. MMBiFPN: Multimodality fusion network with BiFPN for MRI brain tumor segmentation ［J］. IEEE Access, 2021: 160708-160720.
［21］　王卓, 王健, 王枭雄, 等. 基于改进YOLO v4的自然环境苹果轻量级检测方法［J］. 农业机械学报, 2022, 53(8): 294-302.
Wang Zhuo, Wang Jian, Wang Xiaoxiong, et al. Lightweight realtime apple detection method based on improved YOLOv4 ［J］. Transactions of the Chinese Society for Agricultural Machinery, 2022, 53(8): 294-302.
［22］　Wang J, Luan Z, et al. Superpixel segmentation with squeezeandexcitation networks ［J］. Signal, Image and Video Processing, 2022, 16(5): 1-8.
［23］　Woo S, Park J, Lee J Y, et al. CBAM: Convolutional block attention module ［C］. Proceedings of the European Conference on Computer Vision (ECCV). 2018: 3-19.
［24］　Algorithms. Recent findings from Shanghai Jiao tong University has provided new information about algorithms (Detecting tomatoes in greenhouse scenes by combining AdaBoost classifier and colour analysis) ［J］. Journal of Engineering, 2016: 795.
［25］　夏烨, 雷哓晖, 祁雁楠, 等. 基于改进GhostYOLOv5sBiFPN算法检测梨树花序［J］. 智慧农业(中英文), 2022, 4(3): 108-119.
Xia Ye, Lei Xiaohui, Qi Yannan, et al. Detection of pear inflorescence based on improved GhostYOLOv5sBiFPN algorithm ［J］. Smart Agriculture, 2022, 4(3): 108-119.
［26］　Guo C, Fan B, Zhang Q, et al. Improving multiscale feature learning for object detection ［J］. Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2020.
［27］　Jiang Z, Zhao L, Li S, et al. Realtime object detection method based on improved YOLOv4tiny ［J］. arXiv Preprint, 2020, arXiv: 2011.04244.
［28］　Shaoqing R, Kaiming H, Ross G, et al. Faster RCNN: Towards realtime object detection with region proposal networks ［J］. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2017, 39(6): 1137-1149.
（上接第167页）
［13］　Dechant C, WiesnerHanks T, Chen S, et al. Automated identification of northern leaf blightinfected maize plants from field imagery using deep learning ［J］. Phytopathology, 2017: 1426-1432.
［14］　陈娟, 陈良勇, 王生生, 等. 基于改进残差网络的园林害虫图像识别［J］. 农业机械学报, 2019, 50(5): 187-195.
Chen Juan, Chen Liangyong, Wang Shengsheng, et al. Pest image recognition of garden based on improved residual network ［J］. Transactions of the Chinese Society for Agricultural Machinery, 2019, 50(5): 187-195.
［15］　Oksuz K, Cam B C, Akbas E, et al. Rank & sort loss for object detection and instance segmentation ［C］. Proceedings of the IEEE/CVF International Conference on Computer Vision, 2021: 3009-3018.
［16］　Li Y, Li S, Du H, et al. YOLOACN: Focusing on small target and occluded object detection ［J］. IEEE Access, 2020, 8: 227288-227303.
［17］　Girshick R. Fast rcnn ［C］. Proceedings of the IEEE International Conference on Computer Vision, 2015: 1440-1448.
［18］　Uijlings J R R, Van De Sande K E A, Gevers T, et al. Selective search for object recognition ［J］. International Journal of Computer Vision, 2013, 104: 154-171.
［19］　He K, Zhang X, Ren S, et al. Deep residual learning for image recognition ［C］. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2016: 770-778.
［20］　Simonyan K, Zisserman A. Very deep convolutional networks for largescale image recognition ［J］. arXiv Preprint arXiv: 1409.1556, 2014.
［21］　Lin T Y, Dollár P, Girshick R, et al. Feature pyramid networks for object detection ［C］. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2017: 2117-2125.
［22］　方帅, 金忍, 于磊, 等. 一种基于RGBD图像的似物性采样算法［J］. 图学学报, 2015, 36(6): 931-936.
Fang Shuai, Jin Ren, Yu Lei, et al. Object proposals from RGBD images ［J］. Journal of Graphics, 2015, 36(6): 931-936.
［23］　Szegedy C, Ioffe S, Vanhoucke V, et al. Inceptionv4, inceptionresnet and the impact of residual connections on learning ［C］. Proceedings of the AAAI Conference on Artificial Intelligence, 2017, 31(1).
［24］　Hughes D, Salathé M. An open access repository of images on plant health to enable the development of mobile disease diagnostics ［J］. arXiv Preprint arXiv: 1511.08060, 2015.

[1]	霍正瑞, 孙铁波. 基于改进YOLOv5算法的瓜蒌分级方法[J]. 中国农机化学报, 2024, 45(4): 100-107.
[2]	陈鲁威, , 曾锦, , 袁全春, , 夏烨, , 潘健, 吕晓兰, , . 基于改进DeepLabV3+的梨树冠层分割方法[J]. 中国农机化学报, 2024, 45(4): 155-161.
[3]	张慧蒙, 何超, 徐嘉雯, 罗鑫, 荣剑, 刘学渊. 基于SCGYOLOv5n的收获期澳洲坚果检测算法[J]. 中国农机化学报, 2024, 45(4): 214-221.
[4]	纪元浩, 许金普, 严蓓蓓, 薛俊龙. 基于改进ResNet50模型的咖啡生豆质量和缺陷检测方法[J]. 中国农机化学报, 2024, 45(4): 237-243.
[5]	胡冬军, 闫建伟, 张珍, 黄昕, 牛素贞, 胡正军. 基于堆积试验的针形茶叶离散元仿真参数标定[J]. 中国农机化学报, 2024, 45(3): 66-73.
[6]	邱金凯, 许秀英, 康烨, 臧浩, 马锴, 郭志鹏. 基于语义分割的无人机图像西瓜果实提取研究[J]. 中国农机化学报, 2024, 45(3): 182-188.
[7]	项新建, 郑雨, 曹光客, , 李旭, 尤钦寅, 姚佳娜. 基于改进Yolov5s的水稻叶病检测方法[J]. 中国农机化学报, 2024, 45(3): 212-218.
[8]	达措, , 赵启军, , , 高定国, , 索南尖措, 尼玛扎西. 基于注意力网络的长时牦牛个体识别研究[J]. 中国农机化学报, 2024, 45(1): 202-208.
[9]	黎远江, 李云伍, , 赵颖, , 台少瑜, 王克超. 基于改进Deeplabv3+模型的果树语义分割研究[J]. 中国农机化学报, 2024, 45(1): 209-216.
[10]	叶荣, 马自飞, 高泉, 李彤, 邵郭奇, 王白娟. 基于改进YOLOv5sECAASFF算法的茶叶病害目标检测[J]. 中国农机化学报, 2024, 45(1): 244-251.
[11]	王瑞彬, 杨世忠, 高升. 融合残差网络与注意力机制的草莓检测[J]. 中国农机化学报, 2024, 45(1): 266-273.
[12]	郑航, 傅童, 薛向磊, 叶云翔, 俞国红. 茶叶机械化采摘技术研究现状与展望[J]. 中国农机化学报, 2023, 44(9): 28-35.
[13]	余胜, 谢莉. 基于迁移学习和卷积视觉转换器的农作物病害识别研究[J]. 中国农机化学报, 2023, 44(8): 191-197.
[14]	李航, 廖映华, 黄波. 基于改进DQN算法的茶叶采摘机械手路径规划[J]. 中国农机化学报, 2023, 44(8): 198-205.
[15]	段宇飞, 董庚, 孙记委, 王焱清, . 基于SE-ResNet网络的油茶果果壳与茶籽分选模型[J]. 中国农机化学报, 2023, 44(4): 89-95.