3D information detection method for facility greenhouse tomato based on improved YOLOv5l

doi:10.13733/j.jcam.issn.2095-5553.2024.01.038

Abstract

Abstract: To solve the problem of inaccurate fruit recognition and positioning caused by obstruction and complex light conditions in greenhouse environments. This study combines the deep learning object detection algorithm with the Intel RealSense D435i depth camera. And we propose a method to obtain the coordinated position of the tomato in threedimensional space, which is used for the picking robot in the greenhouse to perform the tomato positioning and picking task. Based on the YOLOv5 network, we use GhostConvolution to replace the CSP structure in the original network. And we adopted the multiscale connection method of BiFPN to maximize the use of the tomato feature information extracted by different feature layers to improve the accuracy of bounding box regression. This article compared different attention mechanisms and selected the CBAM Attention mechanism to insert into the models feature extraction network. Then, the model obtains the center point of the tomato detected in the twodimensional video stream data through the RGBD camera and calculates the tomatos spatial coordinate information in the camera coordinate system. To minimize the impact of the complex greenhouse environment on target recognition and the final picking effect, we filter all video streams over 1.5 meters so that the vision algorithm only focuses on the recognition and detection of targets within a range of 1.5 meters. The mean average precision of red and green tomatoes was 82.4% and 82.2%. Finally, this article introduces a method for combining a depth camera with an object detection network to detect the depth of tomato objects. It provide theoretical support for the tomato picking robot vision system.

Key words: tomato, deep learning, picking robot, 3Dobject detection, YOLOv5

摘要： 针对温室环境中由于遮挡和光线复杂等原因造成的果实识别和定位不准确这一问题，将深度学习目标检测算法与Intel RealSense D435i深度相机相结合，提出一种获取番茄在三维空间中协同位置的方法，用于温室中采摘机器人执行番茄定位和采摘任务。基于YOLOv5网络，使用GhostConvolution替换原始网络中的CSP结构，并采用BiFPN的多尺度连接方法，最大限度地利用不同特征层提取番茄特征信息，以提高边界框回归的准确性。比较不同的注意机制，并选择CBAM注意机制插入到模型的特征提取网络中。该模型通过RGBD相机获取检测到的番茄的中心点，并计算其在相机坐标系中的空间坐标信息。为最大限度地减少复杂温室环境对目标识别以及最终采摘效果的影响，筛选所有超过1.5 m的视频流，以便视觉算法只专注于识别和检测1.5 m范围内的目标。试验表明，模型检测红色和绿色番茄的平均精度均值分别为82.4%和82.2%。最后，介绍深度相机与目标检测网络相结合以检测番茄物体深度的方法。为番茄采摘机器人视觉系统提供理论支持。

关键词: 番茄, 深度学习, 采摘机器人, 3D目标检测, YOLOv5

CLC Number:

S641.2: TP391.4

Lin Sen, , Xu Tongyu, Ge Yuhao, Ma Jing, Sun Tianlong, Zhao Chunjiang. 3D information detection method for facility greenhouse tomato based on improved YOLOv5l[J]. Journal of Chinese Agricultural Mechanization, 2024, 45(1): 274-284.

林森, , 许童羽, 葛禹豪, 马璟, 孙添龙, 赵春江. 基于改进YOLOv5l的设施番茄3D信息检测方法[J]. 中国农机化学报, 2024, 45(1): 274-284.

References

［1］ Kong Jianlei, Yang Chengcai, Wang Jianli, et al. Deepstacking network approach by multisource data mining for hazardous risk identification in IoTbased intelligent food management systems ［J］. Computational Intelligence and Neuroscience, 2021.
［2］熊征, 李惠玲, 侯露, 等. 面向机器人采摘的樱桃番茄力学特性研究［J］. 现代农业装备, 2020, 41(3): 24-28.
Xiong Zheng, Li Huiling, Hou Lu, et al. Mechanical properties of cherry tomato for harvesting robot ［J］. Modern Agricultural Equipment, 2020, 41(3): 24-28.
［3］ Ruder S. An overview of gradient descent optimization algorithms ［J］. arXiv 2016, arXiv: 1609.04747.
［4］ Jin X, Zheng W, Kong J, et al. Deeplearning forecasting method for electric power load via attentionbased encoderdecoder with bayesian optimization ［J］. Energies, 2021, 14(6): 1596.
［5］ Jin X, Zheng W, Kong J, et al. Deeplearning temporal predictor via bidirectional selfattentive encoderdecoder framework for IOTbased environmental sensing in intelligent greenhouse ［J］. Agriculture, 2021, 11: 802.
［6］ Wu Z, Pan S, Chen F, et al. A comprehensive survey on graph neural networks ［J］. IEEE transactions on neural networks and learning systems, 2021, 32(1): 4-24.
［7］ Baltazar A R, Santos F N d, Moreira A P, et al. Smarter robotic sprayer system for precision agriculture ［J］. Electronics, 2021, 10(17): 2061.
［8］ Hulens D, Van Ranst W, Cao Y, et al. Autonomous visual navigation for a flower pollination drone ［J］. Machines, 2022, 10(5): 364.
［9］ RodríguezOrtega W M, Martínez V, Nieves M, et al. Agricultural and physiological responses of tomato plants grown in different soilless culture systems with saline water under greenhouse conditions ［J］. Scientific Reports, 2019, 9(1): 6733.
［10］ Hinton G E, Salakhutdinov R R. Reducing the dimensionality of data with neural networks ［J］. Science, 2006, 313: 504-507.
［11］ Girshick R, Donahue J, Darrell T, et al. Rich feature hierarchies for accurate object detection and semantic segmentation ［C］. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2014: 580-587.
［12］ Girshick R. Fast RCNN ［C］. In Proceedings of the IEEE International Conference on Computer Vision, 2015: 1440-1448.
［13］ Ren S, He K, Girshick R, et al. Faster RCNN: Towards realtime object detection with region proposal networks ［J］. IEEE Transactions on Pattern Analysis & Machine Intelligence, 2017, 39(6): 1137-1149.
［14］ Redmon J, Divvala S, Girshick R, et al. You only look once: Unified, realtime object detection ［C］. In Proceedings of the IEEE conference on Computer Vision and Pattern Recognition, 2016: 779-788.
［15］宋佳瑞. 基于视觉的空战目标检测跟踪与定位方法研究［D］. 哈尔滨: 哈尔滨工业大学, 2022.
Song Jiarui. Visionbased air combat target detection tracking and localization methodology research ［D］. Harbin: Harbin Institute of Technology, 2022
［16］ Kong J, Wang H, Jin X, et al. Multistream hybrid architecture based on crosslevel fusion strategy for finegrained crop species recognition in precision agriculture ［J］. Computers and Electronics in Agriculture, 2021, 185(1): 106134.
［17］ Zheng Y, Kong J, Jin X, et al. Crop deep: The crop vision dataset for deeplearningbased classification and detection in precision agriculture ［J］. Sensors, 2019, 19: 1058.
［18］ Zheng Y, Kong J, Jin X, et al. Probability fusion decision framework of multiple deep neural networks for finegrained visual classification ［J］. IEEE Access, 2019, 7: 122740-22757.
［19］ Chen H C, Widodo A M, Wisnujati A, et al. AlexNet convolutional neural network for disease detection and classification of tomato leaf ［J］. Electronics, 2022, 11: 951.
［20］ Bhujel A, Kim N E, Arulmozhi E, et al. A lightweight attentionbased convolutional neural networks for tomato leaf disease classification ［J］. Agriculture, 2022, 12: 228.
［21］ Ge Y, Lin S, Zhang Y, et al. Tracking and counting of tomato at different growth period using an improving YOLOdeepsort network for inspection robot ［J］. Machines, 2022, 10: 489.
［22］ Kuznetsova A, Maleva T, Soloviev V. Using YOLOv3 algorithm with preand postprocessing for apple detection in fruitharvesting robot ［J］. Agronomy, 2020, 10: 1016.
［23］ Ji W, Pan Y, Xu B, et al. A realtime apple targets detection method for picking robot based on ShufflenetV2YOLOX ［J］. Agriculture, 2022, 12: 856.
［24］ Wang F, Sun Z, Chen Y, et al. Xiaomila green pepper target detection method under complex environment based on improved YOLOv5s ［J］. Agronomy, 2022, 12: 1477.
［25］ Su F, Zhao Y, Wang G, et al. Tomato maturity classification based on SEYOLOv3MobileNetV1 network under nature greenhouse environment ［J］. Agronomy, 2022, 12: 1638.
［26］ Andriyanov N, Khasanshin I, Utkin D, et al. Intelligent system for estimation of the spatial position of apples based on YOLOv3 and Real Sense Depth Camera D415 ［J］. Symmetry, 2022, 14: 148.
［27］ Pan S, Ahamed T. Pear recognition in an orchard from 3D Stereo camera datasets to develop a fruit picking mechanism using mask RCNN ［J］. Sensors, 2022, 22: 4187.
［28］ Redmon J, Farhadi A. YOLO9000: Better, faster, stronger ［C］. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2017: 7263-7271.
［29］ Redmon J, Farhadi A. Yolov3: An incremental improvement ［J］. arXiv 2018, arXiv: 1804.02767.
［30］ Bochkovskiy A, Wang C, Liao H. Yolov4: Optimal speed and accuracy of object detection ［J］. arXiv 2020, arXiv: 2004.10934.
［31］ Xia X, Chai X, Zhang N, et al. Culling double counting in sequence images for fruit yield estimation ［J］. Agronomy, 2022, 12: 440.
［32］ Yang B, Gao Z, Gao Y, et al. Rapid detection and counting of wheat ears in the field using YOLOv4 with attention module ［J］. Agronomy, 2021, 11: 1202.
［33］ Bewley A, Ge Z, Ott L, et al. Simple online and realtime tracking ［C］. In Proceedings of the 2016 IEEE International Conference on Image Processing (ICIP), 2016: 3464-3468.
［34］ Wojke N, Bewley A, Paulus D. Simple online and realtime tracking with a deep association metric ［C］. In Proceedings of the 2017 IEEE International Conference on Image Processing (ICIP), 2017: 3645-3649.
［35］ Buslaev A, Parinov A, Khvedchenya E, et al. Albumentations: Fast and flexible image augmentations ［J］. Information, 2020, 11(2): 125.
［36］ Yun S, Han D, Oh S J, et al. Cutmix: Regularization strategy to train strong classifiers with localizable features ［C］. In Proceedings of the IEEE/CVF International Conference on Computer Vision, 2019: 6023-6032.
［37］ Ma N, Zhang X, Zheng H, et al. ShuffleNet V2: Practical Guidelines for Efficient CNN Architecture Design ［C］. In Proceedings of the European Conference on Computer Vision (ECCV), 2018.
［38］ Tan M, Pang R, Le Q. EfficientDet: Scalable and efficient object detection ［C］. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020: 10781-10790.
［39］宋传旗. YOLOv5算法的人脸识别检测方法研究［J］. 计算机时代, 2023(7): 15-19.
Song Chuanqi. Research on face recognition detection method of YOLOv5 algorithm ［J］. Computer Era, 2023(7): 15-19
［40］ Wang C, Liao H, Wu Y, et al. CSPNet: A new backbone that can enhance learning capability of CNN ［C］. In Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), 2020: 1571-1580.
［41］ He K, Zhang X, Ren S, et al. Spatial pyramid pooling in deep convolutional networks for visual recognition ［J］. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2015, 37(9): 1904-1916.
［42］ Zhang X, Zhou X, Lin M, et al. ShuffleNet: An extremely efficient convolutional neural network for mobile devices ［J］. arXiv 2017, arXiv: 1707.01083v2.
［43］ Lin T, Dollár P, Girshick R, et al. Feature pyramid networks for object detection ［C］. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2017: 2117-2125.
［44］ Liu S, Qi L, Qin H, et al. Path aggregation network for instance segmentation ［C］. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2018: 8759-8768.
［45］ Kalman R E. A new approach to linear filtering and prediction problems ［J］. Journal of Fluids Engineering, 1960, 82: 35-45.
［46］ De Maesschalck R, Delphine J R, Massart D L. The mahalanobis distance ［J］. Chemometrics and Intelligent Laboratory Systems, 2000, 50(1): 1-18.
［47］ Wright M B. Speeding up the Hungarian algorithm ［J］. Computers & Operations Research, 1990, 17(1): 95-96.
［48］龙洁花, 赵春江, 林森, 等. 改进Mask RCNN的温室环境下不同成熟度番茄果实分割方法［J］. 农业工程学报, 2021, 37(18): 100-108.
Long Jiehua, Zhao Chunjiang, Lin Sen, et al. Segmentation method of the tomato fruits with different maturities under greenhouse environment based on improved Mask RCNN ［J］. Transactions of the Chinese Society of Agricultural Engineering, 2021, 37(18): 100-108.
［49］刘小龙, 王国强, 刘娜, 等. 设施农业机械发展现状及趋势分析［J］. 农业技术与装备, 2022(3): 61-62.
Liu Xiaolong, Wang Guoqiang, Liu Na, et al. Development status and trend analysis of facility agricultural machinery ［J］. Agricultural Technology & Equipment, 2022(3): 61-62.
［50］赵子文, 金永, 陈友兴, 等. 基于改进YOLVOv5s的X射线图像粘接缺陷实时检测［J］. 国外电子测量技术, 2023, 42(4): 181-186.
Zhao Ziwen, Jin Yong, Chen Youxing, et al. Realtime detection of adhesive defects in Xray images based on improved YOLVOv5s ［J］. Foreign Electronic Measurement Technology, 2023, 42(4): 181-186.

[1]	Ye Rong, Ma Zifei, Gao Quan, Li Tong, Shao Guoqi, Wang Baijuan. Target detection of tea disease based on improved YOLOv5s-ECA-ASFF algorithm [J]. Journal of Chinese Agricultural Mechanization, 2024, 45(1): 244-251.
[2]	Fang Junze, Guo Zheng, Li Ge, Xing Suxia, Wang Yu. Classification model of citrus disease leaf based on improved SwinTransformer [J]. Journal of Chinese Agricultural Mechanization, 2024, 45(1): 252-258.
[3]	Hou Bingfa, Li Xiaomin, Mou Xiangwei, Yao Huaping. Design and experiment of inspection robot system in cherry tomato greenhouse [J]. Journal of Chinese Agricultural Mechanization, 2024, 45(1): 285-294.
[4]	Yang Dongyan, Wang Dan, Sang Ting, Zhao Yunxia, Song Weitang. Characteristics of group light environment and yield distribution of tomato under eastwest ridge pattern in solar greenhouse [J]. Journal of Chinese Agricultural Mechanization, 2023, 44(9): 51-58.
[5]	. Research on fruit growth model of facility tomato based on stepwiseprincipal component regression [J]. Journal of Chinese Agricultural Mechanization, 2023, 44(9): 66-71.
[6]	Zhao Kun, , , Liu Pingzeng, , , Zhang Ze, , , Zhang Yan, , , Ma Feng. Digital research on growth process of facility tomato based on Logistic model [J]. Journal of Chinese Agricultural Mechanization, 2023, 44(9): 72-78.
[7]	Wang Yanqing, , Tang Yang, Yang Guangyou, . Design and experiment of control system for robot citrus picking [J]. Journal of Chinese Agricultural Mechanization, 2023, 44(9): 146-153.
[8]	Hou Yifeng, , Qian Jun, , Wang Liang, He Jie, Bi Yaofeng. Design of sorting mechanism of the clementine picking robots [J]. Journal of Chinese Agricultural Mechanization, 2023, 44(9): 183-189.
[9]	Yang Bo, He Jinping, Zhang Lina. Identification and detection of rice leaf diseases by YOLOv5 neural network based on improved SPP-x [J]. Journal of Chinese Agricultural Mechanization, 2023, 44(9): 190-197.
[10]	Wu Xiao, Yang Ying, Liu Gang, Zhang Qian, Ning Yuanlin. Pig individual recognition method based on transfer learning and improved ResNet34 for real environment [J]. Journal of Chinese Agricultural Mechanization, 2023, 44(9): 214-221.
[11]	Wang Daming, He Yifei, Li Huaying, Gou Yujiang, He Huibo. Research on image recognition algorithm of citrus picking robot [J]. Journal of Chinese Agricultural Mechanization, 2023, 44(9): 222-226.
[12]	Sun Songli, Wen Hongyuan, Liu Binling, Zhong Jinyang, Mao Zhengxing. Design of intelligent sorting system of Pleurotus eryngii based on ROS and deep learning [J]. Journal of Chinese Agricultural Mechanization, 2023, 44(8): 95-102.
[13]	Gao Fangzheng, Tang Wenjun, Chen Guangming, , Huang Jiacai. Fast recognition of ripe tomato fruits in complex environment based on improved YOLOv3 [J]. Journal of Chinese Agricultural Mechanization, 2023, 44(8): 174-183.
[14]	Ma Li, Zhou Qiaoli, Zhao Liya, Hu Yuanhui. Classification and recognition of tomato leaf diseases based on deep learning [J]. Journal of Chinese Agricultural Mechanization, 2023, 44(7): 187-193.
[15]	Li Zongnan, Jiang Yi, Wang Si, Li Yuanhong, Huang Ping, Wei Peng. Object detection of invasive Erigeron L. plants base on YOLOv5 [J]. Journal of Chinese Agricultural Mechanization, 2023, 44(7): 200-206.