English

中国农机化学报

中国农机化学报 ›› 2024, Vol. 45 ›› Issue (1): 209-216.DOI: 10.13733/j.jcam.issn.2095-5553.2024.01.029

• 农业信息化工程 • 上一篇    下一篇

基于改进Deeplabv3+模型的果树语义分割研究

黎远江1,李云伍1, 2,赵颖1, 2,台少瑜1,王克超1   

  • 出版日期:2024-01-15 发布日期:2024-02-06
  • 基金资助:
    贵州省科技计划项目(黔科合支撑[2021]一般171)

Research on semantic segmentation of fruit trees based on improved Deeplabv3+ model

Li Yuanjiang1, Li Yunwu1, 2, Zhao Ying1, 2, Tai Shaoyu1, Wang Kechao1   

  • Online:2024-01-15 Published:2024-02-06

摘要: 针对丘陵山区果园存在地形、光线、边界干扰等环境因素对单株果树难以精准识别分割的问题,提出一种改进的高精度Deeplabv3+语义分割网络模型。首先,该模型以ResNet50为主干网络提取特征,引入金字塔拆分注意力(PSA)机制,获得更清晰的果树轮廓边界信息;继而,将条纹池化(SP)模块串联到解码部分,通过SP加强特征提取,分别沿水平和垂直维度获取丰富的上下文信息,扩大感受野范围并保证信息完整性和连续性。通过语义分割可得以下结论:在使用Labelme工具进行自主图像标注的丘陵山区果树树冠图像数据集中,果树单株识别分割准确率PA为98.91%,果树分割的平均交并比MIoU为74.94%,相较于PSPNet、UNet、FCN和Deeplabv3+,PA分别提高2.5%、1.88%、1.03%和1.85%,MIoU分别提高10.93%、8.19%、2.78%、5.73%,有较明显的数据提升。该研究成果可为智能农业装备在果园对靶喷药、长势识别等精细化作业方面提供数据支撑。

关键词: 果树, 树冠分割, Deeplabv3+, 语义分割, 条状池化, 注意力机制

Abstract: In order to solve the problem that it was difficult to accurately identify and segment individual fruit trees in hilly and mountainous orchards due to environmental factors such as terrain, light and boundary interference, an improved highprecision Deeplabv3+ semantic segmentation network model was proposed. Firstly, features were extracted from ResNet50 main trunk network, and pyramid splitting attention (PSA) mechanism was introduced to obtain clearer fruit tree contour boundary information. Then, the stripe pooling (SP) module was connected to the decoding part in series, and the feature extraction was enhanced by SP to obtain rich context information along the horizontal and vertical dimensions respectively, which expanded the range of sensitivity field and ensures the integrity and continuity of information. Through semantic segmentation, it could be concluded that in the tree crown image data set of fruit trees in hilly and mountainous areas with autonomous image annotation using Labelme tool, the identification and segmentation accuracy of individual fruit trees was 98.91%, and the average intersection ratio of fruit tree segmentation was 74.94%. Compared with PSPNet, UNet, FCN and Deeplabv3+, PA was increased by 2.5%, 1.88%, 1.03% and 1.85% respectively, while MIoU was increased by 10.93%, 8.19%, 2.78% and 5.73% respectively, there was obvious improvement data. The research results could provide data support for intelligent agricultural equipment in fine operations such as target spraying and growth identification in orchards.

Key words: fruit tree, crown segmentation, Deeplabv3+, semantic segmentation, strip pooling, attention mechanism

中图分类号: