基于改进U-Net的深水网箱网衣结节及纲线识别方法

Detection of netting knots and main lines in deep-sea cages using an improved U-Net

  • 摘要: 针对深海网箱养殖中水下网衣清洗机器人视觉导航与网衣维护的迫切需求,提出一种基于U-Net模型改进的网衣结构特征识别方法。该方法通过分割网衣关键结构特征——网线交点 (简称结节) 与纲线,为水下网衣清洗机器人提供可靠的视觉定位基准和路径跟踪依据,同时辅助网衣破损污损的检测。具体而言,通过水下网衣清洗机器人采集网衣图像数据,并对网衣结节和纲线进行标注,构建网衣结构特征数据集。以轻量级语义分割模型U-Net为基线模型,创新性集成空洞空间金字塔池化 (Atrous Spatial Pyramid Pooling, ASPP) 与交叉注意力机制 (Criss-Cross Attention, CCA),强化模型对网衣结构特征的语义表达能力;同时设计适配网衣结构特征的专项评价指标。通过系统对比不同中间层通道数的U-Net以及改进模型的性能。结果表明,改进模型在网衣数据集上表现最优,其中结节Dice系数为0.60 (F1分数:0.92),纲线Dice系数为0.74 (F1分数:0.62),显著优于基线模型。该方法可为后续网衣结节和纲线的跟踪研究提供参考。

     

    Abstract: Addressing the urgent requirements for visual navigation of underwater net-cleaning robots and netting maintenance in deep-sea cage aquaculture, this paper proposes a method for recognizing netting structural features, which is an improved method based on the U-Net model. This method provides reliable visual positioning references and path-tracking foundations for underwater net-cleaning robots by segmenting the key features of netting structure, namely the intersections of mesh wires (Referred to as net knots) and cable lines, while facilitating the detection of damage and fouling on the netting structure. First, underwater netting image data were collected using an underwater net-cleaning robot, and annotations of net knots and cable lines in the images were performed to construct a dataset for netting structural features. Then, using the lightweight semantic segmentation model U-Net as the baseline model, innovations were made by integrating Atrous Spatial Pyramid Pooling (ASPP) and Criss-Cross Attention (CCA) mechanisms. This integration enhanced the model's capability to capture multi-scale contextual information and focus on critical structural regions, thus strengthening the semantic representation of netting features. Additionally, we designed task-specific evaluation metrics tailored to net knots and cable lines, and conducted systematic performance comparisons between U-Net variants with different intermediate-layer channel numbers and the proposed improved model. Experimental results demonstrate that the improved model achieved optimal performance on the constructed netting dataset. Notably, net knots segmentation achieved 0.60 Dice and 0.92 F1, while cable line segmentation attained 0.74 Dice and 0.62 F1, representing significant improvements over baseline configurations. The improved U-Net model integrating ASPP and CCA mechanisms provides an effective solution for precise segmentation of netting knots and cable lines in underwater environments, which also provides a reference for future research on net knot and cable tracking.

     

/

返回文章
返回