EAAI Journal 2026 Journal Article
An enhanced segmentation network built upon the you only look once framework for precise weed recognition in early-stage cotton
- Peng Qin
- Jiajia Wang
- Zhenhong Jia
- Gang Zhou
- Wei Chen
Effective weed management in cotton fields is essential for precision agriculture, where accurate segmentation technologies enable site-specific herbicide application. To facilitate early recognition and timely control of weeds, a self-constructed dataset was established from early-stage cotton fields containing 11 weed species. Considering the challenges posed by occlusion, boundary ambiguity, and the high cost of pixel-level annotations, an enhanced instance segmentation network (ESNet) was developed on the basis of the You-Only-Look-Once version 11 segmentation (YOLO11-seg) framework to improve segmentation performance, and an active learning strategy was further introduced to reduce annotation workload. Specifically, the network integrates the Dynamic-Ghost Enhanced C3k2 Module (C3k2_DG) for lightweight and diverse feature extraction, the Plant-Shaped Enhanced Convolution (PSEC) for downsampling with orientation- and scale-aware modeling, the Dual-Branch Progressive Attention Fusion (DBPAF) for progressive multi-level feature integration, and the Local Importance Attention (LIA) for boundary refinement. The experimental results demonstrated that the mean Intersection over Union (mIoU) was 0. 818, a Mask Precision was 0. 943, and mean Average Precision (mAP50) was 0. 917. Within the active learning framework, a strategy combining Bayesian Active Learning by Disagreement (BALD) uncertainty estimation, Core-Set–based diversity sampling, and class-balanced weighting was adopted to identify representative samples more efficiently. Using only half of the training data, this approach retained 97. 9 % of the mIoU and 98. 1 % of the mAP50 achieved with the full dataset. To further demonstrate its practical applicability, the network was also tested through deployment on mobile devices, validating its feasibility for agricultural perception terminals.