透過您的圖書館登入
IP:3.22.77.117
  • 期刊

Maritime Small Ship Detection in Complex Ocean Environment Based on Improved Yolov3

摘要


Maritime small ship detection is a challenge problem in computer vision. At present, YOLOv3 network is widely used for object detection, but it gets low recall rate and detection accuracy for small objects in the complex ocean environment. Addressing this problem, we improve the backbone and predicted network of YOLOv3 network for detecting maritime small ship. Firstly, we build a maritime small ship dataset including four kinds of scenes: small traffic flow and heavy traffic flow in sunny and foggy weather. Secondly, we use K‐means to re‐cluster the anchor box for matching the shape of maritime ship. Thirdly, we introduce spatial pyramid pooling (SPP) module and frequency channel attention (FCA) module, and redesign the structure of YOLOv3 network, called it as SPP‐FCA‐YOLOv3. Here SPP module is used to fuse local features with global features and enriches the expression capability of the feature maps. FCA module emphasizes important object feature and suppresses unnecessary noise. Experimental results show that proposed SPP‐FCA‐YOLOv3 has higher detection accuracy for maritime small ship detection, getting a 2.2% improvement in average precision compared with YOLOv3, and a 1.2% improvement in average precision as well as higher speed compared with YOLOv5.

參考文獻


Chen C, Zhang Y, Lv Q, et al. 2019. Rrnet: A hybrid detector for object detection in drone-captured images. Proceedings of the IEEE/CVF International Conference on Computer Vision Workshops.
Chen L, Zhang H, Xiao J, et al. 2017. Sca-cnn: Spatial and channel-wise attention in convolutional networks for image captioning. Proceedings of the IEEE conference on computer vision and pattern recognition, 5659-5667.
Deng C, Wang M, Liu L, et al. 2021. Extended feature pyramid network for small object detection. IEEE Transactions on Multimedia.
Girshick R, Donahue J, Darrell T, et al. 2014. Rich feature hierarchies for accurate object detection and semantic segmentation. Proceedings of the IEEE conference on computer vision and pattern recognition, 580-587.
Girshick R. 2015. Fast r-cnn. Proceedings of the IEEE international conference on computer vision, 1440-1448.

延伸閱讀