透過您的圖書館登入
IP:18.219.68.172
  • 期刊

Semantic Segmentation of Street Scene Based on Multi-Scale and Attention Mechanism

摘要


Street scene images contain various objects of different scales. The segmentation model with single scale and feature extraction and fusion can not get good segmentation and prediction results. Therefore, a semantic segmentation model based on multi-scale feature fusion and attention mechanism is proposed. Firstly, the asymmetric structure of atrous spatial pyramid pooling (ASPP) is used to optimize the extraction of different levels and scales of street scene image. Secondly, the attention mechanism is introduced into the feature maps of different scales, so that the network can focus on the salient features of each level. Finally, all the feature images are adjusted to the same size for fusion, and the key feature information of each scale object in the street scene is fully extracted to segment it effectively. The experimental results on the dataset Cityscapes show that the semantic segmentation network model based on multi-scale and attention mechanism can further improve the segmentation accuracy and optimize the segmentation results.

參考文獻


Choy SK, Shu YL, Yu KW, et al. Fuzzy Model-Based Clus-tering and Its Application in Image Segmentation[J]. Pattern Recognition, 2017, 100(68): 141-157.
LONG J, SHELHAMER E, DARRELL T. Fully convolutional networks for semantic segmentation[J]. IEEE Transactions on Pattern Analysis & Machine Intelligence, 2014, 39 (4): 640-651.
Chen LC, Papandreou G, Kokkinos I, et al.Deeplab: Semantic image segmentation with deep convolutional nets,atrous convolution, and fully connected crfs [J]. IEEE transactions on pattern analysis and machine intelligence, 2017, 40(4):834-848.
Sachin Mehta, Mohammad Rastegari, Anat Caspi, Linda Shapiro, and Hannaneh Hajishirzi. ESPNet: Efficient Spatial Pyramid of Dilated Convolutions for Semantic Segmenta-tion .In ECCV,2018.
BADRINARAYANAN V, KENDALL A,CIPOLLA R, et al. Segnet: A deep convolutional encoder-decoder architecture for image segmentation[J]. IEEE transactions on pattern analysis and machine intelligence, 2017, 39(12): 2481-2495.

延伸閱讀