当前位置>主页 > 期刊在线 > 计算机技术 >

计算机技术2020年4期

​基于全局注意力机制的语义分割方法研究
彭启伟,冯杰,吕进,余磊,程鼎
(南京南瑞信息通信科技有限公司,江苏 南京 210003)

摘  要:如何捕获更长距离的上下文信息成为语义分割的一个研究热点,但已有的方法无法捕获到全局的上下文信息。为此,文章提出了一种全局注意力模块,其通过计算每个像素和其他像素之间的关系生成一个全局关系注意力谱,然后通过该全局注意力谱来对深层卷积特征进行重新聚合,加强其中的有用信息,抑制无用的噪声信息。在具有挑战性的 Cityscapes 和PASCAL VOC 2012 数据集上验证了所提出的方法具有有效性其优于现有的方法。


关键词:语义分割;注意力机制;全局信息



中图分类号:TP391.41         文献标识码:A         文章编号:2096-4706(2020)04-0102-03


Research on Semantic Segmentation Based on Global Attention Mechanism

PENG Qiwei,FENG Jie,LYU Jin,YU Lei,CHENG Ding

(Nanjing Nari Information and Communication Technology Co.,Ltd.,Nanjing 210003,China)

Abstract:How to capture the context information with longer distance has become a research hotspot of semantic segmentation,but the existing methods can not capture the global context information. This paper proposes a global attention module,which generates a global relation attention spectrum by calculating the relationship between each pixel and other pixels,and then reaggregates the deep convolution features through the global attention spectrum to strengthen the useful information and suppress the useless noise information. The validity of the proposed method is verified on the challenging Cityscape and PASCAL VOC 2012 datasets,which is superior to the existing methods.

Keywords:semantic segmentation;attention mechanism;global information


参考文献:

[1] LONG J,SHELHAMER E,DARRELL T.Fully convolutional networks for semantic segmentation [C]//Proceedings of the IEEE conference on computer vision and pattern recognition,2015:3431-3440.

[2] CHEN L C,PAPANDREOU G,KOKKINOS I,et al.Deeplab:Semantic image segmentation with deep convolutional nets,atrous convolution,and fully connected crfs [J].IEEE transactions on pattern analysis and machine intelligence,2017,40(4):834-848.

[3] LIN G S,MILAN A,SHEN C H,et al.Refinenet:Multipath refinement networks for high-resolution semantic segmentation [C]//Proceedings of the IEEE conference on computer vision and pattern recognition,2017.

[4] ZHANG R,TANG S,ZHANG Y,et al. Scale-Adaptive Convolutions for Scene Parsing [C]//2017 IEEE International Conference on Computer Vision (ICCV).IEEE,2017.

[5] PENG C,ZHANG X Y,YU G,et al.Large Kernel Matters——Improve Semantic Segmentation by Global Convolutional Network [C]//The IEEE Conference on Computer Vision and Pattern Recognition,2017.


作者简介:彭启伟(1984-),男,汉族,安徽六安人,高级工程师,硕士研究生,主要研究方向:视频处理。