一种有效融合多尺度特征的图像语义分割方法

首页 > 过刊浏览>2022年第33卷第3期 >264-271

一种有效融合多尺度特征的图像语义分割方法
DOI:
                        
                    
CSTR:
                        [cstr]
                    
作者:
                        
                        
                    
作者单位:(安徽理工大学 计算机科学与工程学院, 安徽 淮南 232001)
作者简介:许光宇(1976－),男,博士,硕士生导师,主 要从事数字图像处理方面的研究.
通讯作者:
中图分类号:
基金项目:国家自然科学基金(61471004)和安徽理工大学博士专项基金(ZX942)资助项目

An image semantic segmentation method effectively fusing multi-scale features

Author:

Affiliation:

(School of Computer Science and Engineering,Anhui University of Science and Tech nology,Huainan,Anhui 232001, China)

Fund Project:

摘要

图/表

访问统计

参考文献

相似文献

引证文献

资源附件

文章评论

摘要:

卷积神经网络在高级计算机视觉任务中展现出强大的特征学习能力,已经在图像语义分割任务中取得了显著的效果。然而,如何有效地利用多尺度的特征信息一直是个难点。本文提出一种有效融合多尺度特征的图像语义分割方法。该方法包含4个基础模块,分别为特征融合模块(feature fusion module,FFM)、空间信息模块(spatial information module,SIM)、全局池化模块(global pooling module,GPM)和边界细化模块(boundary refinement module,BRM)。FFM采用了注意力机制和残差结构,以提高融合多尺度特征的效率,SIM由卷积和平均池化组成,为模型提供额外的空间细节信息以辅助定位对象的边缘信息,GPM提取图像的全局信息,能够显著提高模型的性能,BRM以残差结构为核心,对特征图进行边界细化。本文在全卷积神经网络中添加4个基础模块, 从而有效地利用多尺度的特征信息。在PASCAL VOC 2012数据集上的实验结果表明该方法相比全卷积神经网络的平均交并比提高了8.7%,在同一框架下与其他方法的对比结果也验证了其性能的有效性。

Abstract:

Convolutional neural networks show strong feature learning ability in a dvanced computer vision and have achieved remarkable effect in image semantic segmentation tasks.Ho wever,how to use the multi-scale feature information effectively is always a difficulty.This paper proposes an effective image semantic segmentation method which integrates multi-scale features.The propose d method consists of four basic modules,which are feature fusion module (FFM),spatial information module (SIM),global pooling module (GPM) and boundary refinement module (BRM).FFM adopts attention mechanis m and residual structure to improve the efficiency of multi-scale feature fusion.SIM includes convolution and average pooling opearaitons,and its purpose is to assist in locating the edge informati on of the object by providing additional spatial details.GPM extracts the global information of the image,wh ich can significantly improve the performance of the model.BRM takes the residual structure as the co re to refine the boundary of the feature map.Four basic modules are added into the full convolutional neu ral network to effectively utilize the multi-scale feature information.Experimental results on PASCAL VOC 2012dataset show that mean intersection over union of the proposed method is 8.7% higher than that of full convolutional neural network.The results of comparison with other methods in the same framework also verify the effectiveness of the proposed method.

参考文献

相似文献

引证文献

引用本文

许光宇,汤伟建.一种有效融合多尺度特征的图像语义分割方法[J].光电子激光,2022,33(3):264~271

复制

文章指标

点击次数:
下载次数:
HTML阅读次数:
引用次数:

历史

收稿日期:2021-06-07
最后修改日期:
录用日期:
在线发布日期: 2022-04-27
出版日期:

首页

期刊介绍

编委会

投稿指南

期刊订阅

下载中心

公告

联系我们

引用本文

相关视频

分享

文章指标

历史

文章二维码