一种新的多尺度深度学习图像语义理解方法研究

首页 > 过刊浏览>2016年第27卷第2期 >224-230

一种新的多尺度深度学习图像语义理解方法研究
DOI:
                        
                    
CSTR:
                        [cstr]
                    
作者:
                        
                        
                    
作者单位:(天津理工大学 天津市智能计算及软件新技术重点实验室,计算机视觉与系统省部共建教 育部重点实验室,天津 300384)
作者简介:张桦(1962－),女,四川人,博士,教授, 博士生导师,主要研究方向为图形图像和模式识别.
通讯作者:
中图分类号:
基金项目:国家自然科学基金(61202168,1,61472278)、天津市自然科学基金重点(14JCZDJC31700)和天津市高校发展基金(20120802,4)资助项目 (天津理工大学 天津市智能计算及软件新技术重点实验室,计算机视觉与系统省部共建教育部重点实验室,天津 300384)

A new multi-scale image semantic understanding method based on deep learning

Author:

Affiliation:

(Key Laboratory of Computer Vision and System,Ministry of Education of China,Tia njin Key Laboratory of Intelligence Computing and Novel Software Technology,Tian jin Universi ty of Technology,Tianjin 300384,China)

Fund Project:

摘要

图/表

访问统计

参考文献

相似文献

引证文献

资源附件

文章评论

摘要:

如何在深度学习中融合图像的多尺度信息,是基于深度学习的视觉算法需要解决的一个关键问题。本文提出一种基于多尺度交替迭代训练的深度学习方法,并应用于图像的语义理解。算法采用卷积神经网络(CNN)从原始图像中提取稠密性特征来编码以每个像素为中心的矩形区域,将多个尺度图像交替迭代训练,能够捕获不同尺度下的纹理、颜色和边缘等重要信息。在深度学习提取特征分类结果的基础上,提出了一种结合超像素分割的方法,统计超像素块的主导类别,来校正分类错误的像素类别,同时描绘出目标区域边界轮廓,完成最终的语义理解。在Stanford Background Dataset 8类数据集上验证了本文方法的有效性,准确率达到77.4%。

Abstract:

How to fuse multi-scale information o f image in deep learning is a key problem to be solved.To solve these problems,this paper proposed a deep learning method base d on multi-scale iterative training for image semantic understanding.The algorithm uses the convolution neu ral network (CNN) to extract dense feature vectors from raw pixel for encoding regions centered on each pixel.The multi-scale iterative training captures different scales of textures,colors,edges and other important informa tion.A new method combined with superpixel segmentation is proposed,to estimate the leading category of superpi xel block and to correct the pixel classification error.It can depict the outline of the target area boundary a nd complete the final semantic understanding.The experiments on Stanford Background Dataset-8verify the effectiveness of the proposed method,and the accuracy rate is 77.4%.

参考文献

相似文献

引证文献

引用本文

蒋应锋,张桦,薛彦兵,周冕,徐光平,高赞.一种新的多尺度深度学习图像语义理解方法研究[J].光电子激光,2016,27(2):224~230

复制

文章指标

点击次数:
下载次数:
HTML阅读次数:
引用次数:

历史

收稿日期:2015-09-25
最后修改日期:
录用日期:
在线发布日期: 2016-03-15
出版日期:

首页

期刊介绍

编委会

投稿指南

期刊订阅

下载中心

公告

联系我们

引用本文

分享

文章指标

历史

文章二维码