基于DenseNet的复杂交通场景语义分割方法

doi:10.16451/j.cnki.issn1003-6059.201905010

摘要
图/表
参考文献
相关文章 (2)

全文: PDF (2007 KB) HTML (0 KB)
输出: BibTeX | EndNote (RIS)

摘要针对交通场景语义分割方法存在参数量较大、计算效率较低、精度不足等问题,文中提出基于全卷积化DenseNet的多尺度端到端语义分割模型.首先,构建一种含混合空洞卷积的密集连接模块,同时沿通道维度级联各模块,用于提取图像特征.然后,采集多尺度视觉信息并以此作为监督信号回传至原通道中.最后,通过双线性插值法获得预测输出.在CityScapes数据集上的测试实验表明,文中方法对复杂交通场景的解析能力较强,预测精度和分割效率较高.

	服务

	把本文推荐给朋友
	加入我的书架
	加入引用管理器
	E-mail Alert
	RSS
	作者相关文章
	蒋斌
	涂文轩
	杨超
	刘虹雨
	赵子龙

关键词 ：交通场景, 图像语义分割, 空洞卷积, 多尺度特征融合

Abstract：An end-to-end multi-scale semantic segmentation model based on fully convolutional DenseNet is proposed, aiming at the problems of traditional semantic segmentation methods for street scene, such as the large number of parameters and low computational efficiency and precision. Firstly, convolution layers embedded with hybrid dilation convolution are stacked to establish a dense module, and then the modules are cascaded along channel dimension to extract features. Next, multi-scale visual information regarded as supervised signals are transferred back to original channels. Finally, the prediction results are obtained by bilinear interpolation method. Experimental results on Cityscapes dataset demonstrate that the proposed method achieves an efficient segmentation and performs a better accuracy for street scene parsing.

Key words： Traffic Scene Image Semantic Segmentation Dilated Convolution Multi-scale Feature Fusion

收稿日期: 2018-09-15

ZTFLH:

TP 391

基金资助:国家自然科学基金青年科学基金项目(No.61702176)、湖南省自然科学基金项目(No.2017JJ3038)资助

通讯作者: 蒋斌(通讯作者),博士,副教授,主要研究方向为大数据技术、计算机视觉、机器学习.E-mail:jiangbin@hnu. edu.cn.

作者简介: 涂文轩,硕士研究生,主要研究方向为计算机视觉、机器学习.E-mail:twx@hnu.edu.cn.杨超,博士,副教授,主要研究方向为大数据技术、社会网络计算、智能信息处理.E-mail:yangchaoedu@hnu.edu.cn.刘虹雨,硕士研究生,主要研究方向为计算机视觉、机器学习.E-mail:www884886@126.com.赵子龙,硕士研究生,主要研究方向为计算机视觉、机器学习.E-mail:zerahhah@gmail.com.

引用本文:

蒋斌, 涂文轩, 杨超, 刘虹雨, 赵子龙. 基于DenseNet的复杂交通场景语义分割方法[J]. 模式识别与人工智能, 2019, 32(5): 472-480. JIANG Bin¹, TU Wenxuan¹, YANG Chao¹, LIU Hongyu¹, ZHAO Zilong¹. Semantic Segmentation Method for Complex Traffic Scene Based on DenseNet. , 2019, 32(5): 472-480.

链接本文:

http://manu46.magtech.com.cn/Jweb_prai/CN/10.16451/j.cnki.issn1003-6059.201905010 或 http://manu46.magtech.com.cn/Jweb_prai/CN/Y2019/V32/I5/472

[1] GARCIA-GARCIA A, ORTS-ESCOLANO S, OPREA S, et al. A Review on Deep Learning Techniques Applied to Semantic Segmentation[C/OL]. [2018-08-15]. https://arxiv.org/pdf/1704.06857.pdf.
[2] 张新明,祝晓斌,蔡强,等.图像语义分割深度学习模型综述.高技术通讯, 2017, 27(9): 808-815.
(ZHANG X M, ZHU X B, CAI Q, et al. Survey of the Deep Learning Models for Image Semantic Segmentation. Chinese High Technology Letters, 2017, 27(9): 808-815.)
[3] 姜枫,顾庆,郝慧珍,等.基于内容的图像分割方法综述.软件学报, 2017, 28(1): 160-183.
(JIANG F, GU Q, HAO H Z, et al. Survey on Content-Based Image Segmentation Methods. Journal of Software, 2017, 28(1): 160-183.)
[4] ZHAO H S, QI X J, SHEN X Y, et al. ICNet for Real Time Semantic Segmentation on High-Resolution Images[C/OL]. [2018-08-15]. https://arxiv.org/pdf/1704.08545.pdf.
[5] 李琳辉,钱波,连静,等.基于卷积神经网络的交通场景语义分割方法研究.通信学报, 2018, 39(4): 123-130.
(LI L H, QIAN B, LIAN J, et al. Study on Traffic Scene Semantic Segmentation Method Based on Convolutional Neural Network. Journal on Communications, 2018, 39(4): 123-130.)
[6] LIU C, YUEN J, TORRALBA A. Sift Flow: Dense Corresponden-ce across Scenes and Its Applications // HASSNER T, LIU C, eds. Dense Image Correspondences for Computer Vision. Berlin, Germany: Springer, 2011: 15-49.
[7] FARABET C, COUPRIE C, NAJMAN L, et al. Learning Hierarchical Features for Scene Labeling. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2013, 35(8): 1915-1929.
[8] LONG J, SHELHAMER E, DARRELL T. Fully Convolutional Networks for Semantic Segmentation[C/OL]. [2018-08-15]. https://arxiv.org/pdf/1411.4038.pdf.
[9] KRIZHEVSKY A, SUTSKEVER I, HINTON G E. ImageNet Classification with Deep Convolutional Neural Networks. Communications of the ACM, 2017, 60(6): 84-90.
[10] SIMONYAN K, ZISSERMAN A. Very Deep Convolutional Networks for Large-Scale Image Recognition[C/OL]. [2018-08-15]. https://arxiv.org/pdf/1409.1556.pdf.
[11] SZEGEDY C, LIU W, JIA Y Q, et al. Going Deeper with Convolutions // Proc of the 28th IEEE Conference on Computer Vision and Pattern Recognition. Washington, USA: IEEE, 2015: 1-9.
[12] HE K M, ZHANG X Y, REN S Q, et al. Deep Residual Learning for Image Recognition // Proc of the 28th IEEE Conference on Computer Vision and Pattern Recognition. Washington, USA: IEEE, 2015: 770-778.
[13] BADRINARAYANAN V, HANDA A, CIPOLLA R. SegNet: A Deep Convolutional Encoder-Decoder Architecture for Robust Semantic Pixel-Wise Labelling. IEEE Transaction on Pattern Analysis and Machine Intelligence, 2015, 39(12): 2481-2495.
[14] RONNEBERGER O, FISCHER P, BROX T. U-Net: Convolutional Networks for Biomedical Image Segmentation // Proc of the International Conference on Medical Image Computing and Computer-Assisted Intervention. Berlin, Germany: Springer, 2015: 234-241.
[15] BANSAL A, CHEN X L, RUSSELL B, et al. PixelNet: Representation of the Pixels, by the Pixels, and for the Pixels[C/OL]. [2018-08-15]. https://arxiv.org/pdf/1702.06506.pdf.
[16] CHEN L C, PAPANDREOU G, KOKKINOS I, et al. Semantic Image Segmentation with Deep Convolutional Nets and Fully Connected CRFs[C/OL]. [2018-08-15]. https://arxiv.org/pdf/1412.7062.pdf.
[17] CHEN L C, PAPANDREOU G, KOKKINOS I, et al. DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2018, 40(4): 834-848.
[18] ZHAO H S, SHI J P, QI X J, et al. Pyramid Scene Parsing Network // Proc of the IEEE Conference on Computer Vision and Pa-ttern Recognition. Washington, USA: IEEE, 2017: 6230-6239.
[19] YU F, KOLTUN V. Multi-scale Context Aggregation by Dilated Convolutions[C/OL]. [2018-08-15]. https://arxiv.org/pdf/1511.07122.pdf.
[20] YU F, KOLTUN V, FUNKHOUSER T. Dilated Residual Networks // Proc of the IEEE Conference on Computer Vision and Pattern Recognition. Washington, USA: IEEE, 2017: 636-644.
[21] CHEN L C, PAPANDREOU G, SCHROFF F, et al. Rethinking Atrous Convolution for Semantic Image Segmentation[C/OL]. [2018-08-15]. https://arxiv.org/pdf/1706.05587.pdf.
[22] XIE C W, ZHOU H Y, WU J X. Vortex Pooling: Improving Con-text Representation in Semantic Segmentation[C/OL]. [2018-08-15]. https://arxiv.org/pdf/1804.06242v2.pdf.
[23] HUANG G, LIU Z, VAN DER MAATEN L, et al. Densely Connected Convolutional Networks // Proc of the IEEE Conference on
Computer Vision and Pattern Recognition. Washington, USA: IEEE, 2017: 2261-2269.
[24] DENG J, DONG W, SOCHER R, et al. ImageNet: A Large-Scale Hierarchical Image Database // Proc of the 22th IEEE Conference on Computer Vision and Pattern Recognition. Washington, USA: IEEE, 2009: 248-255.
[25] BENENSON R, FRANKE U, ROTH S, et al. The Cityscapes Da-taset for Semantic Urban Scene Understanding // Proc of the IEEE Conference on Computer Vision and Pattern Recognition. Washington, USA: IEEE, 2016: 3213-3223.