结合高斯混合模型与多通道双边滤波的RGBD场景流计算方法

doi:10.16451/j.cnki.issn1003-6059.202303003

摘要
图/表
参考文献
相关文章 (15)

全文: PDF (4310 KB) HTML (1 KB)
输出: BibTeX | EndNote (RIS)

摘要针对现有RGBD场景流计算方法在大位移、运动遮挡等复杂运动场景中存在计算准确性与可靠性较低的问题,文中提出结合高斯混合模型与多通道双边滤波的RGBD场景流计算方法.首先,构造基于高斯混合模型的光流聚类分割模型,从光流中提取目标运动信息,逐层优化深度图分层分割结果,获取高置信度的深度运动分层分割信息.然后,在场景流计算中引入多通道双边滤波优化,建立结合高斯混合模型与多通道双边滤波的RGBD场景流计算模型,克服场景流计算边缘模糊问题.最后,在Middlebury、MPI-Sintel数据集上的实验表明,文中方法在大位移、运动遮挡等复杂运动场景下具有较高的场景流计算准确性和鲁棒性,特别在边缘区域具有较好的保护效果.

	服务

	把本文推荐给朋友
	加入我的书架
	加入引用管理器
	E-mail Alert
	RSS
	作者相关文章
	王梓歌
	李盈盈
	葛利跃
	陈震
	张聪炫

关键词 ： RGBD场景流, 高斯混合模型, 多通道双边滤波, 光流, 边缘模糊

Abstract：To improve the computational accuracy and robustness of existing RGBD scene flow calculation methods under complex motion scenarios, such as large displacement and motion occlusion, a calculation method of RGBD scene flow combining Gaussian mixture model and multi-channel bilateral filtering is proposed. Firstly, a Gaussian mixture-based optical flow clustering segmentation model is constructed to extract target motion information from optical flow and optimize the results of depth map segmentation layer by layer. Consequently, high-confidence depth motion hierarchical segmentation information is obtained. Then, the RGBD scene flow estimation model combining the Gaussian mixture model and multi-channel bilateral filtering is established by introducing the multi-channel bilateral filtering optimization to overcome the edge-blurring problem of the scene flow computation. Finally, experiments on Middlebury and MPI-Sintel datasets demonstrate that the proposed method exhits higher accuracy and robustness in complex motion scenarios such as large displacements and motion occlusions, particularly in edge-preserving.

Key words： RGBD Scene Flow Gaussian Mixture Model Multi-channel Bilateral Filtering Optical Flow Edge Blur

收稿日期: 2022-12-06

ZTFLH:

TP391

基金资助:国家重点研发计划项目(No.2020YFC2003800)、国家自然科学基金项目(No.62222206,62272209,61866026,61772255,61866025)、江西省自然科学基金重点项目(No.20202ACB214007)、江西省技术创新引导类计划项目(No.20212AEI91005)、江西省优势科技创新团队项目(No.20165BCB19007)、江西省教育厅科学技术项目(No.GJJ210910)、江西省图像处理与模式识别重点实验室开放基金项目(No.ET202104413)、江西省研究生创新基金(省级)项目(No.YC2021-S689)资助

通讯作者: 张聪炫,博士,教授,主要研究方向为图像处理、计算机视觉.E-mail:zcxdsg@163.com.

作者简介: 王梓歌,硕士研究生,主要研究方向为计算机视觉.E-mail:wangzggg@163.com. 李盈盈,硕士研究生,主要研究方向为计算机视觉.E-mail:1203602578@qq.com. 葛利跃,博士研究生,助理实验师,主要研究方向为机器视觉、智能感知.E-mail:lygeah@163.com. 陈震,博士,教授,主要研究方向为图像处理、计算机视觉.E-mail:dr_chenzhen@163.com.

引用本文:

王梓歌, 李盈盈, 葛利跃, 陈震, 张聪炫. 结合高斯混合模型与多通道双边滤波的RGBD场景流计算方法[J]. 模式识别与人工智能, 2023, 36(3): 225-241. WANG Zige, LI Yingying, GE Liyue, CHEN Zhen, ZHANG Congxuan. Calculation Method of RGBD Scene Flow Combining Gaussian Mixture Model and Multi-channel Bilateral Filtering. Pattern Recognition and Artificial Intelligence, 2023, 36(3): 225-241.

链接本文:

http://manu46.magtech.com.cn/Jweb_prai/CN/10.16451/j.cnki.issn1003-6059.202303003 或 http://manu46.magtech.com.cn/Jweb_prai/CN/Y2023/V36/I3/225

[1] 葛利跃,邓士心,龚洁,等.基于运动优化语义分割的变分光流计算方法.模式识别与人工智能, 2021, 34(7): 631-645.
(GE L Y, DENG S X, GONG J, et al. Variational Optical Flow Computation Method Based on Motion Optimization Semantic Segmentation. Pattern Recognition and Artificial Intelligence, 2021, 34(7): 631-645.)
[2] SCHUSTER R, WASENMÜLLER O, UNGER C, et al. SceneFlowFields++: Multi-frame Matching, Visibility Prediction, and Robust Interpolation for Scene Flow Estimation. International Journal of Computer Vision, 2020, 128(2): 527-546.
[3] PONTES J K, HAYS J, LUCEY S. Scene Flow from Point Clouds with or Without Learning // Proc of the International Conference on 3D Vision. Washington, USA: IEEE, 2020: 261-270.
[4] SAXENA R, SCHUSTER R, WASENMULLER O, et al. PWOC-3D: Deep Occlusion-Aware End-to-End Scene Flow Estimation // Proc of the IEEE Intelligent Vehicles Symposium. Washington, USA: IEEE, 2019: 324-331.
[5] OUYANG B J, RAVIV D. Occlusion Guided Self-Supervised Scene Flow Estimation on 3D Point Clouds // Proc of the International Conference on 3D Vision. Washington, USA: IEEE, 2021: 782-791.
[6] VEDULA S, RANDER P, COLLINS R, et al. Three-Dimensional Scene Flow. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2005, 27(3): 475-480.
[7] 陈震,马龙,张聪炫,等.基于语义分割的双目场景流估计.电子学报, 2020, 48(4): 631-636.
(CHEN Z, MA L, ZHANG C X, et al. Binocular Scene Flow Estimation Based on Semantic Segmentation. Acta Electronica Sinica, 2020, 48(4): 631-636.)
[8] WEDEL A, RABE C, VAUDREY T, et al. Efficient Dense Scene Flow from Sparse or Dense Stereo Data // Proc of the European Conference on Computer Vision. Berlin, Germany: Springer, 2008: 739-751.
[9] RABE C, MÜLLER T, WEDEL A, et al. Dense, Robust, and Accurate Motion Field Estimation from Stereo Image Sequences in Real-Time // Proc of the European Conference on Computer Vision. Berlin, Germany: Springer, 2010: 582-595.
[10] BASHA T, MOSES Y, KIRYATI N. Multi-view Scene Flow Estimation: A View Centered Variational Approach. International Journal of Computer Vision, 2013, 101(1): 6-21.
[11] MENZE M, GEIGER A. Object Scene Flow for Autonomous Vehicles // Proc of the IEEE Conference on Computer Vision and Pa-ttern Recognition. Washington, USA: IEEE, 2015: 3061-3070.
[12] VOGEL C, SCHINDLER K, ROTH S. Piecewise Rigid Scene Flow // Proc of the IEEE Conference on Computer Vision. Wa-shington, USA: IEEE, 2013: 1377-1384.
[13] SCHUSTER R, WASENMULLER O, KUSCHK G, et al. SceneFlowFields: Dense Interpolation of Sparse Scene Flow Correspondences // Proc of the IEEE Winter Conference on Applications of Computer Vision. Washington, USA: IEEE, 2018: 1056-1065.
[14] ZHOU Z K, FAN X N, SHI P F, et al. R-MSFM: Recurrent Multi-scale Feature Modulation for Monocular Depth Estimating // Proc of the IEEE/CVF International Conference on Computer Vision. Washington, USA: IEEE, 2021: 12757-12766.
[15] GOTTFRIED J M, FEHR J, GARBE C S. Computing Range Flow from Multi-modal Kinect Data // Proc of the International Sympo-sium on Visual Computing. Berlin, Germany: Springer, 2011: 758-767.
[16] QUIROGA J, DEVERNAY F, CROWLEY J. Local/Global Scene Flow Estimation // Proc of the IEEE International Conference on Image Processing. Washington, USA: IEEE, 2013: 3850-3854.
[17] HERBST E, REN X F, FOX D. RGB-D Flow: Dense 3-D Motion Estimation Using Color and Depth // Proc of the IEEE Internatio-nal Conference on Robotics and Automation. Washington, USA: IEEE, 2013: 2276-2282.
[18] WANG Y C, ZHANG J, LIU Z C, et al. Handling Occlusion and Large Displacement Through Improved RGB-D Scene Flow Estimation. IEEE Transactions on Circuits and Systems for Video Te-chnology, 2016, 26(7): 1265-1278.
[19] SUN D Q, SUDDERTH E B, PFISTER H. Layered RGBD Scene Flow Estimation // Proc of the IEEE Conference on Computer Vision and Pattern Recognition. Washington, USA: IEEE, 2015: 548-556.
[20] MAYER N, IIG E, HÄUSSER P, et al. A Large Dataset to Train Convolutional Networks for Disparity, Optical Flow, and Scene Flow Estimation // Proc of the IEEE Conference on Computer Vision and Pattern Recognition. Washington, USA: IEEE, 2016: 4040-4048.
[21] EIGEN D, PUHRSCH C, FERGUS R.Depth Map Prediction from a Single Image Using a Multi-scale Deep Network // Proc of the 27th International Conference on Neural Information Processing Systems. Cambridge, USA: MIT Press, 2014, II: 2366-2374.
[22] YANG G S, RAMANAN D. Upgrading Optical Flow to 3D Scene Flow Through Optical Expansion // Proc of the IEEE/CVF Confe-rence on Computer Vision and Pattern Recognition. Washington, USA: IEEE, 2020: 1331-1340.
[23] QUIROGA J, BROX T, DEVERNAY F, et al. Dense Semi-Rigid Scene Flow Estimation from RGBD Images // Proc of the European Conference on Computer Vision. Berlin, Germany: Springer, 2014: 567-582.
[24] ZHAI M L, XIANG X Z, LÜ N, et al. Optical Flow and Scene Flow Estimation: A Survey. Pattern Recognition, 2021, 114. DOI: 10.1016/j.patcog.2021.107861.
[25] SUN D Q, ROTH S, BLACK M J. A Quantitative Analysis of Cu-rrent Practices in Optical Flow Estimation and the Principles Behind Them. International Journal of Computer Vision, 2014, 106(2): 115-137.
[26] ZHANG C X, CHEN Z, WANG M R, et al. Robust Non-local TV-L¹ Optical Flow Estimation with Occlusion Detection. IEEE Transactions on Image Processing, 2017, 26(8): 4055-4067.
[27] DONG C, WANG Z S, HAN J M, et al. A Non-local Propagation Filtering Scheme for Edge-Preserving in Variational Optical Flow Computation. Signal Processing: Image Communication, 2021, 93. DOI: 10.1016/j.image.2021.116143.
[28] ZHU Y, LAN Z Z, NEWSAM S, et al. Guided Optical Flow Learning[C/OL].[2022-11-23]. https://arxiv.org/pdf/1702.02295.pdf.
[29] LI X X, LIU Y J, JIN H Y, et al. Automatic Layered RGB-D Scene Flow Estimation with Optical Flow Field Constraint. IET Image Processing, 2020, 14(8). DOI: 10.1049/iet-ipr.2020.0230.
[30] BROX T, MALIK J. Large Displacement Optical Flow: Descriptor Matching in Variational Motion Estimation. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2011, 33(3): 500-513.
[31] 张聪炫,裴刘继,陈震,等.FRFCM聚类与深度优化的RGBD场景流计算.电子学报, 2020, 48(7): 1380-1386.
(ZHANG C X, PEI L J, CHEN Z, et al. RGBD Scene Flow Estimation Based on FRFCM Clustering and Depth Optimization. Acta Electronica Sinica, 2020, 48(7): 1380-1386.)