基于可变形卷积的多人人体姿态估计

doi:10.16451/j.cnki.issn1003-6059.202010009

摘要
图/表
参考文献
相关文章 (15)

全文: PDF (5282 KB) HTML (1 KB)
输出: BibTeX | EndNote (RIS)

摘要目前针对人体姿态估计的深度神经网络都是在特征图的固定位置上进行采样,无法对人体姿态的几何变换进行建模,当人体实例在尺寸、姿势、拍摄角度等方面发生变化后,网络泛化能力较差.因此,文中提出基于可变形卷积的多人人体姿态估计方法.利用可变形卷积对目标几何变换建模能力较强的特性,设计特征提取模块,可在人体关键点几何变化的条件下保证检测的准确性.为了进一步提高网络性能,利用预训练残差网络.模型的预测值与二维高斯模型生成的真值用于计算损失,并迭代训练模型,能在拍摄视角、附着物及人物尺度变化等复杂条件下有效检测人体关键点.实验表明,文中模型可有效提升人体关键点检测的准确性.

	服务

	把本文推荐给朋友
	加入我的书架
	加入引用管理器
	E-mail Alert
	RSS
	作者相关文章
	赵云霄
	钱宇华
	王克琪

关键词 ：计算机视觉, 人体姿态估计, 人体关键点检测, 深度学习

Abstract：Deep neural networks for human pose estimation all sample at the fixed position of the feature map, and therefore it is difficult to model the geometric transformation of human pose. The generalization ability of the network is poor with the variation of the size, pose and shooting angle of the human instance. To solve this problem, multi-person human pose estimation based on deformable convolution is proposed.Based on the strong ability of deformable convolution in modeling geometric transformation of targets, a feature extraction module is designed to ensure the detection accuracy under the geometric changes of human key points. To further improve the performance of the network, the prediction value of the model and the truth value generated by the two-dimensional Gaussian model are employed to calculate the loss, and the model is trained iteratively. The human key points are detected effectively by the proposed model under the complex conditions, such as shooting angle, attachment and character scale changes. The experiment shows that the proposed model effectively improves the accuracy of human key point detection.

Key words： Computer Vision Human Pose Estimation Human Keypoints Detection Deep Learning

收稿日期: 2020-06-15

ZTFLH:

TP391

基金资助:国家自然科学基金项目(No.61672332)、山西省回国留学人员科研项目(No.2017023)资助

通讯作者: 钱宇华,博士,教授,主要研究方向为模式识别、特征选择、粗糙集理论、粒计算、人工智能.E-maill:jinchengqyh@126.com.

作者简介: 赵云霄,硕士研究生,主要研究方向为模式识别、深度学习.E-mail:384498377@163.com. 王克琪,硕士,主要研究方向为机器学习、计算机视觉、计算理论.E-mail:76190504@qq.com.

引用本文:

赵云霄, 钱宇华, 王克琪. 基于可变形卷积的多人人体姿态估计[J]. 模式识别与人工智能, 2020, 33(10): 944-950. ZHAO Yunxiao, QIAN Yuhua, WANG Keqi. Multi-person Human Pose Estimation Based on Deformable Convolution. , 2020, 33(10): 944-950.

链接本文:

http://manu46.magtech.com.cn/Jweb_prai/CN/10.16451/j.cnki.issn1003-6059.202010009 或 http://manu46.magtech.com.cn/Jweb_prai/CN/Y2020/V33/I10/944