一种自动的唇部定位及唇轮廓提取、跟踪方法<sup>*</sup>

Abstract
Figure/Table
References
Related Citation (15)

Download: PDF (1460 KB) HTML (1 KB)
Export: BibTeX | EndNote (RIS)

Abstract ;An automatic approach to lip localization, contour extraction and tracking is presented, which combines CbCr color space, Fisher transform and deformable templates. Firstly, a skincolor model is constructed for skin detection in CbCr color space so that the approximate lip region can be obtained based on the geometry features of human face. Then, the color difference between lip and skin is enhanced by Fisher transform. Preprocessing of brightness is carried out before segmentation, and the threshold is obtained by Otsu method. Next, the lip color model is used to validate the segmentation result of accurate localization and the deformable templates are used for lip contour extraction. Based on the segmentation result, a method of curves fitting for edges of inner mouth is presented to extract the inner contour robustly. Finally, locating result of previous frame is predicted as the next lip region, in which lip localization and contour extraction are executed for lip tracking.

Key words： LipReading Face Detection Color Space Fisher Transform Localization Deformable Template Contour Extraction Tracking

Received: 16 January 2006

ZTFLH:

TP391.41

	Service

	E-mail this article
	Add to my bookshelf
	Add to citation manager
	E-mail Alert
	RSS
	Articles by authors
	WANG XiaoPing
	HAO YuFeng
	FU DeGang
	YUAN ChunWei

Cite this article:

WANG XiaoPing,HAO YuFeng,FU DeGang等. An Automatic Approach to Lip Localization, Contour Extraction and Tracking[J]. , 2007, 20(4): 485-491.

URL:

http://manu46.magtech.com.cn/Jweb_prai/EN/ OR http://manu46.magtech.com.cn/Jweb_prai/EN/Y2007/V20/I4/485

[1] Hennecke M E, Prasad K V, Stork D G. Automatic Speech Recognition System Using Acoustic and Visual Signals // Proc of the 29th Asilomar Conference on Signals, Systems and Computers. Pacific Grove, USA, 1995, Ⅱ: 12141218
[2] Lanitis A, Taylor C J, Cootes T F. An Automatic Face Identification System Using Flexible Appearance Models. Image and Vision Computing, 1995, 13(5): 393401
[3] Turk M, Pentland A. Eigenfaces for Recognition. Journal of Cognitive Neuroscience, 1991, 3(1): 7186
[4] Yang Jie, Waibel A. A Realtime Face Tracker // Proc of the 3rd IEEE Workshop on Applications of Computer Vision. Sarasota, USA, 1996: 142147
[5] Chai D, Ngan K N. Face Segmentation Using SkinColor Map in Videophone Application. IEEE Trans on Circuits and Systems for Video Technology, 1999, 9(4): 551564
[6] Lin Fuzong. Fundamentals of Multimedia Technology. 2nd Edition. Beijing, China: Tsinghua University Press, 2002 (in Chinese)
(林福宗.多媒体技术基础.第2版.北京:清华大学出版社, 2002)
[7] Yao Hongxun, Liu Mingbao, Gao Wen, et al. Method of Face Locating and Tracking Based on Chromatic Coordinates Transformation of Color Images. Chinese Journal of Computers, 2000, 23(2): 158165 (in Chinese)
(姚鸿勋,刘明宝,高文,等.基于彩色图像的色系坐标变换的面部定位与跟踪法.计算机学报, 2000, 23(2): 158165)
[8] Wang Rui, Gao Wen, Ma Jiyong. An Approach to Robust and Fast Locating of Lip Motion. Chinese Journal of Computers, 2001, 24(8): 866871 (in Chinese)
(王瑞,高文,马继涌.一种快速、鲁棒的唇动检测与定位方法.计算机学报, 2001, 24(8): 866871)
[9] Bian Zhaoqi, Zhang Xuegong. Pattern Recognition. 2nd Edition. Beijing, China: Tsinghua University Press, 2002 (in Chinese)
(边肇祺, 张学工.模式识别.第2版.北京:清华大学出版社, 2002)
[10] Otsu N. A Threshold Selection Method from GrayLevel Histogram. IEEE Trans on Systems, Man and Cybernetics, 1979, 9(1): 6266
[11] Hennecke M E, Prasad K V, Stork D G. Using Deformable Templates to Infer Visual Speech Dynamics // Proc of the 28th Annual Asilomar Conference on Signals, Systems and Computers. Pacific Grove, USA, 1994, Ⅰ: 578582
[12] Kass M, Witkin A, Terzopoulus D. Snakes: Active Contour Models. International Journal of Computer Vision, 1988, 1(4): 321331
[13] Bregler C, Konig Y. Eigenlips for Robust Speech Recognition // Proc of the IEEE International Conference on Acoustics, Speech and Signal Processing. Adelaide, Australia, 1994: 669672
[14] Iwano K, Tamura S, Furui S. Bimodal Speech Recognition Using Lip Movement Measured by OpticalFlow Analysis // Proc of the International Workshop on HandsFree Speech Communication. Kyoto, Japan, 2001: 187190
[15] Luettin J, Thacker N A, Beet S W. Speechreading Using Shape and Intensity Information // Proc of the IEEE International Conference on Spoken Language. Philadelphia, USA, 1996, Ⅰ: 5861
[16] Cootes T F, Edwards G J, Taylor C J. Active Appearance Models. IEEE Trans on Pattern Analysis and Machine Intelligence, 2001, 23(6): 681685
[17] Aleksic P S, Katsaggelos K. Comparison of MPEG4 Facial Animation Parameter Groups with Respect to AudioVisual Speech Recognition Performance // Proc of the IEEE International Conference on Image Processing. Genoa, Italy, 2005, Ⅲ: 501504
[18] Xue Yi. The Principles and Methods for Optimization. Beijing, China: Beijing Industry University Press, 2001 (in Chinese)
(薛毅.最优化原理与方法.北京:北京工业大学出版社, 2001)
[19] Gonzalez R C, Woods R E. Digital Image Processing. 2nd Edition. Upper Saddle River, USA: Prentice Hall, 2002