模式识别与人工智能
Home      About Journal      Editorial Board      Instructions      Ethics Statement      Contact Us                   中文
Pattern Recognition and Artificial Intelligence
22 Judgement and Disposal of Academic Misconduct Article
22 Copyright Transfer Agreement
22 Proof of Confidentiality
22 Requirements for Electronic Version
More....
22 Chinese Association of Automation
22 National ResearchCenter for Intelligent Computing System
22 Institute of Intelligent Machines,Chinese Academy of Sciences
More....
 
 
2010 Vol.23 Issue.6, Published 2010-12-31

Orignal Article   
   
Orignal Article
745 SVM Active Feedback Scheme Using Semi-Supervised Ensemble with Bias
WU Jun,DUAN Jing,LU Ming-Yu
Most SVM-based active learning methods are challenged by the small sample problem and the asymmetric distribution problems. A SVM-based active relevance feedback scheme is presented which deals with SVM ensemble under semi-supervised setting to augment the diversity among the individual SVM classifiers, thus a powerful ensemble classification model is obtained. Meanwhile, the powerful ensemble model is helpful to identify the most informative images for active learning. Moreover, aggregation method, termed as bias-weighting, is used within the semi-supervised ensemble framework to tackle the asymmetric distribution between positive and negative samples. Under the influence of bias-weighting, the ensemble classification model pays more attention on the positive samples than the negative ones. Experimental results validate the superiority of the presented scheme over several existing active learning methods.
2010 Vol. 23 (6): 745-751 [Abstract] ( 1047 ) [HTML 1KB] [ PDF 499KB] ( 632 )
752 A Computational Model of Salient Edges Detection on Object Contour in Natural Images
BO Yi-Hang,LUO Si-Wei,ZOU Qi
How to simulate the human visual perception system in order to build a robust and unsupervised computational model for salient edges detection on object contours in natural images is the main content in this paper. Firstly, the salient object sub-region is determined. Secondly, the low-level visual features of texture and color are analyzed to obtain a group of potential salient contour edges. Next, a graph model is established by analyzing the closure between any two potential salient edges. Finally, Dijkstras algorithm is used to obtain the salient contour edges. The experimental results of natural images show the proposed model is feasible and its biological rationality is verified.
2010 Vol. 23 (6): 752-758 [Abstract] ( 690 ) [HTML 1KB] [ PDF 463KB] ( 560 )
759 Digital Video Steganalysis Algorithm Based on Motion Estimation
SUN Yi-Feng,LIU Fen-Lin
A motion estimation based video steganalysis algorithm is proposed. The effect of information embedding on video motion estimation is investigated by the change of block mean square errors. The motion vectors are sensitive to information embedding, and the sensitivity increases with the decrease of the block size. The motion vector fields which reflect the time-variable characteristic of videos are used as the representation of video data in detecting stego-video. Firstly, a block size is given, and the motion vector fields are obtained by the minimum mean square error block matched motion estimation algorithm. Then the co-occurrence matrixes of three directional adjacent elements in the motions vector fields are calculated. The main diagonal elements and their neighbors of the co-occurrence matrixes are selected as the features. The support vector machine is adopted as the classifier. The experimental results show that the performance of the proposed algorithm is better compared with that of Budhia algorithm.
2010 Vol. 23 (6): 759-771 [Abstract] ( 678 ) [HTML 1KB] [ PDF 473KB] ( 674 )
772 Clustering Ensembles Based Classification Method for Imbalanced Data Sets
CHEN Si,GUO Gong-De,CHEN Li-Fei
Recently, classification of imbalanced data sets becomes a research hotspot in data mining and machine learning. A type of novel classification methods for imbalanced data sets based on clustering ensembles is proposed, which aims to provide a better training platform for classification methods by introducing clustering consistency index to find the cluster boundary minority examples and the cluster center majority examples. And the improved synthetic minority over-sampling technique (SMOTE) and the modified random under-sampling method are used respectively to deal with imbalanced data sets. The classifications of eight methods on some public data sets are compared. Experimental results show that the proposed methods perform better for both minority and majority classes, and are effective and feasible to deal with the imbalanced data sets.
2010 Vol. 23 (6): 772-775 [Abstract] ( 743 ) [HTML 1KB] [ PDF 656KB] ( 1008 )
776 Decision-Tree Model Research Based on Privacy-Preserving
FANG Wei-Wei,YANG Bing-Ru,YANG Jun,ZHOU Chang-Sheng

How to realize privacy-preserving data mining becomes a research hotspot in a distributed environment. A model is proposed to realize privacy-preserving decision-tree classifying when data are vertically partitioned. In this model, a privacy-preserving decision-tree is proposed, which is composed of Global-Table stored by an obvious semi-honest partner and several local decision-trees stored by different sites. By using indexed array and private data comparison protocol, decision-tree generation and classification can be realized without uncovering the original information. Theoretical analysis and experimental results demonstrate the proposed model provides good capabilities of privacies preserving, accuracy and efficiency.

2010 Vol. 23 (6): 776-780 [Abstract] ( 814 ) [HTML 1KB] [ PDF 403KB] ( 689 )
781 Maximum Generation for User to Keep Rationality in Interactive Evolutionary Computation
HAO Guo-Sheng,HUANG Yong-Qing,YAN Zhi-Gang,WEI Kai-Xia,GAO Yan,JIA Jing-Jing
To keep user rationality is a key element in interactive evolutionary computation to converge to the global solution. The maximum generation must be designed appropriately to help user keep rationality. Firstly, three different kinds of definition of the maximum generation are proposed. Secondly, the methods to calculate the maximum generation for six kinds of fitness-assignment methods are given. Both theory analysis and experimental results show that the most-satisfactory-identified fitness-assignment and the scale fitness-assignment practically help user keep rationality in more generations. The research provides references to select appropriate fitness-assignment methods.
2010 Vol. 23 (6): 781-785 [Abstract] ( 528 ) [HTML 1KB] [ PDF 348KB] ( 538 )
786 A Stereo Matching Based Construction Algorithm for Low Texture Images
SHAO Jing,DA Fei-Peng,HE Fu
Aiming at the mismatching problem in low texture areas and the well-known streaking effect of dynamic programming, an improved algorithm based on binocular stereo matching technology is proposed to generate three-dimensional reconstruction model for low texture images. Firstly, matching cost is computed based on the distinctiveness of pixels and the similarity among them. Secondly, an adaptive polygon-based support window is adopted in the matching cost aggregation, and a simple tree structure dynamic programming is introduced to guide the pixel to pixel matching. Finally, a simple and efficient method is presented to refine the mismatching pixels detected according to the left-right consistency constraint. To testify the applicability of the proposed algorithm, it is applied to low texture gray images captured in the real situation, and the experimental results show that smooth and vivid 3D points cloud models are generated.
2010 Vol. 23 (6): 786-793 [Abstract] ( 673 ) [HTML 1KB] [ PDF 546KB] ( 794 )
794 Genetic Algorithm for Rectangle Layout Optimization with Equilibrium Constraints
XU Yi-Chun ,DONG Fang-Min,LIU Yong,XIAO Ren-Bin
The 2-dimensional layout optimization problem is studied, where the unequal weighted rectangles are required to be placed in a circular container without overlap. There are two objectives, minimum of the radius of the circle and equilibrium of the system. In most of the literatures, the local search heuristics is applied to the problem. However, the performance of the local search heuristics is not satisfactory. A constructive heuristics is proposed, named orderly positioning technique (OPT). A rectangle is placed close to an already deployed rectangle. Around the deployed rectangle, only finite configurations are considered. Then the time complexity of OPT is polynomial. The output layout of OPT is often with good performance, nevertheless the positioning order of OPT affects the quality of the layout a lot. Thus, a genetic algorithm (GA) to search for the optimal positioning order is designed. In the GA, the crossover operator and mutation operator are specially designed to keep the offspring having a valid placing order. The proposed algorithm is tested on the benchmarks with large-scale instances. The numerical results show that the proposed algorithm has better performance than the local search heuristics from literatures.
2010 Vol. 23 (6): 794-801 [Abstract] ( 574 ) [HTML 1KB] [ PDF 497KB] ( 953 )
802 ACV Constraint Based Sequential Pattern Mining Algorithm
YE Hong-Yun,NI Zhi-Wei,NI Li-Ping
An aggregate constraint with items of varying values (ACV) is introduced. The ACV constraint is used to express users requirement on the aggregate feature of target patterns. An algorithm for mining frequent sequential patterns with the ACV constraint is proposed. It exploits the computational properties of ACV to effectively prune the search space. Experimental results on both the synthetic sequential data generated by IBM data generator and a real world data set show that the proposed algorithm utilizes ACV constraints to prune the useless candidate sequential patterns, and it reduces the redundant search space to improve the mining efficiency.
2010 Vol. 23 (6): 802-808 [Abstract] ( 479 ) [HTML 1KB] [ PDF 476KB] ( 680 )
809 Facial Beauty Classification Based on Geometric Features and C4.5
MAO Hui-Yun,JIN Lian-Wen,DU Ming-Hui
An automated Chinese female facial beauty classification approach is presented through the application of machine learning algorithm of C4.5. Seventeen geometric features are designed to abstractly represent each facial image. With large set of 510 Chinese female facial images, high average accuracy of 94.1% is obtained for two-level classification-beautiful or not, and the average accuracy of 4-level classification is 71.6%. The results show that the notion of beauty perceived by human can also be learned by machine through using machine learning techniques.
2010 Vol. 23 (6): 809-814 [Abstract] ( 722 ) [HTML 1KB] [ PDF 435KB] ( 1524 )
815 Adaptive Neighborhood Selection Algorithm Based on Deflection Angle of Local Tangent Space
YAN De-Qin,LIU Sheng-Lan
An adaptive neighborhood selection algorithm is proposed based on deflection angle of local tangent space by using the geometric properties of local tangent space. It computes the angle between local centralized samples and its tangent space based on the orthogonal projection of local tangent space. It depicts the properties of local tangent space better, and discriminates the samples which do not belong to this neighborhood and possesses better antinoise ability. The proposed algorithm is a modification to local tangent space alignment with manifold learning function of local high curvature. Experimental results show that the proposed algorithm is effective.
2010 Vol. 23 (6): 815-821 [Abstract] ( 482 ) [HTML 1KB] [ PDF 408KB] ( 679 )
822 Minimum Generation Error Based Optimization of HMM Model Clustering for Speech Synthesis
LU Heng,LING Zhen-Hua,LEI Ming,DAI Li-Rong,WANG Ren-Hua
To improve the decision tree clustering and avoid possible clustered model over-training and less-training, a minimal generation error criterion and cross-validation (CV) based minimal description length factor optimizing method is introduced. CV based generation error is calculated to optimize the scale of the decision tree. Results of both subjective and objective tests show that synthesized speech by the proposed method outperforms the synthesized speech by the baseline one system in both quality and naturalness.
2010 Vol. 23 (6): 822-828 [Abstract] ( 400 ) [HTML 1KB] [ PDF 478KB] ( 762 )
829 Kernel-Based Dual-Space Method and Its Fast Solution Algorithm
ZHOU Xiao-Yan,ZHENG Wen-Ming,ZOU Cai-Rong,ZHAO Li
To overcome the drawbacks of the dual-space linear discriminant analysis method, a kernel-based dual-space discriminant analysis method, namely KDS-DA is proposed. A fast algorithm based on the inverse operator of bordered matrix is also proposed for solving the discriminant vectors of KDS-DA. The algorithm utilizes the fact that the inverse computation of higher order bordered matrix can be converted to the inverse computation of a lower order matrix. To solve the (r+1)-th discriminant vector in the principal subspace of the within-class scatter matrix, the computational results obtained in computing the r-th discriminant vector are fully used, which significantly reduces the computational cost. The experimental results on ORL and AR face databases demonstrate the effectiveness of the proposed methods.
2010 Vol. 23 (6): 829-835 [Abstract] ( 476 ) [HTML 1KB] [ PDF 415KB] ( 586 )
836 A Robust Supervised Manifold Learning Algorithm and Its Application to Plant Leaf Classification
ZHANG Shan-Wen,HUANG De-Shuang
By combining the class information, local information and reliability of the original data, a geodesic distance measure is given. Based on the distance measure, a robust supervised Isomap (RS-Isomap) is proposed and is applied to the plant leaf classification. Plant leaf image sets are firstly projected into the low-dimensional manifold subspace by RS-Isomap, and then the SVM classifier is applied to plant. Finally, the experiments are implemented on the 300 leaf images of 20 plant species. The experimental results show that the proposed method is effective and feasible.
2010 Vol. 23 (6): 836-841 [Abstract] ( 733 ) [HTML 1KB] [ PDF 381KB] ( 681 )
842 Self-Regulation of Neighborhood Parameter for Locally Linear Embedding
HUI Kang-Hua,XIAO Bai-Hua,WANG Chun-Heng
The locally linear embedding (LLE) algorithm is considered as a powerful method for the problem of nonlinear dimensionality reduction. A method called self-regulated LLE is proposed. It finds the locally linear patch by analyzing the locally linear reconstruction errors of each sample in the dataset. Then, according to samples on the locally linear patch, it selects the appropriate neighborhood parameter for LLE. The experimental results show that LLE with self-regulation performs better than LLE based on different evaluation criteria and spends less time on several datasets.
2010 Vol. 23 (6): 842-856 [Abstract] ( 572 ) [HTML 1KB] [ PDF 334KB] ( 566 )
847 Web Information Extraction Based on Probabilistic Model
WANG Jing,LIU Zhi-Jing
According to the structure and the content features of web pages, a model named tree-structured hierarchical conditional random fields (TH-CRFs) is proposed. Firstly, a multi-feature vector space model is proposed to represent the features of the web pages from the facets of the page structure and the content. Secondly, the Boolean model and multi-rules are introduced to denote the features for a better representation of the web objects. Thirdly, an optimal web objects information extraction based on the TH-CRFs is performed to find out the recruitment knowledge and optimize the efficiency of the training. Finally, the proposed model is compared with the existing approaches for web objects information extraction. The experimental results show that the accuracy of the TH-CRFs for the web objects information extraction is significantly improved, and the time complexity is decreased.
2010 Vol. 23 (6): 847-855 [Abstract] ( 504 ) [HTML 1KB] [ PDF 575KB] ( 662 )
856 K-L Divergence Based Model Clustering Method for Fast Speaker Identification
WANG Huan-Liang,HAN Ji-Qing,ZHENG Gui-Bin
With the increase of enrolled speakers and audio data to be recognized, the conventional speaker identification methods can not meet the real-time demand for internet application environment. A K-L divergence based speaker model clustering method is proposed to construct a hierarchical identification system, which remarkably improves the recognition efficiency. Moreover, the confidence measure using class-level identification information is also investigated to effectively exclude out-of-set speaker as early as possible. The experimental results show the proposed method averagely increases the identification speed by 3.2 times while the error rate of closed-set identification only increases about 0.9% compared with the conventional method. The open-set identification can be speeded up by using class-level confidence measure and a relatively 5.1% error rate reduction can be achieved on out-of-set speakers identification while keeping the identification performance of in-set speakers unchanged.
2010 Vol. 23 (6): 856-861 [Abstract] ( 607 ) [HTML 1KB] [ PDF 410KB] ( 682 )
862 Multi-Label GRF Algorithm with Spearman Correlation
FENG Yuan-Ji,LI Mei-Yi,WANG Wei
An improved multi-label Gaussian random field algorithm is proposed to reduce the uncertainty of temporary labels. The spearman correlation matrix is used to build a label-relevant module instead of temporary labels. The results of comparative experiments show that the proposed algorithm is stable for temporary labels with tolerance and disturbance and it increases the accuracy of classification.
2010 Vol. 23 (6): 862-866 [Abstract] ( 590 ) [HTML 1KB] [ PDF 356KB] ( 1821 )
867 Fast Algorithm for Maximum Fuzzy Entropy Thresholding Method
LEI Bo,LAN Rong,FAN Jiu-Lun

Aiming at the large computation of the maximum fuzzy entropy thresholding method, a fast algorithm for the maximum fuzzy entropy thresholding method is presented. It is based on the analysis of the character of the S-type function and the properties of the fuzzy entropy. The fast algorithm reduces the time complexity from O(L4) to O(L3). Meanwhile, the fast algorithm avoids the defaults of reaching the local extrema by the optimization methods. Therefore, the fast algorithm raises the speed and maintains the segmentation performance of the maximum fuzzy entropy thresholding method.

2010 Vol. 23 (6): 867-873 [Abstract] ( 621 ) [HTML 1KB] [ PDF 405KB] ( 1047 )
874 Variable Structure Radial Basis Function Network and Its Application to On-Line Chaotic Time Series Prediction
YIN Jian-Chuan,HU Jiang-Qiang,HE Qing-Hua

To improve the accuracy and the speed of on-line chaotic time series prediction via radial basis function (RBF) network, a sequential learning algorithm is presented for on-line constructing variable structure RBF network. A sliding window is constructed. By learning real-time updated data in the window, the parameters of the connecting weights, number of hidden units and center locations are dynamically tuned. The algorithm achieves parsimonious RBF network quickly, while only a small number of tuning parameters are employed. The variable structure network is applied to Mackey-Glass chaotic time series on-line prediction. The results demonstrate that network possesses satisfactory on-line dynamic identification and prediction performance.

2010 Vol. 23 (6): 874-879 [Abstract] ( 454 ) [HTML 1KB] [ PDF 379KB] ( 560 )
880 Image Thresholding Based on Minimax Probability Criterion
WANG Jun,WANG Shi-Tong,DENG Zhao-Hong,QI Yun-Song

The minimax probabilistic machine is a classifier based on minimizing the misclassification probability. The problem of 1-dimensional minimax probability machine is firstly discussed. Then a theory on minimax probabilistic image segmentation is presented. A method for developing criterion function for image thresholding is proposed. Meanwhile, the minimax probabilistic image thresholding algorithm is also proposed and it ensures the maximal lower bound for correctly classifying pixels. Experimental results show the effectiveness of the proposed algorithm.

2010 Vol. 23 (6): 880-884 [Abstract] ( 576 ) [HTML 1KB] [ PDF 275KB] ( 652 )
885 Robust Online Process Modeling Method Based on Non-Bias LSSVM
ZHOU Xin-Ran, TENG Zhao-Sheng, JIANG Xing-Jun

The accuracy of least squares support vector machine(LSSVM) is influenced easily by gross errors and noises superimposed on value measurement of plant output when LSSVM is applied to the dynamic process online modeling directly. Aiming at that problem, robust online process modeling method using non-bias LSSVM is presented after the characteristics of sample sequence structure and noise action are analyzed. During the prediction period, abnormal measure data are recognized and recovered, and measure data containing noises are detected and rectified according to the relation between the predicting error and the set threshold value. Consequently, noises in samples are decreased,and online LSSVM tracks dynamics of process better. The modeling method is robust, and it decreases the effect of gross error and Gaussian white noise on the prediction accuracy of LSSVM to improve the prediction accuracy. The numerical simulation shows the validity and advantage of the proposed method.

2010 Vol. 23 (6): 885-890 [Abstract] ( 563 ) [HTML 1KB] [ PDF 523KB] ( 608 )
模式识别与人工智能
 

Supervised by
China Association for Science and Technology
Sponsored by
Chinese Association of Automation
NationalResearchCenter for Intelligent Computing System
Institute of Intelligent Machines, Chinese Academy of Sciences
Published by
Science Press
 
Copyright © 2010 Editorial Office of Pattern Recognition and Artificial Intelligence
Address: No.350 Shushanhu Road, Hefei, Anhui Province, P.R. China Tel: 0551-65591176 Fax:0551-65591176 Email: bjb@iim.ac.cn
Supported by Beijing Magtech  Email:support@magtech.com.cn