模式识别与人工智能
Home      About Journal      Editorial Board      Instructions      Ethics Statement      Contact Us                   中文
Pattern Recognition and Artificial Intelligence
22 Judgement and Disposal of Academic Misconduct Article
22 Copyright Transfer Agreement
22 Proof of Confidentiality
22 Requirements for Electronic Version
More....
22 Chinese Association of Automation
22 National ResearchCenter for Intelligent Computing System
22 Institute of Intelligent Machines,Chinese Academy of Sciences
More....
 
 
2018 Vol.31 Issue.1, Published 2018-01-31

   
0
2018 Vol. 31 (1): 0- [Abstract] ( 1336 ) [HTML 1KB] [ PDF 259KB] ( 745 )
1 Interpretable Structured Multi-modal Deep Neural Network
XIONG Hongkai, GAO Xing, LI Shaohui, XU Yuhui, WANG Yongzhuang,YU Haoyang, LIU Xin, ZHANG Yunfei

Deep learning methods achieve excellent performance in the fields of computer vision and natural language processing through end-to-end supervised training dependent on large scale labeled datasets. However, the existing methods are often targeted for single modal data, ignoring the inherent structure of the data with the lack of theoretical support. Therefore, the wavelet theory based deep convolution networks, the structured deep learning and the multi-modal deep learning are discussed in this paper to demonstrate the potential methods of the combination of deep learning techniques, wavelet theory and structure prediction, and the viable mechanism for extending to multi-modal data is explored as well.

2018 Vol. 31 (1): 1-11 [Abstract] ( 1146 ) [HTML 1KB] [ PDF 1220KB] ( 1948 )
12 Binary Representation Learning and Its Applications
LU Jiwen, DUAN Yueqi, CHEN Zhixiang, ZHOU Jie

With the rapid development of internet and other information technologies, the amount of visual data grows explosively. It is an essential research topic for efficient visual data mining in the big data era. In this paper, the technology of binary representation learning and its applications are reviewed. Binary representation shows efficiency in storage, transmission and matching, and it is successfully applied to several visual analysis tasks. The applications on visual search and recognition are summarized, and the future trend of binary representation learning is pointed out. From the aspect of methodology, the image-based hashing and the video-based hashing are illustrated, respectively. From the aspect of applications, the successful applications of binary representation learning on varying tasks are discussed including facial analysis, image categorization, image matching and visual tracking. Finally, the future trend of binary representation learning is introduced.

2018 Vol. 31 (1): 12-22 [Abstract] ( 623 ) [HTML 1KB] [ PDF 1022KB] ( 1032 )
23 Research Progress of Low-Rank Matrix Approximation and #br# Optimization Problem
ZHANG Hengmin, YANG Jian, ZHENG Wei

Based on the compression and recovery of high-dimensional data, the development process from the theory of Shannon sampling to sparse representation and compression perception and then to low-rank matrix problem is described. Then, the importance of low rank matrix relaxation and optimization problem is discussed. Subsequently, a detailed review of the existing methods is introduced from three aspects of low rank matrix minimization, decomposition, optimization and applications. Finally, some reasonable suggestions on the deficiencies of current research and the future research direction are put forward.

2018 Vol. 31 (1): 23-36 [Abstract] ( 898 ) [HTML 1KB] [ PDF 1797KB] ( 2499 )
37 A Review and Comparison Study on Face Sketch Synthesis
WANG Nannan, LI Jie, GAO Xinbo

Face sketch synthesis refers to generating a sketch from an input face photo with some face sketch-photo pairs as the training set. An experimental study on the existing methods is nontrivial. The comprehensive review and the comparative study on representative face sketch synthesis methods are conducted. These methods are grouped into two categories: data-driven methods, also known as exemplar-based methods, and model-driven methods. Generally, data-driven face sketch synthesis consists of three sub-categories: subspace learning-based methods, sparse representation-based methods and Bayesian inference-based methods. Model-driven methods explicitly learn the mapping from face photos to face sketches. Some previously unknown conclusions are drawn as well.

2018 Vol. 31 (1): 37-48 [Abstract] ( 841 ) [HTML 1KB] [ PDF 1811KB] ( 1025 )
49 Location Prediction via Generative Adversarial Network with #br# Spatial Temporal Embedding
KONG Dejiang, TANG Siliang, WU Fei

The wide use of positioning technology makes the mining of the people movements easy and plenty of trajectory data are recorded. How to efficiently handle these data for location prediction is a popular research topic as it is fundamental to location-based services(LBS). The existing methods focus either on long time (days or months) visit prediction(i.e. point of interest recommendation) or on real time location prediction(i.e. trajectory prediction). In this paper, the location prediction problem in weak real time conditions is discussed to predict users′ movement in next minutes or hours. A spatial-temporal long-short term memory model(ST-LSTM) combining spatial-temporal influence into LSTM model naturally is proposed to mitigate the data sparse problem. Furthermore, following the idea of generative adversarial network(GAN) for seq2seq learning, the ST-GAN model is proposed, and it takes the proposed ST-LSTM as the generator and the proposed spatial-temporal convolutional neural network(ST-CNN) as the discriminator. The minimax game of ST-GAN can produce more real enough data to train a better prediction model. The proposed ST-GAN is evaluated on a real world trajectory dataset and the results demonstrate the effectiveness of the proposed model.

2018 Vol. 31 (1): 49-60 [Abstract] ( 724 ) [HTML 1KB] [ PDF 925KB] ( 1378 )
61 Visual Object Tracking: A Survey
LU Huchuan, LI Peixia, WANG Dong

Online object tracking is a fundamental problem in computer vision and it is crucial to application in numerous fileds such as guided missile, video surveillance and unmanned aerial vehicle. Despite many studies on visual tracking, there are still many challenges during the tracking process including illumination variation, rotation, scale change, deformation, occlusion and camera motion. To make a clear understanding of visual tracking, visual tracking algorithms are summarized in this paper. Firstly, the meaning and the related work are briefly introduced. Secondly, the typical algorithms are classified, summarized and analyzed from two aspects: traditional algorithms and deep learning algorithms. Finally, the problems and the prediction of the future of visual tracking are discussed.

2018 Vol. 31 (1): 61-76 [Abstract] ( 2082 ) [HTML 1KB] [ PDF 2074KB] ( 4191 )
77 Research on Educational Data Mining for Online Intelligent Learning
LIU Qi, CHEN Enhong, ZHU Tianyu, HUANG Zhenya, WU Runze, SU Yu, HU Guoping

With the rapid informationization of education, extensive data records from online education of students are accumulated, and it provides a good opportunity for both data-driven educational assessment and intelligent tutoring. However, existing models are hard to accurately analyze the characteristics of questions and the academic levels of students from the massive and sparse data with high noise. Meanwhile, it is difficult for these models to satisfy the personalized needs of students and teachers. In this paper, educational data mining studies on these problems are summarized. To improve the student academic level, these studies focus on modeling three objects in education (i.e., questions, students and teachers) and apply effective techniques, such as personalized recommendation methods, combined with the domain knowledge from education. Specifically, a question text embedding framework is presented for question analysis and question retrieval. Then, personalized recommendation methods on learning resources are illustrated based on the cognitive diagnosis of students. Moreover, the way of providing effective guidance and suggestions for teachers is showed. Some of these research achievements are applied to the online educational system “ZHIXUE” in iFlyTek. Finally, the possible research directions in the future are discussed.

2018 Vol. 31 (1): 77-90 [Abstract] ( 1070 ) [HTML 1KB] [ PDF 2485KB] ( 1743 )
91 Semantic Communications: Outcome of the Intelligence Era
SHI Guangming, LI Yingyu, XIE Xuemei

Communication technology develops greatly, especially in source and channel coding, modulation mechanisms and ultra-wideband communications. From the aspect of signal processing, current technologies have already approached the Shannon capacity. Directions for future researches in traditional data and signal transmission-based communication industry become unclear. According to the recent state-of-the-art artificial intelligence technologies and their influences on the revolution of the communication industry, a new communication mechanism, namely semantic communications, is proposed. Compared with the traditional communication technology based on pattern transmission, the key point of semantic communications is idea-passing communication, and it can also be referred to as content transmission. In such a revolutionary communication mechanism, the transmission is conducted upon ideas instead of data through the construction of a certain knowledge library. The error tolerance of the channel can also be improved via the matching between the idea transmitter and receiver. It could be considered as the artificial intelligence induced communication with its true meaning since it is a brain-resembling communication mechanism. The introduced brain-resembling communication mechanism based on idea-passing significantly reduces the data amount to be transmitted, and it is an efficient way to address the challenge to the communication industry due to the arrival of big data era. In this paper, the tentative idea of semantic communications is presented and the fundamental elements, semantic coding and decoding, future research directions and major challenges in semantic communications are discussed.

2018 Vol. 31 (1): 91-99 [Abstract] ( 1347 ) [HTML 1KB] [ PDF 763KB] ( 2212 )
模式识别与人工智能
 

Supervised by
China Association for Science and Technology
Sponsored by
Chinese Association of Automation
NationalResearchCenter for Intelligent Computing System
Institute of Intelligent Machines, Chinese Academy of Sciences
Published by
Science Press
 
Copyright © 2010 Editorial Office of Pattern Recognition and Artificial Intelligence
Address: No.350 Shushanhu Road, Hefei, Anhui Province, P.R. China Tel: 0551-65591176 Fax:0551-65591176 Email: bjb@iim.ac.cn
Supported by Beijing Magtech  Email:support@magtech.com.cn