Pattern Recognition and Artificial Intelligence

模式识别与人工智能

Home About Journal Editorial Board Instructions Ethics Statement Contact Us 中文

	Judgement and Disposal of Academic Misconduct Article

	Copyright Transfer Agreement

	Proof of Confidentiality

	Requirements for Electronic Version

More....

	Chinese Association of Automation

	National ResearchCenter for Intelligent Computing System

	Institute of Intelligent Machines,Chinese Academy of Sciences

More....

	2026 Vol.39 Issue.4, Published 2026-04-25

	Papers and Reports Surveys and Reviews

	289


		2026 Vol. 39 (4): 289-290 [Abstract] ( 98 ) [HTML 1KB] [ PDF 134KB] ( 126 )

Papers and Reports

	291	Underwater Image Enhancement Network Based on Octave Convolution with Physical Model Constraints
		WU Cheng, LIU Xiaohua, TANG Guijin
		In underwater image enhancement tasks,deep learning struggles with the lack of physical interpretability and the spatial redundancy in feature representation. To address these issues, an underwater image enhancement network based on Octave convolution with physical model constraints（OCPMNet） is proposed. First, an octave feature extraction block（OFEB） is designed to explicitly decompose the input image into a high-frequency branch and a low-frequency branch via octave convolution. The receptive field of the low-frequency branch is expanded through downsampling to capture global degradation, and meanwhile the computational redundancy is effectively reduced. Then, a multi-scale background light module （MBLM） is constructed to extract deep low-frequency features from the bottleneck layer of the network. Three branches are utilized to extract local, neighbor and global background light information, and the information is fused to obtain an estimated map of the ambient background light. Finally, a physics-constrained dual-domain restoration module（PDRM） is introduced. Octave convolution is adopted to fuse terminal features, yielding the estimations of the direct transmission map and the backscatter transmission map. Moreover, an initial enhanced image is reconstructed under the constraints of the Sea-Thru physical model with the background light estimated by MBLM. Subsequently, residual correction is performed simultaneously in both spatial and frequency domains to compensate for the fitting bias of the physical model. Experiments on multiple datasets demonstrate the superiority of OCPMNet in terms of objective metrics and subjective image quality. Furthermore, OCPMNet shows strong real-time processing capability with engineering application value.
		2026 Vol. 39 (4): 291-310 [Abstract] ( 149 ) [HTML 1KB] [ PDF 4289KB] ( 233 )

	311	Transformer Network with Multimodal Attention Perception and Adjacent-Scale Modeling
		SONG Xiaogang, ZHANG Haoze, ZHANG Xiaolong, ZHAO Qin, HEI Xinhong, HE Min
		RGB-D salient object detection aims to identify the most visually attractive objects from paired color images and depth images, and the key challenge is the effective fusion of multimodal and multiscale features. The existing methods still need the improvement in modal complementary information representation, edge detail preservation and utilization of cross-scale association during the fusion of RGB features and depth features. Therefore, a Transformer network with multimodal attention perception and adjacent-scale modeling（MATNet） is proposed. Multilevel RGB features and depth features are extracted by dual-branch pyramid pooling Transformer encoders. A multimodal attention fusion module is introduced at each stage. The modal complementary information representation and semantic consistency in key regions are jointly enhanced by channel attention and spatial attention. Then, an adjacent-scale modeling module is constructed to aggregate adjacent-scale features progressively in a top-down manner. High-level semantic information and low-level edge texture information are fused effectively. The structural integrity and boundary representation capability of salient objects are improved. Finally, an end-to-end detection framework is constructed by combining multi-scale prediction and the supervision mechanism. Experiments on five public datasets demonstrate that MATNet is effective and stable in improving detection accuracy and edge preservation capability.
		2026 Vol. 39 (4): 311-329 [Abstract] ( 143 ) [HTML 1KB] [ PDF 2271KB] ( 163 )

	330	EEG Signal Classificaiton Method Based on Fuzzy Confidence Causal Power and Multi-stream Ensemble Neural Networks
		FAN Min, CHEN Qiuyu, LI Jinhai
		The automatic identification of epileptic electroencephalogram（EEG） signals is of paramount importance for clinical diagnosis. However, high-dimensional redundancy, non-stationarity, and difficulties in cross-scenario generalization are commonly encountered in automatic EEG detection. To address these issues, a EEG signal classificaiton method based on fuzzy confident causal power and multi-stream ensemble neural networks is proposed. First, a feature selection algorithm based on Rényi entropy and fuzzy confident causal power（FCCP） is proposed to mitigate high-dimensional redundancy. Fuzzy Rényi entropy and the theory of confident causal power are introduced to quantify the bidirectional causal effects between features and labels. Key features with strong causal interpretability are selected by utilizing data structure constraints. Second, the formal context and the space of causal power attribute network are constructed to mine underlying topological features. Thus, network-based modeling of unstructured data is achieved. Finally, a multi-stream ensemble neural network（MENN） classifier is developed. The deep fully-connected stream, the wide stream and the narrow-deep stream are integrated to simultaneously capture local details and global dependencies. Multi-scale feature representations are fused through a concatenation layer. The experiments on Bonn and CHB-MIT clinical datasets demonstrate that the proposed method achieves stable and superior performance under multi-task settings, exhibiting significant robustness in distinguishing seizure signals. The effectiveness of the combination of confident causal power-based feature selection with multi-stream integrated learning is verified.
		2026 Vol. 39 (4): 330-347 [Abstract] ( 124 ) [HTML 1KB] [ PDF 926KB] ( 141 )

Surveys and Reviews

	348	Research Progress of Single Object Tracking Based on Transformer
		ZHANG Dawei, XU Dongsheng, YU Zhechen, JIANG Kaiwei, TIAN Weigang, ZHENG Zhonglong
		Single object tracking（SOT） is recognized as one of the fundamental tasks in the field of computer vision. However, traditional correlation filters and Siamese network architectures struggle to meet the growing demands for accuracy and robustness in complex and dynamic environments. Transformer exhibits significant advantages in SOT by virtue of its powerful global modeling capability. Therefore, the recent research advances in Transformer-based SOT are reviewed systematically. Based on the overall pipeline design, existing tracking algorithms can be categorized into two primary types： two-stream two-stage algorithms and one-stream one-stage algorithms. Representative algorithms of each category are analyzed in depth to highlight their relations and characteristics, while the research status of lightweight Transformer-based tracking methods is summarized. In addition, recent emerging trends, such as Mamba-based tracking algorithms and unified model architectures, are further investigated and their promising potential in model efficiency and generalizability is discussed as well. The performance of different Transformer-based tracking methods is comprehensively analyzed and evaluated on multiple mainstream datasets. Finally, several promising directions for future research in SOT, including lightweight models, multimodal fusion, long-term tracking and foundation models-driven tracking, are outlined, providing valuable references for the research and development of SOT.
		2026 Vol. 39 (4): 348-378 [Abstract] ( 153 ) [HTML 1KB] [ PDF 1907KB] ( 211 )

模式识别与人工智能

Supervised by
China Association for Science and Technology
Sponsored by
Chinese Association of Automation
NationalResearchCenter for Intelligent Computing System
Institute of Intelligent Machines, Chinese Academy of Sciences
Published by
Science Press

Copyright © 2010 Editorial Office of Pattern Recognition and Artificial Intelligence
Address: No.350 Shushanhu Road, Hefei, Anhui Province, P.R. China Tel: 0551-65591176 Fax:0551-65591176 Email: bjb@iim.ac.cn
Supported by Beijing Magtech 　Email:support@magtech.com.cn