Abstract:To improve the modeling capability of image classification networks for critical information and enhance the completeness and discriminability of feature representation, a dual wavelet-enhanced network for image classification(DWENet) is proposed. First, a wavelet-gated convolution module is constructed during the shallow feature extraction stage of the backbone network. By integrating wavelet frequency domain decomposition and a gating mechanism, edge and texture information are effectively captured with key details preserved and redundant noise suppressed. Second, a wavelet kernel attention module is introduced after the max-pooling layer. Frequency awareness and large receptive field spatial modeling are integrated to enhance structural representation and long-range dependency perception. As a result, the information loss caused by pooling operations is compensated. Furthermore, a dual-path feature enhancement module(DFEM) is incorporated into the deep network layers. DFEM enhances mid-and high-level semantic representation and key region responsiveness by reorganizing features through spatial-frequency decomposition and incorporating channel attention mechanisms. While maintaining computational efficiency, DWENet effectively mitigates core issues such as shallow feature decay and limited spatial perception. Experiments on CIFAR-10, CIFAR-100, SVHN, Imagenette, and Imagewoof datasets demonstrate that DWENet improves classification accuracy.
[1] LECUN Y, BOTTOU L, BENGIO Y, et al. Gradient-Based Lear-ning Applied to Document Recognition. Proceedings of the IEEE, 1998, 86(11): 2278-2324. [2] KRIZHEVSKY A, SUTSKEVER I, HINTON G E. ImageNet Classification with Deep Convolutional Neural Networks// Proc of the 25th International Conference on Neural Information Processing Systems. Cambridge, USA: MIT Press, 2012: 1097-1105. [3] SIMONYAN K, ZISSERMAN A. Very Deep Convolutional Networks for Large-Scale Image Recognition[C/OL]. [2025-07-19]. https://arxiv.org/pdf/1409.1556. [4] SZEGEDY C, LIU W, JIA Y Q, et al. Going Deeper with Convolutions// Proc of the IEEE Conference on Computer Vision and Pa-ttern Recognition. Washington, USA: IEEE, 2015. DOI: 10.1109/CVPR.2015.7298594. [5] HE K M, ZHANG X Y, REN S Q, et al. Deep Residual Learning for Image Recognition// Proc of the IEEE Conference on Computer Vision and Pattern Recognition. Washington, USA: IEEE, 2016: 770-778. [6] FUJIEDA S, TAKAYAMA K, HACHISUKA T. Wavelet Convolutional Neural Network[C/OL]. [2025-07-19]. https://arxiv.org/pdf/1805.08620. [7] YANG K Z, HU T, DAI K X, et al. CRNet: A Detail-Preserving Network for Unified Image Restoration and Enhancement Task// Proc of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops. Washington, USA: IEEE, 2024: 6086-6096. [8] CHEN L W, FU Y, GU L, et al. Frequency-Aware Feature Fusion for Dense Image Prediction. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2024, 46(12): 10763-10780. [9] HU J, SHEN L, SUN G.Squeeze-and-Excitation Networks// Proc of the IEEE/CVF Conference on Computer Vision and Pattern Re-cognition. Washington, USA: IEEE, 2018: 7132-7141. [10] WOO S, PARK J, LEE J Y, et al. CBAM: Convolutional Block Attention Module// Proc of the European Conference on Computer Vision. Berlin, Germany: Springer. 2018: 3-19. [11] QIN Z Q, ZHANG P Y, WU F, et al. FcaNet: Frequency Cha-nnel Attention Networks// Proc of the IEEE/CVF International Conference on Computer Vision. Washington,USA: IEEE, 2021: 763-772. [12] YEE T Z, QASEM A. MEGANet-W: A Wavelet-Driven Edge-Guided Attention Framework for Weak Boundary Polyp Detection[C/OL]. [2025-07-19]. https://arxiv.org/pdf/2507.02668. [13] JING Y X, ZHAO J F, ZHANG T P, et al. SWAN: Synergistic Wavelet-Attention Network for Infrared Small Target Detection[C/OL].[2025-07-19]. https://arxiv.org/pdf/2508.01322. [14] HUANG J, HUANG R, XU J H, et al. Wavelet-Assisted Multi-frequency Attention Network for Pansharpening. Proceedings of the AAAI Conference on Artificial Intelligence, 2025, 39(4): 3662-3670. [15] FINDER S E, AMOYAL R, TREISTER E, et al. Wavelet Convolutions for Large Receptive Fields// Proc of the European Confe-rence on Computer Vision. Berlin, Germany: Springer, 2024: 363-380. [16] GUO M H, LU C Z, LIU Z N, et al. Visual Attention Network. Computational Visual Media, 2023, 9(4): 733-752. [17] TAN M X, LE Q V.EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks// Proc of the 36th International Conference on Machine Learning. San Diego, USA: JMLR, 2019: 6105-6114. [18] HAN K, WANG Y H, TIAN Q, et al. GhostNet: More Features from Cheap Operations// Proc of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. Washington, USA: IEEE, 2020: 1577-1586. [19] HUANG G, LIU Z, VAN DER MAATEN L, et al. Densely Connected Convolutional Networks// Proc of the IEEE Conference on Computer Vision and Pattern Recognition. Washington, USA: IEEE, 2017: 2261-2269. [20] ZAGORUYKO S, KOMODAKIS N. Wide Residual Networks[C/OL]. [2025-07-19]. https://arxiv.org/pdf/1605.07146. [21] LAN H, WANG X H, WEI X. Couplformer: Rethinking Vision Transformer with Coupling Attention Map[C/OL]. [2025-07-19]. https://arxiv.org/pdf/2112.05425. [22] CHOROMANSKI K, LIKHOSHERSTOV V, DOHAN D, et al. Rethinking Attention with Performers[C/OL].[2025-07-19]. https://arxiv.org/pdf/2009.14794. [23] ABDI M, NAHAVANDI S. Multi-residual Networks: Improving the Speed and Accuracy of Residual Networks[C/OL]. [2025-07-19]. https://arxiv.org/pdf/1609.05672. [24] WU X D, GAO S Q, ZHANG Z Y, et al. Auto-Train-Once: Controller Network Guided Automatic Network Pruning from Scratch// Proc of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. Washington, USA: IEEE, 2024: 16163-16173. [25] 赵琳琳. 双分支多注意力机制的锐度感知图像分类网络.硕士学位论文.阜新:辽宁工程技术大学, 2024. (ZHAO L L. Sharpness-Aware Image Classification Network of Dual-Branch Multi-attention Mechanism. Master Dissertation. FuxinFuxin, China: Liaoning Technical University, 2024.) [26] SANDLER M, HOWARD A, ZHU M L, et al. MobileNetV2: Inverted Residuals and Linear Bottlenecks// Proc of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. Washington, USA: IEEE, 2018: 4510-4520. [27] FU Y, LIU H R, ZOU Y H, et al. Category-Level Band Learning-Based Feature Extraction for Hyperspectral Image Classification. IEEE Transactions on Geoscience and Remote Sensing, 2023, 62. DOI: 10.1109/TGRS.2023.3340517. [28] BUI N T, HOANG D H, NGUYEN Q T, et al. MEGANet: Multi-scale Edge-Guided Attention Network for Weak Boundary Polyp Segmentation// Proc of the IEEE/CVF Winter Conference on Applications of Computer Vision. Washington, USA: IEEE, 2024: 7970-7979. [29] OUYANG D L, HE S, ZHANG G Z, et al. Efficient Multi-scale Attention Module with Cross-Spatial Learning// Proc of the IEEE International Conference on Acoustics, Speech and Signal Proce-ssing. Washington, USA: IEEE, 2023. DOI: 10.1109/ICASSP49357.2023.10096516. [30] XIANG X, WANG Z H, ZHANG J, et al. AGCA: An Adaptive Graph Channel Attention Module for Steel Surface Defect Detection. IEEE Transactions on Instrumentation and Measurement, 2023, 72. DOI: 10.1109/TIM.2023.3248111.