模式识别与人工智能
Saturday, May. 3, 2025 Home      About Journal      Editorial Board      Instructions      Ethics Statement      Contact Us                   中文
Pattern Recognition and Artificial Intelligence  2022, Vol. 35 Issue (2): 106-115    DOI: 10.16451/j.cnki.issn1003-6059.202202002
Papers and Reports Current Issue| Next Issue| Archive| Adv Search |
AdaBelief Based Heavy-Ball Momentum Method
ZHANG Zedong1, LONG Sheng1, BAO Lei1, TAO Qing1
1. Department of Information Engineering, Army Academy of Artillery and Air Defense of PLA, Hefei 230031

Download: PDF (791 KB)   HTML (1 KB) 
Export: BibTeX | EndNote (RIS)      
Abstract  Adaptive moment estimation algorithms with momentum and adaptive step techniques are widely applied in deep learning. However, these algorithms cannot achieve the optimal performance in both theory and experiment. To solve the problem, an AdaBelief based heavy-ball momentum method, AdaBHB, is proposed. The AdaBelief technique of adjusting step size flexibly is introduced to improve the algorithm performance in experiments. The heavy ball momentum method with step size adjusted by exponential moving average strategy is employed to accelerate convergence. According to the convergence analysis techniques of AdaBelief and Heavy-ball momentum methods, time-varying step size and momentum coefficient are selected skillfully and the momentum term and adaptive matrix are added. It is proved that AdaBHB gains the optimal individual convergence rate for non-smooth general convex optimization problems. Finally, the correctness of the theoretical analysis of the proposed algorithm is verified by experiments on convex optimization problems and deep neural networks, and AdaBHB is validated to obtain the optimal convergence in theory with performance improved.
Key wordsAdaBelief      Heavy-Ball Momentum Method      Individual Convergence Rate      Deep Neural Network     
Received: 24 May 2021     
ZTFLH: TP 181  
Fund:Supported by National Natural Science Foundation of China(No.62076252)
Corresponding Authors: TAO Qing, Ph.D., professor. His research interests include pa-ttern recognition, machine learning and applied mathematics.   
About author:: ZHANG Zedong, master student. His research interests include pattern recognition and machine learning;LONG Sheng, master student. His research interests include pattern recognition and machine learning;BAO Lei, Ph.D., lecturer. Her research interests include pattern recognition and computer vision.
Service
E-mail this article
Add to my bookshelf
Add to citation manager
E-mail Alert
RSS
Articles by authors
ZHANG Zedong
LONG Sheng
BAO Lei
TAO Qing
Cite this article:   
ZHANG Zedong,LONG Sheng,BAO Lei等. AdaBelief Based Heavy-Ball Momentum Method[J]. Pattern Recognition and Artificial Intelligence, 2022, 35(2): 106-115.
URL:  
http://manu46.magtech.com.cn/Jweb_prai/EN/10.16451/j.cnki.issn1003-6059.202202002      OR     http://manu46.magtech.com.cn/Jweb_prai/EN/Y2022/V35/I2/106
Copyright © 2010 Editorial Office of Pattern Recognition and Artificial Intelligence
Address: No.350 Shushanhu Road, Hefei, Anhui Province, P.R. China Tel: 0551-65591176 Fax:0551-65591176 Email: bjb@iim.ac.cn
Supported by Beijing Magtech  Email:support@magtech.com.cn