Inicio  /  Algorithms  /  Vol: 15 Par: 1 (2022)  /  Artículo
ARTÍCULO
TITULO

Accelerating Symmetric Rank-1 Quasi-Newton Method with Nesterov?s Gradient for Training Neural Networks

S. Indrapriyadarsini    
Shahrzad Mahboubi    
Hiroshi Ninomiya    
Takeshi Kamio and Hideki Asai    

Resumen

Gradient-based methods are popularly used in training neural networks and can be broadly categorized into first and second order methods. Second order methods have shown to have better convergence compared to first order methods, especially in solving highly nonlinear problems. The BFGS quasi-Newton method is the most commonly studied second order method for neural network training. Recent methods have been shown to speed up the convergence of the BFGS method using the Nesterov?s acclerated gradient and momentum terms. The SR1 quasi-Newton method, though less commonly used in training neural networks, is known to have interesting properties and provide good Hessian approximations when used with a trust-region approach. Thus, this paper aims to investigate accelerating the Symmetric Rank-1 (SR1) quasi-Newton method with the Nesterov?s gradient for training neural networks, and to briefly discuss its convergence. The performance of the proposed method is evaluated on a function approximation and image classification problem.

 Artículos similares

       
 
Diya Wang, Yonglin Zhang, Lixin Wu, Yupeng Tai, Haibin Wang, Jun Wang, Fabrice Meriaudeau and Fan Yang    
In recent years, the study of deep learning techniques for underwater acoustic channel estimation has gained widespread attention. However, existing neural network channel estimation methods often overfit to training dataset noise levels, leading to dimi... ver más

 
Mark A. Denisenko, Alina S. Isaeva, Alexander S. Sinyukin and Andrey V. Kovalev    
The fast, convenient, and accurate determination of railroad cars? load mass is critical to ensure safety and allow asset counting in railway infrastructure. In this paper, we propose a method for modeling the mechanical deformations that occur in the ra... ver más
Revista: Infrastructures

 
Seokjoon Kwon, Jae-Hyeon Park, Hee-Deok Jang, Hyunwoo Nam and Dong Eui Chang    
Deep learning algorithms are widely used for pattern recognition in electronic noses, which are sensor arrays for gas mixtures. One of the challenges of using electronic noses is sensor drift, which can degrade the accuracy of the system over time, even ... ver más
Revista: Applied Sciences

 
Károly Héberger    
Background: The development and application of machine learning (ML) methods have become so fast that almost nobody can follow their developments in every detail. It is no wonder that numerous errors and inconsistencies in their usage have also spread wi... ver más
Revista: Algorithms

 
Sheng Zhang, Guangzhong Liu and Chen Cheng    
Over the past few decades, unmanned surface vehicles (USV) have drawn a lot of attention. But because of the wind, waves, currents, and other sporadic disturbances, it is challenging to understand and collect correct data about USV dynamics. In this pape... ver más