Gradient methods with memory

Nesterov, Yurii;Florea, Mihai
(2022) Optimization Methods and Software — Vol. 37, n° 3, p. 936-953 (2022)

Files

CORE_RP_3230.pdf
  • Open Access
  • Adobe PDF
  • 652.82 KB

Details

Authors
  • Nesterov, Yuriiorcid-logoUCLouvain
    Author
  • Florea, Mihaiorcid-logoUCLouvain
    Author
Abstract
In this paper, we consider gradient methods for minimizing smooth convex functions, which employ the information obtained at the previous iterations in order to accelerate the convergence towards the optimal solution. This information is used in the form of a piece-wise linear model of the objective function, which provides us with much better prediction abilities as compared with the standard linear model. To the best of our knowledge, this approach was never really applied in Convex Minimization to differentiable functions in view of the high complexity of the corresponding auxiliary problems. However, we show that all necessary computations can be done very efficiently. Consequently, we get new optimization methods, which are better than the usual Gradient Methods both in the number of oracle calls and in the computational time. Our theoretical conclusions are confirmed by preliminary computational experiments.
Affiliations

Citations

Nesterov, Y., & Florea, M. (2022). Gradient methods with memory. Optimization Methods and Software, 37(3), 936-953. https://doi.org/10.1080/10556788.2020.1858831 (Original work published 2022)