Skip to main content

Showing 1–5 of 5 results for author: Castera, C

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.18222  [pdf, other

    cs.LG math.OC

    From Learning to Optimize to Learning Optimization Algorithms

    Authors: Camille Castera, Peter Ochs

    Abstract: Towards designing learned optimization algorithms that are usable beyond their training setting, we identify key principles that classical algorithms obey, but have up to now, not been used for Learning to Optimize (L2O). Following these principles, we provide a general design pipeline, taking into account data, architecture and learning strategy, and thereby enabling a synergy between classical o… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

  2. arXiv:2311.10053  [pdf, other

    math.OC cs.LG math.DS

    Near-optimal Closed-loop Method via Lyapunov Dam** for Convex Optimization

    Authors: Severin Maier, Camille Castera, Peter Ochs

    Abstract: We introduce an autonomous system with closed-loop dam** for first-order convex optimization. While, to this day, optimal rates of convergence are almost exclusively achieved by non-autonomous methods via open-loop dam** (e.g., Nesterov's algorithm), we show that our system, featuring a closed-loop dam**, exhibits a rate arbitrarily close to the optimal one. We do so by coupling the dam**… ▽ More

    Submitted 15 April, 2024; v1 submitted 16 November, 2023; originally announced November 2023.

  3. Inertial Newton Algorithms Avoiding Strict Saddle Points

    Authors: Camille Castera

    Abstract: We study the asymptotic behavior of second-order algorithms mixing Newton's method and inertial gradient descent in non-convex landscapes. We show that, despite the Newtonian behavior of these methods, they almost always escape strict saddle points. We also evidence the role played by the hyper-parameters of these methods in their qualitative behavior near critical points. The theoretical results… ▽ More

    Submitted 12 February, 2024; v1 submitted 8 November, 2021; originally announced November 2021.

    Journal ref: Journal of Optimization Theory and Applications (2023) 199(12):881--903

  4. Second-order step-size tuning of SGD for non-convex optimization

    Authors: Camille Castera, Jérôme Bolte, Cédric Févotte, Edouard Pauwels

    Abstract: In view of a direct and simple improvement of vanilla SGD, this paper presents a fine-tuning of its step-sizes in the mini-batch case. For doing so, one estimates curvature, based on a local quadratic model and using only noisy gradient approximations. One obtains a new stochastic first-order method (Step-Tuned SGD), enhanced by second-order information, which can be seen as a stochastic version o… ▽ More

    Submitted 21 November, 2021; v1 submitted 5 March, 2021; originally announced March 2021.

    Comments: To appear in Neural Processing Letters (accepted Nov. 2021)

    Journal ref: Neural Processing Letters (2022)

  5. arXiv:1905.12278  [pdf, other

    cs.LG math.OC stat.ML

    An Inertial Newton Algorithm for Deep Learning

    Authors: Camille Castera, Jérôme Bolte, Cédric Févotte, Edouard Pauwels

    Abstract: We introduce a new second-order inertial optimization method for machine learning called INNA. It exploits the geometry of the loss function while only requiring stochastic approximations of the function values and the generalized gradients. This makes INNA fully implementable and adapted to large-scale optimization problems such as the training of deep neural networks. The algorithm combines both… ▽ More

    Submitted 28 July, 2021; v1 submitted 29 May, 2019; originally announced May 2019.

    Comments: To appear in Journal of Machine Learning Research (JMLR), Volume 22, acceptance date: 5/21

    Journal ref: Journal of Machine Learning Research (JMLR), v22(134):1-31, 2021