Skip to main content

Showing 1–3 of 3 results for author: Seccia, R

.
  1. arXiv:2212.01848  [pdf, other

    math.OC cs.LG

    Convergence of ease-controlled Random Reshuffling gradient Algorithms under Lipschitz smoothness

    Authors: Ruggiero Seccia, Corrado Coppola, Giampaolo Liuzzi, Laura Palagi

    Abstract: In this work, we consider minimizing the average of a very large number of smooth and possibly non-convex functions, and we focus on two widely used minibatch frameworks to tackle this optimization problem: Incremental Gradient (IG) and Random Reshuffling (RR). We define ease-controlled modifications of the IG/RR schemes, which require a light additional computational effort {but} can be proved to… ▽ More

    Submitted 20 May, 2024; v1 submitted 4 December, 2022; originally announced December 2022.

    Comments: Add author, add references, correct typos, improve imoplementation

    MSC Class: 90.C.XX ACM Class: G.4.1

  2. Block Layer Decomposition schemes for training Deep Neural Networks

    Authors: Laura Palagi, Ruggiero Seccia

    Abstract: Deep Feedforward Neural Networks' (DFNNs) weights estimation relies on the solution of a very large nonconvex optimization problem that may have many local (no global) minimizers, saddle points and large plateaus. As a consequence, optimization algorithms can be attracted toward local minimizers which can lead to bad solutions or can slow down the optimization process. Furthermore, the time needed… ▽ More

    Submitted 18 March, 2020; originally announced March 2020.

    Comments: 23 pages. J Glob Optim (2019)

  3. A gray-box approach for curriculum learning

    Authors: Francesco Foglino, Matteo Leonetti, Simone Sagratella, Ruggiero Seccia

    Abstract: Curriculum learning is often employed in deep reinforcement learning to let the agent progress more quickly towards better behaviors. Numerical methods for curriculum learning in the literature provides only initial heuristic solutions, with little to no guarantee on their quality. We define a new gray-box function that, including a suitable scheduling problem, can be effectively used to reformula… ▽ More

    Submitted 16 June, 2019; originally announced June 2019.

    Comments: 10 pages, 1 figure

    Journal ref: Optimization of Complex Systems: Theory, Models, Algorithms and Applications, 2020, pp 720-729