Skip to main content

Showing 1–3 of 3 results for author: Oldewage, E T

Searching in archive cs. Search in all archives.
.
  1. arXiv:2310.14901  [pdf, other

    cs.LG stat.ML

    Series of Hessian-Vector Products for Tractable Saddle-Free Newton Optimisation of Neural Networks

    Authors: Elre T. Oldewage, Ross M. Clarke, José Miguel Hernández-Lobato

    Abstract: Despite their popularity in the field of continuous optimisation, second-order quasi-Newton methods are challenging to apply in machine learning, as the Hessian matrix is intractably large. This computational burden is exacerbated by the need to address non-convexity, for instance by modifying the Hessian's eigenvalues as in Saddle-Free Newton methods. We propose an optimisation algorithm which ad… ▽ More

    Submitted 27 February, 2024; v1 submitted 23 October, 2023; originally announced October 2023.

    Comments: 37 pages, 10 figures, 5 tables. To appear in TMLR. First two authors' order randomised

  2. arXiv:2211.12990  [pdf, other

    cs.LG cs.CR

    Adversarial Attacks are a Surprisingly Strong Baseline for Poisoning Few-Shot Meta-Learners

    Authors: Elre T. Oldewage, John Bronskill, Richard E. Turner

    Abstract: This paper examines the robustness of deployed few-shot meta-learning systems when they are fed an imperceptibly perturbed few-shot dataset. We attack amortized meta-learners, which allows us to craft colluding sets of inputs that are tailored to fool the system's learning algorithm when used as training data. Jointly crafted adversarial inputs might be expected to synergistically manipulate a cla… ▽ More

    Submitted 23 November, 2022; originally announced November 2022.

    Comments: Accepted at I Can't Believe It's Not Better Workshop, Neurips 2022

  3. arXiv:2110.10461  [pdf, other

    cs.LG stat.ML

    Scalable One-Pass Optimisation of High-Dimensional Weight-Update Hyperparameters by Implicit Differentiation

    Authors: Ross M. Clarke, Elre T. Oldewage, José Miguel Hernández-Lobato

    Abstract: Machine learning training methods depend plentifully and intricately on hyperparameters, motivating automated strategies for their optimisation. Many existing algorithms restart training for each new hyperparameter choice, at considerable computational cost. Some hypergradient-based one-pass methods exist, but these either cannot be applied to arbitrary optimiser hyperparameters (such as learning… ▽ More

    Submitted 21 April, 2022; v1 submitted 20 October, 2021; originally announced October 2021.

    Comments: 41 pages, 19 figures, 15 tables; minor CIFAR-10 normalisation updates from ICLR 2022 camera-ready version