Skip to main content

Showing 1–10 of 10 results for author: Esperança, P M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.02987  [pdf, other

    cs.LG cs.AI cs.CL

    LoRA-Guard: Parameter-Efficient Guardrail Adaptation for Content Moderation of Large Language Models

    Authors: Hayder Elesedy, Pedro M. Esperança, Silviu Vlad Oprea, Mete Ozay

    Abstract: Guardrails have emerged as an alternative to safety alignment for content moderation of large language models (LLMs). Existing model-based guardrails have not been designed for resource-constrained computational portable devices, such as mobile phones, more and more of which are running LLM-based applications locally. We introduce LoRA-Guard, a parameter-efficient guardrail adaptation method that… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

  2. arXiv:2203.08887  [pdf, other

    stat.ML cs.AI cs.LG

    On Redundancy and Diversity in Cell-based Neural Architecture Search

    Authors: Xingchen Wan, Binxin Ru, Pedro M. Esperança, Zhenguo Li

    Abstract: Searching for the architecture cells is a dominant paradigm in NAS. However, little attention has been devoted to the analysis of the cell-based search spaces even though it is highly important for the continual development of NAS. In this work, we conduct an empirical post-hoc analysis of architectures from the popular cell-based search spaces and find that the existing search spaces contain a hi… ▽ More

    Submitted 16 March, 2022; originally announced March 2022.

    Comments: ICLR 2022. 10 pages, 10 figures, 2 tables (25 pages, 34 figures, 3 tables including references and appendices)

  3. arXiv:2112.06741  [pdf, other

    cs.CV cs.LG

    Long-tail Recognition via Compositional Knowledge Transfer

    Authors: Sarah Parisot, Pedro M. Esperanca, Steven McDonagh, Tamas J. Madarasz, Yongxin Yang, Zhenguo Li

    Abstract: In this work, we introduce a novel strategy for long-tail recognition that addresses the tail classes' few-shot problem via training-free knowledge transfer. Our objective is to transfer knowledge acquired from information-rich common classes to semantically similar, and yet data-hungry, rare classes in order to obtain stronger tail class representations. We leverage the fact that class prototypes… ▽ More

    Submitted 12 April, 2022; v1 submitted 13 December, 2021; originally announced December 2021.

    Comments: Accepted to CVPR 2022

  4. arXiv:2111.04670  [pdf, other

    cs.LG cs.CV stat.ML

    Approximate Neural Architecture Search via Operation Distribution Learning

    Authors: Xingchen Wan, Binxin Ru, Pedro M. Esperança, Fabio M. Carlucci

    Abstract: The standard paradigm in Neural Architecture Search (NAS) is to search for a fully deterministic architecture with specific operations and connections. In this work, we instead propose to search for the optimal operation distribution, thus providing a stochastic and approximate solution, which can be used to sample architectures of arbitrary length. We propose and show, that given an architectural… ▽ More

    Submitted 8 November, 2021; originally announced November 2021.

    Comments: WACV 2022. 10 pages, 3 figures and 5 tables (15 pages, 7 figures and 6 tables including appendices)

  5. arXiv:2111.03555  [pdf, other

    cs.LG

    AUTOKD: Automatic Knowledge Distillation Into A Student Architecture Family

    Authors: Roy Henha Eyono, Fabio Maria Carlucci, Pedro M Esperança, Binxin Ru, Phillip Torr

    Abstract: State-of-the-art results in deep learning have been improving steadily, in good part due to the use of larger models. However, widespread use is constrained by device hardware limitations, resulting in a substantial performance gap between state-of-the-art models and those that can be effectively deployed on small devices. While Knowledge Distillation (KD) theoretically enables small student model… ▽ More

    Submitted 5 November, 2021; originally announced November 2021.

    Comments: 12 pages, 8 figures

  6. arXiv:1912.12522  [pdf, other

    cs.LG cs.CV stat.ML

    NAS evaluation is frustratingly hard

    Authors: Antoine Yang, Pedro M. Esperança, Fabio M. Carlucci

    Abstract: Neural Architecture Search (NAS) is an exciting new field which promises to be as much as a game-changer as Convolutional Neural Networks were in 2012. Despite many great works leading to substantial improvements on a variety of tasks, comparison between different methods is still very much an open issue. While most algorithms are tested on the same datasets, there is no shared experimental protoc… ▽ More

    Submitted 13 February, 2020; v1 submitted 28 December, 2019; originally announced December 2019.

    Comments: Published as a conference paper at ICLR2020; 13 pages; 10 figures

  7. arXiv:1909.01051  [pdf, other

    cs.CV cs.LG cs.MA

    MANAS: Multi-Agent Neural Architecture Search

    Authors: Vasco Lopes, Fabio Maria Carlucci, Pedro M Esperança, Marco Singh, Victor Gabillon, Antoine Yang, Hang Xu, Zewei Chen, Jun Wang

    Abstract: The Neural Architecture Search (NAS) problem is typically formulated as a graph search problem where the goal is to learn the optimal operations over edges in order to maximise a graph-level global objective. Due to the large architecture parameter space, efficiency is a key bottleneck preventing NAS from its practical use. In this paper, we address the issue by framing NAS as a multi-agent proble… ▽ More

    Submitted 12 January, 2023; v1 submitted 3 September, 2019; originally announced September 2019.

  8. arXiv:1703.00839  [pdf, other

    stat.ML cs.LG

    Encrypted accelerated least squares regression

    Authors: Pedro M. Esperança, Louis J. M. Aslett, Chris C. Holmes

    Abstract: Information that is stored in an encrypted format is, by definition, usually not amenable to statistical analysis or machine learning methods. In this paper we present detailed analysis of coordinate and accelerated gradient descent algorithms which are capable of fitting least squares and penalised ridge regression models, using data encrypted under a fully homomorphic encryption scheme. Gradient… ▽ More

    Submitted 2 March, 2017; originally announced March 2017.

    Comments: Accepted for AISTATS 2017

  9. arXiv:1508.06845  [pdf, other

    stat.ML cs.CR cs.LG stat.ME

    Encrypted statistical machine learning: new privacy preserving methods

    Authors: Louis J. M. Aslett, Pedro M. Esperança, Chris C. Holmes

    Abstract: We present two new statistical machine learning methods designed to learn on fully homomorphic encrypted (FHE) data. The introduction of FHE schemes following Gentry (2009) opens up the prospect of privacy preserving statistical machine learning analysis and modelling of encrypted data without compromising security constraints. We propose tailored algorithms for applying extremely random forests,… ▽ More

    Submitted 27 August, 2015; originally announced August 2015.

    Comments: 39 pages

  10. arXiv:1508.06574  [pdf, ps, other

    stat.ML cs.CR cs.LG

    A review of homomorphic encryption and software tools for encrypted statistical machine learning

    Authors: Louis J. M. Aslett, Pedro M. Esperança, Chris C. Holmes

    Abstract: Recent advances in cryptography promise to enable secure statistical computation on encrypted data, whereby a limited set of operations can be carried out without the need to first decrypt. We review these homomorphic encryption schemes in a manner accessible to statisticians and machine learners, focusing on pertinent limitations inherent in the current state of the art. These limitations restric… ▽ More

    Submitted 26 August, 2015; originally announced August 2015.

    Comments: 21 pages, technical report