Skip to main content

Showing 1–15 of 15 results for author: Triantafillou, E

.
  1. arXiv:2407.00106  [pdf, other

    cs.LG cs.AI cs.CL cs.CR

    UnUnlearning: Unlearning is not sufficient for content regulation in advanced generative AI

    Authors: Ilia Shumailov, Jamie Hayes, Eleni Triantafillou, Guillermo Ortiz-Jimenez, Nicolas Papernot, Matthew Jagielski, Itay Yona, Heidi Howard, Eugene Bagdasaryan

    Abstract: Exact unlearning was first introduced as a privacy mechanism that allowed a user to retract their data from machine learning models on request. Shortly after, inexact schemes were proposed to mitigate the impractical costs associated with exact unlearning. More recently unlearning is often discussed as an approach for removal of impermissible knowledge i.e. knowledge that the model should not poss… ▽ More

    Submitted 27 June, 2024; originally announced July 2024.

  2. arXiv:2406.09073  [pdf, other

    cs.LG

    Are we making progress in unlearning? Findings from the first NeurIPS unlearning competition

    Authors: Eleni Triantafillou, Peter Kairouz, Fabian Pedregosa, Jamie Hayes, Meghdad Kurmanji, Kairan Zhao, Vincent Dumoulin, Julio Jacques Junior, Ioannis Mitliagkas, Jun Wan, Lisheng Sun Hosoya, Sergio Escalera, Gintare Karolina Dziugaite, Peter Triantafillou, Isabelle Guyon

    Abstract: We present the findings of the first NeurIPS competition on unlearning, which sought to stimulate the development of novel algorithms and initiate discussions on formal and robust evaluation methodologies. The competition was highly successful: nearly 1,200 teams from across the world participated, and a wealth of novel, imaginative solutions with different characteristics were contributed. In thi… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

  3. arXiv:2406.01257  [pdf, other

    cs.LG

    What makes unlearning hard and what to do about it

    Authors: Kairan Zhao, Meghdad Kurmanji, George-Octavian Bărbulescu, Eleni Triantafillou, Peter Triantafillou

    Abstract: Machine unlearning is the problem of removing the effect of a subset of training data (the ''forget set'') from a trained model without damaging the model's utility e.g. to comply with users' requests to delete their data, or remove mislabeled, poisoned or otherwise problematic data. With unlearning research still being at its infancy, many fundamental open questions exist: Are there interpretable… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

  4. arXiv:2405.10425  [pdf, other

    cs.LG

    Data Selection for Transfer Unlearning

    Authors: Nazanin Mohammadi Sepahvand, Vincent Dumoulin, Eleni Triantafillou, Gintare Karolina Dziugaite

    Abstract: As deep learning models are becoming larger and data-hungrier, there are growing ethical, legal and technical concerns over use of data: in practice, agreements on data use may change over time, rendering previously-used training data impermissible for training purposes. These issues have driven increased attention to machine unlearning: removing "the influence of" a subset of training data from a… ▽ More

    Submitted 16 May, 2024; originally announced May 2024.

  5. arXiv:2404.16436  [pdf

    cs.SD cs.AI cs.LG eess.AS

    Leveraging tropical reef, bird and unrelated sounds for superior transfer learning in marine bioacoustics

    Authors: Ben Williams, Bart van Merriënboer, Vincent Dumoulin, Jenny Hamer, Eleni Triantafillou, Abram B. Fleishman, Matthew McKown, Jill E. Munger, Aaron N. Rice, Ashlee Lillis, Clemency E. White, Catherine A. D. Hobbs, Tries B. Razak, Kate E. Jones, Tom Denton

    Abstract: Machine learning has the potential to revolutionize passive acoustic monitoring (PAM) for ecological assessments. However, high annotation and compute costs limit the field's efficacy. Generalizable pretrained networks can overcome these costs, but high-quality pretraining requires vast annotated libraries, limiting its current applicability primarily to bird taxa. Here, we identify the optimum pr… ▽ More

    Submitted 7 May, 2024; v1 submitted 25 April, 2024; originally announced April 2024.

    Comments: 18 pages, 5 figures

  6. arXiv:2403.01218  [pdf, other

    cs.LG cs.CR

    Inexact Unlearning Needs More Careful Evaluations to Avoid a False Sense of Privacy

    Authors: Jamie Hayes, Ilia Shumailov, Eleni Triantafillou, Amr Khalifa, Nicolas Papernot

    Abstract: The high cost of model training makes it increasingly desirable to develop techniques for unlearning. These techniques seek to remove the influence of a training example without having to retrain the model from scratch. Intuitively, once a model has unlearned, an adversary that interacts with the model should no longer be able to tell whether the unlearned example was included in the model's train… ▽ More

    Submitted 21 May, 2024; v1 submitted 2 March, 2024; originally announced March 2024.

  7. arXiv:2312.07439  [pdf, other

    cs.LG

    BIRB: A Generalization Benchmark for Information Retrieval in Bioacoustics

    Authors: Jenny Hamer, Eleni Triantafillou, Bart van Merriënboer, Stefan Kahl, Holger Klinck, Tom Denton, Vincent Dumoulin

    Abstract: The ability for a machine learning model to cope with differences in training and deployment conditions--e.g. in the presence of distribution shift or the generalization to new classes altogether--is crucial for real-world use cases. However, most empirical work in this area has focused on the image domain with artificial benchmarks constructed to measure individual aspects of generalization. We p… ▽ More

    Submitted 13 December, 2023; v1 submitted 12 December, 2023; originally announced December 2023.

  8. arXiv:2311.17276  [pdf, other

    cs.DB

    Machine Unlearning in Learned Databases: An Experimental Analysis

    Authors: Meghdad Kurmanji, Eleni Triantafillou, Peter Triantafillou

    Abstract: Machine learning models based on neural networks (NNs) are enjoying ever-increasing attention in the DB community. However, an important issue has been largely overlooked, namely the challenge of dealing with the highly dynamic nature of DBs, where data updates are fundamental, highly-frequent operations. Although some recent research has addressed the issues of maintaining updated NN models in th… ▽ More

    Submitted 28 November, 2023; originally announced November 2023.

    Comments: Accepted as a conference paper at SIGMOD 2024

  9. arXiv:2302.09880  [pdf, other

    cs.LG cs.CR

    Towards Unbounded Machine Unlearning

    Authors: Meghdad Kurmanji, Peter Triantafillou, Jamie Hayes, Eleni Triantafillou

    Abstract: Deep machine unlearning is the problem of `removing' from a trained neural network a subset of its training set. This problem is very timely and has many applications, including the key tasks of removing biases (RB), resolving confusion (RC) (caused by mislabelled data in trained models), as well as allowing users to exercise their `right to be forgotten' to protect User Privacy (UP). This paper i… ▽ More

    Submitted 30 October, 2023; v1 submitted 20 February, 2023; originally announced February 2023.

  10. arXiv:2302.06658  [pdf, other

    cs.LG

    In Search for a Generalizable Method for Source Free Domain Adaptation

    Authors: Malik Boudiaf, Tom Denton, Bart van Merriënboer, Vincent Dumoulin, Eleni Triantafillou

    Abstract: Source-free domain adaptation (SFDA) is compelling because it allows adapting an off-the-shelf model to a new domain using only unlabelled data. In this work, we apply existing SFDA techniques to a challenging set of naturally-occurring distribution shifts in bioacoustics, which are very different from the ones commonly studied in computer vision. We find existing methods perform differently relat… ▽ More

    Submitted 24 June, 2023; v1 submitted 13 February, 2023; originally announced February 2023.

    Comments: ICML 2023

  11. arXiv:2105.07029  [pdf, other

    cs.LG cs.CV

    Learning a Universal Template for Few-shot Dataset Generalization

    Authors: Eleni Triantafillou, Hugo Larochelle, Richard Zemel, Vincent Dumoulin

    Abstract: Few-shot dataset generalization is a challenging variant of the well-studied few-shot classification problem where a diverse training set of several datasets is given, for the purpose of training an adaptable model that can then learn classes from new datasets using only a few examples. To this end, we propose to utilize the diverse training set to construct a universal template: a partial model t… ▽ More

    Submitted 21 June, 2021; v1 submitted 14 May, 2021; originally announced May 2021.

  12. arXiv:2012.05895  [pdf, other

    cs.LG cs.CV stat.ML

    Probing Few-Shot Generalization with Attributes

    Authors: Mengye Ren, Eleni Triantafillou, Kuan-Chieh Wang, James Lucas, Jake Snell, Xaq Pitkow, Andreas S. Tolias, Richard Zemel

    Abstract: Despite impressive progress in deep learning, generalizing far beyond the training distribution is an important open challenge. In this work, we consider few-shot classification, and aim to shed light on what makes some novel classes easier to learn than others, and what types of learned representations generalize better. To this end, we define a new paradigm in terms of attributes -- simple build… ▽ More

    Submitted 30 May, 2022; v1 submitted 10 December, 2020; originally announced December 2020.

    Comments: Technical report, 26 pages

  13. arXiv:1903.03096  [pdf, other

    cs.LG stat.ML

    Meta-Dataset: A Dataset of Datasets for Learning to Learn from Few Examples

    Authors: Eleni Triantafillou, Tyler Zhu, Vincent Dumoulin, Pascal Lamblin, Utku Evci, Kelvin Xu, Ross Goroshin, Carles Gelada, Kevin Swersky, Pierre-Antoine Manzagol, Hugo Larochelle

    Abstract: Few-shot classification refers to learning a classifier for new classes given only a few examples. While a plethora of models have emerged to tackle it, we find the procedure and datasets that are used to assess their progress lacking. To address this limitation, we propose Meta-Dataset: a new benchmark for training and evaluating models that is large-scale, consists of diverse datasets, and prese… ▽ More

    Submitted 8 April, 2020; v1 submitted 7 March, 2019; originally announced March 2019.

    Comments: Code available at https://github.com/google-research/meta-dataset

    Journal ref: International Conference on Learning Representations (2020)

  14. arXiv:1803.00676  [pdf, other

    cs.LG cs.CV stat.ML

    Meta-Learning for Semi-Supervised Few-Shot Classification

    Authors: Mengye Ren, Eleni Triantafillou, Sachin Ravi, Jake Snell, Kevin Swersky, Joshua B. Tenenbaum, Hugo Larochelle, Richard S. Zemel

    Abstract: In few-shot classification, we are interested in learning algorithms that train a classifier from only a handful of labeled examples. Recent progress in few-shot classification has featured meta-learning, in which a parameterized model for a learning algorithm is defined and trained on episodes representing different classification problems, each with a small labeled training set and its correspon… ▽ More

    Submitted 1 March, 2018; originally announced March 2018.

    Comments: Published as a conference paper at ICLR 2018. 15 pages

  15. arXiv:1707.02610  [pdf, other

    cs.LG

    Few-Shot Learning Through an Information Retrieval Lens

    Authors: Eleni Triantafillou, Richard Zemel, Raquel Urtasun

    Abstract: Few-shot learning refers to understanding new concepts from only a few examples. We propose an information retrieval-inspired approach for this problem that is motivated by the increased importance of maximally leveraging all the available information in this low-data regime. We define a training objective that aims to extract as much information as possible from each training batch by effectively… ▽ More

    Submitted 14 November, 2017; v1 submitted 9 July, 2017; originally announced July 2017.