Skip to main content

Showing 1–27 of 27 results for author: Louizos, C

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.07925  [pdf, other

    cs.LG cs.AI cs.DC

    Stable Diffusion-based Data Augmentation for Federated Learning with Non-IID Data

    Authors: Mahdi Morafah, Matthias Reisser, Bill Lin, Christos Louizos

    Abstract: The proliferation of edge devices has brought Federated Learning (FL) to the forefront as a promising paradigm for decentralized and collaborative model training while preserving the privacy of clients' data. However, FL struggles with a significant performance reduction and poor convergence when confronted with Non-Independent and Identically Distributed (Non-IID) data distributions among partici… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

    Comments: International Workshop on Federated Foundation Models for the Web 2024 (FL@FM-TheWebConf'24)

  2. arXiv:2405.02140  [pdf, other

    cs.LG cs.IT stat.ML

    An Information Theoretic Perspective on Conformal Prediction

    Authors: Alvaro H. C. Correia, Fabio Valerio Massoli, Christos Louizos, Arash Behboodi

    Abstract: Conformal Prediction (CP) is a distribution-free uncertainty estimation framework that constructs prediction sets guaranteed to contain the true answer with a user-specified probability. Intuitively, the size of the prediction set encodes a general notion of uncertainty, with larger sets associated with higher degrees of uncertainty. In this work, we leverage information theory to connect conforma… ▽ More

    Submitted 26 June, 2024; v1 submitted 3 May, 2024; originally announced May 2024.

  3. arXiv:2405.02081  [pdf, other

    cs.LG

    A Mutual Information Perspective on Federated Contrastive Learning

    Authors: Christos Louizos, Matthias Reisser, Denis Korzhenkov

    Abstract: We investigate contrastive learning in the federated setting through the lens of SimCLR and multi-view mutual information maximization. In doing so, we uncover a connection between contrastive representation learning and user verification; by adding a user verification loss to each client's local SimCLR loss we recover a lower bound to the global multi-view mutual information. To accommodate for t… ▽ More

    Submitted 3 May, 2024; originally announced May 2024.

    Comments: Published as a conference paper at ICLR 2024

  4. arXiv:2404.13381  [pdf, other

    cs.LG cs.CR cs.MA q-bio.PE

    DNA: Differentially private Neural Augmentation for contact tracing

    Authors: Rob Romijnders, Christos Louizos, Yuki M. Asano, Max Welling

    Abstract: The COVID19 pandemic had enormous economic and societal consequences. Contact tracing is an effective way to reduce infection rates by detecting potential virus carriers early. However, this was not generally adopted in the recent pandemic, and privacy concerns are cited as the most important reason. We substantially improve the privacy guarantees of the current state of the art in decentralized c… ▽ More

    Submitted 20 April, 2024; originally announced April 2024.

    Comments: Privacy Regulation and Protection in Machine Learning Workshop at ICLR 2024

  5. arXiv:2402.16848  [pdf, other

    cs.LG

    InterroGate: Learning to Share, Specialize, and Prune Representations for Multi-task Learning

    Authors: Babak Ehteshami Bejnordi, Gaurav Kumar, Amelie Royer, Christos Louizos, Tijmen Blankevoort, Mohsen Ghafoorian

    Abstract: Jointly learning multiple tasks with a unified model can improve accuracy and data efficiency, but it faces the challenge of task interference, where optimizing one task objective may inadvertently compromise the performance of another. A solution to mitigate this issue is to allocate task-specific parameters, free from interference, on top of shared features. However, manually designing such arch… ▽ More

    Submitted 26 February, 2024; originally announced February 2024.

    Comments: Under review

  6. arXiv:2401.02609  [pdf, other

    cs.IT

    Importance Matching Lemma for Lossy Compression with Side Information

    Authors: Buu Phan, Ashish Khisti, Christos Louizos

    Abstract: We propose two extensions to existing importance sampling based methods for lossy compression. First, we introduce an importance sampling based compression scheme that is a variant of ordered random coding (Theis and Ahmed, 2022) and is amenable to direct evaluation of the achievable compression rate for a finite number of samples. Our second and major contribution is the importance matching lemma… ▽ More

    Submitted 8 March, 2024; v1 submitted 4 January, 2024; originally announced January 2024.

  7. arXiv:2312.11581  [pdf, other

    cs.CR cs.AI cs.LG

    Protect Your Score: Contact Tracing With Differential Privacy Guarantees

    Authors: Rob Romijnders, Christos Louizos, Yuki M. Asano, Max Welling

    Abstract: The pandemic in 2020 and 2021 had enormous economic and societal consequences, and studies show that contact tracing algorithms can be key in the early containment of the virus. While large strides have been made towards more effective contact tracing algorithms, we argue that privacy concerns currently hold deployment back. The essence of a contact tracing algorithm constitutes the communication… ▽ More

    Submitted 15 February, 2024; v1 submitted 18 December, 2023; originally announced December 2023.

    Comments: Accepted to The 38th Annual AAAI Conference on Artificial Intelligence (AAAI 2024)

  8. arXiv:2304.14766  [pdf, other

    cs.LG stat.ML

    Hyperparameter Optimization through Neural Network Partitioning

    Authors: Bruno Mlodozeniec, Matthias Reisser, Christos Louizos

    Abstract: Well-tuned hyperparameters are crucial for obtaining good generalization behavior in neural networks. They can enforce appropriate inductive biases, regularize the model and improve performance -- especially in the presence of limited data. In this work, we propose a simple and efficient way for optimizing hyperparameters inspired by the marginal likelihood, an optimization objective that requires… ▽ More

    Submitted 28 April, 2023; originally announced April 2023.

    Comments: Published as a conference paper at ICLR 2023

  9. arXiv:2206.10844  [pdf, other

    cs.LG cs.DC

    Quantization Robust Federated Learning for Efficient Inference on Heterogeneous Devices

    Authors: Kartik Gupta, Marios Fournarakis, Matthias Reisser, Christos Louizos, Markus Nagel

    Abstract: Federated Learning (FL) is a machine learning paradigm to distributively learn machine learning models from decentralized data that remains on-device. Despite the success of standard Federated optimization methods, such as Federated Averaging (FedAvg) in FL, the energy demands and hardware induced constraints for on-device learning have not been considered sufficiently in the literature. Specifica… ▽ More

    Submitted 22 June, 2022; originally announced June 2022.

  10. arXiv:2111.10192  [pdf, other

    cs.LG stat.ML

    An Expectation-Maximization Perspective on Federated Learning

    Authors: Christos Louizos, Matthias Reisser, Joseph Soriaga, Max Welling

    Abstract: Federated learning describes the distributed training of models across multiple clients while kee** the data private on-device. In this work, we view the server-orchestrated federated learning process as a hierarchical latent variable model where the server provides the parameters of a prior distribution over the client-specific model parameters. We show that with simple Gaussian priors and a ha… ▽ More

    Submitted 19 November, 2021; originally announced November 2021.

  11. arXiv:2111.05454  [pdf, other

    cs.LG cs.CR stat.ML

    DP-REC: Private & Communication-Efficient Federated Learning

    Authors: Aleksei Triastcyn, Matthias Reisser, Christos Louizos

    Abstract: Privacy and communication efficiency are important challenges in federated training of neural networks, and combining them is still an open problem. In this work, we develop a method that unifies highly compressed communication and differential privacy (DP). We introduce a compression technique based on Relative Entropy Coding (REC) to the federated setting. With a minor modification to REC, we ob… ▽ More

    Submitted 7 December, 2021; v1 submitted 9 November, 2021; originally announced November 2021.

  12. arXiv:2107.06724  [pdf, other

    cs.LG cs.DC

    Federated Mixture of Experts

    Authors: Matthias Reisser, Christos Louizos, Efstratios Gavves, Max Welling

    Abstract: Federated learning (FL) has emerged as the predominant approach for collaborative training of neural network models across multiple users, without the need to gather the data at a central location. One of the important challenges in this setting is data heterogeneity, i.e. different users have different data characteristics. For this reason, training and using a single global model might be subopt… ▽ More

    Submitted 14 July, 2021; originally announced July 2021.

  13. arXiv:2104.08776  [pdf, other

    cs.LG cs.CR

    Federated Learning of User Verification Models Without Sharing Embeddings

    Authors: Hossein Hosseini, Hyunsin Park, Sungrack Yun, Christos Louizos, Joseph Soriaga, Max Welling

    Abstract: We consider the problem of training User Verification (UV) models in federated setting, where each user has access to the data of only one class and user embeddings cannot be shared with the server or other users. To address this problem, we propose Federated User Verification (FedUV), a framework in which users jointly learn a set of vectors and maximize the correlation of their instance embeddin… ▽ More

    Submitted 7 June, 2021; v1 submitted 18 April, 2021; originally announced April 2021.

  14. arXiv:2008.10880  [pdf, other

    cs.LG cs.AI stat.ML

    Improving Fair Predictions Using Variational Inference In Causal Models

    Authors: Rik Helwegen, Christos Louizos, Patrick Forré

    Abstract: The importance of algorithmic fairness grows with the increasing impact machine learning has on people's lives. Recent work on fairness metrics shows the need for causal reasoning in fairness constraints. In this work, a practical method named FairTrade is proposed for creating flexible prediction models which integrate fairness constraints on sensitive causal paths. The method uses recent advance… ▽ More

    Submitted 25 August, 2020; originally announced August 2020.

  15. arXiv:2007.04618  [pdf, other

    cs.LG stat.ML

    Federated Learning of User Authentication Models

    Authors: Hossein Hosseini, Sungrack Yun, Hyunsin Park, Christos Louizos, Joseph Soriaga, Max Welling

    Abstract: Machine learning-based User Authentication (UA) models have been widely deployed in smart devices. UA models are trained to map input data of different users to highly separable embedding vectors, which are then used to accept or reject new inputs at test time. Training UA models requires having direct access to the raw inputs and embedding vectors of users, both of which are privacy-sensitive inf… ▽ More

    Submitted 9 July, 2020; originally announced July 2020.

  16. arXiv:2005.07093  [pdf, other

    cs.LG cs.CV stat.ML

    Bayesian Bits: Unifying Quantization and Pruning

    Authors: Mart van Baalen, Christos Louizos, Markus Nagel, Rana Ali Amjad, Ying Wang, Tijmen Blankevoort, Max Welling

    Abstract: We introduce Bayesian Bits, a practical method for joint mixed precision quantization and pruning through gradient based optimization. Bayesian Bits employs a novel decomposition of the quantization operation, which sequentially considers doubling the bit width. At each new bit width, the residual error between the full precision value and the previously rounded value is quantized. We then decide… ▽ More

    Submitted 27 October, 2020; v1 submitted 14 May, 2020; originally announced May 2020.

  17. arXiv:2004.10568  [pdf, other

    cs.LG cs.CV stat.ML

    Up or Down? Adaptive Rounding for Post-Training Quantization

    Authors: Markus Nagel, Rana Ali Amjad, Mart van Baalen, Christos Louizos, Tijmen Blankevoort

    Abstract: When quantizing neural networks, assigning each floating-point weight to its nearest fixed-point value is the predominant approach. We find that, perhaps surprisingly, this is not the best we can do. In this paper, we propose AdaRound, a better weight-rounding mechanism for post-training quantization that adapts to the data and the task loss. AdaRound is fast, does not require fine-tuning of the n… ▽ More

    Submitted 30 June, 2020; v1 submitted 22 April, 2020; originally announced April 2020.

    Comments: Published as a conference paper at ICML 2020

  18. arXiv:2002.07520  [pdf, other

    cs.LG stat.ML

    Gradient $\ell_1$ Regularization for Quantization Robustness

    Authors: Milad Alizadeh, Arash Behboodi, Mart van Baalen, Christos Louizos, Tijmen Blankevoort, Max Welling

    Abstract: We analyze the effect of quantizing weights and activations of neural networks on their loss and derive a simple regularization scheme that improves robustness against post-training quantization. By training quantization-ready networks, our approach enables storing a single set of weights that can be quantized on-demand to different bit-widths as energy and memory requirements of the application c… ▽ More

    Submitted 18 February, 2020; originally announced February 2020.

    Comments: ICLR 2020

  19. arXiv:1906.08324  [pdf, ps, other

    cs.LG stat.ML

    The Functional Neural Process

    Authors: Christos Louizos, Xiahan Shi, Klamer Schutte, Max Welling

    Abstract: We present a new family of exchangeable stochastic processes, the Functional Neural Processes (FNPs). FNPs model distributions over functions by learning a graph of dependencies on top of latent representations of the points in the given dataset. In doing so, they define a Bayesian model without explicitly positing a prior distribution over latent global parameters; they instead adopt priors over… ▽ More

    Submitted 4 November, 2019; v1 submitted 19 June, 2019; originally announced June 2019.

    Comments: Published as a conference paper at NeurIPS 2019

  20. arXiv:1905.10427  [pdf, other

    stat.ML cs.LG

    DIVA: Domain Invariant Variational Autoencoders

    Authors: Maximilian Ilse, Jakub M. Tomczak, Christos Louizos, Max Welling

    Abstract: We consider the problem of domain generalization, namely, how to learn representations given data from a set of domains that generalize to data from a previously unseen domain. We propose the Domain Invariant Variational Autoencoder (DIVA), a generative model that tackles this problem by learning three independent latent subspaces, one for the domain, one for the class, and one for any residual va… ▽ More

    Submitted 7 October, 2019; v1 submitted 24 May, 2019; originally announced May 2019.

    Comments: Code available at https://github.com/AMLab-Amsterdam/DIVA

  21. arXiv:1810.01875  [pdf, other

    cs.LG stat.ML

    Relaxed Quantization for Discretized Neural Networks

    Authors: Christos Louizos, Matthias Reisser, Tijmen Blankevoort, Efstratios Gavves, Max Welling

    Abstract: Neural network quantization has become an important research area due to its great impact on deployment of large models on resource constrained devices. In order to train networks that can be effectively discretized without loss of performance, we introduce a differentiable quantization procedure. Differentiability can be achieved by transforming continuous distributions over the weights and activ… ▽ More

    Submitted 3 October, 2018; originally announced October 2018.

  22. arXiv:1712.01312  [pdf, other

    stat.ML cs.LG

    Learning Sparse Neural Networks through $L_0$ Regularization

    Authors: Christos Louizos, Max Welling, Diederik P. Kingma

    Abstract: We propose a practical method for $L_0$ norm regularization for neural networks: pruning the network during training by encouraging weights to become exactly zero. Such regularization is interesting since (1) it can greatly speed up training and inference, and (2) it can improve generalization. AIC and BIC, well-known model selection criteria, are special cases of $L_0$ regularization. However, si… ▽ More

    Submitted 22 June, 2018; v1 submitted 4 December, 2017; originally announced December 2017.

    Comments: Published as a conference paper at the International Conference on Learning Representations (ICLR) 2018

  23. arXiv:1705.08821  [pdf, other

    stat.ML cs.LG

    Causal Effect Inference with Deep Latent-Variable Models

    Authors: Christos Louizos, Uri Shalit, Joris Mooij, David Sontag, Richard Zemel, Max Welling

    Abstract: Learning individual-level causal effects from observational data, such as inferring the most effective medication for a specific patient, is a problem of growing importance for policy makers. The most important aspect of inferring causal effects from observational data is the handling of confounders, factors that affect both an intervention and its outcome. A carefully designed observational study… ▽ More

    Submitted 6 November, 2017; v1 submitted 24 May, 2017; originally announced May 2017.

    Comments: Published as a conference paper at NIPS 2017

  24. arXiv:1705.08665  [pdf, other

    stat.ML cs.LG

    Bayesian Compression for Deep Learning

    Authors: Christos Louizos, Karen Ullrich, Max Welling

    Abstract: Compression and computational efficiency in deep learning have become a problem of great significance. In this work, we argue that the most principled and effective way to attack this problem is by adopting a Bayesian point of view, where through sparsity inducing priors we prune large parts of the network. We introduce two novelties in this paper: 1) we use hierarchical priors to prune nodes inst… ▽ More

    Submitted 6 November, 2017; v1 submitted 24 May, 2017; originally announced May 2017.

    Comments: Published as a conference paper at NIPS 2017

  25. arXiv:1703.01961  [pdf, other

    stat.ML cs.LG

    Multiplicative Normalizing Flows for Variational Bayesian Neural Networks

    Authors: Christos Louizos, Max Welling

    Abstract: We reinterpret multiplicative noise in neural networks as auxiliary random variables that augment the approximate posterior in a variational setting for Bayesian neural networks. We show that through this interpretation it is both efficient and straightforward to improve the approximation by employing normalizing flows while still allowing for local reparametrizations and a tractable lower bound.… ▽ More

    Submitted 12 June, 2017; v1 submitted 6 March, 2017; originally announced March 2017.

    Comments: Appearing at the International Conference on Machine Learning (ICML) 2017

  26. arXiv:1603.04733  [pdf, other

    stat.ML cs.LG

    Structured and Efficient Variational Deep Learning with Matrix Gaussian Posteriors

    Authors: Christos Louizos, Max Welling

    Abstract: We introduce a variational Bayesian neural network where the parameters are governed via a probability distribution on random matrices. Specifically, we employ a matrix variate Gaussian \cite{gupta1999matrix} parameter posterior distribution where we explicitly model the covariance among the input and output dimensions of each layer. Furthermore, with approximate covariance matrices we can achieve… ▽ More

    Submitted 23 June, 2016; v1 submitted 15 March, 2016; originally announced March 2016.

    Comments: Updated results with the original folds in the regression experiments. Appearing in the International Conference on Machine Learning (ICML) 2016

  27. arXiv:1511.00830  [pdf, other

    stat.ML cs.LG

    The Variational Fair Autoencoder

    Authors: Christos Louizos, Kevin Swersky, Yujia Li, Max Welling, Richard Zemel

    Abstract: We investigate the problem of learning representations that are invariant to certain nuisance or sensitive factors of variation in the data while retaining as much of the remaining information as possible. Our model is based on a variational autoencoding architecture with priors that encourage independence between sensitive and latent factors of variation. Any subsequent processing, such as classi… ▽ More

    Submitted 9 August, 2017; v1 submitted 3 November, 2015; originally announced November 2015.

    Comments: Fixed typo in eq. 3 and 4