Skip to main content

Showing 1–22 of 22 results for author: Charpentier, B

.
  1. arXiv:2406.14404  [pdf, other

    cs.LG

    Predicting Probabilities of Error to Combine Quantization and Early Exiting: QuEE

    Authors: Florence Regol, Joud Chataoui, Bertrand Charpentier, Mark Coates, Pablo Piantanida, Stephan Gunnemann

    Abstract: Machine learning models can solve complex tasks but often require significant computational resources during inference. This has led to the development of various post-training computation reduction methods that tackle this issue in different ways, such as quantization which reduces the precision of weights and arithmetic operations, and dynamic networks which adapt computation to the sample at ha… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

  2. arXiv:2405.01462  [pdf, other

    cs.LG

    Uncertainty for Active Learning on Graphs

    Authors: Dominik Fuchsgruber, Tom Wollschläger, Bertrand Charpentier, Antonio Oroz, Stephan Günnemann

    Abstract: Uncertainty Sampling is an Active Learning strategy that aims to improve the data efficiency of machine learning models by iteratively acquiring labels of data points with the highest uncertainty. While it has proven effective for independent data its applicability to graphs remains under-explored. We propose the first extensive study of Uncertainty Sampling for node classification: (1) We benchma… ▽ More

    Submitted 2 May, 2024; originally announced May 2024.

  3. arXiv:2403.18955  [pdf, other

    cs.LG cs.CV

    Structurally Prune Anything: Any Architecture, Any Framework, Any Time

    Authors: Xun Wang, John Rachwan, Stephan Günnemann, Bertrand Charpentier

    Abstract: Neural network pruning serves as a critical technique for enhancing the efficiency of deep learning models. Unlike unstructured pruning, which only sets specific parameters to zero, structured pruning eliminates entire channels, thus yielding direct computational and storage benefits. However, the diverse patterns for coupling parameters, such as residual connections and group convolutions, the di… ▽ More

    Submitted 3 March, 2024; originally announced March 2024.

  4. arXiv:2402.15978  [pdf, other

    cs.LG stat.ML

    Shaving Weights with Occam's Razor: Bayesian Sparsification for Neural Networks Using the Marginal Likelihood

    Authors: Rayen Dhahri, Alexander Immer, Betrand Charpentier, Stephan Günnemann, Vincent Fortuin

    Abstract: Neural network sparsification is a promising avenue to save computational time and memory costs, especially in an age where many successful AI models are becoming too large to naïvely deploy on consumer hardware. While much work has focused on different weight pruning criteria, the overall sparsifiability of the network, i.e., its capacity to be pruned without quality loss, has often been overlook… ▽ More

    Submitted 24 February, 2024; originally announced February 2024.

  5. arXiv:2306.15427  [pdf, other

    cs.LG

    Adversarial Training for Graph Neural Networks: Pitfalls, Solutions, and New Directions

    Authors: Lukas Gosch, Simon Geisler, Daniel Sturm, Bertrand Charpentier, Daniel Zügner, Stephan Günnemann

    Abstract: Despite its success in the image domain, adversarial training did not (yet) stand out as an effective defense for Graph Neural Networks (GNNs) against graph structure perturbations. In the pursuit of fixing adversarial training (1) we show and overcome fundamental theoretical as well as practical limitations of the adopted graph learning setting in prior work; (2) we reveal that more flexible GNNs… ▽ More

    Submitted 2 December, 2023; v1 submitted 27 June, 2023; originally announced June 2023.

    Comments: Published as a conference paper at NeurIPS 2023

  6. arXiv:2306.14916  [pdf, other

    physics.chem-ph cs.LG physics.comp-ph stat.ML

    Uncertainty Estimation for Molecules: Desiderata and Methods

    Authors: Tom Wollschläger, Nicholas Gao, Bertrand Charpentier, Mohamed Amine Ketata, Stephan Günnemann

    Abstract: Graph Neural Networks (GNNs) are promising surrogates for quantum mechanical calculations as they establish unprecedented low errors on collections of molecular dynamics (MD) trajectories. Thanks to their fast inference times they promise to accelerate computational chemistry applications. Unfortunately, despite low in-distribution (ID) errors, such GNNs might be horribly wrong for out-of-distribu… ▽ More

    Submitted 20 June, 2023; originally announced June 2023.

    Comments: Published as conference paper at ICML 2023

  7. arXiv:2305.10498  [pdf, other

    cs.LG cs.SI

    Edge Directionality Improves Learning on Heterophilic Graphs

    Authors: Emanuele Rossi, Bertrand Charpentier, Francesco Di Giovanni, Fabrizio Frasca, Stephan Günnemann, Michael Bronstein

    Abstract: Graph Neural Networks (GNNs) have become the de-facto standard tool for modeling relational data. However, while many real-world graphs are directed, the majority of today's GNN models discard this information altogether by simply making the graph undirected. The reasons for this are historical: 1) many early variants of spectral GNNs explicitly required undirected graphs, and 2) the first benchma… ▽ More

    Submitted 28 November, 2023; v1 submitted 17 May, 2023; originally announced May 2023.

  8. arXiv:2304.00897  [pdf, other

    cs.LG

    Accuracy is not the only Metric that matters: Estimating the Energy Consumption of Deep Learning Models

    Authors: Johannes Getzner, Bertrand Charpentier, Stephan Günnemann

    Abstract: Modern machine learning models have started to consume incredible amounts of energy, thus incurring large carbon footprints (Strubell et al., 2019). To address this issue, we have created an energy estimation pipeline1, which allows practitioners to estimate the energy needs of their models in advance, without actually running or training them. We accomplished this, by collecting high-quality ener… ▽ More

    Submitted 3 April, 2023; originally announced April 2023.

  9. arXiv:2303.05796  [pdf, other

    cs.LG

    Training, Architecture, and Prior for Deterministic Uncertainty Methods

    Authors: Bertrand Charpentier, Chenxiang Zhang, Stephan Günnemann

    Abstract: Accurate and efficient uncertainty estimation is crucial to build reliable Machine Learning (ML) models capable to provide calibrated uncertainty estimates, generalize and detect Out-Of-Distribution (OOD) datasets. To this end, Deterministic Uncertainty Methods (DUMs) is a promising model family capable to perform uncertainty estimation in a single forward pass. This work investigates important de… ▽ More

    Submitted 28 March, 2023; v1 submitted 10 March, 2023; originally announced March 2023.

  10. arXiv:2207.04227  [pdf, other

    cs.LG

    On the Robustness and Anomaly Detection of Sparse Neural Networks

    Authors: Morgane Ayle, Bertrand Charpentier, John Rachwan, Daniel Zügner, Simon Geisler, Stephan Günnemann

    Abstract: The robustness and anomaly detection capability of neural networks are crucial topics for their safe adoption in the real-world. Moreover, the over-parameterization of recent networks comes with high computational costs and raises questions about its influence on robustness and anomaly detection. In this work, we show that sparsity can make networks more robust and better anomaly detectors. To mot… ▽ More

    Submitted 9 July, 2022; originally announced July 2022.

  11. arXiv:2206.10451  [pdf, other

    cs.LG

    Winning the Lottery Ahead of Time: Efficient Early Network Pruning

    Authors: John Rachwan, Daniel Zügner, Bertrand Charpentier, Simon Geisler, Morgane Ayle, Stephan Günnemann

    Abstract: Pruning, the task of sparsifying deep neural networks, received increasing attention recently. Although state-of-the-art pruning methods extract highly sparse models, they neglect two main challenges: (1) the process of finding these sparse models is often very expensive; (2) unstructured pruning does not provide benefits in terms of GPU memory, training time, or carbon emissions. We propose Early… ▽ More

    Submitted 21 June, 2022; originally announced June 2022.

  12. arXiv:2206.01558  [pdf, other

    cs.LG

    Disentangling Epistemic and Aleatoric Uncertainty in Reinforcement Learning

    Authors: Bertrand Charpentier, Ransalu Senanayake, Mykel Kochenderfer, Stephan Günnemann

    Abstract: Characterizing aleatoric and epistemic uncertainty on the predicted rewards can help in building reliable reinforcement learning (RL) systems. Aleatoric uncertainty results from the irreducible environment stochasticity leading to inherently risky states and actions. Epistemic uncertainty results from the limited information accumulated during learning to make informed decisions. Characterizing al… ▽ More

    Submitted 3 June, 2022; originally announced June 2022.

  13. arXiv:2203.08509  [pdf, other

    cs.LG stat.ML

    Differentiable DAG Sampling

    Authors: Bertrand Charpentier, Simon Kibler, Stephan Günnemann

    Abstract: We propose a new differentiable probabilistic model over DAGs (DP-DAG). DP-DAG allows fast and differentiable DAG sampling suited to continuous optimization. To this end, DP-DAG samples a DAG by successively (1) sampling a linear ordering of the node and (2) sampling edges consistent with the sampled linear ordering. We further propose VI-DP-DAG, a new method for DAG learning from observational da… ▽ More

    Submitted 16 March, 2022; originally announced March 2022.

  14. arXiv:2110.14012  [pdf, other

    stat.ML cs.LG

    Graph Posterior Network: Bayesian Predictive Uncertainty for Node Classification

    Authors: Maximilian Stadler, Bertrand Charpentier, Simon Geisler, Daniel Zügner, Stephan Günnemann

    Abstract: The interdependence between nodes in graphs is key to improve class predictions on nodes and utilized in approaches like Label Propagation (LP) or in Graph Neural Networks (GNN). Nonetheless, uncertainty estimation for non-independent node-level predictions is under-explored. In this work, we explore uncertainty quantification for node classification in three ways: (1) We derive three axioms expli… ▽ More

    Submitted 26 October, 2021; originally announced October 2021.

    Comments: Neurips 2021

  15. arXiv:2107.08785  [pdf, other

    cs.LG

    On Out-of-distribution Detection with Energy-based Models

    Authors: Sven Elflein, Bertrand Charpentier, Daniel Zügner, Stephan Günnemann

    Abstract: Several density estimation methods have shown to fail to detect out-of-distribution (OOD) samples by assigning higher likelihoods to anomalous data. Energy-based models (EBMs) are flexible, unnormalized density models which seem to be able to improve upon this failure mode. In this work, we provide an extensive study investigating OOD detection with EBMs trained with different approaches on tabula… ▽ More

    Submitted 3 July, 2021; originally announced July 2021.

    Comments: Accepted to ICML 2021 Workshop on Uncertainty & Robustness in Deep Learning

  16. arXiv:2105.04471  [pdf, other

    cs.LG stat.ML

    Natural Posterior Network: Deep Bayesian Uncertainty for Exponential Family Distributions

    Authors: Bertrand Charpentier, Oliver Borchert, Daniel Zügner, Simon Geisler, Stephan Günnemann

    Abstract: Uncertainty awareness is crucial to develop reliable machine learning models. In this work, we propose the Natural Posterior Network (NatPN) for fast and high-quality uncertainty estimation for any task where the target distribution belongs to the exponential family. Thus, NatPN finds application for both classification and general regression settings. Unlike many previous approaches, NatPN does n… ▽ More

    Submitted 16 March, 2022; v1 submitted 10 May, 2021; originally announced May 2021.

  17. arXiv:2010.14986  [pdf, other

    cs.LG stat.ML

    Evaluating Robustness of Predictive Uncertainty Estimation: Are Dirichlet-based Models Reliable?

    Authors: Anna-Kathrin Kopetzki, Bertrand Charpentier, Daniel Zügner, Sandhya Giri, Stephan Günnemann

    Abstract: Dirichlet-based uncertainty (DBU) models are a recent and promising class of uncertainty-aware models. DBU models predict the parameters of a Dirichlet distribution to provide fast, high-quality uncertainty estimates alongside with class predictions. In this work, we present the first large-scale, in-depth study of the robustness of DBU models under adversarial attacks. Our results suggest that un… ▽ More

    Submitted 11 June, 2021; v1 submitted 28 October, 2020; originally announced October 2020.

    Comments: Published at ICML 2021

  18. arXiv:2009.07660  [pdf, other

    cs.SI

    Scikit-network: Graph Analysis in Python

    Authors: Thomas Bonald, Nathan de Lara, Quentin Lutz, Bertrand Charpentier

    Abstract: Scikit-network is a Python package inspired by scikit-learn for the analysis of large graphs. Graphs are represented by their adjacency matrix in the sparse CSR format of SciPy. The package provides state-of-the-art algorithms for ranking, clustering, classifying, embedding and visualizing the nodes of a graph. High performance is achieved through a mix of fast matrix-vector products (using SciPy)… ▽ More

    Submitted 14 September, 2020; originally announced September 2020.

    Journal ref: Journal of Machine Learning Research, Microtome Publishing, In press

  19. arXiv:2006.09239  [pdf, other

    cs.LG stat.ML

    Posterior Network: Uncertainty Estimation without OOD Samples via Density-Based Pseudo-Counts

    Authors: Bertrand Charpentier, Daniel Zügner, Stephan Günnemann

    Abstract: Accurate estimation of aleatoric and epistemic uncertainty is crucial to build safe and reliable systems. Traditional approaches, such as dropout and ensemble methods, estimate uncertainty by sampling probability predictions from different submodels, which leads to slow uncertainty estimation at inference time. Recent works address this drawback by directly predicting parameters of prior distribut… ▽ More

    Submitted 22 October, 2020; v1 submitted 16 June, 2020; originally announced June 2020.

    Comments: Neurips 2020

  20. arXiv:1911.05503  [pdf, other

    cs.LG stat.ML

    Uncertainty on Asynchronous Time Event Prediction

    Authors: Marin Biloš, Bertrand Charpentier, Stephan Günnemann

    Abstract: Asynchronous event sequences are the basis of many applications throughout different industries. In this work, we tackle the task of predicting the next event (given a history), and how this prediction changes with the passage of time. Since at some time points (e.g. predictions far into the future) we might not be able to predict anything with confidence, capturing uncertainty in the predictions… ▽ More

    Submitted 8 January, 2020; v1 submitted 13 November, 2019; originally announced November 2019.

    Comments: Neurips 2019 (Spotlight)

  21. arXiv:1807.05087  [pdf, ps, other

    cs.SI cs.LG stat.ML

    Learning Graph Representations by Dendrograms

    Authors: Thomas Bonald, Bertrand Charpentier

    Abstract: Hierarchical graph clustering is a common technique to reveal the multi-scale structure of complex networks. We propose a novel metric for assessing the quality of a hierarchical clustering. This metric reflects the ability to reconstruct the graph from the dendrogram, which encodes the hierarchy. The optimal representation of the graph defines a class of reducible linkages leading to regular dend… ▽ More

    Submitted 13 July, 2018; originally announced July 2018.

  22. arXiv:1806.01664  [pdf, other

    cs.SI cs.AI

    Hierarchical Graph Clustering using Node Pair Sampling

    Authors: Thomas Bonald, Bertrand Charpentier, Alexis Galland, Alexandre Hollocou

    Abstract: We present a novel hierarchical graph clustering algorithm inspired by modularity-based clustering techniques. The algorithm is agglomerative and based on a simple distance between clusters induced by the probability of sampling node pairs. We prove that this distance is reducible, which enables the use of the nearest-neighbor chain to speed up the agglomeration. The output of the algorithm is a r… ▽ More

    Submitted 22 June, 2018; v1 submitted 5 June, 2018; originally announced June 2018.

    ACM Class: I.5.2