Skip to main content

Showing 1–50 of 143 results for author: Günnemann, S

.
  1. arXiv:2406.14404  [pdf, other

    cs.LG

    Predicting Probabilities of Error to Combine Quantization and Early Exiting: QuEE

    Authors: Florence Regol, Joud Chataoui, Bertrand Charpentier, Mark Coates, Pablo Piantanida, Stephan Gunnemann

    Abstract: Machine learning models can solve complex tasks but often require significant computational resources during inference. This has led to the development of various post-training computation reduction methods that tackle this issue in different ways, such as quantization which reduces the precision of weights and arithmetic operations, and dynamic networks which adapt computation to the sample at ha… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

  2. arXiv:2406.11390  [pdf, other

    physics.flu-dyn cs.LG

    Unfolding Time: Generative Modeling for Turbulent Flows in 4D

    Authors: Abdullah Saydemir, Marten Lienen, Stephan Günnemann

    Abstract: A recent study in turbulent flow simulation demonstrated the potential of generative diffusion models for fast 3D surrogate modeling. This approach eliminates the need for specifying initial states or performing lengthy simulations, significantly accelerating the process. While adept at sampling individual frames from the learned manifold of turbulent flow states, the previous model lacks the capa… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    Comments: AI4Science Workshop @ ICML 2024

  3. arXiv:2406.10513  [pdf, other

    cs.LG q-bio.BM

    Lift Your Molecules: Molecular Graph Generation in Latent Euclidean Space

    Authors: Mohamed Amine Ketata, Nicholas Gao, Johanna Sommer, Tom Wollschläger, Stephan Günnemann

    Abstract: We introduce a new framework for molecular graph generation with 3D molecular generative models. Our Synthetic Coordinate Embedding (SyCo) framework maps molecular graphs to Euclidean point clouds via synthetic conformer coordinates and learns the inverse map using an E(n)-Equivariant Graph Neural Network (EGNN). The induced point cloud-structured latent space is well-suited to apply existing 3D m… ▽ More

    Submitted 15 June, 2024; originally announced June 2024.

  4. arXiv:2406.08210  [pdf, other

    cs.LG

    Expressivity and Generalization: Fragment-Biases for Molecular GNNs

    Authors: Tom Wollschläger, Niklas Kemper, Leon Hetzel, Johanna Sommer, Stephan Günnemann

    Abstract: Although recent advances in higher-order Graph Neural Networks (GNNs) improve the theoretical expressiveness and molecular property predictive performance, they often fall short of the empirical performance of models that explicitly use fragment information as inductive bias. However, for these approaches, there exists no theoretic expressivity study. In this work, we propose the Fragment-WL test,… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

    Journal ref: International Conference on Machine Learning. 2024. Oral

  5. arXiv:2406.06417  [pdf, other

    cs.LG cs.AI

    Explainable Graph Neural Networks Under Fire

    Authors: Zhong Li, Simon Geisler, Yuhang Wang, Stephan Günnemann, Matthijs van Leeuwen

    Abstract: Predictions made by graph neural networks (GNNs) usually lack interpretability due to their complex computational behavior and the abstract nature of graphs. In an attempt to tackle this, many GNN explanation methods have emerged. Their goal is to explain a model's predictions and thereby obtain trust when GNN models are deployed in decision critical applications. Most GNN explanation methods work… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

  6. arXiv:2406.04043  [pdf, other

    cs.LG stat.ML

    Energy-based Epistemic Uncertainty for Graph Neural Networks

    Authors: Dominik Fuchsgruber, Tom Wollschläger, Stephan Günnemann

    Abstract: In domains with interdependent data, such as graphs, quantifying the epistemic uncertainty of a Graph Neural Network (GNN) is challenging as uncertainty can arise at different structural scales. Existing techniques neglect this issue or only distinguish between structure-aware and structure-agnostic uncertainty without combining them into a single measure. We propose GEBM, an energy-based model (E… ▽ More

    Submitted 1 July, 2024; v1 submitted 6 June, 2024; originally announced June 2024.

  7. arXiv:2405.19121  [pdf, other

    cs.LG cs.AI

    Spatio-Spectral Graph Neural Networks

    Authors: Simon Geisler, Arthur Kosmala, Daniel Herbst, Stephan Günnemann

    Abstract: Spatial Message Passing Graph Neural Networks (MPGNNs) are widely used for learning on graph-structured data. However, key limitations of l-step MPGNNs are that their "receptive field" is typically limited to the l-hop neighborhood of a node and that information exchange between distant nodes is limited by over-squashing. Motivated by these limitations, we propose Spatio-Spectral Graph Neural Netw… ▽ More

    Submitted 2 June, 2024; v1 submitted 29 May, 2024; originally announced May 2024.

    Comments: 46 pages, 27 figures, 12 tables

  8. arXiv:2405.17951  [pdf, other

    cs.LG

    Efficient Time Series Processing for Transformers and State-Space Models through Token Merging

    Authors: Leon Götz, Marcel Kollovieh, Stephan Günnemann, Leo Schwinn

    Abstract: Transformer architectures have shown promising results in time series processing. However, despite recent advances in subquadratic attention mechanisms or state-space models, processing very long sequences still imposes significant computational requirements. Token merging, which involves replacing multiple tokens with a single one calculated as their linear combination, has shown to considerably… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

    Comments: 19 pages in total, 14 figures

  9. arXiv:2405.15589  [pdf, other

    cs.LG cs.CR

    Efficient Adversarial Training in LLMs with Continuous Attacks

    Authors: Sophie Xhonneux, Alessandro Sordoni, Stephan Günnemann, Gauthier Gidel, Leo Schwinn

    Abstract: Large language models (LLMs) are vulnerable to adversarial attacks that can bypass their safety guardrails. In many domains, adversarial training has proven to be one of the most promising methods to reliably improve robustness against such attacks. Yet, in the context of LLMs, current methods for adversarial training are hindered by the high computational costs required to perform discrete advers… ▽ More

    Submitted 21 June, 2024; v1 submitted 24 May, 2024; originally announced May 2024.

    Comments: 19 pages, 4 figures

  10. arXiv:2405.14762  [pdf, other

    cs.LG physics.chem-ph physics.comp-ph quant-ph

    Neural Pfaffians: Solving Many Many-Electron Schrödinger Equations

    Authors: Nicholas Gao, Stephan Günnemann

    Abstract: Neural wave functions accomplished unprecedented accuracies in approximating the ground state of many-electron systems, though at a high computational cost. Recent works proposed amortizing the cost by learning generalized wave functions across different structures and compounds instead of solving each problem independently. Enforcing the permutation antisymmetry of electrons in such generalized n… ▽ More

    Submitted 6 June, 2024; v1 submitted 23 May, 2024; originally announced May 2024.

  11. arXiv:2405.11337  [pdf, other

    cs.CV

    A Unified Approach Towards Active Learning and Out-of-Distribution Detection

    Authors: Sebastian Schmidt, Leonard Schenk, Leo Schwinn, Stephan Günnemann

    Abstract: When applying deep learning models in open-world scenarios, active learning (AL) strategies are crucial for identifying label candidates from a nearly infinite amount of unlabeled data. In this context, robust out-of-distribution (OOD) detection mechanisms are essential for handling data outside the target distribution of the application. However, current works investigate both problems separately… ▽ More

    Submitted 25 May, 2024; v1 submitted 18 May, 2024; originally announced May 2024.

  12. arXiv:2405.01462  [pdf, other

    cs.LG

    Uncertainty for Active Learning on Graphs

    Authors: Dominik Fuchsgruber, Tom Wollschläger, Bertrand Charpentier, Antonio Oroz, Stephan Günnemann

    Abstract: Uncertainty Sampling is an Active Learning strategy that aims to improve the data efficiency of machine learning models by iteratively acquiring labels of data points with the highest uncertainty. While it has proven effective for independent data its applicability to graphs remains under-explored. We propose the first extensive study of Uncertainty Sampling for node classification: (1) We benchma… ▽ More

    Submitted 2 May, 2024; originally announced May 2024.

  13. arXiv:2404.07664  [pdf, other

    cs.CV cs.AI

    Finding Dino: A plug-and-play framework for unsupervised detection of out-of-distribution objects using prototypes

    Authors: Poulami Sinhamahapatra, Franziska Schwaiger, Shirsha Bose, Huiyu Wang, Karsten Roscher, Stephan Guennemann

    Abstract: Detecting and localising unknown or Out-of-distribution (OOD) objects in any scene can be a challenging task in vision. Particularly, in safety-critical cases involving autonomous systems like automated vehicles or trains. Supervised anomaly segmentation or open-world object detection models depend on training on exhaustively annotated datasets for every domain and still struggle in distinguishing… ▽ More

    Submitted 11 April, 2024; originally announced April 2024.

  14. arXiv:2404.02830  [pdf, other

    cs.CV cs.AI

    Enhancing Interpretability of Vertebrae Fracture Grading using Human-interpretable Prototypes

    Authors: Poulami Sinhamahapatra, Suprosanna Shit, Anjany Sekuboyina, Malek Husseini, David Schinz, Nicolas Lenhart, Joern Menze, Jan Kirschke, Karsten Roscher, Stephan Guennemann

    Abstract: Vertebral fracture grading classifies the severity of vertebral fractures, which is a challenging task in medical imaging and has recently attracted Deep Learning (DL) models. Only a few works attempted to make such models human-interpretable despite the need for transparency and trustworthiness in critical use cases like DL-assisted medical diagnosis. Moreover, such models either rely on post-hoc… ▽ More

    Submitted 3 April, 2024; originally announced April 2024.

  15. arXiv:2403.18955  [pdf, other

    cs.LG cs.CV

    Structurally Prune Anything: Any Architecture, Any Framework, Any Time

    Authors: Xun Wang, John Rachwan, Stephan Günnemann, Bertrand Charpentier

    Abstract: Neural network pruning serves as a critical technique for enhancing the efficiency of deep learning models. Unlike unstructured pruning, which only sets specific parameters to zero, structured pruning eliminates entire channels, thus yielding direct computational and storage benefits. However, the diverse patterns for coupling parameters, such as residual connections and group convolutions, the di… ▽ More

    Submitted 3 March, 2024; originally announced March 2024.

  16. arXiv:2403.05249  [pdf, other

    quant-ph cs.LG physics.chem-ph physics.comp-ph

    On Representing Electronic Wave Functions with Sign Equivariant Neural Networks

    Authors: Nicholas Gao, Stephan Günnemann

    Abstract: Recent neural networks demonstrated impressively accurate approximations of electronic ground-state wave functions. Such neural networks typically consist of a permutation-equivariant neural network followed by a permutation-antisymmetric operation to enforce the electronic exchange symmetry. While accurate, such neural networks are computationally expensive. In this work, we explore the flipped a… ▽ More

    Submitted 8 March, 2024; originally announced March 2024.

    Comments: Published at Workshop on AI4DifferentialEquations in Science at ICLR 2024

  17. arXiv:2403.04867  [pdf, other

    cs.CR cs.LG stat.ML

    Unified Mechanism-Specific Amplification by Subsampling and Group Privacy Amplification

    Authors: Jan Schuchardt, Mihail Stoian, Arthur Kosmala, Stephan Günnemann

    Abstract: Amplification by subsampling is one of the main primitives in machine learning with differential privacy (DP): Training a model on random batches instead of complete datasets results in stronger privacy. This is traditionally formalized via mechanism-agnostic subsampling guarantees that express the privacy parameters of a subsampled mechanism as a function of the original mechanism's privacy param… ▽ More

    Submitted 10 June, 2024; v1 submitted 7 March, 2024; originally announced March 2024.

  18. arXiv:2402.15978  [pdf, other

    cs.LG stat.ML

    Shaving Weights with Occam's Razor: Bayesian Sparsification for Neural Networks Using the Marginal Likelihood

    Authors: Rayen Dhahri, Alexander Immer, Betrand Charpentier, Stephan Günnemann, Vincent Fortuin

    Abstract: Neural network sparsification is a promising avenue to save computational time and memory costs, especially in an age where many successful AI models are becoming too large to naïvely deploy on consumer hardware. While much work has focused on different weight pruning criteria, the overall sparsifiability of the network, i.e., its capacity to be pruned without quality loss, has often been overlook… ▽ More

    Submitted 24 February, 2024; originally announced February 2024.

  19. arXiv:2402.09154  [pdf, other

    cs.LG

    Attacking Large Language Models with Projected Gradient Descent

    Authors: Simon Geisler, Tom Wollschläger, M. H. I. Abdalla, Johannes Gasteiger, Stephan Günnemann

    Abstract: Current LLM alignment methods are readily broken through specifically crafted adversarial prompts. While crafting adversarial prompts using discrete optimization is highly effective, such attacks typically use more than 100,000 LLM calls. This high computational cost makes them unsuitable for, e.g., quantitative analyses and adversarial training. To remedy this, we revisit Projected Gradient Desce… ▽ More

    Submitted 14 February, 2024; originally announced February 2024.

  20. arXiv:2402.09063  [pdf, other

    cs.LG

    Soft Prompt Threats: Attacking Safety Alignment and Unlearning in Open-Source LLMs through the Embedding Space

    Authors: Leo Schwinn, David Dobre, Sophie Xhonneux, Gauthier Gidel, Stephan Gunnemann

    Abstract: Current research in adversarial robustness of LLMs focuses on discrete input manipulations in the natural language space, which can be directly transferred to closed-source models. However, this approach neglects the steady progression of open-source models. As open-source models advance in capability, ensuring their safety also becomes increasingly imperative. Yet, attacks tailored to open-source… ▽ More

    Submitted 14 February, 2024; originally announced February 2024.

    Comments: Trigger Warning: the appendix contains LLM-generated text with violence and harassment

  21. arXiv:2312.05502  [pdf, other

    cs.LG

    Poisoning $\times$ Evasion: Symbiotic Adversarial Robustness for Graph Neural Networks

    Authors: Ege Erdogan, Simon Geisler, Stephan Günnemann

    Abstract: It is well-known that deep learning models are vulnerable to small input perturbations. Such perturbed instances are called adversarial examples. Adversarial examples are commonly crafted to fool a model either at training time (poisoning) or test time (evasion). In this work, we study the symbiosis of poisoning and evasion. We show that combining both threat models can substantially improve the d… ▽ More

    Submitted 9 December, 2023; originally announced December 2023.

    Comments: NeurIPS 2023 New Frontiers in Graph Learning Workshop (NeurIPS GLFrontiers 2023)

  22. arXiv:2312.05340  [pdf, other

    q-bio.QM cs.LG

    Transition Path Sampling with Boltzmann Generator-based MCMC Moves

    Authors: Michael Plainer, Hannes Stärk, Charlotte Bunne, Stephan Günnemann

    Abstract: Sampling all possible transition paths between two 3D states of a molecular system has various applications ranging from catalyst design to drug discovery. Current approaches to sample transition paths use Markov chain Monte Carlo and rely on time-intensive molecular dynamics simulations to find new paths. Our approach operates in the latent space of a normalizing flow that maps from the molecule'… ▽ More

    Submitted 28 May, 2024; v1 submitted 8 December, 2023; originally announced December 2023.

    Comments: Spotlight at NeurIPS 2023 Generative AI and Biology Workshop

  23. arXiv:2312.02708  [pdf, other

    cs.LG cs.CR stat.ML

    Provable Adversarial Robustness for Group Equivariant Tasks: Graphs, Point Clouds, Molecules, and More

    Authors: Jan Schuchardt, Yan Scholten, Stephan Günnemann

    Abstract: A machine learning model is traditionally considered robust if its prediction remains (almost) constant under input perturbations with small norm. However, real-world tasks like molecular property prediction or point cloud segmentation have inherent equivariances, such as rotation or permutation equivariance. In such tasks, even perturbations with large norm do not necessarily change an input's se… ▽ More

    Submitted 15 January, 2024; v1 submitted 5 December, 2023; originally announced December 2023.

    Comments: Accepted at NeurIPS 2023

  24. arXiv:2311.17853  [pdf, other

    cs.LG

    On the Adversarial Robustness of Graph Contrastive Learning Methods

    Authors: Filippo Guerranti, Zinuo Yi, Anna Starovoit, Rafiq Kamel, Simon Geisler, Stephan Günnemann

    Abstract: Contrastive learning (CL) has emerged as a powerful framework for learning representations of images and text in a self-supervised manner while enhancing model robustness against adversarial attacks. More recently, researchers have extended the principles of contrastive learning to graph-structured data, giving birth to the field of graph contrastive learning (GCL). However, whether GCL methods ca… ▽ More

    Submitted 30 November, 2023; v1 submitted 29 November, 2023; originally announced November 2023.

    Comments: Accepted at NeurIPS 2023 New Frontiers in Graph Learning Workshop (NeurIPS GLFrontiers 2023)

  25. arXiv:2311.06481  [pdf, other

    cs.RO cs.LG

    Topology-Matching Normalizing Flows for Out-of-Distribution Detection in Robot Learning

    Authors: Jianxiang Feng, Jongseok Lee, Simon Geisler, Stephan Gunnemann, Rudolph Triebel

    Abstract: To facilitate reliable deployments of autonomous robots in the real world, Out-of-Distribution (OOD) detection capabilities are often required. A powerful approach for OOD detection is based on density estimation with Normalizing Flows (NFs). However, we find that prior work with NFs attempts to match the complex target distribution topologically with naive base distributions leading to adverse im… ▽ More

    Submitted 11 November, 2023; originally announced November 2023.

    Comments: Accepted on CoRL2023

  26. arXiv:2311.01139  [pdf, other

    cs.LG stat.ML

    Add and Thin: Diffusion for Temporal Point Processes

    Authors: David Lüdke, Marin Biloš, Oleksandr Shchur, Marten Lienen, Stephan Günnemann

    Abstract: Autoregressive neural networks within the temporal point process (TPP) framework have become the standard for modeling continuous-time event data. Even though these models can expressively capture event sequences in a one-step-ahead fashion, they are inherently limited for long-term forecasting applications due to the accumulation of errors caused by their sequential nature. To overcome these limi… ▽ More

    Submitted 20 February, 2024; v1 submitted 2 November, 2023; originally announced November 2023.

  27. arXiv:2310.19737  [pdf, other

    cs.AI

    Adversarial Attacks and Defenses in Large Language Models: Old and New Threats

    Authors: Leo Schwinn, David Dobre, Stephan Günnemann, Gauthier Gidel

    Abstract: Over the past decade, there has been extensive research aimed at enhancing the robustness of neural networks, yet this problem remains vastly unsolved. Here, one major impediment has been the overestimation of the robustness of new defense approaches due to faulty defense evaluations. Flawed robustness evaluations necessitate rectifications in subsequent works, dangerously slowing down the researc… ▽ More

    Submitted 30 October, 2023; originally announced October 2023.

  28. arXiv:2310.16221  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    Hierarchical Randomized Smoothing

    Authors: Yan Scholten, Jan Schuchardt, Aleksandar Bojchevski, Stephan Günnemann

    Abstract: Real-world data is complex and often consists of objects that can be decomposed into multiple entities (e.g. images into pixels, graphs into interconnected nodes). Randomized smoothing is a powerful framework for making models provably robust against small changes to their inputs - by guaranteeing robustness of the majority vote when randomly adding noise before classification. Yet, certifying rob… ▽ More

    Submitted 15 January, 2024; v1 submitted 24 October, 2023; originally announced October 2023.

  29. arXiv:2310.04285  [pdf, other

    cs.CV cs.AI cs.LG stat.ML

    Assessing Robustness via Score-Based Adversarial Image Generation

    Authors: Marcel Kollovieh, Lukas Gosch, Yan Scholten, Marten Lienen, Stephan Günnemann

    Abstract: Most adversarial attacks and defenses focus on perturbations within small $\ell_p$-norm constraints. However, $\ell_p$ threat models cannot capture all relevant semantic-preserving perturbations, and hence, the scope of robustness evaluations is limited. In this work, we introduce Score-Based Adversarial Generation (ScoreAG), a novel framework that leverages the advancements in score-based generat… ▽ More

    Submitted 6 October, 2023; originally announced October 2023.

  30. arXiv:2309.05517  [pdf, other

    cs.CV cs.LG

    Stream-based Active Learning by Exploiting Temporal Properties in Perception with Temporal Predicted Loss

    Authors: Sebastian Schmidt, Stephan Günnemann

    Abstract: Active learning (AL) reduces the amount of labeled data needed to train a machine learning model by intelligently choosing which instances to label. Classic pool-based AL requires all data to be present in a datacenter, which can be challenging with the increasing amounts of data needed in deep learning. However, AL on mobile devices and robots, like autonomous cars, can filter the data from perce… ▽ More

    Submitted 26 September, 2023; v1 submitted 11 September, 2023; originally announced September 2023.

  31. arXiv:2308.08173  [pdf, other

    cs.LG

    Expressivity of Graph Neural Networks Through the Lens of Adversarial Robustness

    Authors: Francesco Campi, Lukas Gosch, Tom Wollschläger, Yan Scholten, Stephan Günnemann

    Abstract: We perform the first adversarial robustness study into Graph Neural Networks (GNNs) that are provably more powerful than traditional Message Passing Neural Networks (MPNNs). In particular, we use adversarial robustness as a tool to uncover a significant gap between their theoretically possible and empirically achieved expressive power. To do so, we focus on the ability of GNNs to count specific su… ▽ More

    Submitted 3 July, 2024; v1 submitted 16 August, 2023; originally announced August 2023.

    Comments: Published in ${2}^{nd}$ AdvML Frontiers workshop at ${40}^{th}$ International Conference on Machine Learning (ICML)

    ACM Class: I.2.6

  32. arXiv:2308.05239  [pdf, other

    cs.SE cs.LG

    Machine Learning-Enabled Software and System Architecture Frameworks

    Authors: Armin Moin, Atta Badii, Stephan Günnemann, Moharram Challenger

    Abstract: Various architecture frameworks for software, systems, and enterprises have been proposed in the literature. They identified several stakeholders and defined modeling perspectives, architecture viewpoints, and views to frame and address stakeholder concerns. However, the stakeholders with data science and Machine Learning (ML) related concerns, such as data scientists and data engineers, are yet t… ▽ More

    Submitted 26 June, 2024; v1 submitted 9 August, 2023; originally announced August 2023.

    Comments: Journal manuscript

  33. arXiv:2307.08423  [pdf, other

    cs.LG physics.comp-ph

    Artificial Intelligence for Science in Quantum, Atomistic, and Continuum Systems

    Authors: Xuan Zhang, Limei Wang, Jacob Helwig, Youzhi Luo, Cong Fu, Yaochen Xie, Meng Liu, Yuchao Lin, Zhao Xu, Keqiang Yan, Keir Adams, Maurice Weiler, Xiner Li, Tianfan Fu, Yucheng Wang, Haiyang Yu, YuQing Xie, Xiang Fu, Alex Strasser, Shenglong Xu, Yi Liu, Yuanqi Du, Alexandra Saxton, Hongyi Ling, Hannah Lawrence , et al. (38 additional authors not shown)

    Abstract: Advances in artificial intelligence (AI) are fueling a new paradigm of discoveries in natural sciences. Today, AI has started to advance natural sciences by improving, accelerating, and enabling our understanding of natural phenomena at a wide range of spatial and temporal scales, giving rise to a new area of research known as AI for science (AI4Science). Being an emerging research paradigm, AI4Sc… ▽ More

    Submitted 15 November, 2023; v1 submitted 17 July, 2023; originally announced July 2023.

  34. arXiv:2307.04533  [pdf, other

    cs.CV cs.AI

    Preventing Errors in Person Detection: A Part-Based Self-Monitoring Framework

    Authors: Franziska Schwaiger, Andrea Matic, Karsten Roscher, Stephan Günnemann

    Abstract: The ability to detect learned objects regardless of their appearance is crucial for autonomous systems in real-world applications. Especially for detecting humans, which is often a fundamental task in safety-critical applications, it is vital to prevent errors. To address this challenge, we propose a self-monitoring framework that allows for the perception system to perform plausibility checks at… ▽ More

    Submitted 10 July, 2023; originally announced July 2023.

    Comments: Accepted for the 35th IEEE Intelligent Vehicles Symposium (IV 2023), 9 pages

  35. arXiv:2307.01317  [pdf, other

    cs.RO cs.LG

    Density-based Feasibility Learning with Normalizing Flows for Introspective Robotic Assembly

    Authors: Jianxiang Feng, Matan Atad, Ismael Rodríguez, Maximilian Durner, Stephan Günnemann, Rudolph Triebel

    Abstract: Machine Learning (ML) models in Robotic Assembly Sequence Planning (RASP) need to be introspective on the predicted solutions, i.e. whether they are feasible or not, to circumvent potential efficiency degradation. Previous works need both feasible and infeasible examples during training. However, the infeasible ones are hard to collect sufficiently when re-training is required for swift adaptation… ▽ More

    Submitted 6 July, 2023; v1 submitted 3 July, 2023; originally announced July 2023.

    Comments: Accepted to the RSS 2023 Robotic Assembly Workshop

  36. arXiv:2306.17246  [pdf, other

    cs.LG q-bio.BM stat.AP

    The power of motifs as inductive bias for learning molecular distributions

    Authors: Johanna Sommer, Leon Hetzel, David Lüdke, Fabian Theis, Stephan Günnemann

    Abstract: Machine learning for molecules holds great potential for efficiently exploring the vast chemical space and thus streamlining the drug discovery process by facilitating the design of new therapeutic molecules. Deep generative models have shown promising results for molecule generation, but the benefits of specific inductive biases for learning distributions over small graphs are unclear. Our study… ▽ More

    Submitted 4 April, 2023; originally announced June 2023.

    Comments: Accepted for publication at the MLDD workshop, ICLR 2023

  37. arXiv:2306.15427  [pdf, other

    cs.LG

    Adversarial Training for Graph Neural Networks: Pitfalls, Solutions, and New Directions

    Authors: Lukas Gosch, Simon Geisler, Daniel Sturm, Bertrand Charpentier, Daniel Zügner, Stephan Günnemann

    Abstract: Despite its success in the image domain, adversarial training did not (yet) stand out as an effective defense for Graph Neural Networks (GNNs) against graph structure perturbations. In the pursuit of fixing adversarial training (1) we show and overcome fundamental theoretical as well as practical limitations of the adopted graph learning setting in prior work; (2) we reveal that more flexible GNNs… ▽ More

    Submitted 2 December, 2023; v1 submitted 27 June, 2023; originally announced June 2023.

    Comments: Published as a conference paper at NeurIPS 2023

  38. arXiv:2306.14916  [pdf, other

    physics.chem-ph cs.LG physics.comp-ph stat.ML

    Uncertainty Estimation for Molecules: Desiderata and Methods

    Authors: Tom Wollschläger, Nicholas Gao, Bertrand Charpentier, Mohamed Amine Ketata, Stephan Günnemann

    Abstract: Graph Neural Networks (GNNs) are promising surrogates for quantum mechanical calculations as they establish unprecedented low errors on collections of molecular dynamics (MD) trajectories. Thanks to their fast inference times they promise to accelerate computational chemistry applications. Unfortunately, despite low in-distribution (ID) errors, such GNNs might be horribly wrong for out-of-distribu… ▽ More

    Submitted 20 June, 2023; originally announced June 2023.

    Comments: Published as conference paper at ICML 2023

  39. arXiv:2306.01776  [pdf, other

    physics.flu-dyn cs.LG

    From Zero to Turbulence: Generative Modeling for 3D Flow Simulation

    Authors: Marten Lienen, David Lüdke, Jan Hansen-Palmus, Stephan Günnemann

    Abstract: Simulations of turbulent flows in 3D are one of the most expensive simulations in computational fluid dynamics (CFD). Many works have been written on surrogate models to replace numerical solvers for fluid flows with faster, learned, autoregressive models. However, the intricacies of turbulence in three dimensions necessitate training these models with very small time steps, while generating reali… ▽ More

    Submitted 14 March, 2024; v1 submitted 29 May, 2023; originally announced June 2023.

    Comments: Published at ICLR 2024

  40. arXiv:2305.19303  [pdf, other

    physics.chem-ph cs.LG

    MAGNet: Motif-Agnostic Generation of Molecules from Shapes

    Authors: Leon Hetzel, Johanna Sommer, Bastian Rieck, Fabian Theis, Stephan Günnemann

    Abstract: Recent advances in machine learning for molecules exhibit great potential for facilitating drug discovery from in silico predictions. Most models for molecule generation rely on the decomposition of molecules into frequently occurring substructures (motifs), from which they generate novel compounds. While motif representations greatly aid in learning molecular distributions, such methods struggle… ▽ More

    Submitted 7 November, 2023; v1 submitted 30 May, 2023; originally announced May 2023.

  41. arXiv:2305.10498  [pdf, other

    cs.LG cs.SI

    Edge Directionality Improves Learning on Heterophilic Graphs

    Authors: Emanuele Rossi, Bertrand Charpentier, Francesco Di Giovanni, Fabrizio Frasca, Stephan Günnemann, Michael Bronstein

    Abstract: Graph Neural Networks (GNNs) have become the de-facto standard tool for modeling relational data. However, while many real-world graphs are directed, the majority of today's GNN models discard this information altogether by simply making the graph undirected. The reasons for this are historical: 1) many early variants of spectral GNNs explicitly required undirected graphs, and 2) the first benchma… ▽ More

    Submitted 28 November, 2023; v1 submitted 17 May, 2023; originally announced May 2023.

  42. arXiv:2305.00851  [pdf, other

    cs.LG

    Revisiting Robustness in Graph Machine Learning

    Authors: Lukas Gosch, Daniel Sturm, Simon Geisler, Stephan Günnemann

    Abstract: Many works show that node-level predictions of Graph Neural Networks (GNNs) are unrobust to small, often termed adversarial, changes to the graph structure. However, because manual inspection of a graph is difficult, it is unclear if the studied perturbations always preserve a core assumption of adversarial examples: that of unchanged semantic content. To address this problem, we introduce a more… ▽ More

    Submitted 2 May, 2023; v1 submitted 1 May, 2023; originally announced May 2023.

    Comments: Published as a conference paper at ICLR 2023. Preliminary version accepted as an oral at the NeurIPS 2022 TSRML workshop and at the NeurIPS 2022 ML safety workshop

  43. arXiv:2305.00472  [pdf, other

    quant-ph cs.LG

    Efficient MILP Decomposition in Quantum Computing for ReLU Network Robustness

    Authors: Nicola Franco, Tom Wollschläger, Benedikt Poggel, Stephan Günnemann, Jeanette Miriam Lorenz

    Abstract: Emerging quantum computing technologies, such as Noisy Intermediate-Scale Quantum (NISQ) devices, offer potential advancements in solving mathematical optimization problems. However, limitations in qubit availability, noise, and errors pose challenges for practical implementation. In this study, we examine two decomposition methods for Mixed-Integer Linear Programming (MILP) designed to reduce the… ▽ More

    Submitted 11 October, 2023; v1 submitted 30 April, 2023; originally announced May 2023.

  44. arXiv:2304.02902  [pdf, other

    stat.ML cs.LG

    Towards Efficient MCMC Sampling in Bayesian Neural Networks by Exploiting Symmetry

    Authors: Jonas Gregor Wiese, Lisa Wimmer, Theodore Papamarkou, Bernd Bischl, Stephan Günnemann, David Rügamer

    Abstract: Bayesian inference in deep neural networks is challenging due to the high-dimensional, strongly multi-modal parameter posterior density landscape. Markov chain Monte Carlo approaches asymptotically recover the true posterior but are considered prohibitively expensive for large modern architectures. Local methods, which have emerged as a popular alternative, focus on specific parameter regions that… ▽ More

    Submitted 6 April, 2023; originally announced April 2023.

  45. arXiv:2304.00897  [pdf, other

    cs.LG

    Accuracy is not the only Metric that matters: Estimating the Energy Consumption of Deep Learning Models

    Authors: Johannes Getzner, Bertrand Charpentier, Stephan Günnemann

    Abstract: Modern machine learning models have started to consume incredible amounts of energy, thus incurring large carbon footprints (Strubell et al., 2019). To address this issue, we have created an energy estimation pipeline1, which allows practitioners to estimate the energy needs of their models in advance, without actually running or training them. We accomplished this, by collecting high-quality ener… ▽ More

    Submitted 3 April, 2023; originally announced April 2023.

  46. arXiv:2303.14961  [pdf, other

    cs.LG cs.AI cs.CV

    Diffusion Denoised Smoothing for Certified and Adversarial Robust Out-Of-Distribution Detection

    Authors: Nicola Franco, Daniel Korth, Jeanette Miriam Lorenz, Karsten Roscher, Stephan Guennemann

    Abstract: As the use of machine learning continues to expand, the importance of ensuring its safety cannot be overstated. A key concern in this regard is the ability to identify whether a given sample is from the training distribution, or is an "Out-Of-Distribution" (OOD) sample. In addition, adversaries can manipulate OOD samples in ways that lead a classifier to make a confident prediction. In this study,… ▽ More

    Submitted 10 August, 2023; v1 submitted 27 March, 2023; originally announced March 2023.

  47. arXiv:2303.05796  [pdf, other

    cs.LG

    Training, Architecture, and Prior for Deterministic Uncertainty Methods

    Authors: Bertrand Charpentier, Chenxiang Zhang, Stephan Günnemann

    Abstract: Accurate and efficient uncertainty estimation is crucial to build reliable Machine Learning (ML) models capable to provide calibrated uncertainty estimates, generalize and detect Out-Of-Distribution (OOD) datasets. To this end, Deterministic Uncertainty Methods (DUMs) is a promising model family capable to perform uncertainty estimation in a single forward pass. This work investigates important de… ▽ More

    Submitted 28 March, 2023; v1 submitted 10 March, 2023; originally announced March 2023.

  48. arXiv:2303.04791  [pdf, other

    cs.LG cond-mat.mtrl-sci physics.chem-ph physics.comp-ph

    Ewald-based Long-Range Message Passing for Molecular Graphs

    Authors: Arthur Kosmala, Johannes Gasteiger, Nicholas Gao, Stephan Günnemann

    Abstract: Neural architectures that learn potential energy surfaces from molecular data have undergone fast improvement in recent years. A key driver of this success is the Message Passing Neural Network (MPNN) paradigm. Its favorable scaling with system size partly relies upon a spatial distance limit on messages. While this focus on locality is a useful inductive bias, it also impedes the learning of long… ▽ More

    Submitted 6 June, 2023; v1 submitted 8 March, 2023; originally announced March 2023.

    Comments: Published at the 40th International Conference on Machine Learning (ICML 2023)

  49. arXiv:2302.04168  [pdf, other

    cs.LG physics.chem-ph physics.comp-ph quant-ph

    Generalizing Neural Wave Functions

    Authors: Nicholas Gao, Stephan Günnemann

    Abstract: Recent neural network-based wave functions have achieved state-of-the-art accuracies in modeling ab-initio ground-state potential energy surface. However, these networks can only solve different spatial arrangements of the same set of atoms. To overcome this limitation, we present Graph-learned orbital embeddings (Globe), a neural network-based reparametrization method that can adapt neural wave f… ▽ More

    Submitted 31 May, 2023; v1 submitted 8 February, 2023; originally announced February 2023.

    Comments: Published at the 40th International Conference on Machine Learning (ICML 2023)

  50. arXiv:2302.02829  [pdf, other

    cs.LG cs.CR

    Collective Robustness Certificates: Exploiting Interdependence in Graph Neural Networks

    Authors: Jan Schuchardt, Aleksandar Bojchevski, Johannes Gasteiger, Stephan Günnemann

    Abstract: In tasks like node classification, image segmentation, and named-entity recognition we have a classifier that simultaneously outputs multiple predictions (a vector of labels) based on a single input, i.e. a single graph, image, or document respectively. Existing adversarial robustness certificates consider each prediction independently and are thus overly pessimistic for such tasks. They implicitl… ▽ More

    Submitted 6 February, 2023; originally announced February 2023.

    Comments: Accepted at ICLR 2021 (https://openreview.net/forum?id=ULQdiUTHe3y). Uploaded to arxiv to fix Google Scholar indexing