Skip to main content

Showing 1–6 of 6 results for author: Sturm, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2307.14970  [pdf, other

    cond-mat.soft cs.LG

    Learning locally dominant force balances in active particle systems

    Authors: Dominik Sturm, Suryanarayana Maddu, Ivo F. Sbalzarini

    Abstract: We use a combination of unsupervised clustering and sparsity-promoting inference algorithms to learn locally dominant force balances that explain macroscopic pattern formation in self-organized active particle systems. The self-organized emergence of macroscopic patterns from microscopic interactions between self-propelled particles can be widely observed nature. Although hydrodynamic theories hel… ▽ More

    Submitted 27 July, 2023; originally announced July 2023.

  2. arXiv:2306.15427  [pdf, other

    cs.LG

    Adversarial Training for Graph Neural Networks: Pitfalls, Solutions, and New Directions

    Authors: Lukas Gosch, Simon Geisler, Daniel Sturm, Bertrand Charpentier, Daniel Zügner, Stephan Günnemann

    Abstract: Despite its success in the image domain, adversarial training did not (yet) stand out as an effective defense for Graph Neural Networks (GNNs) against graph structure perturbations. In the pursuit of fixing adversarial training (1) we show and overcome fundamental theoretical as well as practical limitations of the adopted graph learning setting in prior work; (2) we reveal that more flexible GNNs… ▽ More

    Submitted 2 December, 2023; v1 submitted 27 June, 2023; originally announced June 2023.

    Comments: Published as a conference paper at NeurIPS 2023

  3. arXiv:2305.00851  [pdf, other

    cs.LG

    Revisiting Robustness in Graph Machine Learning

    Authors: Lukas Gosch, Daniel Sturm, Simon Geisler, Stephan Günnemann

    Abstract: Many works show that node-level predictions of Graph Neural Networks (GNNs) are unrobust to small, often termed adversarial, changes to the graph structure. However, because manual inspection of a graph is difficult, it is unclear if the studied perturbations always preserve a core assumption of adversarial examples: that of unchanged semantic content. To address this problem, we introduce a more… ▽ More

    Submitted 2 May, 2023; v1 submitted 1 May, 2023; originally announced May 2023.

    Comments: Published as a conference paper at ICLR 2023. Preliminary version accepted as an oral at the NeurIPS 2022 TSRML workshop and at the NeurIPS 2022 ML safety workshop

  4. arXiv:2210.10851  [pdf, other

    cs.AR

    Scalable Coherent Optical Crossbar Architecture using PCM for AI Acceleration

    Authors: Daniel Sturm, Sajjad Moazeni

    Abstract: Optical computing has been recently proposed as a new compute paradigm to meet the demands of future AI/ML workloads in datacenters and supercomputers. However, proposed implementations so far suffer from lack of scalability, large footprints and high power consumption, and incomplete system-level architectures to become integrated within existing datacenter architecture for real-world application… ▽ More

    Submitted 19 October, 2022; originally announced October 2022.

    Comments: 6 Pages, 8 figures

  5. arXiv:2107.00940  [pdf, ps, other

    cs.LG math.NA physics.comp-ph q-bio.QM

    Inverse-Dirichlet Weighting Enables Reliable Training of Physics Informed Neural Networks

    Authors: Suryanarayana Maddu, Dominik Sturm, Christian L. Müller, Ivo F. Sbalzarini

    Abstract: We characterize and remedy a failure mode that may arise from multi-scale dynamics with scale imbalances during training of deep neural networks, such as Physics Informed Neural Networks (PINNs). PINNs are popular machine-learning templates that allow for seamless integration of physical equation models with data. Their training amounts to solving an optimization problem over a weighted sum of dat… ▽ More

    Submitted 2 July, 2021; originally announced July 2021.

  6. arXiv:2101.06182  [pdf, other

    math.NA cs.LG

    STENCIL-NET: Data-driven solution-adaptive discretization of partial differential equations

    Authors: Suryanarayana Maddu, Dominik Sturm, Bevan L. Cheeseman, Christian L. Müller, Ivo F. Sbalzarini

    Abstract: Numerical methods for approximately solving partial differential equations (PDE) are at the core of scientific computing. Often, this requires high-resolution or adaptive discretization grids to capture relevant spatio-temporal features in the PDE solution, e.g., in applications like turbulence, combustion, and shock propagation. Numerical approximation also requires knowing the PDE in order to co… ▽ More

    Submitted 18 January, 2021; v1 submitted 15 January, 2021; originally announced January 2021.