Skip to main content

Showing 1–9 of 9 results for author: Sorrenson, P

.
  1. arXiv:2407.09297  [pdf, other

    cs.LG stat.ML

    Learning Distances from Data with Normalizing Flows and Score Matching

    Authors: Peter Sorrenson, Daniel Behrend-Uriarte, Christoph Schnörr, Ullrich Köthe

    Abstract: Density-based distances (DBDs) offer an elegant solution to the problem of metric learning. By defining a Riemannian metric which increases with decreasing probability density, shortest paths naturally follow the data manifold and points are clustered according to the modes of the data. We show that existing methods to estimate Fermat distances, a particular choice of DBD, suffer from poor converg… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

  2. arXiv:2312.09852  [pdf, other

    cs.LG stat.ML

    Learning Distributions on Manifolds with Free-form Flows

    Authors: Peter Sorrenson, Felix Draxler, Armand Rousselot, Sander Hummerich, Ullrich Köthe

    Abstract: Many real world data, particularly in the natural sciences and computer vision, lie on known Riemannian manifolds such as spheres, tori or the group of rotation matrices. The predominant approaches to learning a distribution on such a manifold require solving a differential equation in order to sample from the model and evaluate densities. The resulting sampling times are slowed down by a high num… ▽ More

    Submitted 15 December, 2023; originally announced December 2023.

    Comments: Preprint, under review

  3. arXiv:2310.16624  [pdf, other

    cs.LG stat.ML

    Free-form Flows: Make Any Architecture a Normalizing Flow

    Authors: Felix Draxler, Peter Sorrenson, Lea Zimmermann, Armand Rousselot, Ullrich Köthe

    Abstract: Normalizing Flows are generative models that directly maximize the likelihood. Previously, the design of normalizing flows was largely constrained by the need for analytical invertibility. We overcome this constraint by a training procedure that uses an efficient estimator for the gradient of the change of variables formula. This enables any dimension-preserving neural network to serve as a genera… ▽ More

    Submitted 24 April, 2024; v1 submitted 25 October, 2023; originally announced October 2023.

    Comments: Camera-ready version: accepted at AISTATS 2024

  4. arXiv:2306.01843  [pdf, other

    cs.LG

    Lifting Architectural Constraints of Injective Flows

    Authors: Peter Sorrenson, Felix Draxler, Armand Rousselot, Sander Hummerich, Lea Zimmermann, Ullrich Köthe

    Abstract: Normalizing Flows explicitly maximize a full-dimensional likelihood on the training data. However, real data is typically only supported on a lower-dimensional manifold leading the model to expend significant compute on modeling noise. Injective Flows fix this by jointly learning a manifold and the distribution on it. So far, they have been limited by restrictive architectures and/or high computat… ▽ More

    Submitted 27 June, 2024; v1 submitted 2 June, 2023; originally announced June 2023.

    Comments: Camera-ready version: accepted to ICLR 2024

  5. arXiv:2305.10475  [pdf, other

    hep-ph

    Jet Diffusion versus JetGPT -- Modern Networks for the LHC

    Authors: Anja Butter, Nathan Huetsch, Sofia Palacios Schweitzer, Tilman Plehn, Peter Sorrenson, Jonas Spinner

    Abstract: We introduce two diffusion models and an autoregressive transformer for LHC physics simulations. Bayesian versions allow us to control the networks and capture training uncertainties. After illustrating their different density estimation methods for simple toy models, we discuss their advantages for Z plus jets event generation. While diffusion networks excel through their precision, the transform… ▽ More

    Submitted 22 June, 2023; v1 submitted 17 May, 2023; originally announced May 2023.

    Comments: 37 pages, 17 figures

  6. arXiv:2206.14225  [pdf, other

    hep-ph

    A Normalized Autoencoder for LHC Triggers

    Authors: Barry M. Dillon, Luigi Favaro, Tilman Plehn, Peter Sorrenson, Michael Krämer

    Abstract: Autoencoders are an effective analysis tool for the LHC, as they represent one of its main goal of finding physics beyond the Standard Model. The key challenge is that out-of-distribution anomaly searches based on the compressibility of features do not apply to the LHC, while existing density-based searches lack performance. We present the first autoencoder which identifies anomalous jets symmetri… ▽ More

    Submitted 22 June, 2023; v1 submitted 28 June, 2022; originally announced June 2022.

    Comments: 26 pages, 11 figures; update based on referees report

  7. Symmetries, Safety, and Self-Supervision

    Authors: Barry M. Dillon, Gregor Kasieczka, Hans Olischlager, Tilman Plehn, Peter Sorrenson, Lorenz Vogel

    Abstract: Collider searches face the challenge of defining a representation of high-dimensional data such that physical symmetries are manifest, the discriminating features are retained, and the choice of representation is new-physics agnostic. We introduce JetCLR to solve the map** from low-level data to optimized observables though self-supervised contrastive learning. As an example, we construct a data… ▽ More

    Submitted 9 August, 2021; originally announced August 2021.

    Journal ref: SciPost Phys. 12, 188 (2022)

  8. Better Latent Spaces for Better Autoencoders

    Authors: Barry M. Dillon, Tilman Plehn, Christof Sauer, Peter Sorrenson

    Abstract: Autoencoders as tools behind anomaly searches at the LHC have the structural problem that they only work in one direction, extracting jets with higher complexity but not the other way around. To address this, we derive classifiers from the latent space of (variational) autoencoders, specifically in Gaussian mixture and Dirichlet latent spaces. In particular, the Dirichlet setup solves the problem… ▽ More

    Submitted 16 April, 2021; originally announced April 2021.

    Comments: 25 pages

    Journal ref: SciPost Phys. 11, 061 (2021)

  9. arXiv:2001.04872  [pdf, other

    cs.LG stat.ML

    Disentanglement by Nonlinear ICA with General Incompressible-flow Networks (GIN)

    Authors: Peter Sorrenson, Carsten Rother, Ullrich Köthe

    Abstract: A central question of representation learning asks under which conditions it is possible to reconstruct the true latent variables of an arbitrarily complex generative process. Recent breakthrough work by Khemakhem et al. (2019) on nonlinear ICA has answered this question for a broad class of conditional generative processes. We extend this important result in a direction relevant for application t… ▽ More

    Submitted 14 January, 2020; originally announced January 2020.

    Comments: 23 pages, 15 figures, ICLR 2020