Skip to main content

Showing 1–50 of 52 results for author: Richter, L

.
  1. arXiv:2407.07873  [pdf, other

    cs.LG math.DS math.OC math.PR stat.ML

    Dynamical Measure Transport and Neural PDE Solvers for Sampling

    Authors: **gtong Sun, Julius Berner, Lorenz Richter, Marius Zeinhofer, Johannes Müller, Kamyar Azizzadenesheli, Anima Anandkumar

    Abstract: The task of sampling from a probability density can be approached as transporting a tractable density function to the target, known as dynamical measure transport. In this work, we tackle it through a principled unified framework using deterministic or stochastic evolutions described by partial differential equations (PDEs). This framework incorporates prior trajectory-based sampling methods, such… ▽ More

    Submitted 10 July, 2024; originally announced July 2024.

  2. arXiv:2406.11292  [pdf, other

    cs.RO

    Daedalus 2: Autorotation Entry, Descent and Landing Experiment on REXUS29

    Authors: Philip Bergmann, Clemens Riegler, Zuri Klaschka, Tobias Herbst, Jan M. Wolf, Maximilian Reigl, Niels Koch, Sarah Menninger, Jan von Pichowski, Cedric Bös, Bence Barthó, Frederik Dunschen, Johanna Mehringer, Ludwig Richter, Lennart Werner

    Abstract: In recent years, interplanetary exploration has gained significant momentum, leading to a focus on the development of launch vehicles. However, the critical technology of edl mechanisms has not received the same level of attention and remains less mature and capable. To address this gap, we took advantage of the REXUS program to develop a pioneering edl mechanism. We propose an alternative to conv… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    Comments: 12 pages, 9 figures

  3. arXiv:2406.04940  [pdf, other

    cs.LG cs.AI

    CarbonSense: A Multimodal Dataset and Baseline for Carbon Flux Modelling

    Authors: Matthew Fortier, Mats L. Richter, Oliver Sonnentag, Chris Pal

    Abstract: Terrestrial carbon fluxes provide vital information about our biosphere's health and its capacity to absorb anthropogenic CO$_2$ emissions. The importance of predicting carbon fluxes has led to the emerging field of data-driven carbon flux modelling (DDCFM), which uses statistical techniques to predict carbon fluxes from biophysical data. However, the field lacks a standardized dataset to promote… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

    Comments: 9 content pages, 11 reference pages, 9 appendix pages

  4. arXiv:2405.03549  [pdf, other

    stat.ML cs.LG math.DS math.PR

    Bridging discrete and continuous state spaces: Exploring the Ehrenfest process in time-continuous diffusion models

    Authors: Ludwig Winkler, Lorenz Richter, Manfred Opper

    Abstract: Generative modeling via stochastic processes has led to remarkable empirical results as well as to recent advances in their theoretical understanding. In principle, both space and time of the processes can be discrete or continuous. In this work, we study time-continuous Markov jump processes on discrete state spaces and investigate their correspondence to state-continuous diffusion processes give… ▽ More

    Submitted 6 May, 2024; originally announced May 2024.

  5. arXiv:2403.15881  [pdf, other

    cs.LG stat.ML

    Fast and Unified Path Gradient Estimators for Normalizing Flows

    Authors: Lorenz Vaitl, Ludwig Winkler, Lorenz Richter, Pan Kessel

    Abstract: Recent work shows that path gradient estimators for normalizing flows have lower variance compared to standard estimators for variational inference, resulting in improved training. However, they are often prohibitively more expensive from a computational point of view and cannot be applied to maximum likelihood training in a scalable manner, which severely hinders their widespread adoption. In thi… ▽ More

    Submitted 23 March, 2024; originally announced March 2024.

  6. arXiv:2403.08763  [pdf, other

    cs.LG cs.AI cs.CL

    Simple and Scalable Strategies to Continually Pre-train Large Language Models

    Authors: Adam Ibrahim, Benjamin Thérien, Kshitij Gupta, Mats L. Richter, Quentin Anthony, Timothée Lesort, Eugene Belilovsky, Irina Rish

    Abstract: Large language models (LLMs) are routinely pre-trained on billions of tokens, only to start the process over again once new data becomes available. A much more efficient solution is to continually pre-train these models, saving significant compute compared to re-training. However, the distribution shift induced by new data typically results in degraded performance on previous data or poor adaptati… ▽ More

    Submitted 26 March, 2024; v1 submitted 13 March, 2024; originally announced March 2024.

  7. arXiv:2312.07341  [pdf, other

    cond-mat.supr-con cond-mat.mtrl-sci cond-mat.str-el

    Unconventional crystal structure of the high-pressure superconductor La$_3$Ni$_2$O$_7$

    Authors: Pascal Puphal, Pascal Reiss, Niklas Enderlein, Yu-Mi Wu, Giniyat Khaliullin, Vignesh Sundaramurthy, Tim Priessnitz, Manuel Knauft, Lea Richter, Masahiko Isobe, Peter A. van Aken, Hidenori Takagi, Bernhard Keimer, Y. Eren Suyolcu, Björn Wehinger, Philipp Hansmann, Matthias Hepting

    Abstract: The discovery of high-temperature superconductivity in La$_3$Ni$_2$O$_7$ at pressures above 14 GPa has spurred extensive research efforts. Yet, fundamental aspects of the superconducting phase, including the possibility of a filamentary character, are currently subjects of controversial debates. Conversely, a crystal structure with NiO$_6$ octahedral bilayers stacked along the $c$-axis direction w… ▽ More

    Submitted 12 December, 2023; originally announced December 2023.

  8. arXiv:2312.07275  [pdf, other

    astro-ph.GA

    The SARAO MeerKAT 1.3 GHz Galactic Plane Survey

    Authors: S. Goedhart, W. D. Cotton, F. Camilo, M. A. Thompson, G. Umana, M. Bietenholz, P. A. Woudt, L. D. Anderson, C. Bordiu, D. A. H. Buckley, C. S. Buemi, F. Bufano, F. Cavallaro, H. Chen, J. O. Chibueze, D. Egbo, B. S. Frank, M. G. Hoare, A. Ingallinera, T. Irabor, R. C. Kraan-Korteweg, S. Kurapati, P. Leto, S. Loru, M. Mutale , et al. (105 additional authors not shown)

    Abstract: We present the SARAO MeerKAT Galactic Plane Survey (SMGPS), a 1.3 GHz continuum survey of almost half of the Galactic Plane (251°$\le l \le$ 358°and 2°$\le l \le$ 61°at $|b| \le 1.5°$). SMGPS is the largest, most sensitive and highest angular resolution 1 GHz survey of the Plane yet carried out, with an angular resolution of 8" and a broadband RMS sensitivity of $\sim$10--20 $μ$ Jy/beam. Here we d… ▽ More

    Submitted 2 May, 2024; v1 submitted 12 December, 2023; originally announced December 2023.

    Comments: Accepted for publication in MNRAS. The data release is live and links can be found in the Data Availability Statement in the paper

  9. arXiv:2311.04100  [pdf, other

    quant-ph

    Graph-controlled Permutation Mixers in QAOA for the Flexible Job-Shop Problem

    Authors: Lilly Palackal, Leonhard Richter, Maximilian Hess

    Abstract: One of the most promising attempts towards solving optimization problems with quantum computers in the noisy intermediate scale era of quantum computing are variational quantum algorithms. The Quantum Alternating Operator Ansatz provides an algorithmic framework for constrained, combinatorial optimization problems. As opposed to the better known standard QAOA protocol, the constraints of the optim… ▽ More

    Submitted 7 November, 2023; originally announced November 2023.

    Comments: This work has been submitted to the HICSS 2024 for possible publication

  10. arXiv:2308.04014  [pdf, other

    cs.CL cs.LG

    Continual Pre-Training of Large Language Models: How to (re)warm your model?

    Authors: Kshitij Gupta, Benjamin Thérien, Adam Ibrahim, Mats L. Richter, Quentin Anthony, Eugene Belilovsky, Irina Rish, Timothée Lesort

    Abstract: Large language models (LLMs) are routinely pre-trained on billions of tokens, only to restart the process over again once new data becomes available. A much cheaper and more efficient solution would be to enable the continual pre-training of these models, i.e. updating pre-trained models with new data instead of re-training them from scratch. However, the distribution shift induced by novel data t… ▽ More

    Submitted 6 September, 2023; v1 submitted 7 August, 2023; originally announced August 2023.

  11. arXiv:2307.15496  [pdf, other

    cs.LG math.NA math.PR stat.ML

    From continuous-time formulations to discretization schemes: tensor trains and robust regression for BSDEs and parabolic PDEs

    Authors: Lorenz Richter, Leon Sallandt, Nikolas Nüsken

    Abstract: The numerical approximation of partial differential equations (PDEs) poses formidable challenges in high dimensions since classical grid-based methods suffer from the so-called curse of dimensionality. Recent attempts rely on a combination of Monte Carlo methods and variational formulations, using neural networks for function approximation. Extending previous work (Richter et al., 2021), we argue… ▽ More

    Submitted 28 July, 2023; originally announced July 2023.

  12. arXiv:2307.02454  [pdf, other

    cs.LG

    Transgressing the boundaries: towards a rigorous understanding of deep learning and its (non-)robustness

    Authors: Carsten Hartmann, Lorenz Richter

    Abstract: The recent advances in machine learning in various fields of applications can be largely attributed to the rise of deep learning (DL) methods and architectures. Despite being a key technology behind autonomous cars, image processing, speech recognition, etc., a notorious problem remains the lack of theoretical understanding of DL and related interpretability and (adversarial) robustness issues. Un… ▽ More

    Submitted 5 July, 2023; originally announced July 2023.

  13. arXiv:2307.01198  [pdf, other

    cs.LG math.OC math.PR stat.ML

    Improved sampling via learned diffusions

    Authors: Lorenz Richter, Julius Berner

    Abstract: Recently, a series of papers proposed deep learning-based approaches to sample from target distributions using controlled diffusion processes, being trained only on the unnormalized target densities without access to samples. Building on previous work, we identify these approaches as special cases of a generalized Schrödinger bridge problem, seeking a stochastic evolution between a given prior dis… ▽ More

    Submitted 23 May, 2024; v1 submitted 3 July, 2023; originally announced July 2023.

    Comments: Accepted at ICLR 2024

    Journal ref: International Conference on Learning Representations, 2024

  14. arXiv:2306.00637  [pdf, other

    cs.CV

    Wuerstchen: An Efficient Architecture for Large-Scale Text-to-Image Diffusion Models

    Authors: Pablo Pernias, Dominic Rampas, Mats L. Richter, Christopher J. Pal, Marc Aubreville

    Abstract: We introduce Würstchen, a novel architecture for text-to-image synthesis that combines competitive performance with unprecedented cost-effectiveness for large-scale text-to-image diffusion models. A key contribution of our work is to develop a latent diffusion technique in which we learn a detailed but extremely compact semantic image representation used to guide the diffusion process. This highly… ▽ More

    Submitted 29 September, 2023; v1 submitted 1 June, 2023; originally announced June 2023.

    Comments: Corresponding to "Würstchen v2"

    Journal ref: The Twelfth International Conference on Learning Representations (ICLR), 2024

  15. arXiv:2301.06684  [pdf, other

    math.LO

    Co-analytic Counterexamples to Marstrand's Projection Theorem

    Authors: Linus Richter

    Abstract: Assuming $V=L$, we construct a plane set $E$ of Hausdorff dimension $1$ whose every orthogonal projection onto straight lines through the origin has Hausdorff dimension $0$. This is a counterexample to J. M. Marstrand's seminal projection theorem. While counterexamples had already been constructed decades ago, initially by R. O. Davies, the novelty of our result lies in the fact that $E$ is co-ana… ▽ More

    Submitted 16 January, 2023; originally announced January 2023.

    Comments: 35 pages, 3 figures

    MSC Class: 03D32 (Primary) 28A75; 28A80 (Secondary)

  16. arXiv:2212.12364  [pdf, other

    physics.ins-det nucl-ex

    The new APD-Based Readout of the Crystal Barrel Calorimeter -- An Overview

    Authors: CBELSA/TAPS Collaboration, :, C. Honisch, P. Klassen, J. Müllers, M. Urban, F. Afzal, J. Bieling, S. Ciupka, J. Hartmann, P. Hoffmeister, M. Lang, D. Schaab, C. Schmidt, M. Steinacher, D. Walther, R. Beck, K. -T. Brinkmann, V. Crede, H. Dutz, D. Elsner, W. Erni, E. Fix, F. Frommberger, M. Grüner , et al. (26 additional authors not shown)

    Abstract: The Crystal Barrel is an electromagnetic calorimeter consisting of 1380 CsI(Tl) scintillators, and is currently installed at the CBELSA/TAPS experiment where it is used to detect decay products from photoproduction of mesons. The readout of the Crystal Barrel has been upgraded in order to integrate the detector into the first level of the trigger and to increase its sensitivity for neutral final s… ▽ More

    Submitted 16 January, 2023; v1 submitted 23 December, 2022; originally announced December 2022.

  17. arXiv:2211.14487  [pdf, other

    cs.CV cs.AI cs.LG

    Receptive Field Refinement for Convolutional Neural Networks Reliably Improves Predictive Performance

    Authors: Mats L. Richter, Christopher Pal

    Abstract: Minimal changes to neural architectures (e.g. changing a single hyperparameter in a key layer), can lead to significant gains in predictive performance in Convolutional Neural Networks (CNNs). In this work, we present a new approach to receptive field analysis that can yield these types of theoretical and empirical performance gains across twenty well-known CNN architectures examined in our experi… ▽ More

    Submitted 26 November, 2022; originally announced November 2022.

  18. arXiv:2211.11183  [pdf, other

    cs.LG

    Causal Fairness Assessment of Treatment Allocation with Electronic Health Records

    Authors: Linying Zhang, Lauren R. Richter, Yixin Wang, Anna Ostropolets, Noemie Elhadad, David M. Blei, George Hripcsak

    Abstract: Healthcare continues to grapple with the persistent issue of treatment disparities, sparking concerns regarding the equitable allocation of treatments in clinical practice. While various fairness metrics have emerged to assess fairness in decision-making processes, a growing focus has been on causality-based fairness concepts due to their capacity to mitigate confounding effects and reason about b… ▽ More

    Submitted 7 January, 2024; v1 submitted 21 November, 2022; originally announced November 2022.

  19. arXiv:2211.01517  [pdf

    cond-mat.mtrl-sci cond-mat.soft

    Hydration of a side-chain-free n-type semiconducting ladder polymer driven by electrochemical do**

    Authors: Jiajie Guo, Lucas Q. Flagg, Duyen K. Tran, Shinya E. Chen, Ruipeng Li, Nagesh B. Kolhe, Rajiv Giridharagopal, Samson A. Jenekhe, Lee J. Richter, David S. Ginger

    Abstract: We study the organic electrochemical transistors (OECTs) performance of the ladder polymer, poly(benzimidazobenzophenanthroline) (BBL) in an attempt to better understand how an apparently hydrophobic side-chain-free polymer is able to operate as an OECT with favorable redox kinetics in an aqueous environment. We examine two BBLs of different molecular masses from different sources. Both BBLs show… ▽ More

    Submitted 2 November, 2022; originally announced November 2022.

    Comments: 24 pages, 5 figures

  20. arXiv:2211.01364  [pdf, other

    cs.LG math.OC stat.ML

    An optimal control perspective on diffusion-based generative modeling

    Authors: Julius Berner, Lorenz Richter, Karen Ullrich

    Abstract: We establish a connection between stochastic optimal control and generative models based on stochastic differential equations (SDEs), such as recently developed diffusion probabilistic models. In particular, we derive a Hamilton-Jacobi-Bellman equation that governs the evolution of the log-densities of the underlying SDE marginals. This perspective allows to transfer methods from optimal control t… ▽ More

    Submitted 26 March, 2024; v1 submitted 2 November, 2022; originally announced November 2022.

    Comments: Accepted for oral presentation at NeurIPS 2022 Workshop on Score-Based Methods

    Journal ref: Transactions on Machine Learning Research, 2024

  21. arXiv:2206.10588  [pdf, other

    cs.LG math.NA stat.ML

    Robust SDE-Based Variational Formulations for Solving Linear PDEs via Deep Learning

    Authors: Lorenz Richter, Julius Berner

    Abstract: The combination of Monte Carlo methods and deep learning has recently led to efficient algorithms for solving partial differential equations (PDEs) in high dimensions. Related learning problems are often stated as variational formulations based on associated stochastic differential equations (SDEs), which allow the minimization of corresponding losses using gradient-based optimization methods. In… ▽ More

    Submitted 5 August, 2022; v1 submitted 21 June, 2022; originally announced June 2022.

    Comments: Accepted at ICML 2022

    Journal ref: Proceedings of the 39th International Conference on Machine Learning, 2022, pp. 18649-18666

  22. Improving control based importance sampling strategies for metastable diffusions via adapted metadynamics

    Authors: Enric Ribera Borrell, Jannes Quer, Lorenz Richter, Christof Schütte

    Abstract: Sampling rare events in metastable dynamical systems is often a computationally expensive task and one needs to resort to enhanced sampling methods such as importance sampling. Since we can formulate the problem of finding optimal importance sampling controls as a stochastic optimization problem, this then brings additional numerical challenges and the convergence of corresponding algorithms might… ▽ More

    Submitted 3 October, 2023; v1 submitted 14 June, 2022; originally announced June 2022.

    MSC Class: 49-XX; 62-XX; 68-XX

  23. arXiv:2112.03749  [pdf, other

    math.NA math.PR stat.ML

    Interpolating between BSDEs and PINNs: deep learning for elliptic and parabolic boundary value problems

    Authors: Nikolas Nüsken, Lorenz Richter

    Abstract: Solving high-dimensional partial differential equations is a recurrent challenge in economics, science and engineering. In recent years, a great number of computational approaches have been developed, most of them relying on a combination of Monte Carlo sampling and deep learning based approximation. For elliptic and parabolic problems, existing methods can broadly be classified into those resting… ▽ More

    Submitted 29 January, 2023; v1 submitted 7 December, 2021; originally announced December 2021.

  24. Observation of a structure in the M$_{pη}$ invariant mass distribution near 1700 MeV/$c^2$ in the $\mathbf{γp \rightarrow p π^0 η} $ reaction

    Authors: V. Metag, M. Nanova, J. Hartmann, P. Mahlberg, F. Afzal, C. Bartels, D. Bayadilov, R. Beck, M. Becker, E. Blanke, K. -T. Brinkmann, S. Ciupka, V. Crede, M. Dieterle, H. Dutz, D. Elsner, F. Frommberger, A. Gridnev, M. Gottschall, M. Grüner, Ch. Hammann, J. Hannappel, W. Hillert, J. Hoff, Ph. Hoffmeister , et al. (52 additional authors not shown)

    Abstract: The reaction $γp \rightarrow p π^0 η$ has been studied with the CBELSA/TAPS detector at the electron stretcher accelerator ELSA in Bonn for incident photon energies from threshold up to 3.1 GeV. This paper has been motivated by the recently claimed observation of a narrow structure in the M$_{Nη}$ invariant mass distribution at a mass of 1678 MeV/$c^2$. The existence of this structure cannot be co… ▽ More

    Submitted 18 November, 2021; v1 submitted 11 October, 2021; originally announced October 2021.

    Comments: 16 pages, 18 figure

  25. arXiv:2106.12307  [pdf, other

    cs.LG cs.AI cs.NE stat.ML

    Should You Go Deeper? Optimizing Convolutional Neural Network Architectures without Training by Receptive Field Analysis

    Authors: Mats L. Richter, Julius Schöning, Anna Wiedenroth, Ulf Krumnack

    Abstract: When optimizing convolutional neural networks (CNN) for a specific image-based task, specialists commonly overshoot the number of convolutional layers in their designs. By implication, these CNNs are unnecessarily resource intensive to train and deploy, with diminishing beneficial effects on the predictive performance. The features a convolutional layer can process are strictly limited by its re… ▽ More

    Submitted 5 October, 2021; v1 submitted 23 June, 2021; originally announced June 2021.

    Comments: Preprint

  26. arXiv:2106.09526  [pdf, other

    cs.LG cs.AI

    Exploring the Properties and Evolution of Neural Network Eigenspaces during Training

    Authors: Mats L. Richter, Leila Malihi, Anne-Kathrin Patricia Windler, Ulf Krumnack

    Abstract: In this work we explore the information processing inside neural networks using logistic regression probes \cite{probes} and the saturation metric \cite{featurespace_saturation}. We show that problem difficulty and neural network capacity affect the predictive performance in an antagonistic manner, opening the possibility of detecting over- and under-parameterization of neural networks for a given… ▽ More

    Submitted 27 October, 2021; v1 submitted 17 June, 2021; originally announced June 2021.

  27. arXiv:2102.11830  [pdf, other

    stat.ML cs.LG math.NA math.PR

    Solving high-dimensional parabolic PDEs using the tensor train format

    Authors: Lorenz Richter, Leon Sallandt, Nikolas Nüsken

    Abstract: High-dimensional partial differential equations (PDEs) are ubiquitous in economics, science and engineering. However, their numerical treatment poses formidable challenges since traditional grid-based methods tend to be frustrated by the curse of dimensionality. In this paper, we argue that tensor trains provide an appealing approximation framework for parabolic PDEs: the combination of reformulat… ▽ More

    Submitted 17 July, 2021; v1 submitted 23 February, 2021; originally announced February 2021.

  28. arXiv:2102.09606  [pdf, other

    math.ST math.NA math.PR

    Nonasymptotic bounds for suboptimal importance sampling

    Authors: Carsten Hartmann, Lorenz Richter

    Abstract: Importance sampling is a popular variance reduction method for Monte Carlo estimation, where a notorious question is how to design good proposal distributions. While in most cases optimal (zero-variance) estimators are theoretically possible, in practice only suboptimal proposal distributions are available and it can often be observed numerically that those can reduce statistical performance signi… ▽ More

    Submitted 18 February, 2021; originally announced February 2021.

  29. Size Matters

    Authors: Mats L. Richter, Wolf Byttner, Ulf Krumnack, Ludwdig Schallner, Justin Shenk

    Abstract: Fully convolutional neural networks can process input of arbitrary size by applying a combination of downsampling and pooling. However, we find that fully convolutional image classifiers are not agnostic to the input size but rather show significant differences in performance: presenting the same image at different scales can result in different outcomes. A closer look reveals that there is no sim… ▽ More

    Submitted 9 February, 2021; v1 submitted 2 February, 2021; originally announced February 2021.

    Comments: Preprint

    Journal ref: Artificial Neural Networks and Machine Learning ICANN 2021 133-144

  30. arXiv:2010.10436  [pdf, other

    stat.ML cs.LG math.ST

    VarGrad: A Low-Variance Gradient Estimator for Variational Inference

    Authors: Lorenz Richter, Ayman Boustati, Nikolas Nüsken, Francisco J. R. Ruiz, Ömer Deniz Akyildiz

    Abstract: We analyse the properties of an unbiased gradient estimator of the ELBO for variational inference, based on the score function method with leave-one-out control variates. We show that this gradient estimator can be obtained using a new loss, defined as the variance of the log-ratio between the exact posterior and the variational approximation, which we call the $\textit{log-variance loss}$. Under… ▽ More

    Submitted 29 October, 2020; v1 submitted 20 October, 2020; originally announced October 2020.

  31. arXiv:2008.12288  [pdf, other

    math.OC math.NA

    Model Order Reduction for (Stochastic-) Delay Equations With Error Bounds

    Authors: Simon Becker, Lorenz Richter

    Abstract: We analyze a structure-preserving model order reduction technique for delay and stochastic delay equations based on the balanced truncation method and provide a system theoretic interpretation. Transferring error bounds based on Hankel operators to delay systems, we find error estimates for the difference between the dynamics of the full and reduced model. This analysis also yields new error bound… ▽ More

    Submitted 27 August, 2020; originally announced August 2020.

  32. arXiv:2006.08679  [pdf, other

    cs.LG cs.CV cs.NE stat.ML

    Feature Space Saturation during Training

    Authors: Mats L. Richter, Justin Shenk, Wolf Byttner, Anders Arpteg, Mikael Huss

    Abstract: We propose layer saturation - a simple, online-computable method for analyzing the information processing in neural networks. First, we show that a layer's output can be restricted to the eigenspace of its variance matrix without performance loss. We propose a computationally lightweight method for approximating the variance matrix during training. From the dimension of its lossless eigenspace we… ▽ More

    Submitted 22 November, 2021; v1 submitted 15 June, 2020; originally announced June 2020.

    Comments: 45 pages, 41 figures; author order changed in v5 to reflect additional contribution; for code see http://github.com/MLRichter/phd-lab and http://github.com/delve-team/delve

    MSC Class: 68T07 ACM Class: I.2.6

    Journal ref: British Machine Vision Conference (BMVC) 2021

  33. arXiv:2005.05409  [pdf, other

    math.OC cs.LG math.NA math.PR stat.ML

    Solving high-dimensional Hamilton-Jacobi-Bellman PDEs using neural networks: perspectives from the theory of controlled diffusions and measures on path space

    Authors: Nikolas Nüsken, Lorenz Richter

    Abstract: Optimal control of diffusion processes is intimately connected to the problem of solving certain Hamilton-Jacobi-Bellman equations. Building on recent machine learning inspired approaches towards high-dimensional PDEs, we investigate the potential of $\textit{iterative diffusion optimisation}$ techniques, in particular considering applications in importance sampling and rare event simulation, and… ▽ More

    Submitted 29 January, 2023; v1 submitted 11 May, 2020; originally announced May 2020.

  34. The 1.28 GHz MeerKAT DEEP2 Image

    Authors: T. Mauch, W. D. Cotton, J. J. Condon, A. M. Matthews, T. D. Abbott, R. M. Adam, M. A. Aldera, K. M. B. Asad, E. F. Bauermeister, T. G. H. Bennett, H. Bester, D. H. Botha, L. R. S. Brederode, Z. B. Brits, S. J. Buchner, J. P. Burger, F. Camilo, J. M. Chalmers, T. Cheetham, D. de Villiers, M. S. de Villiers, M. A. Dikgale-Mahlakoana, L. J. du Toit, S. W. P. Esterhuyse, G. Fadana , et al. (79 additional authors not shown)

    Abstract: We present the confusion-limited 1.28 GHz MeerKAT DEEP2 image covering one $\approx 68'$ FWHM primary beam area with $7.6''$ FWHM resolution and $0.55 \pm 0.01$ $μ$Jy/beam rms noise. Its J2000 center position $α=04^h 13^m 26.4^s$, $δ=-80^\circ 00' 00''$ was selected to minimize artifacts caused by bright sources. We introduce the new 64-element MeerKAT array and describe commissioning observations… ▽ More

    Submitted 12 December, 2019; originally announced December 2019.

    Comments: 20 pages, 18 figures. Accepted for publication in ApJ

  35. arXiv:1912.06113  [pdf, other

    math.OC math.DS math.NA

    Error bounds for model reduction of feedback-controlled linear stochastic dynamics on Hilbert spaces

    Authors: Simon Becker, Carsten Hartmann, Martin Redmann, Lorenz Richter

    Abstract: We analyze structure-preserving model order reduction methods for Ornstein-Uhlenbeck processes and linear S(P)DEs with multiplicative noise based on balanced truncation. For the first time, we include in this study the analysis of non-zero initial conditions. We moreover allow for feedback-controlled dynamics for solving stochastic optimal control problems with reduced-order models and prove novel… ▽ More

    Submitted 17 March, 2022; v1 submitted 12 December, 2019; originally announced December 2019.

    Comments: comments welcome

  36. arXiv:1907.08589  [pdf, other

    cs.LG stat.ML

    Spectral Analysis of Latent Representations

    Authors: Justin Shenk, Mats L. Richter, Anders Arpteg, Mikael Huss

    Abstract: We propose a metric, Layer Saturation, defined as the proportion of the number of eigenvalues needed to explain 99% of the variance of the latent representations, for analyzing the learned representations of neural network layers. Saturation is based on spectral analysis and can be computed efficiently, making live analysis of the representations practical during training. We provide an outlook fo… ▽ More

    Submitted 19 July, 2019; originally announced July 2019.

    Comments: 13 pages, 16 figures, code: https://github.com/delve-team/delve

  37. arXiv:1901.09195  [pdf, other

    math.PR math.OC

    Variational approach to rare event simulation using least-squares regression

    Authors: Carsten Hartmann, Omar Kebiri, Lara Neureither, Lorenz Richter

    Abstract: We propose an adaptive importance sampling scheme for the simulation of rare events when the underlying dynamics is given by a diffusion. The scheme is based on a Gibbs variational principle that is used to determine the optimal (i.e. zero-variance) change of measure and exploits the fact that the latter can be rephrased as a stochastic optimal control problem. The control problem can be solved by… ▽ More

    Submitted 16 April, 2019; v1 submitted 26 January, 2019; originally announced January 2019.

    Comments: 28 pages, 7 figures

    MSC Class: 65C05 (primary); 65C30; 92C40 (secondary)

  38. The Stripe 82 1-2 GHz Very Large Array Snapshot Survey: Multiwavelength Counterparts

    Authors: Matthew Prescott, I. H. Whittam, M. J. Jarvis, K. McAlpine, L. L. Richter, S. Fine, T. Mauch, I. Heywood, M. Vaccari

    Abstract: We have combined spectrosopic and photometric data from the Sloan Digital Sky Survey (SDSS) with $1.4$ GHz radio observations, conducted as part of the Stripe 82 $1-2$ GHz Snapshot Survey using the Karl G. Jansky Very Large Array (VLA), which covers $\sim100$ sq degrees, to a flux limit of 88 $μ$Jy rms. Cross-matching the $11\,768$ radio source components with optical data via visual inspection re… ▽ More

    Submitted 26 June, 2018; originally announced June 2018.

    Comments: 17 pages, 19 figures. Resubmitted to MNRAS after the initial comments

  39. Revival of the magnetar PSR J1622-4950: observations with MeerKAT, Parkes, XMM-Newton, Swift, Chandra, and NuSTAR

    Authors: F. Camilo, P. Scholz, M. Serylak, S. Buchner, M. Merryfield, V. M. Kaspi, R. F. Archibald, M. Bailes, A. Jameson, W. van Straten, J. Sarkissian, J. E. Reynolds, S. Johnston, G. Hobbs, T. D. Abbott, R. M. Adam, G. B. Adams, T. Alberts, R. Andreas, K. M. B. Asad, D. E. Baker, T. Baloyi, E. F. Bauermeister, T. Baxana, T. G. H. Bennett , et al. (183 additional authors not shown)

    Abstract: New radio (MeerKAT and Parkes) and X-ray (XMM-Newton, Swift, Chandra, and NuSTAR) observations of PSR J1622-4950 indicate that the magnetar, in a quiescent state since at least early 2015, reactivated between 2017 March 19 and April 5. The radio flux density, while variable, is approximately 100x larger than during its dormant state. The X-ray flux one month after reactivation was at least 800x la… ▽ More

    Submitted 5 April, 2018; originally announced April 2018.

    Comments: Published in ApJ (2018 April 5); 13 pages, 4 figures

    Journal ref: ApJ 856 (2018) 180

  40. arXiv:1709.01289  [pdf, other

    astro-ph.GA

    The MeerKAT Fornax Survey

    Authors: P. Serra, W. J. G. de Blok, G. L. Bryan, S. Colafrancesco, R. -J. Dettmar, B. S. Frank, F. Govoni, G. I. G. Józsa, R. C. Kraan-Korteweg, S. I. Loubser, F. M. Maccagni, M. Murgia, T. A. Oosterloo, R. F. Peletier, R. Pizzo, M. Ramatsoku, L. Richter, M. W. L. Smith, S. C. Trager, J. H. van Gorkom, M. A. W. Verheijen

    Abstract: We present the science case and observations plan of the MeerKAT Fornax Survey, an HI and radio continuum survey of the Fornax galaxy cluster to be carried out with the SKA precursor MeerKAT. Fornax is the second most massive cluster within 20 Mpc and the largest nearby cluster in the southern hemisphere. Its low X-ray luminosity makes it representative of the environment where most galaxies live… ▽ More

    Submitted 5 September, 2017; originally announced September 2017.

    Comments: Proceedings of Science, "MeerKAT Science: On the Pathway to the SKA", Stellenbosch, 25-27 May 2016

  41. arXiv:1703.09921  [pdf, other

    astro-ph.HE astro-ph.GA hep-ph

    Dark matter in the Reticulum II dSph: a radio search

    Authors: Marco Regis, Laura Richter, Sergio Colafrancesco

    Abstract: We present a deep radio search in the Reticulum II dwarf spheroidal (dSph) galaxy performed with the Australia Telescope Compact Array. Observations were conducted at 16 cm wavelength, with an rms sensitivity of 0.01 mJy/beam, and with the goal of searching for synchrotron emission induced by annihilation or decay of weakly interacting massive particles (WIMPs). Data were complemented with observa… ▽ More

    Submitted 25 July, 2017; v1 submitted 29 March, 2017; originally announced March 2017.

    Comments: 24 pages, 13 figure panels, 2 tables. v2 to match published version

    Journal ref: JCAP 07 (2017) 025

  42. Engineering and Science Highlights of the KAT-7 Radio Telescope

    Authors: A. R. Foley, T. Alberts, R P. Armstrong, A. Barta, E. F. Bauermeister, H. Bester, S. Blose, R. S. Booth, D. H. Botha, S. J. Buchner, C. Carignan, T. Cheetham, K. Cloete, G. Coreejes, R. C. Crida, S. D. Cross, F. Curtolo, A. Dikgale, M. S. de Villiers, L. J. du Toit, S. W. P. Esterhuyse, B. Fanaroff, R. P. Fender, M. Fijalkowski, D. Fourie , et al. (78 additional authors not shown)

    Abstract: The construction of the KAT-7 array in the Karoo region of the Northern Cape in South Africa was intended primarily as an engineering prototype for technologies and techniques applicable to the MeerKAT telescope. This paper looks at the main engineering and scien- tific highlights from this effort, and discusses their applicability to both MeerKAT and other next-generation radio telescopes. In par… ▽ More

    Submitted 9 June, 2016; originally announced June 2016.

  43. arXiv:1605.09572  [pdf, ps, other

    astro-ph.SR astro-ph.GA

    Simultaneous VLBA polarimetric observations of the v=$\{$1,2$\}$ J=1-0 and v=1, J=2-1 SiO maser emission toward VY CMa II: component-level polarization analysis

    Authors: L. Richter, A. Kemball, J. Jonas

    Abstract: This paper presents a component-level comparison of the polarized v=1 J =1-0, v=2 J=1-0 and v=1 J=2-1 SiO maser emission towards the supergiant star VY CMa at milliarcsecond-scale, as observed using the VLBA at $λ=7$mm and $λ=3$mm. An earlier paper considered overall maser morphology and constraints on SiO maser excitation and pum** derived from these data. The goal of the current paper is to us… ▽ More

    Submitted 31 May, 2016; originally announced May 2016.

  44. An expanded evaluation of protein function prediction methods shows an improvement in accuracy

    Authors: Yuxiang Jiang, Tal Ronnen Oron, Wyatt T Clark, Asma R Bankapur, Daniel D'Andrea, Rosalba Lepore, Christopher S Funk, Indika Kahanda, Karin M Verspoor, Asa Ben-Hur, Emily Koo, Duncan Penfold-Brown, Dennis Shasha, Noah Youngs, Richard Bonneau, Alexandra Lin, Sayed ME Sahraeian, Pier Luigi Martelli, Giuseppe Profiti, Rita Casadio, Renzhi Cao, Zhaolong Zhong, Jianlin Cheng, Adrian Altenhoff, Nives Skunca , et al. (122 additional authors not shown)

    Abstract: Background: The increasing volume and variety of genotypic and phenotypic data is a major defining characteristic of modern biomedical sciences. At the same time, the limitations in technology for generating data and the inherently stochastic nature of biomolecular events have led to the discrepancy between the volume of data and the amount of knowledge gleaned from it. A major bottleneck in our a… ▽ More

    Submitted 2 January, 2016; originally announced January 2016.

    Comments: Submitted to Genome Biology

  45. arXiv:1407.5482  [pdf, other

    astro-ph.GA astro-ph.CO

    Local Group dSph radio survey with ATCA (II): Non-thermal diffuse emission

    Authors: M. Regis, L. Richter, S. Colafrancesco, S. Profumo, W. J. G. de Blok, M. Massardi

    Abstract: Our closest neighbours, the Local Group dwarf spheroidal (dSph) galaxies, are extremely quiescent and dim objects, where thermal and non-thermal diffuse emissions lack, so far, of detection. In order to possibly study the dSph interstellar medium, deep observations are required. They could reveal non-thermal emissions associated with the very-low level of star formation, or to particle dark matter… ▽ More

    Submitted 9 February, 2015; v1 submitted 18 July, 2014; originally announced July 2014.

    Comments: 21 pages, 11 figure panels. Companion papers: arXiv:1407.5479 and arXiv:1407.4948. v3: minor revision, matches version accepted in MNRAS

    Journal ref: MNRAS 448, 3747-3765 (2015)

  46. arXiv:1407.5479  [pdf, other

    astro-ph.GA astro-ph.CO

    Local Group dSph radio survey with ATCA (I): Observations and background sources

    Authors: M. Regis, L. Richter, S. Colafrancesco, M. Massardi, W. J. G. de Blok, S. Profumo, N. Orford

    Abstract: Dwarf spheroidal (dSph) galaxies are key objects in near-field cosmology, especially in connection to the study of galaxy formation and evolution at small scales. In addition, dSphs are optimal targets to investigate the nature of dark matter. However, while we begin to have deep optical photometric observations of the stellar population in these objects, little is known so far about their diffuse… ▽ More

    Submitted 9 February, 2015; v1 submitted 18 July, 2014; originally announced July 2014.

    Comments: 18 pages, 9 figure panels. Companion papers: arXiv:1407.5482 and arXiv:1407.4948. v3: minor revision, matches version accepted in MNRAS

    Journal ref: MNRAS 448, 3731-3746 (2015)

  47. arXiv:1407.4948  [pdf, ps, other

    astro-ph.CO astro-ph.GA hep-ph

    Local Group dSph radio survey with ATCA (III): Constraints on Particle Dark Matter

    Authors: M. Regis, S. Colafrancesco, S. Profumo, W. J. G. de Blok, M. Massardi, L. Richter

    Abstract: We performed a deep search for radio synchrotron emissions induced by weakly interacting massive particles (WIMPs) annihilation or decay in six dwarf spheroidal (dSph) galaxies of the Local Group. Observations were conducted with the Australia Telescope Compact Array (ATCA) at 16 cm wavelength, with an rms sensitivity better than 0.05 mJy/beam in each field. In this work, we first discuss the unce… ▽ More

    Submitted 10 October, 2014; v1 submitted 18 July, 2014; originally announced July 2014.

    Comments: 17 pages, 6 figure panels. Companion papers: arXiv:1407.5479 and arXiv:1407.5482. v3: minor revision, matches published version

    Journal ref: JCAP 10 (2014) 016

  48. Simultaneous VLBA polarimetric observations of the v={1,2} J=1-0 and v=1, J=2-1 SiO maser emission toward VY CMa: maser morphology and pum**

    Authors: Laura Richter, Athol Kemball, Justin Jonas

    Abstract: This paper presents a milliarcsecond-scale comparison of the polarised component-level v=1 J=1-0, v=2 J=1-0 and v=1 J=2-1 SiO maser emission toward the supergiant star VY CMa. These observations used the VLBA at λ=7mm and λ=3mm over two epochs. The goal is to use the relative characteristics and spatial distribution of the transitions in individual resolved maser components to provide observationa… ▽ More

    Submitted 9 September, 2013; originally announced September 2013.

    Comments: 14 pages, 14 figures, 1 table

  49. A return to strong radio flaring by Circinus X-1 observed with the Karoo Array Telescope test array KAT-7

    Authors: R. P. Armstrong, R. P. Fender, G. D. Nicolson, S. Ratcliffe, M. Linares, J. Horrell, L. Richter, M. P. E. Schurch, M. Coriat, P. Woudt, J. Jonas, R. Booth, B. Fanaroff

    Abstract: Circinus X-1 is a bright and highly variable X-ray binary which displays strong and rapid evolution in all wavebands. Radio flaring, associated with the production of a relativistic jet, occurs periodically on a ~17-day timescale. A longer-term envelope modulates the peak radio fluxes in flares, ranging from peaks in excess of a Jansky in the 1970s to an historic low of milliJanskys during the yea… ▽ More

    Submitted 15 May, 2013; originally announced May 2013.

    Comments: 7 pages, 5 figures, accepted for publication in MNRAS 14 May 2013

  50. Electric vector rotations of π/2 in polarized circumstellar SiO maser emission

    Authors: A. J. Kemball, P. J. Diamond, L. Richter, I. Gonidakis, R. Xue

    Abstract: This paper examines the detailed sub-milliarcsecond polarization properties of an individual SiO maser feature displaying a rotation in polarization electric vector position angle of approximately π/2 across the feature. Such rotations are a characteristic observational signature of circumstellar SiO masers detected toward a number of late-type, evolved stars. We employ a new calibration method fo… ▽ More

    Submitted 23 October, 2011; originally announced October 2011.

    Comments: 14 pages, 11 figures; to appear in Ap. J