-
Multidimensional Contrast Limited Adaptive Histogram Equalization
Authors:
Vincent Stimper,
Stefan Bauer,
Ralph Ernstorfer,
Bernhard Schölkopf,
R. Patrick Xian
Abstract:
Contrast enhancement is an important preprocessing technique for improving the performance of downstream tasks in image processing and computer vision. Among the existing approaches based on nonlinear histogram transformations, contrast limited adaptive histogram equalization (CLAHE) is a popular choice for dealing with 2D images obtained in natural and scientific settings. The recent hardware upg…
▽ More
Contrast enhancement is an important preprocessing technique for improving the performance of downstream tasks in image processing and computer vision. Among the existing approaches based on nonlinear histogram transformations, contrast limited adaptive histogram equalization (CLAHE) is a popular choice for dealing with 2D images obtained in natural and scientific settings. The recent hardware upgrade in data acquisition systems results in significant increase in data complexity, including their sizes and dimensions. Measurements of densely sampled data higher than three dimensions, usually composed of 3D data as a function of external parameters, are becoming commonplace in various applications in the natural sciences and engineering. The initial understanding of these complex multidimensional datasets often requires human intervention through visual examination, which may be hampered by the varying levels of contrast permeating through the dimensions. We show both qualitatively and quantitatively that using our multidimensional extension of CLAHE (MCLAHE) simultaneously on all dimensions of the datasets allows better visualization and discernment of multidimensional image features, as demonstrated using cases from 4D photoemission spectroscopy and fluorescence microscopy. Our implementation of multidimensional CLAHE in Tensorflow is publicly accessible and supports parallelization with multiple CPUs and various other hardware accelerators, including GPUs.
△ Less
Submitted 9 November, 2019; v1 submitted 26 June, 2019;
originally announced June 2019.
-
On the Transfer of Inductive Bias from Simulation to the Real World: a New Disentanglement Dataset
Authors:
Muhammad Waleed Gondal,
Manuel Wüthrich,
Đorđe Miladinović,
Francesco Locatello,
Martin Breidt,
Valentin Volchkov,
Joel Akpo,
Olivier Bachem,
Bernhard Schölkopf,
Stefan Bauer
Abstract:
Learning meaningful and compact representations with disentangled semantic aspects is considered to be of key importance in representation learning. Since real-world data is notoriously costly to collect, many recent state-of-the-art disentanglement models have heavily relied on synthetic toy data-sets. In this paper, we propose a novel data-set which consists of over one million images of physica…
▽ More
Learning meaningful and compact representations with disentangled semantic aspects is considered to be of key importance in representation learning. Since real-world data is notoriously costly to collect, many recent state-of-the-art disentanglement models have heavily relied on synthetic toy data-sets. In this paper, we propose a novel data-set which consists of over one million images of physical 3D objects with seven factors of variation, such as object color, shape, size and position. In order to be able to control all the factors of variation precisely, we built an experimental platform where the objects are being moved by a robotic arm. In addition, we provide two more datasets which consist of simulations of the experimental setup. These datasets provide for the first time the possibility to systematically investigate how well different disentanglement methods perform on real data in comparison to simulation, and how simulated data can be leveraged to build better representations of the real world. We provide a first experimental study of these questions and our results indicate that learned models transfer poorly, but that model and hyperparameter selection is an effective means of transferring information to the real world.
△ Less
Submitted 25 November, 2019; v1 submitted 7 June, 2019;
originally announced June 2019.
-
Disentangled State Space Representations
Authors:
Đorđe Miladinović,
Muhammad Waleed Gondal,
Bernhard Schölkopf,
Joachim M. Buhmann,
Stefan Bauer
Abstract:
Sequential data often originates from diverse domains across which statistical regularities and domain specifics exist. To specifically learn cross-domain sequence representations, we introduce disentangled state space models (DSSM) -- a class of SSM in which domain-invariant state dynamics is explicitly disentangled from domain-specific information governing that dynamics. We analyze how such sep…
▽ More
Sequential data often originates from diverse domains across which statistical regularities and domain specifics exist. To specifically learn cross-domain sequence representations, we introduce disentangled state space models (DSSM) -- a class of SSM in which domain-invariant state dynamics is explicitly disentangled from domain-specific information governing that dynamics. We analyze how such separation can improve knowledge transfer to new domains, and enable robust prediction, sequence manipulation and domain characterization. We furthermore propose an unsupervised VAE-based training procedure to implement DSSM in form of Bayesian filters. In our experiments, we applied VAE-DSSM framework to achieve competitive performance in online ODE system identification and regression across experimental settings, and controlled generation and prediction of bouncing ball video sequences across varying gravitational influences.
△ Less
Submitted 7 June, 2019;
originally announced June 2019.
-
On the Fairness of Disentangled Representations
Authors:
Francesco Locatello,
Gabriele Abbati,
Tom Rainforth,
Stefan Bauer,
Bernhard Schölkopf,
Olivier Bachem
Abstract:
Recently there has been a significant interest in learning disentangled representations, as they promise increased interpretability, generalization to unseen scenarios and faster learning on downstream tasks. In this paper, we investigate the usefulness of different notions of disentanglement for improving the fairness of downstream prediction tasks based on representations. We consider the settin…
▽ More
Recently there has been a significant interest in learning disentangled representations, as they promise increased interpretability, generalization to unseen scenarios and faster learning on downstream tasks. In this paper, we investigate the usefulness of different notions of disentanglement for improving the fairness of downstream prediction tasks based on representations. We consider the setting where the goal is to predict a target variable based on the learned representation of high-dimensional observations (such as images) that depend on both the target variable and an \emph{unobserved} sensitive variable. We show that in this setting both the optimal and empirical predictions can be unfair, even if the target variable and the sensitive variable are independent. Analyzing the representations of more than \num{12600} trained state-of-the-art disentangled models, we observe that several disentanglement scores are consistently correlated with increased fairness, suggesting that disentanglement may be a useful property to encourage fairness when sensitive variables are not observed.
△ Less
Submitted 29 October, 2019; v1 submitted 31 May, 2019;
originally announced May 2019.
-
Disentangling Factors of Variation Using Few Labels
Authors:
Francesco Locatello,
Michael Tschannen,
Stefan Bauer,
Gunnar Rätsch,
Bernhard Schölkopf,
Olivier Bachem
Abstract:
Learning disentangled representations is considered a cornerstone problem in representation learning. Recently, Locatello et al. (2019) demonstrated that unsupervised disentanglement learning without inductive biases is theoretically impossible and that existing inductive biases and unsupervised methods do not allow to consistently learn disentangled representations. However, in many practical set…
▽ More
Learning disentangled representations is considered a cornerstone problem in representation learning. Recently, Locatello et al. (2019) demonstrated that unsupervised disentanglement learning without inductive biases is theoretically impossible and that existing inductive biases and unsupervised methods do not allow to consistently learn disentangled representations. However, in many practical settings, one might have access to a limited amount of supervision, for example through manual labeling of (some) factors of variation in a few training examples. In this paper, we investigate the impact of such supervision on state-of-the-art disentanglement methods and perform a large scale study, training over 52000 models under well-defined and reproducible experimental conditions. We observe that a small number of labeled examples (0.01--0.5\% of the data set), with potentially imprecise and incomplete labels, is sufficient to perform model selection on state-of-the-art unsupervised models. Further, we investigate the benefit of incorporating supervision into the training process. Overall, we empirically validate that with little and imprecise supervision it is possible to reliably learn disentangled representations.
△ Less
Submitted 14 February, 2020; v1 submitted 3 May, 2019;
originally announced May 2019.
-
Wave-like Properties of Phasor Fields: Experimental Demonstrations
Authors:
Syed Azer Reza,
Marco La Manna,
Sebastian Bauer,
Andreas Velten
Abstract:
Recently, an optical meta concept called the Phasor Field (P-Field) was proposed that yields great quality in the reconstruction of hidden objects imaged by non-line-of-sight (NLOS) imaging. It is based on virtual sinusoidal modulation of the light with frequencies in the MHz range. Phasor Field propagation was shown to be described by the Rayleigh-Sommerfeld diffraction integral. We extend this c…
▽ More
Recently, an optical meta concept called the Phasor Field (P-Field) was proposed that yields great quality in the reconstruction of hidden objects imaged by non-line-of-sight (NLOS) imaging. It is based on virtual sinusoidal modulation of the light with frequencies in the MHz range. Phasor Field propagation was shown to be described by the Rayleigh-Sommerfeld diffraction integral. We extend this concept and stress the analogy between electric field and Phasor Field. We introduce Phasor Field optical elements and present experiments demonstrating the validity of the approach. Straightforward use of the Phasor Field concept in real-world applications is also discussed.
△ Less
Submitted 2 April, 2019;
originally announced April 2019.
-
4MOST: Project overview and information for the First Call for Proposals
Authors:
R. S. de Jong,
O. Agertz,
A. Agudo Berbel,
J. Aird,
D. A. Alexander,
A. Amarsi,
F. Anders,
R. Andrae,
B. Ansarinejad,
W. Ansorge,
P. Antilogus,
H. Anwand-Heerwart,
A. Arentsen,
A. Arnadottir,
M. Asplund,
M. Auger,
N. Azais,
D. Baade,
G. Baker,
S. Baker,
E. Balbinot,
I. K. Baldry,
M. Banerji,
S. Barden,
P. Barklem
, et al. (313 additional authors not shown)
Abstract:
We introduce the 4-metre Multi-Object Spectroscopic Telescope (4MOST), a new high-multiplex, wide-field spectroscopic survey facility under development for the four-metre-class Visible and Infrared Survey Telescope for Astronomy (VISTA) at Paranal. Its key specifications are: a large field of view (FoV) of 4.2 square degrees and a high multiplex capability, with 1624 fibres feeding two low-resolut…
▽ More
We introduce the 4-metre Multi-Object Spectroscopic Telescope (4MOST), a new high-multiplex, wide-field spectroscopic survey facility under development for the four-metre-class Visible and Infrared Survey Telescope for Astronomy (VISTA) at Paranal. Its key specifications are: a large field of view (FoV) of 4.2 square degrees and a high multiplex capability, with 1624 fibres feeding two low-resolution spectrographs ($R = λ/Δλ\sim 6500$), and 812 fibres transferring light to the high-resolution spectrograph ($R \sim 20\,000$). After a description of the instrument and its expected performance, a short overview is given of its operational scheme and planned 4MOST Consortium science; these aspects are covered in more detail in other articles in this edition of The Messenger. Finally, the processes, schedules, and policies concerning the selection of ESO Community Surveys are presented, commencing with a singular opportunity to submit Letters of Intent for Public Surveys during the first five years of 4MOST operations.
△ Less
Submitted 1 April, 2019; v1 submitted 6 March, 2019;
originally announced March 2019.
-
Orthogonal Structure Search for Efficient Causal Discovery from Observational Data
Authors:
Anant Raj,
Luigi Gresele,
Michel Besserve,
Bernhard Schölkopf,
Stefan Bauer
Abstract:
The problem of inferring the direct causal parents of a response variable among a large set of explanatory variables is of high practical importance in many disciplines. Recent work exploits stability of regression coefficients or invariance properties of models across different experimental conditions for reconstructing the full causal graph. These approaches generally do not scale well with the…
▽ More
The problem of inferring the direct causal parents of a response variable among a large set of explanatory variables is of high practical importance in many disciplines. Recent work exploits stability of regression coefficients or invariance properties of models across different experimental conditions for reconstructing the full causal graph. These approaches generally do not scale well with the number of the explanatory variables and are difficult to extend to nonlinear relationships. Contrary to existing work, we propose an approach which even works for observational data alone, while still offering theoretical guarantees including the case of partially nonlinear relationships. Our algorithm requires only one estimation for each variable and in our experiments we apply our causal discovery algorithm even to large graphs, demonstrating significant improvements compared to well established approaches.
△ Less
Submitted 6 July, 2020; v1 submitted 6 March, 2019;
originally announced March 2019.
-
A novel ppm-precise absolute calibration method for precision high-voltage dividers
Authors:
O. Rest,
D. Winzen,
S. Bauer,
R. Berendes,
J. Meisner,
T. Thümmler,
S. Wüstling,
C. Weinheimer
Abstract:
The most common method to measure direct current high voltage (HV) down to the ppm-level is to use resistive high-voltage dividers. Such devices scale the HV into a range where it can be compared with precision digital voltmeters to reference voltages sources, which can be traced back to Josephson voltage standards. So far the calibration of the scale factors of HV dividers for voltages above 1~kV…
▽ More
The most common method to measure direct current high voltage (HV) down to the ppm-level is to use resistive high-voltage dividers. Such devices scale the HV into a range where it can be compared with precision digital voltmeters to reference voltages sources, which can be traced back to Josephson voltage standards. So far the calibration of the scale factors of HV dividers for voltages above 1~kV could only be done at metrology institutes and sometimes involves round-robin tests among several institutions to get reliable results. Here we present a novel absolute calibration method based on the measurement of a differential scale factor, which can be performed with commercial equipment and outside metrology institutes. We demonstrate that reproducible measurements up to 35~kV can be performed with relative uncertainties below $1\cdot10^{-6}$. This method is not restricted to metrology institutes and offers the possibility to determine the linearity of high-voltage dividers for a wide range of applications.
△ Less
Submitted 28 February, 2019;
originally announced March 2019.
-
AReS and MaRS - Adversarial and MMD-Minimizing Regression for SDEs
Authors:
Gabriele Abbati,
Philippe Wenk,
Michael A Osborne,
Andreas Krause,
Bernhard Schölkopf,
Stefan Bauer
Abstract:
Stochastic differential equations are an important modeling class in many disciplines. Consequently, there exist many methods relying on various discretization and numerical integration schemes. In this paper, we propose a novel, probabilistic model for estimating the drift and diffusion given noisy observations of the underlying stochastic system. Using state-of-the-art adversarial and moment mat…
▽ More
Stochastic differential equations are an important modeling class in many disciplines. Consequently, there exist many methods relying on various discretization and numerical integration schemes. In this paper, we propose a novel, probabilistic model for estimating the drift and diffusion given noisy observations of the underlying stochastic system. Using state-of-the-art adversarial and moment matching inference techniques, we avoid the discretization schemes of classical approaches. This leads to significant improvements in parameter accuracy and robustness given random initial guesses. On four established benchmark systems, we compare the performance of our algorithms to state-of-the-art solutions based on extended Kalman filtering and Gaussian processes.
△ Less
Submitted 28 May, 2019; v1 submitted 22 February, 2019;
originally announced February 2019.
-
ODIN: ODE-Informed Regression for Parameter and State Inference in Time-Continuous Dynamical Systems
Authors:
Philippe Wenk,
Gabriele Abbati,
Michael A Osborne,
Bernhard Schölkopf,
Andreas Krause,
Stefan Bauer
Abstract:
Parameter inference in ordinary differential equations is an important problem in many applied sciences and in engineering, especially in a data-scarce setting. In this work, we introduce a novel generative modeling approach based on constrained Gaussian processes and leverage it to build a computationally and data efficient algorithm for state and parameter inference. In an extensive set of exper…
▽ More
Parameter inference in ordinary differential equations is an important problem in many applied sciences and in engineering, especially in a data-scarce setting. In this work, we introduce a novel generative modeling approach based on constrained Gaussian processes and leverage it to build a computationally and data efficient algorithm for state and parameter inference. In an extensive set of experiments, our approach outperforms the current state of the art for parameter inference both in terms of accuracy and computational cost. It also shows promising results for the much more challenging problem of model selection.
△ Less
Submitted 5 December, 2019; v1 submitted 17 February, 2019;
originally announced February 2019.
-
Bayesian Online Prediction of Change Points
Authors:
Diego Agudelo-España,
Sebastian Gomez-Gonzalez,
Stefan Bauer,
Bernhard Schölkopf,
Jan Peters
Abstract:
Online detection of instantaneous changes in the generative process of a data sequence generally focuses on retrospective inference of such change points without considering their future occurrences. We extend the Bayesian Online Change Point Detection algorithm to also infer the number of time steps until the next change point (i.e., the residual time). This enables to handle observation models w…
▽ More
Online detection of instantaneous changes in the generative process of a data sequence generally focuses on retrospective inference of such change points without considering their future occurrences. We extend the Bayesian Online Change Point Detection algorithm to also infer the number of time steps until the next change point (i.e., the residual time). This enables to handle observation models which depend on the total segment duration, which is useful to model data sequences with temporal scaling. The resulting inference algorithm for segment detection can be deployed in an online fashion, and we illustrate applications to synthetic and to two medical real-world data sets.
△ Less
Submitted 24 June, 2020; v1 submitted 12 February, 2019;
originally announced February 2019.
-
Learning Counterfactual Representations for Estimating Individual Dose-Response Curves
Authors:
Patrick Schwab,
Lorenz Linhardt,
Stefan Bauer,
Joachim M. Buhmann,
Walter Karlen
Abstract:
Estimating what would be an individual's potential response to varying levels of exposure to a treatment is of high practical relevance for several important fields, such as healthcare, economics and public policy. However, existing methods for learning to estimate counterfactual outcomes from observational data are either focused on estimating average dose-response curves, or limited to settings…
▽ More
Estimating what would be an individual's potential response to varying levels of exposure to a treatment is of high practical relevance for several important fields, such as healthcare, economics and public policy. However, existing methods for learning to estimate counterfactual outcomes from observational data are either focused on estimating average dose-response curves, or limited to settings with only two treatments that do not have an associated dosage parameter. Here, we present a novel machine-learning approach towards learning counterfactual representations for estimating individual dose-response curves for any number of treatments with continuous dosage parameters with neural networks. Building on the established potential outcomes framework, we introduce performance metrics, model selection criteria, model architectures, and open benchmarks for estimating individual dose-response curves. Our experiments show that the methods developed in this work set a new state-of-the-art in estimating individual dose-response.
△ Less
Submitted 10 December, 2020; v1 submitted 3 February, 2019;
originally announced February 2019.
-
User Space Network Drivers
Authors:
Paul Emmerich,
Maximilian Pudelko,
Simon Bauer,
Stefan Huber,
Thomas Zwickl,
Georg Carle
Abstract:
The rise of user space packet processing frameworks like DPDK and netmap makes low-level code more accessible to developers and researchers. Previously, driver code was hidden in the kernel and rarely modified, or even looked at, by developers working at higher layers. These barriers are gone nowadays, yet developers still treat user space drivers as black-boxes magically accelerating applications…
▽ More
The rise of user space packet processing frameworks like DPDK and netmap makes low-level code more accessible to developers and researchers. Previously, driver code was hidden in the kernel and rarely modified, or even looked at, by developers working at higher layers. These barriers are gone nowadays, yet developers still treat user space drivers as black-boxes magically accelerating applications. We want to change this: every researcher building high-speed network applications should understand the intricacies of the underlying drivers, especially if they impact performance. We present ixy, a user space network driver designed for simplicity and educational purposes to show that fast packet IO is not black magic but careful engineering. ixy focuses on the bare essentials of user space packet processing: a packet forwarder including the whole NIC driver uses less than 1,000 lines of C code.
This paper is partially written in tutorial style on the case study of our implementations of drivers for both the Intel 82599 family and for virtual VirtIO NICs. The former allows us to reason about driver and framework performance on a stripped-down implementation to assess individual optimizations in isolation. VirtIO support ensures that everyone can run it in a virtual machine.
Our code is available as free and open source under the BSD license at https://github.com/emmericp/ixy
△ Less
Submitted 8 September, 2019; v1 submitted 29 January, 2019;
originally announced January 2019.
-
Challenging Common Assumptions in the Unsupervised Learning of Disentangled Representations
Authors:
Francesco Locatello,
Stefan Bauer,
Mario Lucic,
Gunnar Rätsch,
Sylvain Gelly,
Bernhard Schölkopf,
Olivier Bachem
Abstract:
The key idea behind the unsupervised learning of disentangled representations is that real-world data is generated by a few explanatory factors of variation which can be recovered by unsupervised learning algorithms. In this paper, we provide a sober look at recent progress in the field and challenge some common assumptions. We first theoretically show that the unsupervised learning of disentangle…
▽ More
The key idea behind the unsupervised learning of disentangled representations is that real-world data is generated by a few explanatory factors of variation which can be recovered by unsupervised learning algorithms. In this paper, we provide a sober look at recent progress in the field and challenge some common assumptions. We first theoretically show that the unsupervised learning of disentangled representations is fundamentally impossible without inductive biases on both the models and the data. Then, we train more than 12000 models covering most prominent methods and evaluation metrics in a reproducible large-scale experimental study on seven different data sets. We observe that while the different methods successfully enforce properties ``encouraged'' by the corresponding losses, well-disentangled models seemingly cannot be identified without supervision. Furthermore, increased disentanglement does not seem to lead to a decreased sample complexity of learning for downstream tasks. Our results suggest that future work on disentanglement learning should be explicit about the role of inductive biases and (implicit) supervision, investigate concrete benefits of enforcing disentanglement of the learned representations, and consider a reproducible experimental setup covering several data sets.
△ Less
Submitted 18 June, 2019; v1 submitted 29 November, 2018;
originally announced November 2018.
-
Identifying the Best Machine Learning Algorithms for Brain Tumor Segmentation, Progression Assessment, and Overall Survival Prediction in the BRATS Challenge
Authors:
Spyridon Bakas,
Mauricio Reyes,
Andras Jakab,
Stefan Bauer,
Markus Rempfler,
Alessandro Crimi,
Russell Takeshi Shinohara,
Christoph Berger,
Sung Min Ha,
Martin Rozycki,
Marcel Prastawa,
Esther Alberts,
Jana Lipkova,
John Freymann,
Justin Kirby,
Michel Bilello,
Hassan Fathallah-Shaykh,
Roland Wiest,
Jan Kirschke,
Benedikt Wiestler,
Rivka Colen,
Aikaterini Kotrotsou,
Pamela Lamontagne,
Daniel Marcus,
Mikhail Milchenko
, et al. (402 additional authors not shown)
Abstract:
Gliomas are the most common primary brain malignancies, with different degrees of aggressiveness, variable prognosis and various heterogeneous histologic sub-regions, i.e., peritumoral edematous/invaded tissue, necrotic core, active and non-enhancing core. This intrinsic heterogeneity is also portrayed in their radio-phenotype, as their sub-regions are depicted by varying intensity profiles dissem…
▽ More
Gliomas are the most common primary brain malignancies, with different degrees of aggressiveness, variable prognosis and various heterogeneous histologic sub-regions, i.e., peritumoral edematous/invaded tissue, necrotic core, active and non-enhancing core. This intrinsic heterogeneity is also portrayed in their radio-phenotype, as their sub-regions are depicted by varying intensity profiles disseminated across multi-parametric magnetic resonance imaging (mpMRI) scans, reflecting varying biological properties. Their heterogeneous shape, extent, and location are some of the factors that make these tumors difficult to resect, and in some cases inoperable. The amount of resected tumor is a factor also considered in longitudinal scans, when evaluating the apparent tumor for potential diagnosis of progression. Furthermore, there is mounting evidence that accurate segmentation of the various tumor sub-regions can offer the basis for quantitative image analysis towards prediction of patient overall survival. This study assesses the state-of-the-art machine learning (ML) methods used for brain tumor image analysis in mpMRI scans, during the last seven instances of the International Brain Tumor Segmentation (BraTS) challenge, i.e., 2012-2018. Specifically, we focus on i) evaluating segmentations of the various glioma sub-regions in pre-operative mpMRI scans, ii) assessing potential tumor progression by virtue of longitudinal growth of tumor sub-regions, beyond use of the RECIST/RANO criteria, and iii) predicting the overall survival from pre-operative mpMRI scans of patients that underwent gross total resection. Finally, we investigate the challenge of identifying the best ML algorithms for each of these tasks, considering that apart from being diverse on each instance of the challenge, the multi-institutional mpMRI BraTS dataset has also been a continuously evolving/growing dataset.
△ Less
Submitted 23 April, 2019; v1 submitted 5 November, 2018;
originally announced November 2018.
-
Robustly Disentangled Causal Mechanisms: Validating Deep Representations for Interventional Robustness
Authors:
Raphael Suter,
Đorđe Miladinović,
Bernhard Schölkopf,
Stefan Bauer
Abstract:
The ability to learn disentangled representations that split underlying sources of variation in high dimensional, unstructured data is important for data efficient and robust use of neural networks. While various approaches aiming towards this goal have been proposed in recent times, a commonly accepted definition and validation procedure is missing. We provide a causal perspective on representati…
▽ More
The ability to learn disentangled representations that split underlying sources of variation in high dimensional, unstructured data is important for data efficient and robust use of neural networks. While various approaches aiming towards this goal have been proposed in recent times, a commonly accepted definition and validation procedure is missing. We provide a causal perspective on representation learning which covers disentanglement and domain shift robustness as special cases. Our causal framework allows us to introduce a new metric for the quantitative evaluation of deep latent variable models. We show how this metric can be estimated from labeled observational data and further provide an efficient estimation algorithm that scales linearly in the dataset size.
△ Less
Submitted 13 May, 2019; v1 submitted 31 October, 2018;
originally announced November 2018.
-
Learning stable and predictive structures in kinetic systems: Benefits of a causal approach
Authors:
Niklas Pfister,
Stefan Bauer,
Jonas Peters
Abstract:
Learning kinetic systems from data is one of the core challenges in many fields. Identifying stable models is essential for the generalization capabilities of data-driven inference. We introduce a computationally efficient framework, called CausalKinetiX, that identifies structure from discrete time, noisy observations, generated from heterogeneous experiments. The algorithm assumes the existence…
▽ More
Learning kinetic systems from data is one of the core challenges in many fields. Identifying stable models is essential for the generalization capabilities of data-driven inference. We introduce a computationally efficient framework, called CausalKinetiX, that identifies structure from discrete time, noisy observations, generated from heterogeneous experiments. The algorithm assumes the existence of an underlying, invariant kinetic model, a key criterion for reproducible research. Results on both simulated and real-world examples suggest that learning the structure of kinetic systems benefits from a causal perspective. The identified variables and models allow for a concise description of the dynamics across multiple experimental settings and can be used for prediction in unseen experiments. We observe significant improvements compared to well established approaches focusing solely on predictive performance, especially for out-of-sample generalization.
△ Less
Submitted 28 November, 2019; v1 submitted 28 October, 2018;
originally announced October 2018.
-
Weck's Selection Theorem: The Maxwell Compactness Property for Bounded Weak Lipschitz Domains with Mixed Boundary Conditions in Arbitrary Dimensions
Authors:
Sebastian Bauer,
Dirk Pauly,
Michael Schomburg
Abstract:
It is proved that the space of differential forms with weak exterior and co-derivative, is compactly embedded into the space of square integrable differential forms. Mixed boundary conditions on weak Lipschitz domains are considered. Furthermore, canonical applications such as Maxwell estimates, Helmholtz decompositions and a static solution theory are proved. As a side product and crucial tool fo…
▽ More
It is proved that the space of differential forms with weak exterior and co-derivative, is compactly embedded into the space of square integrable differential forms. Mixed boundary conditions on weak Lipschitz domains are considered. Furthermore, canonical applications such as Maxwell estimates, Helmholtz decompositions and a static solution theory are proved. As a side product and crucial tool for our proofs we show the existence of regular potentials and regular decompositions as well.
△ Less
Submitted 30 April, 2019; v1 submitted 4 September, 2018;
originally announced September 2018.
-
Can Stellar Discs in a Cosmological Setting Avoid Forming Strong Bars?
Authors:
Jacob S. Bauer,
Lawrence M. Widrow
Abstract:
We investigate the connection between the vertical structure of stellar discs and the formation of bars using high-resolution simulations of galaxies in isolation and in the cosmological context. In particular, we simulate a suite of isolated galaxy models that have the same Toomre Q parameter and swing amplification parameter but that differ in the vertical scale height and velocity dispersion. W…
▽ More
We investigate the connection between the vertical structure of stellar discs and the formation of bars using high-resolution simulations of galaxies in isolation and in the cosmological context. In particular, we simulate a suite of isolated galaxy models that have the same Toomre Q parameter and swing amplification parameter but that differ in the vertical scale height and velocity dispersion. We find that the onset of bar formation occurs more slowly in models with thicker discs. Moreover, thicker discs and also discs evolved in simulations with larger force softening also appear to be more resilient to buckling, which acts to regulate the length and strength of bars. We also simulate disc-halo systems in the cosmological environment using a disc-insertion technique developed in a previous paper. In this case, bar formation is driven by the stochastic effects of a triaxial halo and subhalo-disc interactions and the initial growth of bars appears to be relatively insensitive to the thickness of the disc. On the other hand, thin discs in cosmological halos do appear to be more susceptible to buckling than thick ones and therefore bar strength correlates with disc thickness as in the isolated case. More to the point, one can form discs in cosmological simulations with relatively weak bars or no bars at all provided the discs as thin as the discs we observe and the softening length is smaller than the disc scale height.
△ Less
Submitted 31 August, 2018;
originally announced September 2018.
-
Adaptive Skip Intervals: Temporal Abstraction for Recurrent Dynamical Models
Authors:
Alexander Neitz,
Giambattista Parascandolo,
Stefan Bauer,
Bernhard Schölkopf
Abstract:
We introduce a method which enables a recurrent dynamics model to be temporally abstract. Our approach, which we call Adaptive Skip Intervals (ASI), is based on the observation that in many sequential prediction tasks, the exact time at which events occur is irrelevant to the underlying objective. Moreover, in many situations, there exist prediction intervals which result in particularly easy-to-p…
▽ More
We introduce a method which enables a recurrent dynamics model to be temporally abstract. Our approach, which we call Adaptive Skip Intervals (ASI), is based on the observation that in many sequential prediction tasks, the exact time at which events occur is irrelevant to the underlying objective. Moreover, in many situations, there exist prediction intervals which result in particularly easy-to-predict transitions. We show that there are prediction tasks for which we gain both computational efficiency and prediction accuracy by allowing the model to make predictions at a sampling rate which it can choose itself.
△ Less
Submitted 12 December, 2018; v1 submitted 14 August, 2018;
originally announced August 2018.
-
Viewpoint Planning for Quantitative Coronary Angiography
Authors:
Alexander Preuhs,
Martin Berger,
Sebastian Bauer,
Thomas Redel,
Mathias Unberath,
Stephan Achenbach,
Andreas Maier
Abstract:
In coronary angiography the condition of myocardial blood supply is assessed by analyzing 2-D X-ray projections of contrasted coronary arteries. This is done using a flexible C-arm system. Due to the X-ray immanent dimensionality reduction projecting the 3-D scene onto a 2-D image, the viewpoint is critical to guarantee an appropriate view onto the affected artery and, thus, enable reliable diagno…
▽ More
In coronary angiography the condition of myocardial blood supply is assessed by analyzing 2-D X-ray projections of contrasted coronary arteries. This is done using a flexible C-arm system. Due to the X-ray immanent dimensionality reduction projecting the 3-D scene onto a 2-D image, the viewpoint is critical to guarantee an appropriate view onto the affected artery and, thus, enable reliable diagnosis. In this work we introduce an algorithm computing optimal viewpoints for the assessment of coronary arteries without the need for 3-D models. We introduce the concept of optimal viewpoint planning solely based on a single angiographic X-ray image. The subsequent viewpoint is computed such that it is rotated precisely around a vessel, while minimizing foreshortening. Our algorithm reduces foreshortening substantially compared to the input view and completely eliminates it for 90 degree rotations. Rotations around iso-centered foreshortening-free vessels passing the isocenter are exact. The precision, however, decreases when the vessel is off-centered or foreshortened. We evaluate worst case boundaries, providing insight in the maximal inaccuracies to be expected. This can be utilized to design viewpoints guaranteeing desired requirements, e.g. a true rotation around the vessel of at minimum 30 degree.
△ Less
Submitted 2 August, 2018;
originally announced August 2018.
-
Laser and Radio Tracking for Planetary Science Missions - A Comparison
Authors:
Dominic Dirkx,
Ivan Prochazka,
Sven Bauer,
Pieter Visser,
Ron Noomen,
Leonid I. Gurvits,
Bert Vermeersen
Abstract:
At present, tracking data for planetary missions largely consists of radio observables: range-rate range and angular position. Future planetary missions may use Interplanetary Laser Ranging (ILR) as a tracking observable. Two-way ILR will provide range data that are about 2 orders of magnitude more accurate than radio-based range data. ILR does not produce Doppler data, however. In this article, w…
▽ More
At present, tracking data for planetary missions largely consists of radio observables: range-rate range and angular position. Future planetary missions may use Interplanetary Laser Ranging (ILR) as a tracking observable. Two-way ILR will provide range data that are about 2 orders of magnitude more accurate than radio-based range data. ILR does not produce Doppler data, however. In this article, we compare the relative strength of radio Doppler and laser range data for the retrieval of parameters of interest in planetary missions, to clarify and quantify the science case of ILR, with a focus on geodetic observables.
We first provide an overview of the near-term attainable quality of ILR, in terms of both the realization of the observable and the models used to process the measurements. Subsequently, we analyze the sensitivity of radio-Doppler and laser-range measurements in representative mission scenarios. We use both an analytical approximation and numerical analyses of the relative sensitivity of ILR and radio Doppler observables for more general cases.
We show that mm-precise range normal points are feasible for ILR, but mm-level accuracy and stability is unlikely to be attained, due to a combination of instrumental and model errors. ILR has the potential for superior performance in observing signatures in the data with a characteristic period of greater than 0.33-1.65 hours This indicates that Doppler tracking will typically remain the method of choice for gravity field determination and spacecraft orbit determination in planetary missions. Laser ranging data, however, are shown to have a significant advantage for the retrieval of rotational and tidal characteristics from landers. Similarly, laser ranging data will be superior for the construction of planetary ephemerides and the improvement of solar system tests of gravitation.
△ Less
Submitted 24 July, 2018;
originally announced July 2018.
-
Linear Tree Constraints
Authors:
Sabine Bauer,
Martin Hofmann
Abstract:
Linear tree constraints were introduced by Hofmann and Rodriguez in the context of amortized resource analysis for object oriented programs. More precisely, they gave a reduction from inference of resource types to constraint solving. Thus, once we have found an algorithm to solve the constraints generated from a program, we can read off the resource consumption from their solutions.
These const…
▽ More
Linear tree constraints were introduced by Hofmann and Rodriguez in the context of amortized resource analysis for object oriented programs. More precisely, they gave a reduction from inference of resource types to constraint solving. Thus, once we have found an algorithm to solve the constraints generated from a program, we can read off the resource consumption from their solutions.
These constraints have the form of pointwise linear inequalities between infinite trees labeled with nonnegative rational numbers. We are interested in the question if a system of such constraints is simultaneously satisfiable. Bauer and Hofmann have recently identified a fragment of the tree constraint problem (UTC) that is still sufficient for program analysis and they proved that the list case of UTC is decidable, whereas the case with trees of degree at least two remained open. In this paper, we solve this problem. We give a decision procedure that covers the entire range of constraints needed for resource analysis.
△ Less
Submitted 26 June, 2018;
originally announced June 2018.
-
NLP-assisted software testing: A systematic map** of the literature
Authors:
Vahid Garousi,
Sara Bauer,
Michael Felderer
Abstract:
Context: To reduce manual effort of extracting test cases from natural-language requirements, many approaches based on Natural Language Processing (NLP) have been proposed in the literature. Given the large amount of approaches in this area, and since many practitioners are eager to utilize such techniques, it is important to synthesize and provide an overview of the state-of-the-art in this area.…
▽ More
Context: To reduce manual effort of extracting test cases from natural-language requirements, many approaches based on Natural Language Processing (NLP) have been proposed in the literature. Given the large amount of approaches in this area, and since many practitioners are eager to utilize such techniques, it is important to synthesize and provide an overview of the state-of-the-art in this area. Objective: Our objective is to summarize the state-of-the-art in NLP-assisted software testing which could benefit practitioners to potentially utilize those NLP-based techniques. Moreover, this can benefit researchers in providing an overview of the research landscape. Method: To address the above need, we conducted a survey in the form of a systematic literature map** (classification). After compiling an initial pool of 95 papers, we conducted a systematic voting, and our final pool included 67 technical papers. Results: This review paper provides an overview of the contribution types presented in the papers, types of NLP approaches used to assist software testing, types of required input requirements, and a review of tool support in this area. Some key results we have detected are: (1) only four of the 38 tools (11%) presented in the papers are available for download; (2) a larger ratio of the papers (30 of 67) provided a shallow exposure to the NLP aspects (almost no details). Conclusion: This paper would benefit both practitioners and researchers by serving as an "index" to the body of knowledge in this area. The results could help practitioners utilizing the existing NLP-based techniques; this in turn reduces the cost of test-case design and decreases the amount of human resources spent on test activities. After sharing this review with some of our industrial collaborators, initial insights show that this review can indeed be useful and beneficial to practitioners.
△ Less
Submitted 21 March, 2020; v1 submitted 2 June, 2018;
originally announced June 2018.
-
Reduction of stored-particle background by a magnetic pulse method at the KATRIN experiment
Authors:
KATRIN Collaboration,
M. Arenz,
W. -J. Baek,
S. Bauer,
M. Beck,
A. Beglarian,
J. Behrens,
R. Berendes,
T. Bergmann,
A. Berlev,
U. Besserer,
K. Blaum,
T. Bode,
B. Bornschein,
L. Bornschein,
T. Brunst,
W. Buglak,
N. Buzinsky,
S. Chilingaryan,
W. Q. Choi,
M. Deffert,
P. J. Doe,
O. Dragoun,
G. Drexlin,
S. Dyba
, et al. (105 additional authors not shown)
Abstract:
The KATRIN experiment aims to determine the effective electron neutrino mass with a sensitivity of $0.2\,{\text{eV}/c^2}$ (90\% C.L.) by precision measurement of the shape of the tritium \textbeta-spectrum in the endpoint region. The energy analysis of the decay electrons is achieved by a MAC-E filter spectrometer. A common background source in this setup is the decay of short-lived isotopes, such…
▽ More
The KATRIN experiment aims to determine the effective electron neutrino mass with a sensitivity of $0.2\,{\text{eV}/c^2}$ (90\% C.L.) by precision measurement of the shape of the tritium \textbeta-spectrum in the endpoint region. The energy analysis of the decay electrons is achieved by a MAC-E filter spectrometer. A common background source in this setup is the decay of short-lived isotopes, such as $\textsuperscript{219}$Rn and $\textsuperscript{220}$Rn, in the spectrometer volume. Active and passive countermeasures have been implemented and tested at the KATRIN main spectrometer. One of these is the magnetic pulse method, which employs the existing air coil system to reduce the magnetic guiding field in the spectrometer on a short timescale in order to remove low- and high-energy stored electrons. Here we describe the working principle of this method and present results from commissioning measurements at the main spectrometer. Simulations with the particle-tracking software Kassiopeia were carried out to gain a detailed understanding of the electron storage conditions and removal processes.
△ Less
Submitted 3 May, 2018;
originally announced May 2018.
-
Fast Gaussian Process Based Gradient Matching for Parameter Identification in Systems of Nonlinear ODEs
Authors:
Philippe Wenk,
Alkis Gotovos,
Stefan Bauer,
Nico Gorbach,
Andreas Krause,
Joachim M. Buhmann
Abstract:
Parameter identification and comparison of dynamical systems is a challenging task in many fields. Bayesian approaches based on Gaussian process regression over time-series data have been successfully applied to infer the parameters of a dynamical system without explicitly solving it. While the benefits in computational cost are well established, a rigorous mathematical framework has been missing.…
▽ More
Parameter identification and comparison of dynamical systems is a challenging task in many fields. Bayesian approaches based on Gaussian process regression over time-series data have been successfully applied to infer the parameters of a dynamical system without explicitly solving it. While the benefits in computational cost are well established, a rigorous mathematical framework has been missing. We offer a novel interpretation which leads to a better understanding and improvements in state-of-the-art performance in terms of accuracy for nonlinear dynamical systems.
△ Less
Submitted 1 March, 2019; v1 submitted 12 April, 2018;
originally announced April 2018.
-
Disc-Halo Interactions in ΛCDM
Authors:
Jacob S. Bauer,
Lawrence M. Widrow,
Denis Erkal
Abstract:
We present a new method for embedding a stellar disc in a cosmological dark matter halo and provide a worked example from a ΛCDM zoom-in simulation. The disc is inserted into the halo at a redshift z = 3 as a zero-mass rigid body. Its mass and size are then increased adiabatically while its position, velocity, and orientation are determined from rigid-body dynamics. At z = 1, the rigid disc is rep…
▽ More
We present a new method for embedding a stellar disc in a cosmological dark matter halo and provide a worked example from a ΛCDM zoom-in simulation. The disc is inserted into the halo at a redshift z = 3 as a zero-mass rigid body. Its mass and size are then increased adiabatically while its position, velocity, and orientation are determined from rigid-body dynamics. At z = 1, the rigid disc is replaced by an N-body disc whose particles sample a three-integral distribution function (DF). The simulation then proceeds to z = 0 with live disc and halo particles. By comparison, other methods assume one or more of the following: the centre of the rigid disc during the growth phase is pinned to the minimum of the halo potential, the orientation of the rigid disc is fixed, or the live N-body disc is constructed from a two rather than three-integral DF. In general, the presence of a disc makes the halo rounder, more centrally concentrated, and smoother, especially in the innermost regions. We find that methods in which the disc is pinned to the minimum of the halo potential tend to overestimate the amount of adiabatic contraction. Additionally, the effect of the disc on the subhalo distribution appears to be rather insensitive to the disc insertion method. The live disc in our simulation develops a bar that is consistent with the bars seen in late-type spiral galaxies. In addition, particles from the disc are launched or "kicked up" to high galactic latitudes.
△ Less
Submitted 10 January, 2018;
originally announced January 2018.
-
Technical design and commissioning of the KATRIN large-volume air coil system
Authors:
M. Erhard,
J. Behrens,
S. Bauer,
A. Beglarian,
R. Berendes,
G. Drexlin,
F. Glück,
R. Gumbsheimer,
J. Hergenhan,
B. Leiber,
S. Mertens,
A. Osipowicz,
P. Plischke,
J. Reich,
T. Thümmler,
N. Wandkowsky,
C. Weinheimer,
S. Wüstling
Abstract:
The KATRIN experiment is a next-generation direct neutrino mass experiment with a sensitivity of 0.2 eV (90% C.L.) to the effective mass of the electron neutrino. It measures the tritium $β$-decay spectrum close to its endpoint with a spectrometer based on the MAC-E filter technique. The $β$-decay electrons are guided by a magnetic field that operates in the mT range in the central spectrometer vo…
▽ More
The KATRIN experiment is a next-generation direct neutrino mass experiment with a sensitivity of 0.2 eV (90% C.L.) to the effective mass of the electron neutrino. It measures the tritium $β$-decay spectrum close to its endpoint with a spectrometer based on the MAC-E filter technique. The $β$-decay electrons are guided by a magnetic field that operates in the mT range in the central spectrometer volume; it is fine-tuned by a large-volume air coil system surrounding the spectrometer vessel. The purpose of the system is to provide optimal transmission properties for signal electrons and to achieve efficient magnetic shielding against background. In this paper we describe the technical design of the air coil system, including its mechanical and electrical properties. We outline the importance of its versatile operation modes in background investigation and suppression techniques. We compare magnetic field measurements in the inner spectrometer volume during system commissioning with corresponding simulations, which allows to verify the system's functionality in fine-tuning the magnetic field configuration. This is of major importance for a successful neutrino mass measurement at KATRIN.
△ Less
Submitted 4 December, 2017;
originally announced December 2017.
-
Modeling Dynamic Helium Release as a Tracer of Rock Deformation
Authors:
W. Payton Gardner,
Stephen J. Bauer,
Kristopher L. Kuhlman,
Jason E. Heath
Abstract:
We use helium released during mechanical deformation of shales as a signal to explore the effects of deformation and failure on material transport properties. A dynamic dual-permeability model with evolving pore and fracture networks is used to simulate gases released from shale during deformation and failure. Changes in material properties required to reproduce experimentally observed gas signals…
▽ More
We use helium released during mechanical deformation of shales as a signal to explore the effects of deformation and failure on material transport properties. A dynamic dual-permeability model with evolving pore and fracture networks is used to simulate gases released from shale during deformation and failure. Changes in material properties required to reproduce experimentally observed gas signals are explored. We model two different experiments of $^4$He flow rate measured from shale undergoing mechanical deformation, a core parallel to bedding and a core perpendicular to bedding. We find that the helium signal is sensitive to fracture development and evolution as well as changes in the matrix transport properties. We constrain the timing and effective fracture aperture, as well as the increase in matrix porosity and permeability. Increases in matrix permeability are required to explain gas flow prior to macroscopic failure, and the short-term gas flow post failure. Increased matrix porosity, is required to match the long-term, post-failure gas flow. Our model provides the first quantitative interpretation of helium release as a result of mechanical deformation. The sensitivity of this model to changes in the fracture network, as well as to matrix properties during deformation, indicates that helium release can be used as a quantitative tool to evaluate the state of stress and strain in earth materials.
△ Less
Submitted 11 October, 2017;
originally announced October 2017.
-
A stencil scaling approach for accelerating matrix-free finite element implementations
Authors:
Simon Bauer,
Daniel Drzisga,
Marcus Mohr,
Ulrich Ruede,
Christian Waluga,
Barbara Wohlmuth
Abstract:
We present a novel approach to fast on-the-fly low order finite element assembly for scalar elliptic partial differential equations of Darcy type with variable coefficients optimized for matrix-free implementations. Our approach introduces a new operator that is obtained by appropriately scaling the reference stiffness matrix from the constant coefficient case. Assuming sufficient regularity, an a…
▽ More
We present a novel approach to fast on-the-fly low order finite element assembly for scalar elliptic partial differential equations of Darcy type with variable coefficients optimized for matrix-free implementations. Our approach introduces a new operator that is obtained by appropriately scaling the reference stiffness matrix from the constant coefficient case. Assuming sufficient regularity, an a priori analysis shows that solutions obtained by this approach are unique and have asymptotically optimal order convergence in the $H^1$- and the $L^2$-norm on hierarchical hybrid grids. For the pre-asymptotic regime, we present a local modification that guarantees uniform ellipticity of the operator. Cost considerations show that our novel approach requires roughly one third of the floating-point operations compared to a classical finite element assembly scheme employing nodal integration. Our theoretical considerations are illustrated by numerical tests that confirm the expectations with respect to accuracy and run-time. A large scale application with more than a hundred billion ($1.6\cdot10^{11}$) degrees of freedom executed on 14,310 compute cores demonstrates the efficiency of the new scaling approach.
△ Less
Submitted 23 July, 2018; v1 submitted 20 September, 2017;
originally announced September 2017.
-
Scalable Variational Inference for Dynamical Systems
Authors:
Nico S. Gorbach,
Stefan Bauer,
Joachim M. Buhmann
Abstract:
Gradient matching is a promising tool for learning parameters and state dynamics of ordinary differential equations. It is a grid free inference approach, which, for fully observable systems is at times competitive with numerical integration. However, for many real-world applications, only sparse observations are available or even unobserved variables are included in the model description. In thes…
▽ More
Gradient matching is a promising tool for learning parameters and state dynamics of ordinary differential equations. It is a grid free inference approach, which, for fully observable systems is at times competitive with numerical integration. However, for many real-world applications, only sparse observations are available or even unobserved variables are included in the model description. In these cases most gradient matching methods are difficult to apply or simply do not provide satisfactory results. That is why, despite the high computational cost, numerical integration is still the gold standard in many applications. Using an existing gradient matching approach, we propose a scalable variational inference framework which can infer states and parameters simultaneously, offers computational speedups, improved accuracy and works well even under model misspecifications in a partially observable system.
△ Less
Submitted 10 April, 2018; v1 submitted 19 May, 2017;
originally announced May 2017.
-
MRI-based Surgical Planning for Lumbar Spinal Stenosis
Authors:
Gabriele Abbati,
Stefan Bauer,
Peter J. Schüffler,
Jakob Burgstaller,
Ulrike Held,
Sebastian Winklhofer,
Johann Steurer,
Joachim M. Buhmann
Abstract:
The most common reason for spinal surgery in elderly patients is lumbar spinal stenosis(LSS). For LSS, treatment decisions based on clinical and radiological information as well as personal experience of the surgeon shows large variance. Thus a standardized support system is of high value for a more objective and reproducible decision. In this work, we develop an automated algorithm to localize th…
▽ More
The most common reason for spinal surgery in elderly patients is lumbar spinal stenosis(LSS). For LSS, treatment decisions based on clinical and radiological information as well as personal experience of the surgeon shows large variance. Thus a standardized support system is of high value for a more objective and reproducible decision. In this work, we develop an automated algorithm to localize the stenosis causing the symptoms of the patient in magnetic resonance imaging (MRI). With 22 MRI features of each of five spinal levels of 321 patients, we show it is possible to predict the location of lesion triggering the symptoms. To support this hypothesis, we conduct an automated analysis of labeled and unlabeled MRI scans extracted from 788 patients. We confirm quantitatively the importance of radiological information and provide an algorithmic pipeline for working with raw MRI scans.
△ Less
Submitted 21 March, 2017;
originally announced March 2017.
-
Aeronomical constraints to the minimum mass and maximum radius of hot low-mass planets
Authors:
L. Fossati,
N. V. Erkaev,
H. Lammer,
P. E. Cubillos,
P. Odert,
I. Juvan,
K. G. Kislyakova,
M. Lendl,
D. Kubyshkina,
S. J. Bauer
Abstract:
Stimulated by the discovery of a number of close-in low-density planets, we generalise the Jeans escape parameter taking hydrodynamic and Roche lobe effects into account. We furthermore define $Λ$ as the value of the Jeans escape parameter calculated at the observed planetary radius and mass for the planet's equilibrium temperature and considering atomic hydrogen, independently of the atmospheric…
▽ More
Stimulated by the discovery of a number of close-in low-density planets, we generalise the Jeans escape parameter taking hydrodynamic and Roche lobe effects into account. We furthermore define $Λ$ as the value of the Jeans escape parameter calculated at the observed planetary radius and mass for the planet's equilibrium temperature and considering atomic hydrogen, independently of the atmospheric temperature profile. We consider 5 and 10 $M_{\oplus}$ planets with an equilibrium temperature of 500 and 1000 K, orbiting early G-, K-, and M-type stars. Assuming a clear atmosphere and by comparing escape rates obtained from the energy-limited formula, which only accounts for the heating induced by the absorption of the high-energy stellar radiation, and from a hydrodynamic atmosphere code, which also accounts for the bolometric heating, we find that planets whose $Λ$ is smaller than 15-35 lie in the "boil-off" regime, where the escape is driven by the atmospheric thermal energy and low planetary gravity. We find that the atmosphere of hot (i.e. $T_{\rm eq}\gtrapprox$ 1000 K) low-mass ($M_{\rm pl}\lessapprox$ 5 $M_{\oplus}$) planets with $Λ$ < 15-35 shrinks to smaller radii so that their $Λ$ evolves to values higher than 15-35, hence out of the boil-off regime, in less than $\approx$500 Myr. Because of their small Roche lobe radius, we find the same result also for hot (i.e. $T_{\rm eq}\gtrapprox$ 1000 K) higher mass ($M_{\rm pl}\lessapprox$ 10 $M_{\oplus}$) planets with $Λ$ < 15-35, when they orbit M-dwarfs. For old, hydrogen-dominated planets in this range of parameters, $Λ$ should therefore be $\geq$15-35, which provides a strong constraint on the planetary minimum mass and maximum radius and can be used to predict the presence of aerosols and/or constrain planetary masses, for example.
△ Less
Submitted 16 December, 2016;
originally announced December 2016.
-
Mean-Field Variational Inference for Gradient Matching with Gaussian Processes
Authors:
Nico S. Gorbach,
Stefan Bauer,
Joachim M. Buhmann
Abstract:
Gradient matching with Gaussian processes is a promising tool for learning parameters of ordinary differential equations (ODE's). The essence of gradient matching is to model the prior over state variables as a Gaussian process which implies that the joint distribution given the ODE's and GP kernels is also Gaussian distributed. The state-derivatives are integrated out analytically since they are…
▽ More
Gradient matching with Gaussian processes is a promising tool for learning parameters of ordinary differential equations (ODE's). The essence of gradient matching is to model the prior over state variables as a Gaussian process which implies that the joint distribution given the ODE's and GP kernels is also Gaussian distributed. The state-derivatives are integrated out analytically since they are modelled as latent variables. However, the state variables themselves are also latent variables because they are contaminated by noise. Previous work sampled the state variables since integrating them out is \textit{not} analytically tractable. In this paper we use mean-field approximation to establish tight variational lower bounds that decouple state variables and are therefore, in contrast to the integral over state variables, analytically tractable and even concave for a restricted family of ODE's, including nonlinear and periodic ODE's. Such variational lower bounds facilitate "hill climbing" to determine the maximum a posteriori estimate of ODE parameters. An additional advantage of our approach over sampling methods is the determination of a proxy to the intractable posterior distribution over state variables given observations and the ODE's.
△ Less
Submitted 21 October, 2016;
originally announced October 2016.
-
Model Selection for Gaussian Process Regression by Approximation Set Coding
Authors:
Benjamin Fischer,
Nico Gorbach,
Stefan Bauer,
Yatao Bian,
Joachim M. Buhmann
Abstract:
Gaussian processes are powerful, yet analytically tractable models for supervised learning. A Gaussian process is characterized by a mean function and a covariance function (kernel), which are determined by a model selection criterion. The functions to be compared do not just differ in their parametrization but in their fundamental structure. It is often not clear which function structure to choos…
▽ More
Gaussian processes are powerful, yet analytically tractable models for supervised learning. A Gaussian process is characterized by a mean function and a covariance function (kernel), which are determined by a model selection criterion. The functions to be compared do not just differ in their parametrization but in their fundamental structure. It is often not clear which function structure to choose, for instance to decide between a squared exponential and a rational quadratic kernel. Based on the principle of approximation set coding, we develop a framework for model selection to rank kernels for Gaussian process regression. In our experiments approximation set coding shows promise to become a model selection criterion competitive with maximum evidence (also called marginal likelihood) and leave-one-out cross-validation.
△ Less
Submitted 4 October, 2016;
originally announced October 2016.
-
A two-scale approach for efficient on-the-fly operator assembly in massively parallel high performance multigrid codes
Authors:
Simon Bauer,
Marcus Mohr,
Ulrich Rüde,
Jens Weismüller,
Markus Wittmann,
Barbara Wohlmuth
Abstract:
Matrix-free finite element implementations of massively parallel geometric multigrid save memory and are often significantly faster than implementations using classical sparse matrix techniques. They are especially well suited for hierarchical hybrid grids on polyhedral domains. In the case of constant coefficients all fine grid node stencils in the interior of a coarse macro element are equal. Ho…
▽ More
Matrix-free finite element implementations of massively parallel geometric multigrid save memory and are often significantly faster than implementations using classical sparse matrix techniques. They are especially well suited for hierarchical hybrid grids on polyhedral domains. In the case of constant coefficients all fine grid node stencils in the interior of a coarse macro element are equal. However, for non-polyhedral domains the situation changes. Then even for the Laplace operator, the non-linear element map** leads to fine grid stencils that can vary from grid point to grid point. This observation motivates a new two-scale approach that exploits a piecewise polynomial approximation of the fine grid operator with respect to the coarse mesh size. The low-cost evaluation of these surrogate polynomials results in an efficient stencil assembly on-the-fly for non-polyhedral domains that can be significantly more efficient than matrix-free techniques that are based on an element-wise assembly. The performance analysis and additional hardware-aware code optimizations are based on the Execution-Cache-Memory model. Several aspects such as two-scale a priori error bounds and double discretization techniques are presented. Weak and strong scaling results illustrate the benefits of the new technique when used within large scale PDE solvers.
△ Less
Submitted 23 August, 2016;
originally announced August 2016.
-
Confining Metal-Halide Perovskites in Nanoporous Thin Films
Authors:
Stepan Demchyshyn,
Janina Melanie Roemer,
Heiko Groiß,
Herwig Heilbrunner,
Christoph Ulbricht,
Dogukan Apaydin,
Uta Rütt,
Florian Bertram,
Günter Hesser,
Markus Scharber,
Bert Nickel,
Niyazi Serdar Sariciftci,
Siegfried Bauer,
Eric Daniel Głowacki,
Martin Kaltenbrunner
Abstract:
Controlling size and shape of semiconducting nanocrystals advances nanoelectronics and photonics. Quantum confined, inexpensive, solution derived metal halide perovskites offer narrow band, color-pure emitters as integral parts of next-generation displays and optoelectronic devices. We use nanoporous silicon and alumina thin films as templates for the growth of perovskite nanocrystallites directly…
▽ More
Controlling size and shape of semiconducting nanocrystals advances nanoelectronics and photonics. Quantum confined, inexpensive, solution derived metal halide perovskites offer narrow band, color-pure emitters as integral parts of next-generation displays and optoelectronic devices. We use nanoporous silicon and alumina thin films as templates for the growth of perovskite nanocrystallites directly within device-relevant architectures without the use of colloidal stabilization. We find significantly blue shifted photoluminescence emission by reducing the pore size; normally infrared-emitting materials become visibly red, green-emitting materials cyan and blue. Confining perovskite nanocrystals within porous oxide thin films drastically increases photoluminescence stability as the templates auspiciously serve as encapsulation. We quantify the template-induced size of the perovskite crystals in nanoporous silicon with microfocus high-energy X-ray depth profiling in transmission geometry, verifying the growth of perovskite nanocrystals throughout the entire thickness of the nanoporous films. Low-voltage electroluminescent diodes with narrow, blue-shifted emission fabricated from nanocrystalline perovskites grown in embedded nanoporous alumina thin films substantiate our general concept for next generation photonic devices.
△ Less
Submitted 18 August, 2017; v1 submitted 13 May, 2016;
originally announced July 2016.
-
The Potsdam MRS Spectrograph - heritage of MUSE and the impact of cross-innovation in the process of technology transfer
Authors:
Benito Moralejo,
Martin M. Roth,
Philippe Godefroy,
Thomas Fechner,
Svend M. Bauer,
Elmar Schmälzlin,
Andreas Kelz,
Roger Haynes
Abstract:
After having demonstrated that an IFU, attached to a microscope rather than to a telescope, is capable of differentiating complex organic tissue with spatially resolved Raman spectroscopy, we have launched a clinical validation program that utilizes a novel optimized fiber-coupled multi-channel spectrograph whose layout is based on the modular MUSE spectrograph concept. The new design features a t…
▽ More
After having demonstrated that an IFU, attached to a microscope rather than to a telescope, is capable of differentiating complex organic tissue with spatially resolved Raman spectroscopy, we have launched a clinical validation program that utilizes a novel optimized fiber-coupled multi-channel spectrograph whose layout is based on the modular MUSE spectrograph concept. The new design features a telecentric input and has an extended blue performance, but otherwise maintains the properties of high throughput and excellent image quality over an octave of wavelength coverage with modest spectral resolution. We present the opto-mechanical layout and details of its optical performance.
△ Less
Submitted 6 July, 2016; v1 submitted 5 July, 2016;
originally announced July 2016.
-
Multi-Organ Cancer Classification and Survival Analysis
Authors:
Stefan Bauer,
Nicolas Carion,
Peter Schüffler,
Thomas Fuchs,
Peter Wild,
Joachim M. Buhmann
Abstract:
Accurate and robust cell nuclei classification is the cornerstone for a wider range of tasks in digital and Computational Pathology. However, most machine learning systems require extensive labeling from expert pathologists for each individual problem at hand, with no or limited abilities for knowledge transfer between datasets and organ sites. In this paper we implement and evaluate a variety of…
▽ More
Accurate and robust cell nuclei classification is the cornerstone for a wider range of tasks in digital and Computational Pathology. However, most machine learning systems require extensive labeling from expert pathologists for each individual problem at hand, with no or limited abilities for knowledge transfer between datasets and organ sites. In this paper we implement and evaluate a variety of deep neural network models and model ensembles for nuclei classification in renal cell cancer (RCC) and prostate cancer (PCa). We propose a convolutional neural network system based on residual learning which significantly improves over the state-of-the-art in cell nuclei classification. Finally, we show that the combination of tissue types during training increases not only classification accuracy but also overall survival analysis.
△ Less
Submitted 2 December, 2016; v1 submitted 2 June, 2016;
originally announced June 2016.
-
Road Detection through Supervised Classification
Authors:
Yasamin Alkhorshid,
Kamelia Aryafar,
Sven Bauer,
Gerd Wanielik
Abstract:
Autonomous driving is a rapidly evolving technology. Autonomous vehicles are capable of sensing their environment and navigating without human input through sensory information such as radar, lidar, GNSS, vehicle odometry, and computer vision. This sensory input provides a rich dataset that can be used in combination with machine learning models to tackle multiple problems in supervised settings.…
▽ More
Autonomous driving is a rapidly evolving technology. Autonomous vehicles are capable of sensing their environment and navigating without human input through sensory information such as radar, lidar, GNSS, vehicle odometry, and computer vision. This sensory input provides a rich dataset that can be used in combination with machine learning models to tackle multiple problems in supervised settings. In this paper we focus on road detection through gray-scale images as the sole sensory input. Our contributions are twofold: first, we introduce an annotated dataset of urban roads for machine learning tasks; second, we introduce a road detection framework on this dataset through supervised classification and hand-crafted feature vectors.
△ Less
Submitted 10 May, 2016;
originally announced May 2016.
-
A non-relativistic Model of Plasma Physics Containing a Radiation Reaction Term
Authors:
Sebastian Bauer
Abstract:
While a fully relativistic collisionless plasma is modeled by the Vlasov-Maxwell system a good approximation in the non-relativistic limit is given by the Vlasov-Poisson system. We modify the Vlasov-Poisson system so that dam** due to the relativistic effect of radiation reaction is included. We prove the existence and uniqueness as well as the higher regularity of local classical solutions. The…
▽ More
While a fully relativistic collisionless plasma is modeled by the Vlasov-Maxwell system a good approximation in the non-relativistic limit is given by the Vlasov-Poisson system. We modify the Vlasov-Poisson system so that dam** due to the relativistic effect of radiation reaction is included. We prove the existence and uniqueness as well as the higher regularity of local classical solutions. These theorems also include the higher regularity of classical solutions of the Vlasov-Poisson system depending on the regularity of the initial datum.
△ Less
Submitted 20 April, 2016;
originally announced April 2016.
-
Strong spin-orbit fields and Dyakonov-Perel spin dephasing in supported metallic films
Authors:
Nguyen H. Long,
Phivos Mavropoulos,
David S. G. Bauer,
Bernd Zimmermann,
Yuriy Mokrousov,
Stefan Blügel
Abstract:
Spin dephasing by the Dyakonov-Perel mechanism in metallic films deposited on insulating substrates is revealed, and quantitatively examined by means of density functional calculations combined with a kinetic equation. The surface-to-substrate asymmetry, probed by the metal wave functions in thin films, is found to produce strong spin-orbit fields and a fast Larmor precession, giving a dominant co…
▽ More
Spin dephasing by the Dyakonov-Perel mechanism in metallic films deposited on insulating substrates is revealed, and quantitatively examined by means of density functional calculations combined with a kinetic equation. The surface-to-substrate asymmetry, probed by the metal wave functions in thin films, is found to produce strong spin-orbit fields and a fast Larmor precession, giving a dominant contribution to spin decay over the Elliott-Yafet spin relaxation up to a thickness of 70 nm. The spin dephasing is oscillatory in time with a rapid (sub-picosecond) initial decay. However, parts of the Fermi surface act as spin traps, causing a persistent tail signal lasting 1000 times longer than the initial decay time. It is also found that the decay depends on the direction of the initial spin polarization, resulting in a spin-dephasing anisotropy of 200% in the examined cases.
△ Less
Submitted 6 April, 2016;
originally announced April 2016.
-
Commissioning of the vacuum system of the KATRIN Main Spectrometer
Authors:
M. Arenz,
M. Babutzka,
M. Bahr,
J. P. Barrett,
S. Bauer,
M. Beck,
A. Beglarian,
J. Behrens,
T. Bergmann,
U. Besserer,
J. Blümer,
L. I. Bodine,
K. Bokeloh,
J. Bonn,
B. Bornschein,
L. Bornschein,
S. Büsch,
T. H. Burritt,
S. Chilingaryan,
T. J. Corona,
L. De Viveiros,
P. J. Doe,
O. Dragoun,
G. Drexlin,
S. Dyba
, et al. (125 additional authors not shown)
Abstract:
The KATRIN experiment will probe the neutrino mass by measuring the beta-electron energy spectrum near the endpoint of tritium beta-decay. An integral energy analysis will be performed by an electro-static spectrometer (Main Spectrometer), an ultra-high vacuum vessel with a length of 23.2 m, a volume of 1240 m^3, and a complex inner electrode system with about 120000 individual parts. The strong m…
▽ More
The KATRIN experiment will probe the neutrino mass by measuring the beta-electron energy spectrum near the endpoint of tritium beta-decay. An integral energy analysis will be performed by an electro-static spectrometer (Main Spectrometer), an ultra-high vacuum vessel with a length of 23.2 m, a volume of 1240 m^3, and a complex inner electrode system with about 120000 individual parts. The strong magnetic field that guides the beta-electrons is provided by super-conducting solenoids at both ends of the spectrometer. Its influence on turbo-molecular pumps and vacuum gauges had to be considered. A system consisting of 6 turbo-molecular pumps and 3 km of non-evaporable getter strips has been deployed and was tested during the commissioning of the spectrometer. In this paper the configuration, the commissioning with bake-out at 300°C, and the performance of this system are presented in detail. The vacuum system has to maintain a pressure in the 10^{-11} mbar range. It is demonstrated that the performance of the system is already close to these stringent functional requirements for the KATRIN experiment, which will start at the end of 2016.
△ Less
Submitted 3 March, 2016;
originally announced March 2016.
-
The Arrow of Time in Multivariate Time Series
Authors:
Stefan Bauer,
Bernhard Schölkopf,
Jonas Peters
Abstract:
We prove that a time series satisfying a (linear) multivariate autoregressive moving average (VARMA) model satisfies the same model assumption in the reversed time direction, too, if all innovations are normally distributed. This reversibility breaks down if the innovations are non-Gaussian. This means that under the assumption of a VARMA process with non-Gaussian noise, the arrow of time becomes…
▽ More
We prove that a time series satisfying a (linear) multivariate autoregressive moving average (VARMA) model satisfies the same model assumption in the reversed time direction, too, if all innovations are normally distributed. This reversibility breaks down if the innovations are non-Gaussian. This means that under the assumption of a VARMA process with non-Gaussian noise, the arrow of time becomes detectable. Our work thereby provides a theoretic justification of an algorithm that has been used for inferring the direction of video snippets. We present a slightly modified practical algorithm that estimates the time direction for a given sample and prove its consistency. We further investigate how the performance of the algorithm depends on sample size, number of dimensions of the time series and the order of the process. An application to real world data from economics shows that considering multivariate processes instead of univariate processes can be beneficial for estimating the time direction. Our result extends earlier work on univariate time series. It relates to the concept of causal inference, where recent methods exploit non-Gaussianity of the error terms for causal structure learning.
△ Less
Submitted 2 March, 2016;
originally announced March 2016.
-
On Korn's First Inequality for Mixed Tangential and Normal Boundary Conditions on Bounded Lipschitz-Domains in $\mathbb{R}^N$
Authors:
Sebastian Bauer,
Dirk Pauly
Abstract:
We prove that for bounded Lipschitz domains in $\mathbb{R}^N$ Korn's first inequality holds for vector fields satisfying homogeneous mixed normal and tangential boundary conditions.
We prove that for bounded Lipschitz domains in $\mathbb{R}^N$ Korn's first inequality holds for vector fields satisfying homogeneous mixed normal and tangential boundary conditions.
△ Less
Submitted 19 August, 2016; v1 submitted 28 December, 2015;
originally announced December 2015.
-
Photon Drag Effect in (Bi$_{1-x}$Sb$_{x}$)$_{2}$Te$_{3}$ Three Dimensional Topological Insulators
Authors:
H. Plank,
L. E. Golub,
S. Bauer,
V. V. Bel'kov,
T. Herrmann,
P. Olbrich,
M. Eschbach,
L. Plucinski,
J. Kampmeier,
M. Lanius,
G. Mussler,
D. Grützmacher,
S. D. Ganichev
Abstract:
We report on the observation of a terahertz radiation induced photon drag effect in epitaxially grown $n$- and $p$-type (Bi$_{1-x}$Sb$_{x}$)$_{2}$Te$_{3}$ three dimensional topological insulators with different antimony concentrations $x$ varying from 0 to 1. We demonstrate that the excitation with polarized terahertz radiation results in a $dc$ electric photocurrent. While at normal incidence a c…
▽ More
We report on the observation of a terahertz radiation induced photon drag effect in epitaxially grown $n$- and $p$-type (Bi$_{1-x}$Sb$_{x}$)$_{2}$Te$_{3}$ three dimensional topological insulators with different antimony concentrations $x$ varying from 0 to 1. We demonstrate that the excitation with polarized terahertz radiation results in a $dc$ electric photocurrent. While at normal incidence a current arises due to the photogalvanic effect in the surface states, at oblique incidence it is outweighed by the trigonal photon drag effect. The developed microscopic model and theory show that the photon drag photocurrent is due to the dynamical momentum alignment by time and space dependent radiation electric field and implies the radiation induced asymmetric scattering in the electron momentum space.
△ Less
Submitted 22 December, 2015;
originally announced December 2015.
-
The Maxwell Compactness Property in Bounded Weak Lipschitz Domains with Mixed Boundary Conditions
Authors:
Sebastian Bauer,
Dirk Pauly,
Michael Schomburg
Abstract:
For a bounded weak Lipschitz domain we show the so called `Maxwell compactness property', that is, the space of square integrable vector fields having square integrable weak rotation and divergence and satisfying mixed tangential and normal boundary conditions is compactly embedded into the space of square integrable vector fields. We will also prove some canonical applications, such as Maxwell es…
▽ More
For a bounded weak Lipschitz domain we show the so called `Maxwell compactness property', that is, the space of square integrable vector fields having square integrable weak rotation and divergence and satisfying mixed tangential and normal boundary conditions is compactly embedded into the space of square integrable vector fields. We will also prove some canonical applications, such as Maxwell estimates, Helmholtz decompositions and a static solution theory. Furthermore, a Fredholm alternative for the underlying time-harmonic Maxwell problem and all corresponding and related results for exterior domains formulated in weighted Sobolev spaces are straight forward.
△ Less
Submitted 23 January, 2019; v1 submitted 20 November, 2015;
originally announced November 2015.
-
PEPSI: The high-resolution echelle spectrograph and polarimeter for the Large Binocular Telescope
Authors:
K. G. Strassmeier,
I. Ilyin,
A. Järvinen,
M. Weber,
M. Woche,
S. I. Barnes,
S. -M. Bauer,
E. Beckert,
W. Bittner,
R. Bredthauer,
T. A. Carroll,
C. Denker,
F. Dionies,
I. DiVarano,
D. Döscher,
T. Fechner,
D. Feuerstein,
T. Granzer,
T. Hahn,
G. Harnisch,
A. Hofmann,
M. Lesser,
J. Paschke,
S. Pankratow,
V. Plank
, et al. (4 additional authors not shown)
Abstract:
PEPSI is the bench-mounted, two-arm, fibre-fed and stabilized Potsdam Echelle Polarimetric and Spectroscopic Instrument for the 2x8.4 m Large Binocular Telescope (LBT). Three spectral resolutions of either 43 000, 120 000 or 270 000 can cover the entire optical/red wavelength range from 383 to 907 nm in three exposures. Two 10.3kx10.3k CCDs with 9-μm pixels and peak quantum efficiencies of 96 % re…
▽ More
PEPSI is the bench-mounted, two-arm, fibre-fed and stabilized Potsdam Echelle Polarimetric and Spectroscopic Instrument for the 2x8.4 m Large Binocular Telescope (LBT). Three spectral resolutions of either 43 000, 120 000 or 270 000 can cover the entire optical/red wavelength range from 383 to 907 nm in three exposures. Two 10.3kx10.3k CCDs with 9-μm pixels and peak quantum efficiencies of 96 % record a total of 92 echelle orders. We introduce a new variant of a wave-guide image slicer with 3, 5, and 7 slices and peak efficiencies between 96 %. A total of six cross dispersers cover the six wavelength settings of the spectrograph, two of them always simultaneously. These are made of a VPH-grating sandwiched by two prisms. The peak efficiency of the system, including the telescope, is 15% at 650 nm, and still 11% and 10% at 390 nm and 900 nm, respectively. In combination with the 110 m2 light-collecting capability of the LBT, we expect a limiting magnitude of 20th mag in V in the low-resolution mode. The R=120 000 mode can also be used with two, dual-beam Stokes IQUV polarimeters. The 270 000-mode is made possible with the 7-slice image slicer and a 100- μm fibre through a projected sky aperture of 0.74", comparable to the median seeing of the LBT site. The 43000-mode with 12-pixel sampling per resolution element is our bad seeing or faint-object mode. Any of the three resolution modes can either be used with sky fibers for simultaneous sky exposures or with light from a stabilized Fabry-Perot etalon for ultra-precise radial velocities. CCD-image processing is performed with the dedicated data-reduction and analysis package PEPSI-S4S. A solar feed makes use of PEPSI during day time and a 500-m feed from the 1.8 m VATT can be used when the LBT is busy otherwise. In this paper, we present the basic instrument design, its realization, and its characteristics.
△ Less
Submitted 24 May, 2015;
originally announced May 2015.
-
On Korn's First Inequality for Tangential or Normal Boundary Conditions with Explicit Constants
Authors:
Sebastian Bauer,
Dirk Pauly
Abstract:
We will prove that for piecewise smooth and concave domains Korn's first inequality holds for vector fields satisfying homogeneous normal or tangential boundary conditions with explicit Korn constant square root of 2.
We will prove that for piecewise smooth and concave domains Korn's first inequality holds for vector fields satisfying homogeneous normal or tangential boundary conditions with explicit Korn constant square root of 2.
△ Less
Submitted 21 February, 2016; v1 submitted 25 March, 2015;
originally announced March 2015.