-
A convergent stochastic scalar auxiliary variable method
Authors:
Stefan Metzger
Abstract:
We discuss an extension of the scalar auxiliary variable approach, which was originally introduced by Shen et al. ([Shen, Xu, Yang, J. Comput. Phys., 2018]) for the discretization of deterministic gradient flows. By introducing an additional scalar auxiliary variable, this approach allows to derive a linear scheme, while still maintaining unconditional stability. Our extension augments the approxi…
▽ More
We discuss an extension of the scalar auxiliary variable approach, which was originally introduced by Shen et al. ([Shen, Xu, Yang, J. Comput. Phys., 2018]) for the discretization of deterministic gradient flows. By introducing an additional scalar auxiliary variable, this approach allows to derive a linear scheme, while still maintaining unconditional stability. Our extension augments the approximation of the evolution of this scalar auxiliary variable with higher order terms, which enables its application to stochastic partial differential equations. Using the stochastic Allen--Cahn equation as a prototype for nonlinear stochastic partial differential equations with multiplicative noise, we propose an unconditionally energy stable, linear, fully discrete finite element scheme based on our augmented scalar auxiliary variable method. Recovering a discrete version of the energy estimate and establishing Nikolskii estimates with respect to time, we are able to prove convergence of discrete solutions towards pathwise unique martingale solutions by applying Jakubowski's generalization of Skorokhod's theorem. A generalization of the Gyöngy--Krylov characterization of convergence in probability to quasi-Polish spaces finally provides convergence of fully discrete solutions towards strong solutions of the stochastic Allen--Cahn equation. Finally, we present numerical simulations underlining the practicality of the scheme and the importance of the introduced augmentation terms.
△ Less
Submitted 15 June, 2024; v1 submitted 14 August, 2023;
originally announced August 2023.
-
A convergent SAV scheme for Cahn--Hilliard equations with dynamic boundary conditions
Authors:
Stefan Metzger
Abstract:
The Cahn-Hilliard equation is one of the most common models to describe phase separation processes in mixtures of two materials. For a better description of short-range interactions between the material and the boundary, various dynamic boundary conditions for this equation have been proposed. Recently, a family of models using Cahn-Hilliard-type equations on the boundary of the domain to describe…
▽ More
The Cahn-Hilliard equation is one of the most common models to describe phase separation processes in mixtures of two materials. For a better description of short-range interactions between the material and the boundary, various dynamic boundary conditions for this equation have been proposed. Recently, a family of models using Cahn-Hilliard-type equations on the boundary of the domain to describe adsorption processes was analysed (cf. Knopf, Lam, Liu, Metzger, ESAIM: Math. Model. Numer. Anal., 2021). This family of models includes the case of instantaneous adsorption processes studied by Goldstein, Miranville, and Schimperna (Physica D, 2011) as well as the case of vanishing adsorption rates which was investigated by Liu and Wu (Arch. Ration. Mech. Anal., 2019). In this paper, we are interested in the numerical treatment of these models and propose an unconditionally stable, linear, fully discrete finite element scheme based on the scalar auxiliary variable approach. Furthermore, we establish the convergence of discrete solutions towards suitable weak solutions of the original model. Thereby, when passing to the limit, we are able to remove the auxiliary variables introduced in the discrete setting completely. Finally, we present simulations based on the proposed linear scheme and compare them to results obtained using a stable, non-linear scheme to underline the practicality of our scheme.
△ Less
Submitted 23 March, 2022;
originally announced March 2022.
-
Existence of nonnegative solutions to stochastic thin-film equations in two space dimensions
Authors:
Stefan Metzger,
Günther Grün
Abstract:
We prove the existence of martingale solutions to stochastic thin-film equations in the physically relevant space dimension $d=2$. Conceptually, we rely on a stochastic Faedo-Galerkin approach using tensor-product linear finite elements in space. Augmenting the physical energy on the approximate level by a curvature term weighted by positive powers of the spatial discretization parameter $h$, we c…
▽ More
We prove the existence of martingale solutions to stochastic thin-film equations in the physically relevant space dimension $d=2$. Conceptually, we rely on a stochastic Faedo-Galerkin approach using tensor-product linear finite elements in space. Augmenting the physical energy on the approximate level by a curvature term weighted by positive powers of the spatial discretization parameter $h$, we combine Ito's formula with inverse estimates and appropriate stop** time arguments to derive stochastic counterparts of the energy and entropy estimates known from the deterministic setting. In the limit $h\searrow 0$, we prove our strictly positive finite element solutions to converge towards nonnegative martingale solutions -- making use of compactness arguments based on Jakubowski's generalization of Skorokhod's theorem and subtle exhaustion arguments to identify third-order spatial derivatives in the flux terms.
△ Less
Submitted 22 December, 2021; v1 submitted 15 June, 2021;
originally announced June 2021.
-
Self-Supervised Pretraining Improves Self-Supervised Pretraining
Authors:
Colorado J. Reed,
Xiangyu Yue,
Ani Nrusimha,
Sayna Ebrahimi,
Vivek Vijaykumar,
Richard Mao,
Bo Li,
Shanghang Zhang,
Devin Guillory,
Sean Metzger,
Kurt Keutzer,
Trevor Darrell
Abstract:
While self-supervised pretraining has proven beneficial for many computer vision tasks, it requires expensive and lengthy computation, large amounts of data, and is sensitive to data augmentation. Prior work demonstrates that models pretrained on datasets dissimilar to their target data, such as chest X-ray models trained on ImageNet, underperform models trained from scratch. Users that lack the r…
▽ More
While self-supervised pretraining has proven beneficial for many computer vision tasks, it requires expensive and lengthy computation, large amounts of data, and is sensitive to data augmentation. Prior work demonstrates that models pretrained on datasets dissimilar to their target data, such as chest X-ray models trained on ImageNet, underperform models trained from scratch. Users that lack the resources to pretrain must use existing models with lower performance. This paper explores Hierarchical PreTraining (HPT), which decreases convergence time and improves accuracy by initializing the pretraining process with an existing pretrained model. Through experimentation on 16 diverse vision datasets, we show HPT converges up to 80x faster, improves accuracy across tasks, and improves the robustness of the self-supervised pretraining process to changes in the image augmentation policy or amount of pretraining data. Taken together, HPT provides a simple framework for obtaining better pretrained representations with less computational resources.
△ Less
Submitted 24 March, 2021; v1 submitted 23 March, 2021;
originally announced March 2021.
-
SelfAugment: Automatic Augmentation Policies for Self-Supervised Learning
Authors:
Colorado J Reed,
Sean Metzger,
Aravind Srinivas,
Trevor Darrell,
Kurt Keutzer
Abstract:
A common practice in unsupervised representation learning is to use labeled data to evaluate the quality of the learned representations. This supervised evaluation is then used to guide critical aspects of the training process such as selecting the data augmentation policy. However, guiding an unsupervised training process through supervised evaluations is not possible for real-world data that doe…
▽ More
A common practice in unsupervised representation learning is to use labeled data to evaluate the quality of the learned representations. This supervised evaluation is then used to guide critical aspects of the training process such as selecting the data augmentation policy. However, guiding an unsupervised training process through supervised evaluations is not possible for real-world data that does not actually contain labels (which may be the case, for example, in privacy sensitive fields such as medical imaging). Therefore, in this work we show that evaluating the learned representations with a self-supervised image rotation task is highly correlated with a standard set of supervised evaluations (rank correlation $> 0.94$). We establish this correlation across hundreds of augmentation policies, training settings, and network architectures and provide an algorithm (SelfAugment) to automatically and efficiently select augmentation policies without using supervised evaluations. Despite not using any labeled data, the learned augmentation policies perform comparably with augmentation policies that were determined using exhaustive supervised evaluations.
△ Less
Submitted 17 May, 2021; v1 submitted 16 September, 2020;
originally announced September 2020.
-
Phase-field dynamics with transfer of materials: The Cahn--Hilliard equation with reaction rate dependent dynamic boundary conditions
Authors:
Patrik Knopf,
Kei Fong Lam,
Chun Liu,
Stefan Metzger
Abstract:
The Cahn--Hilliard equation is one of the most common models to describe phase separation processes of a mixture of two materials. For a better description of short-range interactions between the material and the boundary, various dynamic boundary conditions for the Cahn--Hilliard equation have been proposed and investigated in recent times. Of particular interests are the model by Goldstein, Mira…
▽ More
The Cahn--Hilliard equation is one of the most common models to describe phase separation processes of a mixture of two materials. For a better description of short-range interactions between the material and the boundary, various dynamic boundary conditions for the Cahn--Hilliard equation have been proposed and investigated in recent times. Of particular interests are the model by Goldstein, Miranville and Schimperna (Physica D, 2011) and the model by Liu and Wu (Arch.~Ration.~Mech.~Anal., 2019). Both of these models satisfy similar physical properties but differ greatly in their mass conservation behaviour. In this paper we introduce a new model which interpolates between these previous models, and investigate analytical properties such as the existence of unique solutions and convergence to the previous models mentioned above in both the weak and the strong sense. For the strong convergences we also establish rates in terms of the interpolation parameter, which are supported by numerical simulations obtained from a fully discrete, unconditionally stable and convergent finite element scheme for the new interpolation model.
△ Less
Submitted 24 April, 2021; v1 submitted 29 March, 2020;
originally announced March 2020.
-
On a novel approach for modeling liquid crystalline flows
Authors:
Stefan Metzger
Abstract:
In this paper, we derive a new model for the description of liquid crystalline flows. While microscopic Doi type models suffer from the high dimensionality of the underlying product space, the more macroscopic Ericksen--Leslie type models describe only the long time behavior of the flow and are valid only close to equilibrium. By applying an energetic variational approach, we derive a new macrosco…
▽ More
In this paper, we derive a new model for the description of liquid crystalline flows. While microscopic Doi type models suffer from the high dimensionality of the underlying product space, the more macroscopic Ericksen--Leslie type models describe only the long time behavior of the flow and are valid only close to equilibrium. By applying an energetic variational approach, we derive a new macroscopic model which shall provide an improved description far from equilibrium. The novelty of our approach lies in the way the energy is minimized. Distinguishing between the velocities of particles and fluid allows us to define the energy dissipation not in terms of chemical potentials but in terms of friction induced by the discrepancies in the considered velocities. We conclude this publication by establishing the existence of weak solutions to the newly derived model.
△ Less
Submitted 27 March, 2020;
originally announced March 2020.
-
Homogenization of two-phase flow in porous media from Pore to Darcy Scale: A phase-field approach
Authors:
Stefan Metzger,
Peter Knabner
Abstract:
We extend the two-scale expansion approach of periodic homogenization to include time scales and thus can tackle the full instationary Navier-Stokes-Cahn-Hilliard model at the pore scale as microscale. Time scale separation allows us to keep microscale dynamics, responsible e.g. for hysteresis, and arrive at a numerically tractable micro-macro model including coupled generalized Darcy's laws.
We extend the two-scale expansion approach of periodic homogenization to include time scales and thus can tackle the full instationary Navier-Stokes-Cahn-Hilliard model at the pore scale as microscale. Time scale separation allows us to keep microscale dynamics, responsible e.g. for hysteresis, and arrive at a numerically tractable micro-macro model including coupled generalized Darcy's laws.
△ Less
Submitted 30 January, 2020;
originally announced February 2020.
-
An efficient and convergent finite element scheme for Cahn--Hilliard equations with dynamic boundary conditions
Authors:
Stefan Metzger
Abstract:
The Cahn--Hilliard equation is a widely used model that describes amongst others phase separation processes of binary mixtures or two-phase flows. In the recent years, different types of boundary conditions for the Cahn--Hilliard equation were proposed and analyzed. In this publication, we are concerned with the numerical treatment of a recent model which introduces an additional Cahn--Hilliard ty…
▽ More
The Cahn--Hilliard equation is a widely used model that describes amongst others phase separation processes of binary mixtures or two-phase flows. In the recent years, different types of boundary conditions for the Cahn--Hilliard equation were proposed and analyzed. In this publication, we are concerned with the numerical treatment of a recent model which introduces an additional Cahn--Hilliard type equation on the boundary as closure for the Cahn--Hilliard equation in the domain [C. Liu, H. Wu, Arch. Ration. Mech. An., 2019]. By identifying a map** between the phase-field parameter and the chemical potential inside of the domain, we are able to postulate an efficient, unconditionally energy stable finite element scheme. Furthermore, we establish the convergence of discrete solutions towards suitable weak solutions of the original model. This serves also as an additional pathway to establish existence of weak solutions. Furthermore, we present simulations underlining the practicality of the proposed scheme and investigate its experimental order of convergence.
△ Less
Submitted 20 October, 2020; v1 submitted 13 August, 2019;
originally announced August 2019.
-
Relating prepotentials and quantum vacua of N=1 gauge theories with different tree-level superpotentials
Authors:
Adel Bilal,
Steffen Metzger
Abstract:
We consider N=1 supersymmetric U(N) gauge theories with Z_k symmetric tree-level superpotentials W for an adjoint chiral multiplet. We show that (for integer 2N/k) this Z_k symmetry survives in the quantum effective theory as a corresponding symmetry of the effective superpotential W_eff(S_i) under permutations of the S_i. For W(x)=^W(h(x)) with h(x)=x^k, this allows us to express the prepotenti…
▽ More
We consider N=1 supersymmetric U(N) gauge theories with Z_k symmetric tree-level superpotentials W for an adjoint chiral multiplet. We show that (for integer 2N/k) this Z_k symmetry survives in the quantum effective theory as a corresponding symmetry of the effective superpotential W_eff(S_i) under permutations of the S_i. For W(x)=^W(h(x)) with h(x)=x^k, this allows us to express the prepotential F_0 and effective superpotential W_eff on certain submanifolds of the moduli space in terms of an ^F_0 and ^W_eff of a different theory with tree-level superpotential ^W. In particular, if the Z_k symmetric polynomial W(x) is of degree 2k, then ^W is gaussian and we obtain very explicit formulae for F_0 and W_eff. Moreover, in this case, every vacuum of the effective Veneziano-Yankielowicz superpotential ^W_eff is shown to give rise to a vacuum of W_eff. Somewhat surprisingly, at the level of the prepotential F_0(S_i) the permutation symmetry only holds for k=2, while it is anomalous for k>2 due to subtleties related to the non-compact period integrals. Some of these results are also extended to general polynomial relations h(x) between the tree-level superpotentials.
△ Less
Submitted 30 May, 2006; v1 submitted 31 January, 2006;
originally announced January 2006.
-
Supersymmetric Gauge Theories from String Theory
Authors:
Steffen Metzger
Abstract:
The subject of this thesis are various ways to construct four-dimensional quantum field theories from string theory. In a first part we study the generation of a supersymmetric Yang-Mills theory, coupled to an adjoint chiral superfield, from type IIB string theory on non-compact Calabi-Yau manifolds, with D-branes wrap** certain subcycles. The low energy limit of this non-Abelian gauge theory…
▽ More
The subject of this thesis are various ways to construct four-dimensional quantum field theories from string theory. In a first part we study the generation of a supersymmetric Yang-Mills theory, coupled to an adjoint chiral superfield, from type IIB string theory on non-compact Calabi-Yau manifolds, with D-branes wrap** certain subcycles. The low energy limit of this non-Abelian gauge theory can be obtained from a second non-compact Calabi-Yau geometry, which is related to the first one through a geometric transition. In particular, the effective superpotential governing the vacuum structure of the gauge theory can be obtained from integrals on a Calabi-Yau manifold. These integrals in turn are related to matrix model quantities and one therefore can use the matrix model to learn something about the gauge theory vacua. The second part of this work covers the generation of four-dimensional supersymmetric gauge theories, carrying several important characteristic features of the standard model, from compactifications of eleven-dimensional supergravity on G_2-manifolds. We discuss anomaly cancellation through inflow in the case of conical singularities, present an explicit compact manifold with two conical singularities and weak G_2-holonomy, and review the anomaly cancellation mechanism in the context of M-theory on the interval.
△ Less
Submitted 22 December, 2005;
originally announced December 2005.
-
Special geometry of local Calabi-Yau manifolds and superpotentials from holomorphic matrix models
Authors:
Adel Bilal,
Steffen Metzger
Abstract:
We analyse the (rigid) special geometry of a class of local Calabi-Yau manifolds given by hypersurfaces in C^4 as W'(x)^2+f_0(x)+v^2+w^2+z^2=0, that arise in the study of the large N duals of four-dimensional N=1 supersymmetric SU(N) Yang-Mills theories with adjoint field Φand superpotential W(Φ). The special geometry relations are deduced from the planar limit of the corresponding holomorphic m…
▽ More
We analyse the (rigid) special geometry of a class of local Calabi-Yau manifolds given by hypersurfaces in C^4 as W'(x)^2+f_0(x)+v^2+w^2+z^2=0, that arise in the study of the large N duals of four-dimensional N=1 supersymmetric SU(N) Yang-Mills theories with adjoint field Φand superpotential W(Φ). The special geometry relations are deduced from the planar limit of the corresponding holomorphic matrix model. The set of cycles is split into a bulk sector, for which we obtain the standard rigid special geometry relations, and a set of relative cycles, that come from the non-compactness of the manifold, for which we find cut-off dependent corrections to the usual special geometry relations. The (cut-off independent) prepotential is identified with the (analytically continued) free energy of the holomorphic matrix model in the planar limit. On the way, we clarify various subtleties pertaining to the saddle point approximation of the holomorphic matrix model. A formula for the superpotential of IIB string theory with background fluxes on these local Calabi-Yau manifolds is proposed that is based on pairings similar to the ones of relative cohomology.
△ Less
Submitted 23 March, 2005;
originally announced March 2005.
-
M-theory compactifications, G_2-manifolds and anomalies
Authors:
Steffen Metzger
Abstract:
This diploma thesis has three major objectives. Firstly, we give an elementary introduction to M-theory compactifications, which are obtained from an analysis of its low-energy effective theory, eleven-dimensional supergravity. In particular, we show how the requirement of N=1 supersymmetry in four dimensions leads to compactifications on G_2-manifolds. We also examine the Freund-Rubin solution…
▽ More
This diploma thesis has three major objectives. Firstly, we give an elementary introduction to M-theory compactifications, which are obtained from an analysis of its low-energy effective theory, eleven-dimensional supergravity. In particular, we show how the requirement of N=1 supersymmetry in four dimensions leads to compactifications on G_2-manifolds. We also examine the Freund-Rubin solution as well as the M2- and M5-brane. Secondly, we review the construction of realistic theories in four dimensions from compactifications on G_2-manifolds. It turns out that this can only be achieved if the manifolds are allowed to carry singularities of various kinds. Thirdly, we are interested in the concept of anomalies in the framework of M-theory. We present some basic material on anomalies and examine three cases where anomalies play a prominent role in M-theory. We review M-theory on R^10 x S^1/Z_2 where anomalies are a major ingredient leading to the duality between M-theory and the E_8 x E_8 heterotic string. A detailed calculation of the tangent and normal bundle anomaly in the case of the M5-brane is also included. It is known that in this case the normal bundle anomaly can only be cancelled if the topological term of eleven-dimensional supergravity is modified in a suitable way. Finally, we present a new mechanism to cancel anomalies which are present if M-theory is compactified on G_2-manifolds carrying singularities of codimension seven. In order to establish local anomaly cancellation we once again have to modify the topological term of eleven-dimensional supergravity as well as the Green-Schwarz term.
△ Less
Submitted 13 August, 2003;
originally announced August 2003.
-
Anomaly cancellation in M-theory: a critical review
Authors:
Adel Bilal,
Steffen Metzger
Abstract:
We carefully review the basic examples of anomaly cancellation in M-theory: the 5-brane anomalies and the anomalies on S^1/Z_2. This involves cancellation between quantum anomalies and classical inflow from topological terms. To correctly fix all coefficients and signs, proper attention is paid to issues of orientation, chirality and the Euclidean continuation. Independent of the conventions cho…
▽ More
We carefully review the basic examples of anomaly cancellation in M-theory: the 5-brane anomalies and the anomalies on S^1/Z_2. This involves cancellation between quantum anomalies and classical inflow from topological terms. To correctly fix all coefficients and signs, proper attention is paid to issues of orientation, chirality and the Euclidean continuation. Independent of the conventions chosen, the Chern-Simons and Green-Schwarz terms must always have the same sign. The reanalysis of the reduction to the heterotic string on S^1/Z_2 yields a surprise: a previously neglected factor forces us to slightly modify the Chern-Simons term, similar to what is needed for cancelling the normal bundle anomaly of the 5-brane. This modification leads to a local cancellation of the anomaly, while maintaining the periodicity on S^1.
△ Less
Submitted 17 July, 2003;
originally announced July 2003.
-
Anomalies in M-theory on singular G_2-manifolds
Authors:
Adel Bilal,
Steffen Metzger
Abstract:
When M-theory is compactified on G_2-holonomy manifolds with conical singularities, charged chiral fermions are present and the low-energy four-dimensional theory is potentially anomalous. We reconsider the issue of anomaly cancellation, first studied by Witten. We propose a mechanism that provides local cancellation of all gauge and mixed gauge-gravitational anomalies, i.e. separately for each…
▽ More
When M-theory is compactified on G_2-holonomy manifolds with conical singularities, charged chiral fermions are present and the low-energy four-dimensional theory is potentially anomalous. We reconsider the issue of anomaly cancellation, first studied by Witten. We propose a mechanism that provides local cancellation of all gauge and mixed gauge-gravitational anomalies, i.e. separately for each conical singularity. It is similar in spirit to the one used to cancel the normal bundle anomaly in the presence of five-branes. It involves smoothly cutting off all fields close to the conical singularities, resulting in an anomalous variation of the 3-form C and of the non-abelian gauge fields present if there are also ADE singularities.
△ Less
Submitted 4 September, 2003; v1 submitted 27 March, 2003;
originally announced March 2003.
-
Compact weak G_2-manifolds with conical singularities
Authors:
Adel Bilal,
Steffen Metzger
Abstract:
We construct 7-dimensional compact Einstein spaces with conical singularities that preserve 1/8 of the supersymmetries of M-theory. Mathematically they have weak G_2-holonomy. We show that for every non-compact G_2-holonomy manifold which is asymptotic to a cone on a 6-manifold Y, there is a corresponding weak G_2-manifold with two conical singularities which, close to the singularities, looks l…
▽ More
We construct 7-dimensional compact Einstein spaces with conical singularities that preserve 1/8 of the supersymmetries of M-theory. Mathematically they have weak G_2-holonomy. We show that for every non-compact G_2-holonomy manifold which is asymptotic to a cone on a 6-manifold Y, there is a corresponding weak G_2-manifold with two conical singularities which, close to the singularities, looks like a cone on Y. Our construction provides explicit metrics on these weak G_2-manifolds. We completely determine the cohomology of these manifolds in terms of the cohomology of Y.
△ Less
Submitted 25 April, 2003; v1 submitted 4 February, 2003;
originally announced February 2003.
-
Spin measurement retrodiction revisited
Authors:
Steffen Metzger
Abstract:
The retrodiction of spin measurements along a set of different axes is revisited in detail. The problem is presented in two different pictures, a geometric and a general algebraic one. Explicit measurement operators that allow the retrodiction are given for the case of three and four axes. For the Vaidman-Aharanov-Albert case of three orthogonal axes the quantum network is constructed for two di…
▽ More
The retrodiction of spin measurements along a set of different axes is revisited in detail. The problem is presented in two different pictures, a geometric and a general algebraic one. Explicit measurement operators that allow the retrodiction are given for the case of three and four axes. For the Vaidman-Aharanov-Albert case of three orthogonal axes the quantum network is constructed for two different initial Bell states.
△ Less
Submitted 4 February, 2001; v1 submitted 25 June, 2000;
originally announced June 2000.