-
Multimodal Conditional 3D Face Geometry Generation
Authors:
Christopher Otto,
Prashanth Chandran,
Sebastian Weiss,
Markus Gross,
Gaspard Zoss,
Derek Bradley
Abstract:
We present a new method for multimodal conditional 3D face geometry generation that allows user-friendly control over the output identity and expression via a number of different conditioning signals. Within a single model, we demonstrate 3D faces generated from artistic sketches, 2D face landmarks, Canny edges, FLAME face model parameters, portrait photos, or text prompts. Our approach is based o…
▽ More
We present a new method for multimodal conditional 3D face geometry generation that allows user-friendly control over the output identity and expression via a number of different conditioning signals. Within a single model, we demonstrate 3D faces generated from artistic sketches, 2D face landmarks, Canny edges, FLAME face model parameters, portrait photos, or text prompts. Our approach is based on a diffusion process that generates 3D geometry in a 2D parameterized UV domain. Geometry generation passes each conditioning signal through a set of cross-attention layers (IP-Adapter), one set for each user-defined conditioning signal. The result is an easy-to-use 3D face generation tool that produces high resolution geometry with fine-grain user control.
△ Less
Submitted 1 July, 2024;
originally announced July 2024.
-
First high peak and average power single-pass THz FEL based on high brightness photoinjector
Authors:
M. Krasilnikov,
Z. Aboulbanine,
G. Adhikari,
N. Aftab,
A. Asoyan,
P. Boonpornprasert,
H. Davtyan,
G. Georgiev,
J. Good,
A. Grebinyk,
M. Gross,
A. Hoffmann,
E. Kongmon,
X. -K. Li,
A. Lueangaramwong,
D. Melkumyan,
S. Mohanty,
R. Niemczyk,
A. Oppelt,
H. Qian,
C. Richard,
F. Stephan,
G. Vashchenko,
T. Weilbach,
X. Zhang
, et al. (9 additional authors not shown)
Abstract:
Advanced experiments using THz pump and X-ray probe pulses at modern free-electron lasers (FELs) like the European X-ray FEL require a frequency-tunable, high-power, narrow-band THz source maintaining the repetition rate and pulse structure of the X-ray pulses. This paper reports the first results from a THz source, that is based on a single-pass high-gain THz FEL operating with a central waveleng…
▽ More
Advanced experiments using THz pump and X-ray probe pulses at modern free-electron lasers (FELs) like the European X-ray FEL require a frequency-tunable, high-power, narrow-band THz source maintaining the repetition rate and pulse structure of the X-ray pulses. This paper reports the first results from a THz source, that is based on a single-pass high-gain THz FEL operating with a central wavelength of 100 micrometers. The THz FEL prototype is currently in operation at the Photo Injector Test facility at DESY in Zeuthen (PITZ) and uses the same type of electron source as the European XFEL photo injector. A self-amplified spontaneous emission (SASE) FEL was envisioned as the main mechanism for generating the THz pulses. Although the THz FEL at PITZ is supposed to use the same mechanism as at X-ray facilities, it cannot be considered as a simple scaling of the radiation wavelength because there is a large difference in the number of electrons per radiation wavelength, which is five orders of magnitude higher for the THz case. The bunching factor arising from the electron beam current profile contributes strongly to the initial spontaneous emission starting the FEL process. Proof-of-principle experiments were done at PITZ using an LCLS-I undulator to generate the first high-power, high-repetition-rate single-pass THz FEL radiation. Electron bunches with a beam energy of ~17 MeV and a bunch charge of up to several nC are used to generate THz pulses with a pulse energy of several tens of microjoules. For example, for an electron beam with a charge of ~2.4 nC, more than 100 microjoules were generated at a central wavelength of 100 micrometers. The narrowband spectrum was also demonstrated by spectral measurements. These proof-of-principle experiments pave the way for a tunable, high-repetition-rate THz source providing pulses with energies in the millijoule range.
△ Less
Submitted 29 May, 2024;
originally announced May 2024.
-
Preheating with deep learning
Authors:
Jong-Hyun Yoon,
Simon Cléry,
Mathieu Gross,
Yann Mambrini
Abstract:
We apply deep learning techniques to the late-time turbulent regime in a post-inflationary model where a real scalar inflaton field and the standard model Higgs doublet interact with renormalizable couplings between them. After inflation, the inflaton decays into the Higgs through a trilinear coupling and the Higgs field subsequently thermalizes with gauge bosons via its $SU(2)\times U(1)$ gauge i…
▽ More
We apply deep learning techniques to the late-time turbulent regime in a post-inflationary model where a real scalar inflaton field and the standard model Higgs doublet interact with renormalizable couplings between them. After inflation, the inflaton decays into the Higgs through a trilinear coupling and the Higgs field subsequently thermalizes with gauge bosons via its $SU(2)\times U(1)$ gauge interaction. Depending on the strength of the trilinear interaction and the Higgs self-coupling, the effective mass squared of Higgs can become negative, leading to the tachyonic production of Higgs particles. These produced Higgs particles would then share their energy with gauge bosons, potentially indicating thermalization. Since the model entails different non-perturbative effects, it is necessary to resort to numerical and semi-classical techniques. However, simulations require significant costs in terms of time and computational resources depending on the model used. Particularly, when $SU(2)$ gauge interactions are introduced, this becomes evident as the gauge field redistributes particle energies through rescattering processes, leading to an abundance of UV modes that disrupt simulation stability. This necessitates very small lattice spacings, resulting in exceedingly long simulation runtimes. Furthermore, the late-time behavior of preheating dynamics exhibits a universal form by wave kinetic theory. Therefore, we analyze patterns in the flow of particle numbers and predict future behavior using CNN-LSTM (Convolutional Neural Network combined with Long Short-Term Memory) time series analysis. In this way, we can reduce our dependence on simulations by orders of magnitude in terms of time and computational resources.
△ Less
Submitted 14 May, 2024;
originally announced May 2024.
-
Lossy Image Compression with Foundation Diffusion Models
Authors:
Lucas Relic,
Roberto Azevedo,
Markus Gross,
Christopher Schroers
Abstract:
Incorporating diffusion models in the image compression domain has the potential to produce realistic and detailed reconstructions, especially at extremely low bitrates. Previous methods focus on using diffusion models as expressive decoders robust to quantization errors in the conditioning signals, yet achieving competitive results in this manner requires costly training of the diffusion model an…
▽ More
Incorporating diffusion models in the image compression domain has the potential to produce realistic and detailed reconstructions, especially at extremely low bitrates. Previous methods focus on using diffusion models as expressive decoders robust to quantization errors in the conditioning signals, yet achieving competitive results in this manner requires costly training of the diffusion model and long inference times due to the iterative generative process. In this work we formulate the removal of quantization error as a denoising task, using diffusion to recover lost information in the transmitted image latent. Our approach allows us to perform less than 10\% of the full diffusion generative process and requires no architectural changes to the diffusion model, enabling the use of foundation models as a strong prior without additional fine tuning of the backbone. Our proposed codec outperforms previous methods in quantitative realism metrics, and we verify that our reconstructions are qualitatively preferred by end users, even when other methods use twice the bitrate.
△ Less
Submitted 12 April, 2024;
originally announced April 2024.
-
Learning a Generalized Physical Face Model From Data
Authors:
Lingchen Yang,
Gaspard Zoss,
Prashanth Chandran,
Markus Gross,
Barbara Solenthaler,
Eftychios Sifakis,
Derek Bradley
Abstract:
Physically-based simulation is a powerful approach for 3D facial animation as the resulting deformations are governed by physical constraints, allowing to easily resolve self-collisions, respond to external forces and perform realistic anatomy edits. Today's methods are data-driven, where the actuations for finite elements are inferred from captured skin geometry. Unfortunately, these approaches h…
▽ More
Physically-based simulation is a powerful approach for 3D facial animation as the resulting deformations are governed by physical constraints, allowing to easily resolve self-collisions, respond to external forces and perform realistic anatomy edits. Today's methods are data-driven, where the actuations for finite elements are inferred from captured skin geometry. Unfortunately, these approaches have not been widely adopted due to the complexity of initializing the material space and learning the deformation model for each character separately, which often requires a skilled artist followed by lengthy network training. In this work, we aim to make physics-based facial animation more accessible by proposing a generalized physical face model that we learn from a large 3D face dataset in a simulation-free manner. Once trained, our model can be quickly fit to any unseen identity and produce a ready-to-animate physical face model automatically. Fitting is as easy as providing a single 3D face scan, or even a single face image. After fitting, we offer intuitive animation controls, as well as the ability to retarget animations across characters. All the while, the resulting animations allow for physical effects like collision avoidance, gravity, paralysis, bone resha** and more.
△ Less
Submitted 29 February, 2024;
originally announced February 2024.
-
Collaborative Semantic Occupancy Prediction with Hybrid Feature Fusion in Connected Automated Vehicles
Authors:
Rui Song,
Chenwei Liang,
Hu Cao,
Zhiran Yan,
Walter Zimmer,
Markus Gross,
Andreas Festag,
Alois Knoll
Abstract:
Collaborative perception in automated vehicles leverages the exchange of information between agents, aiming to elevate perception results. Previous camera-based collaborative 3D perception methods typically employ 3D bounding boxes or bird's eye views as representations of the environment. However, these approaches fall short in offering a comprehensive 3D environmental prediction. To bridge this…
▽ More
Collaborative perception in automated vehicles leverages the exchange of information between agents, aiming to elevate perception results. Previous camera-based collaborative 3D perception methods typically employ 3D bounding boxes or bird's eye views as representations of the environment. However, these approaches fall short in offering a comprehensive 3D environmental prediction. To bridge this gap, we introduce the first method for collaborative 3D semantic occupancy prediction. Particularly, it improves local 3D semantic occupancy predictions by hybrid fusion of (i) semantic and occupancy task features, and (ii) compressed orthogonal attention features shared between vehicles. Additionally, due to the lack of a collaborative perception dataset designed for semantic occupancy prediction, we augment a current collaborative perception dataset to include 3D collaborative semantic occupancy labels for a more robust evaluation. The experimental findings highlight that: (i) our collaborative semantic occupancy predictions excel above the results from single vehicles by over 30%, and (ii) models anchored on semantic occupancy outpace state-of-the-art collaborative 3D detection techniques in subsequent perception applications, showcasing enhanced accuracy and enriched semantic-awareness in road environments.
△ Less
Submitted 25 April, 2024; v1 submitted 12 February, 2024;
originally announced February 2024.
-
An Implicit Physical Face Model Driven by Expression and Style
Authors:
Lingchen Yang,
Gaspard Zoss,
Prashanth Chandran,
Paulo Gotardo,
Markus Gross,
Barbara Solenthaler,
Eftychios Sifakis,
Derek Bradley
Abstract:
3D facial animation is often produced by manipulating facial deformation models (or rigs), that are traditionally parameterized by expression controls. A key component that is usually overlooked is expression 'style', as in, how a particular expression is performed. Although it is common to define a semantic basis of expressions that characters can perform, most characters perform each expression…
▽ More
3D facial animation is often produced by manipulating facial deformation models (or rigs), that are traditionally parameterized by expression controls. A key component that is usually overlooked is expression 'style', as in, how a particular expression is performed. Although it is common to define a semantic basis of expressions that characters can perform, most characters perform each expression in their own style. To date, style is usually entangled with the expression, and it is not possible to transfer the style of one character to another when considering facial animation. We present a new face model, based on a data-driven implicit neural physics model, that can be driven by both expression and style separately. At the core, we present a framework for learning implicit physics-based actuations for multiple subjects simultaneously, trained on a few arbitrary performance capture sequences from a small set of identities. Once trained, our method allows generalized physics-based facial animation for any of the trained identities, extending to unseen performances. Furthermore, it grants control over the animation style, enabling style transfer from one character to another or blending styles of different characters. Lastly, as a physics-based model, it is capable of synthesizing physical effects, such as collision handling, setting our method apart from conventional approaches.
△ Less
Submitted 27 January, 2024;
originally announced January 2024.
-
Implicit Neural Representation for Physics-driven Actuated Soft Bodies
Authors:
Lingchen Yang,
Byungsoo Kim,
Gaspard Zoss,
Baran Gözcü,
Markus Gross,
Barbara Solenthaler
Abstract:
Active soft bodies can affect their shape through an internal actuation mechanism that induces a deformation. Similar to recent work, this paper utilizes a differentiable, quasi-static, and physics-based simulation layer to optimize for actuation signals parameterized by neural networks. Our key contribution is a general and implicit formulation to control active soft bodies by defining a function…
▽ More
Active soft bodies can affect their shape through an internal actuation mechanism that induces a deformation. Similar to recent work, this paper utilizes a differentiable, quasi-static, and physics-based simulation layer to optimize for actuation signals parameterized by neural networks. Our key contribution is a general and implicit formulation to control active soft bodies by defining a function that enables a continuous map** from a spatial point in the material space to the actuation value. This property allows us to capture the signal's dominant frequencies, making the method discretization agnostic and widely applicable. We extend our implicit model to mandible kinematics for the particular case of facial animation and show that we can reliably reproduce facial expressions captured with high-quality capture systems. We apply the method to volumetric soft bodies, human poses, and facial expressions, demonstrating artist-friendly properties, such as simple control over the latent space and resolution invariance at test time.
△ Less
Submitted 26 January, 2024;
originally announced January 2024.
-
Artist-Friendly Relightable and Animatable Neural Heads
Authors:
Yingyan Xu,
Prashanth Chandran,
Sebastian Weiss,
Markus Gross,
Gaspard Zoss,
Derek Bradley
Abstract:
An increasingly common approach for creating photo-realistic digital avatars is through the use of volumetric neural fields. The original neural radiance field (NeRF) allowed for impressive novel view synthesis of static heads when trained on a set of multi-view images, and follow up methods showed that these neural representations can be extended to dynamic avatars. Recently, new variants also su…
▽ More
An increasingly common approach for creating photo-realistic digital avatars is through the use of volumetric neural fields. The original neural radiance field (NeRF) allowed for impressive novel view synthesis of static heads when trained on a set of multi-view images, and follow up methods showed that these neural representations can be extended to dynamic avatars. Recently, new variants also surpassed the usual drawback of baked-in illumination in neural representations, showing that static neural avatars can be relit in any environment. In this work we simultaneously tackle both the motion and illumination problem, proposing a new method for relightable and animatable neural heads. Our method builds on a proven dynamic avatar approach based on a mixture of volumetric primitives, combined with a recently-proposed lightweight hardware setup for relightable neural fields, and includes a novel architecture that allows relighting dynamic neural avatars performing unseen expressions in any environment, even with nearfield illumination and viewpoints.
△ Less
Submitted 6 December, 2023;
originally announced December 2023.
-
Spatially Adaptive Cloth Regression with Implicit Neural Representations
Authors:
Lei Shu,
Vinicius Azevedo,
Barbara Solenthaler,
Markus Gross
Abstract:
The accurate representation of fine-detailed cloth wrinkles poses significant challenges in computer graphics. The inherently non-uniform structure of cloth wrinkles mandates the employment of intricate discretization strategies, which are frequently characterized by high computational demands and complex methodologies. Addressing this, the research introduced in this paper elucidates a novel anis…
▽ More
The accurate representation of fine-detailed cloth wrinkles poses significant challenges in computer graphics. The inherently non-uniform structure of cloth wrinkles mandates the employment of intricate discretization strategies, which are frequently characterized by high computational demands and complex methodologies. Addressing this, the research introduced in this paper elucidates a novel anisotropic cloth regression technique that capitalizes on the potential of implicit neural representations of surfaces. Our first core contribution is an innovative mesh-free sampling approach, crafted to reduce the reliance on traditional mesh structures, thereby offering greater flexibility and accuracy in capturing fine cloth details. Our second contribution is a novel adversarial training scheme, which is designed meticulously to strike a harmonious balance between the sampling and simulation objectives. The adversarial approach ensures that the wrinkles are represented with high fidelity, while also maintaining computational efficiency. Our results showcase through various cloth-object interaction scenarios that our method, given the same memory constraints, consistently surpasses traditional discrete representations, particularly when modelling highly-detailed localized wrinkles.
△ Less
Submitted 27 November, 2023;
originally announced November 2023.
-
Weight fluctuations in (deep) linear neural networks and a derivation of the inverse-variance flatness relation
Authors:
Markus Gross,
Arne P. Raulf,
Christoph Räth
Abstract:
We investigate the stationary (late-time) training regime of single- and two-layer underparameterized linear neural networks within the continuum limit of stochastic gradient descent (SGD) for synthetic Gaussian data. In the case of a single-layer network in the weakly underparameterized regime, the spectrum of the noise covariance matrix deviates notably from the Hessian, which can be attributed…
▽ More
We investigate the stationary (late-time) training regime of single- and two-layer underparameterized linear neural networks within the continuum limit of stochastic gradient descent (SGD) for synthetic Gaussian data. In the case of a single-layer network in the weakly underparameterized regime, the spectrum of the noise covariance matrix deviates notably from the Hessian, which can be attributed to the broken detailed balance of SGD dynamics. The weight fluctuations are in this case generally anisotropic, but effectively experience an isotropic loss. For an underparameterized two-layer network, we describe the stochastic dynamics of the weights in each layer and analyze the associated stationary covariances. We identify the inter-layer coupling as a distinct source of anisotropy for the weight fluctuations. In contrast to the single-layer case, the weight fluctuations are effectively subject to an anisotropic loss, the flatness of which is inversely related to the fluctuation variance. We thereby provide an analytical derivation of the recently observed inverse variance-flatness relation in a model of a deep linear neural network.
△ Less
Submitted 23 June, 2024; v1 submitted 23 November, 2023;
originally announced November 2023.
-
GroomGen: A High-Quality Generative Hair Model Using Hierarchical Latent Representations
Authors:
Yuxiao Zhou,
Menglei Chai,
Alessandro Pepe,
Markus Gross,
Thabo Beeler
Abstract:
Despite recent successes in hair acquisition that fits a high-dimensional hair model to a specific input subject, generative hair models, which establish general embedding spaces for encoding, editing, and sampling diverse hairstyles, are way less explored. In this paper, we present GroomGen, the first generative model designed for hair geometry composed of highly-detailed dense strands. Our appro…
▽ More
Despite recent successes in hair acquisition that fits a high-dimensional hair model to a specific input subject, generative hair models, which establish general embedding spaces for encoding, editing, and sampling diverse hairstyles, are way less explored. In this paper, we present GroomGen, the first generative model designed for hair geometry composed of highly-detailed dense strands. Our approach is motivated by two key ideas. First, we construct hair latent spaces covering both individual strands and hairstyles. The latent spaces are compact, expressive, and well-constrained for high-quality and diverse sampling. Second, we adopt a hierarchical hair representation that parameterizes a complete hair model to three levels: single strands, sparse guide hairs, and complete dense hairs. This representation is critical to the compactness of latent spaces, the robustness of training, and the efficiency of inference. Based on this hierarchical latent representation, our proposed pipeline consists of a strand-VAE and a hairstyle-VAE that encode an individual strand and a set of guide hairs to their respective latent spaces, and a hybrid densification step that populates sparse guide hairs to a dense hair model. GroomGen not only enables novel hairstyle sampling and plausible hairstyle interpolation, but also supports interactive editing of complex hairstyles, or can serve as strong data-driven prior for hairstyle reconstruction from images. We demonstrate the superiority of our approach with qualitative examples of diverse sampled hairstyles and quantitative evaluation of generation quality regarding every single component and the entire pipeline.
△ Less
Submitted 16 November, 2023; v1 submitted 3 November, 2023;
originally announced November 2023.
-
A Perceptual Shape Loss for Monocular 3D Face Reconstruction
Authors:
Christopher Otto,
Prashanth Chandran,
Gaspard Zoss,
Markus Gross,
Paulo Gotardo,
Derek Bradley
Abstract:
Monocular 3D face reconstruction is a wide-spread topic, and existing approaches tackle the problem either through fast neural network inference or offline iterative reconstruction of face geometry. In either case carefully-designed energy functions are minimized, commonly including loss terms like a photometric loss, a landmark reprojection loss, and others. In this work we propose a new loss fun…
▽ More
Monocular 3D face reconstruction is a wide-spread topic, and existing approaches tackle the problem either through fast neural network inference or offline iterative reconstruction of face geometry. In either case carefully-designed energy functions are minimized, commonly including loss terms like a photometric loss, a landmark reprojection loss, and others. In this work we propose a new loss function for monocular face capture, inspired by how humans would perceive the quality of a 3D face reconstruction given a particular image. It is widely known that shading provides a strong indicator for 3D shape in the human visual system. As such, our new 'perceptual' shape loss aims to judge the quality of a 3D face estimate using only shading cues. Our loss is implemented as a discriminator-style neural network that takes an input face image and a shaded render of the geometry estimate, and then predicts a score that perceptually evaluates how well the shaded render matches the given image. This 'critic' network operates on the RGB image and geometry render alone, without requiring an estimate of the albedo or illumination in the scene. Furthermore, our loss operates entirely in image space and is thus agnostic to mesh topology. We show how our new perceptual shape loss can be combined with traditional energy terms for monocular 3D face optimization and deep neural network regression, improving upon current state-of-the-art results.
△ Less
Submitted 30 October, 2023;
originally announced October 2023.
-
Middle-mile optimization for next-day delivery
Authors:
Konstantinos Benidis,
Georgios Paschos,
Martin Gross,
George Iosifidis
Abstract:
We consider an e-commerce retailer operating a supply chain that consists of middle- and last-mile transportation, and study its ability to deliver products stored in warehouses within a day from customer's order time. Successful next-day delivery requires inventory availability and timely truck schedules in the middle-mile and in this paper we assume a fixed inventory position and focus on optimi…
▽ More
We consider an e-commerce retailer operating a supply chain that consists of middle- and last-mile transportation, and study its ability to deliver products stored in warehouses within a day from customer's order time. Successful next-day delivery requires inventory availability and timely truck schedules in the middle-mile and in this paper we assume a fixed inventory position and focus on optimizing the middle-mile. We formulate a novel optimization problem which decides the departure of the last middle-mile truck at each (potential) network connection in order to maximize the number of next-day deliveries. We show that the respective \emph{next-day delivery optimization} is a combinatorial problem that is $NP$-hard to approximate within $(1-1/e)\cdot\texttt{opt}\approx 0.632\cdot\texttt{opt}$, hence every retailer that offers one-day deliveries has to deal with this complexity barrier. We study three variants of the problem motivated by operational constraints that different retailers encounter, and propose solutions schemes tailored to each problem's properties. To that end, we rely on greedy submodular maximization, pipage rounding techniques, and Lagrangian heuristics. The algorithms are scalable, offer optimality gap guarantees, and evaluated in realistic datasets and network scenarios were found to achieve near-optimal results.
△ Less
Submitted 27 October, 2023;
originally announced October 2023.
-
DualStream: Spatially Sharing Selves and Surroundings using Mobile Devices and Augmented Reality
Authors:
Rishi Vanukuru,
Suibi Che-Chuan Weng,
Krithik Ranjan,
Torin Hopkins,
Amy Banic,
Mark D. Gross,
Ellen Yi-Luen Do
Abstract:
In-person human interaction relies on our spatial perception of each other and our surroundings. Current remote communication tools partially address each of these aspects. Video calls convey real user representations but without spatial interactions. Augmented and Virtual Reality (AR/VR) experiences are immersive and spatial but often use virtual environments and characters instead of real-life r…
▽ More
In-person human interaction relies on our spatial perception of each other and our surroundings. Current remote communication tools partially address each of these aspects. Video calls convey real user representations but without spatial interactions. Augmented and Virtual Reality (AR/VR) experiences are immersive and spatial but often use virtual environments and characters instead of real-life representations. Bridging these gaps, we introduce DualStream, a system for synchronous mobile AR remote communication that captures, streams, and displays spatial representations of users and their surroundings. DualStream supports transitions between user and environment representations with different levels of visuospatial fidelity, as well as the creation of persistent shared spaces using environment snapshots. We demonstrate how DualStream can enable spatial communication in real-world contexts, and support the creation of blended spaces for collaboration. A formative evaluation of DualStream revealed that users valued the ability to interact spatially and move between representations, and could see DualStream fitting into their own remote communication practices in the near future. Drawing from these findings, we discuss new opportunities for designing more widely accessible spatial communication tools, centered around the mobile phone.
△ Less
Submitted 2 September, 2023;
originally announced September 2023.
-
Effects of Fragmentation on Post-Inflationary Reheating
Authors:
Marcos A. G. Garcia,
Mathieu Gross,
Yann Mambrini,
Keith A. Olive,
Mathias Pierre,
Jong-Hyun Yoon
Abstract:
We consider the effects of fragmentation on the post-inflationary epoch of reheating. In simple single field models of inflation, an inflaton condensate undergoes an oscillatory phase once inflationary expansion ends. The equation of state of the condensate depends on the shape of the scalar potential, $V(φ)$, about its minimum. Assuming $V(φ) \sim φ^k$, the equation of state parameter is given by…
▽ More
We consider the effects of fragmentation on the post-inflationary epoch of reheating. In simple single field models of inflation, an inflaton condensate undergoes an oscillatory phase once inflationary expansion ends. The equation of state of the condensate depends on the shape of the scalar potential, $V(φ)$, about its minimum. Assuming $V(φ) \sim φ^k$, the equation of state parameter is given by $w = P_φ/ρ_φ= (k-2)/(k+2)$. The evolution of condensate and the reheating process depend on $k$. For $k \ge 4$, inflaton self-interactions may lead to the fragmentation of the condensate and alter the reheating process. Indeed, these self-interactions lead to the production of a massless gas of inflaton particles as $w$ relaxes to 1/3. If reheating occurs before fragmentation, the effects of fragmentation are harmless. We find, however, that the effects of fragmentation depend sensitively to the specific reheating process. Reheating through the decays to fermions is largely excluded since perturbative couplings would imply that fragmentation occurs before reheating and in fact could prevent reheating from completion. Reheating through the decays to boson is relatively unaffected by fragmentation and reheating through scatterings results in a lower reheating temperature.
△ Less
Submitted 18 December, 2023; v1 submitted 30 August, 2023;
originally announced August 2023.
-
Temperature Evolution of Magnon Propagation Length in Tm$_3$Fe$_5$O$_{12}$ Thin Films: Roles of Magnetic Anisotropy and Gilbert Dam**
Authors:
Amit Chanda,
Christian Holzmann,
Noah Schulz,
Aladin Ullrich,
Manfred Albrecht,
Miela J. Gross,
Caroline A. Ross,
Dario. A. Arena,
Manh-Huong Phan,
Hariharan Srikanth
Abstract:
The magnon propagation length ($\langleξ\rangle$) of a ferro/ferrimagnet (FM) is one of the key factors that controls the generation and propagation of thermally-driven spin current in FM/heavy metal (HM) bilayer based spincaloritronic devices. Theory predicts that for the FM layer, $\langleξ\rangle$ is inversely proportional to the Gilbert dam** ($α$) and the square root of the effective magnet…
▽ More
The magnon propagation length ($\langleξ\rangle$) of a ferro/ferrimagnet (FM) is one of the key factors that controls the generation and propagation of thermally-driven spin current in FM/heavy metal (HM) bilayer based spincaloritronic devices. Theory predicts that for the FM layer, $\langleξ\rangle$ is inversely proportional to the Gilbert dam** ($α$) and the square root of the effective magnetic anisotropy constant ($K_{\rm eff}$). However, direct experimental evidence of this relationship is lacking. To experimentally confirm this prediction, we employ a combination of longitudinal spin Seebeck effect (LSSE), transverse susceptibility, and ferromagnetic resonance experiments to investigate the temperature evolution of $\langleξ\rangle$ and establish its correlation with the effective magnetic anisotropy field, $H_K^{\rm eff}$ ($\propto K_{\rm eff}$) and $α$ in Tm$_3$Fe$_5$O$_{12}$ (TmIG)/Pt bilayers. We observe concurrent drops in the LSSE voltage and $\langleξ\rangle$ below 200$^\circ$K in TmIG/Pt bilayers regardless of TmIG film thickness and substrate choice and attribute it to the noticeable increases in $H_K^{\rm eff}$ and $α$ that occur within the same temperature range. From the TmIG thickness dependence of the LSSE voltage, we determined the temperature dependence of $\langleξ\rangle$ and highlighted its correlation with the temperature-dependent $H_K^{\rm eff}$ and $α$ in TmIG/Pt bilayers, which will be beneficial for the development of rare-earth iron garnet-based efficient spincaloritronic nanodevices.
△ Less
Submitted 13 February, 2024; v1 submitted 14 August, 2023;
originally announced August 2023.
-
Remarks on gluing punctured logarithmic maps
Authors:
Mark Gross
Abstract:
We consider some well-behaved cases of the gluing formalism for punctured stable log maps of Abramovich-Chen-Gross-Siebert. This gives a gluing formula for log Gromov-Witten invariants in a diverse set of cases; in particular, the gluing formulae of Li-Ruan, Jun Li and Kim-Lho-Ruddat become an easy special case. The last section gives an application of this gluing formalism to canonical wall struc…
▽ More
We consider some well-behaved cases of the gluing formalism for punctured stable log maps of Abramovich-Chen-Gross-Siebert. This gives a gluing formula for log Gromov-Witten invariants in a diverse set of cases; in particular, the gluing formulae of Li-Ruan, Jun Li and Kim-Lho-Ruddat become an easy special case. The last section gives an application of this gluing formalism to canonical wall structures for K3 surfaces as constructed by Gross and Siebert in "The canonical wall structure and intrinsic mirror symmetry."
△ Less
Submitted 5 June, 2023;
originally announced June 2023.
-
Controllable Inversion of Black-Box Face Recognition Models via Diffusion
Authors:
Manuel Kansy,
Anton Raël,
Graziana Mignone,
Jacek Naruniec,
Christopher Schroers,
Markus Gross,
Romann M. Weber
Abstract:
Face recognition models embed a face image into a low-dimensional identity vector containing abstract encodings of identity-specific facial features that allow individuals to be distinguished from one another. We tackle the challenging task of inverting the latent space of pre-trained face recognition models without full model access (i.e. black-box setting). A variety of methods have been propose…
▽ More
Face recognition models embed a face image into a low-dimensional identity vector containing abstract encodings of identity-specific facial features that allow individuals to be distinguished from one another. We tackle the challenging task of inverting the latent space of pre-trained face recognition models without full model access (i.e. black-box setting). A variety of methods have been proposed in literature for this task, but they have serious shortcomings such as a lack of realistic outputs and strong requirements for the data set and accessibility of the face recognition model. By analyzing the black-box inversion problem, we show that the conditional diffusion model loss naturally emerges and that we can effectively sample from the inverse distribution even without an identity-specific loss. Our method, named identity denoising diffusion probabilistic model (ID3PM), leverages the stochastic nature of the denoising diffusion process to produce high-quality, identity-preserving face images with various backgrounds, lighting, poses, and expressions. We demonstrate state-of-the-art performance in terms of identity preservation and diversity both qualitatively and quantitatively, and our method is the first black-box face recognition model inversion method that offers intuitive control over the generation process.
△ Less
Submitted 30 September, 2023; v1 submitted 22 March, 2023;
originally announced March 2023.
-
Coherent magnon-induced domain wall motion in a magnetic insulator channel
Authors:
Yabin Fan,
Miela J. Gross,
Takian Fakhrul,
Joseph Finley,
Justin T. Hou,
Luqiao Liu,
Caroline A. Ross
Abstract:
Advancing the development of spin-wave devices requires high-quality low-dam** magnetic materials where magnon spin currents can propagate efficiently and interact effectively with local magnetic textures. We show that magnetic domain walls (DW) can modulate spin-wave transport in perpendicularly magnetized channels of Bi-doped yttrium-iron-garnet (BiYIG). Conversely, we demonstrate that the mag…
▽ More
Advancing the development of spin-wave devices requires high-quality low-dam** magnetic materials where magnon spin currents can propagate efficiently and interact effectively with local magnetic textures. We show that magnetic domain walls (DW) can modulate spin-wave transport in perpendicularly magnetized channels of Bi-doped yttrium-iron-garnet (BiYIG). Conversely, we demonstrate that the magnon spin current can drive DW motion in the BiYIG channel device by means of magnon spin-transfer torque. The DW can be reliably moved over 15 um distances at zero applied magnetic field by a magnon spin current excited by an RF pulse as short as 1 ns. The required energy for driving DW motion is orders of magnitude smaller than those reported for metallic systems. These results facilitate low-switching-energy magnonic devices and circuits where magnetic domains can be efficiently reconfigured by magnon spin currents flowing within magnetic channels.
△ Less
Submitted 2 December, 2022;
originally announced December 2022.
-
Tracer particle in a confined correlated medium: an adiabatic elimination method
Authors:
Davide Venturelli,
Markus Gross
Abstract:
We present a simple and systematic procedure to determine the effective dynamics of a Brownian particle coupled to a rapidly fluctuating correlated medium, modeled as a scalar Gaussian field, under spatial confinement. The method allows us, in particular, to address the case in which the fluctuations of the medium are suppressed in the vicinity of the particle, as described by a quadratic coupling…
▽ More
We present a simple and systematic procedure to determine the effective dynamics of a Brownian particle coupled to a rapidly fluctuating correlated medium, modeled as a scalar Gaussian field, under spatial confinement. The method allows us, in particular, to address the case in which the fluctuations of the medium are suppressed in the vicinity of the particle, as described by a quadratic coupling in the underlying Hamiltonian. As a consequence of the confinement of the correlated medium, the resulting effective Fokker-Planck equation features spatially dependent drift and diffusion coefficients. We apply our method to simplified fluid models of binary mixtures and microemulsions near criticality containing a colloidal particle, and we analyze the corrections to the stationary distribution of the particle position and the diffusion coefficient.
△ Less
Submitted 29 December, 2022; v1 submitted 22 September, 2022;
originally announced September 2022.
-
Open FJRW Theory and Mirror Symmetry
Authors:
Mark Gross,
Tyler L. Kelly,
Ran J. Tessler
Abstract:
We construct an open enumerative theory for the Landau-Ginzburg (LG) model $(\mathbb{C}^2, μ_r\times μ_s, x^r+y^s)$. The invariants are defined as integrals of multisections of a Witten bundle with descendents over a moduli space that is a real orbifold with corners. In turn, a generating function for these open invariants yields the mirror LG model and a versal deformation of it with flat coordin…
▽ More
We construct an open enumerative theory for the Landau-Ginzburg (LG) model $(\mathbb{C}^2, μ_r\times μ_s, x^r+y^s)$. The invariants are defined as integrals of multisections of a Witten bundle with descendents over a moduli space that is a real orbifold with corners. In turn, a generating function for these open invariants yields the mirror LG model and a versal deformation of it with flat coordinates. After establishing an open topological recursion result, we prove an LG/LG open mirror symmetry theorem in dimension two with all descendents. The open invariants we define are not unique but depend on boundary conditions that, when altered, exhibit wall-crossing phenomena for the invariants. We describe an LG wall-crossing group classifying the wall-crossing transformations that can occur.
△ Less
Submitted 15 August, 2022; v1 submitted 4 March, 2022;
originally announced March 2022.
-
Mirror Symmetry for open r-spin invariants
Authors:
Mark Gross,
Tyler L. Kelly,
Ran J. Tessler
Abstract:
We show that a generating function for open $r$-spin enumerative invariants produces a universal unfolding of the polynomial $x^r$. Further, the coordinates parametrizing this universal unfolding are flat coordinates on the Frobenius manifold associated to the Landau-Ginzburg model $(\mathbb{C},x^r)$ via Saito-Givental theory. This result provides evidence for the same phenomenon to occur in highe…
▽ More
We show that a generating function for open $r$-spin enumerative invariants produces a universal unfolding of the polynomial $x^r$. Further, the coordinates parametrizing this universal unfolding are flat coordinates on the Frobenius manifold associated to the Landau-Ginzburg model $(\mathbb{C},x^r)$ via Saito-Givental theory. This result provides evidence for the same phenomenon to occur in higher dimension, proven in a sequel paper.
△ Less
Submitted 4 March, 2022;
originally announced March 2022.
-
Analysis of photoinjector transverse phase space in action and phase coordinates
Authors:
Houjun Qian,
Mikhail Krasilnikov,
Zakaria Aboulbanine,
Gowri Adhikari,
Namra Aftab,
Prach Boonpornpras,
Georgi Georgiev,
James Good,
Matthias Gross,
Christian Koschitzki,
Xiangkun Li,
Osip Lishilin,
Anusorn Lueangaramwong,
Raffael Niemczyk,
Anne Oppelt,
Guan Shu,
Frank Stephan,
Grygorii Vashchenko,
Tobias Weilbach
Abstract:
Photoinjectors are the main high brightness electron sources for X-ray free electron lasers (XFEL). Photoinjector emittance reduction is one of the key knobs for improving XFEL lasing, so precise emittance measurement is critical. It's well known that rms emittance is very sensitive to low intensity tails of particle distributions in the phase space, whose measurement depend on the signal to noise…
▽ More
Photoinjectors are the main high brightness electron sources for X-ray free electron lasers (XFEL). Photoinjector emittance reduction is one of the key knobs for improving XFEL lasing, so precise emittance measurement is critical. It's well known that rms emittance is very sensitive to low intensity tails of particle distributions in the phase space, whose measurement depend on the signal to noise ratio (SNR) and image processing procedures. Such sensitivities make the interpretations of beam transverse brightness challenging, leading to different emittance definitions to reduce the impact of tail particles. In this paper, transverse phase space is analyzed in action and phase coordinates for both analytical models and experiments, which give a more intuitive way to calculate the beam core brightness.
△ Less
Submitted 18 February, 2022;
originally announced February 2022.
-
Slice energy spread measurement in the low energy photoinjector
Authors:
Houjun Qian,
Mikhail Krasilnikov,
Anusorn Lueangaramwong,
Xiangkun Li,
Osip Lishilin,
Zakaria Aboulbanine,
Gowri Adhikari,
Namra Aftab,
Prach Boonpornprasert,
Georgi Georgiev,
James Good,
Matthias Gross,
Christian Koschitzki,
Raffael Niemczyk,
Anne Oppelt,
Guan Shu,
Frank Stephan,
Grygorii Vashchenko,
Tobias Weilbach
Abstract:
Slice energy spread is one of the key parameters in free electron laser optimizations, but its accurate measurement is not straightforward. Two recent studies from high energy ($>$100 MeV) photoinjectors at SwissFEL and European XFEL have reported much higher slice energy spread than expected at their XFEL working points (200 - 250 pC). In this paper, a new method for measuring slice energy spread…
▽ More
Slice energy spread is one of the key parameters in free electron laser optimizations, but its accurate measurement is not straightforward. Two recent studies from high energy ($>$100 MeV) photoinjectors at SwissFEL and European XFEL have reported much higher slice energy spread than expected at their XFEL working points (200 - 250 pC). In this paper, a new method for measuring slice energy spread at a lower beam energy ($\sim$20 MeV) is proposed and demonstrated at the PhotoInjector Test facility at DESY Zeuthen (PITZ), and the results for 250 pC and 500 pC are much lower than those measured at high energy injectors.
△ Less
Submitted 31 January, 2022;
originally announced January 2022.
-
Microdosing: Knowledge Distillation for GAN based Compression
Authors:
Leonhard Helminger,
Roberto Azevedo,
Abdelaziz Djelouah,
Markus Gross,
Christopher Schroers
Abstract:
Recently, significant progress has been made in learned image and video compression. In particular the usage of Generative Adversarial Networks has lead to impressive results in the low bit rate regime. However, the model size remains an important issue in current state-of-the-art proposals and existing solutions require significant computation effort on the decoding side. This limits their usage…
▽ More
Recently, significant progress has been made in learned image and video compression. In particular the usage of Generative Adversarial Networks has lead to impressive results in the low bit rate regime. However, the model size remains an important issue in current state-of-the-art proposals and existing solutions require significant computation effort on the decoding side. This limits their usage in realistic scenarios and the extension to video compression. In this paper, we demonstrate how to leverage knowledge distillation to obtain equally capable image decoders at a fraction of the original number of parameters. We investigate several aspects of our solution including sequence specialization with side information for image coding. Finally, we also show how to transfer the obtained benefits into the setting of video compression. Overall, this allows us to reduce the model size by a factor of 20 and to achieve 50% reduction in decoding time.
△ Less
Submitted 7 January, 2022;
originally announced January 2022.
-
Automating Speedrun Routing: Overview and Vision
Authors:
Matthias Groß,
Dietlind Zühlke,
Boris Naujoks
Abstract:
Speedrunning in general means to play a video game fast, i.e. using all means at one's disposal to achieve a given goal in the least amount of time possible. To do so, a speedrun must be planned in advance, or routed, as referred to by the community. This paper focuses on discovering challenges and defining models needed when trying to approach the problem of routing algorithmically. To do so, thi…
▽ More
Speedrunning in general means to play a video game fast, i.e. using all means at one's disposal to achieve a given goal in the least amount of time possible. To do so, a speedrun must be planned in advance, or routed, as referred to by the community. This paper focuses on discovering challenges and defining models needed when trying to approach the problem of routing algorithmically. To do so, this paper is split in two parts. The first part provides an overview of relevant speedrunning literature, extracting vital information and formulating criticism. Important categorizations are pointed out and a nomenclature is built to support professional discussion. The second part of this paper then refers to the actual speedrun routing optimization problem. Different concepts of graph representations are presented and their potential is discussed. Visions both for problem modeling as well as solving are presented and assessed regarding suitability and expected challenges. Finally, a first assessment of the applicability of existing optimization methods to the defined problem is made, including metaheuristics/EA and Deep Learning methods.
△ Less
Submitted 21 April, 2022; v1 submitted 2 June, 2021;
originally announced June 2021.
-
The canonical wall structure and intrinsic mirror symmetry
Authors:
Mark Gross,
Bernd Siebert
Abstract:
As announced "Intrinsic mirror symmetry and punctured invariants" in 2016, we construct and prove consistency of the canonical wall structure. This construction starts with a log Calabi-Yau pair (X,D) and produces a wall structure, as defined by Gross-Hacking-Siebert. Roughly put, the canonical wall structure is a data structure which encodes an algebro-geometric analogue of counts of Maslov index…
▽ More
As announced "Intrinsic mirror symmetry and punctured invariants" in 2016, we construct and prove consistency of the canonical wall structure. This construction starts with a log Calabi-Yau pair (X,D) and produces a wall structure, as defined by Gross-Hacking-Siebert. Roughly put, the canonical wall structure is a data structure which encodes an algebro-geometric analogue of counts of Maslov index zero disks. These enumerative invariants are defined in terms of the punctured invariants of Abramovich-Chen-Gross-Siebert. There are then two main theorems of the paper. First, we prove consistency of the canonical wall structure, so that the canonical wall structure gives rise to a mirror family. Second, we prove that this mirror family coincides with the intrinsic mirror constructed in our paper "Intrinsic mirror symmetry". While the setup of this paper is narrower than that of the latter paper, it gives a more detailed description of the mirror.
△ Less
Submitted 6 May, 2022; v1 submitted 6 May, 2021;
originally announced May 2021.
-
DuctTake: Spatiotemporal Video Compositing
Authors:
Jan Rueegg,
Oliver Wang,
Aljoscha Smolic,
Markus Gross
Abstract:
DuctTake is a system designed to enable practical compositing of multiple takes of a scene into a single video. Current industry solutions are based around object segmentation, a hard problem that requires extensive manual input and cleanup, making compositing an expensive part of the film-making process. Our method instead composites shots together by finding optimal spatiotemporal seams using mo…
▽ More
DuctTake is a system designed to enable practical compositing of multiple takes of a scene into a single video. Current industry solutions are based around object segmentation, a hard problem that requires extensive manual input and cleanup, making compositing an expensive part of the film-making process. Our method instead composites shots together by finding optimal spatiotemporal seams using motion-compensated 3D graph cuts through the video volume. We describe in detail the required components, decisions, and new techniques that together make a usable, interactive tool for compositing HD video, paying special attention to running time and performance of each section. We validate our approach by presenting a wide variety of examples and by comparing result quality and creation time to composites made by professional artists using current state-of-the-art tools.
△ Less
Submitted 12 January, 2021;
originally announced January 2021.
-
Dynamics and steady states of a tracer particle in a confined critical fluid
Authors:
Markus Gross
Abstract:
The dynamics and the steady states of a point-like tracer particle immersed in a confined critical fluid are studied. The fluid is modeled field-theoretically in terms of an order parameter (concentration or density field) obeying dissipative or conservative equilibrium dynamics and (non-)symmetry-breaking boundary conditions. The tracer, which represents, e.g., a colloidal particle, interacts wit…
▽ More
The dynamics and the steady states of a point-like tracer particle immersed in a confined critical fluid are studied. The fluid is modeled field-theoretically in terms of an order parameter (concentration or density field) obeying dissipative or conservative equilibrium dynamics and (non-)symmetry-breaking boundary conditions. The tracer, which represents, e.g., a colloidal particle, interacts with the fluid by locally modifying its chemical potential or its correlations. The coupling between tracer and fluid gives rise to a nonlinear and non-Markovian tracer dynamics, which is investigated here analytically and via numerical simulations for a one-dimensional system. From the coupled Langevin equations for the tracer-fluid system we derive an effective Fokker-Planck equation for the tracer by means of adiabatic elimination as well as perturbation theory within a weak-coupling approximation. The effective tracer dynamics is found to be governed by a fluctuation-induced (Casimir) potential, a spatially dependent mobility, and a spatially dependent (multiplicative) noise, the characteristics of which depend on the interaction and the boundary conditions. The steady-state distribution of the tracer is typically inhomogeneous. Notably, when detailed balance is broken, the driving of the temporally correlated noise can induce an effective attraction of the tracer towards a boundary.
△ Less
Submitted 31 December, 2022; v1 submitted 6 January, 2021;
originally announced January 2021.
-
Fluctuations of the critical Casimir force
Authors:
Markus Gross,
Andrea Gambassi,
S. Dietrich
Abstract:
The critical Casimir force (CCF) arises from confining fluctuations in a critical fluid and thus it is a fluctuating quantity itself. While the mean CCF is universal, its (static) variance has previously been found to depend on the microscopic details of the system which effectively set a large-momentum cutoff in the underlying field theory, rendering it potentially large. This raises the question…
▽ More
The critical Casimir force (CCF) arises from confining fluctuations in a critical fluid and thus it is a fluctuating quantity itself. While the mean CCF is universal, its (static) variance has previously been found to depend on the microscopic details of the system which effectively set a large-momentum cutoff in the underlying field theory, rendering it potentially large. This raises the question how the properties of the force variance are reflected in experimentally observable quantities, such as the thickness of a wetting film or the position of a suspended colloidal particle. Here, based on a rigorous definition of the instantaneous force, we analyze static and dynamic correlations of the CCF for a conserved fluid in film geometry for various boundary conditions within the Gaussian approximation. We find that the dynamic correlation function of the CCF is independent of the momentum cutoff and decays algebraically in time. Within the Gaussian approximation, the associated exponent depends only on the dynamic universality class but not on the boundary conditions. We furthermore consider a fluid film, the thickness of which can fluctuate under the influence of the time-dependent CCF. The latter gives rise to an effective non-Markovian noise in the equation of motion of the film boundary and induces a distinct contribution to the position variance. Within the approximations used here, at short times, this contribution grows algebraically in time whereas, at long times, it saturates and contributes to the steady-state variance of the film thickness.
△ Less
Submitted 17 June, 2021; v1 submitted 24 November, 2020;
originally announced November 2020.
-
Punctured logarithmic maps
Authors:
Dan Abramovich,
Qile Chen,
Mark Gross,
Bernd Siebert
Abstract:
We introduce a variant of stable logarithmic maps, which we call punctured logarithmic maps. They allow an extension of logarithmic Gromov-Witten theory in which marked points have a negative order of tangency with boundary divisors.
As a main application we develop a gluing formalism which reconstructs stable logarithmic maps and their virtual cycles without expansions of the target, with tropi…
▽ More
We introduce a variant of stable logarithmic maps, which we call punctured logarithmic maps. They allow an extension of logarithmic Gromov-Witten theory in which marked points have a negative order of tangency with boundary divisors.
As a main application we develop a gluing formalism which reconstructs stable logarithmic maps and their virtual cycles without expansions of the target, with tropical geometry providing the underlying combinatorics.
Punctured Gromov-Witten invariants also play a pivotal role in the intrinsic construction of mirror partners by the last two authors in arXiv:1909.07649, conjecturally relating to symplectic cohomology, and in the logarithmic gauged linear sigma model in upcoming work of the second author with Felix Janda and Yongbin Ruan.
△ Less
Submitted 22 April, 2024; v1 submitted 16 September, 2020;
originally announced September 2020.
-
Blind Image Restoration with Flow Based Priors
Authors:
Leonhard Helminger,
Michael Bernasconi,
Abdelaziz Djelouah,
Markus Gross,
Christopher Schroers
Abstract:
Image restoration has seen great progress in the last years thanks to the advances in deep neural networks. Most of these existing techniques are trained using full supervision with suitable image pairs to tackle a specific degradation. However, in a blind setting with unknown degradations this is not possible and a good prior remains crucial. Recently, neural network based approaches have been pr…
▽ More
Image restoration has seen great progress in the last years thanks to the advances in deep neural networks. Most of these existing techniques are trained using full supervision with suitable image pairs to tackle a specific degradation. However, in a blind setting with unknown degradations this is not possible and a good prior remains crucial. Recently, neural network based approaches have been proposed to model such priors by leveraging either denoising autoencoders or the implicit regularization captured by the neural network structure itself. In contrast to this, we propose using normalizing flows to model the distribution of the target content and to use this as a prior in a maximum a posteriori (MAP) formulation. By expressing the MAP optimization process in the latent space through the learned bijective map**, we are able to obtain solutions through gradient descent. To the best of our knowledge, this is the first work that explores normalizing flows as prior in image enhancement problems. Furthermore, we present experimental results for a number of different degradations on data sets varying in complexity and show competitive results when comparing with the deep image prior approach.
△ Less
Submitted 9 September, 2020;
originally announced September 2020.
-
Lossy Image Compression with Normalizing Flows
Authors:
Leonhard Helminger,
Abdelaziz Djelouah,
Markus Gross,
Christopher Schroers
Abstract:
Deep learning based image compression has recently witnessed exciting progress and in some cases even managed to surpass transform coding based approaches that have been established and refined over many decades. However, state-of-the-art solutions for deep image compression typically employ autoencoders which map the input to a lower dimensional latent space and thus irreversibly discard informat…
▽ More
Deep learning based image compression has recently witnessed exciting progress and in some cases even managed to surpass transform coding based approaches that have been established and refined over many decades. However, state-of-the-art solutions for deep image compression typically employ autoencoders which map the input to a lower dimensional latent space and thus irreversibly discard information already before quantization. Due to that, they inherently limit the range of quality levels that can be covered. In contrast, traditional approaches in image compression allow for a larger range of quality levels. Interestingly, they employ an invertible transformation before performing the quantization step which explicitly discards information. Inspired by this, we propose a deep image compression method that is able to go from low bit-rates to near lossless quality by leveraging normalizing flows to learn a bijective map** from the image space to a latent representation. In addition to this, we demonstrate further advantages unique to our solution, such as the ability to maintain constant quality results through re-encoding, even when performed multiple times. To the best of our knowledge, this is the first work to explore the opportunities for leveraging normalizing flows for lossy image compression.
△ Less
Submitted 24 August, 2020;
originally announced August 2020.
-
RoomShift: Room-scale Dynamic Haptics for VR with Furniture-moving Swarm Robots
Authors:
Ryo Suzuki,
Hooman Hedayati,
Clement Zheng,
James Bohn,
Daniel Szafir,
Ellen Yi-Luen Do,
Mark D. Gross,
Daniel Leithinger
Abstract:
RoomShift is a room-scale dynamic haptic environment for virtual reality, using a small swarm of robots that can move furniture. RoomShift consists of nine shape-changing robots: Roombas with mechanical scissor lifts. These robots drive beneath a piece of furniture to lift, move and place it. By augmenting virtual scenes with physical objects, users can sit on, lean against, place and otherwise in…
▽ More
RoomShift is a room-scale dynamic haptic environment for virtual reality, using a small swarm of robots that can move furniture. RoomShift consists of nine shape-changing robots: Roombas with mechanical scissor lifts. These robots drive beneath a piece of furniture to lift, move and place it. By augmenting virtual scenes with physical objects, users can sit on, lean against, place and otherwise interact with furniture with their whole body; just as in the real world. When the virtual scene changes or users navigate within it, the swarm of robots dynamically reconfigures the physical environment to match the virtual content. We describe the hardware and software implementation, applications in virtual tours and architectural design and interaction techniques.
△ Less
Submitted 19 August, 2020;
originally announced August 2020.
-
Enriching Video Captions With Contextual Text
Authors:
Philipp Rimle,
Pelin Dogan,
Markus Gross
Abstract:
Understanding video content and generating caption with context is an important and challenging task. Unlike prior methods that typically attempt to generate generic video captions without context, our architecture contextualizes captioning by infusing extracted information from relevant text data. We propose an end-to-end sequence-to-sequence model which generates video captions based on visual i…
▽ More
Understanding video content and generating caption with context is an important and challenging task. Unlike prior methods that typically attempt to generate generic video captions without context, our architecture contextualizes captioning by infusing extracted information from relevant text data. We propose an end-to-end sequence-to-sequence model which generates video captions based on visual input, and mines relevant knowledge such as names and locations from contextual text. In contrast to previous approaches, we do not preprocess the text further, and let the model directly learn to attend over it. Guided by the visual input, the model is able to copy words from the contextual text via a pointer-generator network, allowing to produce more specific video captions. We show competitive performance on the News Video Dataset and, through ablation studies, validate the efficacy of contextual video captioning as well as individual design choices in our model architecture.
△ Less
Submitted 29 July, 2020;
originally announced July 2020.
-
The Higher Dimensional Tropical Vertex
Authors:
Hülya Argüz,
Mark Gross
Abstract:
We study log Calabi-Yau varieties obtained as a blow-up of a toric variety along hypersurfaces in its toric boundary. Mirrors to such varieties are constructed by Gross-Siebert from a canonical scattering diagram built by using punctured log Gromov-Witten invariants of Abramovich-Chen-Gross-Siebert. We show that there is a piecewise linear isomorphism between the canonical scattering diagram and a…
▽ More
We study log Calabi-Yau varieties obtained as a blow-up of a toric variety along hypersurfaces in its toric boundary. Mirrors to such varieties are constructed by Gross-Siebert from a canonical scattering diagram built by using punctured log Gromov-Witten invariants of Abramovich-Chen-Gross-Siebert. We show that there is a piecewise linear isomorphism between the canonical scattering diagram and a scattering diagram defined algortihmically, following a higher dimensional generalisation of the Kontsevich-Soibelman construction. We deduce that the punctured log Gromov-Witten invariants of the log Calabi-Yau variety can be captured from this algorithmic construction. As a particular example, we compute these invariants for a non-toric blow-up of the three dimensional projective space along two lines. This generalizes previous results of Gross-Pandharipande-Siebert on "The Tropical Vertex" to higher dimensions.
△ Less
Submitted 16 December, 2021; v1 submitted 16 July, 2020;
originally announced July 2020.
-
Shapley Value as Principled Metric for Structured Network Pruning
Authors:
Marco Ancona,
Cengiz Öztireli,
Markus Gross
Abstract:
Structured pruning is a well-known technique to reduce the storage size and inference cost of neural networks. The usual pruning pipeline consists of ranking the network internal filters and activations with respect to their contributions to the network performance, removing the units with the lowest contribution, and fine-tuning the network to reduce the harm induced by pruning. Recent results sh…
▽ More
Structured pruning is a well-known technique to reduce the storage size and inference cost of neural networks. The usual pruning pipeline consists of ranking the network internal filters and activations with respect to their contributions to the network performance, removing the units with the lowest contribution, and fine-tuning the network to reduce the harm induced by pruning. Recent results showed that random pruning performs on par with other metrics, given enough fine-tuning resources. In this work, we show that this is not true on a low-data regime when fine-tuning is either not possible or not effective. In this case, reducing the harm caused by pruning becomes crucial to retain the performance of the network. First, we analyze the problem of estimating the contribution of hidden units with tools suggested by cooperative game theory and propose Shapley values as a principled ranking metric for this task. We compare with several alternatives proposed in the literature and discuss how Shapley values are theoretically preferable. Finally, we compare all ranking metrics on the challenging scenario of low-data pruning, where we demonstrate how Shapley values outperform other heuristics.
△ Less
Submitted 2 June, 2020;
originally announced June 2020.
-
Lagrangian Neural Style Transfer for Fluids
Authors:
Byungsoo Kim,
Vinicius C. Azevedo,
Markus Gross,
Barbara Solenthaler
Abstract:
Artistically controlling the shape, motion and appearance of fluid simulations pose major challenges in visual effects production. In this paper, we present a neural style transfer approach from images to 3D fluids formulated in a Lagrangian viewpoint. Using particles for style transfer has unique benefits compared to grid-based techniques. Attributes are stored on the particles and hence are triv…
▽ More
Artistically controlling the shape, motion and appearance of fluid simulations pose major challenges in visual effects production. In this paper, we present a neural style transfer approach from images to 3D fluids formulated in a Lagrangian viewpoint. Using particles for style transfer has unique benefits compared to grid-based techniques. Attributes are stored on the particles and hence are trivially transported by the particle motion. This intrinsically ensures temporal consistency of the optimized stylized structure and notably improves the resulting quality. Simultaneously, the expensive, recursive alignment of stylization velocity fields of grid approaches is unnecessary, reducing the computation time to less than an hour and rendering neural flow stylization practical in production settings. Moreover, the Lagrangian representation improves artistic control as it allows for multi-fluid stylization and consistent color transfer from images, and the generality of the method enables stylization of smoke and liquids likewise.
△ Less
Submitted 2 May, 2020;
originally announced May 2020.
-
Single shot cathode transverse momentum imaging in high brightness photoinjectors
Authors:
Peng-Wei Huang,
Houjun Qian,
Ye Chen,
Daniele Filippetto,
Matthias Gross,
Igor Isaev,
Christian Koschitzki,
Mikhail Krasilnikov,
Shankar Lal,
Xiangkun Li,
Osip Lishilin,
David Melkumyan,
Raffael Niemczyk,
Anne Oppelt,
Fernando Sannibale,
Hamed Shaker,
Guan Shu,
Frank Stephan,
Chuanxiang Tang,
Grygorii Vashchenko,
Weishi Wan
Abstract:
In state of the art photoinjector electron sources, thermal emittance from photoemission dominates the final injector emittance. Therefore, low thermal emittance cathode developments and diagnostics are very important. Conventional thermal emittance measurements for the high gradient gun are time-consuming and thus thermal emittance is not measured as frequently as quantum efficiency during the li…
▽ More
In state of the art photoinjector electron sources, thermal emittance from photoemission dominates the final injector emittance. Therefore, low thermal emittance cathode developments and diagnostics are very important. Conventional thermal emittance measurements for the high gradient gun are time-consuming and thus thermal emittance is not measured as frequently as quantum efficiency during the lifetime of photocathodes, although both are important properties for the photoinjector optimizations. In this paper, a single shot measurement of photoemission transverse momentum, i.e., thermal emittance per rms laser spot size, is proposed for photocathode RF guns. By tuning the gun solenoid focusing, the electrons transverse momenta at the cathode are imaged to a downstream screen, which enables a single shot measurement of both the rms value and the detailed spectra of the photoelectrons transverse momenta. Both simulations and proof of principle experiments are reported.
△ Less
Submitted 17 February, 2020;
originally announced February 2020.
-
LiftTiles: Constructive Building Blocks for Prototy** Room-scale Shape-changing Interfaces
Authors:
Ryo Suzuki,
Ryosuke Nakayama,
Dan Liu,
Yasuaki Kakehi,
Mark D. Gross,
Daniel Leithinger
Abstract:
Large-scale shape-changing interfaces have great potential, but creating such systems requires substantial time, cost, space, and efforts, which hinders the research community to explore interactions beyond the scale of human hands. We introduce modular inflatable actuators as building blocks for prototy** room-scale shape-changing interfaces. Each actuator can change its height from 15cm to 150…
▽ More
Large-scale shape-changing interfaces have great potential, but creating such systems requires substantial time, cost, space, and efforts, which hinders the research community to explore interactions beyond the scale of human hands. We introduce modular inflatable actuators as building blocks for prototy** room-scale shape-changing interfaces. Each actuator can change its height from 15cm to 150cm, actuated and controlled by air pressure. Each unit is low-cost (8 USD), lightweight (10 kg), compact (15 cm), and robust, making it well-suited for prototy** room-scale shape transformations. Moreover, our modular and reconfigurable design allows researchers and designers to quickly construct different geometries and to explore various applications. This paper contributes to the design and implementation of highly extendable inflatable actuators, and demonstrates a range of scenarios that can leverage this modular building block.
△ Less
Submitted 8 January, 2020;
originally announced January 2020.
-
Geometry of twisted Kähler-Einstein metrics and collapsing
Authors:
Mark Gross,
Valentino Tosatti,
Yuguang Zhang
Abstract:
We prove that the twisted Kahler-Einstein metrics that arise on the base of certain holomorphic fiber space with Calabi-Yau fibers have conical-type singularities along the discriminant locus. These fiber spaces arise naturally when studying the collapsing of Ricci-flat Kahler metrics on Calabi-Yau manifolds, and of the Kahler-Ricci flow on compact Kahler manifolds with semiample canonical bundle…
▽ More
We prove that the twisted Kahler-Einstein metrics that arise on the base of certain holomorphic fiber space with Calabi-Yau fibers have conical-type singularities along the discriminant locus. These fiber spaces arise naturally when studying the collapsing of Ricci-flat Kahler metrics on Calabi-Yau manifolds, and of the Kahler-Ricci flow on compact Kahler manifolds with semiample canonical bundle and intermediate Kodaira dimension. Our results allow us to understand their collapsed Gromov-Hausdorff limits when the base is smooth and the discriminant has simple normal crossings.
△ Less
Submitted 20 November, 2020; v1 submitted 17 November, 2019;
originally announced November 2019.
-
A note on the experiment parameters for the non-resonant streaming instability: competition between left and right circularly polarized modes
Authors:
Chun-Sung Jao,
Sergei Vafin,
Ye Chen,
Matthias Gross,
Mikhail Krasilnikov,
Gregor Loisch,
Timon Mehrling,
Jacek Niemiec,
Anne Oppelt,
Alberto Martinez de la Ossa,
Jens Osterhoff,
Martin Pohl,
Frank Stephan
Abstract:
A non-resonant streaming instability driven by cosmic-ray currents, also called Bell's instability, is proposed as a candidate for providing the required magnetic turbulence of efficient diffusive shock accelerations. To demonstrate the saturation level and mechanism of the non-resonant streaming instability in a laboratory environment, we attempt to develop an experiment at the Photo Injector Tes…
▽ More
A non-resonant streaming instability driven by cosmic-ray currents, also called Bell's instability, is proposed as a candidate for providing the required magnetic turbulence of efficient diffusive shock accelerations. To demonstrate the saturation level and mechanism of the non-resonant streaming instability in a laboratory environment, we attempt to develop an experiment at the Photo Injector Test Facility at DESY, Zeuthen site (PITZ). As an electron beam is used to replace the proton beam to carry the cosmic-ray current in our experiment, the polarization of the non-resonant streaming instability will be modified from the left-handed (LH) mode to the right-handed (RH) mode. The theoretical instability analysis shows that the growth rate of this RH non-resonant mode may be smaller than it of the LH resonant mode. However the LH resonant mode can be ignored in our experiment while the expected wavelength is longer than the used plasma cell. The results of PIC simulations will also support this contention and the occurrence of non-resonant streaming instability in our experiment.
△ Less
Submitted 30 October, 2019;
originally announced October 2019.
-
A blockchain-orchestrated Federated Learning architecture for healthcare consortia
Authors:
Jonathan Passerat-Palmbach,
Tyler Farnan,
Robert Miller,
Marielle S. Gross,
Heather Leigh Flannery,
Bill Gleim
Abstract:
We propose a novel architecture for federated learning within healthcare consortia. At the heart of the solution is a unique integration of privacy preserving technologies, built upon native enterprise blockchain components available in the Ethereum ecosystem. We show how the specific characteristics and challenges of healthcare consortia informed our design choices, notably the conception of a ne…
▽ More
We propose a novel architecture for federated learning within healthcare consortia. At the heart of the solution is a unique integration of privacy preserving technologies, built upon native enterprise blockchain components available in the Ethereum ecosystem. We show how the specific characteristics and challenges of healthcare consortia informed our design choices, notably the conception of a new Secure Aggregation protocol assembled with a protected hardware component and an encryption toolkit native to Ethereum. Our architecture also brings in a privacy preserving audit trail that logs events in the network without revealing identities.
△ Less
Submitted 12 October, 2019;
originally announced October 2019.
-
The mirror of the cubic surface
Authors:
Mark Gross,
Paul Hacking,
Sean Keel,
Bernd Siebert
Abstract:
This paper expands on a remark in the paper "Mirror Symmetry for Log Calabi-Yau Surfaces I" of the first three authors of this paper, explaining fully how various constructions of the authors apply to give the mirror to the cubic surface. We give a full description of the scattering diagram associated to the cubic surface: this is a particularly nice diagram in which rays of every rational slope o…
▽ More
This paper expands on a remark in the paper "Mirror Symmetry for Log Calabi-Yau Surfaces I" of the first three authors of this paper, explaining fully how various constructions of the authors apply to give the mirror to the cubic surface. We give a full description of the scattering diagram associated to the cubic surface: this is a particularly nice diagram in which rays of every rational slope occur, but they may all be described. The equation of the mirror cubic family is then derived in two ways, first by using broken lines and then by using more recent constructions involving a direct calculation of Gromov-Witten invariants.
△ Less
Submitted 18 October, 2019;
originally announced October 2019.
-
Intrinsic Mirror Symmetry
Authors:
Mark Gross,
Bernd Siebert
Abstract:
We associate a ring R to a log Calabi-Yau pair (X,D) or a degeneration of Calabi-Yau manifolds X->B. The vector space underlying R is determined by the tropicalization of (X,D) or X->B, while the product rule is defined using punctured Gromov-Witten invariants, defined in joint work with Abramovich and Chen. In the log Calabi-Yau case, if D is maximally degenerate, then we propose that Spec R is t…
▽ More
We associate a ring R to a log Calabi-Yau pair (X,D) or a degeneration of Calabi-Yau manifolds X->B. The vector space underlying R is determined by the tropicalization of (X,D) or X->B, while the product rule is defined using punctured Gromov-Witten invariants, defined in joint work with Abramovich and Chen. In the log Calabi-Yau case, if D is maximally degenerate, then we propose that Spec R is the mirror to X\D, while in the Calabi-Yau degeneration case, if the degeneration is maximally unipotent, the mirror is expected to be Proj R. The main result in this paper is that R as defined is an associative, commutative ring with unit, with associativity the most difficult part.
△ Less
Submitted 10 June, 2021; v1 submitted 17 September, 2019;
originally announced September 2019.
-
ShapeBots: Shape-changing Swarm Robots
Authors:
Ryo Suzuki,
Clement Zheng,
Yasuaki Kakehi,
Tom Yeh,
Ellen Yi-Luen Do,
Mark D. Gross,
Daniel Leithinger
Abstract:
We introduce shape-changing swarm robots. A swarm of self-transformable robots can both individually and collectively change their configuration to display information, actuate objects, act as tangible controllers, visualize data, and provide physical affordances. ShapeBots is a concept prototype of shape-changing swarm robots. Each robot can change its shape by leveraging small linear actuators t…
▽ More
We introduce shape-changing swarm robots. A swarm of self-transformable robots can both individually and collectively change their configuration to display information, actuate objects, act as tangible controllers, visualize data, and provide physical affordances. ShapeBots is a concept prototype of shape-changing swarm robots. Each robot can change its shape by leveraging small linear actuators that are thin (2.5 cm) and highly extendable (up to 20cm) in both horizontal and vertical directions. The modular design of each actuator enables various shapes and geometries of self-transformation. We illustrate potential application scenarios and discuss how this type of interface opens up possibilities for the future of ubiquitous and distributed shape-changing interfaces.
△ Less
Submitted 7 September, 2019;
originally announced September 2019.
-
Data-Driven Physical Face Inversion
Authors:
Yeara Kozlov,
Hongyi Xu,
Moritz Bächer,
Derek Bradley,
Markus Gross,
Thabo Beeler
Abstract:
Facial animation is one of the most challenging problems in computer graphics, and it is often solved using linear heuristics like blend-shape rigging. More expressive approaches like physical simulation have emerged, but these methods are very difficult to tune, especially when simulating a real actor's face. We propose to use a simple finite element simulation approach for face animation, and pr…
▽ More
Facial animation is one of the most challenging problems in computer graphics, and it is often solved using linear heuristics like blend-shape rigging. More expressive approaches like physical simulation have emerged, but these methods are very difficult to tune, especially when simulating a real actor's face. We propose to use a simple finite element simulation approach for face animation, and present a novel method for recovering the required simulation parameters in order to best match a real actor's face motion. Our method involves reconstructing a very small number of head poses of the actor in 3D, where the head poses span different configurations of force directions due to gravity. Our algorithm can then automatically recover both the gravity-free rest shape of the face as well as the spatially-varying physical material stiffness such that a forward simulation will match the captured targets as closely as possible. As a result, our system can produce actor-specific, physical parameters that can be immediately used in recent physical simulation methods for faces. Furthermore, as the simulation results depend heavily on the chosen spatial layout of material clusters, we analyze and compare different spatial layouts.
△ Less
Submitted 24 July, 2019;
originally announced July 2019.
-
Transport-Based Neural Style Transfer for Smoke Simulations
Authors:
Byungsoo Kim,
Vinicius C. Azevedo,
Markus Gross,
Barbara Solenthaler
Abstract:
Artistically controlling fluids has always been a challenging task. Optimization techniques rely on approximating simulation states towards target velocity or density field configurations, which are often handcrafted by artists to indirectly control smoke dynamics. Patch synthesis techniques transfer image textures or simulation features to a target flow field. However, these are either limited to…
▽ More
Artistically controlling fluids has always been a challenging task. Optimization techniques rely on approximating simulation states towards target velocity or density field configurations, which are often handcrafted by artists to indirectly control smoke dynamics. Patch synthesis techniques transfer image textures or simulation features to a target flow field. However, these are either limited to adding structural patterns or augmenting coarse flows with turbulent structures, and hence cannot capture the full spectrum of different styles and semantically complex structures. In this paper, we propose the first Transport-based Neural Style Transfer (TNST) algorithm for volumetric smoke data. Our method is able to transfer features from natural images to smoke simulations, enabling general content-aware manipulations ranging from simple patterns to intricate motifs. The proposed algorithm is physically inspired, since it computes the density transport from a source input smoke to a desired target configuration. Our transport-based approach allows direct control over the divergence of the stylization velocity field by optimizing incompressible and irrotational potentials that transport smoke towards stylization. Temporal consistency is ensured by transporting and aligning subsequent stylized velocities, and 3D reconstructions are computed by seamlessly merging stylizations from different camera viewpoints.
△ Less
Submitted 4 September, 2019; v1 submitted 17 May, 2019;
originally announced May 2019.
-
Dynamics of the critical Casimir force for a conserved order parameter after a critical quench
Authors:
Markus Gross,
Christian M. Rohwer,
S. Dietrich
Abstract:
Fluctuation-induced forces occur generically when long-ranged correlations (e.g., in fluids) are confined by external bodies. In classical systems, such correlations require specific conditions, e.g., a medium close to a critical point. On the other hand, long-ranged correlations appear more commonly in certain non-equilibrium systems with conservation laws. Consequently, a variety of non-equilibr…
▽ More
Fluctuation-induced forces occur generically when long-ranged correlations (e.g., in fluids) are confined by external bodies. In classical systems, such correlations require specific conditions, e.g., a medium close to a critical point. On the other hand, long-ranged correlations appear more commonly in certain non-equilibrium systems with conservation laws. Consequently, a variety of non-equilibrium fluctuation phenomena, including fluctuation-induced forces, have been discovered and explored recently. Here, we address a long-standing problem of non-equilibrium critical Casimir forces emerging after a quench to the critical point in a confined fluid with order-parameter-conserving dynamics and non-symmetry-breaking boundary conditions. The interplay of inherent (critical) fluctuations and dynamical non-local effects (due to density conservation) gives rise to striking features, including correlation functions and forces exhibiting oscillatory time-dependences. Complex transient regimes arise, depending on initial conditions and the geometry of the confinement. Our findings pave the way for exploring a wealth of non-equilibrium processes in critical fluids (e.g., fluctuation-mediated self-assembly or aggregation). In certain regimes, our results are applicable to active matter.
△ Less
Submitted 29 July, 2019; v1 submitted 1 May, 2019;
originally announced May 2019.