-
Generative Assignment Flows for Representing and Learning Joint Distributions of Discrete Data
Authors:
Bastian Boll,
Daniel Gonzalez-Alvarado,
Stefania Petra,
Christoph Schnörr
Abstract:
We introduce a novel generative model for the representation of joint probability distributions of a possibly large number of discrete random variables. The approach uses measure transport by randomized assignment flows on the statistical submanifold of factorizing distributions, which also enables to sample efficiently from the target distribution and to assess the likelihood of unseen data point…
▽ More
We introduce a novel generative model for the representation of joint probability distributions of a possibly large number of discrete random variables. The approach uses measure transport by randomized assignment flows on the statistical submanifold of factorizing distributions, which also enables to sample efficiently from the target distribution and to assess the likelihood of unseen data points. The embedding of the flow via the Segre map in the meta-simplex of all discrete joint distributions ensures that any target distribution can be represented in principle, whose complexity in practice only depends on the parametrization of the affinity function of the dynamical assignment flow system. Our model can be trained in a simulation-free manner without integration by conditional Riemannian flow matching, using the training data encoded as geodesics in closed-form with respect to the e-connection of information geometry. By projecting high-dimensional flow matching in the meta-simplex of joint distributions to the submanifold of factorizing distributions, our approach has strong motivation from first principles of modeling coupled discrete variables. Numerical experiments devoted to distributions of structured image labelings demonstrate the applicability to large-scale problems, which may include discrete distributions in other application areas. Performance measures show that our approach scales better with the increasing number of classes than recent related work.
△ Less
Submitted 6 June, 2024;
originally announced June 2024.
-
A Geometric Embedding Approach to Multiple Games and Multiple Populations
Authors:
Bastian Boll,
Jonas Cassel,
Peter Albers,
Stefania Petra,
Christoph Schnörr
Abstract:
This paper studies a meta-simplex concept and geometric embedding framework for multi-population replicator dynamics. Central results are two embedding theorems which constitute a formal reduction of multi-population replicator dynamics to single-population ones. In conjunction with a robust mathematical formalism, this provides a toolset for analyzing complex multi-population models. Our framewor…
▽ More
This paper studies a meta-simplex concept and geometric embedding framework for multi-population replicator dynamics. Central results are two embedding theorems which constitute a formal reduction of multi-population replicator dynamics to single-population ones. In conjunction with a robust mathematical formalism, this provides a toolset for analyzing complex multi-population models. Our framework provides a unifying perspective on different population dynamics in the literature which in particular enables to establish a formal link between multi-population and multi-game dynamics.
△ Less
Submitted 11 January, 2024;
originally announced January 2024.
-
Accelerated Bregmann divergence optimization with SMART: an information geometry point of view
Authors:
Maren Raus,
Yara Elshiaty,
Stefania Petra
Abstract:
We investigate the problem of minimizing Kullback-Leibler divergence between a linear model $Ax$ and a positive vector $b$ in different convex domains (positive orthant, $n$-dimensional box, probability simplex). Our focus is on the SMART method that employs efficient multiplicative updates. We explore the exponentiated gradient method, which can be viewed as a Bregman proximal gradient method and…
▽ More
We investigate the problem of minimizing Kullback-Leibler divergence between a linear model $Ax$ and a positive vector $b$ in different convex domains (positive orthant, $n$-dimensional box, probability simplex). Our focus is on the SMART method that employs efficient multiplicative updates. We explore the exponentiated gradient method, which can be viewed as a Bregman proximal gradient method and as a Riemannian gradient descent on the parameter manifold of a corresponding distribution of the exponential family. This dual interpretation enables us to establish connections and achieve accelerated SMART iterates while smoothly incorporating constraints. The performance of the proposed acceleration schemes is demonstrated by large-scale numerical examples.
△ Less
Submitted 10 January, 2024;
originally announced January 2024.
-
Multilevel Geometric Optimization for Regularised Constrained Linear Inverse Problems
Authors:
Sebastian Müller,
Stefania Petra,
Matthias Zisler
Abstract:
We present a geometric multilevel optimization approach that smoothly incorporates box constraints. Given a box constrained optimization problem, we consider a hierarchy of models with varying discretization levels. Finer models are accurate but expensive to compute, while coarser models are less accurate but cheaper to compute. When working at the fine level, multilevel optimisation computes the…
▽ More
We present a geometric multilevel optimization approach that smoothly incorporates box constraints. Given a box constrained optimization problem, we consider a hierarchy of models with varying discretization levels. Finer models are accurate but expensive to compute, while coarser models are less accurate but cheaper to compute. When working at the fine level, multilevel optimisation computes the search direction based on a coarser model which speeds up updates at the fine level. Moreover, exploiting geometry induced by the hierarchy the feasibility of the updates is preserved. In particular, our approach extends classical components of multigrid methods like restriction and prolongation to the Riemannian structure of our constraints.
△ Less
Submitted 22 April, 2024; v1 submitted 11 July, 2022;
originally announced July 2022.
-
Self-Certifying Classification by Linearized Deep Assignment
Authors:
Bastian Boll,
Alexander Zeilmann,
Stefania Petra,
Christoph Schnörr
Abstract:
We propose a novel class of deep stochastic predictors for classifying metric data on graphs within the PAC-Bayes risk certification paradigm. Classifiers are realized as linearly parametrized deep assignment flows with random initial conditions. Building on the recent PAC-Bayes literature and data-dependent priors, this approach enables (i) to use risk bounds as training objectives for learning p…
▽ More
We propose a novel class of deep stochastic predictors for classifying metric data on graphs within the PAC-Bayes risk certification paradigm. Classifiers are realized as linearly parametrized deep assignment flows with random initial conditions. Building on the recent PAC-Bayes literature and data-dependent priors, this approach enables (i) to use risk bounds as training objectives for learning posterior distributions on the hypothesis space and (ii) to compute tight out-of-sample risk certificates of randomized classifiers more efficiently than related work. Comparison with empirical test set errors illustrates the performance and practicality of this self-certifying classification method.
△ Less
Submitted 18 February, 2022; v1 submitted 26 January, 2022;
originally announced January 2022.
-
Learning Linearized Assignment Flows for Image Labeling
Authors:
Alexander Zeilmann,
Stefania Petra,
Christoph Schnörr
Abstract:
We introduce a novel algorithm for estimating optimal parameters of linearized assignment flows for image labeling. An exact formula is derived for the parameter gradient of any loss function that is constrained by the linear system of ODEs determining the linearized assignment flow. We show how to efficiently evaluate this formula using a Krylov subspace and a low-rank approximation. This enables…
▽ More
We introduce a novel algorithm for estimating optimal parameters of linearized assignment flows for image labeling. An exact formula is derived for the parameter gradient of any loss function that is constrained by the linear system of ODEs determining the linearized assignment flow. We show how to efficiently evaluate this formula using a Krylov subspace and a low-rank approximation. This enables us to perform parameter learning by Riemannian gradient descent in the parameter space, without the need to backpropagate errors or to solve an adjoint equation. Experiments demonstrate that our method performs as good as highly-tuned machine learning software using automatic differentiation. Unlike methods employing automatic differentiation, our approach yields a low-dimensional representation of internal parameters and their dynamics which helps to understand how assignment flows and more generally neural networks work and perform.
△ Less
Submitted 4 April, 2022; v1 submitted 2 August, 2021;
originally announced August 2021.
-
Multi-Channel Potts-Based Reconstruction for Multi-Spectral Computed Tomography
Authors:
Lukas Kiefer,
Stefania Petra,
Martin Storath,
Andreas Weinmann
Abstract:
We consider reconstructing multi-channel images from measurements performed by photon-counting and energy-discriminating detectors in the setting of multi-spectral X-ray computed tomography (CT). Our aim is to exploit the strong structural correlation that is known to exist between the channels of multi-spectral CT images. To that end, we adopt the multi-channel Potts prior to jointly reconstruct…
▽ More
We consider reconstructing multi-channel images from measurements performed by photon-counting and energy-discriminating detectors in the setting of multi-spectral X-ray computed tomography (CT). Our aim is to exploit the strong structural correlation that is known to exist between the channels of multi-spectral CT images. To that end, we adopt the multi-channel Potts prior to jointly reconstruct all channels. This prior produces piecewise constant solutions with strongly correlated channels. In particular, edges are enforced to have the same spatial position across channels which is a benefit over TV-based methods. We consider the Potts prior in two frameworks: (a) in the context of a variational Potts model, and (b) in a Potts-superiorization approach that perturbs the iterates of a basic iterative least squares solver. We identify an alternating direction method of multipliers (ADMM) approach as well as a Potts-superiorized conjugate gradient method as particularly suitable. In numerical experiments, we compare the Potts prior based approaches to existing TV-type approaches on realistically simulated multi-spectral CT data and obtain improved reconstruction for compound solid bodies.
△ Less
Submitted 10 March, 2021; v1 submitted 12 September, 2020;
originally announced September 2020.
-
Superiorization vs. Accelerated Convex Optimization: The Superiorized/Regularized Least-Squares Case
Authors:
Yair Censor,
Stefania Petra,
Christoph Schnörr
Abstract:
We conduct a study and comparison of superiorization and optimization approaches for the reconstruction problem of superiorized/regularized least-squares solutions of underdetermined linear equations with nonnegativity variable bounds. Regarding superiorization, the state of the art is examined for this problem class, and a novel approach is proposed that employs proximal map**s and is structura…
▽ More
We conduct a study and comparison of superiorization and optimization approaches for the reconstruction problem of superiorized/regularized least-squares solutions of underdetermined linear equations with nonnegativity variable bounds. Regarding superiorization, the state of the art is examined for this problem class, and a novel approach is proposed that employs proximal map**s and is structurally similar to the established forward-backward optimization approach. Regarding convex optimization, accelerated forward-backward splitting with inexact proximal maps is worked out and applied to both the natural splitting least-squares term/regularizer and to the reverse splitting regularizer/least-squares term. Our numerical findings suggest that superiorization can approach the solution of the optimization problem and leads to comparable results at significantly lower costs, after appropriate parameter tuning. On the other hand, applying accelerated forward-backward optimization to the reverse splitting slightly outperforms superiorization, which suggests that convex optimization can approach superiorization too, using a suitable problem splitting.
△ Less
Submitted 1 April, 2020; v1 submitted 13 November, 2019;
originally announced November 2019.
-
Self-Assignment Flows for Unsupervised Data Labeling on Graphs
Authors:
Matthias Zisler,
Artjom Zern,
Stefania Petra,
Christoph Schnörr
Abstract:
This paper extends the recently introduced assignment flow approach for supervised image labeling to unsupervised scenarios where no labels are given. The resulting self-assignment flow takes a pairwise data affinity matrix as input data and maximizes the correlation with a low-rank matrix that is parametrized by the variables of the assignment flow, which entails an assignment of the data to them…
▽ More
This paper extends the recently introduced assignment flow approach for supervised image labeling to unsupervised scenarios where no labels are given. The resulting self-assignment flow takes a pairwise data affinity matrix as input data and maximizes the correlation with a low-rank matrix that is parametrized by the variables of the assignment flow, which entails an assignment of the data to themselves through the formation of latent labels (feature prototypes). A single user parameter, the neighborhood size for the geometric regularization of assignments, drives the entire process. By smooth geodesic interpolation between different normalizations of self-assignment matrices on the positive definite matrix manifold, a one-parameter family of self-assignment flows is defined. Accordingly, our approach can be characterized from different viewpoints, e.g. as performing spatially regularized, rank-constrained discrete optimal transport, or as computing spatially regularized normalized spectral cuts. Regarding combinatorial optimization, our approach successfully determines completely positive factorizations of self-assignments in large-scale scenarios, subject to spatial regularization. Various experiments including the unsupervised learning of patch dictionaries using a locally invariant distance function, illustrate the properties of the approach.
△ Less
Submitted 24 March, 2020; v1 submitted 8 November, 2019;
originally announced November 2019.
-
Learning Adaptive Regularization for Image Labeling Using Geometric Assignment
Authors:
Ruben Hühnerbein,
Fabrizio Savarino,
Stefania Petra,
Christoph Schnörr
Abstract:
We study the inverse problem of model parameter learning for pixelwise image labeling, using the linear assignment flow and training data with ground truth. This is accomplished by a Riemannian gradient flow on the manifold of parameters that determine the regularization properties of the assignment flow. Using the symplectic partitioned Runge--Kutta method for numerical integration, it is shown t…
▽ More
We study the inverse problem of model parameter learning for pixelwise image labeling, using the linear assignment flow and training data with ground truth. This is accomplished by a Riemannian gradient flow on the manifold of parameters that determine the regularization properties of the assignment flow. Using the symplectic partitioned Runge--Kutta method for numerical integration, it is shown that deriving the sensitivity conditions of the parameter learning problem and its discretization commute. A convenient property of our approach is that learning is based on exact inference. Carefully designed experiments demonstrate the performance of our approach, the expressiveness of the mathematical model as well as its limitations, from the viewpoint of statistical learning and optimal control.
△ Less
Submitted 25 June, 2020; v1 submitted 22 October, 2019;
originally announced October 2019.
-
Unsupervised Assignment Flow: Label Learning on Feature Manifolds by Spatially Regularized Geometric Assignment
Authors:
Artjom Zern,
Matthias Zisler,
Stefania Petra,
Christoph Schnörr
Abstract:
This paper introduces the unsupervised assignment flow that couples the assignment flow for supervised image labeling with Riemannian gradient flows for label evolution on feature manifolds. The latter component of the approach encompasses extensions of state-of-the-art clustering approaches to manifold-valued data. Coupling label evolution with the spatially regularized assignment flow induces a…
▽ More
This paper introduces the unsupervised assignment flow that couples the assignment flow for supervised image labeling with Riemannian gradient flows for label evolution on feature manifolds. The latter component of the approach encompasses extensions of state-of-the-art clustering approaches to manifold-valued data. Coupling label evolution with the spatially regularized assignment flow induces a sparsifying effect that enables to learn compact label dictionaries in an unsupervised manner. Our approach alleviates the requirement for supervised labeling to have proper labels at hand, because an initial set of labels can evolve and adapt to better values while being assigned to given data. The separation between feature and assignment manifolds enables the flexible application which is demonstrated for three scenarios with manifold-valued features. Experiments demonstrate a beneficial effect in both directions: adaptivity of labels improves image labeling, and steering label evolution by spatially regularized assignments leads to proper labels, because the assignment flow for supervised labeling is exactly used without any approximation for label learning.
△ Less
Submitted 16 December, 2019; v1 submitted 24 April, 2019;
originally announced April 2019.
-
Performance Bounds For Co-/Sparse Box Constrained Signal Recovery
Authors:
Jan Kuske,
Stefania Petra
Abstract:
The recovery of structured signals from a few linear measurements is a central point in both compressed sensing (CS) and discrete tomography. In CS the signal structure is described by means of a low complexity model e.g. co-/sparsity. The CS theory shows that any signal/image can be undersampled at a rate dependent on its intrinsic complexity. Moreover, in such undersampling regimes, the signal c…
▽ More
The recovery of structured signals from a few linear measurements is a central point in both compressed sensing (CS) and discrete tomography. In CS the signal structure is described by means of a low complexity model e.g. co-/sparsity. The CS theory shows that any signal/image can be undersampled at a rate dependent on its intrinsic complexity. Moreover, in such undersampling regimes, the signal can be recovered by sparsity promoting convex regularization like $\ell_1$- or total variation (TV-) minimization. Precise relations between many low complexity measures and the sufficient number of random measurements are known for many sparsity promoting norms. However, a precise estimate of the undersampling rate for the TV seminorm is still lacking. We address this issue by: a) providing dual certificates testing uniqueness of a given cosparse signal with bounded signal values, b) approximating the undersampling rates via the statistical dimension of the TV descent cone and c) showing empirically that the provided rates also hold for tomographic measurements.
△ Less
Submitted 23 December, 2018;
originally announced December 2018.
-
Geometric Numerical Integration of the Assignment Flow
Authors:
Alexander Zeilmann,
Fabrizio Savarino,
Stefania Petra,
Christoph Schnörr
Abstract:
The assignment flow is a smooth dynamical system that evolves on an elementary statistical manifold and performs contextual data labeling on a graph. We derive and introduce the linear assignment flow that evolves nonlinearly on the manifold, but is governed by a linear ODE on the tangent space. Various numerical schemes adapted to the mathematical structure of these two models are designed and st…
▽ More
The assignment flow is a smooth dynamical system that evolves on an elementary statistical manifold and performs contextual data labeling on a graph. We derive and introduce the linear assignment flow that evolves nonlinearly on the manifold, but is governed by a linear ODE on the tangent space. Various numerical schemes adapted to the mathematical structure of these two models are designed and studied, for the geometric numerical integration of both flows: embedded Runge-Kutta-Munthe-Kaas schemes for the nonlinear flow, adaptive Runge-Kutta schemes and exponential integrators for the linear flow. All algorithms are parameter free, except for setting a tolerance value that specifies adaptive step size selection by monitoring the local integration error, or fixing the dimension of the Krylov subspace approximation. These algorithms provide a basis for applying the assignment flow to machine learning scenarios beyond supervised labeling, including unsupervised labeling and learning from controlled assignment flows.
△ Less
Submitted 5 October, 2018;
originally announced October 2018.
-
A Novel Convex Relaxation for Non-Binary Discrete Tomography
Authors:
Jan Kuske,
Paul Swoboda,
Stefania Petra
Abstract:
We present a novel convex relaxation and a corresponding inference algorithm for the non-binary discrete tomography problem, that is, reconstructing discrete-valued images from few linear measurements. In contrast to state of the art approaches that split the problem into a continuous reconstruction problem for the linear measurement constraints and a discrete labeling problem to enforce discrete-…
▽ More
We present a novel convex relaxation and a corresponding inference algorithm for the non-binary discrete tomography problem, that is, reconstructing discrete-valued images from few linear measurements. In contrast to state of the art approaches that split the problem into a continuous reconstruction problem for the linear measurement constraints and a discrete labeling problem to enforce discrete-valued reconstructions, we propose a joint formulation that addresses both problems simultaneously, resulting in a tighter convex relaxation. For this purpose a constrained graphical model is set up and evaluated using a novel relaxation optimized by dual decomposition. We evaluate our approach experimentally and show superior solutions both mathematically (tighter relaxation) and experimentally in comparison to previously proposed relaxations.
△ Less
Submitted 10 March, 2017;
originally announced March 2017.
-
Image Labeling by Assignment
Authors:
Freddie Åström,
Stefania Petra,
Bernhard Schmitzer,
Christoph Schnörr
Abstract:
We introduce a novel geometric approach to the image labeling problem. Abstracting from specific labeling applications, a general objective function is defined on a manifold of stochastic matrices, whose elements assign prior data that are given in any metric space, to observed image measurements. The corresponding Riemannian gradient flow entails a set of replicator equations, one for each data p…
▽ More
We introduce a novel geometric approach to the image labeling problem. Abstracting from specific labeling applications, a general objective function is defined on a manifold of stochastic matrices, whose elements assign prior data that are given in any metric space, to observed image measurements. The corresponding Riemannian gradient flow entails a set of replicator equations, one for each data point, that are spatially coupled by geometric averaging on the manifold. Starting from uniform assignments at the barycenter as natural initialization, the flow terminates at some global maximum, each of which corresponds to an image labeling that uniquely assigns the prior data. Our geometric variational approach constitutes a smooth non-convex inner approximation of the general image labeling problem, implemented with sparse interior-point numerics in terms of parallel multiplicative updates that converge efficiently.
△ Less
Submitted 16 March, 2016;
originally announced March 2016.
-
Single Projection Kaczmarz Extended Algorithms
Authors:
Stefania Petra,
Constantin Popa
Abstract:
To find the least squares solution of a very large and inconsistent system of equations, one can employ the extended Kaczmarz algorithm. This method simultaneously removes the error term, such that a consistent system is asymptotically obtained, and applies Kaczmarz iterations for the current approximation of this system. For random corrections of the right hand side and Kaczmarz updates selected…
▽ More
To find the least squares solution of a very large and inconsistent system of equations, one can employ the extended Kaczmarz algorithm. This method simultaneously removes the error term, such that a consistent system is asymptotically obtained, and applies Kaczmarz iterations for the current approximation of this system. For random corrections of the right hand side and Kaczmarz updates selected at random, convergence to the least squares solution has been shown. We consider the deterministic control strategies, and show convergence to a least squares solution when row and column updates are chosen according to the almost-cyclic or maximal-residual choice.
△ Less
Submitted 1 April, 2015;
originally announced April 2015.
-
Phase Transitions and Cosparse Tomographic Recovery of Compound Solid Bodies from Few Projections
Authors:
Andreea Deniţiu,
Stefania Petra,
Claudius Schnörr,
Christoph Schnörr
Abstract:
We study unique recovery of cosparse signals from limited-angle tomographic measurements of two- and three-dimensional domains. Admissible signals belong to the union of subspaces defined by all cosupports of maximal cardinality $\ell$ with respect to the discrete gradient operator. We relate $\ell$ both to the number of measurements and to a nullspace condition with respect to the measurement mat…
▽ More
We study unique recovery of cosparse signals from limited-angle tomographic measurements of two- and three-dimensional domains. Admissible signals belong to the union of subspaces defined by all cosupports of maximal cardinality $\ell$ with respect to the discrete gradient operator. We relate $\ell$ both to the number of measurements and to a nullspace condition with respect to the measurement matrix, so as to achieve unique recovery by linear programming. These results are supported by comprehensive numerical experiments that show a high correlation of performance in practice and theoretical predictions. Despite poor properties of the measurement matrix from the viewpoint of compressed sensing, the class of uniquely recoverable signals basically seems large enough to cover practical applications, like contactless quality inspection of compound solid bodies composed of few materials.
△ Less
Submitted 2 November, 2013;
originally announced November 2013.
-
Critical Parameter Values and Reconstruction Properties of Discrete Tomography: Application to Experimental Fluid Dynamics
Authors:
Stefania Petra,
Christoph Schnörr,
Andreas Schröder
Abstract:
We analyze representative ill-posed scenarios of tomographic PIV with a focus on conditions for unique volume reconstruction. Based on sparse random seedings of a region of interest with small particles, the corresponding systems of linear projection equations are probabilistically analyzed in order to determine (i) the ability of unique reconstruction in terms of the imaging geometry and the crit…
▽ More
We analyze representative ill-posed scenarios of tomographic PIV with a focus on conditions for unique volume reconstruction. Based on sparse random seedings of a region of interest with small particles, the corresponding systems of linear projection equations are probabilistically analyzed in order to determine (i) the ability of unique reconstruction in terms of the imaging geometry and the critical sparsity parameter, and (ii) sharpness of the transition to non-unique reconstruction with ghost particles when choosing the sparsity parameter improperly. The sparsity parameter directly relates to the seeding density used for PIV in experimental fluids dynamics that is chosen empirically to date. Our results provide a basic mathematical characterization of the PIV volume reconstruction problem that is an essential prerequisite for any algorithm used to actually compute the reconstruction. Moreover, we connect the sparse volume function reconstruction problem from few tomographic projections to major developments in compressed sensing.
△ Less
Submitted 19 September, 2012;
originally announced September 2012.
-
Average Case Recovery Analysis of Tomographic Compressive Sensing
Authors:
Stefania Petra,
Christoph Schnörr
Abstract:
The reconstruction of three-dimensional sparse volume functions from few tomographic projections constitutes a challenging problem in image reconstruction and turns out to be a particular instance problem of compressive sensing. The tomographic measurement matrix encodes the incidence relation of the imaging process, and therefore is not subject to design up to small perturbations of non-zero entr…
▽ More
The reconstruction of three-dimensional sparse volume functions from few tomographic projections constitutes a challenging problem in image reconstruction and turns out to be a particular instance problem of compressive sensing. The tomographic measurement matrix encodes the incidence relation of the imaging process, and therefore is not subject to design up to small perturbations of non-zero entries. We present an average case analysis of the recovery properties and a corresponding tail bound to establish weak thresholds, in excellent agreement with numerical experiments. Our result improve the state-of-the-art of tomographic imaging in experimental fluid dynamics by a factor of three.
△ Less
Submitted 30 August, 2012; v1 submitted 29 August, 2012;
originally announced August 2012.
-
Influence of the lattice topography on a three-dimensional, controllable Brownian motor
Authors:
H. Hagman,
C. M. Dion,
P. Sjolund,
S. J. H. Petra,
A. Kastberg
Abstract:
We study the influence of the lattice topography and the coupling between motion in different directions, for a three-dimensional Brownian motor based on cold atoms in a double optical lattice. Due to controllable relative spatial phases between the lattices, our Brownian motor can induce drifts in arbitrary directions. Since the lattices couple the different directions, the relation between the…
▽ More
We study the influence of the lattice topography and the coupling between motion in different directions, for a three-dimensional Brownian motor based on cold atoms in a double optical lattice. Due to controllable relative spatial phases between the lattices, our Brownian motor can induce drifts in arbitrary directions. Since the lattices couple the different directions, the relation between the phase shifts and the directionality of the induced drift is non trivial. Here is therefore this relation investigated experimentally by systematically varying the relative spatial phase in two dimensions, while monitoring the vertically induced drift and the temperature. A relative spatial phase range of 2pi x 2pi is covered. We show that a drift, controllable both in speed and direction, can be achieved, by varying the phase both parallel and perpendicular to the direction of the measured induced drift. The experimental results are qualitatively reproduced by numerical simulations of a simplified, classical model of the system.
△ Less
Submitted 19 November, 2007; v1 submitted 28 May, 2007;
originally announced May 2007.
-
Characterisation of a three-dimensional Brownian motor in optical lattices
Authors:
P. Sjolund,
S. J. H. Petra,
C. M. Dion,
H. Hagman,
S. Jonsell,
A. Kastberg
Abstract:
We present here a detailed study of the behaviour of a three dimensional Brownian motor based on cold atoms in a double optical lattice [P. Sjolund et al., Phys. Rev. Lett. 96, 190602 (2006)]. This includes both experiments and numerical simulations of a Brownian particle. The potentials used are spatially and temporally symmetric, but combined spatiotemporal symmetry is broken by phase shifts a…
▽ More
We present here a detailed study of the behaviour of a three dimensional Brownian motor based on cold atoms in a double optical lattice [P. Sjolund et al., Phys. Rev. Lett. 96, 190602 (2006)]. This includes both experiments and numerical simulations of a Brownian particle. The potentials used are spatially and temporally symmetric, but combined spatiotemporal symmetry is broken by phase shifts and asymmetric transfer rates between potentials. The diffusion of atoms in the optical lattices is rectified and controlled both in direction and speed along three dimensions. We explore a large range of experimental parameters, where irradiances and detunings of the optical lattice lights are varied within the dissipative regime. Induced drift velocities in the order of one atomic recoil velocity have been achieved.
△ Less
Submitted 30 April, 2007;
originally announced April 2007.
-
Controllable 3D atomic Brownian motor in optical lattices
Authors:
Claude M. Dion,
Peder Sjolund,
Stefan J. H. Petra,
Svante Jonsell,
Mats Nylen,
Laurent Sanchez-Palencia,
Anders Kastberg
Abstract:
We study a Brownian motor, based on cold atoms in optical lattices, where atomic motion can be induced in a controlled manner in an arbitrary direction, by rectification of isotropic random fluctuations. In contrast with ratchet mechanisms, our Brownian motor operates in a potential that is spatially and temporally symmetric, in apparent contradiction to the Curie principle. Simulations, based o…
▽ More
We study a Brownian motor, based on cold atoms in optical lattices, where atomic motion can be induced in a controlled manner in an arbitrary direction, by rectification of isotropic random fluctuations. In contrast with ratchet mechanisms, our Brownian motor operates in a potential that is spatially and temporally symmetric, in apparent contradiction to the Curie principle. Simulations, based on the Fokker-Planck equation, allow us to gain knowledge on the qualitative behaviour of our Brownian motor. Studies of Brownian motors, and in particular ones with unique control properties, are of fundamental interest because of the role they play in protein motors and their potential applications in nanotechnology. In particular, our system opens the way to the study of quantum Brownian motors.
△ Less
Submitted 30 November, 2006;
originally announced November 2006.
-
A nonadiabatic semi-classical method for dynamics of atoms in optical lattices
Authors:
S. Jonsell,
C. M. Dion,
M. Nylén,
S. J. H. Petra,
P. Sjölund,
A. Kastberg
Abstract:
We develop a semi-classical method to simulate the motion of atoms in a dissipative optical lattice. Our method treats the internal states of the atom quantum mechanically, including all nonadiabatic couplings, while position and momentum are treated as classical variables. We test our method in the one-dimensional case. Excellent agreement with fully quantum mechanical simulations is found. Our…
▽ More
We develop a semi-classical method to simulate the motion of atoms in a dissipative optical lattice. Our method treats the internal states of the atom quantum mechanically, including all nonadiabatic couplings, while position and momentum are treated as classical variables. We test our method in the one-dimensional case. Excellent agreement with fully quantum mechanical simulations is found. Our results are much more accurate than those of earlier semi-classical methods based on the adiabatic approximation.
△ Less
Submitted 16 December, 2005;
originally announced December 2005.
-
Generation of multiple power-balanced laser beams for quantum-state manipulation experiments with phase-stable double optical lattices
Authors:
S. J. H. Petra,
P. Sjolund,
A. Kastberg
Abstract:
We present a method to obtain power-balanced laser beams for doing quantum-state manipulation experiments with phase-stable double optical lattices. Double optical lattices are constructed using four pairs of overlapped laser beams with different frequency. Our optical scheme provides a phase stability between the optical lattices of 5 mrad/s and laser beams with a very clean polarisation state…
▽ More
We present a method to obtain power-balanced laser beams for doing quantum-state manipulation experiments with phase-stable double optical lattices. Double optical lattices are constructed using four pairs of overlapped laser beams with different frequency. Our optical scheme provides a phase stability between the optical lattices of 5 mrad/s and laser beams with a very clean polarisation state resulting in a power imbalance in the individual laser beams of less than 1%.
△ Less
Submitted 11 April, 2006; v1 submitted 16 December, 2005;
originally announced December 2005.
-
Demonstration of a controllable three-dimensional Brownian motor in symmetric potentials
Authors:
Peder Sjolund,
Stefan J. H. Petra,
Claude M. Dion,
Svante Jonsell,
Mats Nylen,
Laurent Sanchez-Palencia,
Anders Kastberg
Abstract:
We demonstrate a Brownian motor, based on cold atoms in optical lattices, where isotropic random fluctuations are rectified in order to induce controlled atomic motion in arbitrary directions. In contrast to earlier demonstrations of ratchet effects, our Brownian motor operates in potentials that are spatially and temporally symmetric, but where spatiotemporal symmetry is broken by a phase shift…
▽ More
We demonstrate a Brownian motor, based on cold atoms in optical lattices, where isotropic random fluctuations are rectified in order to induce controlled atomic motion in arbitrary directions. In contrast to earlier demonstrations of ratchet effects, our Brownian motor operates in potentials that are spatially and temporally symmetric, but where spatiotemporal symmetry is broken by a phase shift between the potentials and asymmetric transfer rates between them. The Brownian motor is demonstrated in three dimensions and the noise-induced drift is controllable in our system.
△ Less
Submitted 24 April, 2006; v1 submitted 15 December, 2005;
originally announced December 2005.
-
Time dependence of laser cooling in optical lattices
Authors:
Claude M. Dion,
Peder Sjolund,
Stefan J. H. Petra,
Svante Jonsell,
Anders Kastberg
Abstract:
We study the dynamics of the cooling of a gas of caesium atoms in an optical lattice, both experimentally and with 1D full-quantum Monte Carlo simulations. We find that, contrary to the standard interpretation of the Sisyphus model, the cooling process does not work by a continuous decrease of the average kinetic energy of the atoms in the lattice. Instead, we show that the momentum of the atoms…
▽ More
We study the dynamics of the cooling of a gas of caesium atoms in an optical lattice, both experimentally and with 1D full-quantum Monte Carlo simulations. We find that, contrary to the standard interpretation of the Sisyphus model, the cooling process does not work by a continuous decrease of the average kinetic energy of the atoms in the lattice. Instead, we show that the momentum of the atoms follows a bimodal distribution, the atoms being gradually transferred from a hot to a cold mode. We suggest that the cooling mechanism should be depicted in terms of a rate model, describing the transfer between the two modes along with the processes occurring within each mode.
△ Less
Submitted 5 September, 2005; v1 submitted 14 June, 2005;
originally announced June 2005.
-
Atom lithography with two-dimensional optical masks
Authors:
S. J. H. Petra,
K. A. H. van Leeuwen,
L. Feenstra,
W. Hogervorst,
W. Vassen
Abstract:
With a two-dimensional (2D) optical mask, nanoscale patterns are created for the first time in an atom lithography process using metastable helium atoms. The internal energy of the atoms is used to locally damage a hydrofobic resist layer, which is removed in a wet etching process. Experiments have been performed with several polarizations for the optical mask, resulting in different intensity p…
▽ More
With a two-dimensional (2D) optical mask, nanoscale patterns are created for the first time in an atom lithography process using metastable helium atoms. The internal energy of the atoms is used to locally damage a hydrofobic resist layer, which is removed in a wet etching process. Experiments have been performed with several polarizations for the optical mask, resulting in different intensity patterns, and corresponding nanoscale structures. The results for a linear polarized light field show an array of holes with a diameter of 260 nm, in agreement with a computed pattern. With a circularly polarized light field a line pattern is observed with a spacing of 766 nm. Simulations taking into account many possible experimental imperfections can not explain this pattern.
△ Less
Submitted 27 February, 2004; v1 submitted 26 February, 2004;
originally announced February 2004.
-
Nanolithography with metastable helium atoms in a high-power standing-wave light field
Authors:
S. J. H. Petra,
L. Feenstra,
W. Hogervorst,
W. Vassen
Abstract:
We have created periodic nanoscale structures in a gold substrate with a lithography process using metastable triplet helium atoms that damage a hydrofobic resist layer on top of the substrate. A beam of metastable helium atoms is transversely cooled and guided through an intense standing-wave light field. Compared to commonly used low-power optical masks, a high-power light field (saturation pa…
▽ More
We have created periodic nanoscale structures in a gold substrate with a lithography process using metastable triplet helium atoms that damage a hydrofobic resist layer on top of the substrate. A beam of metastable helium atoms is transversely cooled and guided through an intense standing-wave light field. Compared to commonly used low-power optical masks, a high-power light field (saturation parameter of 10E7) increases the confinement of the atoms in the standing-wave considerably, and makes the alignment of the experimental setup less critical. Due to the high internal energy of the metastable helium atoms (20 eV), a dose of only one atom per resist molecule is required. With an exposure time of only eight minutes, parallel lines with a separation of 542 nm and a width of 100 nm (1/11th of the wavelength used for the optical mask) are created.
△ Less
Submitted 19 August, 2003;
originally announced August 2003.
-
Numerical simulations on the motion of atoms travelling through a standing-wave light field
Authors:
S. J. H. Petra,
K. A. H. van Leeuwen,
L. Feenstra,
W. Hogervorst,
W. Vassen
Abstract:
The motion of metastable helium atoms travelling through a standing light wave is investigated with a semi-classical numerical model. The results of a calculation including the velocity dependence of the dipole force are compared with those of the commonly used approach, which assumes a conservative dipole force. The comparison is made for two atom guiding regimes that can be used for the produc…
▽ More
The motion of metastable helium atoms travelling through a standing light wave is investigated with a semi-classical numerical model. The results of a calculation including the velocity dependence of the dipole force are compared with those of the commonly used approach, which assumes a conservative dipole force. The comparison is made for two atom guiding regimes that can be used for the production of nanostructure arrays; a low power regime, where the atoms are focused in a standing wave by the dipole force, and a higher power regime, in which the atoms channel along the potential minima of the light field. In the low power regime the differences between the two models are negligible and both models show that, for lithography purposes, pattern widths of 150 nm can be achieved. In the high power channelling regime the conservative force model, predicting 100 nm features, is shown to break down. The model that incorporates velocity dependence, resulting in a structure size of 40 nm, remains valid, as demonstrated by a comparison with quantum Monte-Carlo wavefunction calculations.
△ Less
Submitted 17 June, 2003;
originally announced June 2003.