-
Vision-Language Models as a Source of Rewards
Authors:
Kate Baumli,
Satinder Baveja,
Feryal Behbahani,
Harris Chan,
Gheorghe Comanici,
Sebastian Flennerhag,
Maxime Gazeau,
Kristian Holsheimer,
Dan Horgan,
Michael Laskin,
Clare Lyle,
Hussain Masoom,
Kay McKinney,
Volodymyr Mnih,
Alexander Neitz,
Fabio Pardo,
Jack Parker-Holder,
John Quan,
Tim Rocktäschel,
Himanshu Sahni,
Tom Schaul,
Yannick Schroecker,
Stephen Spencer,
Richie Steigerwald,
Luyu Wang
, et al. (1 additional authors not shown)
Abstract:
Building generalist agents that can accomplish many goals in rich open-ended environments is one of the research frontiers for reinforcement learning. A key limiting factor for building generalist agents with RL has been the need for a large number of reward functions for achieving different goals. We investigate the feasibility of using off-the-shelf vision-language models, or VLMs, as sources of…
▽ More
Building generalist agents that can accomplish many goals in rich open-ended environments is one of the research frontiers for reinforcement learning. A key limiting factor for building generalist agents with RL has been the need for a large number of reward functions for achieving different goals. We investigate the feasibility of using off-the-shelf vision-language models, or VLMs, as sources of rewards for reinforcement learning agents. We show how rewards for visual achievement of a variety of language goals can be derived from the CLIP family of models, and used to train RL agents that can achieve a variety of language goals. We showcase this approach in two distinct visual domains and present a scaling trend showing how larger VLMs lead to more accurate rewards for visual goal achievement, which in turn produces more capable RL agents.
△ Less
Submitted 21 February, 2024; v1 submitted 14 December, 2023;
originally announced December 2023.
-
In-context Reinforcement Learning with Algorithm Distillation
Authors:
Michael Laskin,
Luyu Wang,
Junhyuk Oh,
Emilio Parisotto,
Stephen Spencer,
Richie Steigerwald,
DJ Strouse,
Steven Hansen,
Angelos Filos,
Ethan Brooks,
Maxime Gazeau,
Himanshu Sahni,
Satinder Singh,
Volodymyr Mnih
Abstract:
We propose Algorithm Distillation (AD), a method for distilling reinforcement learning (RL) algorithms into neural networks by modeling their training histories with a causal sequence model. Algorithm Distillation treats learning to reinforcement learn as an across-episode sequential prediction problem. A dataset of learning histories is generated by a source RL algorithm, and then a causal transf…
▽ More
We propose Algorithm Distillation (AD), a method for distilling reinforcement learning (RL) algorithms into neural networks by modeling their training histories with a causal sequence model. Algorithm Distillation treats learning to reinforcement learn as an across-episode sequential prediction problem. A dataset of learning histories is generated by a source RL algorithm, and then a causal transformer is trained by autoregressively predicting actions given their preceding learning histories as context. Unlike sequential policy prediction architectures that distill post-learning or expert sequences, AD is able to improve its policy entirely in-context without updating its network parameters. We demonstrate that AD can reinforcement learn in-context in a variety of environments with sparse rewards, combinatorial task structure, and pixel-based observations, and find that AD learns a more data-efficient RL algorithm than the one that generated the source data.
△ Less
Submitted 25 October, 2022;
originally announced October 2022.
-
A*Net: A Scalable Path-based Reasoning Approach for Knowledge Graphs
Authors:
Zhaocheng Zhu,
Xinyu Yuan,
Mikhail Galkin,
Sophie Xhonneux,
Ming Zhang,
Maxime Gazeau,
Jian Tang
Abstract:
Reasoning on large-scale knowledge graphs has been long dominated by embedding methods. While path-based methods possess the inductive capacity that embeddings lack, their scalability is limited by the exponential number of paths. Here we present A*Net, a scalable path-based method for knowledge graph reasoning. Inspired by the A* algorithm for shortest path problems, our A*Net learns a priority f…
▽ More
Reasoning on large-scale knowledge graphs has been long dominated by embedding methods. While path-based methods possess the inductive capacity that embeddings lack, their scalability is limited by the exponential number of paths. Here we present A*Net, a scalable path-based method for knowledge graph reasoning. Inspired by the A* algorithm for shortest path problems, our A*Net learns a priority function to select important nodes and edges at each iteration, to reduce time and memory footprint for both training and inference. The ratio of selected nodes and edges can be specified to trade off between performance and efficiency. Experiments on both transductive and inductive knowledge graph reasoning benchmarks show that A*Net achieves competitive performance with existing state-of-the-art path-based methods, while merely visiting 10% nodes and 10% edges at each iteration. On a million-scale dataset ogbl-wikikg2, A*Net not only achieves a new state-of-the-art result, but also converges faster than embedding methods. A*Net is the first path-based method for knowledge graph reasoning at such scale.
△ Less
Submitted 8 November, 2023; v1 submitted 6 June, 2022;
originally announced June 2022.
-
Higher Order Generalization Error for First Order Discretization of Langevin Diffusion
Authors:
Mufan Bill Li,
Maxime Gazeau
Abstract:
We propose a novel approach to analyze generalization error for discretizations of Langevin diffusion, such as the stochastic gradient Langevin dynamics (SGLD). For an $ε$ tolerance of expected generalization error, it is known that a first order discretization can reach this target if we run $Ω(ε^{-1} \log (ε^{-1}) )$ iterations with $Ω(ε^{-1})$ samples. In this article, we show that with additio…
▽ More
We propose a novel approach to analyze generalization error for discretizations of Langevin diffusion, such as the stochastic gradient Langevin dynamics (SGLD). For an $ε$ tolerance of expected generalization error, it is known that a first order discretization can reach this target if we run $Ω(ε^{-1} \log (ε^{-1}) )$ iterations with $Ω(ε^{-1})$ samples. In this article, we show that with additional smoothness assumptions, even first order methods can achieve arbitrarily runtime complexity. More precisely, for each $N>0$, we provide a sufficient smoothness condition on the loss function such that a first order discretization can reach $ε$ expected generalization error given $Ω( ε^{-1/N} \log (ε^{-1}) )$ iterations with $Ω(ε^{-1})$ samples.
△ Less
Submitted 11 February, 2021;
originally announced February 2021.
-
An Empirical Study of Large-Batch Stochastic Gradient Descent with Structured Covariance Noise
Authors:
Yeming Wen,
Kevin Luk,
Maxime Gazeau,
Guodong Zhang,
Harris Chan,
Jimmy Ba
Abstract:
The choice of batch-size in a stochastic optimization algorithm plays a substantial role for both optimization and generalization. Increasing the batch-size used typically improves optimization but degrades generalization. To address the problem of improving generalization while maintaining optimal convergence in large-batch training, we propose to add covariance noise to the gradients. We demonst…
▽ More
The choice of batch-size in a stochastic optimization algorithm plays a substantial role for both optimization and generalization. Increasing the batch-size used typically improves optimization but degrades generalization. To address the problem of improving generalization while maintaining optimal convergence in large-batch training, we propose to add covariance noise to the gradients. We demonstrate that the learning performance of our method is more accurately captured by the structure of the covariance matrix of the noise rather than by the variance of gradients. Moreover, over the convex-quadratic, we prove in theory that it can be characterized by the Frobenius norm of the noise matrix. Our empirical studies with standard deep learning model-architectures and datasets shows that our method not only improves generalization performance in large-batch training, but furthermore, does so in a way where the optimization performance remains desirable and the training duration is not elongated.
△ Less
Submitted 28 February, 2020; v1 submitted 21 February, 2019;
originally announced February 2019.
-
A general system of differential equations to model first order adaptive algorithms
Authors:
André Belotto da Silva,
Maxime Gazeau
Abstract:
First order optimization algorithms play a major role in large scale machine learning. A new class of methods, called adaptive algorithms, were recently introduced to adjust iteratively the learning rate for each coordinate. Despite great practical success in deep learning, their behavior and performance on more general loss functions are not well understood. In this paper, we derive a non-autonom…
▽ More
First order optimization algorithms play a major role in large scale machine learning. A new class of methods, called adaptive algorithms, were recently introduced to adjust iteratively the learning rate for each coordinate. Despite great practical success in deep learning, their behavior and performance on more general loss functions are not well understood. In this paper, we derive a non-autonomous system of differential equations, which is the continuous time limit of adaptive optimization methods. We prove global well-posedness of the system and we investigate the numerical time convergence of its forward Euler approximation. We study, furthermore, the convergence of its trajectories and give conditions under which the differential system, underlying all adaptive algorithms, is suitable for optimization. We discuss convergence to a critical point in the non-convex case and give conditions for the dynamics to avoid saddle points and local maxima. For convex and deterministic loss function, we introduce a suitable Lyapunov functional which allow us to study its rate of convergence. Several other properties of both the continuous and discrete systems are briefly discussed. The differential system studied in the paper is general enough to encompass many other classical algorithms (such as Heavy ball and Nesterov's accelerated method) and allow us to recover several known results for these algorithms.
△ Less
Submitted 30 September, 2019; v1 submitted 31 October, 2018;
originally announced October 2018.
-
Scalable Recommender Systems through Recursive Evidence Chains
Authors:
Elias Tragas,
Calvin Luo,
Maxime Gazeau,
Kevin Luk,
David Duvenaud
Abstract:
Recommender systems can be formulated as a matrix completion problem, predicting ratings from user and item parameter vectors. Optimizing these parameters by subsampling data becomes difficult as the number of users and items grows. We develop a novel approach to generate all latent variables on demand from the ratings matrix itself and a fixed pool of parameters. We estimate missing ratings using…
▽ More
Recommender systems can be formulated as a matrix completion problem, predicting ratings from user and item parameter vectors. Optimizing these parameters by subsampling data becomes difficult as the number of users and items grows. We develop a novel approach to generate all latent variables on demand from the ratings matrix itself and a fixed pool of parameters. We estimate missing ratings using chains of evidence that link them to a small set of prototypical users and items. Our model automatically addresses the cold-start and online learning problems by combining information across both users and items. We investigate the scaling behavior of this model, and demonstrate competitive results with respect to current matrix factorization techniques in terms of accuracy and convergence speed.
△ Less
Submitted 5 July, 2018;
originally announced July 2018.
-
Implicit Manifold Learning on Generative Adversarial Networks
Authors:
Kry Yik Chau Lui,
Yanshuai Cao,
Maxime Gazeau,
Kelvin Shuangjian Zhang
Abstract:
This paper raises an implicit manifold learning perspective in Generative Adversarial Networks (GANs), by studying how the support of the learned distribution, modelled as a submanifold $\mathcal{M}_θ$, perfectly match with $\mathcal{M}_{r}$, the support of the real data distribution. We show that optimizing Jensen-Shannon divergence forces $\mathcal{M}_θ$ to perfectly match with…
▽ More
This paper raises an implicit manifold learning perspective in Generative Adversarial Networks (GANs), by studying how the support of the learned distribution, modelled as a submanifold $\mathcal{M}_θ$, perfectly match with $\mathcal{M}_{r}$, the support of the real data distribution. We show that optimizing Jensen-Shannon divergence forces $\mathcal{M}_θ$ to perfectly match with $\mathcal{M}_{r}$, while optimizing Wasserstein distance does not. On the other hand, by comparing the gradients of the Jensen-Shannon divergence and the Wasserstein distances ($W_1$ and $W_2^2$) in their primal forms, we conjecture that Wasserstein $W_2^2$ may enjoy desirable properties such as reduced mode collapse. It is therefore interesting to design new distances that inherit the best from both distances.
△ Less
Submitted 30 October, 2017;
originally announced October 2017.
-
VUV-absorption cross section of carbon dioxide from 150 to 800 K and applications to warm exoplanetary atmospheres
Authors:
Olivia Venot,
Yves Bénilan,
Nicolas Fray,
Marie-Claire Gazeau,
Franck Lefèvre,
Et-touhami Es-sebbar,
Eric Hébrard,
Martin Schwell,
Chiheb Bahrini,
Franck Montmessin,
Maxence Lefèvre,
Ingo P. Waldmann
Abstract:
Most exoplanets detected so far have atmospheric T significantly higher than 300K. Often close to their star, they receive an intense UV photons flux that triggers important photodissociation processes. The T dependency of VUV absorption cross sections are poorly known, leading to an undefined uncertainty in atmospheric models. Similarly, data measured at low T similar to that of the high atmosphe…
▽ More
Most exoplanets detected so far have atmospheric T significantly higher than 300K. Often close to their star, they receive an intense UV photons flux that triggers important photodissociation processes. The T dependency of VUV absorption cross sections are poorly known, leading to an undefined uncertainty in atmospheric models. Similarly, data measured at low T similar to that of the high atmosphere of Mars, Venus, and Titan are often lacking. Our aim is to quantify the T dependency of the abs. cross section of important molecules in planetary atmospheres. We want to provide both high-resolution data at T prevailing in these media and a simple parameterization of the absorption in order to simplify its use in photochemical models. This study focuses on carbon dioxide. We performed experimental measurements of CO$_2$ absorption cross section with synchrotron radiation for the wavelength range (115--200nm). For longer wavelengths (195--230nm), we used a deuterium lamp and a 1.5m Jobin-Yvon spectrometer. We used these data in our 1D thermo-photochemical model in order to study their impact on the predicted atmospheric compositions. The cross section of CO$_2$ increases with T. It can be separated in two parts: a continuum and a fine structure superimposed on the continuum. The variation of the continuum of absorption can be represented by the sum of three gaussian functions. Using data at high T in thermo-photochemical models modifies significantly the abundance and the photodissociation rates of many species, in addition to CO$_2$, such as methane and ammonia. These deviations have an impact on synthetic transmission spectra, leading to variations of up to 5 ppm. We present a full set of HR ($Δλ$=0.03nm) absorption cross sections of CO$_2$ from 115 to 230nm for T ranging from 150 to 800K.
△ Less
Submitted 25 September, 2017;
originally announced September 2017.
-
Bloch theory and spectral gaps for linearized water waves
Authors:
Walter Craig,
Maxime Gazeau,
Christophe Lacave,
Catherine Sulem
Abstract:
The system of equations for water waves, when linearized about equilibrium of a fluid body with a varying bottom boundary, is described by a spectral problem for the Dirichlet -- Neumann operator of the unperturbed free surface. This spectral problem is fundamental in questions of stability, as well as to the perturbation theory of evolution of the free surface in such settings. In addition, the D…
▽ More
The system of equations for water waves, when linearized about equilibrium of a fluid body with a varying bottom boundary, is described by a spectral problem for the Dirichlet -- Neumann operator of the unperturbed free surface. This spectral problem is fundamental in questions of stability, as well as to the perturbation theory of evolution of the free surface in such settings. In addition, the Dirichlet -- Neumann operator is self-adjoint when given an appropriate definition and domain, and it is a novel but very natural spectral problem for a nonlocal operator. In the case in which the bottom boundary varies periodically, $\{y = -h + b(x)\}$ where $b(x+γ) = b(x)$, $γ\in Γ$ a lattice, this spectral problem admits a Bloch decomposition in terms of spectral band functions and their associated band-parametrized eigenfunctions. In this article we describe this analytic construction in the case of a spatially periodic bottom variation from constant depth in two space dimensional water waves problem, giving a construction of the Bloch eigenfunctions and eigenvalues as a function of the band parameters and a description of the Dirichlet -- Neumann operator in terms of the bathymetry $b(x)$. One of the consequences of this description is that the spectrum consists of a series of bands separated by spectral gaps which are zones of forbidden energies. For a given generic periodic bottom profile $b(x)=\varepsilon β(x)$, every gap opens for a sufficiently small value of the perturbation parameter $\varepsilon$.
△ Less
Submitted 27 February, 2018; v1 submitted 22 June, 2017;
originally announced June 2017.
-
Characterization of aromaticity in analogues of titan's atmospheric aerosols with two-step laser desorption ionization mass spectrometry
Authors:
Ahmed Mahjoub,
Martin Schwell,
Nathalie Carrasco,
Yves Benilan,
Guy Cernogora,
Cyril Szopa,
Marie-Claire Gazeau
Abstract:
The role of polycyclic aromatic hydrocarbons (PAH) and Nitrogen containing PAH (PANH) as intermediates of aerosol production in the atmosphere of Titan has been a subject of controversy for a long time. An analysis of the atmospheric emission band observed by the Visible and Infrared Map** Spectrometer (VIMS) at 3.28 micrometer suggests the presence of neutral polycyclic aromatic species in the…
▽ More
The role of polycyclic aromatic hydrocarbons (PAH) and Nitrogen containing PAH (PANH) as intermediates of aerosol production in the atmosphere of Titan has been a subject of controversy for a long time. An analysis of the atmospheric emission band observed by the Visible and Infrared Map** Spectrometer (VIMS) at 3.28 micrometer suggests the presence of neutral polycyclic aromatic species in the upper atmosphere of Titan. These molecules are seen as the counter part of negative and positive aromatics ions suspected by the Plasma Spectrometer onboard the Cassini spacecraft, but the low resolution of the instrument hinders any molecular speciation.
In this work we investigate the specific aromatic content of Titan's atmospheric aerosols through laboratory simulations. We report here the selective detection of aromatic compounds in tholins, Titan's aerosol analogues, produced with a capacitively coupled plasma in a N2:CH4 95:5 gas mixture. For this purpose, Two-Step Laser Desorption Ionization Time-of-Flight Mass Spectrometry (L2DI-TOF-MS) technique is used to analyze the so produced analogues. This analytical technique is based on the ionization of molecules by Resonance Enhanced Multi-Photon Ionization (REMPI) using a λ=248 nm wavelength laser which is selective for aromatic species. This allows for the selective identification of compounds having at least one aromatic ring. Our experiments show that tholins contain a trace amount of small PAHs with one to three aromatic rings. Nitrogen containing PAHs (PANHs) are also detected as constituents of tholins. Molecules relevant to astrobiology are detected as is the case of the substituted DNA base adenine.
△ Less
Submitted 19 May, 2016;
originally announced May 2016.
-
VUV-absorption cross section of CO2 at high temperatures and impact on exoplanet atmospheres
Authors:
Olivia Venot,
Nicolas Fray,
Yves Bénilan,
Marie-Claire Gazeau,
Eric Hébrard,
Gwenaelle Larcher,
Martin Schwell,
Michel Dobrijevic,
Franck Selsis
Abstract:
Ultraviolet (UV) absorption cross sections are an essential ingredient of photochemical atmosphere models. Exoplanet searches have unveiled a large population of short-period objects with hot atmospheres, very different from what we find in our solar system. Transiting exoplanets whose atmospheres can now be studied by transit spectroscopy receive extremely strong UV fluxes and have typical temper…
▽ More
Ultraviolet (UV) absorption cross sections are an essential ingredient of photochemical atmosphere models. Exoplanet searches have unveiled a large population of short-period objects with hot atmospheres, very different from what we find in our solar system. Transiting exoplanets whose atmospheres can now be studied by transit spectroscopy receive extremely strong UV fluxes and have typical temperatures ranging from 400 to 2500 K. At these temperatures, UV photolysis cross section data are severely lacking. Our goal is to provide high-temperature absorption cross sections and their temperature dependency for important atmospheric compounds. This study is dedicated to CO2, which is observed and photodissociated in exoplanet atmospheres. We performed these measurements for the 115 - 200 nm range at 300, 410, 480, and 550 K. In the 195 - 230 nm range, we worked at seven temperatures between 465 and 800 K. We found that the absorption cross section of CO2 is very sensitive to temperature, especially above 160 nm. Within the studied range of temperature, the CO2 cross section can vary by more than two orders of magnitude. This, in particular, makes the absorption of CO2 significant up to wavelengths as high as 230 nm, while it is negligible above 200 nm at 300 K. To investigate the influence of these new data on the photochemistry of exoplanets, we implemented the measured cross section into a 1D photochemical model. The model predicts that accounting for this temperature dependency of CO2 cross section can affect the computed abundances of NH3, CO2, and CO by one order of magnitude in the atmospheres of hot Jupiter and hot Neptune.
△ Less
Submitted 26 February, 2015;
originally announced February 2015.
-
Analysis and simulation of rare events for SPDE
Authors:
Charles-Edouard Bréhier,
Maxime Gazeau,
Ludovic Goudenège,
Mathias Rousset
Abstract:
In this work, we consider the numerical estimation of the probability for a stochastic process to hit a set B before reaching another set A. This event is assumed to be rare. We consider reactive trajectories of the stochastic Allen-Cahn partial differential evolution equation (with double well potential) in dimension 1. Reactive trajectories are defined as the probability distribution of the traj…
▽ More
In this work, we consider the numerical estimation of the probability for a stochastic process to hit a set B before reaching another set A. This event is assumed to be rare. We consider reactive trajectories of the stochastic Allen-Cahn partial differential evolution equation (with double well potential) in dimension 1. Reactive trajectories are defined as the probability distribution of the trajectories of a stochastic process, conditioned by the event of hitting B before A. We investigate the use of the so-called Adaptive Multilevel Splitting algorithm in order to estimate the rare event and simulate reactive trajectories. This algorithm uses a \emph{reaction coordinate} (a real valued function of state space defining level sets), and is based on (i) the selection, among several replicas of the system having hit A before B, of those with maximal reaction coordinate; (ii) iteration of the latter step. We choose for the reaction coordinate the average magnetization, and for B the minimum of the well opposite to the initial condition. We discuss the context, prove that the algorithm has a sense in the usual functional setting, and numerically test the method (estimation of rare event, and transition state sampling).
△ Less
Submitted 7 January, 2014;
originally announced January 2014.
-
Strong order of convergence of a semidiscrete scheme for the stochastic Manakov equation
Authors:
Maxime Gazeau
Abstract:
It is well accepted by physicists that the Manakov PMD equation is a good model to describe the evolution of nonlinear electric fields in optical fibers with randomly varying birefringence. In the regime of the diffusion approximation theory, an effective asymptotic dynamics has recently been obtained to describe this evolution. This equation is called the stochastic Manakov equation. In this arti…
▽ More
It is well accepted by physicists that the Manakov PMD equation is a good model to describe the evolution of nonlinear electric fields in optical fibers with randomly varying birefringence. In the regime of the diffusion approximation theory, an effective asymptotic dynamics has recently been obtained to describe this evolution. This equation is called the stochastic Manakov equation. In this article, we propose a semidiscrete version of a Crank Nicolson scheme for this limit equation and we analyze the strong error. Allowing sufficient regularity of the initial data, we prove that the numerical scheme has strong order 1/2.
△ Less
Submitted 8 August, 2013; v1 submitted 7 August, 2013;
originally announced August 2013.
-
High-temperature measurements of VUV-absorption cross sections of CO2 and their application to exoplanets
Authors:
Olivia Venot,
Nicolas Fray,
Yves Bénilan,
Marie-Claire Gazeau,
Eric Hébrard,
Gwenaelle Larcher,
Martin Schwell,
Michel Dobrijevic,
Franck Selsis
Abstract:
UV absorption cross sections are an essential ingredient of photochemical atmosphere models. Exoplanet searches have unveiled a large population of short-period objects with hot atmospheres, very different from what we find in our solar system. Transiting exoplanets whose atmospheres can now be studied by transit spectroscopy receive extremely strong UV fluxes and have typical temperatures ranging…
▽ More
UV absorption cross sections are an essential ingredient of photochemical atmosphere models. Exoplanet searches have unveiled a large population of short-period objects with hot atmospheres, very different from what we find in our solar system. Transiting exoplanets whose atmospheres can now be studied by transit spectroscopy receive extremely strong UV fluxes and have typical temperatures ranging from 400 to 2500 K. At these temperatures, UV photolysis cross section data are severely lacking. Aims. Our goal is to provide high-temperature absorption cross sections and their temperature dependency for important atmospheric compounds. This study is dedicated to CO2, which is observed and photodissociated in exoplanet atmospheres. We also investigate the influence of these new data on the photochemistry of some exoplanets. We performed these measurements for the 115 - 200 nm range at 300, 410, 480, and 550 K. In the 195 - 230 nm range, we worked at seven temperatures between 465 and 800 K. We implemented the measured cross section into a 1D photochemical model. For wavelengths > 170 nm, the wavelength dependence of ln(cross-section_CO2(wavelength, T)x1/Qv(T)) can be parametrized with a linear law. Thus, we can interpolate cross-section_CO2(wavelength, T) at any temperature between 300 and 800 K. Within the studied range of temperature, the CO2 cross section can vary by more than two orders of magnitude. This, in particular, makes the absorption of CO2 significant up to wavelengths as high as 230 nm. The absorption cross section of CO2 is very sensitive to temperature. The model predicts that accounting for this temperature dependency of CO2 cross section can affect the computed abundances of NH3, CO2, and CO by one order of magnitude in the atmospheres of hot Jupiter and hot Neptune. This effect will be more important in hot CO2-dominated atmospheres.
△ Less
Submitted 12 February, 2013; v1 submitted 11 February, 2013;
originally announced February 2013.
-
A diffusion approximation theorem for a nonlinear PDE with application to random birefringent optical fibers
Authors:
A. de Bouard,
M. Gazeau
Abstract:
In this article we propose a generalization of the theory of diffusion approximation for random ODE to a nonlinear system of random Schrödinger equations. This system arises in the study of pulse propagation in randomly birefringent optical fibers. We first show existence and uniqueness of solutions for the random PDE and the limiting equation. We follow the work of Garnier and Marty [Wave Motion…
▽ More
In this article we propose a generalization of the theory of diffusion approximation for random ODE to a nonlinear system of random Schrödinger equations. This system arises in the study of pulse propagation in randomly birefringent optical fibers. We first show existence and uniqueness of solutions for the random PDE and the limiting equation. We follow the work of Garnier and Marty [Wave Motion 43 (2006) 544-560], Marty [Problèmes d'évolution en milieux aléatoires: Théorèmes limites, schémas numériques et applications en optique (2005) Univ. Paul Sabatier], where a linear electric field is considered, and we get an asymptotic dynamic for the nonlinear electric field.
△ Less
Submitted 13 December, 2012; v1 submitted 20 May, 2011;
originally announced May 2011.