Skip to main content

Showing 1–6 of 6 results for author: Orlova, E

Searching in archive cs. Search in all archives.
.
  1. arXiv:2306.01187  [pdf, other

    cs.LG math.DS

    Training neural operators to preserve invariant measures of chaotic attractors

    Authors: Ruoxi Jiang, Peter Y. Lu, Elena Orlova, Rebecca Willett

    Abstract: Chaotic systems make long-horizon forecasts difficult because small perturbations in initial conditions cause trajectories to diverge at an exponential rate. In this setting, neural operators trained to minimize squared error losses, while capable of accurate short-term forecasts, often fail to reproduce statistical or structural properties of the dynamics over longer time horizons and can yield d… ▽ More

    Submitted 16 April, 2024; v1 submitted 1 June, 2023; originally announced June 2023.

    Comments: Accepted at NeurIPS 2023

  2. arXiv:2305.19685  [pdf, other

    cs.LG quant-ph stat.ML

    Deep Stochastic Mechanics

    Authors: Elena Orlova, Aleksei Ustimenko, Ruoxi Jiang, Peter Y. Lu, Rebecca Willett

    Abstract: This paper introduces a novel deep-learning-based approach for numerical simulation of a time-evolving Schrödinger equation inspired by stochastic mechanics and generative diffusion models. Unlike existing approaches, which exhibit computational complexity that scales exponentially in the problem dimension, our method allows us to adapt to the latent low-dimensional structure of the wave function… ▽ More

    Submitted 4 June, 2024; v1 submitted 31 May, 2023; originally announced May 2023.

  3. arXiv:2211.15856  [pdf, other

    cs.LG physics.ao-ph

    Beyond Ensemble Averages: Leveraging Climate Model Ensembles for Subseasonal Forecasting

    Authors: Elena Orlova, Haokun Liu, Raphael Rossellini, Benjamin A. Cash, Rebecca Willett

    Abstract: Producing high-quality forecasts of key climate variables, such as temperature and precipitation, on subseasonal time scales has long been a gap in operational forecasting. This study explores an application of machine learning (ML) models as post-processing tools for subseasonal forecasting. Lagged numerical ensemble forecasts (i.e., an ensemble where the members have different initialization dat… ▽ More

    Submitted 3 June, 2024; v1 submitted 28 November, 2022; originally announced November 2022.

  4. arXiv:2012.07163  [pdf, other

    cs.LG cs.PF

    Comparing the costs of abstraction for DL frameworks

    Authors: Maksim Levental, Elena Orlova

    Abstract: High level abstractions for implementing, training, and testing Deep Learning (DL) models abound. Such frameworks function primarily by abstracting away the implementation details of arbitrary neural architectures, thereby enabling researchers and engineers to focus on design. In principle, such frameworks could be "zero-cost abstractions"; in practice, they incur translation and indirection overh… ▽ More

    Submitted 13 December, 2020; originally announced December 2020.

  5. arXiv:1901.10787  [pdf, other

    cs.CL cs.LG

    Tensorized Embedding Layers for Efficient Model Compression

    Authors: Oleksii Hrinchuk, Valentin Khrulkov, Leyla Mirvakhabova, Elena Orlova, Ivan Oseledets

    Abstract: The embedding layers transforming input words into real vectors are the key components of deep neural networks used in natural language processing. However, when the vocabulary is large, the corresponding weight matrices can be enormous, which precludes their deployment in a limited resource setting. We introduce a novel way of parametrizing embedding layers based on the Tensor Train (TT) decompos… ▽ More

    Submitted 19 February, 2020; v1 submitted 30 January, 2019; originally announced January 2019.

  6. arXiv:1812.01319  [pdf, other

    physics.data-an cs.LG

    Generative Models for Fast Calorimeter Simulation.LHCb case

    Authors: Viktoria Chekalina, Elena Orlova, Fedor Ratnikov, Dmitry Ulyanov, Andrey Ustyuzhanin, Egor Zakharov

    Abstract: Simulation is one of the key components in high energy physics. Historically it relies on the Monte Carlo methods which require a tremendous amount of computation resources. These methods may have difficulties with the expected High Luminosity Large Hadron Collider (HL LHC) need, so the experiment is in urgent need of new fast simulation techniques. We introduce a new Deep Learning framework based… ▽ More

    Submitted 6 April, 2019; v1 submitted 4 December, 2018; originally announced December 2018.

    Comments: Proceedings of the presentation at CHEP 2018 Conference