Skip to main content

Showing 1–11 of 11 results for author: Kovachki, N B

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.06486  [pdf, other

    cs.LG math.NA

    Continuum Attention for Neural Operators

    Authors: Edoardo Calvello, Nikola B. Kovachki, Matthew E. Levine, Andrew M. Stuart

    Abstract: Transformers, and the attention mechanism in particular, have become ubiquitous in machine learning. Their success in modeling nonlocal, long-range correlations has led to their widespread adoption in natural language processing, computer vision, and time-series problems. Neural operators, which map spaces of functions into spaces of functions, are necessarily both nonlinear and nonlocal if they a… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

  2. arXiv:2405.15992  [pdf, ps, other

    cs.LG math.NA

    Data Complexity Estimates for Operator Learning

    Authors: Nikola B. Kovachki, Samuel Lanthaler, Hrushikesh Mhaskar

    Abstract: Operator learning has emerged as a new paradigm for the data-driven approximation of nonlinear operators. Despite its empirical success, the theoretical underpinnings governing the conditions for efficient operator learning remain incomplete. The present work develops theory to study the data complexity of operator learning, complementing existing research on the parametric complexity. We investig… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

  3. arXiv:2402.15715  [pdf, other

    cs.LG math.NA

    Operator Learning: Algorithms and Analysis

    Authors: Nikola B. Kovachki, Samuel Lanthaler, Andrew M. Stuart

    Abstract: Operator learning refers to the application of ideas from machine learning to approximate (typically nonlinear) operators map** between Banach spaces of functions. Such operators often arise from physical models expressed in terms of partial differential equations (PDEs). In this context, such approximate operators hold great potential as efficient surrogate models to complement traditional nume… ▽ More

    Submitted 23 February, 2024; originally announced February 2024.

  4. arXiv:2309.00583  [pdf, other

    cs.LG math.NA

    Geometry-Informed Neural Operator for Large-Scale 3D PDEs

    Authors: Zongyi Li, Nikola Borislavov Kovachki, Chris Choy, Boyi Li, Jean Kossaifi, Shourya Prakash Otta, Mohammad Amin Nabian, Maximilian Stadler, Christian Hundt, Kamyar Azizzadenesheli, Anima Anandkumar

    Abstract: We propose the geometry-informed neural operator (GINO), a highly efficient approach to learning the solution operator of large-scale partial differential equations with varying geometries. GINO uses a signed distance function and point-cloud representations of the input shape and neural operators based on graph and Fourier architectures to learn the solution operator. The graph neural operator ha… ▽ More

    Submitted 1 September, 2023; originally announced September 2023.

  5. arXiv:2302.07400  [pdf, other

    cs.LG math.FA stat.ML

    Score-based Diffusion Models in Function Space

    Authors: Jae Hyun Lim, Nikola B. Kovachki, Ricardo Baptista, Christopher Beckham, Kamyar Azizzadenesheli, Jean Kossaifi, Vikram Voleti, Jiaming Song, Karsten Kreis, Jan Kautz, Christopher Pal, Arash Vahdat, Anima Anandkumar

    Abstract: Diffusion models have recently emerged as a powerful framework for generative modeling. They consist of a forward process that perturbs input data with Gaussian white noise and a reverse process that learns a score function to generate samples by denoising. Despite their tremendous success, they are mostly formulated on finite-dimensional spaces, e.g. Euclidean, limiting their applications to many… ▽ More

    Submitted 22 November, 2023; v1 submitted 14 February, 2023; originally announced February 2023.

    Comments: 52 pages

    MSC Class: 46B09 (Primary); 60J22 (Secondary) ACM Class: I.2.6; J.2

  6. arXiv:2108.12515  [pdf, other

    math.ST cs.LG stat.ME stat.ML

    Convergence Rates for Learning Linear Operators from Noisy Data

    Authors: Maarten V. de Hoop, Nikola B. Kovachki, Nicholas H. Nelsen, Andrew M. Stuart

    Abstract: This paper studies the learning of linear operators between infinite-dimensional Hilbert spaces. The training data comprises pairs of random input vectors in a Hilbert space and their noisy images under an unknown self-adjoint linear operator. Assuming that the operator is diagonalizable in a known basis, this work solves the equivalent inverse problem of estimating the operator's eigenvalues give… ▽ More

    Submitted 2 November, 2022; v1 submitted 27 August, 2021; originally announced August 2021.

    Comments: To appear in SIAM/ASA Journal on Uncertainty Quantification (JUQ); 34 pages, 5 figures, 2 tables

    MSC Class: 62G20; 62C10; 68T05; 47A62

    Journal ref: SIAM/ASA J. Uncertainty Quantification Vol. 11 No. 2 (2023) pp. 480-513

  7. arXiv:2006.06755  [pdf, other

    stat.ML cs.LG stat.CO

    Conditional Sampling with Monotone GANs: from Generative Models to Likelihood-Free Inference

    Authors: Ricardo Baptista, Bamdad Hosseini, Nikola B. Kovachki, Youssef Marzouk

    Abstract: We present a novel framework for conditional sampling of probability measures, using block triangular transport maps. We develop the theoretical foundations of block triangular transport in a Banach space setting, establishing general conditions under which conditional sampling can be achieved and drawing connections between monotone block triangular maps and optimal transport. Based on this theor… ▽ More

    Submitted 5 June, 2023; v1 submitted 11 June, 2020; originally announced June 2020.

    Comments: Major expansion of earlier version, with new theoretical results. 33 pages, 8 figures, 1 table

  8. arXiv:2005.03180  [pdf, other

    math.NA cs.LG stat.ML

    Model Reduction and Neural Networks for Parametric PDEs

    Authors: Kaushik Bhattacharya, Bamdad Hosseini, Nikola B. Kovachki, Andrew M. Stuart

    Abstract: We develop a general framework for data-driven approximation of input-output maps between infinite-dimensional spaces. The proposed approach is motivated by the recent successes of neural networks and deep learning, in combination with ideas from model reduction. This combination results in a neural network approximation which, in principle, is defined on infinite-dimensional spaces and, in practi… ▽ More

    Submitted 17 June, 2021; v1 submitted 6 May, 2020; originally announced May 2020.

    Comments: 39 pages, 13 figures

    MSC Class: 65N75; 62M45; 68T05; 60H30; 60H15

  9. arXiv:1909.02041  [pdf, other

    physics.chem-ph cs.LG

    Regression-clustering for Improved Accuracy and Training Cost with Molecular-Orbital-Based Machine Learning

    Authors: Lixue Cheng, Nikola B. Kovachki, Matthew Welborn, Thomas F. Miller III

    Abstract: Machine learning (ML) in the representation of molecular-orbital-based (MOB) features has been shown to be an accurate and transferable approach to the prediction of post-Hartree-Fock correlation energies. Previous applications of MOB-ML employed Gaussian Process Regression (GPR), which provides good prediction accuracy with small training sets; however, the cost of GPR training scales cubically w… ▽ More

    Submitted 23 October, 2019; v1 submitted 4 September, 2019; originally announced September 2019.

    Comments: 31 pages, 10 figures, with an SI

  10. arXiv:1906.04285  [pdf, other

    cs.LG math.NA stat.ML

    Continuous Time Analysis of Momentum Methods

    Authors: Nikola B. Kovachki, Andrew M. Stuart

    Abstract: Gradient descent-based optimization methods underpin the parameter training of neural networks, and hence comprise a significant component in the impressive test results found in a number of applications. Introducing stochasticity is key to their success in practical problems, and there is some understanding of the role of stochastic gradient descent in this context. Momentum modifications of grad… ▽ More

    Submitted 28 May, 2021; v1 submitted 10 June, 2019; originally announced June 2019.

    Comments: 40 pages, 7 figures

    Journal ref: Journal of Machine Learning Research 21 (2020) 1-40

  11. arXiv:1808.03620  [pdf, other

    cs.LG math.OC stat.ML

    Ensemble Kalman Inversion: A Derivative-Free Technique For Machine Learning Tasks

    Authors: Nikola B. Kovachki, Andrew M. Stuart

    Abstract: The standard probabilistic perspective on machine learning gives rise to empirical risk-minimization tasks that are frequently solved by stochastic gradient descent (SGD) and variants thereof. We present a formulation of these tasks as classical inverse or filtering problems and, furthermore, we propose an efficient, gradient-free algorithm for finding a solution to these problems using ensemble K… ▽ More

    Submitted 10 August, 2018; originally announced August 2018.

    Comments: 41 pages, 14 figures

    MSC Class: 68T20; 65L09; 65K10; 49M15 ACM Class: I.2.6