Skip to main content

Showing 1–9 of 9 results for author: Gulian, M

Searching in archive stat. Search in all archives.
.
  1. arXiv:2303.11379  [pdf, other

    stat.ML cs.LG math.OC

    Solving High-Dimensional Inverse Problems with Auxiliary Uncertainty via Operator Learning with Limited Data

    Authors: Joseph Hart, Mamikon Gulian, Indu Manickam, Laura Swiler

    Abstract: In complex large-scale systems such as climate, important effects are caused by a combination of confounding processes that are not fully observable. The identification of sources from observations of system state is vital for attribution and prediction, which inform critical policy decisions. The difficulty of these types of inverse problems lies in the inability to isolate sources and the cost o… ▽ More

    Submitted 20 March, 2023; originally announced March 2023.

    Comments: 29 pages, 10 figures

  2. arXiv:2204.10909  [pdf, other

    cs.LG stat.ML

    Error-in-variables modelling for operator learning

    Authors: Ravi G. Patel, Indu Manickam, Myoungkyu Lee, Mamikon Gulian

    Abstract: Deep operator learning has emerged as a promising tool for reduced-order modelling and PDE model discovery. Leveraging the expressive power of deep neural networks, especially in high dimensions, such methods learn the map** between functional state variables. While proposed methods have assumed noise only in the dependent variables, experimental and numerical data for operator learning typicall… ▽ More

    Submitted 19 July, 2022; v1 submitted 22 April, 2022; originally announced April 2022.

    Comments: 23 pages, 10 figures

  3. arXiv:2110.11531  [pdf, other

    math.AP physics.flu-dyn stat.AP

    Fractional Modeling in Action: A Survey of Nonlocal Models for Subsurface Transport, Turbulent Flows, and Anomalous Materials

    Authors: Jorge Suzuki, Mamikon Gulian, Mohsen Zayernouri, Marta D'Elia

    Abstract: Modeling of phenomena such as anomalous transport via fractional-order differential equations has been established as an effective alternative to partial differential equations, due to the inherent ability to describe large-scale behavior with greater efficiency than fully-resolved classical models. In this review article, we first provide a broad overview of fractional-order derivatives with a cl… ▽ More

    Submitted 21 October, 2021; originally announced October 2021.

    Comments: 75 pages, 16 figures

    Report number: SAND2021-11291 R

  4. arXiv:2107.03066  [pdf, other

    cs.LG stat.ML

    Probabilistic partition of unity networks: clustering based deep approximation

    Authors: Nat Trask, Mamikon Gulian, Andy Huang, Kook** Lee

    Abstract: Partition of unity networks (POU-Nets) have been shown capable of realizing algebraic convergence rates for regression and solution of PDEs, but require empirical tuning of training parameters. We enrich POU-Nets with a Gaussian noise model to obtain a probabilistic generalization amenable to gradient-based minimization of a maximum likelihood loss. The resulting architecture provides spatial repr… ▽ More

    Submitted 7 July, 2021; originally announced July 2021.

    Comments: 12 pages, 6 figures

  5. arXiv:2101.11256  [pdf, other

    cs.LG math.NA stat.ML

    Partition of unity networks: deep hp-approximation

    Authors: Kook** Lee, Nathaniel A. Trask, Ravi G. Patel, Mamikon A. Gulian, Eric C. Cyr

    Abstract: Approximation theorists have established best-in-class optimal approximation rates of deep neural networks by utilizing their ability to simultaneously emulate partitions of unity and monomials. Motivated by this, we propose partition of unity networks (POUnets) which incorporate these elements directly into the architecture. Classification architectures of the type used to learn probability measu… ▽ More

    Submitted 27 January, 2021; originally announced January 2021.

    Comments: 8 pages, 5 figures

  6. arXiv:2006.10123  [pdf, other

    cs.LG stat.ML

    A block coordinate descent optimizer for classification problems exploiting convexity

    Authors: Ravi G. Patel, Nathaniel A. Trask, Mamikon A. Gulian, Eric C. Cyr

    Abstract: Second-order optimizers hold intriguing potential for deep learning, but suffer from increased cost and sensitivity to the non-convexity of the loss surface as compared to gradient-based approaches. We introduce a coordinate descent method to train deep neural networks for classification tasks that exploits global convexity of the cross-entropy loss in the weights of the linear layer. Our hybrid N… ▽ More

    Submitted 17 June, 2020; originally announced June 2020.

    Comments: 10 pages, 4 figures

  7. A Survey of Constrained Gaussian Process Regression: Approaches and Implementation Challenges

    Authors: Laura Swiler, Mamikon Gulian, Ari Frankel, Cosmin Safta, John Jakeman

    Abstract: Gaussian process regression is a popular Bayesian framework for surrogate modeling of expensive data sources. As part of a broader effort in scientific machine learning, many recent works have incorporated physical constraints or other a priori information within Gaussian process regression to supplement limited data and regularize the behavior of the model. We provide an overview and survey of se… ▽ More

    Submitted 6 January, 2021; v1 submitted 16 June, 2020; originally announced June 2020.

    Comments: 42 pages, 3 figures. Version 3: DOI & Reference added; appeared in Journal of Machine Learning for Modeling and Computing. Version 2 includes minor additions, clarifications and improvements to notation

    Journal ref: Journal of Machine Learning for Modeling and Computing, 1(2):119-156 (2020)

  8. arXiv:1912.04862  [pdf, other

    cs.LG math.NA stat.ML

    Robust Training and Initialization of Deep Neural Networks: An Adaptive Basis Viewpoint

    Authors: Eric C. Cyr, Mamikon A. Gulian, Ravi G. Patel, Mauro Perego, Nathaniel A. Trask

    Abstract: Motivated by the gap between theoretical optimal approximation rates of deep neural networks (DNNs) and the accuracy realized in practice, we seek to improve the training of DNNs. The adoption of an adaptive basis viewpoint of DNNs leads to novel initializations and a hybrid least squares/gradient descent optimizer. We provide analysis of these techniques and illustrate via numerical examples dram… ▽ More

    Submitted 10 December, 2019; originally announced December 2019.

    Comments: 26 pages

  9. arXiv:1808.00931  [pdf, other

    cs.LG stat.ML

    Machine Learning of Space-Fractional Differential Equations

    Authors: Mamikon Gulian, Maziar Raissi, Paris Perdikaris, George Karniadakis

    Abstract: Data-driven discovery of "hidden physics" -- i.e., machine learning of differential equation models underlying observed data -- has recently been approached by embedding the discovery problem into a Gaussian Process regression of spatial data, treating and discovering unknown equation parameters as hyperparameters of a modified "physics informed" Gaussian Process kernel. This kernel includes the p… ▽ More

    Submitted 2 August, 2019; v1 submitted 2 August, 2018; originally announced August 2018.

    Comments: 26 pages, 10 figures. In v2, a minor change to the formatting of a handful of references was made in the bibliography; the main text was unchanged. In v3, minor improvements were made to the exposition; more details about motivation, examples, optimization, and relation to previous works were given

    MSC Class: 35R11; 65N21; 62M10; 62F15; 60G15; 60G52