Skip to main content

Showing 1–8 of 8 results for author: Trask, N

Searching in archive stat. Search in all archives.
.
  1. arXiv:2310.18471  [pdf, other

    cs.LG cs.AI stat.ML

    Causal disentanglement of multimodal data

    Authors: Elise Walker, Jonas A. Actor, Carianne Martinez, Nathaniel Trask

    Abstract: Causal representation learning algorithms discover lower-dimensional representations of data that admit a decipherable interpretation of cause and effect; as achieving such interpretable representations is challenging, many causal learning algorithms utilize elements indicating prior information, such as (linear) structural causal models, interventional data, or weak supervision. Unfortunately, in… ▽ More

    Submitted 8 November, 2023; v1 submitted 27 October, 2023; originally announced October 2023.

    MSC Class: 68T07

  2. arXiv:2202.03242  [pdf, other

    cs.LG stat.ML

    Unsupervised physics-informed disentanglement of multimodal data for high-throughput scientific discovery

    Authors: Nathaniel Trask, Carianne Martinez, Kook** Lee, Brad Boyce

    Abstract: We introduce physics-informed multimodal autoencoders (PIMA) - a variational inference framework for discovering shared information in multimodal scientific datasets representative of high-throughput testing. Individual modalities are embedded into a shared latent space and fused through a product of experts formulation, enabling a Gaussian mixture prior to identify shared features. Sampling from… ▽ More

    Submitted 7 February, 2022; originally announced February 2022.

  3. arXiv:2107.03066  [pdf, other

    cs.LG stat.ML

    Probabilistic partition of unity networks: clustering based deep approximation

    Authors: Nat Trask, Mamikon Gulian, Andy Huang, Kook** Lee

    Abstract: Partition of unity networks (POU-Nets) have been shown capable of realizing algebraic convergence rates for regression and solution of PDEs, but require empirical tuning of training parameters. We enrich POU-Nets with a Gaussian noise model to obtain a probabilistic generalization amenable to gradient-based minimization of a maximum likelihood loss. The resulting architecture provides spatial repr… ▽ More

    Submitted 7 July, 2021; originally announced July 2021.

    Comments: 12 pages, 6 figures

  4. arXiv:2101.11256  [pdf, other

    cs.LG math.NA stat.ML

    Partition of unity networks: deep hp-approximation

    Authors: Kook** Lee, Nathaniel A. Trask, Ravi G. Patel, Mamikon A. Gulian, Eric C. Cyr

    Abstract: Approximation theorists have established best-in-class optimal approximation rates of deep neural networks by utilizing their ability to simultaneously emulate partitions of unity and monomials. Motivated by this, we propose partition of unity networks (POUnets) which incorporate these elements directly into the architecture. Classification architectures of the type used to learn probability measu… ▽ More

    Submitted 27 January, 2021; originally announced January 2021.

    Comments: 8 pages, 5 figures

  5. arXiv:2009.11992  [pdf, other

    physics.comp-ph cs.LG math.NA stat.ML

    A physics-informed operator regression framework for extracting data-driven continuum models

    Authors: Ravi G. Patel, Nathaniel A. Trask, Mitchell A. Wood, Eric C. Cyr

    Abstract: The application of deep learning toward discovery of data-driven models requires careful application of inductive biases to obtain a description of physics which is both accurate and robust. We present here a framework for discovering continuum models from high fidelity molecular simulation data. Our approach applies a neural network parameterization of governing physics in modal space, allowing a… ▽ More

    Submitted 24 September, 2020; originally announced September 2020.

    Comments: 37 pages, 15 figures

  6. arXiv:2006.10123  [pdf, other

    cs.LG stat.ML

    A block coordinate descent optimizer for classification problems exploiting convexity

    Authors: Ravi G. Patel, Nathaniel A. Trask, Mamikon A. Gulian, Eric C. Cyr

    Abstract: Second-order optimizers hold intriguing potential for deep learning, but suffer from increased cost and sensitivity to the non-convexity of the loss surface as compared to gradient-based approaches. We introduce a coordinate descent method to train deep neural networks for classification tasks that exploits global convexity of the cross-entropy loss in the weights of the linear layer. Our hybrid N… ▽ More

    Submitted 17 June, 2020; originally announced June 2020.

    Comments: 10 pages, 4 figures

  7. arXiv:1912.04862  [pdf, other

    cs.LG math.NA stat.ML

    Robust Training and Initialization of Deep Neural Networks: An Adaptive Basis Viewpoint

    Authors: Eric C. Cyr, Mamikon A. Gulian, Ravi G. Patel, Mauro Perego, Nathaniel A. Trask

    Abstract: Motivated by the gap between theoretical optimal approximation rates of deep neural networks (DNNs) and the accuracy realized in practice, we seek to improve the training of DNNs. The adoption of an adaptive basis viewpoint of DNNs leads to novel initializations and a hybrid least squares/gradient descent optimizer. We provide analysis of these techniques and illustrate via numerical examples dram… ▽ More

    Submitted 10 December, 2019; originally announced December 2019.

    Comments: 26 pages

  8. arXiv:1909.05371  [pdf, other

    cs.LG math.DS physics.data-an stat.ML

    GMLS-Nets: A framework for learning from unstructured data

    Authors: Nathaniel Trask, Ravi G. Patel, Ben J. Gross, Paul J. Atzberger

    Abstract: Data fields sampled on irregularly spaced points arise in many applications in the sciences and engineering. For regular grids, Convolutional Neural Networks (CNNs) have been successfully used to gaining benefits from weight sharing and invariances. We generalize CNNs by introducing methods for data on unstructured point clouds based on Generalized Moving Least Squares (GMLS). GMLS is a non-parame… ▽ More

    Submitted 13 September, 2019; v1 submitted 6 September, 2019; originally announced September 2019.

    Journal ref: AAAI-MLPS Proceedings, (2020)