Skip to main content

Showing 1–9 of 9 results for author: Marwah, T

Searching in archive cs. Search in all archives.
.
  1. arXiv:2403.07187  [pdf, other

    cs.LG

    UPS: Efficiently Building Foundation Models for PDE Solving via Cross-Modal Adaptation

    Authors: Junhong Shen, Tanya Marwah, Ameet Talwalkar

    Abstract: We present Unified PDE Solvers (UPS), a data- and compute-efficient approach to develo** unified neural operators for diverse families of spatiotemporal PDEs from various domains, dimensions, and resolutions. UPS embeds different PDEs into a shared representation space and processes them using a FNO-transformer architecture. Rather than training the network from scratch, which is data-demanding… ▽ More

    Submitted 23 May, 2024; v1 submitted 11 March, 2024; originally announced March 2024.

  2. arXiv:2312.00234  [pdf, other

    cs.LG math.NA stat.ML

    Deep Equilibrium Based Neural Operators for Steady-State PDEs

    Authors: Tanya Marwah, Ashwini Pokle, J. Zico Kolter, Zachary C. Lipton, Jianfeng Lu, Andrej Risteski

    Abstract: Data-driven machine learning approaches are being increasingly used to solve partial differential equations (PDEs). They have shown particularly striking successes when training an operator, which takes as input a PDE in some family, and outputs its solution. However, the architectural design space, especially given structural knowledge of the PDE family of interest, is still poorly understood. We… ▽ More

    Submitted 30 November, 2023; originally announced December 2023.

    Comments: NeurIPS 2023

  3. arXiv:2211.15853  [pdf, other

    cs.LG

    Disentangling the Mechanisms Behind Implicit Regularization in SGD

    Authors: Zachary Novack, Simran Kaur, Tanya Marwah, Saurabh Garg, Zachary C. Lipton

    Abstract: A number of competing hypotheses have been proposed to explain why small-batch Stochastic Gradient Descent (SGD)leads to improved generalization over the full-batch regime, with recent work crediting the implicit regularization of various quantities throughout training. However, to date, empirical evidence assessing the explanatory power of these hypotheses is lacking. In this paper, we conduct an… ▽ More

    Submitted 28 November, 2022; originally announced November 2022.

    Comments: Accepted as Spotlight at the NeurIPS 2022 Workshop for Higher Order Optimization in Machine Learning

  4. arXiv:2210.12101  [pdf, ps, other

    cs.LG math.NA

    Neural Network Approximations of PDEs Beyond Linearity: A Representational Perspective

    Authors: Tanya Marwah, Zachary C. Lipton, Jianfeng Lu, Andrej Risteski

    Abstract: A burgeoning line of research leverages deep neural networks to approximate the solutions to high dimensional PDEs, opening lines of theoretical inquiry focused on explaining how it is that these models appear to evade the curse of dimensionality. However, most prior theoretical analyses have been limited to linear PDEs. In this work, we take a step towards studying the representational power of n… ▽ More

    Submitted 27 March, 2023; v1 submitted 21 October, 2022; originally announced October 2022.

  5. arXiv:2103.02138  [pdf, ps, other

    cs.LG math.NA stat.ML

    Parametric Complexity Bounds for Approximating PDEs with Neural Networks

    Authors: Tanya Marwah, Zachary C. Lipton, Andrej Risteski

    Abstract: Recent experiments have shown that deep networks can approximate solutions to high-dimensional PDEs, seemingly esca** the curse of dimensionality. However, questions regarding the theoretical basis for such approximations, including the required network size, remain open. In this paper, we investigate the representational power of neural networks for approximating solutions to linear elliptic PD… ▽ More

    Submitted 6 July, 2021; v1 submitted 2 March, 2021; originally announced March 2021.

  6. arXiv:1908.08147  [pdf, other

    cs.SI cs.IR

    Sentiment Dynamics in Social Media News Channels

    Authors: Nagendra Kumar, Rakshita Nagalla, Tanya Marwah, Manish Singh

    Abstract: Social media is currently one of the most important means of news communication. Since people are consuming a large fraction of their daily news through social media, most of the traditional news channels are using social media to catch the attention of users. Each news channel has its own strategies to attract more users. In this paper, we analyze how the news channels use sentiment to garner use… ▽ More

    Submitted 21 August, 2019; originally announced August 2019.

  7. arXiv:1905.03743  [pdf, other

    cs.CV cs.AI cs.LG

    Interactive Image Generation Using Scene Graphs

    Authors: Gaurav Mittal, Shubham Agrawal, Anuva Agarwal, Sushant Mehta, Tanya Marwah

    Abstract: Recent years have witnessed some exciting developments in the domain of generating images from scene-based text descriptions. These approaches have primarily focused on generating images from a static text description and are limited to generating images in a single pass. They are unable to generate an image interactively based on an incrementally additive text description (something that is more… ▽ More

    Submitted 9 May, 2019; originally announced May 2019.

    Comments: Published at ICLR 2019 Deep Generative Models for Highly Structured Data Workshop

  8. arXiv:1708.05980  [pdf, other

    cs.CV

    Attentive Semantic Video Generation using Captions

    Authors: Tanya Marwah, Gaurav Mittal, Vineeth N. Balasubramanian

    Abstract: This paper proposes a network architecture to perform variable length semantic video generation using captions. We adopt a new perspective towards video generation where we allow the captions to be combined with the long-term and short-term dependencies between video frames and thus generate a video in an incremental manner. Our experiments demonstrate our network architecture's ability to disting… ▽ More

    Submitted 21 October, 2017; v1 submitted 20 August, 2017; originally announced August 2017.

    Journal ref: Presented at ICCV 2017 (International Conference on Computer Vision)

  9. Sync-DRAW: Automatic Video Generation using Deep Recurrent Attentive Architectures

    Authors: Gaurav Mittal, Tanya Marwah, Vineeth N. Balasubramanian

    Abstract: This paper introduces a novel approach for generating videos called Synchronized Deep Recurrent Attentive Writer (Sync-DRAW). Sync-DRAW can also perform text-to-video generation which, to the best of our knowledge, makes it the first approach of its kind. It combines a Variational Autoencoder~(VAE) with a Recurrent Attention Mechanism in a novel manner to create a temporally dependent sequence of… ▽ More

    Submitted 21 October, 2017; v1 submitted 30 November, 2016; originally announced November 2016.