Skip to main content

Showing 1–3 of 3 results for author: Jalagam, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2312.17227  [pdf, other

    cs.LG cs.AI

    Gradient-based Planning with World Models

    Authors: Jyothir S V, Siddhartha Jalagam, Yann LeCun, Vlad Sobal

    Abstract: The enduring challenge in the field of artificial intelligence has been the control of systems to achieve desired behaviours. While for systems governed by straightforward dynamics equations, methods like Linear Quadratic Regulation (LQR) have historically proven highly effective, most real-world tasks, which require a general problem-solver, demand world models with dynamics that cannot be easily… ▽ More

    Submitted 28 December, 2023; originally announced December 2023.

  2. arXiv:2309.04516  [pdf, ps, other

    eess.AS cs.LG cs.SD

    End-to-End Speech Recognition and Disfluency Removal with Acoustic Language Model Pretraining

    Authors: Saksham Bassi, Giulio Duregon, Siddhartha Jalagam, David Roth

    Abstract: The SOTA in transcription of disfluent and conversational speech has in recent years favored two-stage models, with separate transcription and cleaning stages. We believe that previous attempts at end-to-end disfluency removal have fallen short because of the representational advantage that large-scale language model pretraining has given to lexical models. Until recently, the high dimensionality… ▽ More

    Submitted 8 September, 2023; originally announced September 2023.

  3. arXiv:2211.10831  [pdf, other

    cs.LG

    Joint Embedding Predictive Architectures Focus on Slow Features

    Authors: Vlad Sobal, Jyothir S V, Siddhartha Jalagam, Nicolas Carion, Kyunghyun Cho, Yann LeCun

    Abstract: Many common methods for learning a world model for pixel-based environments use generative architectures trained with pixel-level reconstruction objectives. Recently proposed Joint Embedding Predictive Architectures (JEPA) offer a reconstruction-free alternative. In this work, we analyze performance of JEPA trained with VICReg and SimCLR objectives in the fully offline setting without access to re… ▽ More

    Submitted 19 November, 2022; originally announced November 2022.

    Comments: 4 pages (3 figures) short paper for SSL Theory and Practice workshop at NeurIPS 2022. Code is available at https://github.com/vladisai/JEPA_SSL_NeurIPS_2022