Skip to main content

Showing 1–50 of 67 results for author: Chaudhari, P

.
  1. arXiv:2406.17249  [pdf, other

    cs.RO

    SlideSLAM: Sparse, Lightweight, Decentralized Metric-Semantic SLAM for Multi-Robot Navigation

    Authors: Xu Liu, Jiuzhou Lei, Ankit Prabhu, Yuezhan Tao, Igor Spasojevic, Pratik Chaudhari, Nikolay Atanasov, Vijay Kumar

    Abstract: This paper develops a real-time decentralized metric-semantic Simultaneous Localization and Map** (SLAM) approach that leverages a sparse and lightweight object-based representation to enable a heterogeneous robot team to autonomously explore 3D environments featuring indoor, urban, and forested areas without relying on GPS. We use a hierarchical metric-semantic representation of the environment… ▽ More

    Submitted 2 July, 2024; v1 submitted 24 June, 2024; originally announced June 2024.

    Comments: Preliminary release

  2. arXiv:2406.09631  [pdf, other

    cs.RO

    Optimal Convex Cover as Collision-free Space Approximation for Trajectory Generation

    Authors: Yuwei Wu, Igor Spasojevic, Pratik Chaudhari, Vijay Kumar

    Abstract: We propose an online iterative algorithm to find a suitable convex cover to under-approximate the free space for autonomous navigation to delineate Safe Flight Corridors (SFC). The convex cover consists of a set of polytopes such that the union of the polytopes represents obstacle-free space, allowing us to find trajectories for robots that lie within the convex cover. In order to find the SFC tha… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

  3. arXiv:2406.07431  [pdf, other

    cs.MA cs.CV

    Active Scout: Multi-Target Tracking Using Neural Radiance Fields in Dense Urban Environments

    Authors: Christopher D. Hsu, Pratik Chaudhari

    Abstract: We study pursuit-evasion games in highly occluded urban environments, e.g. tall buildings in a city, where a scout (quadrotor) tracks multiple dynamic targets on the ground. We show that we can build a neural radiance field (NeRF) representation of the city -- online -- using RGB and depth images from different vantage points. This representation is used to calculate the information gain to both e… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

    Comments: 8 pages, 8 figures, 1 table

  4. arXiv:2406.07361  [pdf, other

    cs.CV cs.LG eess.IV

    Deep Implicit Optimization for Robust and Flexible Image Registration

    Authors: Rohit Jena, Pratik Chaudhari, James C. Gee

    Abstract: Deep Learning in Image Registration (DLIR) methods have been tremendously successful in image registration due to their speed and ability to incorporate weak label supervision at training time. However, DLIR methods forego many of the benefits of classical optimization-based methods. The functional nature of deep networks do not guarantee that the predicted transformation is a local minima of the… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

  5. arXiv:2405.14061  [pdf, other

    cs.AI cs.CL cs.LG

    Meanings and Feelings of Large Language Models: Observability of Latent States in Generative AI

    Authors: Tian Yu Liu, Stefano Soatto, Matteo Marchi, Pratik Chaudhari, Paulo Tabuada

    Abstract: We tackle the question of whether Large Language Models (LLMs), viewed as dynamical systems with state evolving in the embedding space of symbolic tokens, are observable. That is, whether there exist multiple 'mental' state trajectories that yield the same sequence of generated tokens, or sequences that belong to the same Nerode equivalence class ('meaning'). If not observable, mental state trajec… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

  6. arXiv:2405.09717  [pdf, other

    cs.CV

    From NeRFs to Gaussian Splats, and Back

    Authors: Siming He, Zach Osman, Pratik Chaudhari

    Abstract: For robotics applications where there is a limited number of (typically ego-centric) views, parametric representations such as neural radiance fields (NeRFs) generalize better than non-parametric ones such as Gaussian splatting (GS) to views that are very different from those in the training data; GS however can render much faster than NeRFs. We develop a procedure to convert back and forth betwee… ▽ More

    Submitted 10 June, 2024; v1 submitted 15 May, 2024; originally announced May 2024.

  7. arXiv:2404.02325  [pdf, ps, other

    cs.LG eess.SY math.OC

    Heat Death of Generative Models in Closed-Loop Learning

    Authors: Matteo Marchi, Stefano Soatto, Pratik Chaudhari, Paulo Tabuada

    Abstract: Improvement and adoption of generative machine learning models is rapidly accelerating, as exemplified by the popularity of LLMs (Large Language Models) for text, and diffusion models for image generation.As generative models become widespread, data they generate is incorporated into shared content through the public web. This opens the question of what happens when data generated by a model is fe… ▽ More

    Submitted 2 April, 2024; originally announced April 2024.

  8. arXiv:2404.01249  [pdf, other

    cs.CV

    FireANTs: Adaptive Riemannian Optimization for Multi-Scale Diffeomorphic Registration

    Authors: Rohit Jena, Pratik Chaudhari, James C. Gee

    Abstract: Diffeomorphic Image Registration is a critical part of the analysis in various imaging modalities and downstream tasks like image translation, segmentation, and atlas building. Registration algorithms based on optimization have stood the test of time in terms of accuracy, reliability, and robustness across a wide spectrum of modalities and acquisition settings. However, these algorithms converge s… ▽ More

    Submitted 1 April, 2024; originally announced April 2024.

  9. arXiv:2404.00769  [pdf, other

    cs.RO

    An Active Perception Game for Robust Autonomous Exploration

    Authors: Siming He, Yuezhan Tao, Igor Spasojevic, Vijay Kumar, Pratik Chaudhari

    Abstract: We formulate active perception for an autonomous agent that explores an unknown environment as a two-player zero-sum game: the agent aims to maximize information gained from the environment while the environment aims to minimize the information gained by the agent. In each episode, the environment reveals a set of actions with their potentially erroneous information gain. In order to select the be… ▽ More

    Submitted 31 March, 2024; originally announced April 2024.

  10. arXiv:2312.02521  [pdf, other

    cs.CV cs.AI

    Retrieving Conditions from Reference Images for Diffusion Models

    Authors: Haoran Tang, Xin Zhou, Jieren Deng, Zhihong Pan, Hao Tian, Pratik Chaudhari

    Abstract: Newly developed diffusion-based techniques have showcased phenomenal abilities in producing a wide range of high-quality images, sparking considerable interest in various applications. A prevalent scenario is to generate new images based on a subject from reference images. This subject could be face identity for styled avatars, body and clothing for virtual try-on and so on. Satisfying this requir… ▽ More

    Submitted 15 March, 2024; v1 submitted 5 December, 2023; originally announced December 2023.

  11. arXiv:2311.10812  [pdf, other

    cs.CV cs.GR cs.LG

    SplatArmor: Articulated Gaussian splatting for animatable humans from monocular RGB videos

    Authors: Rohit Jena, Ganesh Subramanian Iyer, Siddharth Choudhary, Brandon Smith, Pratik Chaudhari, James Gee

    Abstract: We propose SplatArmor, a novel approach for recovering detailed and animatable human models by `armoring' a parameterized body model with 3D Gaussians. Our approach represents the human as a set of 3D Gaussians within a canonical space, whose articulation is defined by extending the skinning of the underlying SMPL geometry to arbitrary locations in the canonical space. To account for pose-dependen… ▽ More

    Submitted 17 November, 2023; originally announced November 2023.

  12. arXiv:2310.09892  [pdf, other

    cs.RO

    Active Perception using Neural Radiance Fields

    Authors: Siming He, Christopher D. Hsu, Dexter Ong, Yifei Simon Shao, Pratik Chaudhari

    Abstract: We study active perception from first principles to argue that an autonomous agent performing active perception should maximize the mutual information that past observations posses about future ones. Doing so requires (a) a representation of the scene that summarizes past observations and the ability to update this representation to incorporate new observations (state estimation and map**), (b)… ▽ More

    Submitted 30 March, 2024; v1 submitted 15 October, 2023; originally announced October 2023.

    Report number: Proc. of the American Control Conference (ACC) 2024

  13. arXiv:2310.02162  [pdf, other

    cs.RO

    TreeScope: An Agricultural Robotics Dataset for LiDAR-Based Map** of Trees in Forests and Orchards

    Authors: Derek Cheng, Fernando Cladera Ojeda, Ankit Prabhu, Xu Liu, Alan Zhu, Patrick Corey Green, Reza Ehsani, Pratik Chaudhari, Vijay Kumar

    Abstract: Data collection for forestry, timber, and agriculture currently relies on manual techniques which are labor-intensive and time-consuming. We seek to demonstrate that robotics offers improvements over these techniques and accelerate agricultural research, beginning with semantic segmentation and diameter estimation of trees in forests and orchards. We present TreeScope v1.0, the first robotics data… ▽ More

    Submitted 3 October, 2023; originally announced October 2023.

    Comments: Submitted to 2024 IEEE International Conference on Robotics and Automation (ICRA 2024) for review

  14. arXiv:2309.14254  [pdf, other

    physics.data-an hep-ex

    End-to-end deep learning inference with CMSSW via ONNX using docker

    Authors: Purva Chaudhari, Shravan Chaudhari, Ruchi Chudasama, Sergei Gleyzer

    Abstract: Deep learning techniques have been proven to provide excellent performance for a variety of high-energy physics applications, such as particle identification, event reconstruction and trigger operations. Recently, we developed an end-to-end deep learning approach to identify various particles using low-level detector information from high-energy collisions. These models will be incorporated in the… ▽ More

    Submitted 25 September, 2023; originally announced September 2023.

    Comments: 9 pages, 7 figures, CHEP2023 proceedings, submitted to EPJ Web of Conferences

    Report number: CMS CR-2023/161

  15. arXiv:2309.13720  [pdf, other

    cs.RO

    Design and Evaluation of Motion Planners for Quadrotors in Environments with Varying Complexities

    Authors: Yifei Simon Shao, Yuwei Wu, Laura Jarin-Lipschitz, Pratik Chaudhari, Vijay Kumar

    Abstract: Motion planning techniques for quadrotors have advanced significantly over the past decade. Most successful planners have two stages: a front-end that determines a path that incorporates geometric (or kinematic or input) constraints and specifies the homotopy class of the trajectory, and a back-end that optimizes this path to respect dynamics and input constraints. While there are many different c… ▽ More

    Submitted 7 March, 2024; v1 submitted 24 September, 2023; originally announced September 2023.

  16. arXiv:2309.09165  [pdf

    cs.ET cs.AR physics.app-ph

    Analog Content-Addressable Memory from Complementary FeFETs

    Authors: Xiwen Liu, Keshava Katti, Yunfei He, Paul Jacob, Claudia Richter, Uwe Schroeder, Santosh Kurinec, Pratik Chaudhari, Deep Jariwala

    Abstract: To address the increasing computational demands of artificial intelligence (AI) and big data, compute-in-memory (CIM) integrates memory and processing units into the same physical location, reducing the time and energy overhead of the system. Despite advancements in non-volatile memory (NVM) for matrix multiplication, other critical data-intensive operations, like parallel search, have been overlo… ▽ More

    Submitted 17 September, 2023; originally announced September 2023.

  17. arXiv:2308.03175  [pdf, other

    cs.LG eess.IV q-bio.QM

    Adapting Machine Learning Diagnostic Models to New Populations Using a Small Amount of Data: Results from Clinical Neuroscience

    Authors: Rongguang Wang, Guray Erus, Pratik Chaudhari, Christos Davatzikos

    Abstract: Machine learning (ML) has shown great promise for revolutionizing a number of areas, including healthcare. However, it is also facing a reproducibility crisis, especially in medicine. ML models that are carefully constructed from and evaluated on a training set might not generalize well on data from different patient populations or acquisition instrument settings and protocols. We tackle this prob… ▽ More

    Submitted 6 August, 2023; originally announced August 2023.

  18. arXiv:2307.06328  [pdf, other

    cs.LG cs.AI

    Budgeting Counterfactual for Offline RL

    Authors: Yao Liu, Pratik Chaudhari, Rasool Fakoor

    Abstract: The main challenge of offline reinforcement learning, where data is limited, arises from a sequence of counterfactual reasoning dilemmas within the realm of potential actions: What if we were to choose a different course of action? These circumstances frequently give rise to extrapolation errors, which tend to accumulate exponentially with the problem horizon. Hence, it becomes crucial to acknowle… ▽ More

    Submitted 21 May, 2024; v1 submitted 12 July, 2023; originally announced July 2023.

    Comments: Published at NeurIPS 2023

  19. arXiv:2305.18449  [pdf, other

    cs.AI cs.CL cs.LG eess.SY

    Taming AI Bots: Controllability of Neural States in Large Language Models

    Authors: Stefano Soatto, Paulo Tabuada, Pratik Chaudhari, Tian Yu Liu

    Abstract: We tackle the question of whether an agent can, by suitable choice of prompts, control an AI bot to any state. To that end, we first introduce a formal definition of ``meaning'' that is amenable to analysis. Then, we characterize ``meaningful data'' on which large language models (LLMs) are ostensibly trained, and ``well-trained LLMs'' through conditions that are largely met by today's LLMs. While… ▽ More

    Submitted 28 May, 2023; originally announced May 2023.

    Comments: TLDR: AI Bots are stochastic dynamical systems whose mental state can be controlled by both the user and the designer. The space of meanings, defined as equivalence classes of sentences, is learned during fine-tuning with human supervision, and safeguarding can be designed into the bot by establishing controls both at its input and output

  20. arXiv:2305.17332  [pdf, other

    cs.LG cs.IT

    Learning Capacity: A Measure of the Effective Dimensionality of a Model

    Authors: Daiwei Chen, Weikai Chang, Pratik Chaudhari

    Abstract: We exploit a formal correspondence between thermodynamics and inference, where the number of samples can be thought of as the inverse temperature, to define a "learning capacity'' which is a measure of the effective dimensionality of a model. We show that the learning capacity is a tiny fraction of the number of parameters for many deep networks trained on typical datasets, depends upon the number… ▽ More

    Submitted 26 May, 2023; originally announced May 2023.

  21. arXiv:2305.01604  [pdf, other

    cs.LG cond-mat.dis-nn

    The Training Process of Many Deep Networks Explores the Same Low-Dimensional Manifold

    Authors: Jialin Mao, Itay Griniasty, Han Kheng Teoh, Rahul Ramesh, Rubing Yang, Mark K. Transtrum, James P. Sethna, Pratik Chaudhari

    Abstract: We develop information-geometric techniques to analyze the trajectories of the predictions of deep networks during training. By examining the underlying high-dimensional probabilistic models, we reveal that the training process explores an effectively low-dimensional manifold. Networks with a wide range of architectures, sizes, trained using different optimization methods, regularization technique… ▽ More

    Submitted 19 March, 2024; v1 submitted 2 May, 2023; originally announced May 2023.

    Journal ref: Proceedings of the National Academy of Sciences 121.12 (2024)

  22. arXiv:2304.11446  [pdf, other

    cs.CV cs.AI

    Fast Diffusion Probabilistic Model Sampling through the lens of Backward Error Analysis

    Authors: Yansong Gao, Zhihong Pan, Xin Zhou, Le Kang, Pratik Chaudhari

    Abstract: Denoising diffusion probabilistic models (DDPMs) are a class of powerful generative models. The past few years have witnessed the great success of DDPMs in generating high-fidelity samples. A significant limitation of the DDPMs is the slow sampling procedure. DDPMs generally need hundreds or thousands of sequential function evaluations (steps) of neural networks to generate a sample. This paper ai… ▽ More

    Submitted 22 April, 2023; originally announced April 2023.

    Comments: arXiv admin note: text overlap with arXiv:2101.12176 by other authors

  23. arXiv:2303.08808  [pdf, other

    cs.CV

    Mesh Strikes Back: Fast and Efficient Human Reconstruction from RGB videos

    Authors: Rohit Jena, Pratik Chaudhari, James Gee, Ganesh Iyer, Siddharth Choudhary, Brandon M. Smith

    Abstract: Human reconstruction and synthesis from monocular RGB videos is a challenging problem due to clothing, occlusion, texture discontinuities and sharpness, and framespecific pose changes. Many methods employ deferred rendering, NeRFs and implicit methods to represent clothed humans, on the premise that mesh-based representations cannot capture complex clothing and textures from RGB, silhouettes, and… ▽ More

    Submitted 15 March, 2023; originally announced March 2023.

  24. arXiv:2210.17011  [pdf, other

    cs.LG

    A picture of the space of typical learnable tasks

    Authors: Rahul Ramesh, Jialin Mao, Itay Griniasty, Rubing Yang, Han Kheng Teoh, Mark Transtrum, James P. Sethna, Pratik Chaudhari

    Abstract: We develop information geometric techniques to understand the representations learned by deep networks when they are trained on different tasks using supervised, meta-, semi-supervised and contrastive learning. We shed light on the following phenomena that relate to the structure of the space of tasks: (1) the manifold of probabilistic models trained on different tasks using different representati… ▽ More

    Submitted 21 July, 2023; v1 submitted 30 October, 2022; originally announced October 2022.

  25. arXiv:2210.01422  [pdf, other

    cs.LG

    Time-Varying Propensity Score to Bridge the Gap between the Past and Present

    Authors: Rasool Fakoor, Jonas Mueller, Zachary C. Lipton, Pratik Chaudhari, Alexander J. Smola

    Abstract: Real-world deployment of machine learning models is challenging because data evolves over time. While no model can work when data evolves in an arbitrary fashion, if there is some pattern to these changes, we might be able to design methods to address it. This paper addresses situations when data evolves gradually. We introduce a time-varying propensity score that can detect gradual shifts in the… ▽ More

    Submitted 2 May, 2024; v1 submitted 4 October, 2022; originally announced October 2022.

    Comments: Published at ICLR 2024

  26. arXiv:2208.10967  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    The Value of Out-of-Distribution Data

    Authors: Ashwin De Silva, Rahul Ramesh, Carey E. Priebe, Pratik Chaudhari, Joshua T. Vogelstein

    Abstract: We expect the generalization error to improve with more samples from a similar task, and to deteriorate with more samples from an out-of-distribution (OOD) task. In this work, we show a counter-intuitive phenomenon: the generalization error of a task can be a non-monotonic function of the number of OOD samples. As the number of OOD samples increases, the generalization error on the target task imp… ▽ More

    Submitted 13 July, 2023; v1 submitted 23 August, 2022; originally announced August 2022.

    Comments: Previous versions of this work have been presented at the Out-of-Distribution Generalization in Computer Vision (OOD-CV) Workshop (ECCV 2022) and the Workshop on Distribution Shifts (NeurIPS 2022)

    Journal ref: Proceedings of the 40th International Conference on Machine Learning, PMLR 202:7366-7389, 2023

  27. arXiv:2208.01430  [pdf, other

    cs.MA

    A Model for Multi-Agent Heterogeneous Interaction Problems

    Authors: Christopher D. Hsu, Mulugeta A. Haile, Pratik Chaudhari

    Abstract: We introduce a model for multi-agent interaction problems to understand how a heterogeneous team of agents should organize its resources to tackle a heterogeneous team of attackers. This model is inspired by how the human immune system tackles a diverse set of pathogens. The key property of this model is a "cross-reactivity" kernel which enables a particular defender type to respond strongly to so… ▽ More

    Submitted 15 October, 2023; v1 submitted 2 August, 2022; originally announced August 2022.

    Comments: 8 pages, 7 figures

    Report number: Proc. of the American Control Conference (ACC), 2024

  28. arXiv:2207.01614  [pdf, other

    cs.CV cs.LG

    Beyond mAP: Towards better evaluation of instance segmentation

    Authors: Rohit Jena, Lukas Zhornyak, Nehal Doiphode, Pratik Chaudhari, Vivek Buch, James Gee, Jianbo Shi

    Abstract: Correctness of instance segmentation constitutes counting the number of objects, correctly localizing all predictions and classifying each localized prediction. Average Precision is the de-facto metric used to measure all these constituents of segmentation. However, this metric does not penalize duplicate predictions in the high-recall range, and cannot distinguish instances that are localized cor… ▽ More

    Submitted 20 March, 2023; v1 submitted 4 July, 2022; originally announced July 2022.

    Comments: Accepted at CVPR 2023

  29. arXiv:2206.05575  [pdf

    eess.IV cs.CV cs.DC cs.LG

    MammoFL: Mammographic Breast Density Estimation using Federated Learning

    Authors: Ramya Muthukrishnan, Angelina Heyler, Keshava Katti, Sarthak Pati, Walter Mankowski, Aprupa Alahari, Michael Sanborn, Emily F. Conant, Christopher Scott, Stacey Winham, Celine Vachon, Pratik Chaudhari, Despina Kontos, Spyridon Bakas

    Abstract: In this study, we automate quantitative mammographic breast density estimation with neural networks and show that this tool is a strong use case for federated learning on multi-institutional datasets. Our dataset included bilateral CC-view and MLO-view mammographic images from two separate institutions. Two U-Nets were separately trained on algorithm-generated labels to perform segmentation of the… ▽ More

    Submitted 13 December, 2023; v1 submitted 11 June, 2022; originally announced June 2022.

    Comments: Deep learning, federated learning, mammography, breast density, risk assessment

  30. Bias in Machine Learning Models Can Be Significantly Mitigated by Careful Training: Evidence from Neuroimaging Studies

    Authors: Rongguang Wang, Pratik Chaudhari, Christos Davatzikos

    Abstract: Despite the great promise that machine learning has offered in many fields of medicine, it has also raised concerns about potential biases and poor generalization across genders, age distributions, races and ethnicities, hospitals, and data acquisition equipment and protocols. In the current study, and in the context of three brain diseases, we provide evidence which suggests that when properly tr… ▽ More

    Submitted 30 January, 2023; v1 submitted 26 May, 2022; originally announced May 2022.

    Journal ref: Proceedings of the National Academy of Sciences 120.6 (2023)

  31. arXiv:2202.12482  [pdf, other

    stat.ML cs.LG math.ST

    Sparse Neural Additive Model: Interpretable Deep Learning with Feature Selection via Group Sparsity

    Authors: Shiyun Xu, Zhiqi Bu, Pratik Chaudhari, Ian J. Barnett

    Abstract: Interpretable machine learning has demonstrated impressive performance while preserving explainability. In particular, neural additive models (NAM) offer the interpretability to the black-box deep learning and achieve state-of-the-art accuracy among the large family of generalized additive models. In order to empower NAM with feature selection and improve the generalization, we propose the sparse… ▽ More

    Submitted 24 February, 2022; originally announced February 2022.

  32. arXiv:2202.00187  [pdf, other

    stat.ML cs.LG

    Deep Reference Priors: What is the best way to pretrain a model?

    Authors: Yansong Gao, Rahul Ramesh, Pratik Chaudhari

    Abstract: What is the best way to exploit extra data -- be it unlabeled data from the same task, or labeled data from a related task -- to learn a given task? This paper formalizes the question using the theory of reference priors. Reference priors are objective, uninformative Bayesian priors that maximize the mutual information between the task and the weights of the model. Such priors enable the task to m… ▽ More

    Submitted 15 June, 2022; v1 submitted 31 January, 2022; originally announced February 2022.

    Comments: 24 pages

  33. arXiv:2201.07372  [pdf, other

    cs.LG cs.AI

    Prospective Learning: Principled Extrapolation to the Future

    Authors: Ashwin De Silva, Rahul Ramesh, Lyle Ungar, Marshall Hussain Shuler, Noah J. Cowan, Michael Platt, Chen Li, Leyla Isik, Seung-Eon Roh, Adam Charles, Archana Venkataraman, Brian Caffo, Javier J. How, Justus M Kebschull, John W. Krakauer, Maxim Bichuch, Kaleab Alemayehu Kinfu, Eva Yezerets, Dinesh Jayaraman, Jong M. Shin, Soledad Villar, Ian Phillips, Carey E. Priebe, Thomas Hartung, Michael I. Miller , et al. (18 additional authors not shown)

    Abstract: Learning is a process which can update decision rules, based on past experience, such that future performance improves. Traditionally, machine learning is often evaluated under the assumption that the future will be identical to the past in distribution or change adversarially. But these assumptions can be either too optimistic or pessimistic for many problems in the real world. Real world scenari… ▽ More

    Submitted 13 July, 2023; v1 submitted 18 January, 2022; originally announced January 2022.

    Comments: Accepted at the 2nd Conference on Lifelong Learning Agents (CoLLAs), 2023

  34. arXiv:2110.14163  [pdf, other

    cs.LG stat.ML

    Does the Data Induce Capacity Control in Deep Learning?

    Authors: Rubing Yang, Jialin Mao, Pratik Chaudhari

    Abstract: We show that the input correlation matrix of typical classification datasets has an eigenspectrum where, after a sharp initial drop, a large number of small eigenvalues are distributed uniformly over an exponentially large range. This structure is mirrored in a network trained on this data: we show that the Hessian and the Fisher Information Matrix (FIM) have eigenvalues that are spread uniformly… ▽ More

    Submitted 22 June, 2022; v1 submitted 27 October, 2021; originally announced October 2021.

  35. arXiv:2106.06845  [pdf, other

    cs.LG eess.IV stat.ML

    Harmonization with Flow-based Causal Inference

    Authors: Rongguang Wang, Pratik Chaudhari, Christos Davatzikos

    Abstract: Heterogeneity in medical data, e.g., from data collected at different sites and with different protocols in a clinical study, is a fundamental hurdle for accurate prediction using machine learning models, as such models often fail to generalize well. This paper leverages a recently proposed normalizing-flow-based method to perform counterfactual inference upon a structural causal model (SCM), in o… ▽ More

    Submitted 10 July, 2021; v1 submitted 12 June, 2021; originally announced June 2021.

  36. arXiv:2106.03027  [pdf, other

    cs.LG

    Model Zoo: A Growing "Brain" That Learns Continually

    Authors: Rahul Ramesh, Pratik Chaudhari

    Abstract: This paper argues that continual learning methods can benefit by splitting the capacity of the learner across multiple models. We use statistical learning theory and experimental analysis to show how multiple tasks can interact with each other in a non-trivial fashion when a single model is trained on them. The generalization error on a particular task can improve when it is trained with synergist… ▽ More

    Submitted 15 June, 2022; v1 submitted 6 June, 2021; originally announced June 2021.

    Report number: Proc. of the International Conference of Learning Representations (ICLR) 2022

  37. arXiv:2103.16172  [pdf, other

    cond-mat.stat-mech quant-ph

    Zeno crossovers in the entanglement speed of spin chains with noisy impurities

    Authors: Abhijit P. Chaudhari, Shane P. Kelly, Riccardo J. Valencia-Tortora, Jamir Marino

    Abstract: We use a noisy signal with finite correlation time to drive a spin (dissipative impurity) in the quantum XY spin chain and calculate the dynamics of entanglement entropy of a bipartition of spins, for a stochastic quantum trajectory. We compute the noise averaged entanglement entropy of a bipartition of spins and observe that its speed of spreading decreases at strong dissipation, as a result of t… ▽ More

    Submitted 5 October, 2022; v1 submitted 30 March, 2021; originally announced March 2021.

    Comments: 14 pages, 5 figures

    Journal ref: J. Stat. Mech. (2022) 103101

  38. arXiv:2103.14184  [pdf, other

    cs.CV

    Deformable Linear Object Prediction Using Locally Linear Latent Dynamics

    Authors: Wenbo Zhang, Karl Schmeckpeper, Pratik Chaudhari, Kostas Daniilidis

    Abstract: We propose a framework for deformable linear object prediction. Prediction of deformable objects (e.g., rope) is challenging due to their non-linear dynamics and infinite-dimensional configuration spaces. By map** the dynamics from a non-linear space to a linear space, we can use the good properties of linear dynamics for easier learning and more efficient prediction. We learn a locally linear,… ▽ More

    Submitted 25 March, 2021; originally announced March 2021.

  39. Embracing the Disharmony in Medical Imaging: A Simple and Effective Framework for Domain Adaptation

    Authors: Rongguang Wang, Pratik Chaudhari, Christos Davatzikos

    Abstract: Domain shift, the mismatch between training and testing data characteristics, causes significant degradation in the predictive performance in multi-source imaging scenarios. In medical imaging, the heterogeneity of population, scanners and acquisition protocols at different sites presents a significant domain shift challenge and has limited the widespread clinical adoption of machine learning mode… ▽ More

    Submitted 20 December, 2021; v1 submitted 23 March, 2021; originally announced March 2021.

    Journal ref: Medical Image Analysis 76:102309 (2022)

  40. arXiv:2102.09225  [pdf, other

    cs.LG stat.ML

    Continuous Doubly Constrained Batch Reinforcement Learning

    Authors: Rasool Fakoor, Jonas Mueller, Kavosh Asadi, Pratik Chaudhari, Alexander J. Smola

    Abstract: Reliant on too many experiments to learn good actions, current Reinforcement Learning (RL) algorithms have limited applicability in real-world settings, which can be too expensive to allow exploration. We propose an algorithm for batch RL, where effective policies are learned using only a fixed offline dataset instead of online interactions with the environment. The limited data in batch RL produc… ▽ More

    Submitted 6 December, 2021; v1 submitted 18 February, 2021; originally announced February 2021.

    Comments: NeurIPS 2021 conference paper

  41. arXiv:2011.08055  [pdf, other

    cs.MA cs.LG

    Scalable Reinforcement Learning Policies for Multi-Agent Control

    Authors: Christopher D. Hsu, Hee** Jeong, George J. Pappas, Pratik Chaudhari

    Abstract: We develop a Multi-Agent Reinforcement Learning (MARL) method to learn scalable control policies for target tracking. Our method can handle an arbitrary number of pursuers and targets; we show results for tasks consisting up to 1000 pursuers tracking 1000 targets. We use a decentralized, partially-observable Markov Decision Process framework to model pursuers as agents receiving partial observatio… ▽ More

    Submitted 10 November, 2021; v1 submitted 16 November, 2020; originally announced November 2020.

    Comments: 8 pages, 10 figures, contributed paper at IROS 2021

  42. arXiv:2011.00613  [pdf, other

    cs.LG stat.ML

    An Information-Geometric Distance on the Space of Tasks

    Authors: Yansong Gao, Pratik Chaudhari

    Abstract: This paper prescribes a distance between learning tasks modeled as joint distributions on data and labels. Using tools in information geometry, the distance is defined to be the length of the shortest weight trajectory on a Riemannian manifold as a classifier is fitted on an interpolated task. The interpolated task evolves from the source to the target task using an optimal transport formulation.… ▽ More

    Submitted 24 February, 2021; v1 submitted 1 November, 2020; originally announced November 2020.

    Report number: Proc. of the International Conference of Machine Learning (ICML) 2021

  43. arXiv:2008.07081  [pdf, other

    cs.LG cs.RO stat.ML

    MIDAS: Multi-agent Interaction-aware Decision-making with Adaptive Strategies for Urban Autonomous Navigation

    Authors: Xiaoyi Chen, Pratik Chaudhari

    Abstract: Autonomous navigation in crowded, complex urban environments requires interacting with other agents on the road. A common solution to this problem is to use a prediction model to guess the likely future actions of other agents. While this is reasonable, it leads to overly conservative plans because it does not explicitly model the mutual influence of the actions of interacting agents. This paper b… ▽ More

    Submitted 23 March, 2021; v1 submitted 17 August, 2020; originally announced August 2020.

    Comments: Code available at https://github.com/sherrychen1120/MIDAS. To be presented at IEEE International Conference on Robotics and Automation (ICRA), 2021

  44. arXiv:2008.00759  [pdf, other

    cs.LG stat.ML

    Proximal Deterministic Policy Gradient

    Authors: Marco Maggipinto, Gian Antonio Susto, Pratik Chaudhari

    Abstract: This paper introduces two simple techniques to improve off-policy Reinforcement Learning (RL) algorithms. First, we formulate off-policy RL as a stochastic proximal point iteration. The target network plays the role of the variable of optimization and the value network computes the proximal operator. Second, we exploits the two value functions commonly employed in state-of-the-art off-policy algor… ▽ More

    Submitted 3 August, 2020; originally announced August 2020.

  45. arXiv:2006.15199  [pdf, other

    cs.LG stat.ML

    DDPG++: Striving for Simplicity in Continuous-control Off-Policy Reinforcement Learning

    Authors: Rasool Fakoor, Pratik Chaudhari, Alexander J. Smola

    Abstract: This paper prescribes a suite of techniques for off-policy Reinforcement Learning (RL) that simplify the training process and reduce the sample complexity. First, we show that simple Deterministic Policy Gradient works remarkably well as long as the overestimation bias is controlled. This is contrast to existing literature which creates sophisticated off-policy techniques. Second, we pinpoint trai… ▽ More

    Submitted 26 June, 2020; originally announced June 2020.

  46. arXiv:2006.14284  [pdf, other

    cs.LG stat.ML

    Fast, Accurate, and Simple Models for Tabular Data via Augmented Distillation

    Authors: Rasool Fakoor, Jonas Mueller, Nick Erickson, Pratik Chaudhari, Alexander J. Smola

    Abstract: Automated machine learning (AutoML) can produce complex model ensembles by stacking, bagging, and boosting many individual models like trees, deep networks, and nearest neighbor estimators. While highly accurate, the resulting predictors are large, slow, and opaque as compared to their constituents. To improve the deployment of AutoML on tabular data, we propose FAST-DAD to distill arbitrarily com… ▽ More

    Submitted 25 June, 2020; originally announced June 2020.

    Journal ref: NeurIPS 2020

  47. arXiv:2005.04755  [pdf, other

    cs.RO cs.LG eess.SY

    BayesRace: Learning to race autonomously using prior experience

    Authors: Achin Jain, Matthew O'Kelly, Pratik Chaudhari, Manfred Morari

    Abstract: Autonomous race cars require perception, estimation, planning, and control modules which work together asynchronously while driving at the limit of a vehicle's handling capability. A fundamental challenge encountered in designing these software components lies in predicting the vehicle's future state (e.g. position, orientation, and speed) with high accuracy. The root cause is the difficulty in id… ▽ More

    Submitted 15 November, 2020; v1 submitted 10 May, 2020; originally announced May 2020.

    Journal ref: 4th Conference on Robot Learning (CoRL 2020)

  48. arXiv:2004.02441  [pdf, other

    cs.LG stat.ML

    TraDE: Transformers for Density Estimation

    Authors: Rasool Fakoor, Pratik Chaudhari, Jonas Mueller, Alexander J. Smola

    Abstract: We present TraDE, a self-attention-based architecture for auto-regressive density estimation with continuous and discrete valued data. Our model is trained using a penalized maximum likelihood objective, which ensures that samples from the density estimate resemble the training data distribution. The use of self-attention means that the model need not retain conditional sufficient statistics durin… ▽ More

    Submitted 14 October, 2020; v1 submitted 6 April, 2020; originally announced April 2020.

  49. arXiv:2002.12406  [pdf, other

    cs.LG stat.ML

    A Free-Energy Principle for Representation Learning

    Authors: Yansong Gao, Pratik Chaudhari

    Abstract: This paper employs a formal connection of machine learning with thermodynamics to characterize the quality of learnt representations for transfer learning. We discuss how information-theoretic functional such as rate, distortion and classification loss of a model lie on a convex, so-called equilibrium surface.We prescribe dynamical processes to traverse this surface under constraints, e.g., an iso… ▽ More

    Submitted 27 February, 2020; originally announced February 2020.

    Comments: 21 pages, 14 figures

  50. arXiv:2002.11770  [pdf, other

    cs.CV cs.LG stat.ML

    Rethinking the Hyperparameters for Fine-tuning

    Authors: Hao Li, Pratik Chaudhari, Hao Yang, Michael Lam, Avinash Ravichandran, Rahul Bhotika, Stefano Soatto

    Abstract: Fine-tuning from pre-trained ImageNet models has become the de-facto standard for various computer vision tasks. Current practices for fine-tuning typically involve selecting an ad-hoc choice of hyperparameters and kee** them fixed to values normally used for training from scratch. This paper re-examines several common practices of setting hyperparameters for fine-tuning. Our findings are based… ▽ More

    Submitted 19 February, 2020; originally announced February 2020.

    Comments: Published as a conference paper at ICLR 2020