Skip to main content

Showing 1–50 of 284 results for author: Anandkumar, A

.
  1. arXiv:2407.01521  [pdf, other

    cs.LG cs.AI cs.CV

    Improving Diffusion Inverse Problem Solving with Decoupled Noise Annealing

    Authors: Bingliang Zhang, Wenda Chu, Julius Berner, Chenlin Meng, Anima Anandkumar, Yang Song

    Abstract: Diffusion models have recently achieved success in solving Bayesian inverse problems with learned data priors. Current methods build on top of the diffusion sampling process, where each denoising step makes small modifications to samples from the previous step. However, this process struggles to correct errors from earlier sampling steps, leading to worse performance in complicated nonlinear inver… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

  2. arXiv:2406.15669  [pdf, other

    q-bio.BM

    CARE: a Benchmark Suite for the Classification and Retrieval of Enzymes

    Authors: Jason Yang, Ariane Mora, Shengchao Liu, Bruce J. Wittmann, Anima Anandkumar, Frances H. Arnold, Yisong Yue

    Abstract: Enzymes are important proteins that catalyze chemical reactions. In recent years, machine learning methods have emerged to predict enzyme function from sequence; however, there are no standardized benchmarks to evaluate these methods. We introduce CARE, a benchmark and dataset suite for the Classification And Retrieval of Enzymes (CARE). CARE centers on two tasks: (1) classification of a protein s… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

  3. arXiv:2406.13301  [pdf, other

    cs.CV cs.RO

    ARDuP: Active Region Video Diffusion for Universal Policies

    Authors: Shuaiyi Huang, Mara Levy, Zhenyu Jiang, Anima Anandkumar, Yuke Zhu, Linxi Fan, De-An Huang, Abhinav Shrivastava

    Abstract: Sequential decision-making can be formulated as a text-conditioned video generation problem, where a video planner, guided by a text-defined goal, generates future frames visualizing planned actions, from which control actions are subsequently derived. In this work, we introduce Active Region Video Diffusion for Universal Policies (ARDuP), a novel framework for video-based policy learning that emp… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

  4. arXiv:2406.03494  [pdf, other

    cs.LG math.NA stat.ML

    Solving Poisson Equations using Neural Walk-on-Spheres

    Authors: Hong Chul Nam, Julius Berner, Anima Anandkumar

    Abstract: We propose Neural Walk-on-Spheres (NWoS), a novel neural PDE solver for the efficient solution of high-dimensional Poisson equations. Leveraging stochastic representations and Walk-on-Spheres methods, we develop novel losses for neural networks based on the recursive solution of Poisson equations on spheres inside the domain. The resulting method is highly parallelizable and does not require spati… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

    Comments: Accepted at ICML 2024

  5. arXiv:2405.17216  [pdf, other

    cs.LG cs.AI cs.LO stat.ML

    Autoformalizing Euclidean Geometry

    Authors: Logan Murphy, Kaiyu Yang, Jialiang Sun, Zhaoyu Li, Anima Anandkumar, Xujie Si

    Abstract: Autoformalization involves automatically translating informal math into formal theorems and proofs that are machine-verifiable. Euclidean geometry provides an interesting and controllable domain for studying autoformalization. In this paper, we introduce a neuro-symbolic framework for autoformalizing Euclidean geometry, which combines domain knowledge, SMT solvers, and large language models (LLMs)… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

    Comments: Accepted to ICML 2024. The first two authors contributed equally

  6. arXiv:2404.12534  [pdf, other

    cs.AI cs.LG cs.LO stat.ML

    Towards Large Language Models as Copilots for Theorem Proving in Lean

    Authors: Peiyang Song, Kaiyu Yang, Anima Anandkumar

    Abstract: Theorem proving is an important challenge for large language models (LLMs), as formal proofs can be checked rigorously by proof assistants such as Lean, leaving no room for hallucination. Existing LLM-based provers try to prove theorems in a fully autonomous mode without human intervention. In this mode, they struggle with novel and challenging theorems, for which human insights may be critical. I… ▽ More

    Submitted 18 April, 2024; originally announced April 2024.

    Comments: All code open-sourced at https://github.com/lean-dojo/LeanCopilot

  7. arXiv:2404.01990  [pdf, other

    cs.CV

    What is Point Supervision Worth in Video Instance Segmentation?

    Authors: Shuaiyi Huang, De-An Huang, Zhiding Yu, Shiyi Lan, Subhashree Radhakrishnan, Jose M. Alvarez, Abhinav Shrivastava, Anima Anandkumar

    Abstract: Video instance segmentation (VIS) is a challenging vision task that aims to detect, segment, and track objects in videos. Conventional VIS methods rely on densely-annotated object masks which are expensive. We reduce the human annotations to only one point for each object in a video frame during training, and obtain high-quality mask predictions close to fully supervised models. Our proposed train… ▽ More

    Submitted 1 April, 2024; originally announced April 2024.

  8. arXiv:2403.14148  [pdf, other

    cs.CV cs.LG

    Efficient Video Diffusion Models via Content-Frame Motion-Latent Decomposition

    Authors: Sihyun Yu, Weili Nie, De-An Huang, Boyi Li, **woo Shin, Anima Anandkumar

    Abstract: Video diffusion models have recently made great progress in generation quality, but are still limited by the high memory and computational requirements. This is because current video diffusion models often attempt to process high-dimensional videos directly. To tackle this issue, we propose content-motion latent diffusion model (CMD), a novel efficient extension of pretrained image diffusion model… ▽ More

    Submitted 21 March, 2024; originally announced March 2024.

    Comments: ICLR 2024. Project page: https://sihyun.me/CMD

  9. arXiv:2403.12553  [pdf, other

    cs.LG

    Pretraining Codomain Attention Neural Operators for Solving Multiphysics PDEs

    Authors: Md Ashiqur Rahman, Robert Joseph George, Mogab Elleithy, Daniel Leibovici, Zongyi Li, Boris Bonev, Colin White, Julius Berner, Raymond A. Yeh, Jean Kossaifi, Kamyar Azizzadenesheli, Anima Anandkumar

    Abstract: Existing neural operator architectures face challenges when solving multiphysics problems with coupled partial differential equations (PDEs), due to complex geometries, interactions between physical variables, and the lack of large amounts of high-resolution training data. To address these issues, we propose Codomain Attention Neural Operator (CoDA-NO), which tokenizes functions along the codomain… ▽ More

    Submitted 5 April, 2024; v1 submitted 19 March, 2024; originally announced March 2024.

  10. arXiv:2403.09230  [pdf, other

    cs.CV

    Improving Distant 3D Object Detection Using 2D Box Supervision

    Authors: Zetong Yang, Zhiding Yu, Chris Choy, Renhao Wang, Anima Anandkumar, Jose M. Alvarez

    Abstract: Improving the detection of distant 3d objects is an important yet challenging task. For camera-based 3D perception, the annotation of 3d bounding relies heavily on LiDAR for accurate depth information. As such, the distance of annotation is often limited due to the sparsity of LiDAR points on distant objects, which hampers the capability of existing detectors for long-range scenarios. We address t… ▽ More

    Submitted 14 March, 2024; originally announced March 2024.

    Comments: Accepted by CVPR 2024

  11. arXiv:2403.03542  [pdf, other

    cs.LG math.NA

    DPOT: Auto-Regressive Denoising Operator Transformer for Large-Scale PDE Pre-Training

    Authors: Zhongkai Hao, Chang Su, Songming Liu, Julius Berner, Chengyang Ying, Hang Su, Anima Anandkumar, Jian Song, Jun Zhu

    Abstract: Pre-training has been investigated to improve the efficiency and performance of training neural operators in data-scarce settings. However, it is largely in its infancy due to the inherent complexity and diversity, such as long trajectories, multiple scales and varying dimensions of partial differential equations (PDEs) data. In this paper, we present a new auto-regressive denoising pre-training s… ▽ More

    Submitted 6 May, 2024; v1 submitted 6 March, 2024; originally announced March 2024.

  12. arXiv:2403.03507  [pdf, other

    cs.LG

    GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection

    Authors: Jiawei Zhao, Zhenyu Zhang, Beidi Chen, Zhangyang Wang, Anima Anandkumar, Yuandong Tian

    Abstract: Training Large Language Models (LLMs) presents significant memory challenges, predominantly due to the growing size of weights and optimizer states. Common memory-reduction approaches, such as low-rank adaptation (LoRA), add a trainable low-rank matrix to the frozen pre-trained weight in each layer, reducing trainable parameters and optimizer states. However, such approaches typically underperform… ▽ More

    Submitted 2 June, 2024; v1 submitted 6 March, 2024; originally announced March 2024.

    Comments: ICML 2024 (Oral)

  13. arXiv:2402.16845  [pdf, other

    cs.LG cs.AI math.NA

    Neural Operators with Localized Integral and Differential Kernels

    Authors: Miguel Liu-Schiaffini, Julius Berner, Boris Bonev, Thorsten Kurth, Kamyar Azizzadenesheli, Anima Anandkumar

    Abstract: Neural operators learn map**s between function spaces, which is practical for learning solution operators of PDEs and other scientific modeling applications. Among them, the Fourier neural operator (FNO) is a popular architecture that performs global convolutions in the Fourier space. However, such global operations are often prone to over-smoothing and may fail to capture local details. In cont… ▽ More

    Submitted 8 June, 2024; v1 submitted 26 February, 2024; originally announced February 2024.

    Comments: Accepted at 2024 International Conference on Machine Learning

  14. arXiv:2402.14167  [pdf, other

    cs.CV cs.LG

    T-Stitch: Accelerating Sampling in Pre-Trained Diffusion Models with Trajectory Stitching

    Authors: Zizheng Pan, Bohan Zhuang, De-An Huang, Weili Nie, Zhiding Yu, Chaowei Xiao, Jianfei Cai, Anima Anandkumar

    Abstract: Sampling from diffusion probabilistic models (DPMs) is often expensive for high-quality image generation and typically requires many steps with a large model. In this paper, we introduce sampling Trajectory Stitching T-Stitch, a simple yet efficient technique to improve the sampling efficiency with little or no generation degradation. Instead of solely using a large DPM for the entire sampling tra… ▽ More

    Submitted 21 February, 2024; originally announced February 2024.

  15. arXiv:2402.11764  [pdf, other

    cs.CL cs.AI cs.CY

    ChatGPT Based Data Augmentation for Improved Parameter-Efficient Debiasing of LLMs

    Authors: Pengrui Han, Rafal Kocielnik, Adhithya Saravanan, Roy Jiang, Or Sharir, Anima Anandkumar

    Abstract: Large Language models (LLMs), while powerful, exhibit harmful social biases. Debiasing is often challenging due to computational costs, data constraints, and potential degradation of multi-task language capabilities. This work introduces a novel approach utilizing ChatGPT to generate synthetic training data, aiming to enhance the debiasing of LLMs. We propose two strategies: Targeted Prompting, wh… ▽ More

    Submitted 18 February, 2024; originally announced February 2024.

    Comments: Accepted to EACL 2024 Workshop on Language Technology for Equality, Diversity, Inclusion (LT-EDI-2024)

    MSC Class: 68T50 ACM Class: I.2.7; K.4.1

  16. arXiv:2402.01960  [pdf, other

    cs.LG

    Calibrated Uncertainty Quantification for Operator Learning via Conformal Prediction

    Authors: Ziqi Ma, Kamyar Azizzadenesheli, Anima Anandkumar

    Abstract: Operator learning has been increasingly adopted in scientific and engineering applications, many of which require calibrated uncertainty quantification. Since the output of operator learning is a continuous function, quantifying uncertainty simultaneously at all points in the domain is challenging. Current methods consider calibration at a single point or over one scalar function or make strong as… ▽ More

    Submitted 5 February, 2024; v1 submitted 2 February, 2024; originally announced February 2024.

    Comments: 14 pages, 7 figures

  17. arXiv:2401.15122  [pdf, other

    cs.LG cs.AI q-bio.BM q-bio.QM stat.ML

    A Multi-Grained Symmetric Differential Equation Model for Learning Protein-Ligand Binding Dynamics

    Authors: Shengchao Liu, Weitao Du, Yan**g Li, Zhuoxinran Li, Vignesh Bhethanabotla, Nakul Rampal, Omar Yaghi, Christian Borgs, Anima Anandkumar, Hongyu Guo, Jennifer Chayes

    Abstract: In drug discovery, molecular dynamics (MD) simulation for protein-ligand binding provides a powerful tool for predicting binding affinities, estimating transport properties, and exploring pocket sites. There has been a long history of improving the efficiency of MD simulations through better numerical methods and, more recently, by utilizing machine learning (ML) methods. Yet, challenges remain, s… ▽ More

    Submitted 1 February, 2024; v1 submitted 26 January, 2024; originally announced January 2024.

  18. arXiv:2401.11037  [pdf, other

    cs.LG math.NA q-bio.QM

    Equivariant Graph Neural Operator for Modeling 3D Dynamics

    Authors: Minkai Xu, Jiaqi Han, Aaron Lou, Jean Kossaifi, Arvind Ramanathan, Kamyar Azizzadenesheli, Jure Leskovec, Stefano Ermon, Anima Anandkumar

    Abstract: Modeling the complex three-dimensional (3D) dynamics of relational systems is an important problem in the natural sciences, with applications ranging from molecular simulations to particle mechanics. Machine learning methods have achieved good success by learning graph neural networks to model spatial interactions. However, these approaches do not faithfully capture temporal correlations since the… ▽ More

    Submitted 2 June, 2024; v1 submitted 19 January, 2024; originally announced January 2024.

    Comments: Proceedings of the 41 st International Conference on Machine Learning, Vienna, Austria. PMLR 235, 2024. Copyright 2024 by the author(s)

  19. arXiv:2401.09421  [pdf, other

    quant-ph

    Towards large-scale quantum optimization solvers with few qubits

    Authors: Marco Sciorilli, Lucas Borges, Taylor L. Patti, Diego García-Martín, Giancarlo Camilo, Anima Anandkumar, Leandro Aolita

    Abstract: We introduce a variational quantum solver for combinatorial optimizations over $m=\mathcal{O}(n^k)$ binary variables using only $n$ qubits, with tunable $k>1$. The number of parameters and circuit depth display mild linear and sublinear scalings in $m$, respectively. Moreover, we analytically prove that the specific qubit-efficient encoding brings in a super-polynomial mitigation of barren plateau… ▽ More

    Submitted 25 March, 2024; v1 submitted 17 January, 2024; originally announced January 2024.

    Report number: LA-UR-24-20389

  20. arXiv:2401.03844  [pdf, other

    cs.CV

    Fully Attentional Networks with Self-emerging Token Labeling

    Authors: Bingyin Zhao, Zhiding Yu, Shiyi Lan, Yutao Cheng, Anima Anandkumar, Yingjie Lao, Jose M. Alvarez

    Abstract: Recent studies indicate that Vision Transformers (ViTs) are robust against out-of-distribution scenarios. In particular, the Fully Attentional Network (FAN) - a family of ViT backbones, has achieved state-of-the-art robustness. In this paper, we revisit the FAN models and improve their pre-training with a self-emerging token labeling (STL) framework. Our method contains a two-stage training framew… ▽ More

    Submitted 8 January, 2024; originally announced January 2024.

    Journal ref: Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), 2023, pp. 5585-5595

  21. arXiv:2312.10065  [pdf, other

    cs.CY cs.AI

    Exploring Social Bias in Downstream Applications of Text-to-Image Foundation Models

    Authors: Adhithya Prakash Saravanan, Rafal Kocielnik, Roy Jiang, Pengrui Han, Anima Anandkumar

    Abstract: Text-to-image diffusion models have been adopted into key commercial workflows, such as art generation and image editing. Characterising the implicit social biases they exhibit, such as gender and racial stereotypes, is a necessary first step in avoiding discriminatory outcomes. While existing studies on social bias focus on image generation, the biases exhibited in alternate applications of diffu… ▽ More

    Submitted 5 December, 2023; originally announced December 2023.

    ACM Class: F.2.2; I.2.7

  22. arXiv:2312.09323  [pdf, other

    cs.AI cs.LG

    Perspectives on the State and Future of Deep Learning - 2023

    Authors: Micah Goldblum, Anima Anandkumar, Richard Baraniuk, Tom Goldstein, Kyunghyun Cho, Zachary C Lipton, Melanie Mitchell, Preetum Nakkiran, Max Welling, Andrew Gordon Wilson

    Abstract: The goal of this series is to chronicle opinions and issues in the field of machine learning as they stand today and as they change over time. The plan is to host this survey periodically until the AI singularity paperclip-frenzy-driven doomsday, kee** an updated list of topical questions and interviewing new community members for each edition. In this issue, we probed people's opinions on inter… ▽ More

    Submitted 18 December, 2023; v1 submitted 7 December, 2023; originally announced December 2023.

  23. arXiv:2312.03231  [pdf, other

    cs.LG cs.AI cs.CV cs.HC eess.AS

    Deep Multimodal Fusion for Surgical Feedback Classification

    Authors: Rafal Kocielnik, Elyssa Y. Wong, Timothy N. Chu, Lydia Lin, De-An Huang, Jiayun Wang, Anima Anandkumar, Andrew J. Hung

    Abstract: Quantification of real-time informal feedback delivered by an experienced surgeon to a trainee during surgery is important for skill improvements in surgical training. Such feedback in the live operating room is inherently multimodal, consisting of verbal conversations (e.g., questions and answers) as well as non-verbal elements (e.g., through visual cues like pointing to anatomic elements). In th… ▽ More

    Submitted 5 December, 2023; originally announced December 2023.

    Journal ref: Published in Proceedings of Machine Learning for Health 2024

  24. arXiv:2311.05967  [pdf, other

    physics.plasm-ph cs.LG

    Plasma Surrogate Modelling using Fourier Neural Operators

    Authors: Vignesh Gopakumar, Stanislas Pamela, Lorenzo Zanisi, Zongyi Li, Ander Gray, Daniel Brennand, Nitesh Bhatia, Gregory Stathopoulos, Matt Kusner, Marc Peter Deisenroth, Anima Anandkumar, JOREK Team, MAST Team

    Abstract: Predicting plasma evolution within a Tokamak reactor is crucial to realizing the goal of sustainable fusion. Capabilities in forecasting the spatio-temporal evolution of plasma rapidly and accurately allow us to quickly iterate over design and control strategies on current Tokamak devices and future reactors. Modelling plasma evolution using numerical solvers is often expensive, consuming many hou… ▽ More

    Submitted 18 June, 2024; v1 submitted 10 November, 2023; originally announced November 2023.

    Journal ref: Nucl. Fusion 64 056025 (2024)

  25. arXiv:2310.15466  [pdf

    cs.LG

    EKGNet: A 10.96μW Fully Analog Neural Network for Intra-Patient Arrhythmia Classification

    Authors: Benyamin Haghi, Lin Ma, Sahin Lale, Anima Anandkumar, Azita Emami

    Abstract: We present an integrated approach by combining analog computing and deep learning for electrocardiogram (ECG) arrhythmia classification. We propose EKGNet, a hardware-efficient and fully analog arrhythmia classification architecture that archives high accuracy with low power consumption. The proposed architecture leverages the energy efficiency of transistors operating in the subthreshold region,… ▽ More

    Submitted 23 October, 2023; originally announced October 2023.

    Comments: Accepted on IEEE Biomedical Circuits and Systems (BioCAS) 2023

  26. arXiv:2310.12931  [pdf, other

    cs.RO cs.AI cs.LG

    Eureka: Human-Level Reward Design via Coding Large Language Models

    Authors: Yecheng Jason Ma, William Liang, Guanzhi Wang, De-An Huang, Osbert Bastani, Dinesh Jayaraman, Yuke Zhu, Linxi Fan, Anima Anandkumar

    Abstract: Large Language Models (LLMs) have excelled as high-level semantic planners for sequential decision-making tasks. However, harnessing them to learn complex low-level manipulation tasks, such as dexterous pen spinning, remains an open problem. We bridge this fundamental gap and present Eureka, a human-level reward design algorithm powered by LLMs. Eureka exploits the remarkable zero-shot generation,… ▽ More

    Submitted 30 April, 2024; v1 submitted 19 October, 2023; originally announced October 2023.

    Comments: ICLR 2024. Project website and open-source code: https://eureka-research.github.io/

  27. arXiv:2310.04610  [pdf, other

    cs.AI cs.LG

    DeepSpeed4Science Initiative: Enabling Large-Scale Scientific Discovery through Sophisticated AI System Technologies

    Authors: Shuaiwen Leon Song, Bonnie Kruft, Minjia Zhang, Conglong Li, Shiyang Chen, Chengming Zhang, Masahiro Tanaka, Xiaoxia Wu, Jeff Rasley, Ammar Ahmad Awan, Connor Holmes, Martin Cai, Adam Ghanem, Zhongzhu Zhou, Yuxiong He, Pete Luferenko, Divya Kumar, Jonathan Weyn, Ruixiong Zhang, Sylwester Klocek, Volodymyr Vragov, Mohammed AlQuraishi, Gustaf Ahdritz, Christina Floristean, Cristina Negri , et al. (67 additional authors not shown)

    Abstract: In the upcoming decade, deep learning may revolutionize the natural sciences, enhancing our capacity to model and predict natural occurrences. This could herald a new era of scientific exploration, bringing significant advancements across sectors from drug development to renewable energy. To answer this call, we present DeepSpeed4Science initiative (deepspeed4science.ai) which aims to build unique… ▽ More

    Submitted 11 October, 2023; v1 submitted 6 October, 2023; originally announced October 2023.

  28. arXiv:2310.00120  [pdf, other

    cs.LG

    Multi-Grid Tensorized Fourier Neural Operator for High-Resolution PDEs

    Authors: Jean Kossaifi, Nikola Kovachki, Kamyar Azizzadenesheli, Anima Anandkumar

    Abstract: Memory complexity and data scarcity have so far prohibited learning solution operators of partial differential equations (PDEs) at high resolutions. We address these limitations by introducing a new data efficient and highly parallelizable operator learning approach with reduced memory requirement and better generalization, called multi-grid tensorized neural operator (MG-TFNO). MG-TFNO scales to… ▽ More

    Submitted 29 September, 2023; originally announced October 2023.

  29. arXiv:2309.15325  [pdf, other

    cs.LG physics.comp-ph

    Neural Operators for Accelerating Scientific Simulations and Design

    Authors: Kamyar Azizzadenesheli, Nikola Kovachki, Zongyi Li, Miguel Liu-Schiaffini, Jean Kossaifi, Anima Anandkumar

    Abstract: Scientific discovery and engineering design are currently limited by the time and cost of physical experiments, selected mostly through trial-and-error and intuition that require deep domain expertise. Numerical simulations present an alternative to physical experiments but are usually infeasible for complex real-world domains due to the computational requirements of existing numerical methods. Ar… ▽ More

    Submitted 4 January, 2024; v1 submitted 26 September, 2023; originally announced September 2023.

  30. arXiv:2309.05693  [pdf, other

    quant-ph

    Near-Term Distributed Quantum Computation using Mean-Field Corrections and Auxiliary Qubits

    Authors: Abigail McClain Gomez, Taylor L. Patti, Anima Anandkumar, Susanne F. Yelin

    Abstract: Distributed quantum computation is often proposed to increase the scalability of quantum hardware, as it reduces cooperative noise and requisite connectivity by sharing quantum information between distant quantum devices. However, such exchange of quantum information itself poses unique engineering challenges, requiring high gate fidelity and costly non-local operations. To mitigate this, we propo… ▽ More

    Submitted 11 September, 2023; originally announced September 2023.

  31. arXiv:2309.00583  [pdf, other

    cs.LG math.NA

    Geometry-Informed Neural Operator for Large-Scale 3D PDEs

    Authors: Zongyi Li, Nikola Borislavov Kovachki, Chris Choy, Boyi Li, Jean Kossaifi, Shourya Prakash Otta, Mohammad Amin Nabian, Maximilian Stadler, Christian Hundt, Kamyar Azizzadenesheli, Anima Anandkumar

    Abstract: We propose the geometry-informed neural operator (GINO), a highly efficient approach to learning the solution operator of large-scale partial differential equations with varying geometries. GINO uses a signed distance function and point-cloud representations of the input shape and neural operators based on graph and Fourier architectures to learn the solution operator. The graph neural operator ha… ▽ More

    Submitted 1 September, 2023; originally announced September 2023.

  32. arXiv:2308.08794  [pdf, other

    cs.LG math.DS

    Tip** Point Forecasting in Non-Stationary Dynamics on Function Spaces

    Authors: Miguel Liu-Schiaffini, Clare E. Singer, Nikola Kovachki, Tapio Schneider, Kamyar Azizzadenesheli, Anima Anandkumar

    Abstract: Tip** points are abrupt, drastic, and often irreversible changes in the evolution of non-stationary and chaotic dynamical systems. For instance, increased greenhouse gas concentrations are predicted to lead to drastic decreases in low cloud cover, referred to as a climatological tip** point. In this paper, we learn the evolution of such non-stationary dynamical systems using a novel recurrent… ▽ More

    Submitted 17 August, 2023; originally announced August 2023.

    Comments: 29 pages, 15 figures

  33. arXiv:2308.04556  [pdf, other

    cs.CV

    FocalFormer3D : Focusing on Hard Instance for 3D Object Detection

    Authors: Yilun Chen, Zhiding Yu, Yukang Chen, Shiyi Lan, Animashree Anandkumar, Jiaya Jia, Jose Alvarez

    Abstract: False negatives (FN) in 3D object detection, {\em e.g.}, missing predictions of pedestrians, vehicles, or other obstacles, can lead to potentially dangerous situations in autonomous driving. While being fatal, this issue is understudied in many current 3D detection methods. In this work, we propose Hard Instance Probing (HIP), a general pipeline that identifies \textit{FN} in a multi-stage manner… ▽ More

    Submitted 8 August, 2023; originally announced August 2023.

    Comments: Accepted by ICCV 2023

  34. arXiv:2308.02236  [pdf, other

    cs.CV

    FB-BEV: BEV Representation from Forward-Backward View Transformations

    Authors: Zhiqi Li, Zhiding Yu, Wenhai Wang, Anima Anandkumar, Tong Lu, Jose M. Alvarez

    Abstract: View Transformation Module (VTM), where transformations happen between multi-view image features and Bird-Eye-View (BEV) representation, is a crucial step in camera-based BEV perception systems. Currently, the two most prominent VTM paradigms are forward projection and backward projection. Forward projection, represented by Lift-Splat-Shoot, leads to sparsely projected BEV features without post-pr… ▽ More

    Submitted 17 August, 2023; v1 submitted 4 August, 2023; originally announced August 2023.

    Comments: Accept to ICCV 2023, camera-ready version

  35. arXiv:2307.15034  [pdf, other

    cs.LG math.NA

    Guaranteed Approximation Bounds for Mixed-Precision Neural Operators

    Authors: Renbo Tu, Colin White, Jean Kossaifi, Boris Bonev, Nikola Kovachki, Gennady Pekhimenko, Kamyar Azizzadenesheli, Anima Anandkumar

    Abstract: Neural operators, such as Fourier Neural Operators (FNO), form a principled approach for learning solution operators for PDEs and other map**s between function spaces. However, many real-world problems require high-resolution training data, and the training time and limited GPU memory pose big barriers. One solution is to train neural operators in mixed precision to reduce the memory requirement… ▽ More

    Submitted 5 May, 2024; v1 submitted 27 July, 2023; originally announced July 2023.

    Comments: ICLR 2024

  36. arXiv:2307.14988  [pdf, other

    cs.LG cs.CL stat.ML

    Incrementally-Computable Neural Networks: Efficient Inference for Dynamic Inputs

    Authors: Or Sharir, Anima Anandkumar

    Abstract: Deep learning often faces the challenge of efficiently processing dynamic inputs, such as sensor data or user inputs. For example, an AI writing assistant is required to update its suggestions in real time as a document is edited. Re-running the model each time is expensive, even with compression techniques like knowledge distillation, pruning, or quantization. Instead, we take an incremental comp… ▽ More

    Submitted 27 July, 2023; originally announced July 2023.

  37. arXiv:2307.08423  [pdf, other

    cs.LG physics.comp-ph

    Artificial Intelligence for Science in Quantum, Atomistic, and Continuum Systems

    Authors: Xuan Zhang, Limei Wang, Jacob Helwig, Youzhi Luo, Cong Fu, Yaochen Xie, Meng Liu, Yuchao Lin, Zhao Xu, Keqiang Yan, Keir Adams, Maurice Weiler, Xiner Li, Tianfan Fu, Yucheng Wang, Haiyang Yu, YuQing Xie, Xiang Fu, Alex Strasser, Shenglong Xu, Yi Liu, Yuanqi Du, Alexandra Saxton, Hongyi Ling, Hannah Lawrence , et al. (38 additional authors not shown)

    Abstract: Advances in artificial intelligence (AI) are fueling a new paradigm of discoveries in natural sciences. Today, AI has started to advance natural sciences by improving, accelerating, and enabling our understanding of natural phenomena at a wide range of spatial and temporal scales, giving rise to a new area of research known as AI for science (AI4Science). Being an emerging research paradigm, AI4Sc… ▽ More

    Submitted 15 November, 2023; v1 submitted 17 July, 2023; originally announced July 2023.

  38. arXiv:2306.15742  [pdf, other

    cs.CV

    Differentially Private Video Activity Recognition

    Authors: Zelun Luo, Yuliang Zou, Yi** Yang, Zane Durante, De-An Huang, Zhiding Yu, Chaowei Xiao, Li Fei-Fei, Animashree Anandkumar

    Abstract: In recent years, differential privacy has seen significant advancements in image classification; however, its application to video activity recognition remains under-explored. This paper addresses the challenges of applying differential privacy to video activity recognition, which primarily stem from: (1) a discrepancy between the desired privacy level for entire videos and the nature of input dat… ▽ More

    Submitted 27 June, 2023; originally announced June 2023.

  39. arXiv:2306.15626  [pdf, other

    cs.LG cs.AI cs.LO stat.ML

    LeanDojo: Theorem Proving with Retrieval-Augmented Language Models

    Authors: Kaiyu Yang, Aidan M. Swope, Alex Gu, Rahul Chalamala, Peiyang Song, Shixing Yu, Saad Godil, Ryan Prenger, Anima Anandkumar

    Abstract: Large language models (LLMs) have shown promise in proving formal theorems using proof assistants such as Lean. However, existing methods are difficult to reproduce or build on, due to private code, data, and large compute requirements. This has created substantial barriers to research on machine learning methods for theorem proving. This paper removes these barriers by introducing LeanDojo: an op… ▽ More

    Submitted 27 October, 2023; v1 submitted 27 June, 2023; originally announced June 2023.

    Comments: Accepted to NeurIPS 2023 (Datasets and Benchmarks Track) as an oral presentation. Data, code, and models available at https://leandojo.org/

  40. arXiv:2306.11250  [pdf, other

    cs.LG

    InRank: Incremental Low-Rank Learning

    Authors: Jiawei Zhao, Yifei Zhang, Beidi Chen, Florian Schäfer, Anima Anandkumar

    Abstract: The theory of greedy low-rank learning (GLRL) aims to explain the impressive generalization capabilities of deep learning. It proves that stochastic gradient-based training implicitly regularizes neural networks towards low-rank solutions through a gradual increase of the rank during training. However, there is a gap between theory and practice since GLRL requires an infinitesimal initialization o… ▽ More

    Submitted 31 December, 2023; v1 submitted 19 June, 2023; originally announced June 2023.

  41. arXiv:2306.09375  [pdf, other

    cs.LG physics.chem-ph q-bio.QM

    Symmetry-Informed Geometric Representation for Molecules, Proteins, and Crystalline Materials

    Authors: Shengchao Liu, Weitao Du, Yan**g Li, Zhuoxinran Li, Zhiling Zheng, Chenru Duan, Zhiming Ma, Omar Yaghi, Anima Anandkumar, Christian Borgs, Jennifer Chayes, Hongyu Guo, Jian Tang

    Abstract: Artificial intelligence for scientific discovery has recently generated significant interest within the machine learning and scientific communities, particularly in the domains of chemistry, biology, and material discovery. For these scientific problems, molecules serve as the fundamental building blocks, and machine learning has emerged as a highly effective and powerful tool for modeling their g… ▽ More

    Submitted 15 June, 2023; originally announced June 2023.

  42. arXiv:2306.09305  [pdf, other

    cs.CV cs.AI cs.LG

    Fast Training of Diffusion Models with Masked Transformers

    Authors: Hongkai Zheng, Weili Nie, Arash Vahdat, Anima Anandkumar

    Abstract: We propose an efficient approach to train large diffusion models with masked transformers. While masked transformers have been extensively explored for representation learning, their application to generative learning is less explored in the vision domain. Our work is the first to exploit masked training to reduce the training cost of diffusion models significantly. Specifically, we randomly mask… ▽ More

    Submitted 4 March, 2024; v1 submitted 15 June, 2023; originally announced June 2023.

  43. arXiv:2306.08754  [pdf, other

    cs.LG physics.ao-ph

    ClimSim: A large multi-scale dataset for hybrid physics-ML climate emulation

    Authors: Sungduk Yu, Walter Hannah, Liran Peng, Jerry Lin, Mohamed Aziz Bhouri, Ritwik Gupta, Björn Lütjens, Justus Christopher Will, Gunnar Behrens, Julius Busecke, Nora Loose, Charles I Stern, Tom Beucler, Bryce Harrop, Benjamin R Hillman, Andrea Jenney, Savannah Ferretti, Nana Liu, Anima Anandkumar, Noah D Brenowitz, Veronika Eyring, Nicholas Geneva, Pierre Gentine, Stephan Mandt, Jaideep Pathak , et al. (31 additional authors not shown)

    Abstract: Modern climate projections lack adequate spatial and temporal resolution due to computational constraints. A consequence is inaccurate and imprecise predictions of critical processes such as storms. Hybrid methods that combine physics with machine learning (ML) have introduced a new generation of higher fidelity climate simulators that can sidestep Moore's Law by outsourcing compute-hungry, short,… ▽ More

    Submitted 6 February, 2024; v1 submitted 14 June, 2023; originally announced June 2023.

    Comments: NeurIPS 2023 Outstanding Datasets and Benchmarks Track Paper

  44. arXiv:2306.03838  [pdf, other

    cs.LG math.NA physics.ao-ph physics.comp-ph

    Spherical Fourier Neural Operators: Learning Stable Dynamics on the Sphere

    Authors: Boris Bonev, Thorsten Kurth, Christian Hundt, Jaideep Pathak, Maximilian Baust, Karthik Kashinath, Anima Anandkumar

    Abstract: Fourier Neural Operators (FNOs) have proven to be an efficient and effective method for resolution-independent operator learning in a broad variety of application areas across scientific machine learning. A key reason for their success is their ability to accurately model long-range dependencies in spatio-temporal data by learning global convolutions in a computationally efficient manner. To this… ▽ More

    Submitted 6 June, 2023; originally announced June 2023.

  45. arXiv:2305.18246  [pdf, other

    cs.LG

    Provable and Practical: Efficient Exploration in Reinforcement Learning via Langevin Monte Carlo

    Authors: Haque Ishfaq, Qingfeng Lan, Pan Xu, A. Rupam Mahmood, Doina Precup, Anima Anandkumar, Kamyar Azizzadenesheli

    Abstract: We present a scalable and effective exploration strategy based on Thompson sampling for reinforcement learning (RL). One of the key shortcomings of existing Thompson sampling algorithms is the need to perform a Gaussian approximation of the posterior distribution, which is not a good surrogate in most practical settings. We instead directly sample the Q function from its posterior distribution, by… ▽ More

    Submitted 17 March, 2024; v1 submitted 29 May, 2023; originally announced May 2023.

    Comments: Published in The Twelfth International Conference on Learning Representations (ICLR) 2024

  46. arXiv:2305.16291  [pdf, other

    cs.AI cs.LG

    Voyager: An Open-Ended Embodied Agent with Large Language Models

    Authors: Guanzhi Wang, Yuqi Xie, Yunfan Jiang, Ajay Mandlekar, Chaowei Xiao, Yuke Zhu, Linxi Fan, Anima Anandkumar

    Abstract: We introduce Voyager, the first LLM-powered embodied lifelong learning agent in Minecraft that continuously explores the world, acquires diverse skills, and makes novel discoveries without human intervention. Voyager consists of three key components: 1) an automatic curriculum that maximizes exploration, 2) an ever-growing skill library of executable code for storing and retrieving complex behavio… ▽ More

    Submitted 19 October, 2023; v1 submitted 25 May, 2023; originally announced May 2023.

    Comments: Project website and open-source codebase: https://voyager.minedojo.org/

  47. arXiv:2305.13220  [pdf, other

    cs.CV

    Fast Monocular Scene Reconstruction with Global-Sparse Local-Dense Grids

    Authors: Wei Dong, Chris Choy, Charles Loop, Or Litany, Yuke Zhu, Anima Anandkumar

    Abstract: Indoor scene reconstruction from monocular images has long been sought after by augmented reality and robotics developers. Recent advances in neural field representations and monocular priors have led to remarkable results in scene-level surface reconstructions. The reliance on Multilayer Perceptrons (MLP), however, significantly limits speed in training and rendering. In this work, we propose to… ▽ More

    Submitted 22 May, 2023; originally announced May 2023.

    Comments: CVPR 2023

  48. arXiv:2304.14554  [pdf, other

    physics.med-ph cond-mat.soft physics.bio-ph physics.flu-dyn

    AI-aided Geometric Design of Anti-infection Catheters

    Authors: Tingtao Zhou, Xuan Wan, Daniel Zhengyu Huang, Zongyi Li, Zhiwei Peng, Anima Anandkumar, John F. Brady, Paul W. Sternberg, Chiara Daraio

    Abstract: Bacteria can swim upstream due to hydrodynamic interactions with the fluid flow in a narrow tube, and pose a clinical threat of urinary tract infection to patients implanted with catheters. Coatings and structured surfaces have been proposed as a way to suppress bacterial contamination in catheters. However, there is no surface structuring or coating approach to date that thoroughly addresses the… ▽ More

    Submitted 27 April, 2023; originally announced April 2023.

    Comments: maint text 4 figures, SI 5 figures

  49. arXiv:2304.06762  [pdf, other

    cs.CL cs.AI cs.IR cs.LG

    Shall We Pretrain Autoregressive Language Models with Retrieval? A Comprehensive Study

    Authors: Boxin Wang, Wei **, Peng Xu, Lawrence McAfee, Zihan Liu, Mohammad Shoeybi, Yi Dong, Oleksii Kuchaiev, Bo Li, Chaowei Xiao, Anima Anandkumar, Bryan Catanzaro

    Abstract: Large decoder-only language models (LMs) can be largely improved in terms of perplexity by retrieval (e.g., RETRO), but its impact on text generation quality and downstream task accuracy is unclear. Thus, it is still an open question: shall we pretrain large autoregressive LMs with retrieval? To answer it, we perform a comprehensive study on a scalable pre-trained retrieval-augmented LM (i.e., RET… ▽ More

    Submitted 20 December, 2023; v1 submitted 13 April, 2023; originally announced April 2023.

    Comments: EMNLP 2023

  50. arXiv:2303.02506  [pdf, other

    cs.LG cs.AI cs.CV

    Prismer: A Vision-Language Model with Multi-Task Experts

    Authors: Shikun Liu, Linxi Fan, Edward Johns, Zhiding Yu, Chaowei Xiao, Anima Anandkumar

    Abstract: Recent vision-language models have shown impressive multi-modal generation capabilities. However, typically they require training huge models on massive datasets. As a more scalable alternative, we introduce Prismer, a data- and parameter-efficient vision-language model that leverages an ensemble of task-specific experts. Prismer only requires training of a small number of components, with the maj… ▽ More

    Submitted 18 January, 2024; v1 submitted 4 March, 2023; originally announced March 2023.

    Comments: Published at TMLR 2024. Project Page: https://shikun.io/projects/prismer Code: https://github.com/NVlabs/prismer