Skip to main content

Showing 1–50 of 154 results for author: Pal, C

.
  1. arXiv:2407.02362  [pdf, other

    cs.AR cs.AI cs.LG

    Fast, Scalable, Energy-Efficient Non-element-wise Matrix Multiplication on FPGA

    Authors: Xuqi Zhu, Huaizhi Zhang, JunKyu Lee, Jiacheng Zhu, Chandrajit Pal, Sangeet Saha, Klaus D. McDonald-Maier, Xiaojun Zhai

    Abstract: Modern Neural Network (NN) architectures heavily rely on vast numbers of multiply-accumulate arithmetic operations, constituting the predominant computational cost. Therefore, this paper proposes a high-throughput, scalable and energy efficient non-element-wise matrix multiplication unit on FPGAs as a basic component of the NNs. We firstly streamline inter-layer and intra-layer redundancies of MAD… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

  2. arXiv:2406.11811  [pdf, other

    cs.CL cs.AI

    RepLiQA: A Question-Answering Dataset for Benchmarking LLMs on Unseen Reference Content

    Authors: Joao Monteiro, Pierre-Andre Noel, Etienne Marcotte, Sai Rajeswar, Valentina Zantedeschi, David Vazquez, Nicolas Chapados, Christopher Pal, Perouz Taslakian

    Abstract: Large Language Models (LLMs) are trained on vast amounts of data, most of which is automatically scraped from the internet. This data includes encyclopedic documents that harbor a vast amount of general knowledge (e.g., Wikipedia) but also potentially overlap with benchmark datasets used for evaluating LLMs. Consequently, evaluating models on test splits that might have leaked into the training se… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  3. arXiv:2406.05630  [pdf, other

    cs.CV

    Ctrl-V: Higher Fidelity Video Generation with Bounding-Box Controlled Object Motion

    Authors: Ge Ya Luo, Zhi Hao Luo, Anthony Gosselin, Alexia Jolicoeur-Martineau, Christopher Pal

    Abstract: With recent advances in video prediction, controllable video generation has been attracting more attention. Generating high fidelity videos according to simple and flexible conditioning is of particular interest. To this end, we propose a controllable video generation model using pixel level renderings of 2D or 3D bounding boxes as conditioning. In addition, we also create a bounding box predictor… ▽ More

    Submitted 21 June, 2024; v1 submitted 8 June, 2024; originally announced June 2024.

  4. arXiv:2406.04940  [pdf, other

    cs.LG cs.AI

    CarbonSense: A Multimodal Dataset and Baseline for Carbon Flux Modelling

    Authors: Matthew Fortier, Mats L. Richter, Oliver Sonnentag, Chris Pal

    Abstract: Terrestrial carbon fluxes provide vital information about our biosphere's health and its capacity to absorb anthropogenic CO$_2$ emissions. The importance of predicting carbon fluxes has led to the emerging field of data-driven carbon flux modelling (DDCFM), which uses statistical techniques to predict carbon fluxes from biophysical data. However, the field lacks a standardized dataset to promote… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

    Comments: 9 content pages, 11 reference pages, 9 appendix pages

  5. arXiv:2405.13022  [pdf, other

    cs.CL cs.LG

    LLMs can learn self-restraint through iterative self-reflection

    Authors: Alexandre Piché, Aristides Milios, Dzmitry Bahdanau, Chris Pal

    Abstract: In order to be deployed safely, Large Language Models (LLMs) must be capable of dynamically adapting their behavior based on their level of knowledge and uncertainty associated with specific topics. This adaptive behavior, which we refer to as self-restraint, is non-trivial to teach since it depends on the internal knowledge of an LLM. By default, LLMs are trained to maximize the next token likeli… ▽ More

    Submitted 3 July, 2024; v1 submitted 15 May, 2024; originally announced May 2024.

  6. arXiv:2404.15420  [pdf, other

    cs.CL cs.AI

    XC-Cache: Cross-Attending to Cached Context for Efficient LLM Inference

    Authors: João Monteiro, Étienne Marcotte, Pierre-André Noël, Valentina Zantedeschi, David Vázquez, Nicolas Chapados, Christopher Pal, Perouz Taslakian

    Abstract: In-context learning (ICL) approaches typically leverage prompting to condition decoder-only language model generation on reference information. Just-in-time processing of a context is inefficient due to the quadratic cost of self-attention operations, and caching is desirable. However, caching transformer states can easily require almost as much space as the model parameters. When the right contex… ▽ More

    Submitted 23 April, 2024; originally announced April 2024.

  7. arXiv:2403.19918  [pdf, other

    cs.RO cs.AI cs.LG

    CtRL-Sim: Reactive and Controllable Driving Agents with Offline Reinforcement Learning

    Authors: Luke Rowe, Roger Girgis, Anthony Gosselin, Bruno Carrez, Florian Golemo, Felix Heide, Liam Paull, Christopher Pal

    Abstract: Evaluating autonomous vehicle stacks (AVs) in simulation typically involves replaying driving logs from real-world recorded traffic. However, agents replayed from offline data are not reactive and hard to intuitively control. Existing approaches address these challenges by proposing methods that rely on heuristics or generative models of real-world data but these approaches either lack realism or… ▽ More

    Submitted 14 June, 2024; v1 submitted 28 March, 2024; originally announced March 2024.

    Comments: 21 pages, 9 figures, 8 tables

  8. arXiv:2403.14443  [pdf, other

    cs.AI cs.CL cs.GT cs.LG cs.MA cs.SI

    Language Models Can Reduce Asymmetry in Information Markets

    Authors: Nasim Rahaman, Martin Weiss, Manuel Wüthrich, Yoshua Bengio, Li Erran Li, Chris Pal, Bernhard Schölkopf

    Abstract: This work addresses the buyer's inspection paradox for information markets. The paradox is that buyers need to access information to determine its value, while sellers need to limit access to prevent theft. To study this, we introduce an open-source simulated digital marketplace where intelligent agents, powered by language models, buy and sell information on behalf of external participants. The c… ▽ More

    Submitted 21 March, 2024; originally announced March 2024.

  9. arXiv:2403.05236  [pdf, other

    eess.SY

    Fault Recovery and Transient Stability of Grid-Forming Converters Equipped with Current Saturation

    Authors: Ali Arjomandi-Nezhad, Yifei Guo, Bikash C. Pal, Guangya Yang

    Abstract: When grid-forming (GFM) inverter-based resources (IBRs) experience large grid disturbances (e.g., short-circuit faults), the current limiter may be triggered and GFM IBRs enter the current saturation mode, inducing nonlinear dynamical behaviors and imposing great challenges to the post-disturbance transient angle stability. This paper presents a systematic study to reveal the fault recovery behavi… ▽ More

    Submitted 8 March, 2024; originally announced March 2024.

    Comments: 10 pages, 22 figures

  10. arXiv:2402.06143  [pdf, other

    cs.RO

    Reinforcement Learning for Blind Stair Climbing with Legged and Wheeled-Legged Robots

    Authors: Simon Chamorro, Victor Klemm, Miguel de la Iglesia Valls, Christopher Pal, Roland Siegwart

    Abstract: In recent years, legged and wheeled-legged robots have gained prominence for tasks in environments predominantly created for humans across various domains. One significant challenge faced by many of these robots is their limited capability to navigate stairs, which hampers their functionality in multi-story environments. This study proposes a method aimed at addressing this limitation, employing r… ▽ More

    Submitted 8 February, 2024; originally announced February 2024.

    Comments: Video: https://youtu.be/Ec6ar8BVJh4

  11. arXiv:2402.01788  [pdf, other

    cs.CL cs.AI cs.IR

    LitLLM: A Toolkit for Scientific Literature Review

    Authors: Shubham Agarwal, Issam H. Laradji, Laurent Charlin, Christopher Pal

    Abstract: Conducting literature reviews for scientific papers is essential for understanding research, its limitations, and building on existing work. It is a tedious task which makes an automatic literature review generator appealing. Unfortunately, many existing works that generate such reviews using Large Language Models (LLMs) have significant limitations. They tend to hallucinate-generate non-actual in… ▽ More

    Submitted 1 February, 2024; originally announced February 2024.

  12. arXiv:2312.13876  [pdf, other

    cs.LG cs.CL stat.ML

    Capture the Flag: Uncovering Data Insights with Large Language Models

    Authors: Issam Laradji, Perouz Taslakian, Sai Rajeswar, Valentina Zantedeschi, Alexandre Lacoste, Nicolas Chapados, David Vazquez, Christopher Pal, Alexandre Drouin

    Abstract: The extraction of a small number of relevant insights from vast amounts of data is a crucial component of data-driven decision-making. However, accomplishing this task requires considerable technical skills, domain expertise, and human labor. This study explores the potential of using Large Language Models (LLMs) to automate the discovery of insights in data, leveraging recent advances in reasonin… ▽ More

    Submitted 21 December, 2023; originally announced December 2023.

    Comments: 14 pages, 1 figure, Foundation Models for Decision Making Workshop at NeurIPS 2023

  13. arXiv:2312.11556  [pdf, other

    cs.CV cs.AI cs.CL

    StarVector: Generating Scalable Vector Graphics Code from Images

    Authors: Juan A. Rodriguez, Shubham Agarwal, Issam H. Laradji, Pau Rodriguez, David Vazquez, Christopher Pal, Marco Pedersoli

    Abstract: Scalable Vector Graphics (SVGs) have become integral in modern image rendering applications due to their infinite scalability in resolution, versatile usability, and editing capabilities. SVGs are particularly popular in the fields of web development and graphic design. Existing approaches for SVG modeling using deep learning often struggle with generating complex SVGs and are restricted to simple… ▽ More

    Submitted 17 December, 2023; originally announced December 2023.

  14. arXiv:2310.13817  [pdf, other

    eess.SY

    Deep Learning Based Forecasting-Aided State Estimation in Active Distribution Networks

    Authors: Malek Alduhaymi, Ravindra Singh, Firdous Ul Nazir, Bikash C. Pal

    Abstract: Operating an active distribution network (ADN) in the absence of enough measurements, the presence of distributed energy resources, and poor knowledge of responsive demand behaviour is a huge challenge. This paper introduces systematic modelling of demand response behaviour which is then included in Forecasting Aided State Estimation (FASE) for better control of the network. There are several inno… ▽ More

    Submitted 20 October, 2023; originally announced October 2023.

  15. arXiv:2310.11382  [pdf

    physics.flu-dyn

    Condensate droplet roaming on nanostructured superhydrophobic surfaces

    Authors: Cheuk Wing Edmond Lam, Kartik Regulagadda, Matteo Donati, Abinash Tripathy, Gopal Chandra Pal, Chander Shekhar Sharma, Athanasios Milionis, Dimos Poulikakos

    Abstract: Jum** of coalescing condensate droplets from superhydrophobic surfaces is an interesting phenomenon which yields marked heat transfer enhancement over the more explored gravity-driven droplet removal mode in surface condensation, a phase change process of central interest to applications ranging from energy to water harvesting. However, when condensate microdroplets coalesce, they can also spont… ▽ More

    Submitted 17 October, 2023; originally announced October 2023.

  16. arXiv:2309.11592  [pdf, other

    cs.CE

    Parallel-mentoring for Offline Model-based Optimization

    Authors: Can Chen, Christopher Beckham, Zixuan Liu, Xue Liu, Christopher Pal

    Abstract: We study offline model-based optimization to maximize a black-box objective function with a static dataset of designs and scores. These designs encompass a variety of domains, including materials, robots and DNA sequences. A common approach trains a proxy on the static dataset to approximate the black-box objective function and performs gradient ascent to obtain new designs. However, this often re… ▽ More

    Submitted 10 October, 2023; v1 submitted 20 September, 2023; originally announced September 2023.

    Comments: Accepted by NeurIPS 2023

  17. arXiv:2308.01020  [pdf, other

    eess.SY

    A Model Predictive Approach for Enhancing Transient Stability of Grid-Forming Converters

    Authors: Ali Arjomandi-Nezhad, Yifei Guo, Bikash C. Pal, Damiano Varagnolo

    Abstract: A model predictive control (MPC) method for enhancing post-fault transient stability of a grid-forming (GFM) inverter based resources (IBRs) is developed in this paper. This proposed controller is activated as soon as the converter enters into the post-fault current-saturation mode. It aims at mitigating the instability arising from insufficient deceleration due to current saturation and thus impr… ▽ More

    Submitted 8 November, 2023; v1 submitted 2 August, 2023; originally announced August 2023.

    Comments: 14 pages, 19 figures

  18. arXiv:2306.09539  [pdf, other

    cs.CL cs.LG

    Block-State Transformers

    Authors: Mahan Fathi, Jonathan Pilault, Orhan Firat, Christopher Pal, Pierre-Luc Bacon, Ross Goroshin

    Abstract: State space models (SSMs) have shown impressive results on tasks that require modeling long-range dependencies and efficiently scale to long sequences owing to their subquadratic runtime complexity. Originally designed for continuous signals, SSMs have shown superior performance on a plethora of tasks, in vision and audio; however, SSMs still lag Transformer performance in Language Modeling tasks.… ▽ More

    Submitted 30 October, 2023; v1 submitted 15 June, 2023; originally announced June 2023.

    Comments: NeurIPS'23 - Thirty-seventh Conference on Neural Information Processing Systems

  19. arXiv:2306.04620  [pdf, other

    cs.LG q-bio.BM

    Goal-conditioned GFlowNets for Controllable Multi-Objective Molecular Design

    Authors: Julien Roy, Pierre-Luc Bacon, Christopher Pal, Emmanuel Bengio

    Abstract: In recent years, in-silico molecular design has received much attention from the machine learning community. When designing a new compound for pharmaceutical applications, there are usually multiple properties of such molecules that need to be optimised: binding energy to the target, synthesizability, toxicity, EC50, and so on. While previous approaches have employed a scalarization scheme to turn… ▽ More

    Submitted 29 June, 2023; v1 submitted 7 June, 2023; originally announced June 2023.

    Comments: 14 pages

  20. arXiv:2306.01729  [pdf, other

    cs.CL cs.AI

    Improving Generalization in Task-oriented Dialogues with Workflows and Action Plans

    Authors: Stefania Raimondo, Christopher Pal, Xiaotian Liu, David Vazquez, Hector Palacios

    Abstract: Task-oriented dialogue is difficult in part because it involves understanding user intent, collecting information from the user, executing API calls, and generating helpful and fluent responses. However, for complex tasks one must also correctly do all of these things over multiple steps, and in a specific order. While large pre-trained language models can be fine-tuned end-to-end to create multi-… ▽ More

    Submitted 2 June, 2023; originally announced June 2023.

  21. arXiv:2306.00637  [pdf, other

    cs.CV

    Wuerstchen: An Efficient Architecture for Large-Scale Text-to-Image Diffusion Models

    Authors: Pablo Pernias, Dominic Rampas, Mats L. Richter, Christopher J. Pal, Marc Aubreville

    Abstract: We introduce Würstchen, a novel architecture for text-to-image synthesis that combines competitive performance with unprecedented cost-effectiveness for large-scale text-to-image diffusion models. A key contribution of our work is to develop a latent diffusion technique in which we learn a detailed but extremely compact semantic image representation used to guide the diffusion process. This highly… ▽ More

    Submitted 29 September, 2023; v1 submitted 1 June, 2023; originally announced June 2023.

    Comments: Corresponding to "Würstchen v2"

    Journal ref: The Twelfth International Conference on Learning Representations (ICLR), 2024

  22. arXiv:2305.16397  [pdf, other

    cs.CV cs.AI cs.CL

    Are Diffusion Models Vision-And-Language Reasoners?

    Authors: Benno Krojer, Elinor Poole-Dayan, Vikram Voleti, Christopher Pal, Siva Reddy

    Abstract: Text-conditioned image generation models have recently shown immense qualitative success using denoising diffusion processes. However, unlike discriminative vision-and-language models, it is a non-trivial task to subject these diffusion-based generative models to automatic fine-grained quantitative evaluation of high-level phenomena such as compositionality. Towards this goal, we perform two innov… ▽ More

    Submitted 2 November, 2023; v1 submitted 25 May, 2023; originally announced May 2023.

    Comments: Accepted to NeurIPS 2023

  23. arXiv:2305.00970  [pdf, other

    cs.CV

    ArK: Augmented Reality with Knowledge Interactive Emergent Ability

    Authors: Qiuyuan Huang, Jae Sung Park, Abhinav Gupta, Paul Bennett, Ran Gong, Subhojit Som, Baolin Peng, Owais Khan Mohammed, Chris Pal, Ye** Choi, Jianfeng Gao

    Abstract: Despite the growing adoption of mixed reality and interactive AI agents, it remains challenging for these systems to generate high quality 2D/3D scenes in unseen environments. The common practice requires deploying an AI agent to collect large amounts of data for model training for every new task. This process is costly, or even impossible, for many domains. In this study, we develop an infinite a… ▽ More

    Submitted 1 May, 2023; originally announced May 2023.

    Report number: EFI-94-11

  24. arXiv:2304.13722  [pdf, other

    cs.CV

    Controllable Image Generation via Collage Representations

    Authors: Arantxa Casanova, Marlène Careil, Adriana Romero-Soriano, Christopher J. Pal, Jakob Verbeek, Michal Drozdzal

    Abstract: Recent advances in conditional generative image models have enabled impressive results. On the one hand, text-based conditional models have achieved remarkable generation quality, by leveraging large-scale datasets of image-text pairs. To enable fine-grained controllability, however, text-based models require long prompts, whose details may be ignored by the model. On the other hand, layout-based… ▽ More

    Submitted 26 April, 2023; originally announced April 2023.

  25. arXiv:2304.05330  [pdf

    physics.flu-dyn physics.app-ph

    Controlled coalescence-induced droplet jum** on flexible superhydrophobic substrates

    Authors: Gopal Chandra Pal, Siddharth SS, Manish Agarwal, Chander Shekhar Sharma

    Abstract: Sessile droplets coalescing on superhydrophobic substrates spontaneously jump from the surface. In this process, the excess surface energy available at the initiation of coalescence overcomes the minimal surface adhesion and manifests as sufficient kinetic energy to propel the droplets away from the substrate. Here, we show that the coalescence induced droplet jum** velocity is significantly cur… ▽ More

    Submitted 11 April, 2023; originally announced April 2023.

  26. arXiv:2304.03866  [pdf, other

    stat.ML cs.AI cs.LG

    Conservative objective models are a special kind of contrastive divergence-based energy model

    Authors: Christopher Beckham, Christopher Pal

    Abstract: In this work we theoretically show that conservative objective models (COMs) for offline model-based optimisation (MBO) are a special kind of contrastive divergence-based energy model, one where the energy function represents both the unconditional probability of the input and the conditional probability of the reward variable. While the initial formulation only samples modes from its learned dist… ▽ More

    Submitted 7 April, 2023; originally announced April 2023.

  27. arXiv:2302.07400  [pdf, other

    cs.LG math.FA stat.ML

    Score-based Diffusion Models in Function Space

    Authors: Jae Hyun Lim, Nikola B. Kovachki, Ricardo Baptista, Christopher Beckham, Kamyar Azizzadenesheli, Jean Kossaifi, Vikram Voleti, Jiaming Song, Karsten Kreis, Jan Kautz, Christopher Pal, Arash Vahdat, Anima Anandkumar

    Abstract: Diffusion models have recently emerged as a powerful framework for generative modeling. They consist of a forward process that perturbs input data with Gaussian white noise and a reverse process that learns a score function to generate samples by denoising. Despite their tremendous success, they are mostly formulated on finite-dimensional spaces, e.g. Euclidean, limiting their applications to many… ▽ More

    Submitted 22 November, 2023; v1 submitted 14 February, 2023; originally announced February 2023.

    Comments: 52 pages

    MSC Class: 46B09 (Primary); 60J22 (Secondary) ACM Class: I.2.6; J.2

  28. arXiv:2302.05507  [pdf, other

    cs.CL cs.AI cs.LG

    Language Decision Transformers with Exponential Tilt for Interactive Text Environments

    Authors: Nicolas Gontier, Pau Rodriguez, Issam Laradji, David Vazquez, Christopher Pal

    Abstract: Text-based game environments are challenging because agents must deal with long sequences of text, execute compositional actions using text and learn from sparse rewards. We address these challenges by proposing Language Decision Transformers (LDTs), a framework that is based on transformer language models and decision transformers (DTs). Our LDTs extend DTs with 3 components: (1) exponential tilt… ▽ More

    Submitted 17 November, 2023; v1 submitted 10 February, 2023; originally announced February 2023.

    Comments: 19 pages, 6 figures, 5 tables

  29. arXiv:2212.01639  [pdf, other

    stat.ML cs.CV cs.LG

    Visual Question Answering From Another Perspective: CLEVR Mental Rotation Tests

    Authors: Christopher Beckham, Martin Weiss, Florian Golemo, Sina Honari, Derek Nowrouzezahrai, Christopher Pal

    Abstract: Different types of mental rotation tests have been used extensively in psychology to understand human visual reasoning and perception. Understanding what an object or visual scene would look like from another viewpoint is a challenging problem that is made even harder if it must be performed from a single image. We explore a controlled setting whereby questions are posed about the properties of a… ▽ More

    Submitted 3 December, 2022; originally announced December 2022.

    Comments: Accepted for publication to Pattern Recognition journal

  30. arXiv:2211.14487  [pdf, other

    cs.CV cs.AI cs.LG

    Receptive Field Refinement for Convolutional Neural Networks Reliably Improves Predictive Performance

    Authors: Mats L. Richter, Christopher Pal

    Abstract: Minimal changes to neural architectures (e.g. changing a single hyperparameter in a key layer), can lead to significant gains in predictive performance in Convolutional Neural Networks (CNNs). In this work, we present a new approach to receptive field analysis that can yield these types of theoretical and empirical performance gains across twenty well-known CNN architectures examined in our experi… ▽ More

    Submitted 26 November, 2022; originally announced November 2022.

  31. arXiv:2211.10747  [pdf, other

    stat.ML cs.LG

    Exploring validation metrics for offline model-based optimisation with diffusion models

    Authors: Christopher Beckham, Alexandre Piche, David Vazquez, Christopher Pal

    Abstract: In model-based optimisation (MBO) we are interested in using machine learning to design candidates that maximise some measure of reward with respect to a black box function called the (ground truth) oracle, which is expensive to compute since it involves executing a real world process. In offline MBO we wish to do so without assuming access to such an oracle during training or validation, with mak… ▽ More

    Submitted 13 January, 2024; v1 submitted 19 November, 2022; originally announced November 2022.

  32. arXiv:2211.02348  [pdf, other

    cs.LG cs.AI cs.CY

    A General Purpose Neural Architecture for Geospatial Systems

    Authors: Nasim Rahaman, Martin Weiss, Frederik Träuble, Francesco Locatello, Alexandre Lacoste, Yoshua Bengio, Chris Pal, Li Erran Li, Bernhard Schölkopf

    Abstract: Geospatial Information Systems are used by researchers and Humanitarian Assistance and Disaster Response (HADR) practitioners to support a wide variety of important applications. However, collaboration between these actors is difficult due to the heterogeneous nature of geospatial data modalities (e.g., multi-spectral images of various resolutions, timeseries, weather data) and diversity of tasks… ▽ More

    Submitted 4 November, 2022; originally announced November 2022.

    Comments: Presented at AI + HADR Workshop at NeurIPS 2022

  33. arXiv:2211.01233  [pdf, other

    cs.CV cs.AI cs.LG

    Attention-based Neural Cellular Automata

    Authors: Mattie Tesfaldet, Derek Nowrouzezahrai, Christopher Pal

    Abstract: Recent extensions of Cellular Automata (CA) have incorporated key ideas from modern deep learning, dramatically extending their capabilities and catalyzing a new family of Neural Cellular Automata (NCA) techniques. Inspired by Transformer-based architectures, our work presents a new class of $\textit{attention-based}$ NCAs formed using a spatially localized$\unicode{x2014}$yet globally organized… ▽ More

    Submitted 2 November, 2022; originally announced November 2022.

    Comments: NeurIPS 2022

  34. arXiv:2210.15251  [pdf, other

    math.OC

    Optimal control for production inventory system with various cost criterion

    Authors: Subrata Golui, Chandan Pal, Manikandan R., Abhay Sobhanan

    Abstract: In this article, we investigate a dynamic control problem of a production-inventory system. Here, demands arrive at the production unit according to a Poisson process and are processed in an FCFS manner. The processing time of the customers' demand is the exponential distribution. The production manufacturers produce the items on a make-to-order basis to meet customer demands. The production is ru… ▽ More

    Submitted 27 October, 2022; originally announced October 2022.

    Comments: 5 figures

    MSC Class: 93E20 (Primary) 49L20; 60J27 (Secondary)

  35. arXiv:2210.12282   

    cs.LG

    Bridging the Gap Between Target Networks and Functional Regularization

    Authors: Alexandre Piche, Valentin Thomas, Joseph Marino, Rafael Pardinas, Gian Maria Marconi, Christopher Pal, Mohammad Emtiyaz Khan

    Abstract: Bootstrap** is behind much of the successes of Deep Reinforcement Learning. However, learning the value function via bootstrap** often leads to unstable training due to fast-changing target values. Target Networks are employed to stabilize training by using an additional set of lagging parameters to estimate the target values. Despite the popularity of Target Networks, their effect on the opti… ▽ More

    Submitted 3 January, 2024; v1 submitted 21 October, 2022; originally announced October 2022.

    Comments: The published version of this paper (TMLR 2023) is available at arXiv:2106.02613 and https://openreview.net/forum?id=BFvoemrmqX

  36. arXiv:2210.12272  [pdf, other

    stat.ML cs.LG cs.RO

    Implicit Offline Reinforcement Learning via Supervised Learning

    Authors: Alexandre Piche, Rafael Pardinas, David Vazquez, Igor Mordatch, Chris Pal

    Abstract: Offline Reinforcement Learning (RL) via Supervised Learning is a simple and effective way to learn robotic skills from a dataset collected by policies of different expertise levels. It is as simple as supervised learning and Behavior Cloning (BC), but takes advantage of return information. On datasets collected by policies of similar expertise, implicit BC has been shown to match or outperform exp… ▽ More

    Submitted 21 October, 2022; originally announced October 2022.

  37. arXiv:2210.12254  [pdf, other

    cs.LG cs.CV

    Score-based Denoising Diffusion with Non-Isotropic Gaussian Noise Models

    Authors: Vikram Voleti, Christopher Pal, Adam Oberman

    Abstract: Generative models based on denoising diffusion techniques have led to an unprecedented increase in the quality and diversity of imagery that is now possible to create with neural generative models. However, most contemporary state-of-the-art methods are derived from a standard isotropic Gaussian formulation. In this work we examine the situation where non-isotropic Gaussian distributions are used.… ▽ More

    Submitted 22 November, 2022; v1 submitted 21 October, 2022; originally announced October 2022.

    Comments: NeurIPS 2022 Workshop ; 4 pages, 1 page of references, 18 pages of appendix, 2 figures

    Journal ref: NeurIPS 2022 Workshop on Score-Based Methods

  38. arXiv:2210.08031  [pdf, other

    cs.LG cs.AI cs.CV cs.NE stat.ML

    Neural Attentive Circuits

    Authors: Nasim Rahaman, Martin Weiss, Francesco Locatello, Chris Pal, Yoshua Bengio, Bernhard Schölkopf, Li Erran Li, Nicolas Ballas

    Abstract: Recent work has seen the development of general purpose neural architectures that can be trained to perform tasks across diverse data modalities. General purpose models typically make few assumptions about the underlying data-structure and are known to perform well in the large-data regime. At the same time, there has been growing interest in modular neural architectures that represent the data us… ▽ More

    Submitted 19 October, 2022; v1 submitted 14 October, 2022; originally announced October 2022.

    Comments: To appear at NeurIPS 2022

  39. arXiv:2210.07453  [pdf, ps, other

    cs.LG

    Using Graph Algorithms to Pretrain Graph Completion Transformers

    Authors: Jonathan Pilault, Michael Galkin, Bahare Fatemi, Perouz Taslakian, David Vasquez, Christopher Pal

    Abstract: Recent work on Graph Neural Networks has demonstrated that self-supervised pretraining can further enhance performance on downstream graph, link, and node classification tasks. However, the efficacy of pretraining tasks has not been fully investigated for downstream large knowledge graph completion tasks. Using a contextualized knowledge graph embedding approach, we investigate five different pret… ▽ More

    Submitted 27 March, 2023; v1 submitted 13 October, 2022; originally announced October 2022.

  40. arXiv:2208.08274  [pdf, other

    cs.GR cs.LG

    SMPL-IK: Learned Morphology-Aware Inverse Kinematics for AI Driven Artistic Workflows

    Authors: Vikram Voleti, Boris N. Oreshkin, Florent Bocquelet, Félix G. Harvey, Louis-Simon Ménard, Christopher Pal

    Abstract: Inverse Kinematics (IK) systems are often rigid with respect to their input character, thus requiring user intervention to be adapted to new skeletons. In this paper we aim at creating a flexible, learned IK solver applicable to a wide variety of human morphologies. We extend a state-of-the-art machine learning IK solver to operate on the well known Skinned Multi-Person Linear model (SMPL). We cal… ▽ More

    Submitted 16 August, 2022; originally announced August 2022.

  41. arXiv:2208.02377  [pdf, other

    cs.LG cs.AI stat.ML

    Improving Meta-Learning Generalization with Activation-Based Early-Stop**

    Authors: Simon Guiroy, Christopher Pal, Gonçalo Mordido, Sarath Chandar

    Abstract: Meta-Learning algorithms for few-shot learning aim to train neural networks capable of generalizing to novel tasks using only a few examples. Early-stop** is critical for performance, halting model training when it reaches optimal generalization to the new task distribution. Early-stop** mechanisms in Meta-Learning typically rely on measuring the model performance on labeled examples from a me… ▽ More

    Submitted 3 August, 2022; originally announced August 2022.

    Comments: Accepted at CoLLAs 2022. To be published in Proceedings of Machine Learning Research (PMLR)

  42. arXiv:2206.12067  [pdf, ps, other

    math.OC

    Nonzero-Sum Risk-Sensitive Stochastic Differential Games: A Multi-parameter Eigenvalue Problem Approach

    Authors: Mrinal K. Ghosh, K. Suresh Kumar, Chandan Pal, Somnath Pradhan

    Abstract: We study nonzero-sum stochastic differential games with risk-sensitive ergodic cost criterion. Under certain conditions, using multi-parameter eigenvalue approach, we establish the existence of a Nash equilibrium in the space of stationary Markov strategies. We achieve our results by studying the relevant systems of coupled HJB equations. Exploiting the stochastic representation of the principal e… ▽ More

    Submitted 24 June, 2022; originally announced June 2022.

  43. arXiv:2206.09962  [pdf, other

    eess.SY

    Model-Free Optimal Control of Inverter for Dynamic Voltage Support

    Authors: Yifei Guo, Bikash C. Pal, Rabih A. Jabr

    Abstract: Inverter-based resources (IBRs) are required to provide dynamic voltage support (DVS) during voltage dips to enhance the low-voltage ride-through capability. In this paper, we develop a model-free control method to achieve the optimal DVS (ODVS) without relying on the knowledge of grid parameters. Delving into the optimum trajectory of the ODVS problem, it is found that either the current constrai… ▽ More

    Submitted 20 June, 2022; originally announced June 2022.

  44. arXiv:2205.11690  [pdf, other

    cs.CL

    Workflow Discovery from Dialogues in the Low Data Regime

    Authors: Amine El Hattami, Stefania Raimondo, Issam Laradji, David Vazquez, Pau Rodriguez, Chris Pal

    Abstract: Text-based dialogues are now widely used to solve real-world problems. In cases where solution strategies are already known, they can sometimes be codified into workflows and used to guide humans or artificial agents through the task of hel** clients. We introduce a new problem formulation that we call Workflow Discovery (WD) in which we are interested in the situation where a formal workflow ma… ▽ More

    Submitted 11 February, 2023; v1 submitted 23 May, 2022; originally announced May 2022.

  45. arXiv:2205.09853  [pdf, other

    cs.CV cs.AI cs.LG

    MCVD: Masked Conditional Video Diffusion for Prediction, Generation, and Interpolation

    Authors: Vikram Voleti, Alexia Jolicoeur-Martineau, Christopher Pal

    Abstract: Video prediction is a challenging task. The quality of video frames from current state-of-the-art (SOTA) generative models tends to be poor and generalization beyond the training data is difficult. Furthermore, existing prediction frameworks are typically not capable of simultaneously handling other video-related tasks such as unconditional generation or interpolation. In this work, we devise a ge… ▽ More

    Submitted 12 October, 2022; v1 submitted 19 May, 2022; originally announced May 2022.

    Comments: NeurIPS 2022 ; 10 pages, 4 figures, 7 tables

  46. arXiv:2203.16662  [pdf, other

    stat.ML cs.LG

    Overcoming challenges in leveraging GANs for few-shot data augmentation

    Authors: Christopher Beckham, Issam Laradji, Pau Rodriguez, David Vazquez, Derek Nowrouzezahrai, Christopher Pal

    Abstract: In this paper, we explore the use of GAN-based few-shot data augmentation as a method to improve few-shot classification performance. We perform an exploration into how a GAN can be fine-tuned for such a task (one of which is in a class-incremental manner), as well as a rigorous empirical investigation into how well these models can perform to improve few-shot classification. We identify issues re… ▽ More

    Submitted 8 August, 2022; v1 submitted 30 March, 2022; originally announced March 2022.

    Comments: v3 of the paper, various changes including better figures, CIFAR-100 results, and precision-recall metrics

  47. arXiv:2201.03790  [pdf, ps, other

    math.OC

    Discrete-time Zero-Sum Games for Markov chains with risk-sensitive average cost criterion

    Authors: Mrinal K. Ghosh, Subrata Golui, Chandan Pal, Somnath Pradhan

    Abstract: We study zero-sum stochastic games for controlled discrete time Markov chains with risk-sensitive average cost criterion with countable state space and Borel action spaces. The payoff function is nonnegative and possibly unbounded. Under a certain Lyapunov stability assumption on the dynamics, we establish the existence of a value and saddle point equilibrium. Further we completely characterize al… ▽ More

    Submitted 11 January, 2022; originally announced January 2022.

    Comments: 28 pages

    MSC Class: 91A15; 91A25

  48. arXiv:2201.01787  [pdf, other

    cs.CL cs.AI cs.LG

    Does Entity Abstraction Help Generative Transformers Reason?

    Authors: Nicolas Gontier, Siva Reddy, Christopher Pal

    Abstract: We study the utility of incorporating entity type abstractions into pre-trained Transformers and test these methods on four NLP tasks requiring different forms of logical reasoning: (1) compositional language understanding with text-based relational reasoning (CLUTRR), (2) abductive reasoning (ProofWriter), (3) multi-hop question answering (HotpotQA), and (4) conversational question answering (CoQ… ▽ More

    Submitted 21 November, 2022; v1 submitted 5 January, 2022; originally announced January 2022.

    Comments: TMLR 2022; 28 pages; 9 tables; 1 figure

  49. arXiv:2112.12228  [pdf, other

    cs.LG

    Direct Behavior Specification via Constrained Reinforcement Learning

    Authors: Julien Roy, Roger Girgis, Joshua Romoff, Pierre-Luc Bacon, Christopher Pal

    Abstract: The standard formulation of Reinforcement Learning lacks a practical way of specifying what are admissible and forbidden behaviors. Most often, practitioners go about the task of behavior specification by manually engineering the reward function, a counter-intuitive process that requires several iterations and is prone to reward hacking by the agent. In this work, we argue that constrained RL, whi… ▽ More

    Submitted 18 June, 2022; v1 submitted 22 December, 2021; originally announced December 2021.

  50. arXiv:2112.07342  [pdf, other

    cs.LG cs.AI cs.MA

    Learning to Guide and to Be Guided in the Architect-Builder Problem

    Authors: Paul Barde, Tristan Karch, Derek Nowrouzezahrai, Clément Moulin-Frier, Christopher Pal, Pierre-Yves Oudeyer

    Abstract: We are interested in interactive agents that learn to coordinate, namely, a $builder$ -- which performs actions but ignores the goal of the task, i.e. has no access to rewards -- and an $architect$ which guides the builder towards the goal of the task. We define and explore a formal setting where artificial agents are equipped with mechanisms that allow them to simultaneously learn a task while at… ▽ More

    Submitted 11 April, 2022; v1 submitted 14 December, 2021; originally announced December 2021.

    Comments: International Conference on Learning Representations (2022)