Skip to main content

Showing 1–38 of 38 results for author: Nowrouzezahrai, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.03154  [pdf, other

    cs.LG cs.AI q-bio.BM

    Reinforcement Learning for Sequence Design Leveraging Protein Language Models

    Authors: Jithendaraa Subramanian, Shivakanth Sujit, Niloy Irtisam, Umong Sain, Derek Nowrouzezahrai, Samira Ebrahimi Kahou, Riashat Islam

    Abstract: Protein sequence design, determined by amino acid sequences, are essential to protein engineering problems in drug discovery. Prior approaches have resorted to evolutionary strategies or Monte-Carlo methods for protein design, but often fail to exploit the structure of the combinatorial search space, to generalize to unseen sequences. In the context of discrete black box optimization over large se… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

    Comments: 22 pages, 7 figures, 4 tables

  2. arXiv:2406.09328  [pdf, other

    cs.GR

    Learnable Fractal Flames

    Authors: Jordan J. Bannister, Derek Nowrouzezahrai

    Abstract: This work presents a differentiable rendering approach that allows latent fractal flame parameters to be learned from image supervision. The approach extends the state-of-the-art in differentiable fractal rendering through support for color images, non-linear generator functions, and multi-fractal compositions. With these additions, differentiable rendering is now a viable tool for the generation… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

  3. arXiv:2402.08273  [pdf, other

    cs.GR

    Regional Adaptive Metropolis Light Transport

    Authors: Hisanari Otsu, Killian Herveau, Johannes Hanika, Derek Nowrouzezahrai, Carsten Dachsbacher

    Abstract: The design of the proposal distributions, and most notably the kernel parameters, are crucial for the performance of Markov chain Monte Carlo (MCMC) rendering. A poor selection of parameters can increase the correlation of the Markov chain and result in bad rendering performance. We approach this problem by a novel path perturbation strategy for online-learning of state-dependent kernel parameters… ▽ More

    Submitted 13 February, 2024; originally announced February 2024.

    Comments: 14 pages, 12 figures

  4. arXiv:2312.04574  [pdf, other

    cs.LG cs.AI cs.GR cs.NE

    Differentiable Visual Computing for Inverse Problems and Machine Learning

    Authors: Andrew Spielberg, Fangcheng Zhong, Konstantinos Rematas, Krishna Murthy Jatavallabhula, Cengiz Oztireli, Tzu-Mao Li, Derek Nowrouzezahrai

    Abstract: Originally designed for applications in computer graphics, visual computing (VC) methods synthesize information about physical and virtual worlds, using prescribed algorithms optimized for spatial computing. VC is used to analyze geometry, physically simulate solids, fluids, and other media, and render the world via optical techniques. These fine-tuned computations that operate explicitly on a giv… ▽ More

    Submitted 21 November, 2023; originally announced December 2023.

  5. arXiv:2311.17190  [pdf, other

    cs.LG cs.AI cs.MA

    Minimax Exploiter: A Data Efficient Approach for Competitive Self-Play

    Authors: Daniel Bairamian, Philippe Marcotte, Joshua Romoff, Gabriel Robert, Derek Nowrouzezahrai

    Abstract: Recent advances in Competitive Self-Play (CSP) have achieved, or even surpassed, human level performance in complex game environments such as Dota 2 and StarCraft II using Distributed Multi-Agent Reinforcement Learning (MARL). One core component of these methods relies on creating a pool of learning agents -- consisting of the Main Agent, past versions of this agent, and Exploiter Agents -- where… ▽ More

    Submitted 28 November, 2023; originally announced November 2023.

  6. arXiv:2310.01775  [pdf, other

    cs.RO cs.AI

    STAMP: Differentiable Task and Motion Planning via Stein Variational Gradient Descent

    Authors: Yewon Lee, Philip Huang, Krishna Murthy Jatavallabhula, Andrew Z. Li, Fabian Damken, Eric Heiden, Kevin Smith, Derek Nowrouzezahrai, Fabio Ramos, Florian Shkurti

    Abstract: Planning for many manipulation tasks, such as using tools or assembling parts, often requires both symbolic and geometric reasoning. Task and Motion Planning (TAMP) algorithms typically solve these problems by conducting a tree search over high-level task sequences while checking for kinematic and dynamic feasibility. This can be inefficient as the width of the tree can grow exponentially with the… ▽ More

    Submitted 7 January, 2024; v1 submitted 2 October, 2023; originally announced October 2023.

    Comments: 14 pages, 9 figures, Learning Effective Abstractions for Planning (LEAP) Workshop at CoRL 2023

    ACM Class: I.2.9

  7. arXiv:2309.08387  [pdf, other

    cs.GR cs.CV cs.LG

    Efficient Graphics Representation with Differentiable Indirection

    Authors: Sayantan Datta, Carl Marshall, Derek Nowrouzezahrai, Zhao Dong, Zhengqin Li

    Abstract: We introduce differentiable indirection -- a novel learned primitive that employs differentiable multi-scale lookup tables as an effective substitute for traditional compute and data operations across the graphics pipeline. We demonstrate its flexibility on a number of graphics tasks, i.e., geometric and image representation, texture map**, shading, and radiance field representation. In all case… ▽ More

    Submitted 17 November, 2023; v1 submitted 12 September, 2023; originally announced September 2023.

    Comments: Project website: https://sayan1an.github.io/din.html

    Journal ref: SIGGRAPH Asia 2023 Conference Papers (SA Conference Papers '23), December 12--15, 2023, Sydney, NSW, Australia

  8. arXiv:2305.17198  [pdf, other

    cs.LG cs.AI cs.MA

    A Model-Based Solution to the Offline Multi-Agent Reinforcement Learning Coordination Problem

    Authors: Paul Barde, Jakob Foerster, Derek Nowrouzezahrai, Amy Zhang

    Abstract: Training multiple agents to coordinate is an essential problem with applications in robotics, game theory, economics, and social sciences. However, most existing Multi-Agent Reinforcement Learning (MARL) methods are online and thus impractical for real-world applications in which collecting new interactions is costly or dangerous. While these algorithms should leverage offline data when available,… ▽ More

    Submitted 18 January, 2024; v1 submitted 26 May, 2023; originally announced May 2023.

  9. arXiv:2303.08133  [pdf, other

    cs.GR cs.AI cs.CV cs.LG

    MeshDiffusion: Score-based Generative 3D Mesh Modeling

    Authors: Zhen Liu, Yao Feng, Michael J. Black, Derek Nowrouzezahrai, Liam Paull, Weiyang Liu

    Abstract: We consider the task of generating realistic 3D shapes, which is useful for a variety of applications such as automatic scene generation and physical simulation. Compared to other 3D representations like voxels and point clouds, meshes are more desirable in practice, because (1) they enable easy and arbitrary manipulation of shapes for relighting and simulation, and (2) they can fully leverage the… ▽ More

    Submitted 15 April, 2023; v1 submitted 14 March, 2023; originally announced March 2023.

    Comments: ICLR 2023 (Spotlight, Notable-top-25%)

  10. Neural Shadow Map**

    Authors: Sayantan Datta, Derek Nowrouzezahrai, Christoph Schied, Zhao Dong

    Abstract: We present a neural extension of basic shadow map** for fast, high quality hard and soft shadows. We compare favorably to fast pre-filtering shadow map**, all while producing visual results on par with ray traced hard and soft shadows. We show that combining memory bandwidth-aware architecture specialization and careful temporal-window training leads to a fast, compact and easy-to-train neural… ▽ More

    Submitted 12 January, 2023; originally announced January 2023.

    Comments: Project Page: https://sayan1an.github.io/neuralShadowMap**.html

    Journal ref: ACM SIGGRAPH 2022 Conference Proceedings

  11. arXiv:2212.01639  [pdf, other

    stat.ML cs.CV cs.LG

    Visual Question Answering From Another Perspective: CLEVR Mental Rotation Tests

    Authors: Christopher Beckham, Martin Weiss, Florian Golemo, Sina Honari, Derek Nowrouzezahrai, Christopher Pal

    Abstract: Different types of mental rotation tests have been used extensively in psychology to understand human visual reasoning and perception. Understanding what an object or visual scene would look like from another viewpoint is a challenging problem that is made even harder if it must be performed from a single image. We explore a controlled setting whereby questions are posed about the properties of a… ▽ More

    Submitted 3 December, 2022; originally announced December 2022.

    Comments: Accepted for publication to Pattern Recognition journal

  12. arXiv:2211.01233  [pdf, other

    cs.CV cs.AI cs.LG

    Attention-based Neural Cellular Automata

    Authors: Mattie Tesfaldet, Derek Nowrouzezahrai, Christopher Pal

    Abstract: Recent extensions of Cellular Automata (CA) have incorporated key ideas from modern deep learning, dramatically extending their capabilities and catalyzing a new family of Neural Cellular Automata (NCA) techniques. Inspired by Transformer-based architectures, our work presents a new class of $\textit{attention-based}$ NCAs formed using a spatially localized$\unicode{x2014}$yet globally organized… ▽ More

    Submitted 2 November, 2022; originally announced November 2022.

    Comments: NeurIPS 2022

  13. arXiv:2211.00519  [pdf, other

    cs.GR cs.CV

    Learning Neural Implicit Representations with Surface Signal Parameterizations

    Authors: Yanran Guan, Andrei Chubarau, Ruby Rao, Derek Nowrouzezahrai

    Abstract: Neural implicit surface representations have recently emerged as popular alternative to explicit 3D object encodings, such as polygonal meshes, tabulated points, or voxels. While significant work has improved the geometric fidelity of these representations, much less attention is given to their final appearance. Traditional explicit object representations commonly couple the 3D shape data with aux… ▽ More

    Submitted 25 June, 2023; v1 submitted 1 November, 2022; originally announced November 2022.

  14. arXiv:2210.13583  [pdf, other

    cs.LG cs.AI stat.ME

    Learning Latent Structural Causal Models

    Authors: Jithendaraa Subramanian, Yashas Annadani, Ivaxi Sheth, Nan Rosemary Ke, Tristan Deleu, Stefan Bauer, Derek Nowrouzezahrai, Samira Ebrahimi Kahou

    Abstract: Causal learning has long concerned itself with the accurate recovery of underlying causal mechanisms. Such causal modelling enables better explanations of out-of-distribution data. Prior works on causal learning assume that the high-level causal variables are given. However, in machine learning tasks, one often operates on low-level data like image pixels or high-dimensional vectors. In such setti… ▽ More

    Submitted 24 October, 2022; originally announced October 2022.

    Comments: 21 pages, 19 figures

  15. arXiv:2210.00978  [pdf, other

    cs.CV

    Uncertainty-Driven Active Vision for Implicit Scene Reconstruction

    Authors: Edward J. Smith, Michal Drozdzal, Derek Nowrouzezahrai, David Meger, Adriana Romero-Soriano

    Abstract: Multi-view implicit scene reconstruction methods have become increasingly popular due to their ability to represent complex scene details. Recent efforts have been devoted to improving the representation of input information and to reducing the number of views required to obtain high quality reconstructions. Yet, perhaps surprisingly, the study of which views to select to maximally improve scene u… ▽ More

    Submitted 3 October, 2022; originally announced October 2022.

  16. arXiv:2207.05723  [pdf, other

    cs.LG cs.AI stat.ML

    Latent Variable Models for Bayesian Causal Discovery

    Authors: Jithendaraa Subramanian, Yashas Annadani, Ivaxi Sheth, Stefan Bauer, Derek Nowrouzezahrai, Samira Ebrahimi Kahou

    Abstract: Learning predictors that do not rely on spurious correlations involves building causal representations. However, learning such a representation is very challenging. We, therefore, formulate the problem of learning a causal representation from high dimensional data and study causal recovery with synthetic data. This work introduces a latent variable decoder model, Decoder BCD, for Bayesian causal d… ▽ More

    Submitted 10 August, 2022; v1 submitted 12 July, 2022; originally announced July 2022.

    Comments: 7 figures, Published at the ICML 2022 Workshop on Spurious Correlations, Invariance, and Stability

  17. arXiv:2203.16662  [pdf, other

    stat.ML cs.LG

    Overcoming challenges in leveraging GANs for few-shot data augmentation

    Authors: Christopher Beckham, Issam Laradji, Pau Rodriguez, David Vazquez, Derek Nowrouzezahrai, Christopher Pal

    Abstract: In this paper, we explore the use of GAN-based few-shot data augmentation as a method to improve few-shot classification performance. We perform an exploration into how a GAN can be fine-tuned for such a task (one of which is in a class-incremental manner), as well as a rigorous empirical investigation into how well these models can perform to improve few-shot classification. We identify issues re… ▽ More

    Submitted 8 August, 2022; v1 submitted 30 March, 2022; originally announced March 2022.

    Comments: v3 of the paper, various changes including better figures, CIFAR-100 results, and precision-recall metrics

  18. arXiv:2203.03570  [pdf, other

    cs.CV cs.GR cs.LG

    Kubric: A scalable dataset generator

    Authors: Klaus Greff, Francois Belletti, Lucas Beyer, Carl Doersch, Yilun Du, Daniel Duckworth, David J. Fleet, Dan Gnanapragasam, Florian Golemo, Charles Herrmann, Thomas Kipf, Abhijit Kundu, Dmitry Lagun, Issam Laradji, Hsueh-Ti, Liu, Henning Meyer, Yishu Miao, Derek Nowrouzezahrai, Cengiz Oztireli, Etienne Pot, Noha Radwan, Daniel Rebain, Sara Sabour, Mehdi S. M. Sajjadi , et al. (10 additional authors not shown)

    Abstract: Data is the driving force of machine learning, with the amount and quality of training data often being more important for the performance of a system than architecture and training details. But collecting, processing and annotating real data at scale is difficult, expensive, and frequently raises additional privacy, fairness and legal concerns. Synthetic data is a powerful tool with the potential… ▽ More

    Submitted 7 March, 2022; originally announced March 2022.

    Comments: 21 pages, CVPR2022

  19. arXiv:2112.07342  [pdf, other

    cs.LG cs.AI cs.MA

    Learning to Guide and to Be Guided in the Architect-Builder Problem

    Authors: Paul Barde, Tristan Karch, Derek Nowrouzezahrai, Clément Moulin-Frier, Christopher Pal, Pierre-Yves Oudeyer

    Abstract: We are interested in interactive agents that learn to coordinate, namely, a $builder$ -- which performs actions but ignores the goal of the task, i.e. has no access to rewards -- and an $architect$ which guides the builder towards the goal of the task. We define and explore a formal setting where artificial agents are equipped with mechanisms that allow them to simultaneously learn a task while at… ▽ More

    Submitted 11 April, 2022; v1 submitted 14 December, 2021; originally announced December 2021.

    Comments: International Conference on Learning Representations (2022)

  20. arXiv:2108.09593  [pdf, other

    cs.CV

    SSR: Semi-supervised Soft Rasterizer for single-view 2D to 3D Reconstruction

    Authors: Issam Laradji, Pau Rodríguez, David Vazquez, Derek Nowrouzezahrai

    Abstract: Recent work has made significant progress in learning object meshes with weak supervision. Soft Rasterization methods have achieved accurate 3D reconstruction from 2D images with viewpoint supervision only. In this work, we further reduce the labeling effort by allowing such 3D reconstruction methods leverage unlabeled images. In order to obtain the viewpoints for these unlabeled images, we propos… ▽ More

    Submitted 21 August, 2021; originally announced August 2021.

  21. arXiv:2108.05263  [pdf, other

    cs.GR

    Dynamic Diffuse Global Illumination Resampling

    Authors: Zander Majercik, Thomas Müller, Alexander Keller, Derek Nowrouzezahrai, Morgan McGuire

    Abstract: Interactive global illumination remains a challenge in radiometrically- and geometrically-complex scenes. Specialized sampling strategies are effective for specular and near-specular transport because the scattering has relatively low directional variance per scattering event. In contrast, the high variance from transport paths comprising multiple rough glossy or diffuse scattering events remains… ▽ More

    Submitted 11 August, 2021; originally announced August 2021.

  22. arXiv:2104.02646  [pdf, other

    cs.CV cs.AI cs.LG cs.RO

    gradSim: Differentiable simulation for system identification and visuomotor control

    Authors: Krishna Murthy Jatavallabhula, Miles Macklin, Florian Golemo, Vikram Voleti, Linda Petrini, Martin Weiss, Breandan Considine, Jerome Parent-Levesque, Kevin Xie, Kenny Erleben, Liam Paull, Florian Shkurti, Derek Nowrouzezahrai, Sanja Fidler

    Abstract: We consider the problem of estimating an object's physical properties such as mass, friction, and elasticity directly from video sequences. Such a system identification problem is fundamentally ill-posed due to the loss of information during image formation. Current solutions require precise 3D labels which are labor-intensive to gather, and infeasible to create for many systems such as deformable… ▽ More

    Submitted 6 April, 2021; originally announced April 2021.

    Comments: ICLR 2021. Project page (and a dynamic web version of the article): https://gradsim.github.io

  23. arXiv:2103.15163  [pdf, other

    cs.GR

    Countering Racial Bias in Computer Graphics Research

    Authors: Theodore Kim, Holly Rushmeier, Julie Dorsey, Derek Nowrouzezahrai, Raqi Syed, Wojciech Jarosz, A. M. Darke

    Abstract: Current computer graphics research practices contain racial biases that have resulted in investigations into "skin" and "hair" that focus on the hegemonic visual features of Europeans and East Asians. To broaden our research horizons to encompass all of humanity, we propose a variety of improvements to quantitative measures and qualitative practices, and pose novel, open research problems.

    Submitted 2 June, 2022; v1 submitted 28 March, 2021; originally announced March 2021.

    Comments: 2 pages

  24. arXiv:2102.04942  [pdf, other

    cs.CV cs.GR cs.LG

    Robust Motion In-betweening

    Authors: Félix G. Harvey, Mike Yurick, Derek Nowrouzezahrai, Christopher Pal

    Abstract: In this work we present a novel, robust transition generation technique that can serve as a new tool for 3D animators, based on adversarial recurrent neural networks. The system synthesizes high-quality motions that use temporally-sparse keyframes as animation constraints. This is reminiscent of the job of in-betweening in traditional animation pipelines, in which an animator draws motion frames b… ▽ More

    Submitted 9 February, 2021; originally announced February 2021.

    Comments: Published at SIGGRAPH 2020

  25. arXiv:2101.10994  [pdf, other

    cs.CV cs.GR

    Neural Geometric Level of Detail: Real-time Rendering with Implicit 3D Shapes

    Authors: Towaki Takikawa, Joey Litalien, Kangxue Yin, Karsten Kreis, Charles Loop, Derek Nowrouzezahrai, Alec Jacobson, Morgan McGuire, Sanja Fidler

    Abstract: Neural signed distance functions (SDFs) are emerging as an effective representation for 3D shapes. State-of-the-art methods typically encode the SDF with a large, fixed-size neural network to approximate complex shapes with implicit surfaces. Rendering with these large networks is, however, computationally expensive since it requires many forward passes through the network for every pixel, making… ▽ More

    Submitted 26 January, 2021; originally announced January 2021.

  26. arXiv:2011.03149  [pdf, other

    cs.CV

    Affinity LCFCN: Learning to Segment Fish with Weak Supervision

    Authors: Issam Laradji, Alzayat Saleh, Pau Rodriguez, Derek Nowrouzezahrai, Mostafa Rahimi Azghadi, David Vazquez

    Abstract: Aquaculture industries rely on the availability of accurate fish body measurements, e.g., length, width and mass. Manual methods that rely on physical tools like rulers are time and labour intensive. Leading automatic approaches rely on fully-supervised segmentation models to acquire these measurements but these require collecting per-pixel labels -- also time consuming and laborious: i.e., it can… ▽ More

    Submitted 5 November, 2020; originally announced November 2020.

  27. arXiv:2010.03691  [pdf, other

    cs.LG

    Regularized Inverse Reinforcement Learning

    Authors: Wonseok Jeon, Chen-Yang Su, Paul Barde, Thang Doan, Derek Nowrouzezahrai, Joelle Pineau

    Abstract: Inverse Reinforcement Learning (IRL) aims to facilitate a learner's ability to imitate expert behavior by acquiring reward functions that explain the expert's decisions. Regularized IRL applies strongly convex regularizers to the learner's policy in order to avoid the expert's behavior being rationalized by arbitrary constant rewards, also known as degenerate solutions. We propose tractable soluti… ▽ More

    Submitted 2 December, 2020; v1 submitted 7 October, 2020; originally announced October 2020.

    Comments: 26 pages, 7 figures

  28. arXiv:2009.09808  [pdf, other

    cs.GR cs.CG cs.CV

    On the Effectiveness of Weight-Encoded Neural Implicit 3D Shapes

    Authors: Thomas Davies, Derek Nowrouzezahrai, Alec Jacobson

    Abstract: A neural implicit outputs a number indicating whether the given query point in space is inside, outside, or on a surface. Many prior works have focused on _latent-encoded_ neural implicits, where a latent vector encoding of a specific shape is also fed as input. While affording latent-space interpolation, this comes at the cost of reconstruction accuracy for any _single_ shape. Training a specific… ▽ More

    Submitted 17 January, 2021; v1 submitted 17 September, 2020; originally announced September 2020.

  29. arXiv:2007.07012  [pdf, other

    eess.IV cs.CV

    A Weakly Supervised Region-Based Active Learning Method for COVID-19 Segmentation in CT Images

    Authors: Issam Laradji, Pau Rodriguez, Frederic Branchaud-Charron, Keegan Lensink, Parmida Atighehchian, William Parker, David Vazquez, Derek Nowrouzezahrai

    Abstract: One of the key challenges in the battle against the Coronavirus (COVID-19) pandemic is to detect and quantify the severity of the disease in a timely manner. Computed tomographies (CT) of the lungs are effective for assessing the state of the infection. Unfortunately, labeling CT scans can take a lot of time and effort, with up to 150 minutes per scan. We address this challenge introducing a scala… ▽ More

    Submitted 7 July, 2020; originally announced July 2020.

  30. arXiv:2007.02180  [pdf, other

    eess.IV cs.CV

    A Weakly Supervised Consistency-based Learning Method for COVID-19 Segmentation in CT Images

    Authors: Issam Laradji, Pau Rodriguez, Oscar Mañas, Keegan Lensink, Marco Law, Lironne Kurzman, William Parker, David Vazquez, Derek Nowrouzezahrai

    Abstract: Coronavirus Disease 2019 (COVID-19) has spread aggressively across the world causing an existential health crisis. Thus, having a system that automatically detects COVID-19 in tomography (CT) images can assist in quantifying the severity of the illness. Unfortunately, labelling chest CT scans requires significant domain expertise, time, and effort. We address these labelling challenges by only req… ▽ More

    Submitted 7 July, 2020; v1 submitted 4 July, 2020; originally announced July 2020.

  31. arXiv:2006.13258  [pdf, other

    cs.LG cs.AI stat.ML

    Adversarial Soft Advantage Fitting: Imitation Learning without Policy Optimization

    Authors: Paul Barde, Julien Roy, Wonseok Jeon, Joelle Pineau, Christopher Pal, Derek Nowrouzezahrai

    Abstract: Adversarial Imitation Learning alternates between learning a discriminator -- which tells apart expert's demonstrations from generated ones -- and a generator's policy to produce trajectories that can fool this discriminator. This alternated optimization is known to be delicate in practice since it compounds unstable adversarial training with brittle and sample-inefficient reinforcement learning.… ▽ More

    Submitted 16 April, 2021; v1 submitted 23 June, 2020; originally announced June 2020.

    Journal ref: Advances in Neural Information Processing Systems 33 (2020)

  32. arXiv:2006.01659  [pdf, other

    cs.LG stat.ML

    Surprisal-Triggered Conditional Computation with Neural Networks

    Authors: Loren Lugosch, Derek Nowrouzezahrai, Brett H. Meyer

    Abstract: Autoregressive neural network models have been used successfully for sequence generation, feature extraction, and hypothesis scoring. This paper presents yet another use for these models: allocating more computation to more difficult inputs. In our model, an autoregressive model is used both to extract features and to predict observations in a stream of input observations. The surprisal of the inp… ▽ More

    Submitted 2 June, 2020; originally announced June 2020.

  33. arXiv:2003.14166  [pdf, other

    cs.CV cs.LG stat.ML

    Pix2Shape: Towards Unsupervised Learning of 3D Scenes from Images using a View-based Representation

    Authors: Sai Rajeswar, Fahim Mannan, Florian Golemo, Jérôme Parent-Lévesque, David Vazquez, Derek Nowrouzezahrai, Aaron Courville

    Abstract: We infer and generate three-dimensional (3D) scene information from a single input image and without supervision. This problem is under-explored, with most prior work relying on supervision from, e.g., 3D ground-truth, multiple images of a scene, image silhouettes or key-points. We propose Pix2Shape, an approach to solve this problem with four components: (i) an encoder that infers the latent 3D r… ▽ More

    Submitted 17 April, 2020; v1 submitted 22 March, 2020; originally announced March 2020.

    Comments: This is a pre-print of an article published in International Journal of Computer Vision. The final authenticated version is available online at: https://doi.org/10.1007/s11263-020-01322-1

    Journal ref: International Journal of Computer Vision, (2020), 1-16

  34. arXiv:2002.10525  [pdf, other

    cs.MA cs.LG

    Scalable Multi-Agent Inverse Reinforcement Learning via Actor-Attention-Critic

    Authors: Wonseok Jeon, Paul Barde, Derek Nowrouzezahrai, Joelle Pineau

    Abstract: Multi-agent adversarial inverse reinforcement learning (MA-AIRL) is a recent approach that applies single-agent AIRL to multi-agent problems where we seek to recover both policies for our agents and reward functions that promote expert-like behavior. While MA-AIRL has promising results on cooperative and competitive tasks, it is sample-inefficient and has only been validated empirically for small… ▽ More

    Submitted 24 February, 2020; originally announced February 2020.

  35. arXiv:1911.03594  [pdf, other

    cs.LG cs.AI cs.RO stat.ML

    Robo-PlaNet: Learning to Poke in a Day

    Authors: Maxime Chevalier-Boisvert, Guillaume Alain, Florian Golemo, Derek Nowrouzezahrai

    Abstract: Recently, the Deep Planning Network (PlaNet) approach was introduced as a model-based reinforcement learning method that learns environment dynamics directly from pixel observations. This architecture is useful for learning tasks in which either the agent does not have access to meaningful states (like position/velocity of robotic joints) or where the observed states significantly deviate from the… ▽ More

    Submitted 19 November, 2019; v1 submitted 8 November, 2019; originally announced November 2019.

    Comments: 4 pages, 3 figures. Version 2: added reference and acknowledgement

  36. arXiv:1910.13249  [pdf, other

    cs.CV cs.HC cs.LG

    Navigation Agents for the Visually Impaired: A Sidewalk Simulator and Experiments

    Authors: Martin Weiss, Simon Chamorro, Roger Girgis, Margaux Luck, Samira E. Kahou, Joseph P. Cohen, Derek Nowrouzezahrai, Doina Precup, Florian Golemo, Chris Pal

    Abstract: Millions of blind and visually-impaired (BVI) people navigate urban environments every day, using smartphones for high-level path-planning and white canes or guide dogs for local information. However, many BVI people still struggle to travel to new places. In our endeavor to create a navigation assistant for the BVI, we found that existing Reinforcement Learning (RL) environments were unsuitable f… ▽ More

    Submitted 29 October, 2019; originally announced October 2019.

    Comments: Accepted at CoRL2019. Code & video available at https://mweiss17.github.io/SEVN/

  37. arXiv:1908.02269  [pdf, other

    cs.LG cs.MA stat.ML

    Promoting Coordination through Policy Regularization in Multi-Agent Deep Reinforcement Learning

    Authors: Julien Roy, Paul Barde, Félix G. Harvey, Derek Nowrouzezahrai, Christopher Pal

    Abstract: In multi-agent reinforcement learning, discovering successful collective behaviors is challenging as it requires exploring a joint action space that grows exponentially with the number of agents. While the tractability of independent agent-wise exploration is appealing, this approach fails on tasks that require elaborate group strategies. We argue that coordinating the agents' policies can guide t… ▽ More

    Submitted 9 November, 2020; v1 submitted 6 August, 2019; originally announced August 2019.

    Comments: 23 pages, 16 figures. This revised version contains additional results and minor edits

  38. arXiv:1808.02651  [pdf, other

    cs.LG cs.CV cs.GR stat.ML

    Beyond Pixel Norm-Balls: Parametric Adversaries using an Analytically Differentiable Renderer

    Authors: Hsueh-Ti Derek Liu, Michael Tao, Chun-Liang Li, Derek Nowrouzezahrai, Alec Jacobson

    Abstract: Many machine learning image classifiers are vulnerable to adversarial attacks, inputs with perturbations designed to intentionally trigger misclassification. Current adversarial methods directly alter pixel colors and evaluate against pixel norm-balls: pixel perturbations smaller than a specified magnitude, according to a measurement norm. This evaluation, however, has limited practical utility si… ▽ More

    Submitted 17 February, 2019; v1 submitted 8 August, 2018; originally announced August 2018.