Skip to main content

Showing 1–39 of 39 results for author: Kahou, S

.
  1. arXiv:2407.03154  [pdf, other

    cs.LG cs.AI q-bio.BM

    Reinforcement Learning for Sequence Design Leveraging Protein Language Models

    Authors: Jithendaraa Subramanian, Shivakanth Sujit, Niloy Irtisam, Umong Sain, Derek Nowrouzezahrai, Samira Ebrahimi Kahou, Riashat Islam

    Abstract: Protein sequence design, determined by amino acid sequences, are essential to protein engineering problems in drug discovery. Prior approaches have resorted to evolutionary strategies or Monte-Carlo methods for protein design, but often fail to exploit the structure of the combinatorial search space, to generalize to unseen sequences. In the context of discrete black box optimization over large se… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

    Comments: 22 pages, 7 figures, 4 tables

  2. arXiv:2406.01361  [pdf, other

    cs.LG

    Learning to Play Atari in a World of Tokens

    Authors: Pranav Agarwal, Sheldon Andrews, Samira Ebrahimi Kahou

    Abstract: Model-based reinforcement learning agents utilizing transformers have shown improved sample efficiency due to their ability to model extended context, resulting in more accurate world models. However, for complex reasoning and planning tasks, these methods primarily rely on continuous representations. This complicates modeling of discrete properties of the real world such as disjoint object classe… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    Comments: Accepted at ICML 2024

  3. arXiv:2405.18751  [pdf, other

    cs.CV cs.AI

    On the Limits of Multi-modal Meta-Learning with Auxiliary Task Modulation Using Conditional Batch Normalization

    Authors: Jordi Armengol-Estapé, Vincent Michalski, Ramnath Kumar, Pierre-Luc St-Charles, Doina Precup, Samira Ebrahimi Kahou

    Abstract: Few-shot learning aims to learn representations that can tackle novel tasks given a small number of examples. Recent studies show that cross-modal learning can improve representations for few-shot classification. More specifically, language is a rich modality that can be used to guide visual learning. In this work, we experiment with a multi-modal architecture for few-shot learning that consists o… ▽ More

    Submitted 30 May, 2024; v1 submitted 29 May, 2024; originally announced May 2024.

  4. arXiv:2312.00966  [pdf, other

    cs.LG cs.AI

    Spectral Temporal Contrastive Learning

    Authors: Sacha Morin, Somjit Nath, Samira Ebrahimi Kahou, Guy Wolf

    Abstract: Learning useful data representations without requiring labels is a cornerstone of modern deep learning. Self-supervised learning methods, particularly contrastive learning (CL), have proven successful by leveraging data augmentations to define positive pairs. This success has prompted a number of theoretical studies to better understand CL and investigate theoretical bounds for downstream linear p… ▽ More

    Submitted 7 December, 2023; v1 submitted 1 December, 2023; originally announced December 2023.

    Comments: Accepted to Self-Supervised Learning - Theory and Practice, NeurIPS Workshop, 2023

  5. arXiv:2311.11108  [pdf, other

    cs.LG

    Auxiliary Losses for Learning Generalizable Concept-based Models

    Authors: Ivaxi Sheth, Samira Ebrahimi Kahou

    Abstract: The increasing use of neural networks in various applications has lead to increasing apprehensions, underscoring the necessity to understand their operations beyond mere final predictions. As a solution to enhance model transparency, Concept Bottleneck Models (CBMs) have gained popularity since their introduction. CBMs essentially limit the latent space of a model to human-understandable high-leve… ▽ More

    Submitted 18 November, 2023; originally announced November 2023.

    Comments: Neurips 2023

  6. arXiv:2310.10702  [pdf, other

    cs.LG

    Transparent Anomaly Detection via Concept-based Explanations

    Authors: Laya Rafiee Sevyeri, Ivaxi Sheth, Farhood Farahnak, Samira Ebrahimi Kahou, Shirin Abbasinejad Enger

    Abstract: Advancements in deep learning techniques have given a boost to the performance of anomaly detection. However, real-world and safety-critical applications demand a level of transparency and reasoning beyond accuracy. The task of anomaly detection (AD) focuses on finding whether a given sample follows the learned distribution. Existing methods lack the ability to reason with clear explanations for t… ▽ More

    Submitted 1 November, 2023; v1 submitted 16 October, 2023; originally announced October 2023.

    Comments: Accepted at Neurips XAI in Action workshop

  7. arXiv:2307.13081  [pdf, other

    cs.LG cs.AI

    Fairness Under Demographic Scarce Regime

    Authors: Patrik Joslin Kenfack, Samira Ebrahimi Kahou, Ulrich Aïvodji

    Abstract: Most existing works on fairness assume the model has full access to demographic information. However, there exist scenarios where demographic information is partially available because a record was not maintained throughout data collection or due to privacy reasons. This setting is known as demographic scarce regime. Prior research have shown that training an attribute classifier to replace the mi… ▽ More

    Submitted 24 July, 2023; originally announced July 2023.

    Comments: 14 pages, 7 pages

  8. arXiv:2307.05979  [pdf, other

    cs.LG cs.AI cs.CV

    Transformers in Reinforcement Learning: A Survey

    Authors: Pranav Agarwal, Aamer Abdul Rahman, Pierre-Luc St-Charles, Simon J. D. Prince, Samira Ebrahimi Kahou

    Abstract: Transformers have significantly impacted domains like natural language processing, computer vision, and robotics, where they improve performance compared to other neural networks. This survey explores how transformers are used in reinforcement learning (RL), where they are seen as a promising solution for addressing challenges such as unstable training, credit assignment, lack of interpretability,… ▽ More

    Submitted 12 July, 2023; originally announced July 2023.

    Comments: 35 pages, 11 figures

  9. arXiv:2306.11128  [pdf, other

    cs.LG cs.MA

    CAMMARL: Conformal Action Modeling in Multi Agent Reinforcement Learning

    Authors: Nikunj Gupta, Somjit Nath, Samira Ebrahimi Kahou

    Abstract: Before taking actions in an environment with more than one intelligent agent, an autonomous agent may benefit from reasoning about the other agents and utilizing a notion of a guarantee or confidence about the behavior of the system. In this article, we propose a novel multi-agent reinforcement learning (MARL) algorithm CAMMARL, which involves modeling the actions of other agents in different situ… ▽ More

    Submitted 8 February, 2024; v1 submitted 19 June, 2023; originally announced June 2023.

  10. arXiv:2304.13892  [pdf, other

    cs.LG cs.AI

    Discovering Object-Centric Generalized Value Functions From Pixels

    Authors: Somjit Nath, Gopeshh Raaj Subbaraj, Khimya Khetarpal, Samira Ebrahimi Kahou

    Abstract: Deep Reinforcement Learning has shown significant progress in extracting useful representations from high-dimensional inputs albeit using hand-crafted auxiliary tasks and pseudo rewards. Automatically learning such representations in an object-centric manner geared towards control and fast adaptation remains an open research problem. In this paper, we introduce a method that tries to discover mean… ▽ More

    Submitted 27 June, 2023; v1 submitted 26 April, 2023; originally announced April 2023.

    Comments: Accepted at ICML 2023

  11. arXiv:2304.02798  [pdf, other

    cs.LG cs.CV

    Source-free Domain Adaptation Requires Penalized Diversity

    Authors: Laya Rafiee Sevyeri, Ivaxi Sheth, Farhood Farahnak, Alexandre See, Samira Ebrahimi Kahou, Thomas Fevens, Mohammad Havaei

    Abstract: While neural networks are capable of achieving human-like performance in many tasks such as image classification, the impressive performance of each model is limited to its own dataset. Source-free domain adaptation (SFDA) was introduced to address knowledge transfer between different domains in the absence of source data, thus, increasing data privacy. Diversity in representation space can be vit… ▽ More

    Submitted 12 April, 2023; v1 submitted 5 April, 2023; originally announced April 2023.

  12. arXiv:2212.08131  [pdf, other

    cs.LG

    Bridging the Gap Between Offline and Online Reinforcement Learning Evaluation Methodologies

    Authors: Shivakanth Sujit, Pedro H. M. Braga, Jorg Bornschein, Samira Ebrahimi Kahou

    Abstract: Reinforcement learning (RL) has shown great promise with algorithms learning in environments with large state and action spaces purely from scalar reward signals. A crucial challenge for current deep RL algorithms is that they require a tremendous amount of environment interactions for learning. This can be infeasible in situations where such interactions are expensive; such as in robotics. Offlin… ▽ More

    Submitted 21 November, 2023; v1 submitted 15 December, 2022; originally announced December 2022.

    Comments: TMLR 2023

  13. arXiv:2211.15071  [pdf, other

    cs.CV

    Pitfalls of Conditional Batch Normalization for Contextual Multi-Modal Learning

    Authors: Ivaxi Sheth, Aamer Abdul Rahman, Mohammad Havaei, Samira Ebrahimi Kahou

    Abstract: Humans have perfected the art of learning from multiple modalities through sensory organs. Despite their impressive predictive performance on a single modality, neural networks cannot reach human level accuracy with respect to multiple modalities. This is a particularly challenging task due to variations in the structure of respective modalities. Conditional Batch Normalization (CBN) is a popular… ▽ More

    Submitted 28 November, 2022; originally announced November 2022.

    Comments: Accepted at ICBINB workshop @ NeurIPS 2022

  14. arXiv:2211.07941  [pdf, other

    cs.RO cs.AI cs.LG

    Automatic Evaluation of Excavator Operators using Learned Reward Functions

    Authors: Pranav Agarwal, Marek Teichmann, Sheldon Andrews, Samira Ebrahimi Kahou

    Abstract: Training novice users to operate an excavator for learning different skills requires the presence of expert teachers. Considering the complexity of the problem, it is comparatively expensive to find skilled experts as the process is time-consuming and requires precise focus. Moreover, since humans tend to be biased, the evaluation process is noisy and will lead to high variance in the final score… ▽ More

    Submitted 15 November, 2022; originally announced November 2022.

    Comments: 11 pages, 5 figures, Accepted at Reinforcement Learning for Real Life (RL4RealLife) Workshop at NeurIPS 2022

  15. arXiv:2211.05610  [pdf, other

    cs.CL

    BERT on a Data Diet: Finding Important Examples by Gradient-Based Pruning

    Authors: Mohsen Fayyaz, Ehsan Aghazadeh, Ali Modarressi, Mohammad Taher Pilehvar, Yadollah Yaghoobzadeh, Samira Ebrahimi Kahou

    Abstract: Current pre-trained language models rely on large datasets for achieving state-of-the-art performance. However, past research has shown that not all examples in a dataset are equally important during training. In fact, it is sometimes possible to prune a considerable fraction of the training set while maintaining the test performance. Established on standard vision benchmarks, two gradient-based s… ▽ More

    Submitted 28 November, 2022; v1 submitted 10 November, 2022; originally announced November 2022.

    Comments: ENLSP @ NeurIPS2022

  16. arXiv:2210.13583  [pdf, other

    cs.LG cs.AI stat.ME

    Learning Latent Structural Causal Models

    Authors: Jithendaraa Subramanian, Yashas Annadani, Ivaxi Sheth, Nan Rosemary Ke, Tristan Deleu, Stefan Bauer, Derek Nowrouzezahrai, Samira Ebrahimi Kahou

    Abstract: Causal learning has long concerned itself with the accurate recovery of underlying causal mechanisms. Such causal modelling enables better explanations of out-of-distribution data. Prior works on causal learning assume that the high-level causal variables are given. However, in machine learning tasks, one often operates on low-level data like image pixels or high-dimensional vectors. In such setti… ▽ More

    Submitted 24 October, 2022; originally announced October 2022.

    Comments: 21 pages, 19 figures

  17. arXiv:2209.09441  [pdf, other

    cs.LG cs.AI

    Locally Constrained Representations in Reinforcement Learning

    Authors: Somjit Nath, Rushiv Arora, Samira Ebrahimi Kahou

    Abstract: The success of Reinforcement Learning (RL) heavily relies on the ability to learn robust representations from the observations of the environment. In most cases, the representations learned purely by the reinforcement learning loss can differ vastly across states depending on how the value functions change. However, the representations learned need not be very specific to the task at hand. Relying… ▽ More

    Submitted 8 February, 2024; v1 submitted 19 September, 2022; originally announced September 2022.

  18. arXiv:2208.10483  [pdf, other

    cs.LG cs.AI

    Prioritizing Samples in Reinforcement Learning with Reducible Loss

    Authors: Shivakanth Sujit, Somjit Nath, Pedro H. M. Braga, Samira Ebrahimi Kahou

    Abstract: Most reinforcement learning algorithms take advantage of an experience replay buffer to repeatedly train on samples the agent has observed in the past. Not all samples carry the same amount of significance and simply assigning equal importance to each of the samples is a naïve strategy. In this paper, we propose a method to prioritize samples based on how much we can learn from a sample. We define… ▽ More

    Submitted 1 November, 2023; v1 submitted 22 August, 2022; originally announced August 2022.

    Comments: NeurIPS 2023

  19. arXiv:2207.05723  [pdf, other

    cs.LG cs.AI stat.ML

    Latent Variable Models for Bayesian Causal Discovery

    Authors: Jithendaraa Subramanian, Yashas Annadani, Ivaxi Sheth, Stefan Bauer, Derek Nowrouzezahrai, Samira Ebrahimi Kahou

    Abstract: Learning predictors that do not rely on spurious correlations involves building causal representations. However, learning such a representation is very challenging. We, therefore, formulate the problem of learning a causal representation from high dimensional data and study causal recovery with synthetic data. This work introduces a latent variable decoder model, Decoder BCD, for Bayesian causal d… ▽ More

    Submitted 10 August, 2022; v1 submitted 12 July, 2022; originally announced July 2022.

    Comments: 7 figures, Published at the ICML 2022 Workshop on Spurious Correlations, Invariance, and Stability

  20. arXiv:2206.00092  [pdf, other

    cs.CV

    FHIST: A Benchmark for Few-shot Classification of Histological Images

    Authors: Fereshteh Shakeri, Malik Boudiaf, Sina Mohammadi, Ivaxi Sheth, Mohammad Havaei, Ismail Ben Ayed, Samira Ebrahimi Kahou

    Abstract: Few-shot learning has recently attracted wide interest in image classification, but almost all the current public benchmarks are focused on natural images. The few-shot paradigm is highly relevant in medical-imaging applications due to the scarcity of labeled data, as annotations are expensive and require specialized expertise. However, in medical imaging, few-shot learning research is sparse, lim… ▽ More

    Submitted 31 May, 2022; originally announced June 2022.

    Comments: Code available at: https://github.com/mboudiaf/Few-shot-histology

  21. arXiv:2109.03292  [pdf, other

    cs.CV cs.LG eess.IV

    Simple Video Generation using Neural ODEs

    Authors: David Kanaa, Vikram Voleti, Samira Ebrahimi Kahou, Christopher Pal

    Abstract: Despite having been studied to a great extent, the task of conditional generation of sequences of frames, or videos, remains extremely challenging. It is a common belief that a key step towards solving this task resides in modelling accurately both spatial and temporal information in video signals. A promising direction to do so has been to learn latent variable models that predict the future in l… ▽ More

    Submitted 7 September, 2021; originally announced September 2021.

    Comments: 8 pages, 4 figures, NeurIPS 2019 workshop

    Journal ref: NeurIPS 2019 Workshop

  22. arXiv:2104.00563  [pdf, other

    cs.RO cs.AI cs.CV cs.LG cs.MA

    Latent Variable Sequential Set Transformers For Joint Multi-Agent Motion Prediction

    Authors: Roger Girgis, Florian Golemo, Felipe Codevilla, Martin Weiss, Jim Aldon D'Souza, Samira Ebrahimi Kahou, Felix Heide, Christopher Pal

    Abstract: Robust multi-agent trajectory prediction is essential for the safe control of robotic systems. A major challenge is to efficiently learn a representation that approximates the true joint distribution of contextual, social, and temporal information to enable planning. We propose Latent Variable Sequential Set Transformers which are encoder-decoder architectures that generate scene-consistent multi-… ▽ More

    Submitted 10 February, 2022; v1 submitted 19 February, 2021; originally announced April 2021.

    Comments: 26 pages, 17 figures, 8 tables

  23. arXiv:2103.03098  [pdf, other

    cs.LG stat.ML

    Accounting for Variance in Machine Learning Benchmarks

    Authors: Xavier Bouthillier, Pierre Delaunay, Mirko Bronzi, Assya Trofimov, Brennan Nichyporuk, Justin Szeto, Naz Sepah, Edward Raff, Kanika Madan, Vikram Voleti, Samira Ebrahimi Kahou, Vincent Michalski, Dmitriy Serdyuk, Tal Arbel, Chris Pal, Gaël Varoquaux, Pascal Vincent

    Abstract: Strong empirical evidence that one machine-learning algorithm A outperforms another one B ideally calls for multiple trials optimizing the learning pipeline over sources of variation such as data sampling, data augmentation, parameter initialization, and hyperparameters choices. This is prohibitively expensive, and corners are cut to reach conclusions. We model the whole benchmarking process, reve… ▽ More

    Submitted 1 March, 2021; originally announced March 2021.

    Comments: Submitted to MLSys2021

  24. arXiv:2011.14371  [pdf, ps, other

    cs.LG

    Predicting Regional Locust Swarm Distribution with Recurrent Neural Networks

    Authors: Hadia Mohmmed Osman Ahmed Samil, Annabelle Martin, Arnav Kumar Jain, Susan Amin, Samira Ebrahimi Kahou

    Abstract: Locust infestation of some regions in the world, including Africa, Asia and Middle East has become a concerning issue that can affect the health and the lives of millions of people. In this respect, there have been attempts to resolve or reduce the severity of this problem via detection and monitoring of locust breeding areas using satellites and sensors, or the use of chemicals to prevent the for… ▽ More

    Submitted 12 November, 2021; v1 submitted 29 November, 2020; originally announced November 2020.

  25. arXiv:2002.06460  [pdf, other

    cs.CV cs.LG eess.IV stat.ML

    HighRes-net: Recursive Fusion for Multi-Frame Super-Resolution of Satellite Imagery

    Authors: Michel Deudon, Alfredo Kalaitzis, Israel Goytom, Md Rifat Arefin, Zhichao Lin, Kris Sankaran, Vincent Michalski, Samira E. Kahou, Julien Cornebise, Yoshua Bengio

    Abstract: Generative deep learning has sparked a new wave of Super-Resolution (SR) algorithms that enhance single images with impressive aesthetic results, albeit with imaginary details. Multi-frame Super-Resolution (MFSR) offers a more grounded approach to the ill-posed problem, by conditioning on multiple low-resolution views. This is important for satellite monitoring of human impact on the planet -- fro… ▽ More

    Submitted 15 February, 2020; originally announced February 2020.

    Comments: 15 pages, 5 figures

  26. arXiv:1910.13249  [pdf, other

    cs.CV cs.HC cs.LG

    Navigation Agents for the Visually Impaired: A Sidewalk Simulator and Experiments

    Authors: Martin Weiss, Simon Chamorro, Roger Girgis, Margaux Luck, Samira E. Kahou, Joseph P. Cohen, Derek Nowrouzezahrai, Doina Precup, Florian Golemo, Chris Pal

    Abstract: Millions of blind and visually-impaired (BVI) people navigate urban environments every day, using smartphones for high-level path-planning and white canes or guide dogs for local information. However, many BVI people still struggle to travel to new places. In our endeavor to create a navigation assistant for the BVI, we found that existing Reinforcement Learning (RL) environments were unsuitable f… ▽ More

    Submitted 29 October, 2019; originally announced October 2019.

    Comments: Accepted at CoRL2019. Code & video available at https://mweiss17.github.io/SEVN/

  27. arXiv:1908.00061  [pdf, other

    cs.CV cs.LG

    An Empirical Study of Batch Normalization and Group Normalization in Conditional Computation

    Authors: Vincent Michalski, Vikram Voleti, Samira Ebrahimi Kahou, Anthony Ortiz, Pascal Vincent, Chris Pal, Doina Precup

    Abstract: Batch normalization has been widely used to improve optimization in deep neural networks. While the uncertainty in batch statistics can act as a regularizer, using these dataset statistics specific to the training set impairs generalization in certain tasks. Recently, alternative methods for normalizing feature activations in neural networks have been proposed. Among them, group normalization has… ▽ More

    Submitted 31 July, 2019; originally announced August 2019.

  28. arXiv:1902.06704  [pdf, other

    cs.NE cs.LG stat.ML

    Towards Non-saturating Recurrent Units for Modelling Long-term Dependencies

    Authors: Sarath Chandar, Chinnadhurai Sankar, Eugene Vorontsov, Samira Ebrahimi Kahou, Yoshua Bengio

    Abstract: Modelling long-term dependencies is a challenge for recurrent neural networks. This is primarily due to the fact that gradients vanish during training, as the sequence length increases. Gradients can be attenuated by transition operators and are attenuated or dropped by activation functions. Canonical architectures like LSTM alleviate this issue by skip** information through a memory mechanism.… ▽ More

    Submitted 22 January, 2019; originally announced February 2019.

    Comments: In Proceedings of AAAI 2019

  29. arXiv:1812.07617  [pdf, other

    cs.LG cs.CL cs.IR stat.ML

    Towards Deep Conversational Recommendations

    Authors: Raymond Li, Samira Kahou, Hannes Schulz, Vincent Michalski, Laurent Charlin, Chris Pal

    Abstract: There has been growing interest in using neural networks and deep learning techniques to create dialogue systems. Conversational recommendation is an interesting setting for the scientific exploration of dialogue with natural language as the associated discourse involves goal-driven dialogue that often transforms naturally into more free-form chat. This paper provides two contributions. First, unt… ▽ More

    Submitted 4 March, 2019; v1 submitted 18 December, 2018; originally announced December 2018.

    Comments: 17 pages, 5 figures, Accepted at 32nd Conference on Neural Information Processing Systems (NeurIPS 2018), Montréal, Canada

  30. arXiv:1811.09845  [pdf, other

    cs.CV

    Tell, Draw, and Repeat: Generating and Modifying Images Based on Continual Linguistic Instruction

    Authors: Alaaeldin El-Nouby, Shikhar Sharma, Hannes Schulz, Devon Hjelm, Layla El Asri, Samira Ebrahimi Kahou, Yoshua Bengio, Graham W. Taylor

    Abstract: Conditional text-to-image generation is an active area of research, with many possible applications. Existing research has primarily focused on generating a single image from available conditioning information in one step. One practical extension beyond one-step generation is a system that generates an image iteratively, conditioned on ongoing linguistic input or feedback. This is significantly mo… ▽ More

    Submitted 23 September, 2019; v1 submitted 24 November, 2018; originally announced November 2018.

    Comments: Accepted at ICCV 2019

    Journal ref: Proceedings of the 2019 IEEE International Conference on Computer Vision (ICCV)

  31. arXiv:1802.08216  [pdf, other

    cs.CV

    ChatPainter: Improving Text to Image Generation using Dialogue

    Authors: Shikhar Sharma, Dendi Suhubdy, Vincent Michalski, Samira Ebrahimi Kahou, Yoshua Bengio

    Abstract: Synthesizing realistic images from text descriptions on a dataset like Microsoft Common Objects in Context (MS COCO), where each image can contain several objects, is a challenging task. Prior work has used text captions to generate images. However, captions might not be informative enough to capture the entire image and insufficient for the model to be able to understand which objects in the imag… ▽ More

    Submitted 22 February, 2018; originally announced February 2018.

  32. arXiv:1710.07300  [pdf, other

    cs.CV

    FigureQA: An Annotated Figure Dataset for Visual Reasoning

    Authors: Samira Ebrahimi Kahou, Vincent Michalski, Adam Atkinson, Akos Kadar, Adam Trischler, Yoshua Bengio

    Abstract: We introduce FigureQA, a visual reasoning corpus of over one million question-answer pairs grounded in over 100,000 images. The images are synthetic, scientific-style figures from five classes: line plots, dot-line plots, vertical and horizontal bar graphs, and pie charts. We formulate our reasoning task by generating questions from 15 templates; questions concern various relationships between plo… ▽ More

    Submitted 22 February, 2018; v1 submitted 19 October, 2017; originally announced October 2017.

    Comments: workshop paper at ICLR 2018

  33. arXiv:1706.04261  [pdf, other

    cs.CV

    The "something something" video database for learning and evaluating visual common sense

    Authors: Raghav Goyal, Samira Ebrahimi Kahou, Vincent Michalski, Joanna Materzyńska, Susanne Westphal, Heuna Kim, Valentin Haenel, Ingo Fruend, Peter Yianilos, Moritz Mueller-Freitag, Florian Hoppe, Christian Thurau, Ingo Bax, Roland Memisevic

    Abstract: Neural networks trained on datasets such as ImageNet have led to major advances in visual object classification. One obstacle that prevents networks from reasoning more deeply about complex scenes and situations, and from integrating visual knowledge with natural language, like humans do, is their lack of common sense knowledge about the physical world. Videos, unlike still images, contain a wealt… ▽ More

    Submitted 15 June, 2017; v1 submitted 13 June, 2017; originally announced June 2017.

  34. arXiv:1612.02095  [pdf, other

    cs.CV stat.ML

    ExtremeWeather: A large-scale climate dataset for semi-supervised detection, localization, and understanding of extreme weather events

    Authors: Evan Racah, Christopher Beckham, Tegan Maharaj, Samira Ebrahimi Kahou, Prabhat, Christopher Pal

    Abstract: Then detection and identification of extreme weather events in large-scale climate simulations is an important problem for risk management, informing governmental policy decisions and advancing our basic understanding of the climate system. Recent work has shown that fully supervised convolutional neural networks (CNNs) can yield acceptable accuracy for classifying well-known types of extreme weat… ▽ More

    Submitted 25 November, 2017; v1 submitted 6 December, 2016; originally announced December 2016.

  35. arXiv:1605.02688  [pdf, other

    cs.SC cs.LG cs.MS

    Theano: A Python framework for fast computation of mathematical expressions

    Authors: The Theano Development Team, Rami Al-Rfou, Guillaume Alain, Amjad Almahairi, Christof Angermueller, Dzmitry Bahdanau, Nicolas Ballas, Frédéric Bastien, Justin Bayer, Anatoly Belikov, Alexander Belopolsky, Yoshua Bengio, Arnaud Bergeron, James Bergstra, Valentin Bisson, Josh Bleecher Snyder, Nicolas Bouchard, Nicolas Boulanger-Lewandowski, Xavier Bouthillier, Alexandre de Brébisson, Olivier Breuleux, Pierre-Luc Carrier, Kyunghyun Cho, Jan Chorowski, Paul Christiano , et al. (88 additional authors not shown)

    Abstract: Theano is a Python library that allows to define, optimize, and evaluate mathematical expressions involving multi-dimensional arrays efficiently. Since its introduction, it has been one of the most used CPU and GPU mathematical compilers - especially in the machine learning community - and has shown steady performance improvements. Theano is being actively and continuously developed since 2008, mu… ▽ More

    Submitted 9 May, 2016; originally announced May 2016.

    Comments: 19 pages, 5 figures

  36. arXiv:1603.05691  [pdf, other

    stat.ML cs.LG

    Do Deep Convolutional Nets Really Need to be Deep and Convolutional?

    Authors: Gregor Urban, Krzysztof J. Geras, Samira Ebrahimi Kahou, Ozlem Aslan, Shengjie Wang, Rich Caruana, Abdelrahman Mohamed, Matthai Philipose, Matt Richardson

    Abstract: Yes, they do. This paper provides the first empirical demonstration that deep convolutional models really need to be both deep and convolutional, even when trained with methods such as distillation that allow small or shallow models of high accuracy to be trained. Although previous research showed that shallow feed-forward nets sometimes can learn the complex functions previously learned by deep n… ▽ More

    Submitted 3 March, 2017; v1 submitted 17 March, 2016; originally announced March 2016.

  37. arXiv:1510.08660  [pdf, other

    cs.LG

    RATM: Recurrent Attentive Tracking Model

    Authors: Samira Ebrahimi Kahou, Vincent Michalski, Roland Memisevic

    Abstract: We present an attention-based modular neural framework for computer vision. The framework uses a soft attention mechanism allowing models to be trained with gradient descent. It consists of three modules: a recurrent attention module controlling where to look in an image or video frame, a feature-extraction module providing a representation of what is seen, and an objective module formalizing why… ▽ More

    Submitted 28 April, 2016; v1 submitted 29 October, 2015; originally announced October 2015.

  38. arXiv:1503.01800  [pdf, other

    cs.LG cs.CV

    EmoNets: Multimodal deep learning approaches for emotion recognition in video

    Authors: Samira Ebrahimi Kahou, Xavier Bouthillier, Pascal Lamblin, Caglar Gulcehre, Vincent Michalski, Kishore Konda, Sébastien Jean, Pierre Froumenty, Yann Dauphin, Nicolas Boulanger-Lewandowski, Raul Chandias Ferrari, Mehdi Mirza, David Warde-Farley, Aaron Courville, Pascal Vincent, Roland Memisevic, Christopher Pal, Yoshua Bengio

    Abstract: The task of the emotion recognition in the wild (EmotiW) Challenge is to assign one of seven emotions to short video clips extracted from Hollywood style movies. The videos depict acted-out emotions under realistic conditions with a large degree of variation in attributes such as pose and illumination, making it worthwhile to explore approaches which consider combinations of features from multiple… ▽ More

    Submitted 29 March, 2015; v1 submitted 5 March, 2015; originally announced March 2015.

  39. arXiv:1412.6550  [pdf, ps, other

    cs.LG cs.NE

    FitNets: Hints for Thin Deep Nets

    Authors: Adriana Romero, Nicolas Ballas, Samira Ebrahimi Kahou, Antoine Chassang, Carlo Gatta, Yoshua Bengio

    Abstract: While depth tends to improve network performances, it also makes gradient-based training more difficult since deeper networks tend to be more non-linear. The recently proposed knowledge distillation approach is aimed at obtaining small and fast-to-execute models, and it has shown that a student network could imitate the soft output of a larger teacher network or ensemble of networks. In this paper… ▽ More

    Submitted 27 March, 2015; v1 submitted 19 December, 2014; originally announced December 2014.