Skip to main content

Showing 1–13 of 13 results for author: Visin, F

.
  1. arXiv:2101.08616  [pdf, other

    cs.RO

    Learning rich touch representations through cross-modal self-supervision

    Authors: Martina Zambelli, Yusuf Aytar, Francesco Visin, Yuxiang Zhou, Raia Hadsell

    Abstract: The sense of touch is fundamental in several manipulation tasks, but rarely used in robot manipulation. In this work we tackle the problem of learning rich touch features from cross-modal self-supervision. We evaluate them identifying objects and their properties in a few-shot classification setting. Two new datasets are introduced using a simulated anthropomorphic robotic hand equipped with tacti… ▽ More

    Submitted 21 January, 2021; originally announced January 2021.

  2. arXiv:2010.02255  [pdf, other

    cs.AI cs.LG stat.ML

    Temporal Difference Uncertainties as a Signal for Exploration

    Authors: Sebastian Flennerhag, Jane X. Wang, Pablo Sprechmann, Francesco Visin, Alexandre Galashov, Steven Kapturowski, Diana L. Borsa, Nicolas Heess, Andre Barreto, Razvan Pascanu

    Abstract: An effective approach to exploration in reinforcement learning is to rely on an agent's uncertainty over the optimal policy, which can yield near-optimal exploration strategies in tabular settings. However, in non-tabular settings that involve function approximators, obtaining accurate uncertainty estimates is almost as challenging a problem. In this paper, we highlight that value estimates are ea… ▽ More

    Submitted 1 July, 2021; v1 submitted 5 October, 2020; originally announced October 2020.

    Comments: 9 pages, 11 figures, 5 tables

  3. arXiv:2009.12583  [pdf, other

    cs.LG stat.ML

    Small Data, Big Decisions: Model Selection in the Small-Data Regime

    Authors: Jorg Bornschein, Francesco Visin, Simon Osindero

    Abstract: Highly overparametrized neural networks can display curiously strong generalization performance - a phenomenon that has recently garnered a wealth of theoretical and empirical research in order to better understand it. In contrast to most previous work, which typically considers the performance as a function of the model size, in this paper we empirically study the generalization performance as th… ▽ More

    Submitted 26 September, 2020; originally announced September 2020.

    Journal ref: Proceedings of the International Conference on Machine (ICML 2020)

  4. arXiv:1910.14481  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    Continual Unsupervised Representation Learning

    Authors: Dushyant Rao, Francesco Visin, Andrei A. Rusu, Yee Whye Teh, Razvan Pascanu, Raia Hadsell

    Abstract: Continual learning aims to improve the ability of modern learning systems to deal with non-stationary distributions, typically by attempting to learn a series of tasks sequentially. Prior art in the field has largely considered supervised or reinforcement learning tasks, and often assumes full knowledge of task labels and boundaries. In this work, we propose an approach (CURL) to tackle a more gen… ▽ More

    Submitted 31 October, 2019; originally announced October 2019.

    Comments: NeurIPS 2019

  5. arXiv:1909.00025  [pdf, other

    cs.LG cs.NE stat.ML

    Meta-Learning with Warped Gradient Descent

    Authors: Sebastian Flennerhag, Andrei A. Rusu, Razvan Pascanu, Francesco Visin, Hujun Yin, Raia Hadsell

    Abstract: Learning an efficient update rule from data that promotes rapid learning of new tasks from the same distribution remains an open problem in meta-learning. Typically, previous works have approached this issue either by attempting to train a neural network that directly produces updates or by attempting to learn better initialisations or scaling factors for a gradient-based update rule. Both of thes… ▽ More

    Submitted 18 February, 2020; v1 submitted 30 August, 2019; originally announced September 2019.

    Comments: 28 pages, 13 figures, 3 tables. Published as a conference paper at ICLR 2020

  6. arXiv:1806.05510  [pdf, other

    cs.CV

    ReConvNet: Video Object Segmentation with Spatio-Temporal Features Modulation

    Authors: Francesco Lattari, Marco Ciccone, Matteo Matteucci, Jonathan Masci, Francesco Visin

    Abstract: We introduce ReConvNet, a recurrent convolutional architecture for semi-supervised video object segmentation that is able to fast adapt its features to focus on any specific object of interest at inference time. Generalization to new objects never observed during training is known to be a hard task for supervised approaches that would need to be retrained. To tackle this problem, we propose a more… ▽ More

    Submitted 18 June, 2018; v1 submitted 14 June, 2018; originally announced June 2018.

    Comments: CVPR Workshop - DAVIS Challenge 2018

  7. arXiv:1708.04907  [pdf, other

    cs.CV

    Multi-View Stereo with Single-View Semantic Mesh Refinement

    Authors: Andrea Romanoni, Marco Ciccone, Francesco Visin, Matteo Matteucci

    Abstract: While 3D reconstruction is a well-established and widely explored research topic, semantic 3D reconstruction has only recently witnessed an increasing share of attention from the Computer Vision community. Semantic annotations allow in fact to enforce strong class-dependent priors, as planarity for ground and walls, which can be exploited to refine the reconstruction often resulting in non-trivial… ▽ More

    Submitted 24 August, 2017; v1 submitted 16 August, 2017; originally announced August 2017.

    Comments: £D Reconstruction Meets Semantic, ICCV workshop

  8. arXiv:1611.05013  [pdf, other

    cs.LG

    PixelVAE: A Latent Variable Model for Natural Images

    Authors: Ishaan Gulrajani, Kundan Kumar, Faruk Ahmed, Adrien Ali Taiga, Francesco Visin, David Vazquez, Aaron Courville

    Abstract: Natural image modeling is a landmark challenge of unsupervised learning. Variational Autoencoders (VAEs) learn a useful latent representation and model global structure well but have difficulty capturing small details. PixelCNN models details very well, but lacks a latent code and is difficult to scale for capturing large structures. We present PixelVAE, a VAE model with an autoregressive decoder… ▽ More

    Submitted 15 November, 2016; originally announced November 2016.

  9. arXiv:1608.04980  [pdf, other

    cs.LG cs.NE

    Mollifying Networks

    Authors: Caglar Gulcehre, Marcin Moczulski, Francesco Visin, Yoshua Bengio

    Abstract: The optimization of deep neural networks can be more challenging than traditional convex optimization problems due to the highly non-convex nature of the loss function, e.g. it can involve pathological landscapes such as saddle-surfaces that can be difficult to escape for algorithms based on simple gradient descent. In this paper, we attack the problem of optimization of highly non-convex neural n… ▽ More

    Submitted 17 August, 2016; originally announced August 2016.

  10. arXiv:1605.02688  [pdf, other

    cs.SC cs.LG cs.MS

    Theano: A Python framework for fast computation of mathematical expressions

    Authors: The Theano Development Team, Rami Al-Rfou, Guillaume Alain, Amjad Almahairi, Christof Angermueller, Dzmitry Bahdanau, Nicolas Ballas, Frédéric Bastien, Justin Bayer, Anatoly Belikov, Alexander Belopolsky, Yoshua Bengio, Arnaud Bergeron, James Bergstra, Valentin Bisson, Josh Bleecher Snyder, Nicolas Bouchard, Nicolas Boulanger-Lewandowski, Xavier Bouthillier, Alexandre de Brébisson, Olivier Breuleux, Pierre-Luc Carrier, Kyunghyun Cho, Jan Chorowski, Paul Christiano , et al. (88 additional authors not shown)

    Abstract: Theano is a Python library that allows to define, optimize, and evaluate mathematical expressions involving multi-dimensional arrays efficiently. Since its introduction, it has been one of the most used CPU and GPU mathematical compilers - especially in the machine learning community - and has shown steady performance improvements. Theano is being actively and continuously developed since 2008, mu… ▽ More

    Submitted 9 May, 2016; originally announced May 2016.

    Comments: 19 pages, 5 figures

  11. arXiv:1603.07285  [pdf, other

    stat.ML cs.LG cs.NE

    A guide to convolution arithmetic for deep learning

    Authors: Vincent Dumoulin, Francesco Visin

    Abstract: We introduce a guide to help deep learning practitioners understand and manipulate convolutional neural network architectures. The guide clarifies the relationship between various properties (input shape, kernel shape, zero padding, strides and output shape) of convolutional, pooling and transposed convolutional layers, as well as the relationship between convolutional and transposed convolutional… ▽ More

    Submitted 11 January, 2018; v1 submitted 23 March, 2016; originally announced March 2016.

  12. arXiv:1511.07053  [pdf, other

    cs.CV cs.LG

    ReSeg: A Recurrent Neural Network-based Model for Semantic Segmentation

    Authors: Francesco Visin, Marco Ciccone, Adriana Romero, Kyle Kastner, Kyunghyun Cho, Yoshua Bengio, Matteo Matteucci, Aaron Courville

    Abstract: We propose a structured prediction architecture, which exploits the local generic features extracted by Convolutional Neural Networks and the capacity of Recurrent Neural Networks (RNN) to retrieve distant dependencies. The proposed architecture, called ReSeg, is based on the recently introduced ReNet model for image classification. We modify and extend it to perform the more challenging task of s… ▽ More

    Submitted 24 May, 2016; v1 submitted 22 November, 2015; originally announced November 2015.

    Comments: In CVPR Deep Vision Workshop, 2016

  13. arXiv:1505.00393  [pdf, other

    cs.CV

    ReNet: A Recurrent Neural Network Based Alternative to Convolutional Networks

    Authors: Francesco Visin, Kyle Kastner, Kyunghyun Cho, Matteo Matteucci, Aaron Courville, Yoshua Bengio

    Abstract: In this paper, we propose a deep neural network architecture for object recognition based on recurrent neural networks. The proposed network, called ReNet, replaces the ubiquitous convolution+pooling layer of the deep convolutional neural network with four recurrent neural networks that sweep horizontally and vertically in both directions across the image. We evaluate the proposed ReNet on three w… ▽ More

    Submitted 23 July, 2015; v1 submitted 3 May, 2015; originally announced May 2015.