Skip to main content

Showing 101–113 of 113 results for author: Cord, M

.
  1. arXiv:1805.00900  [pdf, other

    cs.AI cs.CL cs.CV cs.IR

    Images & Recipes: Retrieval in the cooking context

    Authors: Micael Carvalho, Rémi Cadène, David Picard, Laure Soulier, Matthieu Cord

    Abstract: Recent advances in the machine learning community allowed different use cases to emerge, as its association to domains like cooking which created the computational cuisine. In this paper, we tackle the picture-recipe alignment problem, having as target application the large-scale retrieval task (finding a recipe given a picture, and vice versa). Our approach is validated on the Recipe1M dataset, c… ▽ More

    Submitted 2 May, 2018; originally announced May 2018.

    Comments: Published at DECOR / ICDE 2018. Extended version accepted at SIGIR 2018, available here: arXiv:1804.11146

  2. arXiv:1804.11146  [pdf, other

    cs.CL cs.CV cs.IR

    Cross-Modal Retrieval in the Cooking Context: Learning Semantic Text-Image Embeddings

    Authors: Micael Carvalho, Rémi Cadène, David Picard, Laure Soulier, Nicolas Thome, Matthieu Cord

    Abstract: Designing powerful tools that support cooking activities has rapidly gained popularity due to the massive amounts of available data, as well as recent advances in machine learning that are capable of analyzing them. In this paper, we propose a cross-modal retrieval model aligning visual and textual data (like pictures of dishes and their recipes) in a shared representation space. We describe an ef… ▽ More

    Submitted 30 April, 2018; originally announced April 2018.

    Comments: accepted at the 41st International ACM SIGIR Conference on Research and Development in Information Retrieval, 2018

  3. arXiv:1804.10988  [pdf, other

    stat.ML cs.LG

    SHADE: Information Based Regularization for Deep Learning

    Authors: Michael Blot, Thomas Robert, Nicolas Thome, Matthieu Cord

    Abstract: Regularization is a big issue for training deep neural networks. In this paper, we propose a new information-theory-based regularization scheme named SHADE for SHAnnon DEcay. The originality of the approach is to define a prior based on conditional entropy, which explicitly decouples the learning of invariant representations in the regularizer and the learning of correlations between inputs and la… ▽ More

    Submitted 22 May, 2018; v1 submitted 29 April, 2018; originally announced April 2018.

  4. arXiv:1804.01852   

    cs.LG stat.ML

    GoSGD: Distributed Optimization for Deep Learning with Gossip Exchange

    Authors: Michael Blot, David Picard, Matthieu Cord

    Abstract: We address the issue of speeding up the training of convolutional neural networks by studying a distributed method adapted to stochastic gradient descent. Our parallel optimization setup uses several threads, each applying individual gradient descents on a local variable. We propose a new way of sharing information between different threads based on gossip algorithms that show good consensus conve… ▽ More

    Submitted 12 November, 2018; v1 submitted 4 April, 2018; originally announced April 2018.

    Comments: Correction to do, and difficulties to change the document

  5. arXiv:1804.01720  [pdf, other

    cs.CV cs.CL cs.LG

    Finding beans in burgers: Deep semantic-visual embedding with localization

    Authors: Martin Engilberge, Louis Chevallier, Patrick Pérez, Matthieu Cord

    Abstract: Several works have proposed to learn a two-path neural network that maps images and texts, respectively, to a same shared Euclidean space where geometry captures useful semantic relationships. Such a multi-modal embedding can be trained and used for various tasks, notably image captioning. In the present work, we introduce a new architecture of this type, with a visual path that leverages recent s… ▽ More

    Submitted 6 April, 2018; v1 submitted 5 April, 2018; originally announced April 2018.

    Comments: Accepted to CVPR2018

  6. arXiv:1707.06175  [pdf, other

    cs.CV cs.AI cs.LG

    Deformable Part-based Fully Convolutional Network for Object Detection

    Authors: Taylor Mordan, Nicolas Thome, Matthieu Cord, Gilles Henaff

    Abstract: Existing region-based object detectors are limited to regions with fixed box geometry to represent objects, even if those are highly non-rectangular. In this paper we introduce DP-FCN, a deep model for object detection which explicitly adapts to shapes of objects with deformable parts. Without additional annotations, it learns to focus on discriminative elements and to align them, and simultaneous… ▽ More

    Submitted 19 July, 2017; originally announced July 2017.

    Comments: Accepted to BMVC 2017 (oral)

  7. arXiv:1705.06676  [pdf, other

    cs.CV

    MUTAN: Multimodal Tucker Fusion for Visual Question Answering

    Authors: Hedi Ben-younes, Rémi Cadene, Matthieu Cord, Nicolas Thome

    Abstract: Bilinear models provide an appealing framework for mixing and merging information in Visual Question Answering (VQA) tasks. They help to learn high level associations between question meaning and visual concepts in the image, but they suffer from huge dimensionality issues. We introduce MUTAN, a multimodal tensor-based Tucker decomposition to efficiently parametrize bilinear interactions between v… ▽ More

    Submitted 18 May, 2017; originally announced May 2017.

  8. arXiv:1611.09726  [pdf, other

    cs.CV cs.LG stat.ML

    Gossip training for deep learning

    Authors: Michael Blot, David Picard, Matthieu Cord, Nicolas Thome

    Abstract: We address the issue of speeding up the training of convolutional networks. Here we study a distributed method adapted to stochastic gradient descent (SGD). The parallel optimization setup uses several threads, each applying individual gradient descents on a local variable. We propose a new way to share information between different threads inspired by gossip algorithms and showing good consensus… ▽ More

    Submitted 29 November, 2016; originally announced November 2016.

  9. arXiv:1610.07882  [pdf, other

    cs.CV

    Maxmin convolutional neural networks for image classification

    Authors: Michael Blot, Matthieu Cord, Nicolas Thome

    Abstract: Convolutional neural networks (CNN) are widely used in computer vision, especially in image classification. However, the way in which information and invariance properties are encoded through in deep CNN architectures is still an open question. In this paper, we propose to modify the standard convo- lutional block of CNN in order to transfer more information layer after layer while kee** some in… ▽ More

    Submitted 25 October, 2016; originally announced October 2016.

  10. arXiv:1610.05567  [pdf, other

    cs.CV

    Master's Thesis : Deep Learning for Visual Recognition

    Authors: Rémi Cadène, Nicolas Thome, Matthieu Cord

    Abstract: The goal of our research is to develop methods advancing automatic visual recognition. In order to predict the unique or multiple labels associated to an image, we study different kind of Deep Neural Networks architectures and methods for supervised features learning. We first draw up a state-of-the-art review of the Convolutional Neural Networks aiming to understand the history behind this family… ▽ More

    Submitted 18 October, 2016; originally announced October 2016.

  11. arXiv:1610.05541  [pdf, other

    cs.CV

    M2CAI Workflow Challenge: Convolutional Neural Networks with Time Smoothing and Hidden Markov Model for Video Frames Classification

    Authors: Rémi Cadène, Thomas Robert, Nicolas Thome, Matthieu Cord

    Abstract: Our approach is among the three best to tackle the M2CAI Workflow challenge. The latter consists in recognizing the operation phase for each frames of endoscopic videos. In this technical report, we compare several classification models and temporal smoothing methods. Our submitted solution is a fine tuned Residual Network-200 on 80% of the training set with temporal smoothing using simple tempora… ▽ More

    Submitted 2 December, 2016; v1 submitted 18 October, 2016; originally announced October 2016.

  12. Deep Neural Networks Under Stress

    Authors: Micael Carvalho, Matthieu Cord, Sandra Avila, Nicolas Thome, Eduardo Valle

    Abstract: In recent years, deep architectures have been used for transfer learning with state-of-the-art performance in many datasets. The properties of their features remain, however, largely unstudied under the transfer perspective. In this work, we present an extensive analysis of the resiliency of feature vectors extracted from deep models, with special focus on the trade-off between performance and com… ▽ More

    Submitted 23 May, 2016; v1 submitted 11 May, 2016; originally announced May 2016.

    Comments: This article corresponds to the accepted version at IEEE ICIP 2016. We will link the DOI as soon as it is available

  13. arXiv:1312.6594  [pdf, other

    cs.CV cs.LG

    Sequentially Generated Instance-Dependent Image Representations for Classification

    Authors: Gabriel Dulac-Arnold, Ludovic Denoyer, Nicolas Thome, Matthieu Cord, Patrick Gallinari

    Abstract: In this paper, we investigate a new framework for image classification that adaptively generates spatial representations. Our strategy is based on a sequential process that learns to explore the different regions of any image in order to infer its category. In particular, the choice of regions is specific to each image, directed by the actual content of previously selected regions.The capacity of… ▽ More

    Submitted 11 February, 2014; v1 submitted 20 December, 2013; originally announced December 2013.