Skip to main content

Showing 1–50 of 74 results for author: Picard, D

.
  1. arXiv:2405.20324  [pdf, other

    cs.CV cs.LG

    Don't drop your samples! Coherence-aware training benefits Conditional diffusion

    Authors: Nicolas Dufour, Victor Besnier, Vicky Kalogeiton, David Picard

    Abstract: Conditional diffusion models are powerful generative models that can leverage various types of conditional information, such as class labels, segmentation masks, or text captions. However, in many real-world scenarios, conditional information may be noisy or unreliable due to human annotation errors or weak alignment. In this paper, we propose the Coherence-Aware Diffusion (CAD), a novel method th… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

    Comments: Accepted at CVPR 2024 as a Highlight. Project page: https://nicolas-dufour.github.io/cad.html

  2. arXiv:2404.13040  [pdf, other

    cs.CV cs.LG

    Analysis of Classifier-Free Guidance Weight Schedulers

    Authors: Xi Wang, Nicolas Dufour, Nefeli Andreou, Marie-Paule Cani, Victoria Fernandez Abrevaya, David Picard, Vicky Kalogeiton

    Abstract: Classifier-Free Guidance (CFG) enhances the quality and condition adherence of text-to-image diffusion models. It operates by combining the conditional and unconditional predictions using a fixed weight. However, recent works vary the weights throughout the diffusion process, reporting superior results but without providing any rationale or analysis. By conducting comprehensive experiments, this p… ▽ More

    Submitted 19 April, 2024; originally announced April 2024.

  3. arXiv:2401.09629  [pdf, other

    cs.LG stat.ML

    Multiple Locally Linear Kernel Machines

    Authors: David Picard

    Abstract: In this paper we propose a new non-linear classifier based on a combination of locally linear classifiers. A well known optimization formulation is given as we cast the problem in a $\ell_1$ Multiple Kernel Learning (MKL) problem using many locally linear kernels. Since the number of such kernels is huge, we provide a scalable generic MKL training algorithm handling streaming kernels. With respect… ▽ More

    Submitted 17 January, 2024; originally announced January 2024.

    Comments: This paper was written in 2014 and was originally submitted but rejected at ICML'15

  4. arXiv:2310.11265  [pdf, other

    eess.IV cs.CV

    Image Compression using only Attention based Neural Networks

    Authors: Natacha Luka, Romain Negrel, David Picard

    Abstract: In recent research, Learned Image Compression has gained prominence for its capacity to outperform traditional handcrafted pipelines, especially at low bit-rates. While existing methods incorporate convolutional priors with occasional attention blocks to address long-range dependencies, recent advances in computer vision advocate for a transformative shift towards fully transformer-based architect… ▽ More

    Submitted 17 October, 2023; originally announced October 2023.

  5. arXiv:2308.11677  [pdf, other

    cs.LG cs.CV

    An Analysis of Initial Training Strategies for Exemplar-Free Class-Incremental Learning

    Authors: Grégoire Petit, Michael Soumm, Eva Feillet, Adrian Popescu, Bertrand Delezoide, David Picard, Céline Hudelot

    Abstract: Class-Incremental Learning (CIL) aims to build classification models from data streams. At each step of the CIL process, new classes must be integrated into the model. Due to catastrophic forgetting, CIL is particularly challenging when examples from past classes cannot be stored, the case on which we focus here. To date, most approaches are based exclusively on the target dataset of the CIL proce… ▽ More

    Submitted 27 September, 2023; v1 submitted 22 August, 2023; originally announced August 2023.

  6. arXiv:2306.02928  [pdf, other

    cs.CV

    LRVS-Fashion: Extending Visual Search with Referring Instructions

    Authors: Simon Lepage, Jérémie Mary, David Picard

    Abstract: This paper introduces a new challenge for image similarity search in the context of fashion, addressing the inherent ambiguity in this domain stemming from complex images. We present Referred Visual Search (RVS), a task allowing users to define more precisely the desired similarity, following recent interest in the industry. We release a new large public dataset, LRVS-Fashion, consisting of 272k f… ▽ More

    Submitted 15 May, 2024; v1 submitted 5 June, 2023; originally announced June 2023.

    Comments: 29 pages, 14 figures, 5 tables

    MSC Class: 68T07 (Primary) 68T45 (Secondary) ACM Class: I.2.10

  7. arXiv:2302.00384  [pdf, other

    cs.CV

    Alphazzle: Jigsaw Puzzle Solver with Deep Monte-Carlo Tree Search

    Authors: Marie-Morgane Paumard, Hedi Tabia, David Picard

    Abstract: Solving jigsaw puzzles requires to grasp the visual features of a sequence of patches and to explore efficiently a solution space that grows exponentially with the sequence length. Therefore, visual deep reinforcement learning (DRL) should answer this problem more efficiently than optimization solvers coupled with neural networks. Based on this assumption, we introduce Alphazzle, a reassembly algo… ▽ More

    Submitted 1 February, 2023; originally announced February 2023.

  8. arXiv:2212.10292  [pdf, other

    cs.CV cs.AI

    Towards Unsupervised Visual Reasoning: Do Off-The-Shelf Features Know How to Reason?

    Authors: Monika Wysoczańska, Tom Monnier, Tomasz Trzciński, David Picard

    Abstract: Recent advances in visual representation learning allowed to build an abundance of powerful off-the-shelf features that are ready-to-use for numerous downstream tasks. This work aims to assess how well these features preserve information about the objects, such as their spatial location, their visual properties and their relative relationships. We propose to do so by evaluating them in the context… ▽ More

    Submitted 20 December, 2022; originally announced December 2022.

  9. arXiv:2211.15692  [pdf, other

    cs.CV

    H3WB: Human3.6M 3D WholeBody Dataset and Benchmark

    Authors: Yue Zhu, Nermin Samet, David Picard

    Abstract: We present a benchmark for 3D human whole-body pose estimation, which involves identifying accurate 3D keypoints on the entire human body, including face, hands, body, and feet. Currently, the lack of a fully annotated and accurate 3D whole-body dataset results in deep networks being trained separately on specific body parts, which are combined during inference. Or they rely on pseudo-groundtruth… ▽ More

    Submitted 6 September, 2023; v1 submitted 28 November, 2022; originally announced November 2022.

    Comments: Accepted by ICCV 2023

  10. arXiv:2211.13131  [pdf, other

    cs.CV cs.AI cs.LG

    FeTrIL: Feature Translation for Exemplar-Free Class-Incremental Learning

    Authors: Grégoire Petit, Adrian Popescu, Hugo Schindler, David Picard, Bertrand Delezoide

    Abstract: Exemplar-free class-incremental learning is very challenging due to the negative effect of catastrophic forgetting. A balance between stability and plasticity of the incremental process is needed in order to obtain good accuracy for past as well as new classes. Existing exemplar-free class-incremental methods focus either on successive fine tuning of the model, thus favoring plasticity, or on usin… ▽ More

    Submitted 28 November, 2023; v1 submitted 23 November, 2022; originally announced November 2022.

  11. arXiv:2210.04883  [pdf, other

    cs.CV cs.AI cs.LG

    SCAM! Transferring humans between images with Semantic Cross Attention Modulation

    Authors: Nicolas Dufour, David Picard, Vicky Kalogeiton

    Abstract: A large body of recent work targets semantically conditioned image generation. Most such methods focus on the narrower task of pose transfer and ignore the more challenging task of subject transfer that consists in not only transferring the pose but also the appearance and background. In this work, we introduce SCAM (Semantic Cross Attention Modulation), a system that encodes rich and diverse info… ▽ More

    Submitted 10 October, 2022; originally announced October 2022.

    Comments: Accepted at ECCV 2022

  12. arXiv:2210.02231  [pdf, other

    cs.CV

    Decanus to Legatus: Synthetic training for 2D-3D human pose lifting

    Authors: Yue Zhu, David Picard

    Abstract: 3D human pose estimation is a challenging task because of the difficulty to acquire ground-truth data outside of controlled environments. A number of further issues have been hindering progress in building a universal and robust model for this task, including domain gaps between different datasets, unseen actions between train and test datasets, various hardware settings and high cost of annotatio… ▽ More

    Submitted 5 October, 2022; originally announced October 2022.

    Comments: Accepted by ACCV 2022

  13. arXiv:2209.06606  [pdf, other

    cs.CV cs.LG

    PlaStIL: Plastic and Stable Memory-Free Class-Incremental Learning

    Authors: Grégoire Petit, Adrian Popescu, Eden Belouadah, David Picard, Bertrand Delezoide

    Abstract: Plasticity and stability are needed in class-incremental learning in order to learn from new data while preserving past knowledge. Due to catastrophic forgetting, finding a compromise between these two properties is particularly challenging when no memory buffer is available. Mainstream methods need to store two deep models since they integrate new classes using fine-tuning with knowledge distilla… ▽ More

    Submitted 4 July, 2023; v1 submitted 14 September, 2022; originally announced September 2022.

  14. arXiv:2207.10541  [pdf, other

    cs.LG cs.AI stat.ML

    Unveiling the Latent Space Geometry of Push-Forward Generative Models

    Authors: Thibaut Issenhuth, Ugo Tanielian, Jérémie Mary, David Picard

    Abstract: Many deep generative models are defined as a push-forward of a Gaussian measure by a continuous generator, such as Generative Adversarial Networks (GANs) or Variational Auto-Encoders (VAEs). This work explores the latent space of such deep generative models. A key issue with these models is their tendency to output samples outside of the support of the target distribution when learning disconnecte… ▽ More

    Submitted 15 May, 2023; v1 submitted 21 July, 2022; originally announced July 2022.

  15. arXiv:2207.08782  [pdf, other

    cs.CV

    Instance-Aware Observer Network for Out-of-Distribution Object Segmentation

    Authors: Victor Besnier, Andrei Bursuc, David Picard, Alexandre Briot

    Abstract: Recent works on predictive uncertainty estimation have shown promising results on Out-Of-Distribution (OOD) detection for semantic segmentation. However, these methods struggle to precisely locate the point of interest in the image, i.e, the anomaly. This limitation is due to the difficulty of finegrained prediction at the pixel level. To address this issue, we build upon the recent ObsNet approac… ▽ More

    Submitted 29 August, 2022; v1 submitted 18 July, 2022; originally announced July 2022.

  16. Absolute measurements of state-to-state rotational energy transfer between CO and H2 at interstellar temperatures

    Authors: H. Labiad, M. Fournier, L. A. Mertens, A. Faure, D. Carty, T. Stoecklin, P. Jankowski, K. Szalewicz, S. D. Le Picard, I. R. Sims

    Abstract: Experimental measurements and theoretical calculations of state-to-state rate coefficients for rotational energy transfer of CO in collision with H$_2$ are reported at the very low temperatures prevailing in dense interstellar clouds (5 - 20 K). Detailed agreement between quantum state-selected experiments performed in cold supersonic flows using time-resolved infrared - vacuum-ultraviolet double… ▽ More

    Submitted 1 February, 2022; originally announced February 2022.

    Comments: 6 pages, 4 figures, accepted for publication in Phys. Rev. A. Letter (21/12/2021)

  17. arXiv:2111.15264  [pdf, other

    cs.CV cs.LG

    EdiBERT, a generative model for image editing

    Authors: Thibaut Issenhuth, Ugo Tanielian, Jérémie Mary, David Picard

    Abstract: Advances in computer vision are pushing the limits of im-age manipulation, with generative models sampling detailed images on various tasks. However, a specialized model is often developed and trained for each specific task, even though many image edition tasks share similarities. In denoising, inpainting, or image compositing, one always aims at generating a realistic image from a low-quality one… ▽ More

    Submitted 21 July, 2022; v1 submitted 30 November, 2021; originally announced November 2021.

  18. arXiv:2111.10248  [pdf, other

    stat.ML cs.LG math.OC

    Non asymptotic bounds in asynchronous sum-weight gossip protocols

    Authors: David Picard, Jérôme Fellus, Stéphane Garnier

    Abstract: This paper focuses on non-asymptotic diffusion time in asynchronous gossip protocols. Asynchronous gossip protocols are designed to perform distributed computation in a network of nodes by randomly exchanging messages on the associated graph. To achieve consensus among nodes, a minimal number of messages has to be exchanged. We provides a probabilistic bound to such number for the general case. We… ▽ More

    Submitted 19 November, 2021; originally announced November 2021.

    Comments: Unpublished work done circa 2016

  19. arXiv:2110.09803  [pdf, other

    cs.LG cs.CV

    Latent reweighting, an almost free improvement for GANs

    Authors: Thibaut Issenhuth, Ugo Tanielian, David Picard, Jeremie Mary

    Abstract: Standard formulations of GANs, where a continuous function deforms a connected latent space, have been shown to be misspecified when fitting different classes of images. In particular, the generator will necessarily sample some low-quality images in between the classes. Rather than modifying the architecture, a line of works aims at improving the sampling quality from pre-trained generators at the… ▽ More

    Submitted 19 October, 2021; originally announced October 2021.

  20. arXiv:2109.08203  [pdf, other

    cs.CV

    Torch.manual_seed(3407) is all you need: On the influence of random seeds in deep learning architectures for computer vision

    Authors: David Picard

    Abstract: In this paper I investigate the effect of random seed selection on the accuracy when using popular deep learning architectures for computer vision. I scan a large amount of seeds (up to $10^4$) on CIFAR 10 and I also scan fewer seeds on Imagenet using pre-trained models to investigate large scale datasets. The conclusions are that even if the variance is not very large, it is surprisingly easy to… ▽ More

    Submitted 11 May, 2023; v1 submitted 16 September, 2021; originally announced September 2021.

    Comments: fixed typos

  21. arXiv:2108.08109  [pdf, other

    cs.CV

    Image Collation: Matching illustrations in manuscripts

    Authors: Ryad Kaoua, Xi Shen, Alexandra Durr, Stavros Lazaris, David Picard, Mathieu Aubry

    Abstract: Illustrations are an essential transmission instrument. For an historian, the first step in studying their evolution in a corpus of similar manuscripts is to identify which ones correspond to each other. This image collation task is daunting for manuscripts separated by many lost copies, spreading over centuries, which might have been completely re-organized and greatly modified to adapt to novel… ▽ More

    Submitted 18 August, 2021; originally announced August 2021.

    Comments: accepted to ICDAR 2021

  22. arXiv:2108.01634  [pdf, other

    cs.CV

    Triggering Failures: Out-Of-Distribution detection by learning from local adversarial attacks in Semantic Segmentation

    Authors: Victor Besnier, Andrei Bursuc, David Picard, Alexandre Briot

    Abstract: In this paper, we tackle the detection of out-of-distribution (OOD) objects in semantic segmentation. By analyzing the literature, we found that current methods are either accurate or fast but not both which limits their usability in real world applications. To get the best of both aspects, we propose to mitigate the common shortcomings by following four design principles: decoupling the OOD detec… ▽ More

    Submitted 3 August, 2021; originally announced August 2021.

  23. arXiv:2105.13688  [pdf, other

    cs.CV

    Learning Uncertainty For Safety-Oriented Semantic Segmentation In Autonomous Driving

    Authors: Victor Besnier, David Picard, Alexandre Briot

    Abstract: In this paper, we show how uncertainty estimation can be leveraged to enable safety critical image segmentation in autonomous driving, by triggering a fallback behavior if a target accuracy cannot be guaranteed. We introduce a new uncertainty measure based on disagreeing predictions as measured by a dissimilarity function. We propose to estimate this dissimilarity by training a deep neural archite… ▽ More

    Submitted 28 May, 2021; originally announced May 2021.

  24. arXiv:2103.11409  [pdf, other

    eess.SP

    Deep Learning Based Detection for Spectrally Efficient FDM Systems

    Authors: David Picard, Arsenia Chorti

    Abstract: In this study we present how to approach the problem of building efficient detectors for spectrally efficient frequency division multiplexing (SEFDM) systems. The superiority of residual convolution neural networks (CNNs) for these types of problems is demonstrated through experimentation with many different types of architectures.

    Submitted 21 March, 2021; originally announced March 2021.

  25. arXiv:2103.02306  [pdf, ps, other

    eess.SP cs.IT

    Rate Analysis and Deep Neural Network Detectors for SEFDM FTN Systems

    Authors: Arsenia Chorti, David Picard

    Abstract: In this work we compare the capacity and achievable rate of uncoded faster than Nyquist (FTN) signalling in the frequency domain, also referred to as spectrally efficient FDM (SEFDM). We propose a deep residual convolutional neural network detector for SEFDM signals in additive white Gaussian noise channels, that allows to approach the Mazo limit in systems with up to 60 subcarriers. Notably, the… ▽ More

    Submitted 3 March, 2021; originally announced March 2021.

  26. arXiv:2012.07487  [pdf, other

    stat.AP cs.LG

    Clustering high dimensional meteorological scenarios: results and performance index

    Authors: Yamila Barrera, Leonardo Boechi, Matthieu Jonckheere, Vincent Lefieux, Dominique Picard, Ezequiel Smucler, Agustin Somacal, Alfredo Umfurer

    Abstract: The Reseau de Transport d'Electricité (RTE) is the French main electricity network operational manager and dedicates large number of resources and efforts towards understanding climate time series data. We discuss here the problem and the methodology of grou** and selecting representatives of possible climate scenarios among a large number of climate simulations provided by RTE. The data used is… ▽ More

    Submitted 14 December, 2020; originally announced December 2020.

    Comments: 19 pages, 14 figures

  27. arXiv:2012.00374  [pdf, other

    physics.chem-ph astro-ph.IM

    A new instrument for kinetics and branching ratio studies of gas phase collisional processes at very low temperatures

    Authors: Olivier Durif, Michael Capron, Joey P. Messinger, Abdessamad Benidar, Ludovic Biennier, Jérémy Bourgalais, André Canosa, Jonathan Courbe, Gustavo A. Garcia, Jean-François Gil, Laurent Nahon, Mitchio Okumura, Lucile Rutkowski, Ian R. Sims, Jonathan Thiévin, Sébastien D. Le Picard

    Abstract: A new instrument dedicated to the kinetic study of low-temperature gas phase neutral-neutral reactions, including clustering processes, is presented. It combines a supersonic flow reactor with Vacuum Ultra-Violet (VUV) synchrotron photoionization time of flight mass spectrometry. A photoion-photoelectron coincidence detection scheme has been adopted to optimize the particle counting efficiency. Th… ▽ More

    Submitted 1 December, 2020; originally announced December 2020.

  28. arXiv:2009.01998  [pdf, other

    cs.CV

    SSP-Net: Scalable Sequential Pyramid Networks for Real-Time 3D Human Pose Regression

    Authors: Diogo Luvizon, Hedi Tabia, David Picard

    Abstract: In this paper we propose a highly scalable convolutional neural network, end-to-end trainable, for real-time 3D human pose regression from still RGB images. We call this approach the Scalable Sequential Pyramid Networks (SSP-Net) as it is trained with refined supervision at multiple scales in a sequential manner. Our network requires a single training procedure and is capable of producing its best… ▽ More

    Submitted 3 September, 2020; originally announced September 2020.

    Comments: Under review at PR

  29. arXiv:2006.06611  [pdf, other

    cs.CV

    Improving Deep Metric Learning with Virtual Classes and Examples Mining

    Authors: Pierre Jacob, David Picard, Aymeric Histace, Edouard Klein

    Abstract: In deep metric learning, the training procedure relies on sampling informative tuples. However, as the training procedure progresses, it becomes nearly impossible to sample relevant hard negative examples without proper mining strategies or generation-based methods. Recent work on hard negative generation have shown great promises to solve the mining problem. However, this generation process is di… ▽ More

    Submitted 11 June, 2020; originally announced June 2020.

  30. Deepzzle: Solving Visual Jigsaw Puzzles with Deep Learning andShortest Path Optimization

    Authors: Marie-Morgane Paumard, David Picard, Hedi Tabia

    Abstract: We tackle the image reassembly problem with wide space between the fragments, in such a way that the patterns and colors continuity is mostly unusable. The spacing emulates the erosion of which the archaeological fragments suffer. We crop-square the fragments borders to compel our algorithm to learn from the content of the fragments. We also complicate the image reassembly by removing fragments an… ▽ More

    Submitted 26 May, 2020; originally announced May 2020.

    Journal ref: IEEE Transactions on Image Processing (2020)

  31. DIABLO: Dictionary-based Attention Block for Deep Metric Learning

    Authors: Pierre Jacob, David Picard, Aymeric Histace, Edouard Klein

    Abstract: Recent breakthroughs in representation learning of unseen classes and examples have been made in deep metric learning by training at the same time the image representations and a corresponding metric with deep networks. Recent contributions mostly address the training part (loss functions, sampling strategies, etc.), while a few works focus on improving the discriminative power of the image repres… ▽ More

    Submitted 30 April, 2020; originally announced April 2020.

    Comments: Pre-print. Accepted for publication at Pattern Recognition Letters

  32. arXiv:2002.02250  [pdf, other

    stat.ML cs.LG nlin.CD physics.ao-ph

    Uncovering differential equations from data with hidden variables

    Authors: Agustín Somacal, Yamila Barrera, Leonardo Boechi, Matthieu Jonckheere, Vincent Lefieux, Dominique Picard, Ezequiel Smucler

    Abstract: SINDy is a method for learning system of differential equations from data by solving a sparse linear regression optimization problem [Brunton et al., 2016]. In this article, we propose an extension of the SINDy method that learns systems of differential equations in cases where some of the variables are not observed. Our extension is based on regressing a higher order time derivative of a target v… ▽ More

    Submitted 23 December, 2020; v1 submitted 6 February, 2020; originally announced February 2020.

  33. Multi-task Deep Learning for Real-Time 3D Human Pose Estimation and Action Recognition

    Authors: Diogo C Luvizon, Hedi Tabia, David Picard

    Abstract: Human pose estimation and action recognition are related tasks since both problems are strongly dependent on the human body representation and analysis. Nonetheless, most recent methods in the literature handle the two problems separately. In this work, we propose a multi-task framework for jointly estimating 2D or 3D human poses from monocular color images and classifying human actions from video… ▽ More

    Submitted 3 March, 2020; v1 submitted 14 December, 2019; originally announced December 2019.

    Comments: Accepted to TPAMI. arXiv admin note: text overlap with arXiv:1802.09232

  34. arXiv:1911.09245  [pdf, other

    cs.CV

    Consensus-based Optimization for 3D Human Pose Estimation in Camera Coordinates

    Authors: Diogo C Luvizon, Hedi Tabia, David Picard

    Abstract: 3D human pose estimation is frequently seen as the task of estimating 3D poses relative to the root body joint. Alternatively, we propose a 3D human pose estimation method in camera coordinates, which allows effective combination of 2D annotated data and 3D poses and a straightforward multi-view generalization. To that end, we cast the problem as a view frustum space pose estimation, where absolut… ▽ More

    Submitted 20 August, 2021; v1 submitted 20 November, 2019; originally announced November 2019.

    Comments: Source code is available at https://github.com/dluvizon/3d-pose-consensus

  35. arXiv:1908.02735  [pdf, other

    cs.CV

    Metric Learning With HORDE: High-Order Regularizer for Deep Embeddings

    Authors: Pierre Jacob, David Picard, Aymeric Histace, Edouard Klein

    Abstract: Learning an effective similarity measure between image representations is key to the success of recent advances in visual search tasks (e.g. verification or zero-shot learning). Although the metric learning part is well addressed, this metric is usually computed over the average of the extracted deep features. This representation is then trained to be discriminative. However, these deep features t… ▽ More

    Submitted 7 August, 2019; originally announced August 2019.

    Comments: Camera-ready for our ICCV 2019 paper (poster)

  36. arXiv:1906.01972  [pdf, ps, other

    cs.CV

    Efficient Codebook and Factorization for Second Order Representation Learning

    Authors: Pierre Jacob, David Picard, Aymeric Histace, Edouard Klein

    Abstract: Learning rich and compact representations is an open topic in many fields such as object recognition or image retrieval. Deep neural networks have made a major breakthrough during the last few years for these tasks but their representations are not necessary as rich as needed nor as compact as expected. To build richer representations, high order statistics have been exploited and have shown excel… ▽ More

    Submitted 5 June, 2019; originally announced June 2019.

    Comments: Accepted at IEEE International Conference on Image Processing (ICIP) 2019

  37. arXiv:1809.00898  [pdf, other

    cs.CV

    Image Reassembly Combining Deep Learning and Shortest Path Problem

    Authors: M. -M. Paumard, D. Picard, H. Tabia

    Abstract: This paper addresses the problem of reassembling images from disjointed fragments. More specifically, given an unordered set of fragments, we aim at reassembling one or several possibly incomplete images. The main contributions of this work are: 1) several deep neural architectures to predict the relative position of image fragments that outperform the previous state of the art; 2) casting the rea… ▽ More

    Submitted 4 September, 2018; originally announced September 2018.

    Comments: ECCV 2018

  38. arXiv:1807.03155  [pdf, other

    cs.CV cs.LG stat.ML

    Jigsaw Puzzle Solving Using Local Feature Co-Occurrences in Deep Neural Networks

    Authors: Marie-Morgane Paumard, David Picard, Hedi Tabia

    Abstract: Archaeologists are in dire need of automated object reconstruction methods. Fragments reassembly is close to puzzle problems, which may be solved by computer vision algorithms. As they are often beaten on most image related tasks by deep learning algorithms, we study a classification method that can solve jigsaw puzzles. In this paper, we focus on classifying the relative position: given a couple… ▽ More

    Submitted 5 July, 2018; originally announced July 2018.

    Comments: ICIP 2018

  39. arXiv:1806.08991  [pdf, other

    cs.CV

    Leveraging Implicit Spatial Information in Global Features for Image Retrieval

    Authors: Pierre Jacob, David Picard, Aymeric Histace, Edouard Klein

    Abstract: Most image retrieval methods use global features that aggregate local distinctive patterns into a single representation. However, the aggregation process destroys the relative spatial information by considering orderless sets of local descriptors. We propose to integrate relative spatial information into the aggregation process by taking into account co-occurrences of local patterns in a tensor fr… ▽ More

    Submitted 23 June, 2018; originally announced June 2018.

    Comments: 8 pages, 2 figures and 1 table. Draft paper for conference, IEEE International Conference on Image Processing (ICIP) 2018

  40. arXiv:1805.04682  [pdf, ps, other

    math.PR math.ST

    Kernel and wavelet density estimators on manifolds and more general metric spaces

    Authors: G. Cleanthous, A. Georgiadis, G. Kerkyacharian, P. Petrushev, D. Picard

    Abstract: We consider the problem of estimating the density of observations taking values in classical or nonclassical spaces such as manifolds and more general metric spaces. Our setting is quite general but also sufficiently rich in allowing the development of smooth functional calculus with well localized spectral kernels, Besov regularity spaces, and wavelet type systems. Kernel and both linear and nonl… ▽ More

    Submitted 9 February, 2019; v1 submitted 12 May, 2018; originally announced May 2018.

    MSC Class: Primary 62G07; 58J35; Secondary 43A85; 42B35

  41. arXiv:1805.00900  [pdf, other

    cs.AI cs.CL cs.CV cs.IR

    Images & Recipes: Retrieval in the cooking context

    Authors: Micael Carvalho, Rémi Cadène, David Picard, Laure Soulier, Matthieu Cord

    Abstract: Recent advances in the machine learning community allowed different use cases to emerge, as its association to domains like cooking which created the computational cuisine. In this paper, we tackle the picture-recipe alignment problem, having as target application the large-scale retrieval task (finding a recipe given a picture, and vice versa). Our approach is validated on the Recipe1M dataset, c… ▽ More

    Submitted 2 May, 2018; originally announced May 2018.

    Comments: Published at DECOR / ICDE 2018. Extended version accepted at SIGIR 2018, available here: arXiv:1804.11146

  42. arXiv:1804.11146  [pdf, other

    cs.CL cs.CV cs.IR

    Cross-Modal Retrieval in the Cooking Context: Learning Semantic Text-Image Embeddings

    Authors: Micael Carvalho, Rémi Cadène, David Picard, Laure Soulier, Nicolas Thome, Matthieu Cord

    Abstract: Designing powerful tools that support cooking activities has rapidly gained popularity due to the massive amounts of available data, as well as recent advances in machine learning that are capable of analyzing them. In this paper, we propose a cross-modal retrieval model aligning visual and textual data (like pictures of dishes and their recipes) in a shared representation space. We describe an ef… ▽ More

    Submitted 30 April, 2018; originally announced April 2018.

    Comments: accepted at the 41st International ACM SIGIR Conference on Research and Development in Information Retrieval, 2018

  43. arXiv:1804.01852   

    cs.LG stat.ML

    GoSGD: Distributed Optimization for Deep Learning with Gossip Exchange

    Authors: Michael Blot, David Picard, Matthieu Cord

    Abstract: We address the issue of speeding up the training of convolutional neural networks by studying a distributed method adapted to stochastic gradient descent. Our parallel optimization setup uses several threads, each applying individual gradient descents on a local variable. We propose a new way of sharing information between different threads based on gossip algorithms that show good consensus conve… ▽ More

    Submitted 12 November, 2018; v1 submitted 4 April, 2018; originally announced April 2018.

    Comments: Correction to do, and difficulties to change the document

  44. arXiv:1802.09232  [pdf, other

    cs.CV

    2D/3D Pose Estimation and Action Recognition using Multitask Deep Learning

    Authors: Diogo C. Luvizon, David Picard, Hedi Tabia

    Abstract: Action recognition and human pose estimation are closely related but both problems are generally handled as distinct tasks in the literature. In this work, we propose a multitask framework for jointly 2D and 3D pose estimation from still images and human action recognition from video sequences. We show that a single architecture can be used to solve the two problems in an efficient way and still a… ▽ More

    Submitted 21 March, 2018; v1 submitted 26 February, 2018; originally announced February 2018.

    Comments: To appear in CVPR 2018

  45. arXiv:1802.07617  [pdf, other

    math.ST

    Convergence rates for smooth k-means change-point detection

    Authors: Aurélie Fischer, Dominique Picard

    Abstract: In this paper, we consider the estimation of a change-point for possibly high-dimensional data in a Gaussian model, using a k-means method. We prove that, up to a logarithmic term, this change-point estimator has a minimax rate of convergence. Then, considering the case of sparse data, with a Sobolev regularity, we propose a smoothing procedure based on Lepski's method and show that the resulting… ▽ More

    Submitted 21 February, 2018; originally announced February 2018.

  46. arXiv:1710.02322  [pdf, other

    cs.CV

    Human Pose Regression by Combining Indirect Part Detection and Contextual Information

    Authors: Diogo C. Luvizon, Hedi Tabia, David Picard

    Abstract: In this paper, we propose an end-to-end trainable regression approach for human pose estimation from still images. We use the proposed Soft-argmax function to convert feature maps directly to joint coordinates, resulting in a fully differentiable framework. Our method is able to learn heat maps representations indirectly, without additional steps of artificial ground truth generation. Consequently… ▽ More

    Submitted 6 October, 2017; originally announced October 2017.

  47. arXiv:1701.00167  [pdf, ps, other

    stat.ML cs.LG

    Very Fast Kernel SVM under Budget Constraints

    Authors: David Picard

    Abstract: In this paper we propose a fast online Kernel SVM algorithm under tight budget constraints. We propose to split the input space using LVQ and train a Kernel SVM in each cluster. To allow for online training, we propose to limit the size of the support vector set of each cluster using different strategies. We show in the experiment that our algorithm is able to achieve high accuracy while having a… ▽ More

    Submitted 31 December, 2016; originally announced January 2017.

  48. arXiv:1611.09726  [pdf, other

    cs.CV cs.LG stat.ML

    Gossip training for deep learning

    Authors: Michael Blot, David Picard, Matthieu Cord, Nicolas Thome

    Abstract: We address the issue of speeding up the training of convolutional networks. Here we study a distributed method adapted to stochastic gradient descent (SGD). The parallel optimization setup uses several threads, each applying individual gradient descents on a local variable. We propose a new way to share information between different threads inspired by gossip algorithms and showing good consensus… ▽ More

    Submitted 29 November, 2016; originally announced November 2016.

  49. arXiv:1610.01000  [pdf, other

    stat.AP stat.ML

    Statistical learning for wind power : a modeling and stability study towards forecasting

    Authors: Aurélie Fischer, Lucie Montuelle, Mathilde Mougeot, Dominique Picard

    Abstract: We focus on wind power modeling using machine learning techniques. We show on real data provided by the wind energy company Ma{ï}a Eolis, that parametric models, even following closely the physical equation relating wind production to wind speed are outperformed by intelligent learning algorithms. In particular, the CART-Bagging algorithm gives very stable and promising results. Besides, as a step… ▽ More

    Submitted 12 January, 2018; v1 submitted 4 October, 2016; originally announced October 2016.

    Journal ref: Wind Energy, Wiley, 2017, 20 (12), pp.2037 - 2047

  50. arXiv:1603.08257  [pdf

    astro-ph.GA physics.chem-ph

    The C(3P) + NH3 reaction in interstellar chemistry: I. Investigation of the product formation channels

    Authors: Jeremy Bourgalais, Michael Capron, Ranjith Kumar Abhinavam Kailasanathan, David L. Osborn, Kevin M. Hickson, Jean-Christophe Loison, Valentine Wakelam, Fabien Goulay, Sébastien D. Le Picard

    Abstract: The product formation channels of ground state carbon atoms, C(3P), reacting with ammonia, NH3, have been investigated using two complementary experiments and electronic structure calculations. Reaction products are detected in a gas flow tube experiment (330 K, 4 Torr) using tunable VUV photoionization coupled with time of flight mass spectrometry. Temporal profiles of the species formed and phot… ▽ More

    Submitted 27 March, 2016; originally announced March 2016.