Skip to main content

Showing 1–30 of 30 results for author: Bailly, K

Searching in archive cs. Search in all archives.
.
  1. arXiv:2311.15806  [pdf, other

    cs.CV

    PIPE : Parallelized Inference Through Post-Training Quantization Ensembling of Residual Expansions

    Authors: Edouard Yvinec, Arnaud Dapogny, Kevin Bailly

    Abstract: Deep neural networks (DNNs) are ubiquitous in computer vision and natural language processing, but suffer from high inference cost. This problem can be addressed by quantization, which consists in converting floating point perations into a lower bit-width format. With the growing concerns on privacy rights, we focus our efforts on data-free methods. However, such techniques suffer from their lack… ▽ More

    Submitted 27 November, 2023; originally announced November 2023.

    Comments: arXiv admin note: substantial text overlap with arXiv:2203.14645

  2. arXiv:2311.10549  [pdf, other

    cs.CV

    Archtree: on-the-fly tree-structured exploration for latency-aware pruning of deep neural networks

    Authors: Rémi Ouazan Reboul, Edouard Yvinec, Arnaud Dapogny, Kevin Bailly

    Abstract: Deep neural networks (DNNs) have become ubiquitous in addressing a number of problems, particularly in computer vision. However, DNN inference is computationally intensive, which can be prohibitive e.g. when considering edge devices. To solve this problem, a popular solution is DNN pruning, and more so structured pruning, where coherent computational blocks (e.g. channels for convolutional network… ▽ More

    Submitted 17 November, 2023; originally announced November 2023.

    Comments: 10 pages, 7 figures

  3. arXiv:2309.17361  [pdf, other

    cs.CV

    Network Memory Footprint Compression Through Jointly Learnable Codebooks and Map**s

    Authors: Edouard Yvinec, Arnaud Dapogny, Kevin Bailly

    Abstract: The massive interest in deep neural networks (DNNs) for both computer vision and natural language processing has been sparked by the growth in computational power. However, this led to an increase in the memory footprint, to a point where it can be challenging to simply load a model on commodity devices such as mobile phones. To address this limitation, quantization is a favored solution as it map… ▽ More

    Submitted 29 September, 2023; originally announced September 2023.

  4. arXiv:2309.05334  [pdf, other

    cs.CV

    MultIOD: Rehearsal-free Multihead Incremental Object Detector

    Authors: Eden Belouadah, Arnaud Dapogny, Kevin Bailly

    Abstract: Class-Incremental learning (CIL) refers to the ability of artificial agents to integrate new classes as they appear in a stream. It is particularly interesting in evolving environments where agents have limited access to memory and computational resources. The main challenge of incremental learning is catastrophic forgetting, the inability of neural networks to retain past knowledge when learning… ▽ More

    Submitted 9 April, 2024; v1 submitted 11 September, 2023; originally announced September 2023.

    Comments: Accepted at the archival track of the Workshop on Continual Learning in Computer Vision (CVPR 2024)

  5. arXiv:2308.07662  [pdf, other

    cs.LG cs.CV

    Gradient-Based Post-Training Quantization: Challenging the Status Quo

    Authors: Edouard Yvinec, Arnaud Dapogny, Kevin Bailly

    Abstract: Quantization has become a crucial step for the efficient deployment of deep neural networks, where floating point operations are converted to simpler fixed point operations. In its most naive form, it simply consists in a combination of scaling and rounding transformations, leading to either a limited compression rate or a significant accuracy drop. Recently, Gradient-based post-training quantizat… ▽ More

    Submitted 15 August, 2023; originally announced August 2023.

  6. arXiv:2308.05600  [pdf, other

    cs.LG cs.CV

    NUPES : Non-Uniform Post-Training Quantization via Power Exponent Search

    Authors: Edouard Yvinec, Arnaud Dapogny, Kevin Bailly

    Abstract: Deep neural network (DNN) deployment has been confined to larger hardware devices due to their expensive computational requirements. This challenge has recently reached another scale with the emergence of large language models (LLMs). In order to reduce both their memory footprint and latency, a promising technique is quantization. It consists in converting floating point representations to low bi… ▽ More

    Submitted 10 August, 2023; originally announced August 2023.

  7. arXiv:2308.04753  [pdf, other

    cs.CV

    SAfER: Layer-Level Sensitivity Assessment for Efficient and Robust Neural Network Inference

    Authors: Edouard Yvinec, Arnaud Dapogny, Kevin Bailly, Xavier Fischer

    Abstract: Deep neural networks (DNNs) demonstrate outstanding performance across most computer vision tasks. Some critical applications, such as autonomous driving or medical imaging, also require investigation into their behavior and the reasons behind the decisions they make. In this vein, DNN attribution consists in studying the relationship between the predictions of a DNN and its inputs. Attribution me… ▽ More

    Submitted 8 December, 2023; v1 submitted 9 August, 2023; originally announced August 2023.

  8. arXiv:2306.17442  [pdf, other

    cs.CV

    Designing strong baselines for ternary neural network quantization through support and mass equalization

    Authors: Edouard Yvinec, Arnaud Dapogny, Kevin Bailly

    Abstract: Deep neural networks (DNNs) offer the highest performance in a wide range of applications in computer vision. These results rely on over-parameterized backbones, which are expensive to run. This computational burden can be dramatically reduced by quantizing (in either data-free (DFQ), post-training (PTQ) or quantization-aware training (QAT) scenarios) floating point values to ternary values (2 bit… ▽ More

    Submitted 30 June, 2023; originally announced June 2023.

    Journal ref: ICIP 2023

  9. arXiv:2303.11803  [pdf, other

    cs.CV

    Fighting over-fitting with quantization for learning deep neural networks on noisy labels

    Authors: Gauthier Tallec, Edouard Yvinec, Arnaud Dapogny, Kevin Bailly

    Abstract: The rising performance of deep neural networks is often empirically attributed to an increase in the available computational power, which allows complex models to be trained upon large amounts of annotated data. However, increased model complexity leads to costly deployment of modern neural networks, while gathering such amounts of data requires huge costs to avoid label noise. In this work, we st… ▽ More

    Submitted 21 March, 2023; originally announced March 2023.

  10. arXiv:2303.02994  [pdf, other

    cs.CV

    Fighting noise and imbalance in Action Unit detection problems

    Authors: Gauthier Tallec, Arnaud Dapogny, Kevin Bailly

    Abstract: Action Unit (AU) detection aims at automatically caracterizing facial expressions with the muscular activations they involve. Its main interest is to provide a low-level face representation that can be used to assist higher level affective computing tasks learning. Yet, it is a challenging task. Indeed, the available databases display limited face variability and are imbalanced toward neutral expr… ▽ More

    Submitted 6 March, 2023; originally announced March 2023.

  11. arXiv:2301.09858  [pdf, other

    cs.CV

    PowerQuant: Automorphism Search for Non-Uniform Quantization

    Authors: Edouard Yvinec, Arnaud Dapogny, Matthieu Cord, Kevin Bailly

    Abstract: Deep neural networks (DNNs) are nowadays ubiquitous in many domains such as computer vision. However, due to their high latency, the deployment of DNNs hinges on the development of compression techniques such as quantization which consists in lowering the number of bits used to encode the weights and activations. Growing concerns for privacy and security have motivated the development of data-free… ▽ More

    Submitted 24 January, 2023; originally announced January 2023.

  12. arXiv:2208.03506  [pdf, ps, other

    cs.CV

    Multi-Task Transformer with uncertainty modelling for Face Based Affective Computing

    Authors: Gauthier Tallec, Jules Bonnard, Arnaud Dapogny, Kévin Bailly

    Abstract: Face based affective computing consists in detecting emotions from face images. It is useful to unlock better automatic comprehension of human behaviours and could pave the way toward improved human-machines interactions. However it comes with the challenging task of designing a computational representation of emotions. So far, emotions have been represented either continuously in the 2D Valence/A… ▽ More

    Submitted 12 December, 2022; v1 submitted 6 August, 2022; originally announced August 2022.

  13. arXiv:2207.04089  [pdf, other

    cs.CV

    SInGE: Sparsity via Integrated Gradients Estimation of Neuron Relevance

    Authors: Edouard Yvinec, Arnaud Dapogny, Matthieu Cord, Kevin Bailly

    Abstract: The leap in performance in state-of-the-art computer vision methods is attributed to the development of deep neural networks. However it often comes at a computational price which may hinder their deployment. To alleviate this limitation, structured pruning is a well known technique which consists in removing channels, neurons or filters, and is commonly applied in order to produce more compact mo… ▽ More

    Submitted 8 July, 2022; originally announced July 2022.

  14. arXiv:2203.14646  [pdf, other

    cs.LG cs.CV

    To Fold or Not to Fold: a Necessary and Sufficient Condition on Batch-Normalization Layers Folding

    Authors: Edouard Yvinec, Arnaud Dapogny, Kevin Bailly

    Abstract: Batch-Normalization (BN) layers have become fundamental components in the evermore complex deep neural network architectures. Such models require acceleration processes for deployment on edge devices. However, BN layers add computation bottlenecks due to the sequential operation processing: thus, a key, yet often overlooked component of the acceleration process is BN layers folding. In this paper,… ▽ More

    Submitted 28 March, 2022; originally announced March 2022.

  15. arXiv:2203.14645  [pdf, other

    cs.CV

    REx: Data-Free Residual Quantization Error Expansion

    Authors: Edouard Yvinec, Arnaud Dapgony, Matthieu Cord, Kevin Bailly

    Abstract: Deep neural networks (DNNs) are ubiquitous in computer vision and natural language processing, but suffer from high inference cost. This problem can be addressed by quantization, which consists in converting floating point operations into a lower bit-width format. With the growing concerns on privacy rights, we focus our efforts on data-free methods. However, such techniques suffer from their lack… ▽ More

    Submitted 29 May, 2023; v1 submitted 28 March, 2022; originally announced March 2022.

  16. arXiv:2203.14642  [pdf, other

    cs.CV

    SPIQ: Data-Free Per-Channel Static Input Quantization

    Authors: Edouard Yvinec, Arnaud Dapogny, Matthieu Cord, Kevin Bailly

    Abstract: Computationally expensive neural networks are ubiquitous in computer vision and solutions for efficient inference have drawn a growing attention in the machine learning community. Examples of such solutions comprise quantization, i.e. converting the processing values (weights and inputs) from floating point into integers e.g. int8 or int4. Concurrently, the rise of privacy concerns motivated the s… ▽ More

    Submitted 28 March, 2022; originally announced March 2022.

  17. arXiv:2203.12905  [pdf, other

    cs.CV

    Privileged Attribution Constrained Deep Networks for Facial Expression Recognition

    Authors: Jules Bonnard, Arnaud Dapogny, Ferdinand Dhombres, Kévin Bailly

    Abstract: Facial Expression Recognition (FER) is crucial in many research domains because it enables machines to better understand human behaviours. FER methods face the problems of relatively small datasets and noisy data that don't allow classical networks to generalize well. To alleviate these issues, we guide the model to concentrate on specific facial areas like the eyes, the mouth or the eyebrows, whi… ▽ More

    Submitted 30 June, 2022; v1 submitted 24 March, 2022; originally announced March 2022.

  18. arXiv:2203.12531  [pdf, ps, other

    cs.CV

    Multi-label Transformer for Action Unit Detection

    Authors: Gauthier Tallec, Edouard Yvinec, Arnaud Dapogny, Kevin Bailly

    Abstract: Action Unit (AU) Detection is the branch of affective computing that aims at recognizing unitary facial muscular movements. It is key to unlock unbiased computational face representations and has therefore aroused great interest in the past few years. One of the main obstacles toward building efficient deep learning based AU detection system is the lack of wide facial image databases annotated by… ▽ More

    Submitted 12 December, 2022; v1 submitted 23 March, 2022; originally announced March 2022.

  19. arXiv:2202.00446  [pdf, other

    cs.CV

    Multi-Order Networks for Action Unit Detection

    Authors: Gauthier Tallec, Arnaud Dapogny, Kevin Bailly

    Abstract: Action Units (AU) are muscular activations used to describe facial expressions. Therefore accurate AU recognition unlocks unbiaised face representation which can improve face-based affective computing applications. From a learning standpoint AU detection is a multi-task problem with strong inter-task dependencies. To solve such problem, most approaches either rely on weight sharing, or add explici… ▽ More

    Submitted 6 March, 2023; v1 submitted 1 February, 2022; originally announced February 2022.

  20. arXiv:2110.01397  [pdf, other

    cs.LG cs.CV

    RED++ : Data-Free Pruning of Deep Neural Networks via Input Splitting and Output Merging

    Authors: Edouard Yvinec, Arnaud Dapogny, Matthieu Cord, Kevin Bailly

    Abstract: Pruning Deep Neural Networks (DNNs) is a prominent field of study in the goal of inference runtime acceleration. In this paper, we introduce a novel data-free pruning protocol RED++. Only requiring a trained neural network, and not specific to DNN architecture, we exploit an adaptive data-free scalar hashing which exhibits redundancies among neuron weight values. We study the theoretical and empir… ▽ More

    Submitted 30 September, 2021; originally announced October 2021.

    Comments: 18 pages, 10 figures

  21. arXiv:2105.14797  [pdf, other

    cs.CV eess.IV

    RED : Looking for Redundancies for Data-Free Structured Compression of Deep Neural Networks

    Authors: Edouard Yvinec, Arnaud Dapogny, Matthieu Cord, Kevin Bailly

    Abstract: Deep Neural Networks (DNNs) are ubiquitous in today's computer vision land-scape, despite involving considerable computational costs. The mainstream approaches for runtime acceleration consist in pruning connections (unstructured pruning) or, better, filters (structured pruning), both often requiring data to re-train the model. In this paper, we present RED, a data-free structured, unified approac… ▽ More

    Submitted 31 May, 2021; originally announced May 2021.

  22. arXiv:2010.07614  [pdf, other

    cs.CV

    THIN: THrowable Information Networks and Application for Facial Expression Recognition In The Wild

    Authors: Estephe Arnaud, Arnaud Dapogny, Kevin Bailly

    Abstract: For a number of machine learning problems, an exogenous variable can be identified such that it heavily influences the appearance of the different classes, and an ideal classifier should be invariant to this variable. An example of such exogenous variable is identity if facial expression recognition (FER) is considered. In this paper, we propose a dual exogenous/endogenous representation. The form… ▽ More

    Submitted 14 January, 2022; v1 submitted 15 October, 2020; originally announced October 2020.

  23. arXiv:2004.07098  [pdf, other

    cs.CV

    DeeSCo: Deep heterogeneous ensemble with Stochastic Combinatory loss for gaze estimation

    Authors: Edouard Yvinec, Arnaud Dapogny, Kévin Bailly

    Abstract: From medical research to gaming applications, gaze estimation is becoming a valuable tool. While there exists a number of hardware-based solutions, recent deep learning-based approaches, coupled with the availability of large-scale databases, have allowed to provide a precise gaze estimate using only consumer sensors. However, there remains a number of questions, regarding the problem formulation,… ▽ More

    Submitted 15 April, 2020; originally announced April 2020.

    Comments: 7 pages, 6 figures, FG 2020

  24. arXiv:2004.06558  [pdf, other

    cs.CV cs.LG

    Deep Entwined Learning Head Pose and Face Alignment Inside an Attentional Cascade with Doubly-Conditional fusion

    Authors: Arnaud Dapogny, Kévin Bailly, Matthieu Cord

    Abstract: Head pose estimation and face alignment constitute a backbone preprocessing for many applications relying on face analysis. While both are closely related tasks, they are generally addressed separately, e.g. by deducing the head pose from the landmark locations. In this paper, we propose to entwine face alignment and head pose tasks inside an attentional cascade. This cascade uses a geometry trans… ▽ More

    Submitted 14 April, 2020; originally announced April 2020.

    Comments: Accepted for publication as an oral session @IEEE FG2020

  25. arXiv:1910.09450  [pdf, other

    cs.CV

    Tree-gated Deep Mixture-of-Experts For Pose-robust Face Alignment

    Authors: Estephe Arnaud, Arnaud Dapogny, Kevin Bailly

    Abstract: Face alignment consists of aligning a shape model on a face image. It is an active domain in computer vision as it is a preprocessing for a number of face analysis and synthesis applications. Current state-of-the-art methods already perform well on "easy" datasets, with moderate head pose variations, but may not be robust for "in-the-wild" data with poses up to 90°. In order to increase robustness… ▽ More

    Submitted 21 October, 2019; originally announced October 2019.

  26. arXiv:1907.03248  [pdf, other

    cs.CV

    Tree-gated Deep Regressor Ensemble For Face Alignment In The Wild

    Authors: Estephe Arnaud, Arnaud Dapogny, Kevin Bailly

    Abstract: Face alignment consists in aligning a shape model on a face in an image. It is an active domain in computer vision as it is a preprocessing for applications like facial expression recognition, face recognition and tracking, face animation, etc. Current state-of-the-art methods already perform well on "easy" datasets, i.e. those that present moderate variations in head pose, expression, illuminatio… ▽ More

    Submitted 10 July, 2019; v1 submitted 7 July, 2019; originally announced July 2019.

  27. arXiv:1904.02549  [pdf, other

    cs.CV

    DeCaFA: Deep Convolutional Cascade for Face Alignment In The Wild

    Authors: Arnaud Dapogny, Kévin Bailly, Matthieu Cord

    Abstract: Face Alignment is an active computer vision domain, that consists in localizing a number of facial landmarks that vary across datasets. State-of-the-art face alignment methods either consist in end-to-end regression, or in refining the shape in a cascaded manner, starting from an initial guess. In this paper, we introduce DeCaFA, an end-to-end deep convolutional cascade architecture for face align… ▽ More

    Submitted 4 April, 2019; originally announced April 2019.

  28. arXiv:1703.01597  [pdf, other

    cs.CV

    Face Alignment with Cascaded Semi-Parametric Deep Greedy Neural Forests

    Authors: Arnaud Dapogny, Kévin Bailly, Séverine Dubuisson

    Abstract: Face alignment is an active topic in computer vision, consisting in aligning a shape model on the face. To this end, most modern approaches refine the shape in a cascaded manner, starting from an initial guess. Those shape updates can either be applied in the feature point space (\textit{i.e.} explicit updates) or in a low-dimensional, parametric space. In this paper, we propose a semi-parametric… ▽ More

    Submitted 5 March, 2017; originally announced March 2017.

    Comments: 10 pages, 1 page appendix, 5 figures

  29. arXiv:1607.06290  [pdf, other

    cs.CV

    Confidence-Weighted Local Expression Predictions for Occlusion Handling in Expression Recognition and Action Unit detection

    Authors: Arnaud Dapogny, Kévin Bailly, Séverine Dubuisson

    Abstract: Fully-Automatic Facial Expression Recognition (FER) from still images is a challenging task as it involves handling large interpersonal morphological differences, and as partial occlusions can occasionally happen. Furthermore, labelling expressions is a time-consuming process that is prone to subjectivity, thus the variability may not be fully covered by the training data. In this work, we propose… ▽ More

    Submitted 21 July, 2016; originally announced July 2016.

  30. arXiv:1607.06250  [pdf, other

    cs.CV

    Dynamic Pose-Robust Facial Expression Recognition by Multi-View Pairwise Conditional Random Forests

    Authors: Arnaud Dapogny, Kévin Bailly, Séverine Dubuisson

    Abstract: Automatic facial expression classification (FER) from videos is a critical problem for the development of intelligent human-computer interaction systems. Still, it is a challenging problem that involves capturing high-dimensional spatio-temporal patterns describing the variation of one's appearance over time. Such representation undergoes great variability of the facial morphology and environmenta… ▽ More

    Submitted 21 July, 2016; originally announced July 2016.

    Comments: Extension of an ICCV 2015 paper