Skip to main content

Showing 1–16 of 16 results for author: Tabia, H

Searching in archive cs. Search in all archives.
.
  1. RRR-Net: Reusing, Reducing, and Recycling a Deep Backbone Network

    Authors: Haozhe Sun, Isabelle Guyon, Felix Mohr, Hedi Tabia

    Abstract: It has become mainstream in computer vision and other machine learning domains to reuse backbone networks pre-trained on large datasets as preprocessors. Typically, the last layer is replaced by a shallow learning machine of sorts; the newly-added classification head and (optionally) deeper layers are fine-tuned on a new task. Due to its strong performance and simplicity, a common pre-trained back… ▽ More

    Submitted 2 October, 2023; originally announced October 2023.

    Journal ref: 2023 International Joint Conference on Neural Networks (IJCNN), Jun 2023, Gold Coast, Australia. pp.1-9

  2. arXiv:2304.05405  [pdf, other

    cs.LG cs.AI cs.CV

    Efficient Automation of Neural Network Design: A Survey on Differentiable Neural Architecture Search

    Authors: Alexandre Heuillet, Ahmad Nasser, Hichem Arioui, Hedi Tabia

    Abstract: In the past few years, Differentiable Neural Architecture Search (DNAS) rapidly imposed itself as the trending approach to automate the discovery of deep neural network architectures. This rise is mainly due to the popularity of DARTS, one of the first major DNAS methods. In contrast with previous works based on Reinforcement Learning or Evolutionary Algorithms, DNAS is faster by several orders of… ▽ More

    Submitted 30 April, 2023; v1 submitted 11 April, 2023; originally announced April 2023.

    Comments: Under review at ACM Computing Surveys. 36 pages, 16 figures

  3. arXiv:2302.10266  [pdf, ps, other

    cs.CV

    Kernel function impact on convolutional neural networks

    Authors: M. Amine Mahmoudi, Aladine Chetouani, Fatma Boufera, Hedi Tabia

    Abstract: This paper investigates the usage of kernel functions at the different layers in a convolutional neural network. We carry out extensive studies of their impact on convolutional, pooling and fully-connected layers. We notice that the linear kernel may not be sufficiently effective to fit the input data distributions, whereas high order kernels prone to over-fitting. This leads to conclude that a tr… ▽ More

    Submitted 20 February, 2023; originally announced February 2023.

  4. arXiv:2302.00384  [pdf, other

    cs.CV

    Alphazzle: Jigsaw Puzzle Solver with Deep Monte-Carlo Tree Search

    Authors: Marie-Morgane Paumard, Hedi Tabia, David Picard

    Abstract: Solving jigsaw puzzles requires to grasp the visual features of a sequence of patches and to explore efficiently a solution space that grows exponentially with the sequence length. Therefore, visual deep reinforcement learning (DRL) should answer this problem more efficiently than optimization solvers coupled with neural networks. Based on this assumption, we introduce Alphazzle, a reassembly algo… ▽ More

    Submitted 1 February, 2023; originally announced February 2023.

  5. arXiv:2302.00059  [pdf, other

    cs.CV cs.AI

    NASiam: Efficient Representation Learning using Neural Architecture Search for Siamese Networks

    Authors: Alexandre Heuillet, Hedi Tabia, Hichem Arioui

    Abstract: Siamese networks are one of the most trending methods to achieve self-supervised visual representation learning (SSL). Since hand labeling is costly, SSL can play a crucial part by allowing deep learning to train on large unlabeled datasets. Meanwhile, Neural Architecture Search (NAS) is becoming increasingly important as a technique to discover novel deep learning architectures. However, early NA… ▽ More

    Submitted 31 January, 2023; originally announced February 2023.

    Comments: 8 pages, 6 figures

  6. arXiv:2109.11844  [pdf, other

    cs.CV

    Learnable Triangulation for Deep Learning-based 3D Reconstruction of Objects of Arbitrary Topology from Single RGB Images

    Authors: Tarek Ben Charrada, Hedi Tabia, Aladine Chetouani, Hamid Laga

    Abstract: We propose a novel deep reinforcement learning-based approach for 3D object reconstruction from monocular images. Prior works that use mesh representations are template based. Thus, they are limited to the reconstruction of objects that have the same topology as the template. Methods that use volumetric grids as intermediate representations are computationally expensive, which limits their applica… ▽ More

    Submitted 24 September, 2021; originally announced September 2021.

  7. arXiv:2108.09306  [pdf, other

    cs.LG cs.CV

    D-DARTS: Distributed Differentiable Architecture Search

    Authors: Alexandre Heuillet, Hedi Tabia, Hichem Arioui, Kamal Youcef-Toumi

    Abstract: Differentiable ARchiTecture Search (DARTS) is one of the most trending Neural Architecture Search (NAS) methods. It drastically reduces search cost by resorting to weight-sharing. However, it also dramatically reduces the search space, thus excluding potential promising architectures. In this article, we propose D-DARTS, a solution that addresses this problem by nesting neural networks at the cell… ▽ More

    Submitted 1 November, 2022; v1 submitted 20 August, 2021; originally announced August 2021.

    Comments: Submitted to Pattern Recognition Letters

  8. arXiv:2009.10814  [pdf, ps, other

    cs.CV

    Kernelized dense layers for facial expression recognition

    Authors: M. Amine Mahmoudi, Aladine Chetouani, Fatma Boufera, Hedi Tabia

    Abstract: Fully connected layer is an essential component of Convolutional Neural Networks (CNNs), which demonstrates its efficiency in computer vision tasks. The CNN process usually starts with convolution and pooling layers that first break down the input images into features, and then analyze them independently. The result of this process feeds into a fully connected neural network structure which drives… ▽ More

    Submitted 22 September, 2020; originally announced September 2020.

  9. arXiv:2009.01998  [pdf, other

    cs.CV

    SSP-Net: Scalable Sequential Pyramid Networks for Real-Time 3D Human Pose Regression

    Authors: Diogo Luvizon, Hedi Tabia, David Picard

    Abstract: In this paper we propose a highly scalable convolutional neural network, end-to-end trainable, for real-time 3D human pose regression from still RGB images. We call this approach the Scalable Sequential Pyramid Networks (SSP-Net) as it is trained with refined supervision at multiple scales in a sequential manner. Our network requires a single training procedure and is capable of producing its best… ▽ More

    Submitted 3 September, 2020; originally announced September 2020.

    Comments: Under review at PR

  10. Deepzzle: Solving Visual Jigsaw Puzzles with Deep Learning andShortest Path Optimization

    Authors: Marie-Morgane Paumard, David Picard, Hedi Tabia

    Abstract: We tackle the image reassembly problem with wide space between the fragments, in such a way that the patterns and colors continuity is mostly unusable. The spacing emulates the erosion of which the archaeological fragments suffer. We crop-square the fragments borders to compel our algorithm to learn from the content of the fragments. We also complicate the image reassembly by removing fragments an… ▽ More

    Submitted 26 May, 2020; originally announced May 2020.

    Journal ref: IEEE Transactions on Image Processing (2020)

  11. Multi-task Deep Learning for Real-Time 3D Human Pose Estimation and Action Recognition

    Authors: Diogo C Luvizon, Hedi Tabia, David Picard

    Abstract: Human pose estimation and action recognition are related tasks since both problems are strongly dependent on the human body representation and analysis. Nonetheless, most recent methods in the literature handle the two problems separately. In this work, we propose a multi-task framework for jointly estimating 2D or 3D human poses from monocular color images and classifying human actions from video… ▽ More

    Submitted 3 March, 2020; v1 submitted 14 December, 2019; originally announced December 2019.

    Comments: Accepted to TPAMI. arXiv admin note: text overlap with arXiv:1802.09232

  12. arXiv:1911.09245  [pdf, other

    cs.CV

    Consensus-based Optimization for 3D Human Pose Estimation in Camera Coordinates

    Authors: Diogo C Luvizon, Hedi Tabia, David Picard

    Abstract: 3D human pose estimation is frequently seen as the task of estimating 3D poses relative to the root body joint. Alternatively, we propose a 3D human pose estimation method in camera coordinates, which allows effective combination of 2D annotated data and 3D poses and a straightforward multi-view generalization. To that end, we cast the problem as a view frustum space pose estimation, where absolut… ▽ More

    Submitted 20 August, 2021; v1 submitted 20 November, 2019; originally announced November 2019.

    Comments: Source code is available at https://github.com/dluvizon/3d-pose-consensus

  13. arXiv:1809.00898  [pdf, other

    cs.CV

    Image Reassembly Combining Deep Learning and Shortest Path Problem

    Authors: M. -M. Paumard, D. Picard, H. Tabia

    Abstract: This paper addresses the problem of reassembling images from disjointed fragments. More specifically, given an unordered set of fragments, we aim at reassembling one or several possibly incomplete images. The main contributions of this work are: 1) several deep neural architectures to predict the relative position of image fragments that outperform the previous state of the art; 2) casting the rea… ▽ More

    Submitted 4 September, 2018; originally announced September 2018.

    Comments: ECCV 2018

  14. arXiv:1807.03155  [pdf, other

    cs.CV cs.LG stat.ML

    Jigsaw Puzzle Solving Using Local Feature Co-Occurrences in Deep Neural Networks

    Authors: Marie-Morgane Paumard, David Picard, Hedi Tabia

    Abstract: Archaeologists are in dire need of automated object reconstruction methods. Fragments reassembly is close to puzzle problems, which may be solved by computer vision algorithms. As they are often beaten on most image related tasks by deep learning algorithms, we study a classification method that can solve jigsaw puzzles. In this paper, we focus on classifying the relative position: given a couple… ▽ More

    Submitted 5 July, 2018; originally announced July 2018.

    Comments: ICIP 2018

  15. arXiv:1802.09232  [pdf, other

    cs.CV

    2D/3D Pose Estimation and Action Recognition using Multitask Deep Learning

    Authors: Diogo C. Luvizon, David Picard, Hedi Tabia

    Abstract: Action recognition and human pose estimation are closely related but both problems are generally handled as distinct tasks in the literature. In this work, we propose a multitask framework for jointly 2D and 3D pose estimation from still images and human action recognition from video sequences. We show that a single architecture can be used to solve the two problems in an efficient way and still a… ▽ More

    Submitted 21 March, 2018; v1 submitted 26 February, 2018; originally announced February 2018.

    Comments: To appear in CVPR 2018

  16. arXiv:1710.02322  [pdf, other

    cs.CV

    Human Pose Regression by Combining Indirect Part Detection and Contextual Information

    Authors: Diogo C. Luvizon, Hedi Tabia, David Picard

    Abstract: In this paper, we propose an end-to-end trainable regression approach for human pose estimation from still images. We use the proposed Soft-argmax function to convert feature maps directly to joint coordinates, resulting in a fully differentiable framework. Our method is able to learn heat maps representations indirectly, without additional steps of artificial ground truth generation. Consequently… ▽ More

    Submitted 6 October, 2017; originally announced October 2017.