Skip to main content

Showing 1–14 of 14 results for author: Langlois, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2310.10053  [pdf, ps, other

    cs.AR

    Fast and Low-Cost Approximate Multiplier for FPGAs using Dynamic Reconfiguration

    Authors: Shervin Vakili, Mobin Vaziri, Amirhossein Zarei, J. M. Pierre Langlois

    Abstract: Multipliers are widely-used arithmetic operators in digital signal processing and machine learning circuits. Due to their relatively high complexity, they can have high latency and be a significant source of power consumption. One strategy to alleviate these limitations is to use approximate computing. This paper thus introduces an original FPGA-based approximate multiplier specifically optimized… ▽ More

    Submitted 16 October, 2023; originally announced October 2023.

    Comments: 5 figures, 3 tables

  2. arXiv:2306.12266  [pdf, other

    astro-ph.IM astro-ph.EP cs.LG

    Combining multi-spectral data with statistical and deep-learning models for improved exoplanet detection in direct imaging at high contrast

    Authors: Olivier Flasseur, Théo Bodrito, Julien Mairal, Jean Ponce, Maud Langlois, Anne-Marie Lagrange

    Abstract: Exoplanet detection by direct imaging is a difficult task: the faint signals from the objects of interest are buried under a spatially structured nuisance component induced by the host star. The exoplanet signals can only be identified when combining several observations with dedicated detection algorithms. In contrast to most of existing methods, we propose to learn a model of the spatial, tempor… ▽ More

    Submitted 21 June, 2023; originally announced June 2023.

    Comments: accepted to EUSIPCO 2023

  3. Design Principles for Packet Deparsers on FPGAs

    Authors: Thomas Luinaud, Jeferson Santiago da Silva, J. M. Pierre Langlois, Yvon Savaria

    Abstract: The P4 language has drastically changed the networking field as it allows to quickly describe and implement new networking applications. Although a large variety of applications can be described with the P4 language, current programmable switch architectures impose significant constraints on P4 programs. To address this shortcoming, FPGAs have been explored as potential targets for P4 applications… ▽ More

    Submitted 13 March, 2021; originally announced March 2021.

    Comments: Presented at ISFPGA'21, 2021 Source code available at : https://github.com/luinaudt/deparser/tree/FPGA_paper

    ACM Class: B.5.1

  4. arXiv:2010.00627  [pdf

    cs.AR

    CARLA: A Convolution Accelerator with a Reconfigurable and Low-Energy Architecture

    Authors: Mehdi Ahmadi, Shervin Vakili, J. M. Pierre Langlois

    Abstract: Convolutional Neural Networks (CNNs) have proven to be extremely accurate for image recognition, even outperforming human recognition capability. When deployed on battery-powered mobile devices, efficient computer architectures are required to enable fast and energy-efficient computation of costly convolution operations. Despite recent advances in hardware accelerator design for CNNs, two major pr… ▽ More

    Submitted 1 October, 2020; originally announced October 2020.

    Comments: 12 pages

  5. arXiv:2009.03576  [pdf, other

    cs.IT

    Primal-dual splitting scheme with backtracking for handling with epigraphic constraint and sparse analysis regularization

    Authors: Laurence Denneulin, Nelly Pustelnik, Maud Langlois, Ignace Loris, Éric Thiébaut

    Abstract: The convergence of many proximal algorithms involving a gradient descent relies on its Lipschitz constant. To avoid computing it, backtracking rules can be used. While such a rule has already been designed for the forward-backward algorithm (FBwB), this scheme is not flexible enough when a non-differentiable penalization with a linear operator is added to a constraint. In this work we propose a ba… ▽ More

    Submitted 8 September, 2020; originally announced September 2020.

    Comments: in Proceedings of iTWIST'20, Paper-ID: 12, Nantes, France, December, 2-4, 2020

  6. arXiv:2004.07733  [pdf, other

    cs.AR cs.NI

    Bridging the Gap: FPGAs as Programmable Switches

    Authors: Thomas Luinaud, Thibaut Stimpfling, Jeferson Santiago da Silva, Yvon Savaria, J. M. Pierre Langlois

    Abstract: The emergence of P4, a domain specific language, coupled to PISA, a domain specific architecture, is revolutionizing the networking field. P4 allows to describe how packets are processed by a programmable data plane, spanning ASICs and CPUs, implementing PISA. Because the processing flexibility can be limited on ASICs, while the CPUs performance for networking tasks lag behind, recent works have p… ▽ More

    Submitted 16 April, 2020; originally announced April 2020.

    Comments: To be published in : IEEE International Conference on High Performance Switching and Routing 2020

    ACM Class: B.5.1

  7. arXiv:2002.09794  [pdf, other

    cs.LG stat.ML

    PoET-BiN: Power Efficient Tiny Binary Neurons

    Authors: Sivakumar Chidambaram, J. M. Pierre Langlois, Jean Pierre David

    Abstract: The success of neural networks in image classification has inspired various hardware implementations on embedded platforms such as Field Programmable Gate Arrays, embedded processors and Graphical Processing Units. These embedded platforms are constrained in terms of power, which is mainly consumed by the Multiply Accumulate operations and the memory accesses for weight fetching. Quantization and… ▽ More

    Submitted 22 February, 2020; originally announced February 2020.

    Comments: Accepted in MLSys 2020 conference

  8. arXiv:2002.07711  [pdf

    eess.SP cs.AR

    An Energy-Efficient Accelerator Architecture with Serial Accumulation Dataflow for Deep CNNs

    Authors: Mehdi Ahmadi, Shervin Vakili, J. M. Pierre Langlois

    Abstract: Convolutional Neural Networks (CNNs) have shown outstanding accuracy for many vision tasks during recent years. When deploying CNNs on portable devices and embedded systems, however, the large number of parameters and computations result in long processing time and low battery life. An important factor in designing CNN hardware accelerators is to efficiently map the convolution computation onto ha… ▽ More

    Submitted 14 February, 2020; originally announced February 2020.

    Comments: 4 pages

  9. arXiv:1903.06693  [pdf, ps, other

    cs.DC

    Module-per-Object: a Human-Driven Methodology for C++-based High-Level Synthesis Design

    Authors: Jeferson Santiago da Silva, François-Raymond Boyer, J. M. Pierre Langlois

    Abstract: High-Level Synthesis (HLS) brings FPGAs to audiences previously unfamiliar to hardware design. However, achieving the highest Quality-of-Results (QoR) with HLS is still unattainable for most programmers. This requires detailed knowledge of FPGA architecture and hardware design in order to produce FPGA-friendly codes. Moreover, these codes are normally in conflict with best coding practices, which… ▽ More

    Submitted 9 April, 2019; v1 submitted 4 March, 2019; originally announced March 2019.

    Comments: 9 pages. Paper accepted for publication at The 27th IEEE International Symposium on Field-Programmable Custom Computing Machines, San Diego CA, April 28 - May 1, 2019

  10. arXiv:1711.09155  [pdf, other

    cs.DS

    SHIP: A Scalable High-performance IPv6 Lookup Algorithm that Exploits Prefix Characteristics

    Authors: Thibaut Stimpfling, Normand Bélanger, J. M. Pierre Langlois, Yvon Savaria

    Abstract: Due to the emergence of new network applications, current IP lookup engines must support high-bandwidth, low lookup latency and the ongoing growth of IPv6 networks. However, existing solutions are not designed to address jointly those three requirements. This paper introduces SHIP, an IPv6 lookup algorithm that exploits prefix characteristics to build a two-level data structure designed to meet fu… ▽ More

    Submitted 24 November, 2017; originally announced November 2017.

    Comments: Submitted to EEE/ACM Transactions on Networking

  11. P4-compatible High-level Synthesis of Low Latency 100 Gb/s Streaming Packet Parsers in FPGAs

    Authors: Jeferson Santiago da Silva, François-Raymond Boyer, J. M. Pierre Langlois

    Abstract: Packet parsing is a key step in SDN-aware devices. Packet parsers in SDN networks need to be both reconfigurable and fast, to support the evolving network protocols and the increasing multi-gigabit data rates. The combination of packet processing languages with FPGAs seems to be the perfect match for these requirements. In this work, we develop an open-source FPGA-based configurable architecture f… ▽ More

    Submitted 17 November, 2017; originally announced November 2017.

    Comments: Accepted for publication at the 26th ACM/SIGDA International Symposium on Field-Programmable Gate Arrays February 25 - 27, 2018 Monterey Marriott Hotel, Monterey, California, 7 pages, 7 figures, 1 table

  12. arXiv:1612.09524  [pdf

    cs.CV cs.AR

    Memory Efficient Multi-Scale Line Detector Architecture for Retinal Blood Vessel Segmentation

    Authors: Hamza Bendaoudi, Farida Cheriet, J. M. Pierre Langlois

    Abstract: This paper presents a memory efficient architecture that implements the Multi-Scale Line Detector (MSLD) algorithm for real-time retinal blood vessel detection in fundus images on a Zynq FPGA. This implementation benefits from the FPGA parallelism to drastically reduce the memory requirements of the MSLD from two images to a few values. The architecture is optimized in terms of resource utilizatio… ▽ More

    Submitted 6 December, 2016; originally announced December 2016.

    Comments: This paper was accepted and presented at Conference on Design and Architectures for Signal and Image Processing - DASIP 2016

  13. arXiv:1611.05943  [pdf, ps, other

    cs.NI

    Extern Objects in P4: an ROHC Header Compression Scheme Case Study

    Authors: Jeferson Santiago da Silva, François-Raymond Boyer, Laurent-Olivier Chiquette, J. M. Pierre Langlois

    Abstract: P4 is an emergent packet-processing language with which the user can describe how the packets are to be processed in a switching element. This paper presents a way to implement complex operations that are not natively supported in P4. In this work, we explored two different methods to add extensions to P4: i) using new native primitives and ii) using extern instances. As a case study, an ROHC enti… ▽ More

    Submitted 21 March, 2018; v1 submitted 17 November, 2016; originally announced November 2016.

    Comments: 6 pages, 4 figures, 3 listings

  14. arXiv:1609.07750  [pdf, other

    cs.NE cs.LG

    Accurate and Efficient Hyperbolic Tangent Activation Function on FPGA using the DCT Interpolation Filter

    Authors: Ahmed M. Abdelsalam, J. M. Pierre Langlois, F. Cheriet

    Abstract: Implementing an accurate and fast activation function with low cost is a crucial aspect to the implementation of Deep Neural Networks (DNNs) on FPGAs. We propose a high-accuracy approximation approach for the hyperbolic tangent activation function of artificial neurons in DNNs. It is based on the Discrete Cosine Transform Interpolation Filter (DCTIF). The proposed architecture combines simple arit… ▽ More

    Submitted 25 September, 2016; originally announced September 2016.

    Comments: 8 pages, 6 figures, 5 tables, submitted for the 25th ACM/SIGDA International Symposium on Field-Programmable Gate Arrays (ISFPGA), 22-24 February 2017, California, USA