Skip to main content

Showing 1–3 of 3 results for author: Muhawenayo, G

.
  1. arXiv:2406.16168  [pdf, other

    cs.LG

    An All-MLP Sequence Modeling Architecture That Excels at Copying

    Authors: Chenwei Cui, Zehao Yan, Gedeon Muhawenayo, Hannah Kerner

    Abstract: Recent work demonstrated Transformers' ability to efficiently copy strings of exponential sizes, distinguishing them from other architectures. We present the Causal Relation Network (CausalRN), an all-MLP sequence modeling architecture that can match Transformers on the copying task. Extending Relation Networks (RNs), we implemented key innovations to support autoregressive sequence modeling while… ▽ More

    Submitted 23 June, 2024; originally announced June 2024.

    Comments: Accepted by ICML 2024 Next Generation of Sequence Modeling Architectures Workshop

  2. arXiv:2209.11002  [pdf, other

    eess.IV cs.CV cs.LG

    Entropic Descent Archetypal Analysis for Blind Hyperspectral Unmixing

    Authors: Alexandre Zouaoui, Gedeon Muhawenayo, Behnood Rasti, Jocelyn Chanussot, Julien Mairal

    Abstract: In this paper, we introduce a new algorithm based on archetypal analysis for blind hyperspectral unmixing, assuming linear mixing of endmembers. Archetypal analysis is a natural formulation for this task. This method does not require the presence of pure pixels (i.e., pixels containing a single material) but instead represents endmembers as convex combinations of a few pixels present in the origin… ▽ More

    Submitted 26 September, 2022; v1 submitted 22 September, 2022; originally announced September 2022.

  3. arXiv:2102.02896  [pdf, ps, other

    cs.CV cs.LG eess.IV

    Compressed Object Detection

    Authors: Gedeon Muhawenayo, Georgia Gkioxari

    Abstract: Deep learning approaches have achieved unprecedented performance in visual recognition tasks such as object detection and pose estimation. However, state-of-the-art models have millions of parameters represented as floats which make them computationally expensive and constrain their deployment on hardware such as mobile phones and IoT nodes. Most commonly, activations of deep neural networks tend… ▽ More

    Submitted 4 February, 2021; originally announced February 2021.