Skip to main content

Showing 1–2 of 2 results for author: Elelimy, E

.
  1. arXiv:2402.13425  [pdf, other

    cs.LG cs.AI stat.ML

    Investigating the Histogram Loss in Regression

    Authors: Ehsan Imani, Kai Luedemann, Sam Scholnick-Hughes, Esraa Elelimy, Martha White

    Abstract: It is becoming increasingly common in regression to train neural networks that model the entire distribution even if only the mean is required for prediction. This additional modeling often comes with performance gain and the reasons behind the improvement are not fully known. This paper investigates a recent approach to regression, the Histogram Loss, which involves learning the conditional distr… ▽ More

    Submitted 20 February, 2024; originally announced February 2024.

    Comments: 50 pages

  2. arXiv:2310.15719  [pdf, other

    cs.LG cs.AI

    Recurrent Linear Transformers

    Authors: Subhojeet Pramanik, Esraa Elelimy, Marlos C. Machado, Adam White

    Abstract: The self-attention mechanism in the transformer architecture is capable of capturing long-range dependencies and it is the main reason behind its effectiveness in processing sequential data. Nevertheless, despite their success, transformers have two significant drawbacks that still limit their broader applicability: (1) In order to remember past information, the self-attention mechanism requires a… ▽ More

    Submitted 24 October, 2023; originally announced October 2023.

    Comments: transformers, reinforcement learning, partial observability