Skip to main content

Showing 1–3 of 3 results for author: Rotem, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2306.02307  [pdf, other

    cs.CL cs.AI cs.LG

    Finding the SWEET Spot: Analysis and Improvement of Adaptive Inference in Low Resource Settings

    Authors: Daniel Rotem, Michael Hassid, Jonathan Mamou, Roy Schwartz

    Abstract: Adaptive inference is a simple method for reducing inference costs. The method works by maintaining multiple classifiers of different capacities, and allocating resources to each test instance according to its difficulty. In this work, we compare the two main approaches for adaptive inference, Early-Exit and Multi-Model, when training data is limited. First, we observe that for models with the sam… ▽ More

    Submitted 4 June, 2023; originally announced June 2023.

    Comments: Proceedings of ACL 2023

  2. arXiv:2211.03495  [pdf, other

    cs.CL cs.LG

    How Much Does Attention Actually Attend? Questioning the Importance of Attention in Pretrained Transformers

    Authors: Michael Hassid, Hao Peng, Daniel Rotem, Jungo Kasai, Ivan Montero, Noah A. Smith, Roy Schwartz

    Abstract: The attention mechanism is considered the backbone of the widely-used Transformer architecture. It contextualizes the input by computing input-specific attention matrices. We find that this mechanism, while powerful and elegant, is not as important as typically thought for pretrained language models. We introduce PAPA, a new probing method that replaces the input-dependent attention matrices with… ▽ More

    Submitted 7 November, 2022; originally announced November 2022.

    Comments: Findings of EMNLP 2022

  3. arXiv:1103.0260  [pdf, ps, other

    cs.DS cs.DM

    A Linear Approximation Algorithm for 2-Dimensional Vector Packing

    Authors: Ekow Otoo, Ali Pinar, Doron Rotem

    Abstract: We study the 2-dimensional vector packing problem, which is a generalization of the classical bin packing problem where each item has 2 distinct weights and each bin has 2 corresponding capacities. The goal is to group items into minimum number of bins, without violating the bin capacity constraints. We propose a Θ}(n)-time approximation algorithm that is inspired by the O(n^2) algorithm proposed… ▽ More

    Submitted 1 March, 2011; originally announced March 2011.