Skip to main content

Showing 1–3 of 3 results for author: Keivanloo, I

.
  1. arXiv:2111.00230  [pdf, other

    cs.CL

    Magic Pyramid: Accelerating Inference with Early Exiting and Token Pruning

    Authors: Xuanli He, Iman Keivanloo, Yi Xu, Xiang He, Belinda Zeng, Santosh Rajagopalan, Trishul Chilimbi

    Abstract: Pre-training and then fine-tuning large language models is commonly used to achieve state-of-the-art performance in natural language processing (NLP) tasks. However, most pre-trained models suffer from low inference speed. Deploying such large models to applications with latency constraints is challenging. In this work, we focus on accelerating the inference via conditional computations. To achiev… ▽ More

    Submitted 30 October, 2021; originally announced November 2021.

    Comments: 8 pages

  2. arXiv:2110.08919  [pdf, other

    cs.IR cs.DB

    Low-Precision Quantization for Efficient Nearest Neighbor Search

    Authors: Anthony Ko, Iman Keivanloo, Vihan Lakshman, Eric Schkufza

    Abstract: Fast k-Nearest Neighbor search over real-valued vector spaces (KNN) is an important algorithmic task for information retrieval and recommendation systems. We present a method for using reduced precision to represent vectors through quantized integer values, enabling both a reduction in the memory overhead of indexing these vectors and faster distance computations at query time. While most traditio… ▽ More

    Submitted 17 October, 2021; originally announced October 2021.

    Comments: 5 pages

  3. Recommending Insightful Comments for Source Code using Crowdsourced Knowledge

    Authors: Mohammad Masudur Rahman, Chanchal K. Roy, Iman Keivanloo

    Abstract: Recently, automatic code comment generation is proposed to facilitate program comprehension. Existing code comment generation techniques focus on describing the functionality of the source code. However, there are other aspects such as insights about quality or issues of the code, which are overlooked by earlier approaches. In this paper, we describe a mining approach that recommends insightful co… ▽ More

    Submitted 6 July, 2018; originally announced July 2018.

    Comments: The 15th IEEE International Working Conference on Source Code Analysis and Manipulation (SCAM 2015), pp. 81--90, Bremen, Germany, September 2015

    Journal ref: Proc. SCAM 2015, pp. 81--90