Skip to main content

Showing 1–9 of 9 results for author: Daliri, M

.
  1. arXiv:2406.03482  [pdf, other

    cs.LG cs.AI cs.CL cs.PF

    QJL: 1-Bit Quantized JL Transform for KV Cache Quantization with Zero Overhead

    Authors: Amir Zandieh, Majid Daliri, Insu Han

    Abstract: Serving LLMs requires substantial memory due to the storage requirements of Key-Value (KV) embeddings in the KV cache, which grows with sequence length. An effective approach to compress KV cache is quantization. However, traditional quantization methods face significant memory overhead due to the need to store quantization constants (at least a zero point and a scale) in full precision per data b… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

    Comments: 13 pages

  2. arXiv:2309.16157  [pdf, other

    cs.DB cs.DS

    Sampling Methods for Inner Product Sketching

    Authors: Majid Daliri, Juliana Freire, Christopher Musco, Aécio Santos, Haoxiang Zhang

    Abstract: Recently, Bessa et al. (PODS 2023) showed that sketches based on coordinated weighted sampling theoretically and empirically outperform popular linear sketching methods like Johnson-Lindentrauss projection and CountSketch for the ubiquitous problem of inner product estimation. We further develop this finding by introducing and analyzing two alternative sampling-based methods. In contrast to the co… ▽ More

    Submitted 15 January, 2024; v1 submitted 28 September, 2023; originally announced September 2023.

    Comments: 17 pages, 10 figures

  3. arXiv:2308.05907  [pdf, ps, other

    cs.DS cs.DB

    Simple Analysis of Priority Sampling

    Authors: Majid Daliri, Juliana Freire, Christopher Musco, Aécio Santos, Haoxiang Zhang

    Abstract: We prove a tight upper bound on the variance of the priority sampling method (aka sequential Poisson sampling). Our proof is significantly shorter and simpler than the original proof given by Mario Szegedy at STOC 2006, which resolved a conjecture by Duffield, Lund, and Thorup.

    Submitted 10 August, 2023; originally announced August 2023.

    Comments: 6 pages

  4. arXiv:2302.02451  [pdf, other

    cs.LG cs.CV cs.DS

    KDEformer: Accelerating Transformers via Kernel Density Estimation

    Authors: Amir Zandieh, Insu Han, Majid Daliri, Amin Karbasi

    Abstract: Dot-product attention mechanism plays a crucial role in modern deep architectures (e.g., Transformer) for sequence modeling, however, naïve exact computation of this model incurs quadratic time and memory complexities in sequence length, hindering the training of long-sequence models. Critical bottlenecks are due to the computation of partition functions in the denominator of softmax function as w… ▽ More

    Submitted 29 June, 2023; v1 submitted 5 February, 2023; originally announced February 2023.

    Comments: 26 pages, 7 figures

  5. arXiv:2301.05811  [pdf, other

    cs.DB cs.DS

    Weighted Minwise Hashing Beats Linear Sketching for Inner Product Estimation

    Authors: Aline Bessa, Majid Daliri, Juliana Freire, Cameron Musco, Christopher Musco, Aécio Santos, Haoxiang Zhang

    Abstract: We present a new approach for computing compact sketches that can be used to approximate the inner product between pairs of high-dimensional vectors. Based on the Weighted MinHash algorithm, our approach admits strong accuracy guarantees that improve on the guarantees of popular linear sketching approaches for inner product estimation, such as CountSketch and Johnson-Lindenstrauss projection. Spec… ▽ More

    Submitted 5 May, 2023; v1 submitted 13 January, 2023; originally announced January 2023.

    Comments: 23 pages, 6 figures

    Journal ref: In Proceedings of the ACM SIGMOD-SIGACT-SIGAI Symposium on Principles of Database Systems (PODS) 2023

  6. Brain Electrical Stimulation for Animal Navigation

    Authors: Amirmasoud Ahmadi, Sepideh Farakhor Seghinsara, Mohammad Reza Daliri, Vahid Shalchyan

    Abstract: The brain stimulation and its widespread use is one of the most important subjects in studies of neurophysiology. In brain electrical stimulation methods, following the surgery and electrode implantation, electrodes send electrical impulses to the specific targets in the brain. The use of this stimulation method is provided therapeutic benefits for treatment chronic pain, essential tremor, Parkins… ▽ More

    Submitted 1 December, 2018; originally announced January 2019.

    Comments: in Farsi

    Journal ref: Iranian Journal of Biomedical Engineering, 11(1), pp. 83-100

  7. A New Method for Epileptic Seizure Classification in EEG Using Adapted Wavelet Packets

    Authors: Amirmasoud Ahmadi, Vahid Shalchyan, Mohammad Reza Daliri

    Abstract: Electroencephalography (EEG), as the most common tool for epileptic seizure classification, contains useful information about different physiological states of the brain. Seizure related features in EEG signals can be better identified when localized in time frequency basis projections. In this work, a novel method for epileptic seizure classification based on wavelet packets (WPs) is presented in… ▽ More

    Submitted 12 May, 2018; originally announced May 2018.

    Comments: Electroencephalography, Wavelet packets transform (WPT), Support vector machines (SVMs), Electric Electronics, Computer Science, Biomedical Engineerings' Meeting (EBBT), 2017

  8. arXiv:1805.01743  [pdf

    eess.SP q-bio.NC stat.ML

    Classification of Epileptic EEG Signals by Wavelet based CFC

    Authors: Amirmasoud Ahmadi, Mahsa Behroozi, Vahid Shalchyan, Mohammad Reza Daliri

    Abstract: Electroencephalogram, an influential equipment for analyzing humans activities and recognition of seizure attacks can play a crucial role in designing accurate systems which can distinguish ictal seizures from regular brain alertness, since it is the first step towards accomplishing a high accuracy computer aided diagnosis system (CAD). In this article a novel approach for classification of ictal… ▽ More

    Submitted 4 May, 2018; originally announced May 2018.

    Comments: Electroencephalogram; Wavelet Decomposition; Cross Frequency Coupling;Quadratic Discriminant Analysis; T-test Feature Selection

    Journal ref: Electrical-Electronics & Biomedical Engineering and Computer Science in 2018 (EBBT 2018)

  9. arXiv:1411.1257   

    q-bio.NC

    Low Frequency LFP in Macaque MT Predicts Reaction Time in an Attentive Task

    Authors: Kourosh Maboudi, Moein Esghaei, Mohammad Reza Daliri

    Abstract: Neural oscillations are related to a wide variety of cognitive functions, including attention. However, there is still a controversy over the frequency bands that have functional roles in attention. In this study, using a spatial attention task we found that phase of low frequency oscillations could predict the reaction time of the monkey, when the monkey is attending to the target stimulus as opp… ▽ More

    Submitted 9 November, 2014; v1 submitted 5 November, 2014; originally announced November 2014.

    Comments: 20 pages, 4 figures The paper has been withdrawn by the author due to misinterpretation of output results of the classification algorithm and so the produced figures have some major problems and need more investigation