Skip to main content

Showing 1–29 of 29 results for author: Abdi, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.02490  [pdf, other

    cs.CL cs.LG

    MInference 1.0: Accelerating Pre-filling for Long-Context LLMs via Dynamic Sparse Attention

    Authors: Huiqiang Jiang, Yucheng Li, Chengruidong Zhang, Qianhui Wu, Xufang Luo, Surin Ahn, Zhenhua Han, Amir H. Abdi, Dongsheng Li, Chin-Yew Lin, Yuqing Yang, Lili Qiu

    Abstract: The computational challenges of Large Language Model (LLM) inference remain a significant barrier to their widespread deployment, especially as prompt lengths continue to increase. Due to the quadratic complexity of the attention computation, it takes 30 minutes for an 8B LLM to process a prompt of 1M tokens (i.e., the pre-filling stage) on a single A100 GPU. Existing methods for speeding up prefi… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

  2. arXiv:2404.02933  [pdf, other

    cs.DB cs.AI cs.CL

    NL2KQL: From Natural Language to Kusto Query

    Authors: Amir H. Abdi, Xinye Tang, Jeremias Eichelbaum, Mahan Das, Alex Klein, Nihal Irmak Pakis, William Blum, Daniel L Mace, Tanvi Raja, Namrata Padmanabhan, Ye Xing

    Abstract: Data is growing rapidly in volume and complexity. Proficiency in database query languages is pivotal for crafting effective queries. As coding assistants become more prevalent, there is significant opportunity to enhance database query languages. The Kusto Query Language (KQL) is a widely used query language for large semi-structured data such as logs, telemetries, and time-series for big data ana… ▽ More

    Submitted 15 April, 2024; v1 submitted 2 April, 2024; originally announced April 2024.

  3. Longest Common Substring in Longest Common Subsequence's Solution Service: A Novel Hyper-Heuristic

    Authors: Alireza Abdi, Masih Hajsaeedi, Mohsen Hooshmand

    Abstract: The Longest Common Subsequence (LCS) is the problem of finding a subsequence among a set of strings that has two properties of being common to all and is the longest. The LCS has applications in computational biology and text editing, among many others. Due to the NP-hardness of the general longest common subsequence, numerous heuristic algorithms and solvers have been proposed to give the best po… ▽ More

    Submitted 3 December, 2022; originally announced December 2022.

  4. arXiv:2206.11726  [pdf, other

    cs.DS

    Longest Common Subsequence: Tabular vs. Closed-Form Equation Computation of Subsequence Probability

    Authors: Alireza Abdi, Mohsen Hooshmand

    Abstract: The Longest Common Subsequence Problem (LCS) deals with finding the longest subsequence among a given set of strings. The LCS problem is an NP-hard problem which makes it a target for lots of effort to find a better solution with heuristics methods. The baseline for most famous heuristics functions is a tabular random, probabilistic approach. This approach approximates the length of the LCS in eac… ▽ More

    Submitted 23 June, 2022; originally announced June 2022.

  5. arXiv:2206.09034  [pdf, other

    cs.LG cs.AI cs.CV

    Towards Better Selective Classification

    Authors: Leo Feng, Mohamed Osama Ahmed, Hossein Hajimirsadeghi, Amir Abdi

    Abstract: We tackle the problem of Selective Classification where the objective is to achieve the best performance on a predetermined ratio (coverage) of the dataset. Recent state-of-the-art selective methods come with architectural changes either via introducing a separate selection head or an extra abstention logit. In this paper, we challenge the aforementioned methods. The results suggest that the super… ▽ More

    Submitted 1 March, 2023; v1 submitted 17 June, 2022; originally announced June 2022.

  6. arXiv:2206.04038  [pdf, other

    cs.LG

    Scaleformer: Iterative Multi-scale Refining Transformers for Time Series Forecasting

    Authors: Amin Shabani, Amir Abdi, Lili Meng, Tristan Sylvain

    Abstract: The performance of time series forecasting has recently been greatly improved by the introduction of transformers. In this paper, we propose a general multi-scale framework that can be applied to the state-of-the-art transformer-based time series forecasting models (FEDformer, Autoformer, etc.). By iteratively refining a forecasted time series at multiple scales with shared weights, introducing ar… ▽ More

    Submitted 6 February, 2023; v1 submitted 8 June, 2022; originally announced June 2022.

    Comments: ICLR 2023

  7. arXiv:2205.03454  [pdf, ps, other

    stat.ML cs.LG eess.SP

    Structure Learning in Graphical Models from Indirect Observations

    Authors: Hang Zhang, Afshin Abdi, Faramarz Fekri

    Abstract: This paper considers learning of the graphical structure of a $p$-dimensional random vector $X \in R^p$ using both parametric and non-parametric methods. Unlike the previous works which observe $x$ directly, we consider the indirect observation scenario in which samples $y$ are collected via a sensing matrix $A \in R^{d\times p}$, and corrupted with some additive noise $w$, i.e, $Y = AX + W$. For… ▽ More

    Submitted 6 May, 2022; originally announced May 2022.

  8. arXiv:2204.01262  [pdf, ps, other

    cs.AR

    FT-EALU: Fault Tolerant Arithmetic and Logic Unit for Critical Embedded and Real time Systems

    Authors: Athena Abdi, Sina Shahoveisi

    Abstract: In this paper, a fault-tolerant approach to mitigate transient and permanent faults of arithmetic and logic operations of embedded processors called FT-EALU is proposed. In this method, each operation is replicated in time and the derived final results are voted to generate the final output. To consider the effect of permanent faults, replicating identical operations in time is not sufficient, and… ▽ More

    Submitted 4 April, 2022; originally announced April 2022.

  9. A novel evolutionary-based neuro-fuzzy task scheduling approach to jointly optimize the main design challenges of heterogeneous MPSoCs

    Authors: Athena Abdi, Armin Salimi-Badr

    Abstract: In this paper, an online task scheduling and map** method based on a fuzzy neural network (FNN) learned by an evolutionary multi-objective algorithm (NSGA-II) to jointly optimize the main design challenges of heterogeneous MPSoCs is proposed. In this approach, first, the FNN parameters are trained using an NSGA-II-based optimization engine by considering the main design challenges of MPSoCs incl… ▽ More

    Submitted 14 March, 2022; originally announced March 2022.

    Comments: in IEEE Transactions on Sustainable Computing

  10. arXiv:2201.09483  [pdf, other

    cs.LG cs.DC cs.IT eess.SP stat.ML

    A Machine Learning Framework for Distributed Functional Compression over Wireless Channels in IoT

    Authors: Yashas Malur Saidutta, Afshin Abdi, Faramarz Fekri

    Abstract: IoT devices generating enormous data and state-of-the-art machine learning techniques together will revolutionize cyber-physical systems. In many diverse fields, from autonomous driving to augmented reality, distributed IoT devices compute specific target functions without simple forms like obstacle detection, object recognition, etc. Traditional cloud-based methods that focus on transferring data… ▽ More

    Submitted 30 April, 2023; v1 submitted 24 January, 2022; originally announced January 2022.

  11. arXiv:2106.10656  [pdf, other

    cs.LG cs.SI stat.ML

    TD-GEN: Graph Generation With Tree Decomposition

    Authors: Hamed Shirzad, Hossein Hajimirsadeghi, Amir H. Abdi, Greg Mori

    Abstract: We propose TD-GEN, a graph generation framework based on tree decomposition, and introduce a reduced upper bound on the maximum number of decisions needed for graph generation. The framework includes a permutation invariant tree generation model which forms the backbone of graph generation. Tree nodes are supernodes, each representing a cluster of nodes in the graph. Graph nodes and edges are incr… ▽ More

    Submitted 23 February, 2022; v1 submitted 20 June, 2021; originally announced June 2021.

  12. arXiv:2011.10529  [pdf

    q-bio.MN cs.IT q-bio.SC stat.CO

    Computation capacities of a broad class of signaling networks are higher than their communication capacities

    Authors: Iman Habibi, Effat S Emamian, Osvaldo Simeone, Ali Abdi

    Abstract: Due to structural and functional abnormalities or genetic variations and mutations, there may be dysfunctional molecules within an intracellular signaling network that do not allow the network to correctly regulate its output molecules, such as transcription factors. This disruption in signaling interrupts normal cellular functions and may eventually develop some pathological conditions. In this p… ▽ More

    Submitted 20 November, 2020; originally announced November 2020.

    Comments: 51 pages, 8 figures

    Journal ref: Phys. Biol. 16 064001 (2019)

  13. arXiv:2008.08289  [pdf, other

    cs.LG cs.DC stat.ML

    Restructuring, Pruning, and Adjustment of Deep Models for Parallel Distributed Inference

    Authors: Afshin Abdi, Saeed Rashidi, Faramarz Fekri, Tushar Krishna

    Abstract: Using multiple nodes and parallel computing algorithms has become a principal tool to improve training and execution times of deep neural networks as well as effective collective intelligence in sensor networks. In this paper, we consider the parallel implementation of an already-trained deep model on multiple processing nodes (a.k.a. workers) where the deep model is divided into several parallel… ▽ More

    Submitted 19 August, 2020; originally announced August 2020.

  14. arXiv:1912.05184  [pdf, other

    cs.LG stat.ML

    Variational Learning with Disentanglement-PyTorch

    Authors: Amir H. Abdi, Purang Abolmaesumi, Sidney Fels

    Abstract: Unsupervised learning of disentangled representations is an open problem in machine learning. The Disentanglement-PyTorch library is developed to facilitate research, implementation, and testing of new variational algorithms. In this modular library, neural architectures, dimensionality of the latent space, and the training algorithms are fully decoupled, allowing for independent and consistent ex… ▽ More

    Submitted 11 December, 2019; originally announced December 2019.

    Comments: Disentanglement Challenge - 33rd Conference on Neural Information Processing Systems (NeurIPS) - NeurIPS 2019

  15. arXiv:1912.03120  [pdf, other

    eess.IV cs.LG stat.ML

    A Study into Echocardiography View Conversion

    Authors: Amir H. Abdi, Mohammad H. Jafari, Sidney Fels, Theresa Tsang, Purang Abolmaesumi

    Abstract: Transthoracic echo is one of the most common means of cardiac studies in the clinical routines. During the echo exam, the sonographer captures a set of standard cross sections (echo views) of the heart. Each 2D echo view cuts through the 3D cardiac geometry via a unique plane. Consequently, different views share some limited information. In this work, we investigate the feasibility of generating a… ▽ More

    Submitted 5 December, 2019; originally announced December 2019.

    Comments: Workshop of Medical Imaging Meets NeurIPS, NeurIPS 2019

  16. arXiv:1912.00614  [pdf, other

    math.CO cs.DM

    Idealness of $k$-wise intersecting families

    Authors: Ahmad Abdi, Gérard Cornuéjols, Tony Huynh, Dabeen Lee

    Abstract: A clutter is \emph{$k$-wise intersecting} if every $k$ members have a common element, yet no element belongs to all members. We conjecture that, for some integer $k\geq 4$, every $k$-wise intersecting clutter is non-ideal. As evidence for our conjecture, we prove it for $k=4$ for the class of binary clutters. Two key ingredients for our proof are Jaeger's $8$-flow theorem for graphs, and Seymour's… ▽ More

    Submitted 3 October, 2020; v1 submitted 2 December, 2019; originally announced December 2019.

    Comments: 20 pages, 2 figures. An extended abstract under the same title appeared in the 21st Conference in Integer Programming and Combinatorial Optimization

    MSC Class: 90C10; 90C27; 05C21; 52B40

  17. arXiv:1911.11791  [pdf, other

    cs.LG stat.ML

    A Preliminary Study of Disentanglement With Insights on the Inadequacy of Metrics

    Authors: Amir H. Abdi, Purang Abolmaesumi, Sidney Fels

    Abstract: Disentangled encoding is an important step towards a better representation learning. However, despite the numerous efforts, there still is no clear winner that captures the independent features of the data in an unsupervised fashion. In this work we empirically evaluate the performance of six unsupervised disentanglement approaches on the mpi3d toy dataset curated and released for the NeurIPS 2019… ▽ More

    Submitted 26 November, 2019; originally announced November 2019.

    Comments: Disentanglement Challenge - NeurIPS 2019

  18. arXiv:1911.02121  [pdf, other

    eess.IV cs.LG stat.ML

    GAN-enhanced Conditional Echocardiogram Generation

    Authors: Amir H. Abdi, Teresa Tsang, Purang Abolmaesumi

    Abstract: Echocardiography (echo) is a common means of evaluating cardiac conditions. Due to the label scarcity, semi-supervised paradigms in automated echo analysis are getting traction. One of the most sought-after problems in echo is the segmentation of cardiac structures (e.g. chambers). Accordingly, we propose an echocardiogram generation approach using generative adversarial networks with a conditiona… ▽ More

    Submitted 23 November, 2019; v1 submitted 5 November, 2019; originally announced November 2019.

    Comments: Workshop of Medical Imaging Meets NeurIPS, NeurIPS 2019

  19. arXiv:1911.00674  [pdf, other

    cs.LG cs.CV eess.IV stat.ML

    On Modelling Label Uncertainty in Deep Neural Networks: Automatic Estimation of Intra-observer Variability in 2D Echocardiography Quality Assessment

    Authors: Zhibin Liao, Hany Girgis, Amir Abdi, Hooman Vaseli, Jorden Hetherington, Robert Rohling, Ken Gin, Teresa Tsang, Purang Abolmaesumi

    Abstract: Uncertainty of labels in clinical data resulting from intra-observer variability can have direct impact on the reliability of assessments made by deep neural networks. In this paper, we propose a method for modelling such uncertainty in the context of 2D echocardiography (echo), which is a routine procedure for detecting cardiovascular disease at point-of-care. Echo imaging quality and acquisition… ▽ More

    Submitted 2 November, 2019; originally announced November 2019.

  20. Variational Shape Completion for Virtual Planning of Jaw Reconstructive Surgery

    Authors: Amir H. Abdi, Mehran Pesteie, Eitan Prisman, Purang Abolmaesumi, Sidney Fels

    Abstract: The premorbid geometry of the mandible is of significant relevance in jaw reconstructive surgeries and occasionally unknown to the surgical team. In this paper, an optimization framework is introduced to train deep models for completion (reconstruction) of the missing segments of the bone based on the remaining healthy structure. To leverage the contextual information of the surroundings of the di… ▽ More

    Submitted 15 July, 2019; v1 submitted 27 June, 2019; originally announced June 2019.

    Comments: Proceedings of Medical Image Computing and Computer Assisted Intervention - {MICCAI} 2019

  21. arXiv:1905.03567  [pdf, other

    cs.IT eess.SP

    Stochastic Fading Channel Models with Multiple Dominant Specular Components for 5G and Beyond

    Authors: Juan M. Romero-Jerez, F. Javier Lopez-Martinez, Juan P. Peña-Martin, Ali Abdi

    Abstract: We introduce a comprehensive statistical characterization of the multipath wireless channel built as a superposition of a number of scattered waves with random phases. We consider an arbitrary number $N$ of specular (dominant) components plus other diffusely propagating waves. Our approach covers the cases on which the specular components have constant amplitudes, as well as when these components… ▽ More

    Submitted 9 May, 2019; originally announced May 2019.

    Comments: This work has been submitted to the IEEE for publication. Copyright may be transferred without notice, after which this version may no longer be accessible

  22. arXiv:1904.01197  [pdf, other

    cs.DC

    Nested Dithered Quantization for Communication Reduction in Distributed Training

    Authors: Afshin Abdi, Faramarz Fekri

    Abstract: In distributed training, the communication cost due to the transmission of gradients or the parameters of the deep model is a major bottleneck in scaling up the number of processing nodes. To address this issue, we propose \emph{dithered quantization} for the transmission of the stochastic gradients and show that training with \emph{Dithered Quantized Stochastic Gradients (DQSG)} is similar to the… ▽ More

    Submitted 1 April, 2019; originally announced April 2019.

  23. Muscle Excitation Estimation in Biomechanical Simulation Using NAF Reinforcement Learning

    Authors: Amir H. Abdi, Pramit Saha, Praneeth Srungarapu, Sidney Fels

    Abstract: Motor control is a set of time-varying muscle excitations which generate desired motions for a biomechanical system. Muscle excitations cannot be directly measured from live subjects. An alternative approach is to estimate muscle activations using inverse motion-driven simulation. In this article, we propose a deep reinforcement learning method to estimate the muscle excitations in simulated biome… ▽ More

    Submitted 3 May, 2019; v1 submitted 17 September, 2018; originally announced September 2018.

    Comments: 9 pages, 3 figures. Computational Biomechanics for Medicine. MICCAI 2019. Springer, Cham

  24. arXiv:1806.06457  [pdf, other

    cs.LG stat.ML

    Fast Convex Pruning of Deep Neural Networks

    Authors: Alireza Aghasi, Afshin Abdi, Justin Romberg

    Abstract: We develop a fast, tractable technique called Net-Trim for simplifying a trained neural network. The method is a convex post-processing module, which prunes (sparsifies) a trained network layer by layer, while preserving the internal responses. We present a comprehensive analysis of Net-Trim from both the algorithmic and sample complexity standpoints, centered on a fast, scalable convex optimizati… ▽ More

    Submitted 25 February, 2019; v1 submitted 17 June, 2018; originally announced June 2018.

  25. arXiv:1611.05162  [pdf, other

    cs.LG stat.ML

    Net-Trim: Convex Pruning of Deep Neural Networks with Performance Guarantee

    Authors: Alireza Aghasi, Afshin Abdi, Nam Nguyen, Justin Romberg

    Abstract: We introduce and analyze a new technique for model reduction for deep neural networks. While large networks are theoretically capable of learning arbitrarily complex models, overfitting and model redundancy negatively affects the prediction accuracy and model variance. Our Net-Trim algorithm prunes (sparsifies) a trained network layer-wise, removing connections at each layer by solving a convex op… ▽ More

    Submitted 23 November, 2017; v1 submitted 16 November, 2016; originally announced November 2016.

  26. arXiv:1502.03578  [pdf, ps, other

    cs.DM

    Lower Bounds for Cover-Free Families

    Authors: Ali Z. Abdi, Nader H. Bshouty

    Abstract: Let ${\cal F}$ be a set of blocks of a $t$-set $X$. $(X,{\cal F})$ is called $(w,r)$-cover-free family ($(w,r)-$CFF) provided that, the intersection of any $w$ blocks in ${\cal F}$ is not contained in the union of any other $r$ blocks in ${\cal F}$. We give new asymptotic lower bounds for the number of minimum points $t$ in a $(w,r)$-CFF when $w\le r=|{\cal F}|^ε$ for some constant $ε\ge 1/2$.

    Submitted 31 March, 2015; v1 submitted 12 February, 2015; originally announced February 2015.

  27. arXiv:1405.1535  [pdf, ps, other

    cs.LG

    Learning Boolean Halfspaces with Small Weights from Membership Queries

    Authors: Hasan Abasi, Ali Z. Abdi, Nader H. Bshouty

    Abstract: We consider the problem of proper learning a Boolean Halfspace with integer weights $\{0,1,\ldots,t\}$ from membership queries only. The best known algorithm for this problem is an adaptive algorithm that asks $n^{O(t^5)}$ membership queries where the best lower bound for the number of membership queries is $n^t$ [Learning Threshold Functions with Small Weights Using Membership Queries. COLT 1999]… ▽ More

    Submitted 7 May, 2014; originally announced May 2014.

  28. arXiv:cs/0604033  [pdf, ps, other

    cs.IT

    Statistical Properties of Eigen-Modes and Instantaneous Mutual Information in MIMO Time-Varying Rayleigh Channels

    Authors: Shuangquan Wang, Ali Abdi

    Abstract: In this paper, we study two important metrics in multiple-input multiple-output (MIMO) time-varying Rayleigh flat fading channels. One is the eigen-mode, and the other is the instantaneous mutual information (IMI). Their second-order statistics, such as the correlation coefficient, level crossing rate (LCR), and average fade/outage duration, are investigated, assuming a general nonisotropic scat… ▽ More

    Submitted 8 April, 2006; originally announced April 2006.

    Comments: 25 pages, 7 figures, 1 table, submitted to IEEE Trans. Inform. Theory, Apr., 2006

  29. arXiv:cs/0603027  [pdf, ps, other

    cs.IT

    On the Second-Order Statistics of the Instantaneous Mutual Information in Rayleigh Fading Channels

    Authors: Shuangquan Wang, Ali Abdi

    Abstract: In this paper, the second-order statistics of the instantaneous mutual information are studied, in time-varying Rayleigh fading channels, assuming general non-isotropic scattering environments. Specifically, first the autocorrelation function, correlation coefficient, level crossing rate, and the average outage duration of the instantaneous mutual information are investigated in single-input sin… ▽ More

    Submitted 7 March, 2006; originally announced March 2006.

    Comments: 11 pages, 6 figures, submitted to IEEE Trans. Inform. Theory, Dec. 2005