Skip to main content

Showing 1–15 of 15 results for author: Taheri, H

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.16266  [pdf, other

    cs.RO cs.LG eess.SY

    Deep Reinforcement Learning with Enhanced PPO for Safe Mobile Robot Navigation

    Authors: Hamid Taheri, Seyed Rasoul Hosseini

    Abstract: Collision-free motion is essential for mobile robots. Most approaches to collision-free and efficient navigation with wheeled robots require parameter tuning by experts to obtain good navigation behavior. This study investigates the application of deep reinforcement learning to train a mobile robot for autonomous navigation in a complex environment. The robot utilizes LiDAR sensor data and a deep… ▽ More

    Submitted 25 May, 2024; originally announced May 2024.

  2. arXiv:2404.02348  [pdf, other

    eess.IV cs.CV

    COVID-19 Detection Based on Blood Test Parameters using Various Artificial Intelligence Methods

    Authors: Kavian Khanjani, Seyed Rasoul Hosseini, Hamid Taheri, Shahrzad Shashaani, Mohammad Teshnehlab

    Abstract: In 2019, the world faced a new challenge: a COVID-19 disease caused by the novel coronavirus, SARS-CoV-2. The virus rapidly spread across the globe, leading to a high rate of mortality, which prompted health organizations to take measures to control its transmission. Early disease detection is crucial in the treatment process, and computer-based automatic detection systems have been developed to a… ▽ More

    Submitted 28 May, 2024; v1 submitted 2 April, 2024; originally announced April 2024.

  3. arXiv:2310.12680  [pdf, other

    cs.LG math.OC stat.ML

    On the Optimization and Generalization of Multi-head Attention

    Authors: Puneesh Deora, Rouzbeh Ghaderi, Hossein Taheri, Christos Thrampoulidis

    Abstract: The training and generalization dynamics of the Transformer's core mechanism, namely the Attention mechanism, remain under-explored. Besides, existing analyses primarily focus on single-head attention. Inspired by the demonstrated benefits of overparameterization when training fully-connected networks, we investigate the potential optimization and generalization advantages of using multiple attent… ▽ More

    Submitted 19 October, 2023; originally announced October 2023.

    Comments: 48 page; presented in the Workshop on High-dimensional Learning Dynamics, ICML 2023

  4. arXiv:2307.08994  [pdf, other

    cs.CV

    Human Action Recognition in Still Images Using ConViT

    Authors: Seyed Rohollah Hosseyni, Sanaz Seyedin, Hasan Taheri

    Abstract: Understanding the relationship between different parts of an image is crucial in a variety of applications, including object recognition, scene understanding, and image classification. Despite the fact that Convolutional Neural Networks (CNNs) have demonstrated impressive results in classifying and detecting objects, they lack the capability to extract the relationship between different parts of a… ▽ More

    Submitted 11 January, 2024; v1 submitted 18 July, 2023; originally announced July 2023.

  5. arXiv:2305.13471  [pdf, other

    cs.LG

    Fast Convergence in Learning Two-Layer Neural Networks with Separable Data

    Authors: Hossein Taheri, Christos Thrampoulidis

    Abstract: Normalized gradient descent has shown substantial success in speeding up the convergence of exponentially-tailed loss functions (which includes exponential and logistic losses) on linear classifiers with separable data. In this paper, we go beyond linear models by studying normalized GD on two-layer neural nets. We prove for exponentially-tailed losses that using normalized GD leads to linear rate… ▽ More

    Submitted 26 June, 2023; v1 submitted 22 May, 2023; originally announced May 2023.

  6. arXiv:2302.09235  [pdf, ps, other

    stat.ML cs.LG

    Generalization and Stability of Interpolating Neural Networks with Minimal Width

    Authors: Hossein Taheri, Christos Thrampoulidis

    Abstract: We investigate the generalization and optimization properties of shallow neural-network classifiers trained by gradient descent in the interpolating regime. Specifically, in a realizable scenario where model weights can achieve arbitrarily small training error $ε$ and their distance from initialization is $g(ε)$, we demonstrate that gradient descent with $n$ training data achieves training error… ▽ More

    Submitted 27 March, 2023; v1 submitted 18 February, 2023; originally announced February 2023.

    Comments: With significant changes: Stating results without homogeneity assumption, Discussing results under NTK-separability in Section 4

  7. arXiv:2209.07116  [pdf, other

    cs.LG cs.DC cs.MA eess.SP

    On Generalization of Decentralized Learning with Separable Data

    Authors: Hossein Taheri, Christos Thrampoulidis

    Abstract: Decentralized learning offers privacy and communication efficiency when data are naturally distributed among agents communicating over an underlying graph. Motivated by overparameterized learning settings, in which models are trained to zero training loss, we study algorithmic and generalization properties of decentralized learning with gradient descent on separable data. Specifically, for decentr… ▽ More

    Submitted 27 March, 2023; v1 submitted 15 September, 2022; originally announced September 2022.

    Comments: Minor changes: fixing typos, few more references. Title changed to the title of conference version

  8. arXiv:2102.12717  [pdf

    cs.DC cs.DL cs.NI

    Cloud Broker: A Systematic Map** Study

    Authors: Hoda Taheri, Faeze Ramezani, Neda Mohammadi, Parisa Khoshdel, Bahareh Taghavi, Neda Khorasani, Saeid Abrishami, Abbas Rasoolzadegan

    Abstract: In a cloud environment, a cloud broker is an important entity that works as an independent middleware between cloud customers and providers to address issues and conduct negotiations related to satisfying both customer preferences and service provider profits. In recent years, researchers have published many articles which directly or indirectly address this research area. A systematic method is v… ▽ More

    Submitted 1 January, 2023; v1 submitted 25 February, 2021; originally announced February 2021.

    Comments: 19 pages, 11 Tables, 16 figures

  9. arXiv:2010.13275  [pdf, other

    stat.ML cs.IT cs.LG eess.SP

    Asymptotic Behavior of Adversarial Training in Binary Classification

    Authors: Hossein Taheri, Ramtin Pedarsani, Christos Thrampoulidis

    Abstract: It has been consistently reported that many machine learning models are susceptible to adversarial attacks i.e., small additive adversarial perturbations applied to data points can cause misclassification. Adversarial training using empirical risk minimization is considered to be the state-of-the-art method for defense against adversarial attacks. Despite being successful in practice, several prob… ▽ More

    Submitted 13 July, 2021; v1 submitted 25 October, 2020; originally announced October 2020.

    Comments: V3: additional theoretical results, extensions to correlated features

  10. arXiv:2006.08917  [pdf, other

    stat.ML cs.IT cs.LG eess.SP

    Fundamental Limits of Ridge-Regularized Empirical Risk Minimization in High Dimensions

    Authors: Hossein Taheri, Ramtin Pedarsani, Christos Thrampoulidis

    Abstract: Empirical Risk Minimization (ERM) algorithms are widely used in a variety of estimation and prediction tasks in signal-processing and machine learning applications. Despite their popularity, a theory that explains their statistical properties in modern regimes where both the number of measurements and the number of unknown parameters is large is only recently emerging. In this paper, we characteri… ▽ More

    Submitted 5 July, 2020; v1 submitted 16 June, 2020; originally announced June 2020.

  11. arXiv:2002.09964  [pdf, other

    cs.DC cs.LG cs.MA eess.SP eess.SY

    Quantized Decentralized Stochastic Learning over Directed Graphs

    Authors: Hossein Taheri, Aryan Mokhtari, Hamed Hassani, Ramtin Pedarsani

    Abstract: We consider a decentralized stochastic learning problem where data points are distributed among computing nodes communicating over a directed graph. As the model size gets large, decentralized learning faces a major bottleneck that is the heavy communication load due to each node transmitting large messages (model updates) to its neighbors. To tackle this bottleneck, we propose the quantized decen… ▽ More

    Submitted 28 December, 2020; v1 submitted 23 February, 2020; originally announced February 2020.

  12. arXiv:2002.07284  [pdf, other

    math.ST cs.IT eess.SP stat.ML

    Sharp Asymptotics and Optimal Performance for Inference in Binary Models

    Authors: Hossein Taheri, Ramtin Pedarsani, Christos Thrampoulidis

    Abstract: We study convex empirical risk minimization for high-dimensional inference in binary models. Our first result sharply predicts the statistical performance of such estimators in the linear asymptotic regime under isotropic Gaussian features. Importantly, the predictions hold for a wide class of convex loss functions, which we exploit in order to prove a bound on the best achievable performance amon… ▽ More

    Submitted 26 February, 2020; v1 submitted 17 February, 2020; originally announced February 2020.

  13. arXiv:1908.07568  [pdf

    eess.SP cs.IT

    Power-Efficient Resource Allocation in Massive MIMO Aided Cloud RANs

    Authors: Nahid Amani, Saeedeh Parsaeefard, Hassan Taheri, Hossein Pedram

    Abstract: This paper considers the power-efficient resource allocation problem in a cloud radio access network (C-RAN). The C-RAN architecture consists of a set of base-band units (BBUs) which are connected to a set of radio remote heads (RRHs) equipped with massive multiple input multiple output (MIMO), via fronthaul links with limited capacity. We formulate the power-efficient optimization problem in C-RA… ▽ More

    Submitted 20 August, 2019; originally announced August 2019.

  14. arXiv:1908.04433  [pdf, other

    math.ST cs.IT cs.LG eess.SP

    Sharp Guarantees for Solving Random Equations with One-Bit Information

    Authors: Hossein Taheri, Ramtin Pedarsani, Christos Thrampoulidis

    Abstract: We study the performance of a wide class of convex optimization-based estimators for recovering a signal from corrupted one-bit measurements in high-dimensions. Our general result predicts sharply the performance of such estimators in the linear asymptotic regime when the measurement vectors have entries IID Gaussian. This includes, as a special case, the previously studied least-squares estimator… ▽ More

    Submitted 23 January, 2020; v1 submitted 12 August, 2019; originally announced August 2019.

  15. arXiv:1907.10595  [pdf, other

    cs.LG cs.DC math.OC stat.ML

    Robust and Communication-Efficient Collaborative Learning

    Authors: Amirhossein Reisizadeh, Hossein Taheri, Aryan Mokhtari, Hamed Hassani, Ramtin Pedarsani

    Abstract: We consider a decentralized learning problem, where a set of computing nodes aim at solving a non-convex optimization problem collaboratively. It is well-known that decentralized optimization schemes face two major system bottlenecks: stragglers' delay and communication overhead. In this paper, we tackle these bottlenecks by proposing a novel decentralized and gradient-based optimization algorithm… ▽ More

    Submitted 31 October, 2019; v1 submitted 24 July, 2019; originally announced July 2019.