Skip to main content

Showing 1–16 of 16 results for author: Karimi, B

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.05189  [pdf

    cs.CL

    Enhancing Language Learning through Technology: Introducing a New English-Azerbaijani (Arabic Script) Parallel Corpus

    Authors: Jalil Nourmohammadi Khiarak, Ammar Ahmadi, Taher Ak-bari Saeed, Meysam Asgari-Chenaghlu, Toğrul Atabay, Mohammad Reza Baghban Karimi, Ismail Ceferli, Farzad Hasanvand, Seyed Mahboub Mousavi, Morteza Noshad

    Abstract: This paper introduces a pioneering English-Azerbaijani (Arabic Script) parallel corpus, designed to bridge the technological gap in language learning and machine translation (MT) for under-resourced languages. Consisting of 548,000 parallel sentences and approximately 9 million words per language, this dataset is derived from diverse sources such as news articles and holy texts, aiming to enhance… ▽ More

    Submitted 6 July, 2024; originally announced July 2024.

    Comments: This paper is accepted and published at NeTTT 2024 Conf

  2. arXiv:2310.12667  [pdf, other

    stat.ML cs.LG

    STANLEY: Stochastic Gradient Anisotropic Langevin Dynamics for Learning Energy-Based Models

    Authors: Belhal Karimi, Jianwen Xie, ** Li

    Abstract: We propose in this paper, STANLEY, a STochastic gradient ANisotropic LangEvin dYnamics, for sampling high dimensional data. With the growing efficacy and potential of Energy-Based modeling, also known as non-normalized probabilistic modeling, for modeling a generative process of different natures of high dimensional data observations, we present an end-to-end learning algorithm for Energy-Based mo… ▽ More

    Submitted 19 October, 2023; originally announced October 2023.

    Comments: arXiv admin note: text overlap with arXiv:1207.5938 by other authors

  3. arXiv:2210.07768  [pdf, other

    cs.IR cs.LG

    FeatureBox: Feature Engineering on GPUs for Massive-Scale Ads Systems

    Authors: Weijie Zhao, Xuewu Jiao, Xinsheng Luo, **gxue Li, Belhal Karimi, ** Li

    Abstract: Deep learning has been widely deployed for online ads systems to predict Click-Through Rate (CTR). Machine learning researchers and practitioners frequently retrain CTR models to test their new extracted features. However, the CTR model training often relies on a large number of raw input data logs. Hence, the feature extraction can take a significant proportion of the training time for an industr… ▽ More

    Submitted 25 September, 2022; originally announced October 2022.

  4. arXiv:2207.02722  [pdf, other

    stat.ML cs.AI cs.LG

    Variational Flow Graphical Model

    Authors: Shaogang Ren, Belhal Karimi, Dingcheng Li, ** Li

    Abstract: This paper introduces a novel approach to embed flow-based models with hierarchical structures. The proposed framework is named Variational Flow Graphical (VFG) Model. VFGs learn the representation of high dimensional data via a message-passing scheme by integrating flow-based functions through variational inference. By leveraging the expressive power of neural networks, VFGs produce a representat… ▽ More

    Submitted 6 July, 2022; originally announced July 2022.

  5. arXiv:2205.05632  [pdf, other

    stat.ML cs.LG

    On Distributed Adaptive Optimization with Gradient Compression

    Authors: Xiaoyun Li, Belhal Karimi, ** Li

    Abstract: We study COMP-AMS, a distributed optimization framework based on gradient averaging and adaptive AMSGrad algorithm. Gradient compression with error feedback is applied to reduce the communication cost in the gradient transmission process. Our convergence analysis of COMP-AMS shows that such compressed gradient averaging strategy yields same convergence rate as standard AMSGrad, and also exhibits t… ▽ More

    Submitted 11 May, 2022; originally announced May 2022.

  6. arXiv:2205.04188  [pdf, other

    cs.CV cs.MM

    Joint learning of object graph and relation graph for visual question answering

    Authors: Hao Li, Xu Li, Belhal Karimi, Jie Chen, Mingming Sun

    Abstract: Modeling visual question answering(VQA) through scene graphs can significantly improve the reasoning accuracy and interpretability. However, existing models answer poorly for complex reasoning questions with attributes or relations, which causes false attribute selection or missing relation in Figure 1(a). It is because these models cannot balance all kinds of information in scene graphs, neglecti… ▽ More

    Submitted 9 May, 2022; originally announced May 2022.

    Comments: 6 pages, 4 figures, Accepted by ICME 2022

  7. arXiv:2203.10186  [pdf, other

    stat.ML cs.LG

    A Class of Two-Timescale Stochastic EM Algorithms for Nonconvex Latent Variable Models

    Authors: Belhal Karimi, ** Li

    Abstract: The Expectation-Maximization (EM) algorithm is a popular choice for learning latent variable models. Variants of the EM have been initially introduced, using incremental updates to scale to large datasets, and using Monte Carlo (MC) approximations to bypass the intractable conditional expectation of the latent data for most nonconvex models. In this paper, we propose a general class of methods cal… ▽ More

    Submitted 18 March, 2022; originally announced March 2022.

  8. arXiv:2110.00532  [pdf, other

    cs.LG

    Layer-wise and Dimension-wise Locally Adaptive Federated Learning

    Authors: Belhal Karimi, ** Li, Xiaoyun Li

    Abstract: In the emerging paradigm of Federated Learning (FL), large amount of clients such as mobile devices are used to train possibly high-dimensional models on their respective data. Combining (dimension-wise) adaptive gradient methods (e.g. Adam, AMSGrad) with FL has been an active direction, which is shown to outperform traditional SGD based FL in many cases. In this paper, we focus on the problem of… ▽ More

    Submitted 23 June, 2022; v1 submitted 1 October, 2021; originally announced October 2021.

  9. arXiv:2109.03194  [pdf, ps, other

    cs.LG

    On the Convergence of Decentralized Adaptive Gradient Methods

    Authors: Xiangyi Chen, Belhal Karimi, Weijie Zhao, ** Li

    Abstract: Adaptive gradient methods including Adam, AdaGrad, and their variants have been very successful for training deep learning models, such as neural networks. Meanwhile, given the need for distributed computing, distributed optimization algorithms are rapidly becoming a focal point. With the growth of computing power and the need for using machine learning models on mobile devices, the communication… ▽ More

    Submitted 7 September, 2021; originally announced September 2021.

  10. arXiv:2008.04975  [pdf, ps, other

    stat.ML cs.DS cs.LG

    FedSKETCH: Communication-Efficient and Private Federated Learning via Sketching

    Authors: Farzin Haddadpour, Belhal Karimi, ** Li, Xiaoyun Li

    Abstract: Communication complexity and privacy are the two key challenges in Federated Learning where the goal is to perform a distributed learning through a large volume of devices. In this work, we introduce FedSKETCH and FedSKETCHGATE algorithms to address both challenges in Federated learning jointly, where these algorithms are intended to be used for homogeneous and heterogeneous data distribution sett… ▽ More

    Submitted 11 August, 2020; originally announced August 2020.

  11. arXiv:1910.12521  [pdf, other

    stat.ML cs.LG stat.ME

    On the Global Convergence of (Fast) Incremental Expectation Maximization Methods

    Authors: Belhal Karimi, Hoi-To Wai, Eric Moulines, Marc Lavielle

    Abstract: The EM algorithm is one of the most popular algorithm for inference in latent data models. The original formulation of the EM algorithm does not scale to large data set, because the whole data set is required at each iteration of the algorithm. To alleviate this problem, Neal and Hinton have proposed an incremental version of the EM (iEM) in which at each iteration the conditional expectation of t… ▽ More

    Submitted 28 October, 2019; originally announced October 2019.

    Comments: 25 pages, Accepted at NeurIPS 2019

  12. arXiv:1903.01435  [pdf, other

    stat.ML cs.LG

    An Optimistic Acceleration of AMSGrad for Nonconvex Optimization

    Authors: Jun-Kun Wang, Xiaoyun Li, Belhal Karimi, ** Li

    Abstract: We propose a new variant of AMSGrad, a popular adaptive gradient based optimization algorithm widely used for training deep neural networks. Our algorithm adds prior knowledge about the sequence of consecutive mini-batch gradients and leverages its underlying structure making the gradients sequentially predictable. By exploiting the predictability and ideas from optimistic online learning, the pro… ▽ More

    Submitted 3 November, 2020; v1 submitted 4 March, 2019; originally announced March 2019.

  13. arXiv:1902.07332  [pdf, ps, other

    cs.IT

    Construction of QC-LDPC Codes with Low Error Floor by Efficient Systematic Search and Elimination of Trap** Sets

    Authors: Bashirreza Karimi, Amir. H Banihashemi

    Abstract: We propose a systematic design of protograph-based quasi-cyclic (QC) low-density parity-check (LDPC) codes with low error floor. We first characterize the trap** sets of such codes and demonstrate that the QC structure of the code eliminates some of the trap** set structures that can exist in a code with the same degree distribution and girth but lacking the QC structure. Using this characteri… ▽ More

    Submitted 17 June, 2019; v1 submitted 19 February, 2019; originally announced February 2019.

    Comments: This paper has been submitted to IEEE Transactions on Communications on January 29, 2019

  14. arXiv:1902.00629  [pdf, ps, other

    stat.ML cs.LG math.OC

    Non-asymptotic Analysis of Biased Stochastic Approximation Scheme

    Authors: Belhal Karimi, Blazej Miasojedow, Eric Moulines, Hoi-To Wai

    Abstract: Stochastic approximation (SA) is a key method used in statistical learning. Recently, its non-asymptotic convergence analysis has been considered in many papers. However, most of the prior analyses are made under restrictive assumptions such as unbiased gradient estimates and convex objective function, which significantly limit their applications to sophisticated tasks such as online and reinforce… ▽ More

    Submitted 16 June, 2019; v1 submitted 1 February, 2019; originally announced February 2019.

    Comments: Accepted to COLT 2019; 32 pages. Minor updates in Section 3.2

  15. arXiv:1809.06244  [pdf, other

    cs.AI cs.RO

    A Virtual Testbed for Critical Incident Investigation with Autonomous Remote Aerial Vehicle Surveying, Artificial Intelligence, and Decision Support

    Authors: David L. Smyth, Sai Abinesh, Nazli B. Karimi, Brett Drury, Ihsan Ullah, Frank G. Glavin, Michael G. Madden

    Abstract: Autonomous robotics and artificial intelligence techniques can be used to support human personnel in the event of critical incidents. These incidents can pose great danger to human life. Some examples of such assistance include: multi-robot surveying of the scene; collection of sensor data and scene imagery, real-time risk assessment and analysis; object identification and anomaly detection; and r… ▽ More

    Submitted 25 January, 2019; v1 submitted 14 September, 2018; originally announced September 2018.

    Comments: arXiv admin note: substantial text overlap with arXiv:1806.04497

    Journal ref: IWAISe, 2nd International Workshop on A.I. in Security, European Conference on Machine Learning 2018

  16. arXiv:1806.04497  [pdf, other

    cs.CY cs.AI

    A Virtual Environment with Multi-Robot Navigation, Analytics, and Decision Support for Critical Incident Investigation

    Authors: David L. Smyth, James Fennell, Sai Abinesh, Nazli B. Karimi, Frank G. Glavin, Ihsan Ullah, Brett Drury, Michael G. Madden

    Abstract: Accidents and attacks that involve chemical, biological, radiological/nuclear or explosive (CBRNE) substances are rare, but can be of high consequence. Since the investigation of such events is not anybody's routine work, a range of AI techniques can reduce investigators' cognitive load and support decision-making, including: planning the assessment of the scene; ongoing evaluation and updating of… ▽ More

    Submitted 12 June, 2018; originally announced June 2018.

    Comments: 27th International Joint Conference on Artificial Intelligence (IJCAI), Stockholm, Sweden