Skip to main content

Showing 1–28 of 28 results for author: Karimi, R

.
  1. arXiv:2407.05189  [pdf

    cs.CL

    Enhancing Language Learning through Technology: Introducing a New English-Azerbaijani (Arabic Script) Parallel Corpus

    Authors: Jalil Nourmohammadi Khiarak, Ammar Ahmadi, Taher Ak-bari Saeed, Meysam Asgari-Chenaghlu, Toğrul Atabay, Mohammad Reza Baghban Karimi, Ismail Ceferli, Farzad Hasanvand, Seyed Mahboub Mousavi, Morteza Noshad

    Abstract: This paper introduces a pioneering English-Azerbaijani (Arabic Script) parallel corpus, designed to bridge the technological gap in language learning and machine translation (MT) for under-resourced languages. Consisting of 548,000 parallel sentences and approximately 9 million words per language, this dataset is derived from diverse sources such as news articles and holy texts, aiming to enhance… ▽ More

    Submitted 6 July, 2024; originally announced July 2024.

    Comments: This paper is accepted and published at NeTTT 2024 Conf

  2. arXiv:2402.09603  [pdf, other

    cs.LG cs.AI

    Scalable Graph Self-Supervised Learning

    Authors: Ali Saheb Pasand, Reza Moravej, Mahdi Biparva, Raika Karimi, Ali Ghodsi

    Abstract: In regularization Self-Supervised Learning (SSL) methods for graphs, computational complexity increases with the number of nodes in graphs and embedding dimensions. To mitigate the scalability of non-contrastive graph SSL, we propose a novel approach to reduce the cost of computing the covariance matrix for the pre-training loss function with volume-maximization terms. Our work focuses on reducing… ▽ More

    Submitted 14 February, 2024; originally announced February 2024.

  3. arXiv:2402.05944  [pdf, other

    cs.LG

    Todyformer: Towards Holistic Dynamic Graph Transformers with Structure-Aware Tokenization

    Authors: Mahdi Biparva, Raika Karimi, Faezeh Faez, Yingxue Zhang

    Abstract: Temporal Graph Neural Networks have garnered substantial attention for their capacity to model evolving structural and temporal patterns while exhibiting impressive performance. However, it is known that these architectures are encumbered by issues that constrain their performance, such as over-squashing and over-smoothing. Meanwhile, Transformers have demonstrated exceptional computational capaci… ▽ More

    Submitted 2 February, 2024; originally announced February 2024.

  4. arXiv:2311.16706  [pdf, ps, other

    cs.LG math.PR stat.ML

    Sinkhorn Flow: A Continuous-Time Framework for Understanding and Generalizing the Sinkhorn Algorithm

    Authors: Mohammad Reza Karimi, Ya-** Hsieh, Andreas Krause

    Abstract: Many problems in machine learning can be formulated as solving entropy-regularized optimal transport on the space of probability measures. The canonical approach involves the Sinkhorn iterates, renowned for their rich mathematical properties. Recently, the Sinkhorn algorithm has been recast within the mirror descent framework, thus benefiting from classical optimization theory insights. Here, we b… ▽ More

    Submitted 28 November, 2023; originally announced November 2023.

  5. arXiv:2311.02374  [pdf, other

    math.OC cs.LG

    Riemannian stochastic optimization methods avoid strict saddle points

    Authors: Ya-** Hsieh, Mohammad Reza Karimi, Andreas Krause, Panayotis Mertikopoulos

    Abstract: Many modern machine learning applications - from online principal component analysis to covariance matrix identification and dictionary learning - can be formulated as minimization problems on Riemannian manifolds, and are typically solved with a Riemannian stochastic gradient method (or some variant thereof). However, in many cases of interest, the resulting minimization problem is not geodesical… ▽ More

    Submitted 4 November, 2023; originally announced November 2023.

    Comments: 27 pages, 3 figures

    MSC Class: Primary 62L20; 37N40; secondary 90C15; 90C48

  6. arXiv:2310.02862  [pdf, other

    cs.LG cs.AI eess.SP

    A novel asymmetrical autoencoder with a sparsifying discrete cosine Stockwell transform layer for gearbox sensor data compression

    Authors: Xin Zhu, Daoguang Yang, Hongyi Pan, Hamid Reza Karimi, Didem Ozevin, Ahmet Enis Cetin

    Abstract: The lack of an efficient compression model remains a challenge for the wireless transmission of gearbox data in non-contact gear fault diagnosis problems. In this paper, we present a signal-adaptive asymmetrical autoencoder with a transform domain layer to compress sensor signals. First, a new discrete cosine Stockwell transform (DCST) layer is introduced to replace linear layers in a multi-layer… ▽ More

    Submitted 4 October, 2023; originally announced October 2023.

  7. arXiv:2212.06375  [pdf, ps, other

    nucl-th astro-ph.HE hep-ph

    Hybrid stars within the framework of the Sigma-Omega-Rho model combined with the MIT and NJL models

    Authors: Reza Karimi, H. R. Moshfegh

    Abstract: In this paper, we investigate the structure of hybrid stars consisting of hadrons (neutrons, protons, sigmas, lambdas), leptons (electrons, muons), and quarks (up, down, strange). We use a relativistic mean-field (RMF) model namely the Sigma-omega-rho model for the hadronic phase and the MIT bag model as well as the NJL model for the quark phase. In addition, Maxwell and Gibbs conditions are emplo… ▽ More

    Submitted 13 December, 2022; originally announced December 2022.

    Comments: 23 pages, 10 figures

  8. arXiv:2211.01689  [pdf, other

    stat.ML cs.LG

    Isotropic Gaussian Processes on Finite Spaces of Graphs

    Authors: Viacheslav Borovitskiy, Mohammad Reza Karimi, Vignesh Ram Somnath, Andreas Krause

    Abstract: We propose a principled way to define Gaussian process priors on various sets of unweighted graphs: directed or undirected, with or without loops. We endow each of these sets with a geometric structure, inducing the notions of closeness and symmetries, by turning them into a vertex set of an appropriate metagraph. Building on this, we describe the class of priors that respect this structure and ar… ▽ More

    Submitted 25 February, 2023; v1 submitted 3 November, 2022; originally announced November 2022.

  9. arXiv:2210.13867  [pdf, ps, other

    cs.LG math.PR math.ST

    A Dynamical System View of Langevin-Based Non-Convex Sampling

    Authors: Mohammad Reza Karimi, Ya-** Hsieh, Andreas Krause

    Abstract: Non-convex sampling is a key challenge in machine learning, central to non-convex optimization in deep learning as well as to approximate probabilistic inference. Despite its significance, theoretically there remain many important challenges: Existing guarantees (1) typically only hold for the averaged iterates rather than the more desirable last iterates, (2) lack convergence metrics that capture… ▽ More

    Submitted 13 March, 2023; v1 submitted 25 October, 2022; originally announced October 2022.

    Comments: typos corrected, references added

    MSC Class: 62D05

  10. arXiv:2206.06795  [pdf, other

    math.OC cs.LG math.DS

    Riemannian stochastic approximation algorithms

    Authors: Mohammad Reza Karimi, Ya-** Hsieh, Panayotis Mertikopoulos, Andreas Krause

    Abstract: We examine a wide class of stochastic approximation algorithms for solving (stochastic) nonlinear problems on Riemannian manifolds. Such algorithms arise naturally in the study of Riemannian optimization, game theory and optimal transport, but their behavior is much less understood compared to the Euclidean case because of the lack of a global linear structure on the manifold. We overcome this dif… ▽ More

    Submitted 27 December, 2022; v1 submitted 14 June, 2022; originally announced June 2022.

    Comments: 33 pages, 2 figures; a one-page abstract of this paper was presented in COLT 2022

    MSC Class: Primary 62L20; 37N40; secondary 90C15; 90C47; 90C48

  11. arXiv:2204.01172  [pdf, other

    cs.CL

    PERFECT: Prompt-free and Efficient Few-shot Learning with Language Models

    Authors: Rabeeh Karimi Mahabadi, Luke Zettlemoyer, James Henderson, Marzieh Saeidi, Lambert Mathias, Veselin Stoyanov, Majid Yazdani

    Abstract: Current methods for few-shot fine-tuning of pretrained masked language models (PLMs) require carefully engineered prompts and verbalizers for each new task to convert examples into a cloze-format that the PLM can score. In this work, we propose PERFECT, a simple and efficient method for few-shot fine-tuning of PLMs without relying on any such handcrafting, which is highly effective given as few as… ▽ More

    Submitted 25 April, 2022; v1 submitted 3 April, 2022; originally announced April 2022.

    Comments: ACL, 2022

  12. The First Photometric Study of AH Mic Contact Binary System

    Authors: Atila Poro, Mark G. Blackford, Selda Ranjbar Salehian, Esfandiar Jahangiri, Meysam Samiei Dastjerdi, Mohammadjavad Gozarandi, Reihaneh Karimi, Tabassom Madayen, Elnaz Bakhshi, Farnad Hedayati

    Abstract: The first multi-color light curve analysis of the AH Mic binary system is presented. This system has very few past observations from the southern hemisphere. We extracted the minima times from the light curves based on the Markov Chain Monte Carlo (MCMC) approach and obtained a new ephemeris. To provide modern photometric light curve solutions, we used the Physics of Eclipsing Binaries (Phoebe) so… ▽ More

    Submitted 5 April, 2022; v1 submitted 2 April, 2022; originally announced April 2022.

    Comments: 6 figures, 4 tables, accepted by the Research in Astronomy and Astrophysics (RAA) journal

  13. arXiv:2201.00283  [pdf, other

    eess.SP cs.HC

    DF-SSmVEP: Dual Frequency Aggregated Steady-State Motion Visual Evoked Potential Design with Bifold Canonical Correlation Analysis

    Authors: Raika Karimi, Arash Mohammadi, Amir Asif, Habib Benali

    Abstract: Recent advancements in Electroencephalography (EEG) sensor technologies and signal processing algorithms have paved the way for further evolution of Brain Computer Interfaces (BCI). When it comes to Signal Processing (SP) for BCI, there has been a surge of interest on Steady-State motion-Visual Evoked Potentials (SSmVEP), where motion stimulation is utilized to address key issues associated with c… ▽ More

    Submitted 1 January, 2022; originally announced January 2022.

  14. arXiv:2107.05151  [pdf, other

    cs.AI

    Document Embedding for Scientific Articles: Efficacy of Word Embeddings vs TFIDF

    Authors: H. J. Meijer, J. Truong, R. Karimi

    Abstract: Over the last few years, neural network derived word embeddings became popular in the natural language processing literature. Studies conducted have mostly focused on the quality and application of word embeddings trained on public available corpuses such as Wikipedia or other news and social media sources. However, these studies are limited to generic text and thus lack technical and scientific n… ▽ More

    Submitted 11 July, 2021; originally announced July 2021.

  15. arXiv:2106.04647  [pdf, other

    cs.CL

    Compacter: Efficient Low-Rank Hypercomplex Adapter Layers

    Authors: Rabeeh Karimi Mahabadi, James Henderson, Sebastian Ruder

    Abstract: Adapting large-scale pretrained language models to downstream tasks via fine-tuning is the standard method for achieving state-of-the-art performance on NLP benchmarks. However, fine-tuning all weights of models with millions or billions of parameters is sample-inefficient, unstable in low-resource settings, and wasteful as it requires storing a separate copy of the model for each task. Recent wor… ▽ More

    Submitted 27 November, 2021; v1 submitted 8 June, 2021; originally announced June 2021.

    Comments: accepted in NeurIPS, 2021

  16. arXiv:2106.04489  [pdf, other

    cs.CL

    Parameter-efficient Multi-task Fine-tuning for Transformers via Shared Hypernetworks

    Authors: Rabeeh Karimi Mahabadi, Sebastian Ruder, Mostafa Dehghani, James Henderson

    Abstract: State-of-the-art parameter-efficient fine-tuning methods rely on introducing adapter modules between the layers of a pretrained language model. However, such modules are trained separately for each task and thus do not enable sharing information across tasks. In this paper, we show that we can learn adapter parameters for all layers and tasks by generating them using shared hypernetworks, which co… ▽ More

    Submitted 8 June, 2021; originally announced June 2021.

    Comments: accepted in ACL, 2021

  17. arXiv:2010.09818  [pdf, other

    cs.LG stat.ML

    Online Active Model Selection for Pre-trained Classifiers

    Authors: Mohammad Reza Karimi, Nezihe Merve Gürel, Bojan Karlaš, Johannes Rausch, Ce Zhang, Andreas Krause

    Abstract: Given $k$ pre-trained classifiers and a stream of unlabeled data examples, how can we actively decide when to query a label so that we can distinguish the best model from the rest while making a small number of queries? Answering this question has a profound impact on a range of practical scenarios. In this work, we design an online selective sampling approach that actively selects informative exa… ▽ More

    Submitted 17 April, 2021; v1 submitted 19 October, 2020; originally announced October 2020.

  18. arXiv:2006.02464  [pdf, other

    cs.DC cs.LG

    Serving DNNs like Clockwork: Performance Predictability from the Bottom Up

    Authors: Arpan Gujarati, Reza Karimi, Safya Alzayat, Wei Hao, Antoine Kaufmann, Ymir Vigfusson, Jonathan Mace

    Abstract: Machine learning inference is becoming a core building block for interactive web applications. As a result, the underlying model serving systems on which these applications depend must consistently meet low latency targets. Existing model serving architectures use well-known reactive techniques to alleviate common-case sources of latency, but cannot effectively curtail tail latency caused by unpre… ▽ More

    Submitted 26 October, 2020; v1 submitted 3 June, 2020; originally announced June 2020.

    Comments: In Proceedings of the 14th USENIX Symposium on Operating Systems Design and Implementation (OSDI '20)

  19. Bandwidth-Aware Page Placement in NUMA

    Authors: David Gureya, João Neto, Reza Karimi, João Barreto, Pramod Bhatotia, Vivien Quema, Rodrigo Rodrigues, Paolo Romano, Vladimir Vlassov

    Abstract: Page placement is a critical problem for memoryintensive applications running on a shared-memory multiprocessor with a non-uniform memory access (NUMA) architecture. State-of-the-art page placement mechanisms interleave pages evenly across NUMA nodes. However, this approach fails to maximize memory throughput in modern NUMA systems, characterised by asymmetric bandwidths and latencies, and sensiti… ▽ More

    Submitted 19 May, 2023; v1 submitted 6 March, 2020; originally announced March 2020.

    Journal ref: 2020 IEEE International Parallel and Distributed Processing Symposium (IPDPS), New Orleans, LA, USA, 2020 pp. 546-556

  20. arXiv:2001.04198  [pdf, ps, other

    eess.SY

    Predefined-time Terminal Sliding Mode Control of Robot Manipulators

    Authors: Chang-Duo Liang, Ming-Feng Ge, Zhi-Wei Liu, Yan-Wu Wang, Hamid Reza Karimi

    Abstract: In this paper, we present a new terminal sliding mode control to achieve predefined-time stability of robot manipulators. The proposed control is developed based on a novel predefined-time terminal sliding mode (PTSM) surface, on which the states are forced to reach the origin in a predefined time, i.e., the settling time is independent to the initial condition and can be explicitly user-defined v… ▽ More

    Submitted 25 April, 2020; v1 submitted 13 January, 2020; originally announced January 2020.

    Comments: 10 pages, 9 figures, This draft is not intended for publication

  21. arXiv:1909.06321  [pdf, other

    cs.CL

    End-to-End Bias Mitigation by Modelling Biases in Corpora

    Authors: Rabeeh Karimi Mahabadi, Yonatan Belinkov, James Henderson

    Abstract: Several recent studies have shown that strong natural language understanding (NLU) models are prone to relying on unwanted dataset biases without learning the underlying task, resulting in models that fail to generalize to out-of-domain datasets and are likely to perform poorly in real-world scenarios. We propose two learning strategies to train neural models, which are more robust to such biases… ▽ More

    Submitted 23 April, 2020; v1 submitted 13 September, 2019; originally announced September 2019.

    Comments: Accepted in ACL 2020 as a long paper

  22. arXiv:1711.01566  [pdf, other

    cs.LG cs.DM stat.ML

    Stochastic Submodular Maximization: The Case of Coverage Functions

    Authors: Mohammad Reza Karimi, Mario Lucic, Hamed Hassani, Andreas Krause

    Abstract: Stochastic optimization of continuous objectives is at the heart of modern machine learning. However, many important problems are of discrete nature and often involve submodular objectives. We seek to unleash the power of stochastic continuous optimization, namely stochastic gradient descent and its variants, to such discrete problems. We first introduce the problem of stochastic submodular optimi… ▽ More

    Submitted 5 November, 2017; originally announced November 2017.

    Comments: 31st Conference on Neural Information Processing Systems (NIPS 2017)

  23. arXiv:1705.07400  [pdf, other

    cs.PF cs.DC cs.OS

    MITHRIL: Mining Sporadic Associations for Cache Prefetching

    Authors: Juncheng Yang, Reza Karimi, Trausti Sæmundsson, Avani Wildani, Ymir Vigfusson

    Abstract: The growing pressure on cloud application scalability has accentuated storage performance as a critical bottle- neck. Although cache replacement algorithms have been extensively studied, cache prefetching - reducing latency by retrieving items before they are actually requested remains an underexplored area. Existing approaches to history-based prefetching, in particular, provide too few benefits… ▽ More

    Submitted 21 May, 2017; originally announced May 2017.

  24. arXiv:1608.01391  [pdf

    cs.CV

    Language free character recognition using character sketch and center of gravity shifting

    Authors: Masoud Nosrati, Fakhereh Rahimi, Ronak Karimi

    Abstract: In this research, we present a heuristic method for character recognition. For this purpose, a sketch is constructed from the image that contains the character to be recognized. This sketch contains the most important pixels of image that are representatives of original image. These points are the most probable points in pixel-by-pixel matching of image that adapt to target image. Furthermore, a t… ▽ More

    Submitted 3 August, 2016; originally announced August 2016.

    Comments: World Applied Programming, Vol (6), Issue (2), July 2016

  25. arXiv:1606.08789  [pdf

    physics.chem-ph physics.atm-clus

    Ultrafast molecular dynamics of dissociative ionization in OCS probed by soft X-ray synchrotron radiation

    Authors: Ali Ramadhan, Benji Wales, Isabelle Gauthier, Reza Karimi, Michael MacDonald, Lucia Zuin, Joe Sanderson

    Abstract: Soft X-rays (90-173 eV) from the 3rd generation Canadian Light Source have been used in conjunction with a multi coincidence time and position sensitive detection apparatus to observe the dissociative ionization of OCS. By varying the X-ray energy we can compare dynamics from direct and Auger ionization processes, and access ionization channels which result in two or three body breakup, from 2+ to… ▽ More

    Submitted 15 August, 2016; v1 submitted 28 June, 2016; originally announced June 2016.

    Comments: 24 pages, 8 figures, 1 table, 77 references

    Journal ref: J. Phys. B: At. Mol. Opt. Phys. 49 215602 (2016)

  26. arXiv:1605.06855  [pdf, other

    cs.SI cs.LG stat.ML

    Smart broadcasting: Do you want to be seen?

    Authors: Mohammad Reza Karimi, Erfan Tavakoli, Mehrdad Farajtabar, Le Song, Manuel Gomez-Rodriguez

    Abstract: Many users in online social networks are constantly trying to gain attention from their followers by broadcasting posts to them. These broadcasters are likely to gain greater attention if their posts can remain visible for a longer period of time among their followers' most recent feeds. Then when to post? In this paper, we study the problem of smart broadcasting using the framework of temporal po… ▽ More

    Submitted 22 May, 2016; originally announced May 2016.

    Comments: To appear in Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD), San Francisco (CA, USA), 2016

  27. Latency Optimization for Resource Allocation in Cloud Computing System

    Authors: Masoud Nosrati, Abdolah Chalechale, Ronak Karimi

    Abstract: Recent studies in different fields of science caused emergence of needs for high performance computing systems like Cloud. A critical issue in design and implementation of such systems is resource allocation which is directly affected by internal and external factors like the number of nodes, geographical distance and communication latencies. Many optimizations took place in resource allocation me… ▽ More

    Submitted 4 May, 2016; originally announced May 2016.

    Comments: 12 pages, 5 figures, In proceeding of ICCSA 2015, published by Springer LNCS

  28. arXiv:1111.6539  [pdf

    cs.NI

    Secure Geographic Routing Protocols: Issues and Approaches

    Authors: Mehdi sookhak, Ramin Karimi, Mahboobeh Haghparast, Ismail Fauzi ISnin

    Abstract: In the years, routing protocols in wireless sensor networks (WSN) have been substantially investigated by researches. Most state-of-the-art surveys have focused on reviewing of wireless sensor network .In this paper we review the existing secure geographic routing protocols for wireless sensor network (WSN) and also provide a qualitative comparison of them.

    Submitted 28 November, 2011; originally announced November 2011.

    Comments: 8 pages

    Journal ref: International Journal of Computer Science Issues 8(4): 382-389 (2011)