Skip to main content

Showing 1–15 of 15 results for author: Salemi, A

.
  1. Properties of core-EP matrices and binary relationships

    Authors: Ehsan Kheirandish, Abbas Salemi, Néstor Thome

    Abstract: In this paper, various properties of core-EP matrices are investigated. We introduce the MPDMP matrix associated with $A$ and by means of it, some properties and equivalent conditions of core-EP matrices can be obtained. Also, properties of MPD, DMP, and CMP inverses are studied and we prove that in the class of core-EP matrices, DMP, MPD, and Drazin inverses are the same. Moreover, DMP and MPD bi… ▽ More

    Submitted 6 July, 2024; v1 submitted 2 July, 2024; originally announced July 2024.

    Comments: 20 pages

    MSC Class: 15A09; 15A45

  2. arXiv:2405.00175  [pdf, other

    cs.CL cs.IR

    Towards a Search Engine for Machines: Unified Ranking for Multiple Retrieval-Augmented Large Language Models

    Authors: Alireza Salemi, Hamed Zamani

    Abstract: This paper introduces uRAG--a framework with a unified retrieval engine that serves multiple downstream retrieval-augmented generation (RAG) systems. Each RAG system consumes the retrieval results for a unique purpose, such as open-domain question answering, fact verification, entity linking, and relation extraction. We introduce a generic training guideline that standardizes the communication bet… ▽ More

    Submitted 30 April, 2024; originally announced May 2024.

  3. arXiv:2404.13781  [pdf, other

    cs.CL cs.IR

    Evaluating Retrieval Quality in Retrieval-Augmented Generation

    Authors: Alireza Salemi, Hamed Zamani

    Abstract: Evaluating retrieval-augmented generation (RAG) presents challenges, particularly for retrieval models within these systems. Traditional end-to-end evaluation methods are computationally expensive. Furthermore, evaluation of the retrieval model's performance based on query-document relevance labels shows a small correlation with the RAG system's downstream performance. We propose a novel evaluatio… ▽ More

    Submitted 21 April, 2024; originally announced April 2024.

  4. arXiv:2404.05970  [pdf, other

    cs.CL cs.IR

    Optimization Methods for Personalizing Large Language Models through Retrieval Augmentation

    Authors: Alireza Salemi, Surya Kallumadi, Hamed Zamani

    Abstract: This paper studies retrieval-augmented approaches for personalizing large language models (LLMs), which potentially have a substantial impact on various applications and domains. We propose the first attempt to optimize the retrieval models that deliver a limited number of personal documents to large language models for the purpose of personalized generation. We develop two optimization algorithms… ▽ More

    Submitted 8 April, 2024; originally announced April 2024.

  5. arXiv:2401.06466  [pdf, other

    cs.CL cs.AI

    PersianMind: A Cross-Lingual Persian-English Large Language Model

    Authors: Pedram Rostami, Ali Salemi, Mohammad Javad Dousti

    Abstract: Large language models demonstrate remarkable proficiency in various linguistic tasks and have extensive knowledge across various domains. Although they perform best in English, their ability in other languages is notable too. In contrast, open-source models, such as LLaMa, are primarily trained on English datasets, resulting in poor performance in non-English languages. In this paper, we introduce… ▽ More

    Submitted 12 January, 2024; originally announced January 2024.

  6. arXiv:2306.16478  [pdf, other

    cs.IR cs.CL cs.CV

    Pre-Training Multi-Modal Dense Retrievers for Outside-Knowledge Visual Question Answering

    Authors: Alireza Salemi, Mahta Rafiee, Hamed Zamani

    Abstract: This paper studies a category of visual question answering tasks, in which accessing external knowledge is necessary for answering the questions. This category is called outside-knowledge visual question answering (OK-VQA). A major step in develo** OK-VQA systems is to retrieve relevant documents for the given multi-modal query. Current state-of-the-art asymmetric dense retrieval model for this… ▽ More

    Submitted 28 June, 2023; originally announced June 2023.

  7. arXiv:2304.13649  [pdf, other

    cs.CV cs.CL cs.IR

    A Symmetric Dual Encoding Dense Retrieval Framework for Knowledge-Intensive Visual Question Answering

    Authors: Alireza Salemi, Juan Altmayer Pizzorno, Hamed Zamani

    Abstract: Knowledge-Intensive Visual Question Answering (KI-VQA) refers to answering a question about an image whose answer does not lie in the image. This paper presents a new pipeline for KI-VQA tasks, consisting of a retriever and a reader. First, we introduce DEDR, a symmetric dual encoding dense retrieval framework in which documents and queries are encoded into a shared embedding space using uni-modal… ▽ More

    Submitted 26 April, 2023; originally announced April 2023.

  8. arXiv:2304.11406  [pdf, other

    cs.CL

    LaMP: When Large Language Models Meet Personalization

    Authors: Alireza Salemi, Sheshera Mysore, Michael Bendersky, Hamed Zamani

    Abstract: This paper highlights the importance of personalization in large language models and introduces the LaMP benchmark -- a novel benchmark for training and evaluating language models for producing personalized outputs. LaMP offers a comprehensive evaluation framework with diverse language tasks and multiple entries for each user profile. It consists of seven personalized tasks, spanning three text cl… ▽ More

    Submitted 4 June, 2024; v1 submitted 22 April, 2023; originally announced April 2023.

  9. arXiv:2304.01282  [pdf, other

    cs.CL

    PEACH: Pre-Training Sequence-to-Sequence Multilingual Models for Translation with Semi-Supervised Pseudo-Parallel Document Generation

    Authors: Alireza Salemi, Amirhossein Abaskohi, Sara Tavakoli, Yadollah Yaghoobzadeh, Azadeh Shakery

    Abstract: Multilingual pre-training significantly improves many multilingual NLP tasks, including machine translation. Most existing methods are based on some variants of masked language modeling and text-denoising objectives on monolingual data. Multilingual pre-training on monolingual data ignores the availability of parallel data in many language pairs. Also, some other works integrate the available huma… ▽ More

    Submitted 14 April, 2023; v1 submitted 3 April, 2023; originally announced April 2023.

    Comments: 15 pages, 5 figures, 16 tables, 1 algorithm, LoResMT@EACL 2023

    Journal ref: https://aclanthology.org/2023.loresmt-1.3

  10. arXiv:2207.13477  [pdf, ps, other

    math.NA

    An improved bound on Legendre approximation

    Authors: M. Hamzehnejad, M. M. Hosseini, A. Salemi

    Abstract: In this paper, new relations between the derivatives of the Legendre polynomials are obtained, and by these relations, new upper bounds for the Legendre coefficients of differentiable functions are presented. These upper bounds are sharp and cover more categories of differentiable functions. Moreover, new and sharper bounds for the approximation error of the partial sums of Legendre polynomials ar… ▽ More

    Submitted 27 July, 2022; originally announced July 2022.

    Comments: The paper contains 15 pages

    MSC Class: 41A25

  11. arXiv:2112.13430  [pdf, other

    cs.CR

    IoT Analytics and Blockchain

    Authors: Abbas Saleminezhadl, Manuel Remmele, Ravikumar Chaudhari, Rasha Kashef

    Abstract: The Internet of Things (IoT) is revolutionizing human life with the idea of interconnecting everyday used devices (Things) and making them smart. By establishing a communication network between devices, the IoT system aids in automating tasks and making them efficient and powerful. The sensors and the physical world, connected over a network, involve a massive amount of data. The data collection a… ▽ More

    Submitted 26 December, 2021; originally announced December 2021.

  12. arXiv:2109.04098  [pdf, other

    cs.CL

    ARMAN: Pre-training with Semantically Selecting and Reordering of Sentences for Persian Abstractive Summarization

    Authors: Alireza Salemi, Emad Kebriaei, Ghazal Neisi Minaei, Azadeh Shakery

    Abstract: Abstractive text summarization is one of the areas influenced by the emergence of pre-trained language models. Current pre-training works in abstractive summarization give more points to the summaries with more words in common with the main text and pay less attention to the semantic similarity between generated sentences and the original document. We propose ARMAN, a Transformer-based encoder-dec… ▽ More

    Submitted 9 September, 2021; originally announced September 2021.

  13. arXiv:2104.04770  [pdf, other

    cs.CL

    UTNLP at SemEval-2021 Task 5: A Comparative Analysis of Toxic Span Detection using Attention-based, Named Entity Recognition, and Ensemble Models

    Authors: Alireza Salemi, Nazanin Sabri, Emad Kebriaei, Behnam Bahrak, Azadeh Shakery

    Abstract: Detecting which parts of a sentence contribute to that sentence's toxicity -- rather than providing a sentence-level verdict of hatefulness -- would increase the interpretability of models and allow human moderators to better understand the outputs of the system. This paper presents our team's, UTNLP, methodology and results in the SemEval-2021 shared task 5 on toxic spans detection. We test multi… ▽ More

    Submitted 10 April, 2021; originally announced April 2021.

  14. DISCERN: Diversity-based Selection of Centroids for k-Estimation and Rapid Non-stochastic Clustering

    Authors: Ali Hassani, Amir Iranmanesh, Mahdi Eftekhari, Abbas Salemi

    Abstract: One of the applications of center-based clustering algorithms such as K-Means is partitioning data points into K clusters. In some examples, the feature space relates to the underlying problem we are trying to solve, and sometimes we can obtain a suitable feature space. Nevertheless, while K-Means is one of the most efficient offline clustering algorithms, it is not equipped to estimate the number… ▽ More

    Submitted 22 September, 2020; v1 submitted 14 October, 2019; originally announced October 2019.

    Comments: Int. J. Mach. Learn. & Cyber. (2020)

  15. arXiv:0807.5094  [pdf, ps, other

    math.FA

    The structure of strong linear preservers of gw-majorization on $ \mathbf{M}_{{n,m}

    Authors: A. Armandnejad, A. Salemi

    Abstract: Let M_{n,m} be the set of all n by m matrices with entries in F, where F is the field of real or complex numbers. A matrix R in M_{n} with the property Re=e, is said to be a g-row stochastic (generalized row stochastic) matrix. Let A,B in M_{n,m}, so B is said to be gw-majorized by A if there exists an n by n g-row stochastic matrix R such that B=RA. In this paper we characterize all linear oper… ▽ More

    Submitted 31 July, 2008; originally announced July 2008.

    Comments: 7 pages

    MSC Class: 15A03; 15A04; 15A51