Skip to main content

Showing 1–21 of 21 results for author: Shakery, A

.
  1. arXiv:2304.01282  [pdf, other

    cs.CL

    PEACH: Pre-Training Sequence-to-Sequence Multilingual Models for Translation with Semi-Supervised Pseudo-Parallel Document Generation

    Authors: Alireza Salemi, Amirhossein Abaskohi, Sara Tavakoli, Yadollah Yaghoobzadeh, Azadeh Shakery

    Abstract: Multilingual pre-training significantly improves many multilingual NLP tasks, including machine translation. Most existing methods are based on some variants of masked language modeling and text-denoising objectives on monolingual data. Multilingual pre-training on monolingual data ignores the availability of parallel data in many language pairs. Also, some other works integrate the available huma… ▽ More

    Submitted 14 April, 2023; v1 submitted 3 April, 2023; originally announced April 2023.

    Comments: 15 pages, 5 figures, 16 tables, 1 algorithm, LoResMT@EACL 2023

    Journal ref: https://aclanthology.org/2023.loresmt-1.3

  2. arXiv:2211.09728  [pdf, ps, other

    cs.CL cs.LG

    Generative Adversarial Training Can Improve Neural Language Models

    Authors: Sajad Movahedi, Azadeh Shakery

    Abstract: While deep learning in the form of recurrent neural networks (RNNs) has caused a significant improvement in neural language modeling, the fact that they are extremely prone to overfitting is still a mainly unresolved issue. In this paper we propose a regularization method based on generative adversarial networks (GANs) and adversarial training (AT), that can prevent overfitting in neural language… ▽ More

    Submitted 2 November, 2022; originally announced November 2022.

    Comments: An extended abstract selected for poster presentation at the Eastern European Machine Learning Summer School 2019

  3. arXiv:2210.07998  [pdf, other

    cs.LG cs.CV

    $Λ$-DARTS: Mitigating Performance Collapse by Harmonizing Operation Selection among Cells

    Authors: Sajad Movahedi, Melika Adabinejad, Ayyoob Imani, Arezou Keshavarz, Mostafa Dehghani, Azadeh Shakery, Babak N. Araabi

    Abstract: Differentiable neural architecture search (DARTS) is a popular method for neural architecture search (NAS), which performs cell-search and utilizes continuous relaxation to improve the search efficiency via gradient-based optimization. The main shortcoming of DARTS is performance collapse, where the discovered architecture suffers from a pattern of declining quality during search. Performance coll… ▽ More

    Submitted 1 March, 2023; v1 submitted 14 October, 2022; originally announced October 2022.

    Comments: Published as a conference paper at ICLR 2023

  4. arXiv:2109.04098  [pdf, other

    cs.CL

    ARMAN: Pre-training with Semantically Selecting and Reordering of Sentences for Persian Abstractive Summarization

    Authors: Alireza Salemi, Emad Kebriaei, Ghazal Neisi Minaei, Azadeh Shakery

    Abstract: Abstractive text summarization is one of the areas influenced by the emergence of pre-trained language models. Current pre-training works in abstractive summarization give more points to the summaries with more words in common with the main text and pay less attention to the semantic similarity between generated sentences and the original document. We propose ARMAN, a Transformer-based encoder-dec… ▽ More

    Submitted 9 September, 2021; originally announced September 2021.

  5. arXiv:2105.03775  [pdf, other

    cs.CL cs.IR cs.IT cs.LG

    NLP-IIS@UT at SemEval-2021 Task 4: Machine Reading Comprehension using the Long Document Transformer

    Authors: Hossein Basafa, Sajad Movahedi, Ali Ebrahimi, Azadeh Shakery, Heshaam Faili

    Abstract: This paper presents a technical report of our submission to the 4th task of SemEval-2021, titled: Reading Comprehension of Abstract Meaning. In this task, we want to predict the correct answer based on a question given a context. Usually, contexts are very lengthy and require a large receptive field from the model. Thus, common contextualized language models like BERT miss fine representation and… ▽ More

    Submitted 8 May, 2021; originally announced May 2021.

    Comments: 6 pages, 1 figure. Accepted in SemEval2021

  6. arXiv:2104.04770  [pdf, other

    cs.CL

    UTNLP at SemEval-2021 Task 5: A Comparative Analysis of Toxic Span Detection using Attention-based, Named Entity Recognition, and Ensemble Models

    Authors: Alireza Salemi, Nazanin Sabri, Emad Kebriaei, Behnam Bahrak, Azadeh Shakery

    Abstract: Detecting which parts of a sentence contribute to that sentence's toxicity -- rather than providing a sentence-level verdict of hatefulness -- would increase the interpretability of models and allow human moderators to better understand the outputs of the system. This paper presents our team's, UTNLP, methodology and results in the SemEval-2021 shared task 5 on toxic spans detection. We test multi… ▽ More

    Submitted 10 April, 2021; originally announced April 2021.

  7. arXiv:2004.11045  [pdf, ps, other

    cs.IR cs.CL

    Distilling Knowledge for Fast Retrieval-based Chat-bots

    Authors: Amir Vakili Tahami, Kamyar Ghajar, Azadeh Shakery

    Abstract: Response retrieval is a subset of neural ranking in which a model selects a suitable response from a set of candidates given a conversation history. Retrieval-based chat-bots are typically employed in information seeking conversational systems such as customer support agents. In order to make pairwise comparisons between a conversation history and a candidate response, two approaches are common: c… ▽ More

    Submitted 23 April, 2020; originally announced April 2020.

    Comments: Accepted for publication in the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR '20)

  8. arXiv:1911.02290  [pdf, other

    cs.CL

    Enriching Conversation Context in Retrieval-based Chatbots

    Authors: Amir Vakili Tahami, Azadeh Shakery

    Abstract: Work on retrieval-based chatbots, like most sequence pair matching tasks, can be divided into Cross-encoders that perform word matching over the pair, and Bi-encoders that encode the pair separately. The latter has better performance, however since candidate responses cannot be encoded offline, it is also much slower. Lately, multi-layer transformer architectures pre-trained as language models hav… ▽ More

    Submitted 6 November, 2019; originally announced November 2019.

    Comments: 8 pages, 1 figure, 3 tables

  9. arXiv:1910.12441  [pdf

    cs.IR cs.CL cs.SI

    Online News Media Website Ranking Using User Generated Content

    Authors: Samaneh Karimi, Azadeh Shakery, Rakesh Verma

    Abstract: News media websites are important online resources that have drawn great attention of text mining researchers. The main aim of this study is to propose a framework for ranking online news websites from different viewpoints. The ranking of news websites is useful information, which can benefit many news-related tasks such as news retrieval and news recommendation. In the proposed framework, the ran… ▽ More

    Submitted 28 October, 2019; originally announced October 2019.

    Comments: 35 pages, 4 Figures, 5 tables

  10. arXiv:1906.10607  [pdf, other

    cs.IR cs.CL cs.SI

    Newswire versus Social Media for Disaster Response and Recovery

    Authors: Rakesh Verma, Samaneh Karimi, Daniel Lee, Omprakash Gnawali, Azadeh Shakery

    Abstract: In a disaster situation, first responders need to quickly acquire situational awareness and prioritize response based on the need, resources available and impact. Can they do this based on digital media such as Twitter alone, or newswire alone, or some combination of the two? We examine this question in the context of the 2015 Nepal Earthquakes. Because newswire articles are longer, effective summ… ▽ More

    Submitted 25 June, 2019; originally announced June 2019.

  11. arXiv:1901.01183  [pdf, other

    cs.CL

    Aspect Category Detection via Topic-Attention Network

    Authors: Sajad Movahedi, Erfan Ghadery, Heshaam Faili, Azadeh Shakery

    Abstract: The e-commerce has started a new trend in natural language processing through sentiment analysis of user-generated reviews. Different consumers have different concerns about various aspects of a specific product or service. Aspect category detection, as a subtask of aspect-based sentiment analysis, tackles the problem of categorizing a given review sentence into a set of pre-defined aspect categor… ▽ More

    Submitted 21 June, 2019; v1 submitted 4 January, 2019; originally announced January 2019.

  12. arXiv:1812.03361  [pdf, ps, other

    cs.CL

    An Unsupervised Approach for Aspect Category Detection Using Soft Cosine Similarity Measure

    Authors: Erfan Ghadery, Sajad Movahedi, Heshaam Faili, Azadeh Shakery

    Abstract: Aspect category detection is one of the important and challenging subtasks of aspect-based sentiment analysis. Given a set of pre-defined categories, this task aims to detect categories which are indicated implicitly or explicitly in a given review sentence. Supervised machine learning approaches perform well to accomplish this subtask. Note that, the performance of these methods depends on the av… ▽ More

    Submitted 21 June, 2019; v1 submitted 8 December, 2018; originally announced December 2018.

  13. arXiv:1811.03569  [pdf, ps, other

    cs.IR

    An Axiomatic Study of Query Terms Order in Ad-hoc Retrieval

    Authors: Ayyoob Imani, Amir Vakili, Ali Montazer, Azadeh Shakery

    Abstract: Classic retrieval methods use simple bag-of-word representations for queries and documents. This representation fails to capture the full semantic richness of queries and documents. More recent retrieval models have tried to overcome this deficiency by using approaches such as incorporating dependencies between query terms, using bi-gram representations of documents, proximity heuristics, and pass… ▽ More

    Submitted 8 November, 2018; originally announced November 2018.

    Comments: 7 pages, 1 figure

  14. arXiv:1811.03514  [pdf, other

    cs.IR cs.CL

    Deep Neural Networks for Query Expansion using Word Embeddings

    Authors: Ayyoob Imani, Amir Vakili, Ali Montazer, Azadeh Shakery

    Abstract: Query expansion is a method for alleviating the vocabulary mismatch problem present in information retrieval tasks. Previous works have shown that terms selected for query expansion by traditional methods such as pseudo-relevance feedback are not always helpful to the retrieval process. In this paper, we show that this is also true for more recently proposed embedding-based query expansion methods… ▽ More

    Submitted 8 November, 2018; originally announced November 2018.

    Comments: 8 pages, 1 figure

  15. arXiv:1801.09936  [pdf

    cs.CL

    PEYMA: A Tagged Corpus for Persian Named Entities

    Authors: Mahsa Sadat Shahshahani, Mahdi Mohseni, Azadeh Shakery, Heshaam Faili

    Abstract: The goal in the NER task is to classify proper nouns of a text into classes such as person, location, and organization. This is an important preprocessing step in many NLP tasks such as question-answering and summarization. Although many research studies have been conducted in this area in English and the state-of-the-art NER systems have reached performances of higher than 90 percent in terms of… ▽ More

    Submitted 30 January, 2018; originally announced January 2018.

    Comments: 2017, Signal and Data Processing Journal

  16. arXiv:1606.00615  [pdf, other

    cs.IR

    Low-dimensional Query Projection based on Divergence Minimization Feedback Model for Ad-hoc Retrieval

    Authors: Javid Dadashkarimi, Masoud Jalili Sabet, Heshaam Faili, Azadeh Shakery

    Abstract: Low-dimensional word vectors have long been used in a wide range of applications in natural language processing. In this paper we shed light on estimating query vectors in ad-hoc retrieval where a limited information is available in the original query. Pseudo-relevance feedback (PRF) is a well-known technique for updating query language models and expanding the queries with a number of relevant te… ▽ More

    Submitted 22 December, 2016; v1 submitted 2 June, 2016; originally announced June 2016.

  17. arXiv:1605.07852  [pdf, other

    cs.IR cs.CL

    SS4MCT: A Statistical Stemmer for Morphologically Complex Texts

    Authors: Javid Dadashkarimi, Hossein Nasr Esfahani, Heshaam Faili, Azadeh Shakery

    Abstract: There have been multiple attempts to resolve various inflection matching problems in information retrieval. Stemming is a common approach to this end. Among many techniques for stemming, statistical stemming has been shown to be effective in a number of languages, particularly highly inflected languages. In this paper we propose a method for finding affixes in different positions of a word. Common… ▽ More

    Submitted 20 June, 2016; v1 submitted 25 May, 2016; originally announced May 2016.

  18. arXiv:1605.07844  [pdf, other

    cs.IR cs.AI cs.CL

    Dimension Projection among Languages based on Pseudo-relevant Documents for Query Translation

    Authors: Javid Dadashkarimi, Mahsa S. Shahshahani, Amirhossein Tebbifakhr, Heshaam Faili, Azadeh Shakery

    Abstract: Using top-ranked documents in response to a query has been shown to be an effective approach to improve the quality of query translation in dictionary-based cross-language information retrieval. In this paper, we propose a new method for dictionary-based query translation based on dimension projection of embedded vectors from the pseudo-relevant documents in the source language to their equivalent… ▽ More

    Submitted 8 October, 2016; v1 submitted 25 May, 2016; originally announced May 2016.

  19. Regression and Learning to Rank Aggregation for User Engagement Evaluation

    Authors: Hamed Zamani, Azadeh Shakery, Pooya Moradi

    Abstract: User engagement refers to the amount of interaction an instance (e.g., tweet, news, and forum post) achieves. Ranking the items in social media websites based on the amount of user participation in them, can be used in different applications, such as recommender systems. In this paper, we consider a tweet containing a rating for a movie as an instance and focus on ranking the instances of each use… ▽ More

    Submitted 29 January, 2015; originally announced January 2015.

    Comments: In Proceedings of the 2014 ACM Recommender Systems Challenge, RecSysChallenge '14

    ACM Class: H.2.8; J.4

  20. arXiv:1411.1006  [pdf, other

    cs.IR cs.CL

    A Probabilistic Translation Method for Dictionary-based Cross-lingual Information Retrieval in Agglutinative Languages

    Authors: Javid Dadashkarimi, Azadeh Shakery, Heshaam Faili

    Abstract: Translation ambiguity, out of vocabulary words and missing some translations in bilingual dictionaries make dictionary-based Cross-language Information Retrieval (CLIR) a challenging task. Moreover, in agglutinative languages which do not have reliable stemmers, missing various lexical formations in bilingual dictionaries degrades CLIR performance. This paper aims to introduce a probabilistic tran… ▽ More

    Submitted 5 November, 2014; v1 submitted 4 November, 2014; originally announced November 2014.

    Comments: The 3rd conference of Computational Linguistic, Sharif University of Technology, November 2014

  21. arXiv:1405.5447  [pdf

    cs.IR cs.CL

    Learning to Exploit Different Translation Resources for Cross Language Information Retrieval

    Authors: Hosein Azarbonyad, Azadeh Shakery, Heshaam Faili

    Abstract: One of the important factors that affects the performance of Cross Language Information Retrieval(CLIR)is the quality of translations being employed in CLIR. In order to improve the quality of translations, it is important to exploit available resources efficiently. Employing different translation resources with different characteristics has many challenges. In this paper, we propose a method for… ▽ More

    Submitted 20 May, 2014; originally announced May 2014.

    Journal ref: International Journal of Information and Communication Technology Research, Volume 6, Issue 1, pp. 55-68, 2013