Skip to main content

Showing 1–20 of 20 results for author: Faili, H

Searching in archive cs. Search in all archives.
.
  1. arXiv:2401.00811  [pdf, other

    cs.CL cs.HC

    PerSHOP -- A Persian dataset for shop** dialogue systems modeling

    Authors: Keyvan Mahmoudi, Heshaam Faili

    Abstract: Nowadays, dialogue systems are used in many fields of industry and research. There are successful instances of these systems, such as Apple Siri, Google Assistant, and IBM Watson. Task-oriented dialogue system is a category of these, that are used in specific tasks. They can perform tasks such as booking plane tickets or making restaurant reservations. Shop** is one of the most popular areas on… ▽ More

    Submitted 1 January, 2024; originally announced January 2024.

  2. arXiv:2305.11731  [pdf

    cs.CL cs.AI

    Persian Typographical Error Type Detection Using Deep Neural Networks on Algorithmically-Generated Misspellings

    Authors: Mohammad Dehghani, Heshaam Faili

    Abstract: Spelling correction is a remarkable challenge in the field of natural language processing. The objective of spelling correction tasks is to recognize and rectify spelling errors automatically. The development of applications that can effectually diagnose and correct Persian spelling and grammatical errors has become more important in order to improve the quality of Persian text. The Typographical… ▽ More

    Submitted 5 May, 2024; v1 submitted 19 May, 2023; originally announced May 2023.

  3. arXiv:2208.00463  [pdf, other

    cs.CL

    Mismatching-Aware Unsupervised Translation Quality Estimation For Low-Resource Languages

    Authors: Fatemeh Azadi, Heshaam Faili, Mohammad Javad Dousti

    Abstract: Translation Quality Estimation (QE) is the task of predicting the quality of machine translation (MT) output without any reference. This task has gained increasing attention as an important component in the practical applications of MT. In this paper, we first propose XLMRScore, which is a cross-lingual counterpart of BERTScore computed via the XLM-RoBERTa (XLMR) model. This metric can be used as… ▽ More

    Submitted 3 March, 2024; v1 submitted 31 July, 2022; originally announced August 2022.

    Comments: Submitted to Language Resources and Evaluation

  4. arXiv:2203.06778  [pdf, other

    cs.CL

    Pruned Graph Neural Network for Short Story Ordering

    Authors: Melika Golestani, Zeinab Borhanifard, Farnaz Tahmasebian, Heshaam Faili

    Abstract: Text coherence is a fundamental problem in natural language generation and understanding. Organizing sentences into an order that maximizes coherence is known as sentence ordering. This paper is proposing a new approach based on the graph neural network approach to encode a set of sentences and learn orderings of short stories. We propose a new method for constructing sentence-entity graphs of sho… ▽ More

    Submitted 13 March, 2022; originally announced March 2022.

  5. arXiv:2112.13238  [pdf, other

    cs.CL

    PerCQA: Persian Community Question Answering Dataset

    Authors: Naghme Jamali, Yadollah Yaghoobzadeh, Hesham Faili

    Abstract: Community Question Answering (CQA) forums provide answers for many real-life questions. Thanks to the large size, these forums are very popular among machine learning researchers. Automatic answer selection, answer ranking, question retrieval, expert finding, and fact-checking are example learning tasks performed using CQA data. In this paper, we present PerCQA, the first Persian dataset for CQA.… ▽ More

    Submitted 25 December, 2021; originally announced December 2021.

  6. arXiv:2108.11994  [pdf, other

    cs.CL cs.AI

    A New Sentence Ordering Method Using BERT Pretrained Model

    Authors: Melika Golestani, Seyedeh Zahra Razavi, Heshaam Faili

    Abstract: Building systems with capability of natural language understanding (NLU) has been one of the oldest areas of AI. An essential component of NLU is to detect logical succession of events contained in a text. The task of sentence ordering is proposed to learn succession of events with applications in AI tasks. The performance of previous works employing statistical methods is poor, while the neural n… ▽ More

    Submitted 26 August, 2021; originally announced August 2021.

    Comments: 7 pages, 4 figures, 2020 11th International Conference on Information and Knowledge Technology (IKT)

  7. arXiv:2108.10986  [pdf, other

    cs.CL

    Using BERT Encoding and Sentence-Level Language Model for Sentence Ordering

    Authors: Melika Golestani, Seyedeh Zahra Razavi, Zeinab Borhanifard, Farnaz Tahmasebian, Hesham Faili

    Abstract: Discovering the logical sequence of events is one of the cornerstones in Natural Language Understanding. One approach to learn the sequence of events is to study the order of sentences in a coherent text. Sentence ordering can be applied in various tasks such as retrieval-based Question Answering, document summarization, storytelling, text generation, and dialogue systems. Furthermore, we can lear… ▽ More

    Submitted 24 August, 2021; originally announced August 2021.

    Comments: 12 pages, 2 figures, The 24th International Conference of Text, Speech and Dialogue (TSD2021)

  8. arXiv:2105.03775  [pdf, other

    cs.CL cs.IR cs.IT cs.LG

    NLP-IIS@UT at SemEval-2021 Task 4: Machine Reading Comprehension using the Long Document Transformer

    Authors: Hossein Basafa, Sajad Movahedi, Ali Ebrahimi, Azadeh Shakery, Heshaam Faili

    Abstract: This paper presents a technical report of our submission to the 4th task of SemEval-2021, titled: Reading Comprehension of Abstract Meaning. In this task, we want to predict the correct answer based on a question given a context. Usually, contexts are very lengthy and require a large receptive field from the model. Thus, common contextualized language models like BERT miss fine representation and… ▽ More

    Submitted 8 May, 2021; originally announced May 2021.

    Comments: 6 pages, 1 figure. Accepted in SemEval2021

  9. arXiv:2003.10816  [pdf, other

    cs.CL

    Cross-Lingual Adaptation Using Universal Dependencies

    Authors: Nasrin Taghizadeh, Heshaam Faili

    Abstract: We describe a cross-lingual adaptation method based on syntactic parse trees obtained from the Universal Dependencies (UD), which are consistent across languages, to develop classifiers in low-resource languages. The idea of UD parsing is to capture similarities as well as idiosyncrasies among typologically different languages. In this paper, we show that models trained using UD parse trees for co… ▽ More

    Submitted 28 March, 2020; v1 submitted 24 March, 2020; originally announced March 2020.

  10. arXiv:2003.09029  [pdf, other

    cs.CL

    NSURL-2019 Task 7: Named Entity Recognition (NER) in Farsi

    Authors: Nasrin Taghizadeh, Zeinab Borhanifard, Melika GolestaniPour, Heshaam Faili

    Abstract: NSURL-2019 Task 7 focuses on Named Entity Recognition (NER) in Farsi. This task was chosen to compare different approaches to find phrases that specify Named Entities in Farsi texts, and to establish a standard testbed for future researches on this task in Farsi. This paper describes the process of making training and test data, a list of participating teams (6 teams), and evaluation results of th… ▽ More

    Submitted 19 March, 2020; originally announced March 2020.

  11. arXiv:1901.01183  [pdf, other

    cs.CL

    Aspect Category Detection via Topic-Attention Network

    Authors: Sajad Movahedi, Erfan Ghadery, Heshaam Faili, Azadeh Shakery

    Abstract: The e-commerce has started a new trend in natural language processing through sentiment analysis of user-generated reviews. Different consumers have different concerns about various aspects of a specific product or service. Aspect category detection, as a subtask of aspect-based sentiment analysis, tackles the problem of categorizing a given review sentence into a set of pre-defined aspect categor… ▽ More

    Submitted 21 June, 2019; v1 submitted 4 January, 2019; originally announced January 2019.

  12. arXiv:1812.03361  [pdf, ps, other

    cs.CL

    An Unsupervised Approach for Aspect Category Detection Using Soft Cosine Similarity Measure

    Authors: Erfan Ghadery, Sajad Movahedi, Heshaam Faili, Azadeh Shakery

    Abstract: Aspect category detection is one of the important and challenging subtasks of aspect-based sentiment analysis. Given a set of pre-defined categories, this task aims to detect categories which are indicated implicitly or explicitly in a given review sentence. Supervised machine learning approaches perform well to accomplish this subtask. Note that, the performance of these methods depends on the av… ▽ More

    Submitted 21 June, 2019; v1 submitted 8 December, 2018; originally announced December 2018.

  13. arXiv:1801.09936  [pdf

    cs.CL

    PEYMA: A Tagged Corpus for Persian Named Entities

    Authors: Mahsa Sadat Shahshahani, Mahdi Mohseni, Azadeh Shakery, Heshaam Faili

    Abstract: The goal in the NER task is to classify proper nouns of a text into classes such as person, location, and organization. This is an important preprocessing step in many NLP tasks such as question-answering and summarization. Although many research studies have been conducted in this area in English and the state-of-the-art NER systems have reached performances of higher than 90 percent in terms of… ▽ More

    Submitted 30 January, 2018; originally announced January 2018.

    Comments: 2017, Signal and Data Processing Journal

  14. arXiv:1708.01891  [pdf

    cs.SI physics.soc-ph

    A Behavioral Analysis on the Reselection of Seed Nodes in Independent Cascade Based Influence Maximization

    Authors: Ali Vardasbi, Heshaam Faili, Masoud Asadpour

    Abstract: Influence maximization serves as the main goal of a variety of social network activities such as viral marketing and campaign advertising. The independent cascade model for the influence spread assumes a one-time chance for each activated node to influence its neighbors. This reasonable assumption cannot be bypassed, since otherwise the influence probabilities of the nodes, modeled by the edge wei… ▽ More

    Submitted 6 August, 2017; originally announced August 2017.

  15. arXiv:1704.03223  [pdf

    cs.CL cs.LG stat.ML

    Persian Wordnet Construction using Supervised Learning

    Authors: Zahra Mousavi, Heshaam Faili

    Abstract: This paper presents an automated supervised method for Persian wordnet construction. Using a Persian corpus and a bi-lingual dictionary, the initial links between Persian words and Princeton WordNet synsets have been generated. These links will be discriminated later as correct or incorrect by employing seven features in a trained classification system. The whole method is just a classification sy… ▽ More

    Submitted 11 April, 2017; originally announced April 2017.

  16. arXiv:1606.00615  [pdf, other

    cs.IR

    Low-dimensional Query Projection based on Divergence Minimization Feedback Model for Ad-hoc Retrieval

    Authors: Javid Dadashkarimi, Masoud Jalili Sabet, Heshaam Faili, Azadeh Shakery

    Abstract: Low-dimensional word vectors have long been used in a wide range of applications in natural language processing. In this paper we shed light on estimating query vectors in ad-hoc retrieval where a limited information is available in the original query. Pseudo-relevance feedback (PRF) is a well-known technique for updating query language models and expanding the queries with a number of relevant te… ▽ More

    Submitted 22 December, 2016; v1 submitted 2 June, 2016; originally announced June 2016.

  17. arXiv:1605.07852  [pdf, other

    cs.IR cs.CL

    SS4MCT: A Statistical Stemmer for Morphologically Complex Texts

    Authors: Javid Dadashkarimi, Hossein Nasr Esfahani, Heshaam Faili, Azadeh Shakery

    Abstract: There have been multiple attempts to resolve various inflection matching problems in information retrieval. Stemming is a common approach to this end. Among many techniques for stemming, statistical stemming has been shown to be effective in a number of languages, particularly highly inflected languages. In this paper we propose a method for finding affixes in different positions of a word. Common… ▽ More

    Submitted 20 June, 2016; v1 submitted 25 May, 2016; originally announced May 2016.

  18. arXiv:1605.07844  [pdf, other

    cs.IR cs.AI cs.CL

    Dimension Projection among Languages based on Pseudo-relevant Documents for Query Translation

    Authors: Javid Dadashkarimi, Mahsa S. Shahshahani, Amirhossein Tebbifakhr, Heshaam Faili, Azadeh Shakery

    Abstract: Using top-ranked documents in response to a query has been shown to be an effective approach to improve the quality of query translation in dictionary-based cross-language information retrieval. In this paper, we propose a new method for dictionary-based query translation based on dimension projection of embedded vectors from the pseudo-relevant documents in the source language to their equivalent… ▽ More

    Submitted 8 October, 2016; v1 submitted 25 May, 2016; originally announced May 2016.

  19. arXiv:1411.1006  [pdf, other

    cs.IR cs.CL

    A Probabilistic Translation Method for Dictionary-based Cross-lingual Information Retrieval in Agglutinative Languages

    Authors: Javid Dadashkarimi, Azadeh Shakery, Heshaam Faili

    Abstract: Translation ambiguity, out of vocabulary words and missing some translations in bilingual dictionaries make dictionary-based Cross-language Information Retrieval (CLIR) a challenging task. Moreover, in agglutinative languages which do not have reliable stemmers, missing various lexical formations in bilingual dictionaries degrades CLIR performance. This paper aims to introduce a probabilistic tran… ▽ More

    Submitted 5 November, 2014; v1 submitted 4 November, 2014; originally announced November 2014.

    Comments: The 3rd conference of Computational Linguistic, Sharif University of Technology, November 2014

  20. arXiv:1405.5447  [pdf

    cs.IR cs.CL

    Learning to Exploit Different Translation Resources for Cross Language Information Retrieval

    Authors: Hosein Azarbonyad, Azadeh Shakery, Heshaam Faili

    Abstract: One of the important factors that affects the performance of Cross Language Information Retrieval(CLIR)is the quality of translations being employed in CLIR. In order to improve the quality of translations, it is important to exploit available resources efficiently. Employing different translation resources with different characteristics has many challenges. In this paper, we propose a method for… ▽ More

    Submitted 20 May, 2014; originally announced May 2014.

    Journal ref: International Journal of Information and Communication Technology Research, Volume 6, Issue 1, pp. 55-68, 2013