Skip to main content

Showing 1–4 of 4 results for author: Janfada, B

Searching in archive cs. Search in all archives.
.
  1. arXiv:2202.10879  [pdf, ps, other

    cs.CL cs.AI

    Evaluating Persian Tokenizers

    Authors: Danial Kamali, Behrooz Janfada, Mohammad Ebrahim Shenasa, Behrouz Minaei-Bidgoli

    Abstract: Tokenization plays a significant role in the process of lexical analysis. Tokens become the input for other natural language processing tasks, like semantic parsing and language modeling. Natural Language Processing in Persian is challenging due to Persian's exceptional cases, such as half-spaces. Thus, it is crucial to have a precise tokenizer for Persian. This article provides a novel work by in… ▽ More

    Submitted 22 February, 2022; originally announced February 2022.

  2. arXiv:2102.00395  [pdf, other

    cs.CL cs.AI

    An Unsupervised Language-Independent Entity Disambiguation Method and its Evaluation on the English and Persian Languages

    Authors: Majid Asgari-Bidhendi, Behrooz Janfada, Amir Havangi, Sayyed Ali Hossayni, Behrouz Minaei-Bidgoli

    Abstract: Entity Linking is one of the essential tasks of information extraction and natural language understanding. Entity linking mainly consists of two tasks: recognition and disambiguation of named entities. Most studies address these two tasks separately or focus only on one of them. Moreover, most of the state-of-the -art entity linking algorithms are either supervised, which have poor performance in… ▽ More

    Submitted 31 January, 2021; originally announced February 2021.

    Comments: arXiv admin note: text overlap with arXiv:2004.10816

  3. arXiv:2005.06588  [pdf, ps, other

    cs.CL

    PERLEX: A Bilingual Persian-English Gold Dataset for Relation Extraction

    Authors: Majid Asgari-Bidhendi, Mehrdad Nasser, Behrooz Janfada, Behrouz Minaei-Bidgoli

    Abstract: Relation extraction is the task of extracting semantic relations between entities in a sentence. It is an essential part of some natural language processing tasks such as information extraction, knowledge extraction, and knowledge base population. The main motivations of this research stem from a lack of a dataset for relation extraction in the Persian language as well as the necessity of extracti… ▽ More

    Submitted 13 May, 2020; originally announced May 2020.

  4. arXiv:2005.01879  [pdf, other

    cs.CL

    FarsBase-KBP: A Knowledge Base Population System for the Persian Knowledge Graph

    Authors: Majid Asgari-Bidhendi, Behrooz Janfada, Behrouz Minaei-Bidgoli

    Abstract: While most of the knowledge bases already support the English language, there is only one knowledge base for the Persian language, known as FarsBase, which is automatically created via semi-structured web information. Unlike English knowledge bases such as Wikidata, which have tremendous community support, the population of a knowledge base like FarsBase must rely on automatically extracted knowle… ▽ More

    Submitted 4 May, 2020; originally announced May 2020.

    Comments: 39 pages, 6 figures