Skip to main content

Showing 1–3 of 3 results for author: Gooran, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.04845  [pdf, other

    cs.CL cs.AI

    SLPL SHROOM at SemEval2024 Task 06: A comprehensive study on models ability to detect hallucination

    Authors: Pouya Fallah, Soroush Gooran, Mohammad Jafarinasab, Pouya Sadeghi, Reza Farnia, Amirreza Tarabkhah, Zainab Sadat Taghavi, Hossein Sameti

    Abstract: Language models, particularly generative models, are susceptible to hallucinations, generating outputs that contradict factual knowledge or the source text. This study explores methods for detecting hallucinations in three SemEval-2024 Task 6 tasks: Machine Translation, Definition Modeling, and Paraphrase Generation. We evaluate two methods: semantic similarity between the generated text and factu… ▽ More

    Submitted 9 April, 2024; v1 submitted 7 April, 2024; originally announced April 2024.

  2. arXiv:2308.10354  [pdf, other

    cs.AI cs.CL

    Imaginations of WALL-E : Reconstructing Experiences with an Imagination-Inspired Module for Advanced AI Systems

    Authors: Zeinab Sadat Taghavi, Soroush Gooran, Seyed Arshan Dalili, Hamidreza Amirzadeh, Mohammad Jalal Nematbakhsh, Hossein Sameti

    Abstract: In this paper, we introduce a novel Artificial Intelligence (AI) system inspired by the philosophical and psychoanalytical concept of imagination as a ``Re-construction of Experiences". Our AI system is equipped with an imagination-inspired module that bridges the gap between textual inputs and other modalities, enriching the derived information based on previously learned experiences. A unique fe… ▽ More

    Submitted 20 August, 2023; originally announced August 2023.

    Comments: 18 pages,

  3. arXiv:2208.13486  [pdf, other

    cs.CL

    naab: A ready-to-use plug-and-play corpus for Farsi

    Authors: Sadra Sabouri, Elnaz Rahmati, Soroush Gooran, Hossein Sameti

    Abstract: Huge corpora of textual data are always known to be a crucial need for training deep models such as transformer-based ones. This issue is emerging more in lower resource languages - like Farsi. We propose naab, the biggest cleaned and ready-to-use open-source textual corpus in Farsi. It contains about 130GB of data, 250 million paragraphs, and 15 billion words. The project name is derived from the… ▽ More

    Submitted 29 August, 2022; originally announced August 2022.

    Comments: 6 pages, 2 figures