Skip to main content

Showing 1–14 of 14 results for author: Hosseini, P

Searching in archive cs. Search in all archives.
.
  1. arXiv:2403.01643  [pdf, other

    cs.LG cs.AI cs.CL cs.CV

    You Need to Pay Better Attention: Rethinking the Mathematics of Attention Mechanism

    Authors: Mehran Hosseini, Peyman Hosseini

    Abstract: Scaled Dot Product Attention (SDPA) is the backbone of many modern deep-learning models. It is so versatile that it has been used in natural language, vision, and multi-modal domains with very little change compared to its original formulation. This paper discusses why the current formulation is inefficient by delving into the mathematical details of the attention mechanism. We propose three impro… ▽ More

    Submitted 30 May, 2024; v1 submitted 3 March, 2024; originally announced March 2024.

    MSC Class: 68T07 (Primary) 68T45; 68T50; 68T10; 15A03; 15A04 (Secondary) ACM Class: I.2.6; I.2.7; I.2.10; I.4.0; I.5.0; I.7.0

  2. arXiv:2402.18919  [pdf, other

    cs.CV cs.LG

    Decompose-and-Compose: A Compositional Approach to Mitigating Spurious Correlation

    Authors: Fahimeh Hosseini Noohdani, Parsa Hosseini, Aryan Yazdan Parast, Hamidreza Yaghoubi Araghi, Mahdieh Soleymani Baghshah

    Abstract: While standard Empirical Risk Minimization (ERM) training is proven effective for image classification on in-distribution data, it fails to perform well on out-of-distribution samples. One of the main sources of distribution shift for image classification is the compositional nature of images. Specifically, in addition to the main object or component(s) determining the label, some other image comp… ▽ More

    Submitted 2 March, 2024; v1 submitted 29 February, 2024; originally announced February 2024.

  3. arXiv:2401.01951  [pdf, other

    cs.CV cs.AI cs.LG

    Can We Generate Realistic Hands Only Using Convolution?

    Authors: Mehran Hosseini, Peyman Hosseini

    Abstract: The enduring inability of image generative models to recreate intricate geometric features, such as those present in human hands and fingers has been an ongoing problem in image generation for nearly a decade. While strides have been made by increasing model sizes and diversifying training datasets, this issue remains prevalent across all models, from denoising diffusion models to Generative Adver… ▽ More

    Submitted 3 January, 2024; originally announced January 2024.

    Comments: Contains 17 pages, 14 figures, and 6 tables

    MSC Class: 51 ACM Class: I.2.10; I.4.0; I.4.10

  4. Lon-ea at SemEval-2023 Task 11: A Comparison of Activation Functions for Soft and Hard Label Prediction

    Authors: Peyman Hosseini, Mehran Hosseini, Sana Sabah Al-Azzawi, Marcus Liwicki, Ignacio Castro, Matthew Purver

    Abstract: We study the influence of different activation functions in the output layer of deep neural network models for soft and hard label prediction in the learning with disagreement task. In this task, the goal is to quantify the amount of disagreement via predicting soft labels. To predict the soft labels, we use BERT-based preprocessors and encoders and vary the activation function used in the output… ▽ More

    Submitted 3 January, 2024; v1 submitted 4 March, 2023; originally announced March 2023.

    Comments: Accepted in ACL 2023 SemEval Workshop as selected task paper

    ACM Class: I.2.7

  5. arXiv:2205.12484  [pdf, other

    cs.CL cs.AI

    GisPy: A Tool for Measuring Gist Inference Score in Text

    Authors: Pedram Hosseini, Christopher R. Wolfe, Mona Diab, David A. Broniatowski

    Abstract: Decision making theories such as Fuzzy-Trace Theory (FTT) suggest that individuals tend to rely on gist, or bottom-line meaning, in the text when making decisions. In this work, we delineate the process of develo** GisPy, an open-source tool in Python for measuring the Gist Inference Score (GIS) in text. Evaluation of GisPy on documents in three benchmarks from the news and scientific text domai… ▽ More

    Submitted 25 May, 2022; originally announced May 2022.

    Comments: Accepted to the 4th Workshop on Narrative Understanding @ NAACL 2022

  6. arXiv:2112.08615  [pdf, other

    cs.CL

    Knowledge-Augmented Language Models for Cause-Effect Relation Classification

    Authors: Pedram Hosseini, David A. Broniatowski, Mona Diab

    Abstract: Previous studies have shown the efficacy of knowledge augmentation methods in pretrained language models. However, these methods behave differently across domains and downstream tasks. In this work, we investigate the augmentation of pretrained language models with commonsense knowledge in the cause-effect relation classification and commonsense causal reasoning tasks. After automatically verbaliz… ▽ More

    Submitted 1 June, 2022; v1 submitted 15 December, 2021; originally announced December 2021.

    Comments: Accepted to Commonsense Representation and Reasoning (CSRR) @ ACL 2022

  7. arXiv:2103.13606  [pdf, other

    cs.CL cs.AI

    Predicting Directionality in Causal Relations in Text

    Authors: Pedram Hosseini, David A. Broniatowski, Mona Diab

    Abstract: In this work, we test the performance of two bidirectional transformer-based language models, BERT and SpanBERT, on predicting directionality in causal pairs in the textual content. Our preliminary results show that predicting direction for inter-sentence and implicit causal relations is more challenging. And, SpanBERT performs better than BERT on causal samples with longer span length. We also in… ▽ More

    Submitted 25 March, 2021; originally announced March 2021.

  8. arXiv:2012.06154  [pdf, other

    cs.CL cs.AI

    ParsiNLU: A Suite of Language Understanding Challenges for Persian

    Authors: Daniel Khashabi, Arman Cohan, Siamak Shakeri, Pedram Hosseini, Pouya Pezeshkpour, Malihe Alikhani, Moin Aminnaseri, Marzieh Bitaab, Faeze Brahman, Sarik Ghazarian, Mozhdeh Gheini, Arman Kabiri, Rabeeh Karimi Mahabadi, Omid Memarrast, Ahmadreza Mosallanezhad, Erfan Noury, Shahab Raji, Mohammad Sadegh Rasooli, Sepideh Sadeghi, Erfan Sadeqi Azer, Niloofar Safi Samghabadi, Mahsa Shafaei, Saber Sheybani, Ali Tazarv, Yadollah Yaghoobzadeh

    Abstract: Despite the progress made in recent years in addressing natural language understanding (NLU) challenges, the majority of this progress remains to be concentrated on resource-rich languages like English. This work focuses on Persian language, one of the widely spoken languages in the world, and yet there are few NLU datasets available for this rich language. The availability of high-quality evaluat… ▽ More

    Submitted 13 July, 2021; v1 submitted 11 December, 2020; originally announced December 2020.

    Comments: To appear on Transactions of the Association for Computational Linguistics (TACL), 2021

  9. arXiv:2010.06671  [pdf, other

    cs.CL cs.AI cs.CV

    A Multi-Modal Method for Satire Detection using Textual and Visual Cues

    Authors: Lily Li, Or Levi, Pedram Hosseini, David A. Broniatowski

    Abstract: Satire is a form of humorous critique, but it is sometimes misinterpreted by readers as legitimate news, which can lead to harmful consequences. We observe that the images used in satirical news articles often contain absurd or ridiculous content and that image manipulation is used to create fictional scenarios. While previous work have studied text-based methods, in this work we propose a multi-m… ▽ More

    Submitted 13 October, 2020; originally announced October 2020.

    Comments: Accepted to the Third Workshop on NLP for Internet Freedom (NLP4IF): Censorship, Disinformation, and Propaganda. Co-located with COLING 2020

  10. arXiv:2005.08400  [pdf, other

    cs.SI cs.CL cs.CY

    Content analysis of Persian/Farsi Tweets during COVID-19 pandemic in Iran using NLP

    Authors: Pedram Hosseini, Poorya Hosseini, David A. Broniatowski

    Abstract: Iran, along with China, South Korea, and Italy was among the countries that were hit hard in the first wave of the COVID-19 spread. Twitter is one of the widely-used online platforms by Iranians inside and abroad for sharing their opinion, thoughts, and feelings about a wide range of issues. In this study, using more than 530,000 original tweets in Persian/Farsi on COVID-19, we analyzed the topics… ▽ More

    Submitted 17 May, 2020; originally announced May 2020.

  11. arXiv:2004.09745  [pdf, other

    cs.SI

    Automatically Identifying Political Ads on Facebook: Towards Understanding of Manipulation via User Targeting

    Authors: Or Levi, Sardar Hamidian, Pedram Hosseini

    Abstract: The reports of Russian interference in the 2016 United States elections brought into the center of public attention concerns related to the ability of foreign actors to increase social discord and take advantage of personal user data for political purposes. It has raised questions regarding the ways and the extent to which data can be used to create psychographical profiles to determine what kind… ▽ More

    Submitted 21 April, 2020; originally announced April 2020.

    Comments: Accepted to the 2nd Multidisciplinary International Symposium on Disinformation in Open Online Media (MISDOOM 2020)

  12. arXiv:1911.05263  [pdf, other

    cs.CL cs.LG

    LexiPers: An ontology based sentiment lexicon for Persian

    Authors: Behnam Sabeti, Pedram Hosseini, Gholamreza Ghassem-Sani, Seyed Abolghasem Mirroshandel

    Abstract: Sentiment analysis refers to the use of natural language processing to identify and extract subjective information from textual resources. One approach for sentiment extraction is using a sentiment lexicon. A sentiment lexicon is a set of words associated with the sentiment orientation that they express. In this paper, we describe the process of generating a general purpose sentiment lexicon for P… ▽ More

    Submitted 12 November, 2019; originally announced November 2019.

  13. Identifying Nuances in Fake News vs. Satire: Using Semantic and Linguistic Cues

    Authors: Or Levi, Pedram Hosseini, Mona Diab, David A. Broniatowski

    Abstract: The blurry line between nefarious fake news and protected-speech satire has been a notorious struggle for social media platforms. Further to the efforts of reducing exposure to misinformation on social media, purveyors of fake news have begun to masquerade as satire sites to avoid being demoted. In this work, we address the challenge of automatically classifying fake news versus satire. Previous w… ▽ More

    Submitted 5 November, 2019; v1 submitted 2 October, 2019; originally announced October 2019.

    Comments: Accepted to the 2nd Workshop on NLP for Internet Freedom (NLP4IF): Censorship, Disinformation, and Propaganda. Co-located with EMNLP-IJCNLP 2019

  14. arXiv:1801.07737  [pdf, other

    cs.CL

    SentiPers: A Sentiment Analysis Corpus for Persian

    Authors: Pedram Hosseini, Ali Ahmadian Ramaki, Hassan Maleki, Mansoureh Anvari, Seyed Abolghasem Mirroshandel

    Abstract: Sentiment Analysis (SA) is a major field of study in natural language processing, computational linguistics and information retrieval. Interest in SA has been constantly growing in both academia and industry over the recent years. Moreover, there is an increasing need for generating appropriate resources and datasets in particular for low resource languages including Persian. These datasets play a… ▽ More

    Submitted 1 January, 2021; v1 submitted 23 January, 2018; originally announced January 2018.

    Comments: This work is accepted to the 3rd Conference on Computational Linguistics, Sharif University of Technology