Skip to main content

Showing 1–9 of 9 results for author: Monajatipoor, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.17779  [pdf, other

    cs.CL

    Medical Vision-Language Pre-Training for Brain Abnormalities

    Authors: Masoud Monajatipoor, Zi-Yi Dou, Aichi Chien, Nanyun Peng, Kai-Wei Chang

    Abstract: Vision-language models have become increasingly powerful for tasks that require an understanding of both visual and linguistic elements, bridging the gap between these modalities. In the context of multimodal clinical AI, there is a growing need for models that possess domain-specific knowledge, as existing models often lack the expertise required for medical applications. In this paper, we take b… ▽ More

    Submitted 27 April, 2024; originally announced April 2024.

  2. arXiv:2404.07376  [pdf, other

    cs.CL

    LLMs in Biomedicine: A study on clinical Named Entity Recognition

    Authors: Masoud Monajatipoor, Jiaxin Yang, Joel Stremmel, Melika Emami, Fazlolah Mohaghegh, Mozhdeh Rouhsedaghat, Kai-Wei Chang

    Abstract: Large Language Models (LLMs) demonstrate remarkable versatility in various NLP tasks but encounter distinct challenges in biomedicine due to medical language complexities and data scarcity. This paper investigates the application of LLMs in the medical domain by exploring strategies to enhance their performance for the Named-Entity Recognition (NER) task. Specifically, our study reveals the import… ▽ More

    Submitted 10 April, 2024; originally announced April 2024.

  3. arXiv:2306.01311  [pdf, other

    cs.CL

    MetaVL: Transferring In-Context Learning Ability From Language Models to Vision-Language Models

    Authors: Masoud Monajatipoor, Liunian Harold Li, Mozhdeh Rouhsedaghat, Lin F. Yang, Kai-Wei Chang

    Abstract: Large-scale language models have shown the ability to adapt to a new task via conditioning on a few demonstrations (i.e., in-context learning). However, in the vision-language domain, most large-scale pre-trained vision-language (VL) models do not possess the ability to conduct in-context learning. How can we enable in-context learning for VL models? In this paper, we study an interesting hypothes… ▽ More

    Submitted 2 June, 2023; originally announced June 2023.

  4. arXiv:2210.15230  [pdf, other

    cs.CL cs.AI cs.LG cs.MM

    How well can Text-to-Image Generative Models understand Ethical Natural Language Interventions?

    Authors: Hritik Bansal, Da Yin, Masoud Monajatipoor, Kai-Wei Chang

    Abstract: Text-to-image generative models have achieved unprecedented success in generating high-quality images based on natural language descriptions. However, it is shown that these models tend to favor specific social groups when prompted with neutral text descriptions (e.g., 'a photo of a lawyer'). Following Zhao et al. (2021), we study the effect on the diversity of the generated images when adding eth… ▽ More

    Submitted 27 October, 2022; originally announced October 2022.

    Comments: 13 pages, 8 figures, 6 tables. Accepted as Oral Presentation at EMNLP 2022

  5. arXiv:2209.11549  [pdf, other

    cs.CV cs.AI cs.LG

    MAGIC: Mask-Guided Image Synthesis by Inverting a Quasi-Robust Classifier

    Authors: Mozhdeh Rouhsedaghat, Masoud Monajatipoor, C. -C. Jay Kuo, Iacopo Masi

    Abstract: We offer a method for one-shot mask-guided image synthesis that allows controlling manipulations of a single image by inverting a quasi-robust classifier equipped with strong regularizers. Our proposed method, entitled MAGIC, leverages structured gradients from a pre-trained quasi-robust classifier to better preserve the input semantics while preserving its classification accuracy, thereby guarant… ▽ More

    Submitted 30 June, 2023; v1 submitted 23 September, 2022; originally announced September 2022.

    Comments: Accepted to the Thirty-Seventh Conference on Artificial Intelligence (AAAI) 2023 - 12 pages, 9 figures

  6. arXiv:2205.12247  [pdf, other

    cs.CL

    GeoMLAMA: Geo-Diverse Commonsense Probing on Multilingual Pre-Trained Language Models

    Authors: Da Yin, Hritik Bansal, Masoud Monajatipoor, Liunian Harold Li, Kai-Wei Chang

    Abstract: Recent work has shown that Pre-trained Language Models (PLMs) store the relational knowledge learned from data and utilize it for performing downstream tasks. However, commonsense knowledge across different regions may vary. For instance, the color of bridal dress is white in American weddings whereas it is red in Chinese weddings. In this paper, we introduce a benchmark dataset, Geo-Diverse Commo… ▽ More

    Submitted 29 November, 2022; v1 submitted 24 May, 2022; originally announced May 2022.

    Comments: EMNLP 2022. Code and data are released at https://github.com/WadeYin9712/GeoMLAMA/

  7. arXiv:2108.12084  [pdf, other

    cs.CL cs.AI cs.LG

    Harms of Gender Exclusivity and Challenges in Non-Binary Representation in Language Technologies

    Authors: Sunipa Dev, Masoud Monajatipoor, Anaelia Ovalle, Arjun Subramonian, Jeff M Phillips, Kai-Wei Chang

    Abstract: Gender is widely discussed in the context of language tasks and when examining the stereotypes propagated by language models. However, current discussions primarily treat gender as binary, which can perpetuate harms such as the cyclical erasure of non-binary gender identities. These harms are driven by model and dataset biases, which are consequences of the non-recognition and lack of understandin… ▽ More

    Submitted 10 September, 2021; v1 submitted 26 August, 2021; originally announced August 2021.

    Journal ref: EMNLP 2021

  8. arXiv:2108.04938  [pdf, other

    cs.CV cs.AI cs.CL

    BERTHop: An Effective Vision-and-Language Model for Chest X-ray Disease Diagnosis

    Authors: Masoud Monajatipoor, Mozhdeh Rouhsedaghat, Liunian Harold Li, Aichi Chien, C. -C. Jay Kuo, Fabien Scalzo, Kai-Wei Chang

    Abstract: Vision-and-language(V&L) models take image and text as input and learn to capture the associations between them. Prior studies show that pre-trained V&L models can significantly improve the model performance for downstream tasks such as Visual Question Answering (VQA). However, V&L models are less effective when applied in the medical domain (e.g., on X-ray images and clinical notes) due to the do… ▽ More

    Submitted 10 August, 2021; originally announced August 2021.

    Comments: 10 pages, 8 figures, Accepted in ICCV workshop

  9. arXiv:2103.00121  [pdf, other

    cs.CV

    Successive Subspace Learning: An Overview

    Authors: Mozhdeh Rouhsedaghat, Masoud Monajatipoor, Zohreh Azizi, C. -C. Jay Kuo

    Abstract: Successive Subspace Learning (SSL) offers a light-weight unsupervised feature learning method based on inherent statistical properties of data units (e.g. image pixels and points in point cloud sets). It has shown promising results, especially on small datasets. In this paper, we intuitively explain this method, provide an overview of its development, and point out some open questions and challeng… ▽ More

    Submitted 26 February, 2021; originally announced March 2021.

    Comments: 4 pages, 1 figure