Skip to main content

Showing 1–7 of 7 results for author: Bader, G

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.14567  [pdf, other

    cs.CL

    WangLab at MEDIQA-M3G 2024: Multimodal Medical Answer Generation using Large Language Models

    Authors: Ronald Xie, Steven Palayew, Augustin Toma, Gary Bader, Bo Wang

    Abstract: This paper outlines our submission to the MEDIQA2024 Multilingual and Multimodal Medical Answer Generation (M3G) shared task. We report results for two standalone solutions under the English category of the task, the first involving two consecutive API calls to the Claude 3 Opus API and the second involving training an image-disease label joint embedding in the style of CLIP for image classificati… ▽ More

    Submitted 22 April, 2024; originally announced April 2024.

  2. arXiv:2308.05864  [pdf, other

    eess.IV cs.CV cs.LG q-bio.QM

    The Multi-modality Cell Segmentation Challenge: Towards Universal Solutions

    Authors: Jun Ma, Ronald Xie, Shamini Ayyadhury, Cheng Ge, Anubha Gupta, Ritu Gupta, Song Gu, Yao Zhang, Gihun Lee, Joonkee Kim, Wei Lou, Haofeng Li, Eric Upschulte, Timo Dickscheid, José Guilherme de Almeida, Yixin Wang, Lin Han, Xin Yang, Marco Labagnara, Vojislav Gligorovski, Maxime Scheder, Sahand Jamal Rahi, Carly Kempster, Alice Pollitt, Leon Espinosa , et al. (15 additional authors not shown)

    Abstract: Cell segmentation is a critical step for quantitative single-cell analysis in microscopy images. Existing cell segmentation methods are often tailored to specific modalities or require manual interventions to specify hyper-parameters in different experimental settings. Here, we present a multi-modality cell segmentation benchmark, comprising over 1500 labeled images derived from more than 50 diver… ▽ More

    Submitted 1 April, 2024; v1 submitted 10 August, 2023; originally announced August 2023.

    Comments: NeurIPS22 Cell Segmentation Challenge: https://neurips22-cellseg.grand-challenge.org/ . Nature Methods (2024)

  3. arXiv:2306.01859  [pdf, other

    cs.CV cs.AI

    Spatially Resolved Gene Expression Prediction from H&E Histology Images via Bi-modal Contrastive Learning

    Authors: Ronald Xie, Kuan Pang, Sai W. Chung, Catia T. Perciani, Sonya A. MacParland, Bo Wang, Gary D. Bader

    Abstract: Histology imaging is an important tool in medical diagnosis and research, enabling the examination of tissue structure and composition at the microscopic level. Understanding the underlying molecular mechanisms of tissue architecture is critical in uncovering disease mechanisms and develo** effective treatments. Gene expression profiling provides insight into the molecular processes underlying t… ▽ More

    Submitted 27 October, 2023; v1 submitted 2 June, 2023; originally announced June 2023.

  4. arXiv:2212.10526  [pdf, other

    cs.CL cs.AI

    Open Domain Multi-document Summarization: A Comprehensive Study of Model Brittleness under Retrieval

    Authors: John Giorgi, Luca Soldaini, Bo Wang, Gary Bader, Kyle Lo, Lucy Lu Wang, Arman Cohan

    Abstract: Multi-document summarization (MDS) assumes a set of topic-related documents are provided as input. In practice, this document set is not always available; it would need to be retrieved given an information need, i.e. a question or topic statement, a setting we dub "open-domain" MDS. We study this more challenging setting by formalizing the task and bootstrap** it using existing datasets, retriev… ▽ More

    Submitted 25 October, 2023; v1 submitted 20 December, 2022; originally announced December 2022.

    Comments: Accepted to EMNLP Findings 2023

  5. arXiv:2204.01098  [pdf, other

    cs.CL cs.AI

    A sequence-to-sequence approach for document-level relation extraction

    Authors: John Giorgi, Gary D. Bader, Bo Wang

    Abstract: Motivated by the fact that many relations cross the sentence boundary, there has been increasing interest in document-level relation extraction (DocRE). DocRE requires integrating information within and across sentences, capturing complex interactions between mentions of entities. Most existing methods are pipeline-based, requiring entities as input. However, jointly learning to extract entities a… ▽ More

    Submitted 10 April, 2022; v1 submitted 3 April, 2022; originally announced April 2022.

    Comments: Camera-ready copy for BioNLP 2022 @ ACL 2022

  6. arXiv:2006.03659  [pdf, other

    cs.CL cs.LG

    DeCLUTR: Deep Contrastive Learning for Unsupervised Textual Representations

    Authors: John Giorgi, Osvald Nitski, Bo Wang, Gary Bader

    Abstract: Sentence embeddings are an important component of many natural language processing (NLP) systems. Like word embeddings, sentence embeddings are typically learned on large text corpora and then transferred to various downstream tasks, such as clustering and retrieval. Unlike word embeddings, the highest performing solutions for learning sentence embeddings require labelled data, limiting their usef… ▽ More

    Submitted 27 May, 2021; v1 submitted 5 June, 2020; originally announced June 2020.

    Comments: ACL2021 Camera Ready V2

  7. arXiv:1912.13415  [pdf, other

    cs.CL cs.LG

    End-to-end Named Entity Recognition and Relation Extraction using Pre-trained Language Models

    Authors: John Giorgi, Xindi Wang, Nicola Sahar, Won Young Shin, Gary D. Bader, Bo Wang

    Abstract: Named entity recognition (NER) and relation extraction (RE) are two important tasks in information extraction and retrieval (IE \& IR). Recent work has demonstrated that it is beneficial to learn these tasks jointly, which avoids the propagation of error inherent in pipeline-based systems and improves performance. However, state-of-the-art joint models typically rely on external natural language p… ▽ More

    Submitted 20 December, 2019; originally announced December 2019.

    Comments: 12 pages, 2 figures