Skip to main content

Showing 1–9 of 9 results for author: Giorgi, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.01796  [pdf, other

    cs.CL cs.DL cs.IR

    TOPICAL: TOPIC Pages AutomagicaLly

    Authors: John Giorgi, Amanpreet Singh, Doug Downey, Sergey Feldman, Lucy Lu Wang

    Abstract: Topic pages aggregate useful information about an entity or concept into a single succinct and accessible article. Automated creation of topic pages would enable their rapid curation as information resources, providing an alternative to traditional web search. While most prior work has focused on generating topic pages about biographical entities, in this work, we develop a completely automated pr… ▽ More

    Submitted 2 May, 2024; originally announced May 2024.

    Comments: 10 pages, 7 figures, 2 tables, NAACL System Demonstrations 2024

  2. arXiv:2306.11167  [pdf, other

    cs.CL cs.AI cs.LG

    Large Language Models are Fixated by Red Herrings: Exploring Creative Problem Solving and Einstellung Effect using the Only Connect Wall Dataset

    Authors: Saeid Naeini, Raeid Saqur, Mozhgan Saeidi, John Giorgi, Babak Taati

    Abstract: The quest for human imitative AI has been an enduring topic in AI research since its inception. The technical evolution and emerging capabilities of the latest cohort of large language models (LLMs) have reinvigorated the subject beyond academia to the cultural zeitgeist. While recent NLP evaluation benchmark tasks test some aspects of human-imitative behaviour (e.g., BIG-bench's 'human-like behav… ▽ More

    Submitted 8 November, 2023; v1 submitted 19 June, 2023; originally announced June 2023.

    Comments: v4,v3: Mincor cosmetic adjustments, typo-fixes etc. from V2. Fixed Fig. 2 caption overlap** with text in S2.2. V2: with added OCW-Randomized and OCW-WordNet results in Section 4.3 (added). 22 pages with Appendix

    ACM Class: I.2.7

  3. arXiv:2305.02220  [pdf, other

    cs.CL cs.AI cs.LG

    WangLab at MEDIQA-Chat 2023: Clinical Note Generation from Doctor-Patient Conversations using Large Language Models

    Authors: John Giorgi, Augustin Toma, Ronald Xie, Sondra S. Chen, Kevin R. An, Grace X. Zheng, Bo Wang

    Abstract: This paper describes our submission to the MEDIQA-Chat 2023 shared task for automatic clinical note generation from doctor-patient conversations. We report results for two approaches: the first fine-tunes a pre-trained language model (PLM) on the shared task data, and the second uses few-shot in-context learning (ICL) with a large language model (LLM). Both achieve high performance as measured by… ▽ More

    Submitted 3 June, 2023; v1 submitted 3 May, 2023; originally announced May 2023.

    Comments: Camera-ready submission to ClinicalNLP @ ACL 2023

  4. arXiv:2212.10526  [pdf, other

    cs.CL cs.AI

    Open Domain Multi-document Summarization: A Comprehensive Study of Model Brittleness under Retrieval

    Authors: John Giorgi, Luca Soldaini, Bo Wang, Gary Bader, Kyle Lo, Lucy Lu Wang, Arman Cohan

    Abstract: Multi-document summarization (MDS) assumes a set of topic-related documents are provided as input. In practice, this document set is not always available; it would need to be retrieved given an information need, i.e. a question or topic statement, a setting we dub "open-domain" MDS. We study this more challenging setting by formalizing the task and bootstrap** it using existing datasets, retriev… ▽ More

    Submitted 25 October, 2023; v1 submitted 20 December, 2022; originally announced December 2022.

    Comments: Accepted to EMNLP Findings 2023

  5. arXiv:2211.05100  [pdf, other

    cs.CL

    BLOOM: A 176B-Parameter Open-Access Multilingual Language Model

    Authors: BigScience Workshop, :, Teven Le Scao, Angela Fan, Christopher Akiki, Ellie Pavlick, Suzana Ilić, Daniel Hesslow, Roman Castagné, Alexandra Sasha Luccioni, François Yvon, Matthias Gallé, Jonathan Tow, Alexander M. Rush, Stella Biderman, Albert Webson, Pawan Sasanka Ammanamanchi, Thomas Wang, Benoît Sagot, Niklas Muennighoff, Albert Villanova del Moral, Olatunji Ruwase, Rachel Bawden, Stas Bekman, Angelina McMillan-Major , et al. (369 additional authors not shown)

    Abstract: Large language models (LLMs) have been shown to be able to perform new tasks based on a few demonstrations or natural language instructions. While these capabilities have led to widespread adoption, most LLMs are developed by resource-rich organizations and are frequently kept from the public. As a step towards democratizing this powerful technology, we present BLOOM, a 176B-parameter open-access… ▽ More

    Submitted 27 June, 2023; v1 submitted 9 November, 2022; originally announced November 2022.

  6. arXiv:2206.15076  [pdf, other

    cs.CL

    BigBIO: A Framework for Data-Centric Biomedical Natural Language Processing

    Authors: Jason Alan Fries, Leon Weber, Natasha Seelam, Gabriel Altay, Debajyoti Datta, Samuele Garda, Myungsun Kang, Ruisi Su, Wojciech Kusa, Samuel Cahyawijaya, Fabio Barth, Simon Ott, Matthias Samwald, Stephen Bach, Stella Biderman, Mario Sänger, Bo Wang, Alison Callahan, Daniel León Periñán, Théo Gigant, Patrick Haller, Jenny Chim, Jose David Posada, John Michael Giorgi, Karthik Rangasai Sivaraman , et al. (18 additional authors not shown)

    Abstract: Training and evaluating language models increasingly requires the construction of meta-datasets --diverse collections of curated data with clear provenance. Natural language prompting has recently lead to improved zero-shot generalization by transforming existing, supervised datasets into a diversity of novel pretraining tasks, highlighting the benefits of meta-dataset curation. While successful i… ▽ More

    Submitted 30 June, 2022; originally announced June 2022.

    Comments: Submitted to NeurIPS 2022 Datasets and Benchmarks Track

  7. arXiv:2204.01098  [pdf, other

    cs.CL cs.AI

    A sequence-to-sequence approach for document-level relation extraction

    Authors: John Giorgi, Gary D. Bader, Bo Wang

    Abstract: Motivated by the fact that many relations cross the sentence boundary, there has been increasing interest in document-level relation extraction (DocRE). DocRE requires integrating information within and across sentences, capturing complex interactions between mentions of entities. Most existing methods are pipeline-based, requiring entities as input. However, jointly learning to extract entities a… ▽ More

    Submitted 10 April, 2022; v1 submitted 3 April, 2022; originally announced April 2022.

    Comments: Camera-ready copy for BioNLP 2022 @ ACL 2022

  8. arXiv:2006.03659  [pdf, other

    cs.CL cs.LG

    DeCLUTR: Deep Contrastive Learning for Unsupervised Textual Representations

    Authors: John Giorgi, Osvald Nitski, Bo Wang, Gary Bader

    Abstract: Sentence embeddings are an important component of many natural language processing (NLP) systems. Like word embeddings, sentence embeddings are typically learned on large text corpora and then transferred to various downstream tasks, such as clustering and retrieval. Unlike word embeddings, the highest performing solutions for learning sentence embeddings require labelled data, limiting their usef… ▽ More

    Submitted 27 May, 2021; v1 submitted 5 June, 2020; originally announced June 2020.

    Comments: ACL2021 Camera Ready V2

  9. arXiv:1912.13415  [pdf, other

    cs.CL cs.LG

    End-to-end Named Entity Recognition and Relation Extraction using Pre-trained Language Models

    Authors: John Giorgi, Xindi Wang, Nicola Sahar, Won Young Shin, Gary D. Bader, Bo Wang

    Abstract: Named entity recognition (NER) and relation extraction (RE) are two important tasks in information extraction and retrieval (IE \& IR). Recent work has demonstrated that it is beneficial to learn these tasks jointly, which avoids the propagation of error inherent in pipeline-based systems and improves performance. However, state-of-the-art joint models typically rely on external natural language p… ▽ More

    Submitted 20 December, 2019; originally announced December 2019.

    Comments: 12 pages, 2 figures