Skip to main content

Showing 1–50 of 52 results for author: Poon, H

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.00902  [pdf, other

    cs.CV cs.AI cs.CL cs.LG

    From Introspection to Best Practices: Principled Analysis of Demonstrations in Multimodal In-Context Learning

    Authors: Nan Xu, Fei Wang, Sheng Zhang, Hoifung Poon, Muhao Chen

    Abstract: Motivated by in-context learning (ICL) capabilities of Large Language models (LLMs), multimodal LLMs with additional visual modality are also exhibited with similar ICL abilities when multiple image-text pairs are provided as demonstrations. However, relatively less work has been done to investigate the principles behind how and why multimodal ICL works. We conduct a systematic and principled eval… ▽ More

    Submitted 30 June, 2024; originally announced July 2024.

  2. arXiv:2406.11839  [pdf, other

    cs.CV cs.AI cs.CL cs.LG

    mDPO: Conditional Preference Optimization for Multimodal Large Language Models

    Authors: Fei Wang, Wenxuan Zhou, James Y. Huang, Nan Xu, Sheng Zhang, Hoifung Poon, Muhao Chen

    Abstract: Direct preference optimization (DPO) has shown to be an effective method for large language model (LLM) alignment. Recent works have attempted to apply DPO to multimodal scenarios but have found it challenging to achieve consistent improvement. Through a comparative experiment, we identify the unconditional preference problem in multimodal preference optimization, where the model overlooks the ima… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  3. arXiv:2406.09411  [pdf, other

    cs.CV cs.AI cs.CL

    MuirBench: A Comprehensive Benchmark for Robust Multi-image Understanding

    Authors: Fei Wang, Xingyu Fu, James Y. Huang, Zekun Li, Qin Liu, Xiaogeng Liu, Mingyu Derek Ma, Nan Xu, Wenxuan Zhou, Kai Zhang, Tianyi Lorena Yan, Wenjie Jacky Mo, Hsiang-Hui Liu, Pan Lu, Chunyuan Li, Chaowei Xiao, Kai-Wei Chang, Dan Roth, Sheng Zhang, Hoifung Poon, Muhao Chen

    Abstract: We introduce MuirBench, a comprehensive benchmark that focuses on robust multi-image understanding capabilities of multimodal LLMs. MuirBench consists of 12 diverse multi-image tasks (e.g., scene understanding, ordering) that involve 10 categories of multi-image relations (e.g., multiview, temporal relations). Comprising 11,264 images and 2,600 multiple-choice questions, MuirBench is created in a… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

  4. arXiv:2405.12971  [pdf, other

    cs.CV

    BiomedParse: a biomedical foundation model for image parsing of everything everywhere all at once

    Authors: Theodore Zhao, Yu Gu, Jianwei Yang, Naoto Usuyama, Ho Hin Lee, Tristan Naumann, Jianfeng Gao, Angela Crabtree, Jacob Abel, Christine Moung-Wen, Brian Piening, Carlo Bifulco, Mu Wei, Hoifung Poon, Sheng Wang

    Abstract: Biomedical image analysis is fundamental for biomedical discovery in cell biology, pathology, radiology, and many other biomedical domains. Holistic image analysis comprises interdependent subtasks such as segmentation, detection, and recognition of relevant objects. Here, we propose BiomedParse, a biomedical foundation model for imaging parsing that can jointly conduct segmentation, detection, an… ▽ More

    Submitted 4 June, 2024; v1 submitted 21 May, 2024; originally announced May 2024.

    Comments: Project page: https://aka.ms/biomedparse-project

  5. arXiv:2404.11045  [pdf, other

    cs.CL

    Offset Unlearning for Large Language Models

    Authors: James Y. Huang, Wenxuan Zhou, Fei Wang, Fred Morstatter, Sheng Zhang, Hoifung Poon, Muhao Chen

    Abstract: Despite the strong capabilities of Large Language Models (LLMs) to acquire knowledge from their training corpora, the memorization of sensitive information in the corpora such as copyrighted, harmful, and private content has led to ethical and legal concerns. In response to these challenges, unlearning has emerged as a potential remedy for LLMs affected by problematic training data. However, previ… ▽ More

    Submitted 16 April, 2024; originally announced April 2024.

  6. arXiv:2403.08002  [pdf, other

    cs.CL cs.CV

    Towards a clinically accessible radiology foundation model: open-access and lightweight, with automated evaluation

    Authors: Juan Manuel Zambrano Chaves, Shih-Cheng Huang, Yanbo Xu, Hanwen Xu, Naoto Usuyama, Sheng Zhang, Fei Wang, Yujia Xie, Mahmoud Khademi, Ziyi Yang, Hany Awadalla, Julia Gong, Houdong Hu, Jianwei Yang, Chunyuan Li, Jianfeng Gao, Yu Gu, Cliff Wong, Mu Wei, Tristan Naumann, Muhao Chen, Matthew P. Lungren, Akshay Chaudhari, Serena Yeung-Levy, Curtis P. Langlotz , et al. (2 additional authors not shown)

    Abstract: The scaling laws and extraordinary performance of large foundation models motivate the development and utilization of such models in biomedicine. However, despite early promising results on some biomedical benchmarks, there are still major challenges that need to be addressed before these models can be used in real-world clinics. Frontier general-domain models such as GPT-4V still have significant… ▽ More

    Submitted 26 June, 2024; v1 submitted 12 March, 2024; originally announced March 2024.

  7. arXiv:2403.01002  [pdf, other

    cs.CL cs.AI

    Attribute Structuring Improves LLM-Based Evaluation of Clinical Text Summaries

    Authors: Zelalem Gero, Chandan Singh, Yiqing Xie, Sheng Zhang, Tristan Naumann, Jianfeng Gao, Hoifung Poon

    Abstract: Summarizing clinical text is crucial in health decision-support and clinical research. Large language models (LLMs) have shown the potential to generate accurate clinical text summaries, but still struggle with issues regarding grounding and evaluation, especially in safety-critical domains such as health. Holistically evaluating text summaries is challenging because they may contain unsubstantiat… ▽ More

    Submitted 1 March, 2024; originally announced March 2024.

    Comments: 4 pages

  8. arXiv:2401.14637  [pdf, other

    cs.CL

    T-Rex: Text-assisted Retrosynthesis Prediction

    Authors: Yifeng Liu, Hanwen Xu, Tangqi Fang, Haocheng Xi, Zixuan Liu, Sheng Zhang, Hoifung Poon, Sheng Wang

    Abstract: As a fundamental task in computational chemistry, retrosynthesis prediction aims to identify a set of reactants to synthesize a target molecule. Existing template-free approaches only consider the graph structures of the target molecule, which often cannot generalize well to rare reaction types and large molecules. Here, we propose T-Rex, a text-assisted retrosynthesis prediction approach that exp… ▽ More

    Submitted 25 January, 2024; originally announced January 2024.

  9. arXiv:2401.07654  [pdf, other

    cs.CV

    Foundation Models for Biomedical Image Segmentation: A Survey

    Authors: Ho Hin Lee, Yu Gu, Theodore Zhao, Yanbo Xu, Jianwei Yang, Naoto Usuyama, Cliff Wong, Mu Wei, Bennett A. Landman, Yuankai Huo, Alberto Santamaria-Pang, Hoifung Poon

    Abstract: Recent advancements in biomedical image analysis have been significantly driven by the Segment Anything Model (SAM). This transformative technology, originally developed for general-purpose computer vision, has found rapid application in medical image processing. Within the last year, marked by over 100 publications, SAM has demonstrated its prowess in zero-shot learning adaptations for medical im… ▽ More

    Submitted 15 January, 2024; originally announced January 2024.

    Comments: 22 pages, 4 figures, 7 tables

  10. arXiv:2312.03558  [pdf, other

    cs.CV

    When an Image is Worth 1,024 x 1,024 Words: A Case Study in Computational Pathology

    Authors: Wenhui Wang, Shuming Ma, Hanwen Xu, Naoto Usuyama, Jiayu Ding, Hoifung Poon, Furu Wei

    Abstract: This technical report presents LongViT, a vision Transformer that can process gigapixel images in an end-to-end manner. Specifically, we split the gigapixel image into a sequence of millions of patches and project them linearly into embeddings. LongNet is then employed to model the extremely long sequence, generating representations that capture both short-range and long-range dependencies. The li… ▽ More

    Submitted 6 December, 2023; originally announced December 2023.

  11. arXiv:2311.16452  [pdf, other

    cs.CL

    Can Generalist Foundation Models Outcompete Special-Purpose Tuning? Case Study in Medicine

    Authors: Harsha Nori, Yin Tat Lee, Sheng Zhang, Dean Carignan, Richard Edgar, Nicolo Fusi, Nicholas King, Jonathan Larson, Yuanzhi Li, Weishung Liu, Renqian Luo, Scott Mayer McKinney, Robert Osazuwa Ness, Hoifung Poon, Tao Qin, Naoto Usuyama, Chris White, Eric Horvitz

    Abstract: Generalist foundation models such as GPT-4 have displayed surprising capabilities in a wide variety of domains and tasks. Yet, there is a prevalent assumption that they cannot match specialist capabilities of fine-tuned models. For example, most explorations to date on medical competency benchmarks have leveraged domain-specific training, as exemplified by efforts on BioGPT and Med-PaLM. We build… ▽ More

    Submitted 27 November, 2023; originally announced November 2023.

    Comments: 21 pages, 7 figures

    ACM Class: I.2.7

  12. arXiv:2311.09581  [pdf, other

    cs.CL

    DocLens: Multi-aspect Fine-grained Evaluation for Medical Text Generation

    Authors: Yiqing Xie, Sheng Zhang, Hao Cheng, Pengfei Liu, Zelalem Gero, Cliff Wong, Tristan Naumann, Hoifung Poon, Carolyn Rose

    Abstract: Medical text generation aims to assist with administrative work and highlight salient information to support decision-making. To reflect the specific requirements of medical text, in this paper, we propose a set of metrics to evaluate the completeness, conciseness, and attribution of the generated text at a fine-grained level. The metrics can be computed by various types of evaluators including in… ▽ More

    Submitted 18 February, 2024; v1 submitted 16 November, 2023; originally announced November 2023.

  13. arXiv:2311.01301  [pdf, other

    cs.LG cs.AI stat.ME

    TRIALSCOPE: A Unifying Causal Framework for Scaling Real-World Evidence Generation with Biomedical Language Models

    Authors: Javier González, Cliff Wong, Zelalem Gero, Jass Bagga, Risa Ueno, Isabel Chien, Eduard Oravkin, Emre Kiciman, Aditya Nori, Roshanthi Weerasinghe, Rom S. Leidner, Brian Piening, Tristan Naumann, Carlo Bifulco, Hoifung Poon

    Abstract: The rapid digitization of real-world data offers an unprecedented opportunity for optimizing healthcare delivery and accelerating biomedical discovery. In practice, however, such data is most abundantly available in unstructured forms, such as clinical notes in electronic medical records (EMRs), and it is generally plagued by confounders. In this paper, we present TRIALSCOPE, a unifying framework… ▽ More

    Submitted 6 November, 2023; v1 submitted 2 November, 2023; originally announced November 2023.

    Comments: 6 Figures, 22 Pages, 3 Tables

  14. arXiv:2310.14573  [pdf, other

    cs.CL

    Exploring the Boundaries of GPT-4 in Radiology

    Authors: Qianchu Liu, Stephanie Hyland, Shruthi Bannur, Kenza Bouzid, Daniel C. Castro, Maria Teodora Wetscherek, Robert Tinn, Harshita Sharma, Fernando Pérez-García, Anton Schwaighofer, Pranav Rajpurkar, Sameer Tajdin Khanna, Hoifung Poon, Naoto Usuyama, Anja Thieme, Aditya V. Nori, Matthew P. Lungren, Ozan Oktay, Javier Alvarez-Valle

    Abstract: The recent success of general-domain large language models (LLMs) has significantly changed the natural language processing paradigm towards a unified foundation model across domains and applications. In this paper, we focus on assessing the performance of GPT-4, the most capable LLM so far, on the text-based applications for radiology reports, comparing against state-of-the-art (SOTA) radiology-s… ▽ More

    Submitted 23 October, 2023; originally announced October 2023.

    Comments: EMNLP 2023 main

  15. arXiv:2310.10765  [pdf, other

    cs.CV cs.AI cs.CL

    BiomedJourney: Counterfactual Biomedical Image Generation by Instruction-Learning from Multimodal Patient Journeys

    Authors: Yu Gu, Jianwei Yang, Naoto Usuyama, Chunyuan Li, Sheng Zhang, Matthew P. Lungren, Jianfeng Gao, Hoifung Poon

    Abstract: Rapid progress has been made in instruction-learning for image editing with natural-language instruction, as exemplified by InstructPix2Pix. In biomedicine, such methods can be applied to counterfactual image generation, which helps differentiate causal structure from spurious correlation and facilitate robust image interpretation for disease progression modeling. However, generic image-editing mo… ▽ More

    Submitted 20 October, 2023; v1 submitted 16 October, 2023; originally announced October 2023.

    Comments: Project page & demo: https://aka.ms/biomedjourney

  16. arXiv:2308.03279  [pdf, other

    cs.CL

    UniversalNER: Targeted Distillation from Large Language Models for Open Named Entity Recognition

    Authors: Wenxuan Zhou, Sheng Zhang, Yu Gu, Muhao Chen, Hoifung Poon

    Abstract: Large language models (LLMs) have demonstrated remarkable generalizability, such as understanding arbitrary entities and relations. Instruction tuning has proven effective for distilling LLMs into more cost-efficient models such as Alpaca and Vicuna. Yet such student models still trail the original LLMs by large margins in downstream applications. In this paper, we explore targeted distillation wi… ▽ More

    Submitted 18 January, 2024; v1 submitted 6 August, 2023; originally announced August 2023.

    Comments: Accepted at ICLR 2024. Project page: https://universal-ner.github.io/

  17. arXiv:2308.02180  [pdf, other

    cs.CL cs.LG

    Scaling Clinical Trial Matching Using Large Language Models: A Case Study in Oncology

    Authors: Cliff Wong, Sheng Zhang, Yu Gu, Christine Moung, Jacob Abel, Naoto Usuyama, Roshanthi Weerasinghe, Brian Piening, Tristan Naumann, Carlo Bifulco, Hoifung Poon

    Abstract: Clinical trial matching is a key process in health delivery and discovery. In practice, it is plagued by overwhelming unstructured data and unscalable manual processing. In this paper, we conduct a systematic study on scaling clinical trial matching using large language models (LLMs), with oncology as the focus area. Our study is grounded in a clinical trial matching system currently in test deplo… ▽ More

    Submitted 18 August, 2023; v1 submitted 4 August, 2023; originally announced August 2023.

    Comments: 24 pages, 5 figures, accepted at Machine Learning for Healthcare (MLHC) 2023

  18. arXiv:2307.06439  [pdf, other

    cs.CL cs.AI

    Distilling Large Language Models for Biomedical Knowledge Extraction: A Case Study on Adverse Drug Events

    Authors: Yu Gu, Sheng Zhang, Naoto Usuyama, Yonas Woldesenbet, Cliff Wong, Praneeth Sanapathi, Mu Wei, Naveen Valluri, Erika Strandberg, Tristan Naumann, Hoifung Poon

    Abstract: Large language models (LLMs), such as GPT-4, have demonstrated remarkable capabilities across a wide range of tasks, including health applications. In this paper, we study how LLMs can be used to scale biomedical knowledge curation. We find that while LLMs already possess decent competency in structuring biomedical text, by distillation into a task-specific student model through self-supervised le… ▽ More

    Submitted 12 July, 2023; originally announced July 2023.

  19. arXiv:2306.16564  [pdf, other

    cs.CL stat.ML

    Pareto Optimal Learning for Estimating Large Language Model Errors

    Authors: Theodore Zhao, Mu Wei, J. Samuel Preston, Hoifung Poon

    Abstract: Large Language Models (LLMs) have shown impressive abilities in many applications. When a concrete and precise answer is desired, it is important to have a quantitative estimation of the potential error rate. However, this can be challenging due to the text-in-text-out nature of generative models. We present a method based on Pareto optimization that generates a risk score to estimate the probabil… ▽ More

    Submitted 22 May, 2024; v1 submitted 28 June, 2023; originally announced June 2023.

  20. arXiv:2306.00890  [pdf, other

    cs.CV cs.CL

    LLaVA-Med: Training a Large Language-and-Vision Assistant for Biomedicine in One Day

    Authors: Chunyuan Li, Cliff Wong, Sheng Zhang, Naoto Usuyama, Haotian Liu, Jianwei Yang, Tristan Naumann, Hoifung Poon, Jianfeng Gao

    Abstract: Conversational generative AI has demonstrated remarkable promise for empowering biomedical practitioners, but current investigations focus on unimodal text. Multimodal conversational AI has seen rapid progress by leveraging billions of image-text pairs from the public web, but such general-domain vision-language models still lack sophistication in understanding and conversing about biomedical imag… ▽ More

    Submitted 1 June, 2023; originally announced June 2023.

    Comments: 17 pages; Website: https://aka.ms/llava-med

  21. arXiv:2306.00024  [pdf, other

    cs.CL cs.LG

    Self-Verification Improves Few-Shot Clinical Information Extraction

    Authors: Zelalem Gero, Chandan Singh, Hao Cheng, Tristan Naumann, Michel Galley, Jianfeng Gao, Hoifung Poon

    Abstract: Extracting patient information from unstructured text is a critical task in health decision-support and clinical research. Large language models (LLMs) have shown the potential to accelerate clinical curation via few-shot in-context learning, in contrast to supervised learning which requires much more costly human annotations. However, despite drastic advances in modern LLMs such as GPT-4, they st… ▽ More

    Submitted 30 May, 2023; originally announced June 2023.

    Journal ref: IMLH 2023

  22. arXiv:2303.13386  [pdf, other

    cs.CL cs.LG

    Compositional Zero-Shot Domain Transfer with Text-to-Text Models

    Authors: Fangyu Liu, Qianchu Liu, Shruthi Bannur, Fernando Pérez-García, Naoto Usuyama, Sheng Zhang, Tristan Naumann, Aditya Nori, Hoifung Poon, Javier Alvarez-Valle, Ozan Oktay, Stephanie L. Hyland

    Abstract: Label scarcity is a bottleneck for improving task performance in specialised domains. We propose a novel compositional transfer learning framework (DoT5 - domain compositional zero-shot T5) for zero-shot domain transfer. Without access to in-domain labels, DoT5 jointly learns domain knowledge (from MLM of unlabelled in-domain free text) and task knowledge (from task training on more readily availa… ▽ More

    Submitted 23 March, 2023; originally announced March 2023.

    Comments: Accepted at TACL, pre-MIT Press publication version. 16 pages, 4 figures

  23. arXiv:2303.11315  [pdf, other

    cs.CL

    Context-faithful Prompting for Large Language Models

    Authors: Wenxuan Zhou, Sheng Zhang, Hoifung Poon, Muhao Chen

    Abstract: Large language models (LLMs) encode parametric knowledge about world facts and have shown remarkable performance in knowledge-driven NLP tasks. However, their reliance on parametric knowledge may cause them to overlook contextual cues, leading to incorrect predictions in context-sensitive NLP tasks (e.g., knowledge acquisition tasks). In this paper, we seek to assess and enhance LLMs' contextual f… ▽ More

    Submitted 22 October, 2023; v1 submitted 20 March, 2023; originally announced March 2023.

    Comments: Accepted at EMNLP 2023 Findings. Code and data are released at https://github.com/wzhouad/context-faithful-llm

  24. arXiv:2303.00915  [pdf, other

    cs.CV cs.CL

    BiomedCLIP: a multimodal biomedical foundation model pretrained from fifteen million scientific image-text pairs

    Authors: Sheng Zhang, Yanbo Xu, Naoto Usuyama, Hanwen Xu, Jaspreet Bagga, Robert Tinn, Sam Preston, Rajesh Rao, Mu Wei, Naveen Valluri, Cliff Wong, Andrea Tupini, Yu Wang, Matt Mazzola, Swadheen Shukla, Lars Liden, Jianfeng Gao, Matthew P. Lungren, Tristan Naumann, Sheng Wang, Hoifung Poon

    Abstract: Biomedical data is inherently multimodal, comprising physical measurements and natural language narratives. A generalist biomedical AI model needs to simultaneously process different modalities of data, including text and images. Therefore, training an effective generalist biomedical model requires high-quality multimodal data, such as parallel image-text pairs. Here, we present PMC-15M, a novel d… ▽ More

    Submitted 16 January, 2024; v1 submitted 1 March, 2023; originally announced March 2023.

    Comments: The models are released at https://aka.ms/biomedclip

  25. arXiv:2302.06860  [pdf, other

    cs.CL

    BLIAM: Literature-based Data Synthesis for Synergistic Drug Combination Prediction

    Authors: Cai Yang, Addie Woicik, Hoifung Poon, Sheng Wang

    Abstract: Language models pre-trained on scientific literature corpora have substantially advanced scientific discovery by offering high-quality feature representations for downstream applications. However, these features are often not interpretable, and thus can reveal limited insights to domain experts. Instead of obtaining features from language models, we propose BLIAM, a literature-based data synthesis… ▽ More

    Submitted 16 February, 2023; v1 submitted 14 February, 2023; originally announced February 2023.

  26. arXiv:2212.10823  [pdf, other

    cs.CL cs.AI cs.LG

    Continual Contrastive Finetuning Improves Low-Resource Relation Extraction

    Authors: Wenxuan Zhou, Sheng Zhang, Tristan Naumann, Muhao Chen, Hoifung Poon

    Abstract: Relation extraction (RE), which has relied on structurally annotated corpora for model training, has been particularly challenging in low-resource scenarios and domains. Recent literature has tackled low-resource RE by self-supervised learning, where the solution involves pretraining the entity pair embedding by RE-based objective and finetuning on labeled data by classification-based objective. H… ▽ More

    Submitted 31 May, 2023; v1 submitted 21 December, 2022; originally announced December 2022.

    Comments: ACL 2023

  27. BioGPT: Generative Pre-trained Transformer for Biomedical Text Generation and Mining

    Authors: Renqian Luo, Liai Sun, Yingce Xia, Tao Qin, Sheng Zhang, Hoifung Poon, Tie-Yan Liu

    Abstract: Pre-trained language models have attracted increasing attention in the biomedical domain, inspired by their great success in the general natural language domain. Among the two main branches of pre-trained language models in the general language domain, i.e., BERT (and its variants) and GPT (and its variants), the first one has been extensively studied in the biomedical domain, such as BioBERT and… ▽ More

    Submitted 3 April, 2023; v1 submitted 19 October, 2022; originally announced October 2022.

    Comments: Published at Briefings in Bioinformatics. Code is available at https://github.com/microsoft/BioGPT

    Journal ref: Briefings in Bioinformatics, 2022;, bbac409

  28. arXiv:2208.14565  [pdf, other

    cs.CL cs.AI

    Optimizing Bi-Encoder for Named Entity Recognition via Contrastive Learning

    Authors: Sheng Zhang, Hao Cheng, Jianfeng Gao, Hoifung Poon

    Abstract: We present a bi-encoder framework for named entity recognition (NER), which applies contrastive learning to map candidate text spans and entity types into the same vector representation space. Prior work predominantly approaches NER as sequence labeling or span classification. We instead frame NER as a representation learning problem that maximizes the similarity between the vector representations… ▽ More

    Submitted 23 February, 2023; v1 submitted 30 August, 2022; originally announced August 2022.

    Comments: ICLR 2023

  29. Making the Most of Text Semantics to Improve Biomedical Vision--Language Processing

    Authors: Benedikt Boecking, Naoto Usuyama, Shruthi Bannur, Daniel C. Castro, Anton Schwaighofer, Stephanie Hyland, Maria Wetscherek, Tristan Naumann, Aditya Nori, Javier Alvarez-Valle, Hoifung Poon, Ozan Oktay

    Abstract: Multi-modal data abounds in biomedicine, such as radiology images and reports. Interpreting this data at scale is essential for improving clinical care and accelerating clinical research. Biomedical text with its complex semantics poses additional challenges in vision--language modelling compared to the general domain, and previous work has used insufficiently adapted models that lack domain-speci… ▽ More

    Submitted 21 July, 2022; v1 submitted 20 April, 2022; originally announced April 2022.

    Comments: To appear in ECCV 2022. Code: https://aka.ms/biovil-code Dataset: https://aka.ms/ms-cxr Demo Notebook: https://aka.ms/biovil-demo-notebook

    Journal ref: Computer Vision - ECCV 2022, LNCS vol 13696, pp 1-21

  30. arXiv:2203.10442  [pdf, other

    cs.CL cs.LG

    Towards Structuring Real-World Data at Scale: Deep Learning for Extracting Key Oncology Information from Clinical Text with Patient-Level Supervision

    Authors: Sam Preston, Mu Wei, Rajesh Rao, Robert Tinn, Naoto Usuyama, Michael Lucas, Roshanthi Weerasinghe, Soohee Lee, Brian Piening, Paul Tittel, Naveen Valluri, Tristan Naumann, Carlo Bifulco, Hoifung Poon

    Abstract: Objective: The majority of detailed patient information in real-world data (RWD) is only consistently available in free-text clinical documents. Manual curation is expensive and time-consuming. Develo** natural language processing (NLP) methods for structuring RWD is thus essential for scaling real-world evidence generation. Materials and Methods: Traditional rule-based systems are vulnerable… ▽ More

    Submitted 19 March, 2022; originally announced March 2022.

  31. arXiv:2203.04294  [pdf, other

    eess.IV cs.AI cs.CV

    NaviAirway: a Bronchiole-sensitive Deep Learning-based Airway Segmentation Pipeline

    Authors: Andong Wang, Terence Chi Chun Tam, Ho Ming Poon, Kun-Chang Yu, Wei-Ning Lee

    Abstract: Airway segmentation is essential for chest CT image analysis. Different from natural image segmentation, which pursues high pixel-wise accuracy, airway segmentation focuses on topology. The task is challenging not only because of its complex tree-like structure but also the severe pixel imbalance among airway branches of different generations. To tackle the problems, we present a NaviAirway method… ▽ More

    Submitted 16 June, 2023; v1 submitted 7 March, 2022; originally announced March 2022.

  32. arXiv:2112.07887  [pdf, other

    cs.CL

    Knowledge-Rich Self-Supervision for Biomedical Entity Linking

    Authors: Sheng Zhang, Hao Cheng, Shikhar Vashishth, Cliff Wong, **feng Xiao, Xiaodong Liu, Tristan Naumann, Jianfeng Gao, Hoifung Poon

    Abstract: Entity linking faces significant challenges such as prolific variations and prevalent ambiguities, especially in high-value domains with myriad entities. Standard classification approaches suffer from the annotation bottleneck and cannot effectively handle unseen entities. Zero-shot entity linking has emerged as a promising direction for generalizing to new entities, but it still requires example… ▽ More

    Submitted 23 May, 2022; v1 submitted 15 December, 2021; originally announced December 2021.

  33. arXiv:2112.07869  [pdf, other

    cs.CL cs.LG

    Fine-Tuning Large Neural Language Models for Biomedical Natural Language Processing

    Authors: Robert Tinn, Hao Cheng, Yu Gu, Naoto Usuyama, Xiaodong Liu, Tristan Naumann, Jianfeng Gao, Hoifung Poon

    Abstract: Motivation: A perennial challenge for biomedical researchers and clinical practitioners is to stay abreast with the rapid growth of publications and medical notes. Natural language processing (NLP) has emerged as a promising direction for taming information overload. In particular, large neural language models facilitate transfer learning by pretraining on unlabeled text, as exemplified by the suc… ▽ More

    Submitted 14 December, 2021; originally announced December 2021.

  34. arXiv:2109.05362  [pdf, other

    cs.CL

    Modular Self-Supervision for Document-Level Relation Extraction

    Authors: Sheng Zhang, Cliff Wong, Naoto Usuyama, Sarthak Jain, Tristan Naumann, Hoifung Poon

    Abstract: Extracting relations across large text spans has been relatively underexplored in NLP, but it is particularly important for high-value domains such as biomedicine, where obtaining high recall of the latest findings is crucial for practical applications. Compared to conventional information extraction confined to short text spans, document-level relation extraction faces additional challenges in bo… ▽ More

    Submitted 11 September, 2021; originally announced September 2021.

    Comments: EMNLP 2021

  35. arXiv:2107.12591  [pdf, other

    cs.LG cs.AI

    Combining Probabilistic Logic and Deep Learning for Self-Supervised Learning

    Authors: Hoifung Poon, Hai Wang, Hunter Lang

    Abstract: Deep learning has proven effective for various application tasks, but its applicability is limited by the reliance on annotated examples. Self-supervised learning has emerged as a promising direction to alleviate the supervision bottleneck, but existing work focuses on leveraging co-occurrences in unlabeled data for task-agnostic representation learning, as exemplified by masked language model pre… ▽ More

    Submitted 27 July, 2021; originally announced July 2021.

    Comments: Book chapter. arXiv admin note: substantial text overlap with arXiv:2012.12474, arXiv:1808.08485, arXiv:2008.12878

  36. arXiv:2106.13375  [pdf, other

    cs.IR cs.CL cs.DL

    Domain-Specific Pretraining for Vertical Search: Case Study on Biomedical Literature

    Authors: Yu Wang, **chao Li, Tristan Naumann, Chenyan Xiong, Hao Cheng, Robert Tinn, Cliff Wong, Naoto Usuyama, Richard Rogahn, Zhihong Shen, Yang Qin, Eric Horvitz, Paul N. Bennett, Jianfeng Gao, Hoifung Poon

    Abstract: Information overload is a prevalent challenge in many high-value domains. A prominent case in point is the explosion of the biomedical literature on COVID-19, which swelled to hundreds of thousands of papers in a matter of months. In general, biomedical literature expands by two papers every minute, totalling over a million new papers every year. Search in the biomedical realm, and many other vert… ▽ More

    Submitted 16 September, 2021; v1 submitted 24 June, 2021; originally announced June 2021.

    Comments: ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD) 2021 Applied Data Science Track

  37. arXiv:2104.05847  [pdf, other

    cs.CL

    Targeted Adversarial Training for Natural Language Understanding

    Authors: Lis Pereira, Xiaodong Liu, Hao Cheng, Hoifung Poon, Jianfeng Gao, Ichiro Kobayashi

    Abstract: We present a simple yet effective Targeted Adversarial Training (TAT) algorithm to improve adversarial training for natural language understanding. The key idea is to introspect current mistakes and prioritize adversarial training steps to where the model errs the most. Experiments show that TAT can significantly improve accuracy over standard adversarial training on GLUE and attain new state-of-t… ▽ More

    Submitted 12 April, 2021; originally announced April 2021.

    Comments: 9 pages, 4 tables, 3 figurers, NAACL 2021

  38. arXiv:2012.12474  [pdf, other

    cs.LG stat.ML

    Self-supervised self-supervision by combining deep learning and probabilistic logic

    Authors: Hunter Lang, Hoifung Poon

    Abstract: Labeling training examples at scale is a perennial challenge in machine learning. Self-supervision methods compensate for the lack of direct supervision by leveraging prior knowledge to automatically generate noisy labeled examples. Deep probabilistic logic (DPL) is a unifying framework for self-supervised learning that represents unknown labels as latent variables and incorporates diverse self-su… ▽ More

    Submitted 22 December, 2020; originally announced December 2020.

    Comments: 12 pages, 2 figures

  39. arXiv:2011.01580  [pdf, other

    cs.IR cs.CL

    CMT in TREC-COVID Round 2: Mitigating the Generalization Gaps from Web to Special Domain Search

    Authors: Chenyan Xiong, Zhenghao Liu, Si Sun, Zhuyun Dai, Kaitao Zhang, Shi Yu, Zhiyuan Liu, Hoifung Poon, Jianfeng Gao, Paul Bennett

    Abstract: Neural rankers based on deep pretrained language models (LMs) have been shown to improve many information retrieval benchmarks. However, these methods are affected by their the correlation between pretraining domain and target domain and rely on massive fine-tuning relevance labels. Directly applying pretraining methods to specific domains may result in suboptimal search quality because specific d… ▽ More

    Submitted 3 November, 2020; originally announced November 2020.

    Comments: 5 pages, 3 figures, 2 tables

  40. arXiv:2007.15779  [pdf, other

    cs.CL cs.LG

    Domain-Specific Language Model Pretraining for Biomedical Natural Language Processing

    Authors: Yu Gu, Robert Tinn, Hao Cheng, Michael Lucas, Naoto Usuyama, Xiaodong Liu, Tristan Naumann, Jianfeng Gao, Hoifung Poon

    Abstract: Pretraining large neural language models, such as BERT, has led to impressive gains on many natural language processing (NLP) tasks. However, most pretraining efforts focus on general domain corpora, such as newswire and Web. A prevailing assumption is that even domain-specific pretraining can benefit by starting from general-domain language models. In this paper, we challenge this assumption by s… ▽ More

    Submitted 16 September, 2021; v1 submitted 30 July, 2020; originally announced July 2020.

    Comments: ACM Transactions on Computing for Healthcare (HEALTH)

  41. arXiv:2004.08994  [pdf, other

    cs.CL

    Adversarial Training for Large Neural Language Models

    Authors: Xiaodong Liu, Hao Cheng, Pengcheng He, Weizhu Chen, Yu Wang, Hoifung Poon, Jianfeng Gao

    Abstract: Generalization and robustness are both key desiderata for designing machine learning methods. Adversarial training can enhance robustness, but past work often finds it hurts generalization. In natural language processing (NLP), pre-training large neural language models such as BERT have demonstrated impressive gain in generalization for a variety of tasks, with further improvement from adversarial… ▽ More

    Submitted 29 April, 2020; v1 submitted 19 April, 2020; originally announced April 2020.

    Comments: 13 pages, 9 tables, 2 figures

  42. arXiv:2002.07972  [pdf, other

    cs.CL

    The Microsoft Toolkit of Multi-Task Deep Neural Networks for Natural Language Understanding

    Authors: Xiaodong Liu, Yu Wang, Jianshu Ji, Hao Cheng, Xueyun Zhu, Emmanuel Awa, Pengcheng He, Weizhu Chen, Hoifung Poon, Guihong Cao, Jianfeng Gao

    Abstract: We present MT-DNN, an open-source natural language understanding (NLU) toolkit that makes it easy for researchers and developers to train customized deep learning models. Built upon PyTorch and Transformers, MT-DNN is designed to facilitate rapid customization for a broad spectrum of NLU tasks, using a variety of objectives (classification, regression, structured prediction) and text encoders (e.g… ▽ More

    Submitted 15 May, 2020; v1 submitted 18 February, 2020; originally announced February 2020.

    Comments: 9 pages, 3 figures and 3 tables

    Journal ref: ACL 2020 demo

  43. arXiv:1906.04382  [pdf, other

    cs.CL

    DoubleTransfer at MEDIQA 2019: Multi-Source Transfer Learning for Natural Language Understanding in the Medical Domain

    Authors: Yichong Xu, Xiaodong Liu, Chunyuan Li, Hoifung Poon, Jianfeng Gao

    Abstract: This paper describes our competing system to enter the MEDIQA-2019 competition. We use a multi-source transfer learning approach to transfer the knowledge from MT-DNN and SciBERT to natural language understanding tasks in the medical domain. For transfer learning fine-tuning, we use multi-task learning on NLI, RQE and QA tasks on general and medical domains to improve performance. The proposed met… ▽ More

    Submitted 11 June, 2019; originally announced June 2019.

    Comments: Proceedings of the BioNLP 2019 workshop, ACL 2019; 7 pages, 5 tables, 1 figure

  44. arXiv:1904.02347  [pdf, other

    cs.CL

    Document-Level $N$-ary Relation Extraction with Multiscale Representation Learning

    Authors: Robin Jia, Cliff Wong, Hoifung Poon

    Abstract: Most information extraction methods focus on binary relations expressed within single sentences. In high-value domains, however, $n$-ary relations are of great demand (e.g., drug-gene-mutation interactions in precision oncology). Such relations often involve entity mentions that are far apart in the document, yet existing work on cross-sentence relation extraction is generally confined to small te… ▽ More

    Submitted 26 June, 2019; v1 submitted 4 April, 2019; originally announced April 2019.

    Comments: NAACL 2019

  45. arXiv:1808.08485  [pdf, other

    cs.CL

    Deep Probabilistic Logic: A Unifying Framework for Indirect Supervision

    Authors: Hai Wang, Hoifung Poon

    Abstract: Deep learning has emerged as a versatile tool for a wide range of NLP tasks, due to its superior capacity in representation learning. But its applicability is limited by the reliance on annotated examples, which are difficult to produce at scale. Indirect supervision has emerged as a promising direction to address this bottleneck, either by introducing labeling functions to automatically generate… ▽ More

    Submitted 25 August, 2018; originally announced August 2018.

    Comments: EMNLP 2018 final version

  46. arXiv:1711.03902  [pdf, other

    cs.AI

    Neural-Symbolic Learning and Reasoning: A Survey and Interpretation

    Authors: Tarek R. Besold, Artur d'Avila Garcez, Sebastian Bader, Howard Bowman, Pedro Domingos, Pascal Hitzler, Kai-Uwe Kuehnberger, Luis C. Lamb, Daniel Lowd, Priscila Machado Vieira Lima, Leo de Penning, Gadi Pinkas, Hoifung Poon, Gerson Zaverucha

    Abstract: The study and understanding of human behaviour is relevant to computer science, artificial intelligence, neural computation, cognitive science, philosophy, psychology, and several other areas. Presupposing cognition as basis of behaviour, among the most prominent tools in the modelling of behaviour are computational-logic systems, connectionist models of cognition, and models of uncertainty. Recen… ▽ More

    Submitted 10 November, 2017; originally announced November 2017.

    Comments: 58 pages, work in progress

  47. arXiv:1709.08600  [pdf, other

    cs.CL cs.LG

    EZLearn: Exploiting Organic Supervision in Large-Scale Data Annotation

    Authors: Maxim Grechkin, Hoifung Poon, Bill Howe

    Abstract: Many real-world applications require automated data annotation, such as identifying tissue origins based on gene expressions and classifying images into semantic categories. Annotation classes are often numerous and subject to changes over time, and annotating examples has become the major bottleneck for supervised learning methods. In science and other high-value domains, large repositories of da… ▽ More

    Submitted 1 July, 2018; v1 submitted 25 September, 2017; originally announced September 2017.

  48. arXiv:1708.03743  [pdf, other

    cs.CL

    Cross-Sentence N-ary Relation Extraction with Graph LSTMs

    Authors: Nanyun Peng, Hoifung Poon, Chris Quirk, Kristina Toutanova, Wen-tau Yih

    Abstract: Past work in relation extraction has focused on binary relations in single sentences. Recent NLP inroads in high-value domains have sparked interest in the more general setting of extracting n-ary relations that span multiple sentences. In this paper, we explore a general relation extraction framework based on graph long short-term memory networks (graph LSTMs) that can be easily extended to cross… ▽ More

    Submitted 12 August, 2017; originally announced August 2017.

    Comments: Conditional accepted by TACL in December 2016; published in April 2017; presented at ACL in August 2017

    Journal ref: Transactions of the Association for Computational Linguistics (TACL) 2017, Vol 5

  49. arXiv:1705.07086  [pdf, other

    cs.LG cs.AI stat.ML

    Estimating Accuracy from Unlabeled Data: A Probabilistic Logic Approach

    Authors: Emmanouil A. Platanios, Hoifung Poon, Tom M. Mitchell, Eric Horvitz

    Abstract: We propose an efficient method to estimate the accuracy of classifiers using only unlabeled data. We consider a setting with multiple classification problems where the target classes may be tied together through logical constraints. For example, a set of classes may be mutually exclusive, meaning that a data instance can belong to at most one of them. The proposed method is based on the intuition… ▽ More

    Submitted 19 May, 2017; originally announced May 2017.

  50. arXiv:1609.04873  [pdf, other

    cs.CL

    Distant Supervision for Relation Extraction beyond the Sentence Boundary

    Authors: Chris Quirk, Hoifung Poon

    Abstract: The growing demand for structured knowledge has led to great interest in relation extraction, especially in cases with limited supervision. However, existing distance supervision approaches only extract relations expressed in single sentences. In general, cross-sentence relation extraction is under-explored, even in the supervised-learning setting. In this paper, we propose the first approach for… ▽ More

    Submitted 14 August, 2017; v1 submitted 15 September, 2016; originally announced September 2016.

    Comments: Presented at EACL 2017; 9 pages (12 pages including references)