Skip to main content

Showing 1–2 of 2 results for author: Meaney, C

Searching in archive cs. Search in all archives.
.
  1. arXiv:2305.08787  [pdf, other

    cs.CL

    Comparing Variation in Tokenizer Outputs Using a Series of Problematic and Challenging Biomedical Sentences

    Authors: Christopher Meaney, Therese A Stukel, Peter C Austin, Michael Escobar

    Abstract: Background & Objective: Biomedical text data are increasingly available for research. Tokenization is an initial step in many biomedical text mining pipelines. Tokenization is the process of parsing an input biomedical sentence (represented as a digital character sequence) into a discrete set of word/token symbols, which convey focused semantic/syntactic meaning. The objective of this study is to… ▽ More

    Submitted 15 May, 2023; originally announced May 2023.

  2. arXiv:2204.07056  [pdf, other

    cs.CL stat.AP stat.CO

    A Comparative Evaluation Of Transformer Models For De-Identification Of Clinical Text Data

    Authors: Christopher Meaney, Wali Hakimpour, Sumeet Kalia, Rahim Moineddin

    Abstract: Objective: To comparatively evaluate several transformer model architectures at identifying protected health information (PHI) in the i2b2/UTHealth 2014 clinical text de-identification challenge corpus. Methods: The i2b2/UTHealth 2014 corpus contains N=1304 clinical notes obtained from N=296 patients. Using a transfer learning framework, we fine-tune several transformer model architectures on th… ▽ More

    Submitted 25 March, 2022; originally announced April 2022.

    Comments: 38 pages, 3 figures, 6 tables, arxiv pre-print