Skip to main content

Showing 1–2 of 2 results for author: Mozer, R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2307.03687  [pdf, other

    cs.CL stat.AP stat.ME

    Leveraging text data for causal inference using electronic health records

    Authors: Reagan Mozer, Aaron R. Kaufman, Leo A. Celi, Luke Miratrix

    Abstract: In studies that rely on data from electronic health records (EHRs), unstructured text data such as clinical progress notes offer a rich source of information about patient characteristics and care that may be missing from structured data. Despite the prevalence of text in clinical research, these data are often ignored for the purposes of quantitative analysis due their complexity. This paper pres… ▽ More

    Submitted 20 May, 2024; v1 submitted 9 June, 2023; originally announced July 2023.

  2. arXiv:1801.00644  [pdf, other

    stat.ME cs.CL

    Matching with Text Data: An Experimental Evaluation of Methods for Matching Documents and of Measuring Match Quality

    Authors: Reagan Mozer, Luke Miratrix, Aaron Russell Kaufman, L. Jason Anastasopoulos

    Abstract: Matching for causal inference is a well-studied problem, but standard methods fail when the units to match are text documents: the high-dimensional and rich nature of the data renders exact matching infeasible, causes propensity scores to produce incomparable matches, and makes assessing match quality difficult. In this paper, we characterize a framework for matching text documents that decomposes… ▽ More

    Submitted 13 March, 2019; v1 submitted 2 January, 2018; originally announced January 2018.