Skip to main content

Showing 51–100 of 148 results for author: Lipton, Z

.
  1. arXiv:2207.13179  [pdf, other

    cs.LG stat.ML

    Unsupervised Learning under Latent Label Shift

    Authors: Manley Roberts, Pranav Mani, Saurabh Garg, Zachary C. Lipton

    Abstract: What sorts of structure might enable a learner to discover classes from unlabeled data? Traditional approaches rely on feature-space similarity and heroic assumptions on the data. In this paper, we introduce unsupervised learning under Latent Label Shift (LLS), where we have access to unlabeled data from multiple domains such that the label marginals $p_d(y)$ can shift across domains but the class… ▽ More

    Submitted 1 December, 2022; v1 submitted 26 July, 2022; originally announced July 2022.

    Comments: NeurIPS 2022. Manley Roberts and Pranav Mani contributed equally to this work

  2. arXiv:2207.13048  [pdf, other

    cs.LG

    Domain Adaptation under Open Set Label Shift

    Authors: Saurabh Garg, Sivaraman Balakrishnan, Zachary C. Lipton

    Abstract: We introduce the problem of domain adaptation under Open Set Label Shift (OSLS) where the label distribution can change arbitrarily and a new class may arrive during deployment, but the class-conditional distributions p(x|y) are domain-invariant. OSLS subsumes domain adaptation under label shift and Positive-Unlabeled (PU) learning. The learner's goals here are two-fold: (a) estimate the target la… ▽ More

    Submitted 16 October, 2022; v1 submitted 26 July, 2022; originally announced July 2022.

    Comments: Accepted at NeurIPS 2022

  3. arXiv:2206.13648  [pdf, other

    stat.ML cs.LG

    Supervised Learning with General Risk Functionals

    Authors: Liu Leqi, Audrey Huang, Zachary C. Lipton, Kamyar Azizzadenesheli

    Abstract: Standard uniform convergence results bound the generalization gap of the expected loss over a hypothesis class. The emergence of risk-sensitive learning requires generalization guarantees for functionals of the loss distribution beyond the expectation. While prior works specialize in uniform convergence of particular functionals, our work provides uniform convergence for a general class of Hölder… ▽ More

    Submitted 27 June, 2022; originally announced June 2022.

  4. arXiv:2206.10654  [pdf, other

    cs.LG stat.ML

    On the Maximum Hessian Eigenvalue and Generalization

    Authors: Simran Kaur, Jeremy Cohen, Zachary C. Lipton

    Abstract: The mechanisms by which certain training interventions, such as increasing learning rates and applying batch normalization, improve the generalization of deep networks remains a mystery. Prior works have speculated that "flatter" solutions generalize better than "sharper" solutions to unseen data, motivating several metrics for measuring flatness (particularly $λ_{max}$, the largest eigenvalue of… ▽ More

    Submitted 23 May, 2023; v1 submitted 21 June, 2022; originally announced June 2022.

    Comments: Proceedings on "I Can't Believe It's Not Better! - Understanding Deep Learning Through Empirical Falsification" at NeurIPS 2022 Workshops, PMLR 187:51-65, 2023

  5. arXiv:2206.04039  [pdf, ps, other

    cs.CY cs.AI cs.CL cs.LG stat.ML

    Resolving the Human Subjects Status of Machine Learning's Crowdworkers

    Authors: Divyansh Kaushik, Zachary C. Lipton, Alex John London

    Abstract: In recent years, machine learning (ML) has relied heavily on crowdworkers both for building datasets and for addressing research questions requiring human interaction or judgment. The diverse tasks performed and uses of the data produced render it difficult to determine when crowdworkers are best thought of as workers (versus human subjects). These difficulties are compounded by conflicting polici… ▽ More

    Submitted 15 June, 2023; v1 submitted 8 June, 2022; originally announced June 2022.

  6. arXiv:2205.09701  [pdf, other

    cs.HC cs.CY

    Homophily and Incentive Effects in Use of Algorithms

    Authors: Riccardo Fogliato, Sina Fazelpour, Shantanu Gupta, Zachary Lipton, David Danks

    Abstract: As algorithmic tools increasingly aid experts in making consequential decisions, the need to understand the precise factors that mediate their influence has grown commensurately. In this paper, we present a crowdsourcing vignette study designed to assess the impacts of two plausible factors on AI-informed decision-making. First, we examine homophily -- do people defer more to models that tend to a… ▽ More

    Submitted 19 May, 2022; originally announced May 2022.

    Comments: Accepted at CogSci, 2022

  7. arXiv:2203.13423  [pdf, ps, other

    cs.LG cs.IR stat.ML

    Modeling Attrition in Recommender Systems with Departing Bandits

    Authors: Omer Ben-Porat, Lee Cohen, Liu Leqi, Zachary C. Lipton, Yishay Mansour

    Abstract: Traditionally, when recommender systems are formalized as multi-armed bandits, the policy of the recommender system influences the rewards accrued, but not the length of interaction. However, in real-world systems, dissatisfied users may depart (and never come back). In this work, we propose a novel multi-armed bandit setup that captures such policy-dependent horizons. Our setup consists of a fini… ▽ More

    Submitted 15 February, 2024; v1 submitted 24 March, 2022; originally announced March 2022.

    Comments: Accepted at AAAI 2022

  8. arXiv:2202.01336  [pdf, other

    cs.LG

    Exploring Transformer Backbones for Heterogeneous Treatment Effect Estimation

    Authors: Yi-Fan Zhang, Hanlin Zhang, Zachary C. Lipton, Li Erran Li, Eric P. Xing

    Abstract: Previous works on Treatment Effect Estimation (TEE) are not in widespread use because they are predominantly theoretical, where strong parametric assumptions are made but untractable for practical application. Recent work uses multilayer perceptron (MLP) for modeling casual relationships, however, MLPs lag far behind recent advances in ML methodology, which limits their applicability and generaliz… ▽ More

    Submitted 17 October, 2022; v1 submitted 2 February, 2022; originally announced February 2022.

  9. arXiv:2201.04234  [pdf, other

    cs.LG stat.ML

    Leveraging Unlabeled Data to Predict Out-of-Distribution Performance

    Authors: Saurabh Garg, Sivaraman Balakrishnan, Zachary C. Lipton, Behnam Neyshabur, Hanie Sedghi

    Abstract: Real-world machine learning deployments are characterized by mismatches between the source (training) and target (test) distributions that may cause performance drops. In this work, we investigate methods for predicting the target domain accuracy using only labeled source data and unlabeled target data. We propose Average Thresholded Confidence (ATC), a practical method that learns a threshold on… ▽ More

    Submitted 14 October, 2022; v1 submitted 11 January, 2022; originally announced January 2022.

    Comments: Accepted at ICLR 2022

  10. arXiv:2112.09669  [pdf, other

    cs.CL

    Explain, Edit, and Understand: Rethinking User Study Design for Evaluating Model Explanations

    Authors: Siddhant Arora, Danish Pruthi, Norman Sadeh, William W. Cohen, Zachary C. Lipton, Graham Neubig

    Abstract: In attempts to "explain" predictions of machine learning models, researchers have proposed hundreds of techniques for attributing predictions to features that are deemed important. While these attributions are often claimed to hold the potential to improve human "understanding" of the models, surprisingly little work explicitly evaluates progress towards this aspiration. In this paper, we conduct… ▽ More

    Submitted 21 August, 2022; v1 submitted 17 December, 2021; originally announced December 2021.

    Comments: AAAI 2022

  11. arXiv:2111.00980  [pdf, other

    cs.LG stat.ML

    Mixture Proportion Estimation and PU Learning: A Modern Approach

    Authors: Saurabh Garg, Yifan Wu, Alex Smola, Sivaraman Balakrishnan, Zachary C. Lipton

    Abstract: Given only positive examples and unlabeled examples (from both positive and negative classes), we might hope nevertheless to estimate an accurate positive-versus-negative classifier. Formally, this task is broken down into two subtasks: (i) Mixture Proportion Estimation (MPE) -- determining the fraction of positive examples in the unlabeled data; and (ii) PU-learning -- given such an estimate, lea… ▽ More

    Submitted 1 November, 2021; originally announced November 2021.

    Comments: Spotlight at NeurIPS 2021

  12. arXiv:2110.07566  [pdf, other

    cs.CL cs.AI cs.LG

    Practical Benefits of Feature Feedback Under Distribution Shift

    Authors: Anurag Katakkar, Clay H. Yoo, Weiqin Wang, Zachary C. Lipton, Divyansh Kaushik

    Abstract: In attempts to develop sample-efficient and interpretable algorithms, researcher have explored myriad mechanisms for collecting and exploiting feature feedback (or rationales) auxiliary annotations provided for training (but not test) instances that highlight salient evidence. Examples include bounding boxes around objects and salient spans in text. Despite its intuitive appeal, feature feedback h… ▽ More

    Submitted 17 October, 2022; v1 submitted 14 October, 2021; originally announced October 2021.

  13. arXiv:2109.04953  [pdf, other

    cs.CL cs.LG

    Does Pretraining for Summarization Require Knowledge Transfer?

    Authors: Kundan Krishna, Jeffrey Bigham, Zachary C. Lipton

    Abstract: Pretraining techniques leveraging enormous datasets have driven recent advances in text summarization. While folk explanations suggest that knowledge transfer accounts for pretraining's benefits, little is known about why it works or what makes a pretraining task or dataset suitable. In this paper, we challenge the knowledge transfer story, showing that pretraining on documents consisting of chara… ▽ More

    Submitted 10 September, 2021; originally announced September 2021.

    Comments: Camera-ready for Findings of EMNLP 2021

  14. arXiv:2109.01443  [pdf, other

    cs.HC cs.AI

    The Impact of Algorithmic Risk Assessments on Human Predictions and its Analysis via Crowdsourcing Studies

    Authors: Riccardo Fogliato, Alexandra Chouldechova, Zachary Lipton

    Abstract: As algorithmic risk assessment instruments (RAIs) are increasingly adopted to assist decision makers, their predictive performance and potential to promote inequity have come under scrutiny. However, while most studies examine these tools in isolation, researchers have come to recognize that assessing their impact requires understanding the behavior of their human interactants. In this paper, buil… ▽ More

    Submitted 3 September, 2021; originally announced September 2021.

    Comments: Proceedings of the ACM on Human-Computer Interaction 5, CSCW2, Article 428 (October 2021)

  15. arXiv:2108.09265  [pdf, other

    cs.LG econ.EM stat.ML

    Efficient Online Estimation of Causal Effects by Deciding What to Observe

    Authors: Shantanu Gupta, Zachary C. Lipton, David Childers

    Abstract: Researchers often face data fusion problems, where multiple data sources are available, each capturing a distinct subset of variables. While problem formulations typically take the data as given, in practice, data acquisition can be an ongoing process. In this paper, we aim to estimate any functional of a probabilistic model (e.g., a causal effect) as efficiently as possible, by deciding, at each… ▽ More

    Submitted 30 October, 2021; v1 submitted 20 August, 2021; originally announced August 2021.

    Comments: Accepted at NeurIPS 2021

  16. arXiv:2107.00441  [pdf, ps, other

    cs.CY

    When Curation Becomes Creation: Algorithms, Microcontent, and the Vanishing Distinction between Platforms and Creators

    Authors: Liu Leqi, Dylan Hadfield-Menell, Zachary C. Lipton

    Abstract: Ever since social activity on the Internet began migrating from the wilds of the open web to the walled gardens erected by so-called platforms, debates have raged about the responsibilities that these platforms ought to bear. And yet, despite intense scrutiny from the news media and grassroots movements of outraged users, platforms continue to operate, from a legal standpoint, on the friendliest t… ▽ More

    Submitted 1 July, 2021; originally announced July 2021.

  17. arXiv:2106.11342  [pdf

    cs.LG cs.AI cs.CL cs.CV

    Dive into Deep Learning

    Authors: Aston Zhang, Zachary C. Lipton, Mu Li, Alexander J. Smola

    Abstract: This open-source book represents our attempt to make deep learning approachable, teaching readers the concepts, the context, and the code. The entire book is drafted in Jupyter notebooks, seamlessly integrating exposition figures, math, and interactive examples with self-contained code. Our goal is to offer a resource that could (i) be freely available for everyone; (ii) offer sufficient technical… ▽ More

    Submitted 22 August, 2023; v1 submitted 21 June, 2021; originally announced June 2021.

    Comments: (HTML) https://D2L.ai (GitHub) https://github.com/d2l-ai/d2l-en/

  18. arXiv:2106.07041  [pdf, other

    cs.LG

    Correcting Exposure Bias for Link Recommendation

    Authors: Shantanu Gupta, Hao Wang, Zachary C. Lipton, Yuyang Wang

    Abstract: Link prediction methods are frequently applied in recommender systems, e.g., to suggest citations for academic papers or friends in social networks. However, exposure bias can arise when users are systematically underexposed to certain relevant items. For example, in citation networks, authors might be more likely to encounter papers from their own field and thus cite them preferentially. This bia… ▽ More

    Submitted 13 June, 2021; originally announced June 2021.

  19. arXiv:2106.00872  [pdf, other

    cs.CL cs.AI cs.LG

    On the Efficacy of Adversarial Data Collection for Question Answering: Results from a Large-Scale Randomized Study

    Authors: Divyansh Kaushik, Douwe Kiela, Zachary C. Lipton, Wen-tau Yih

    Abstract: In adversarial data collection (ADC), a human workforce interacts with a model in real time, attempting to produce examples that elicit incorrect predictions. Researchers hope that models trained on these more challenging datasets will rely less on superficial patterns, and thus be less brittle. However, despite ADC's intuitive appeal, it remains unclear when training on adversarial datasets produ… ▽ More

    Submitted 1 June, 2021; originally announced June 2021.

    Comments: Accepted at ACL-IJCNLP 2021

  20. arXiv:2105.04953  [pdf, other

    stat.AP

    On the Validity of Arrest as a Proxy for Offense: Race and the Likelihood of Arrest for Violent Crimes

    Authors: Riccardo Fogliato, Alice Xiang, Zachary Lipton, Daniel Nagin, Alexandra Chouldechova

    Abstract: The risk of re-offense is considered in decision-making at many stages of the criminal justice system, from pre-trial, to sentencing, to parole. To aid decision makers in their assessments, institutions increasingly rely on algorithmic risk assessment instruments (RAIs). These tools assess the likelihood that an individual will be arrested for a new criminal offense within some time window followi… ▽ More

    Submitted 11 May, 2021; originally announced May 2021.

    Comments: Accepted at AAAI/ACM Conference on AI, Ethics, and Society (AIES), 2021

  21. arXiv:2105.00303  [pdf, other

    cs.LG stat.ML

    RATT: Leveraging Unlabeled Data to Guarantee Generalization

    Authors: Saurabh Garg, Sivaraman Balakrishnan, J. Zico Kolter, Zachary C. Lipton

    Abstract: To assess generalization, machine learning scientists typically either (i) bound the generalization gap and then (after training) plug in the empirical risk to obtain a bound on the true risk; or (ii) validate empirically on holdout data. However, (i) typically yields vacuous guarantees for overparameterized models. Furthermore, (ii) shrinks the training set and its guarantee erodes with each re-u… ▽ More

    Submitted 6 November, 2021; v1 submitted 1 May, 2021; originally announced May 2021.

    Comments: ICML 2021 (Long Talk)

  22. arXiv:2104.08977  [pdf, other

    cs.LG stat.ML

    Off-Policy Risk Assessment in Contextual Bandits

    Authors: Audrey Huang, Liu Leqi, Zachary C. Lipton, Kamyar Azizzadenesheli

    Abstract: Even when unable to run experiments, practitioners can evaluate prospective policies, using previously logged data. However, while the bandits literature has adopted a diverse set of objectives, most research on off-policy evaluation to date focuses on the expected reward. In this paper, we introduce Lipschitz risk functionals, a broad class of objectives that subsumes conditional value-at-risk (C… ▽ More

    Submitted 29 June, 2021; v1 submitted 18 April, 2021; originally announced April 2021.

  23. arXiv:2103.02827  [pdf, other

    cs.LG cs.AI stat.ML

    On the Convergence and Optimality of Policy Gradient for Markov Coherent Risk

    Authors: Audrey Huang, Liu Leqi, Zachary C. Lipton, Kamyar Azizzadenesheli

    Abstract: In order to model risk aversion in reinforcement learning, an emerging line of research adapts familiar algorithms to optimize coherent risk functionals, a class that includes conditional value-at-risk (CVaR). Because optimizing the coherent risk is difficult in Markov decision processes, recent work tends to focus on the Markov coherent risk (MCR), a time-consistent surrogate. While, policy gradi… ▽ More

    Submitted 5 March, 2021; v1 submitted 3 March, 2021; originally announced March 2021.

  24. arXiv:2103.02138  [pdf, ps, other

    cs.LG math.NA stat.ML

    Parametric Complexity Bounds for Approximating PDEs with Neural Networks

    Authors: Tanya Marwah, Zachary C. Lipton, Andrej Risteski

    Abstract: Recent experiments have shown that deep networks can approximate solutions to high-dimensional PDEs, seemingly esca** the curse of dimensionality. However, questions regarding the theoretical basis for such approximations, including the required network size, remain open. In this paper, we investigate the representational power of neural networks for approximating solutions to linear elliptic PD… ▽ More

    Submitted 6 July, 2021; v1 submitted 2 March, 2021; originally announced March 2021.

  25. arXiv:2102.10264  [pdf, other

    cs.LG cs.RO stat.ML

    On Proximal Policy Optimization's Heavy-tailed Gradients

    Authors: Saurabh Garg, Joshua Zhanson, Emilio Parisotto, Adarsh Prasad, J. Zico Kolter, Zachary C. Lipton, Sivaraman Balakrishnan, Ruslan Salakhutdinov, Pradeep Ravikumar

    Abstract: Modern policy gradient algorithms such as Proximal Policy Optimization (PPO) rely on an arsenal of heuristics, including loss clip** and gradient clip**, to ensure successful learning. These heuristics are reminiscent of techniques from robust statistics, commonly used for estimation in outlier-rich (``heavy-tailed'') regimes. In this paper, we present a detailed empirical study to characteriz… ▽ More

    Submitted 12 July, 2021; v1 submitted 20 February, 2021; originally announced February 2021.

    Comments: ICML 2021

  26. arXiv:2012.04825  [pdf, other

    stat.AP

    Unpacking the Drop in COVID-19 Case Fatality Rates: A Study of National and Florida Line-Level Data

    Authors: Cheng Cheng, Helen Zhou, Jeremy C. Weiss, Zachary C. Lipton

    Abstract: Since the COVID-19 pandemic first reached the United States, the case fatality rate has fallen precipitously. Several possible explanations have been floated, including greater detection of mild cases due to expanded testing, shifts in age distribution among the infected, lags between confirmed cases and reported deaths, improvements in treatment, mutations in the virus, and decreased viral load a… ▽ More

    Submitted 11 December, 2020; v1 submitted 8 December, 2020; originally announced December 2020.

    Comments: 24 pages, 13 figures

  27. arXiv:2012.00893  [pdf, other

    cs.CL cs.LG

    Evaluating Explanations: How much do explanations from the teacher aid students?

    Authors: Danish Pruthi, Rachit Bansal, Bhuwan Dhingra, Livio Baldini Soares, Michael Collins, Zachary C. Lipton, Graham Neubig, William W. Cohen

    Abstract: While many methods purport to explain predictions by highlighting salient features, what aims these explanations serve and how they ought to be evaluated often go unstated. In this work, we introduce a framework to quantify the value of explanations via the accuracy gains that they confer on a student model trained to simulate a teacher model. Crucially, the explanations are available to the stude… ▽ More

    Submitted 16 December, 2021; v1 submitted 1 December, 2020; originally announced December 2020.

    Comments: TACL 2021 (pre-MIT Press publication version)

  28. arXiv:2011.13477  [pdf, other

    cs.CL cs.LG

    Decoding and Diversity in Machine Translation

    Authors: Nicholas Roberts, Davis Liang, Graham Neubig, Zachary C. Lipton

    Abstract: Neural Machine Translation (NMT) systems are typically evaluated using automated metrics that assess the agreement between generated translations and ground truth candidates. To improve systems with respect to these metrics, NLP researchers employ a variety of heuristic techniques, including searching for the conditional mode (vs. sampling) and incorporating various training heuristics (e.g., labe… ▽ More

    Submitted 26 November, 2020; originally announced November 2020.

    Comments: Presented at the Resistance AI Workshop, 34th Conference on Neural Information Processing Systems (NeurIPS 2020), Vancouver, Canada

  29. arXiv:2011.06741  [pdf, other

    cs.LG stat.ML

    Rebounding Bandits for Modeling Satiation Effects

    Authors: Liu Leqi, Fatma Kilinc-Karzan, Zachary C. Lipton, Alan L. Montgomery

    Abstract: Psychological research shows that enjoyment of many goods is subject to satiation, with short-term satisfaction declining after repeated exposures to the same item. Nevertheless, proposed algorithms for powering recommender systems seldom model these dynamics, instead proceeding as though user preferences were fixed in time. In this work, we introduce rebounding bandits, a multi-armed bandit setup… ▽ More

    Submitted 27 October, 2021; v1 submitted 12 November, 2020; originally announced November 2020.

  30. arXiv:2011.03654  [pdf, other

    cs.CY cs.LG stat.ML

    Fair Machine Learning Under Partial Compliance

    Authors: Jessica Dai, Sina Fazelpour, Zachary C. Lipton

    Abstract: Typically, fair machine learning research focuses on a single decisionmaker and assumes that the underlying population is stationary. However, many of the critical domains motivating this work are characterized by competitive marketplaces with many decisionmakers. Realistically, we might expect only a subset of them to adopt any non-compulsory fairness-conscious policy, a situation that political… ▽ More

    Submitted 26 September, 2022; v1 submitted 6 November, 2020; originally announced November 2020.

    Comments: Presented at AIES 2021; previously at the NeurIPS 2020 Workshop on Consequential Decision Making in Dynamic Environments and the NeurIPS 2020 Workshop on ML for Economic Policy. Minor correction uploaded Sept. 2022

  31. arXiv:2011.01459  [pdf, other

    cs.CL cs.LG

    Weakly- and Semi-supervised Evidence Extraction

    Authors: Danish Pruthi, Bhuwan Dhingra, Graham Neubig, Zachary C. Lipton

    Abstract: For many prediction tasks, stakeholders desire not only predictions but also supporting evidence that a human can use to verify its correctness. However, in practice, additional annotations marking supporting evidence may only be available for a minority of training examples (if available at all). In this paper, we propose new methods to combine few evidence annotations (strong semi-supervision) w… ▽ More

    Submitted 2 November, 2020; originally announced November 2020.

    Comments: Accepted to the Findings of EMNLP 2020, to be presented at BlackBoxNLP

  32. arXiv:2010.11966  [pdf, other

    cs.CL cs.LG

    Unsupervised Data Augmentation with Naive Augmentation and without Unlabeled Data

    Authors: David Lowell, Brian E. Howard, Zachary C. Lipton, Byron C. Wallace

    Abstract: Unsupervised Data Augmentation (UDA) is a semi-supervised technique that applies a consistency loss to penalize differences between a model's predictions on (a) observed (unlabeled) examples; and (b) corresponding 'noised' examples produced via data augmentation. While UDA has gained popularity for text classification, open questions linger over which design decisions are necessary and over how to… ▽ More

    Submitted 22 October, 2020; originally announced October 2020.

  33. arXiv:2010.03017  [pdf, other

    cs.CL cs.LG

    On Negative Interference in Multilingual Models: Findings and A Meta-Learning Treatment

    Authors: Zirui Wang, Zachary C. Lipton, Yulia Tsvetkov

    Abstract: Modern multilingual models are trained on concatenated text from multiple languages in hopes of conferring benefits to each (positive transfer), with the most pronounced benefits accruing to low-resource languages. However, recent work has shown that this approach can degrade performance on high-resource languages, a phenomenon known as negative interference. In this paper, we present the first sy… ▽ More

    Submitted 6 October, 2020; originally announced October 2020.

    Comments: Published as a main conference paper at EMNLP 2020

  34. arXiv:2010.02114  [pdf, other

    cs.CL cs.AI cs.LG stat.ML

    Explaining The Efficacy of Counterfactually Augmented Data

    Authors: Divyansh Kaushik, Amrith Setlur, Eduard Hovy, Zachary C. Lipton

    Abstract: In attempts to produce ML models less reliant on spurious patterns in NLP datasets, researchers have recently proposed curating counterfactually augmented data (CAD) via a human-in-the-loop process in which given some documents and their (initial) labels, humans must revise the text to make a counterfactual label applicable. Importantly, edits that are not necessary to flip the applicable label ar… ▽ More

    Submitted 23 March, 2021; v1 submitted 5 October, 2020; originally announced October 2020.

    Comments: Published at ICLR 2021

  35. arXiv:2007.07151  [pdf, other

    cs.LG cs.AI cs.CL stat.ML

    Extracting Structured Data from Physician-Patient Conversations By Predicting Noteworthy Utterances

    Authors: Kundan Krishna, Amy Pavel, Benjamin Schloss, Jeffrey P. Bigham, Zachary C. Lipton

    Abstract: Despite diverse efforts to mine various modalities of medical data, the conversations between physicians and patients at the time of care remain an untapped source of insights. In this paper, we leverage this data to extract structured information that might assist physicians with post-visit documentation in electronic health records, potentially lightening the clerical burden. In this exploratory… ▽ More

    Submitted 14 July, 2020; originally announced July 2020.

  36. arXiv:2007.04082  [pdf, other

    q-fin.ST cs.LG cs.NE

    Uncertainty-Aware Lookahead Factor Models for Quantitative Investing

    Authors: Lakshay Chauhan, John Alberg, Zachary C. Lipton

    Abstract: On a periodic basis, publicly traded companies report fundamentals, financial data including revenue, earnings, debt, among others. Quantitative finance research has identified several factors, functions of the reported data that historically correlate with stock market performance. In this paper, we first show through simulation that if we could select stocks via factors calculated on future fund… ▽ More

    Submitted 15 July, 2020; v1 submitted 6 July, 2020; originally announced July 2020.

  37. arXiv:2006.01898  [pdf, other

    stat.AP cs.LG stat.ML

    Predicting Mortality Risk in Viral and Unspecified Pneumonia to Assist Clinicians with COVID-19 ECMO Planning

    Authors: Helen Zhou, Cheng Cheng, Zachary C. Lipton, George H. Chen, Jeremy C. Weiss

    Abstract: Respiratory complications due to coronavirus disease COVID-19 have claimed tens of thousands of lives in 2020. Many cases of COVID-19 escalate from Severe Acute Respiratory Syndrome (SARS-CoV-2) to viral pneumonia to acute respiratory distress syndrome (ARDS) to death. Extracorporeal membranous oxygenation (ECMO) is a life-sustaining oxygenation and ventilation therapy that may be used for patient… ▽ More

    Submitted 2 June, 2020; originally announced June 2020.

  38. arXiv:2005.01795  [pdf, other

    cs.CL cs.AI cs.LG stat.ML

    Generating SOAP Notes from Doctor-Patient Conversations Using Modular Summarization Techniques

    Authors: Kundan Krishna, Sopan Khosla, Jeffrey P. Bigham, Zachary C. Lipton

    Abstract: Following each patient visit, physicians draft long semi-structured clinical summaries called SOAP notes. While invaluable to clinicians and researchers, creating digital SOAP notes is burdensome, contributing to physician burnout. In this paper, we introduce the first complete pipelines to leverage deep summarization models to generate these notes based on transcripts of conversations between phy… ▽ More

    Submitted 2 June, 2021; v1 submitted 4 May, 2020; originally announced May 2020.

    Comments: Published at ACL 2021 Main Conference

  39. arXiv:2003.11991  [pdf, ps, other

    stat.ME cs.LG econ.EM stat.ML

    Estimating Treatment Effects with Observed Confounders and Mediators

    Authors: Shantanu Gupta, Zachary C. Lipton, David Childers

    Abstract: Given a causal graph, the do-calculus can express treatment effects as functionals of the observational joint distribution that can be estimated empirically. Sometimes the do-calculus identifies multiple valid formulae, prompting us to compare the statistical properties of the corresponding estimators. For example, the backdoor formula applies when all confounders are observed and the frontdoor fo… ▽ More

    Submitted 14 June, 2021; v1 submitted 26 March, 2020; originally announced March 2020.

  40. arXiv:2003.07554  [pdf, other

    cs.LG stat.ML

    A Unified View of Label Shift Estimation

    Authors: Saurabh Garg, Yifan Wu, Sivaraman Balakrishnan, Zachary C. Lipton

    Abstract: Under label shift, the label distribution p(y) might change but the class-conditional distributions p(x|y) do not. There are two dominant approaches for estimating the label marginal. BBSE, a moment-matching approach based on confusion matrices, is provably consistent and provides interpretable error bounds. However, a maximum likelihood estimation approach, which we call MLLS, dominates empirical… ▽ More

    Submitted 16 October, 2020; v1 submitted 17 March, 2020; originally announced March 2020.

    Comments: Accepted at Neurips 2020

  41. arXiv:2002.11096  [pdf, other

    stat.ML cs.LG math.OC

    Causal Inference With Selectively Deconfounded Data

    Authors: Kyra Gan, Andrew A. Li, Zachary C. Lipton, Sridhar Tayur

    Abstract: Given only data generated by a standard confounding graph with unobserved confounder, the Average Treatment Effect (ATE) is not identifiable. To estimate the ATE, a practitioner must then either (a) collect deconfounded data;(b) run a clinical trial; or (c) elucidate further properties of the causal graph that might render the ATE identifiable. In this paper, we consider the benefit of incorporati… ▽ More

    Submitted 6 March, 2021; v1 submitted 25 February, 2020; originally announced February 2020.

    Journal ref: Proceedings of the 24th International Conference on Artificial Intelligence and Statistics (AISTATS) 2021, San Diego, California, USA. PMLR: Volume 130. Copyright 2021 by the author(s)

  42. arXiv:2002.10021  [pdf, other

    cs.LG stat.ML

    How Transferable are the Representations Learned by Deep Q Agents?

    Authors: Jacob Tyo, Zachary Lipton

    Abstract: In this paper, we consider the source of Deep Reinforcement Learning (DRL)'s sample complexity, asking how much derives from the requirement of learning useful representations of environment states and how much is due to the sample complexity of learning a policy. While for DRL agents, the distinction between representation and policy may not be clear, we seek new insight through a set of transfer… ▽ More

    Submitted 23 February, 2020; originally announced February 2020.

  43. arXiv:2001.09773  [pdf, ps, other

    cs.CY cs.AI cs.LG stat.ML

    Algorithmic Fairness from a Non-ideal Perspective

    Authors: Sina Fazelpour, Zachary C. Lipton

    Abstract: Inspired by recent breakthroughs in predictive modeling, practitioners in both industry and government have turned to machine learning with hopes of operationalizing predictions to drive automated decisions. Unfortunately, many social desiderata concerning consequential decisions, such as justice or fairness, have no natural formulation within a purely predictive framework. In efforts to mitigate… ▽ More

    Submitted 8 January, 2020; originally announced January 2020.

    Comments: Accepted for publication at the AAAI/ACM Conference on Artificial Intelligence, Ethics, and Society (AIES) 2020

  44. arXiv:1912.06074  [pdf, other

    cs.LG cs.AI stat.ML

    Game Design for Eliciting Distinguishable Behavior

    Authors: Fan Yang, Liu Leqi, Yifan Wu, Zachary C. Lipton, Pradeep Ravikumar, William W. Cohen, Tom Mitchell

    Abstract: The ability to inferring latent psychological traits from human behavior is key to develo** personalized human-interacting machine learning systems. Approaches to infer such traits range from surveys to manually-constructed experiments and games. However, these traditional games are limited because they are typically designed based on heuristics. In this paper, we formulate the task of designing… ▽ More

    Submitted 12 December, 2019; originally announced December 2019.

    Comments: 33rd Conference on Neural Information Processing Systems (NeurIPS 2019)

  45. arXiv:1910.08640  [pdf, other

    cs.LG cs.CV stat.ML

    Are Perceptually-Aligned Gradients a General Property of Robust Classifiers?

    Authors: Simran Kaur, Jeremy Cohen, Zachary C. Lipton

    Abstract: For a standard convolutional neural network, optimizing over the input pixels to maximize the score of some target class will generally produce a grainy-looking version of the original image. However, Santurkar et al. (2019) demonstrated that for adversarially-trained neural networks, this optimization produces images that uncannily resemble the target class. In this paper, we show that these "per… ▽ More

    Submitted 23 October, 2019; v1 submitted 18 October, 2019; originally announced October 2019.

    Comments: To appear in the "Science Meets Engineering of Deep Learning" Workshop at NeurIPS 2019

  46. arXiv:1910.00762  [pdf, other

    cs.LG stat.ML

    Accelerating Deep Learning by Focusing on the Biggest Losers

    Authors: Angela H. Jiang, Daniel L. -K. Wong, Giulio Zhou, David G. Andersen, Jeffrey Dean, Gregory R. Ganger, Gauri Joshi, Michael Kaminksy, Michael Kozuch, Zachary C. Lipton, Padmanabhan Pillai

    Abstract: This paper introduces Selective-Backprop, a technique that accelerates the training of deep neural networks (DNNs) by prioritizing examples with high loss at each iteration. Selective-Backprop uses the output of a training example's forward pass to decide whether to use that example to compute gradients and update parameters, or to skip immediately to the next example. By reducing the number of co… ▽ More

    Submitted 1 October, 2019; originally announced October 2019.

  47. arXiv:1909.12434  [pdf, other

    cs.CL cs.AI cs.LG stat.ML

    Learning the Difference that Makes a Difference with Counterfactually-Augmented Data

    Authors: Divyansh Kaushik, Eduard Hovy, Zachary C. Lipton

    Abstract: Despite alarm over the reliance of machine learning systems on so-called spurious patterns, the term lacks coherent meaning in standard statistical frameworks. However, the language of causality offers clarity: spurious associations are due to confounding (e.g., a common cause), but not direct or indirect causal effects. In this paper, we focus on natural language processing, introducing methods a… ▽ More

    Submitted 14 February, 2020; v1 submitted 26 September, 2019; originally announced September 2019.

    Comments: Published at ICLR 2020

  48. arXiv:1909.07913  [pdf, other

    cs.CL cs.LG

    Learning to Deceive with Attention-Based Explanations

    Authors: Danish Pruthi, Mansi Gupta, Bhuwan Dhingra, Graham Neubig, Zachary C. Lipton

    Abstract: Attention mechanisms are ubiquitous components in neural architectures applied to natural language processing. In addition to yielding gains in predictive accuracy, attention weights are often claimed to confer interpretability, purportedly useful both for providing insights to practitioners and for explaining why a model makes its decisions to stakeholders. We call the latter use of attention mec… ▽ More

    Submitted 6 April, 2020; v1 submitted 17 September, 2019; originally announced September 2019.

    Comments: Accepted to ACL 2020 as a long paper. Updated version

  49. arXiv:1909.05356  [pdf, other

    cs.CL cs.AI cs.LG stat.ML

    Entity Projection via Machine Translation for Cross-Lingual NER

    Authors: Alankar Jain, Bhargavi Paranjape, Zachary C. Lipton

    Abstract: Although over 100 languages are supported by strong off-the-shelf machine translation systems, only a subset of them possess large annotated corpora for named entity recognition. Motivated by this fact, we leverage machine translation to improve annotation-projection approaches to cross-lingual named entity recognition. We propose a system that improves over prior entity-projection methods by: (a)… ▽ More

    Submitted 13 September, 2019; v1 submitted 31 August, 2019; originally announced September 2019.

  50. arXiv:1908.04364  [pdf, other

    cs.CL cs.IR

    AmazonQA: A Review-Based Question Answering Task

    Authors: Mansi Gupta, Nitish Kulkarni, Raghuveer Chanda, Anirudha Rayasam, Zachary C Lipton

    Abstract: Every day, thousands of customers post questions on Amazon product pages. After some time, if they are fortunate, a knowledgeable customer might answer their question. Observing that many questions can be answered based upon the available product reviews, we propose the task of review-based QA. Given a corpus of reviews and a question, the QA system synthesizes an answer. To this end, we introduce… ▽ More

    Submitted 20 August, 2019; v1 submitted 12 August, 2019; originally announced August 2019.

    Comments: 8 pages, 7 figures; IJCAI-19; first three authors contribute equally. Data and code available at https://github.com/amazonqa/amazonqa